Review Comment:
The paper presents a novel approach to understanding and simulating the knowledge graph's structure evolution. While the paper is quite interesting to read, I find it difficult to follow. The authors have produced a significant piece of work, however, its presentation as a paper lacks structure and flow between the different sections. The resources and code are openly available and have been documented which is a plus.
The abstract does not provide a good overview of what the main research question is, what motivates the work and why a new approach is needed. Currently, the authors provide only 2 sentences on this at the start of the abstract. I suggest they provide more context here.
Crowdsourcing does not directly result in a knowledge graph. Social editing is only one aspect of crowdsourcing. The start of the introduction needs to be paraphrased as it lacks flow of content and the facts cannot be so easily generalised.
The role of semantics needs to be better elaborated in the introduction.
It would be better if the paper 1st presented the main motivation of the work and then the research questions.
"This approach has gained momentum in fields as diverse as computer networks and biology."-> Provide references to examples
"To answer these two research questions, this paper explores a novel approach that has never yet been applied to KGs: the modelling of complex network structure and dynamics" -> this does not clearly present the approach. What is the approach exactly?
Missing definition of "bipartite".
In the contributions outline, what do the authors mean by "generative model"? Which model exactly? Clarify what the main contributions are: approach, an AI model, or an algorithm? What are the constraints to implementing your approach?
The start of the related work is good, however, here the authors should also mention what the subsections present. Conclusions can be kept for the end of this section.
Now subsection 2.1 discusses deep learning methods and I am missing the connection to what the authors are doing. The introduction was mostly about crowdsourcing and the research questions. I suggest better interconnecting and aligning the work thus far in the paper.
Missing references to the global and sequential approaches mentioned on page 3, line 51. In this section, I believe the authors are confusing crowdsourced data with crowdsourced knowledge graphs. It should be clarified what the authors mean.
A section presenting the followed methodology and the approach itself in short is missing. I would advise the authors to check similar papers accepted in this journal. Usually, such a section is needed after the related work and before the implementation details of the approach.
Why should one note the implementation language of the approach? Does it matter if Python or Java are used? If yes, explain why.
"In this way, all subsequent experimental calculations were based on SQL queries on a local RDBMS, 50
51 which is highly efficient and not harmful to the planet." This needs to be paraphrased. Do you mean sustainability and the environmental impacts of the developed solution?
Define what "ablation study" entails.
Overall, the evaluation strategy and methodology need to be better presented at the start.
The conclusions should start with a summarisation of what the paper presented, followed by the evaluation findings, current limitations and directions for future work. The conclusion now is too long and detailed. It needs to be more concise and up to the point. Not introducing new knowledge but summarising and concluding the paper. The findings, which are presented in the conclusion, can be put in a separate section before the conclusion.
Minor comments:
It is not common practice for sentences to start with a reference. The names of the authors followed by a reference should be mentioned if this is what the authors want the sentence to start with.
Double-check for consistency in the capitalisation of terms.
Avoid jargon such as "craze".
Terms should be defined and abbreviated when 1st mentioned. Later on, the abbreviations should be used consistently. This is not the case now.
Missing introduction to section 4 and its subsections.
Missing definition of RDBMS.
|