Review Comment:
The position paper aims to present a new decentralized vision for Linked Data. It contains several valid discussions points, in particular related to metadata. This aspect seems to have even higher importance in the article than the decentralization aspects.
Generally, the article points out issues in each subsection followed by solution paths. The authors have long standing experience in the field and the mentioned issues are relevant. However, it is not clear to me how feasible the solution paths are. In many cases, it is almost obvious what *should* be done, but the effort and expertise required to actually do it vs. the incentives (expected reward) is a main issue.
Section 4 makes a particularly relevant point in my opinion: "In fact, we would argue that more principled Linked Data publishing could allow to auto-generate LOD clouds from a set of such HDT dumps, which to demonstrate is on our agenda for future work." => I would not limit this statement to HDT, but rather think that the maintenance of metadata outside of the actual knowledge source, as done in some dataset catalogues, is always likely to lead to synchronization problems (and completely unfeasible for frequently changing data).
Overall, I believe that the article could have a clearer structure. Only indirectly, I see how the solution paths lead to a "more decentralized" vision of Linked Data as claimed in the title. The authors may consider structuring the article fully around LOD cloud access and metadata provisioning, which seems to make up a large part of the article.
Moreover, I believe the visionary parts should be strengthened. In particular, the authors could comment / discuss how the suggestions can actually be realized at web scale.
Since the above two weaknesses (in the structure and the solution paths) from my point of view require potentially more significant changes, I opt for a major revision of the article before it can be accepted.
Further specific comments follow below:
- Myths => decentralized network of ontologies: The criticism here seems not that convincing. Gruber himself was talking about ontologies modeling domains of discourse - one could say that those are almost by definition "insular efforts" as they just relate to one domain. I would also like to see a source for backing up the claim that "vocabulary reuse is still extremely limited".
- Myths => knowledge graphs not decentralized: In some sense true, but some of them are linked to various other knowledge graphs. An argument about why these single knowledge graphs should even be decentralized in those specific cases could be added to the article.
- Section 2 is not that well organized in my opinion: Whereas Section 1 seems mostly about general observations, which are valid for the Semantic Web as a whole, Section 2 seems very specific and mostly focussed on LOD cloud metadata rather than its not completely clear how this fits into the story line.
- The solution path in Section 3.1.1 seems to rather focus on observing the status of the LOD cloud rather than providing actual solutions (of course being able to observe the status can be seen as part of the solution).
- "In addition to that, it is mostly impossible to indeed retrieve all triples from a SPARQL endpoint," => It is not clear that this is indeed a weakness of the Semantic Web, since this is by design to keep endpoints alive. Also non-Semantic Web sources can often not easily be fully retrieved via queries.
Minor:
- "authoes" => "authors"
- "wherefor"
- "sizes of triples" ?
- from unfulfilled 50 expectations) => replace bracket by dot
|