Review Comment:
Overall evaluation
Select your choice from the options below and write its number below.
-1
== 3 strong accept
== 2 accept
== 1 weak accept
== 0 borderline paper
== -1 weak reject
== -2 reject
== -3 strong reject
Reviewer's confidence
Select your choice from the options below and write its number below.
3
== 5 (expert)
== 4 (high)
== 3 (medium)
== 2 (low)
== 1 (none)
Interest to the Knowledge Engineering and Knowledge Management Community
Select your choice from the options below and write its number below.
3
== 5 excellent
== 4 good
== 3 fair
== 2 poor
== 1 very poor
Novelty
Select your choice from the options below and write its number below.
3
== 5 excellent
== 4 good
== 3 fair
== 2 poor
== 1 very poor
Technical quality
Select your choice from the options below and write its number below.
2
== 5 excellent
== 4 good
== 3 fair
== 2 poor
== 1 very poor
Evaluation
Select your choice from the options below and write its number below.
1
== 5 excellent
== 4 good
== 3 fair
== 2 poor
== 1 not present
Clarity and presentation
Select your choice from the options below and write its number below.
2
== 5 excellent
== 4 good
== 3 fair
== 2 poor
== 1 very poor
Review
This paper discusses the limitations of current Semantic Web standards, such as RDF and SPARQL, and proposes a number of alternative technologies to enhance the development of Semantic Web. I appreciate the efforts of the authors who think out of the box and who try to find innovative solutions to the challenges of the Semantic Web. However, I found the authors' arguments not exactly convincing. I have listed my questions as below:
1. In the introduction section, the authors seem to be suggesting that HTTP and HTML provides more flexibility, while RDF and SPARQL do not. However, this contradicts to my understanding of these two technologies. RDF is a general data model which only asks data to be organized as subjects, predicates, and objects. SPARQL only specifies the general structure and syntax for queries. It is the data providers who decide which content should be organized or should be queried. This process is also similar to HTML, as it specifies a set of HTML tags necessary to annotate your document, and it is the data publishers who decide which content should be put into a HTML document.
2. The authors argue that SPARQL introduces a lot of cost on the server side, which is . While I agree that graph matching requires quite some computational resources, this does not sound like a serious problem of SPARQL. A lot of currently popular and advanced methods require a large amount of computational resources on the server side. The machine learning approaches, complex statistical models, or data indexing often require days, weeks, or even months to run on the server side. Yet, these expensive approaches are favored by many companies, not only big companies, but also small start ups. A major reason is that these methods are useful. Thus, the problem should be whether SPARQL is useful or not? If it is useful, the cost issue can be solved by other approaches, such as high performance computing.
3. The authors states "Use of the SPARQL standard in practice leads to low availability." This is a very hasty conclusion. I agree that the number of available linked datasets is not huge, but this may be due to many reasons, including the limited number of years' development since the LOD initiative. A simple conclusion like this is just unsupported.
4. The authors argue that SPARQL endpoints restrict the types of queries a user can ask. I agree that the types of queries that a SPARQL endpoint can support are limited by the ontologies used by the endpoint. However, traditional static HTML pages only allow a single way to query the data (i.e., simply present the HTML document to the end user). Although a SPARQL endpoint does not support infinite types of queries, it does allow the users to query data in many ways.
5. The authors state that "a single data store provides a single point of failure in the case of unanticipated circumstances." This is not an issue of the semantic web standards, but an issue of infrastructure construction. A data provider can simply avoid the failure of a single data store by providing multiple and duplicated data stores on different machines, and use load balance strategies to evenly distribute requests.
7. The authors suggest that "This means that, in practice, individual data providers are handing over their potentially private data to a
few big disseminating entities, thereby handing over (part of) their agency and ownership of the data as well." Again, I think this is not an issue of the current Semantic Web standards, but about data licenses and policies.
8. "However, putting together such multi-perspective data into one spot makes it difficult
to tell these different views apart." It's difficult to agree with the authors on this opinion. When examining DBpedia entities with two different rdfs:label strings, it's not difficult to tell that such entities have incorporated more than one perspective.
9. The authors proposes ERS as an alternative for data publication. However, the description of ERS is sketchy, without stating how it is going to improve the integration of multiple views, and what better features it can provide compared with the existing Semantic Web standards.
10. The alternatives, such as triple pattern fragments, eRDF, DataHives, look very interesting. I appreciate the authors for proposing them as new possible approaches.
|