Review Comment:
The paper presents a “survey of models and architectures for restricted and local access to the Web of data”. The authors attempt to enumerate and discuss existing and possible approaches that could be used to facilitate RDF access, particularly in remote communities with limited internet connectivity.
In general, the idea is valuable and the paper is well written.
However, the are several points that should be addressed before it reaches the level of being published.
I tried to collect them on different topics below.
== Title and Motivation
It is not clear to me in which sense the approaches listed in the work are particularly fit for “mobile” data access. In fact, I could not find a single approach designed exclusively for mobile applications.
I would rather remove the word “mobile” from the title.
I also would like to see other examples of “offline” Linked Data usage.
For instance, historical facts (such as birthdates or birthplaces) do not change and some knowledge bases are updated monthly or yearly.
It seems to me that there are several scenarios in which online query capabilities are not really necessary.
==Content
The first concern is that the authors mixed peer-to-peer architecture and client-server. They are different and should be distinguished. For instance, approaches relying on SPARQL use the client-server architecture, and seems to me that all RDF approaches cited in your paper are client-server based. I think the different types of architectures deserve a discussion, perhaps a subsection. Youcan later related architectures, protocols advantages, and disadvantages I provide a list of peer-to-peer RDF solutions later in this section.
I would hardly advocate to change your survey for something like “Survey of Models and Architectures for Linked Data Access” and complement the missing works, e.g. HDT (https://www.rdfhdt.org/publications/) and Linked Data Fragments (https://linkeddatafragments.org/publications/) (i.e. https://ieeexplore.ieee.org/abstract/document/8334472).
One of my biggest concerns is regarding the method that was used for selecting the works that were surveyed. There are a lot of works from the Semantic Web that were ignored and do deserve a citation and discussion.
I highlight some of them in the list below.
Please revise and check if I am not missing something else.
Create a systematic method for finding the related work and discuss it in an early section, it should look like chapter two of this work https://www.researchgate.net/publication/284200058_Quality_assessment_fo....
It would also be helpful if you provide the method used for finding and selecting the related work in the introduction or a separated chapter.
I suggest organizing these works into different categories:
-> Cache
I would avoid citing cache systems, they are often part of distributed architectures, but they are not the same. Cache systems operate over tiny chunks of data mapped to a particular query. Thus, can not provide subsets over the cachable data. There are also queries that never happen i.e. users do not query the whole datasets. However, if you think that these approaches are important, I suggest revising your survey and include several works missing. Do not forget to discuss the relation of these systems with the architectures themselves.
A Survey of HTTP Caching Implementations on the Open Semantic Web
https://link.springer.com/chapter/10.1007/978-3-319-18818-8_18
Improving the performance of semantic web applications with SPARQL query caching
https://link.springer.com/chapter/10.1007/978-3-642-13489-0_21
A cache-based method to improve query performance of linked Open Data cloud
https://link.springer.com/article/10.1007%2Fs00607-020-00814-9
Graph-Aware, Workload-Adaptive SPARQL Query Caching
https://dl.acm.org/doi/10.1145/2723372.2723714
-> Architectures
KBox — Transparently Shifting Query Execution on Knowledge Graphs to the Edge
https://www.researchgate.net/publication/305410480_KBox_--_Transparently...
A decentralized architecture for SPARQL query processing and RDF sharing: A position paper
https://ieeexplore.ieee.org/abstract/document/8334472
A Decentralized Architecture for Sharing and Querying Semantic Data
https://link.springer.com/chapter/10.1007/978-3-030-21348-0_1
-> Other relevant works
A Demonstration of the Solid Platform for Social Web Applications
https://dl.acm.org/doi/10.1145/2872518.2890529
A Survey of Structured P2P Systems for RDF Data Storage and Retrieval
https://link.springer.com/chapter/10.1007/978-3-642-23074-5_2
Query Processing in RDF/S-Based P2P Database Systems
https://link.springer.com/chapter/10.1007/3-540-28347-1_4
==Format & Structure:
I found it extremely difficult to evaluate the approaches that you listed in your work. You mixed models and architectures of the general Web with some specifically designed for the Semantic Web. Please organize it differently.
Chapter 3 and 4, give an overview of techniques for the Web and, in a different chapter, discusses methods that are used in Semantic Web, relating them with chapters 3 and 4. That will make it much simpler to read your work.
==Writing
On several occasions, you do not use space between citations e.g. “protocols[13]”. The proper way of doing it is by adding a space between the last word and the citation. Please use the macro “~\cite{}”.
That will ensure there is a space and that the citation does not occur in the other line.
In general, the text is well written except for few words that seem to be written in old archaic English. I would recommend using the modern form: amoung (among), signalling (signaling), behaviour (behavior), availabillity (availality)
I also would suggest the use ‘Z’ which is accepted in both American and British English instead of ‘S’ in some words: decentralised
The second suggestion is to remove etc from your text, if you are just giving some examples you really do not need it. A sentence “models, architectures,etc.” can be written, “such as models and architectures”. It reads better.
Some of the spelling mistakes are, but not limited to:
-Title
“Survey of Model…” -> Survey of Models...
- Abstract
“could contribute to solve...” -> could contribute to solving
- Introduction
“Overcomed” -> overcome
“manipulate Web contents” -> manipulate Web content
“By analogy to traditional Web (of documents)” -> By analogy to traditional Web (of documents),
“responses to users needs” -> responses to users’ needs
“Furthermore” -> Furthermore,
“nodes failure” -> node faulure
“In such model,” -> In such a model,
“the number of the smartphone “ -> the number of smartphone
“In recent years, peer-to-peer” -> In recent years, the peer-to-peer
“classes of the superposed networks” -> classes of superposed networks
“problem of the intermittent access” -> problem of intermittent access
“For each of these approaches” -> For each of these approaches,
“Section 3 discusses logical” -> Section 3 discusses the logical
-Section 3
“peer-to-peer networks have made” -> peer-to-peer networks has made
“gossip based” -> gossip-based
“the peers selection” - the peers’ selection
Section 3.1.1
“Ganesh et al. [14] presents” -> Ganesh et al. [14] present
“Jelasity et al. [11] introduces” -> Jelasity et al. [11] introduce (and many more from this mistake)
“that are pronned to malicious behaviour” -> that are prone to malicious behavior
3.2
“members of same cluster” -> members of the same cluster.
“a geographical, semantic, profile” -> a geographical-semantic profile
*I think “amoung” (used several times in your text) is not used anymore, the right spelling is “among”
“This results on a structure named the target graph” -> These results in a structure named the target graph
“connected and of periodically providing” -> connected and periodically providing
“The sampling approch follows” -> The sampling approach follows
“relies on a attack-resilient” -> relies on an attack-resilient
“as an computational entity that” -> as a computational entity that
-Section 4
“In this section we” -> In this section, we
“applied to solve data intensive problems” -> applied to solve data-intensive problems
“devices/sensors (users) as an middle” - > devices/sensors (users) as a middle
“consists of both linux kernel” -> consists of both Linux kernel
“unit gathers raw sensory data and execute” -> unit gathers raw sensory data and executes
“and pass them on to the higher level fog unit” -> and passes them on to the higher level fog unit
-Section 5
“(models, architectures,etc.)” -> (models, architectures, etc.)
|