Review Comment:
This paper defines an extension of the PROV standard for modeling the provenance smart city applications, making them accountable. The paper exploits the usage of the vocabulary extension proposing a) REST API that implements a reputation service; and b) an explanation service which aims at providing human readable explanations for the terms captured by the vocabulary. Both functionalities are explained with examples.
The paper is well written and very relevant to the call (having semantics, ontologies, semantics for citizens and provenance). I have found some typos though, which I highlight at the end of my review. I also consider the work novel, as it is the first work (that I am aware of) that creates human readable representations from provenance statements. However, I have some concerns with the paper, which I list below. If the authors address my issues, I'll be happy to accept the paper as part of this special issue.
1) The paper makes claims at certain points that are not true, as there is no evaluation. For example "this paper proposes... data elements in a narrative form so it can be EASILY DIGGESTED by users", "...state of the art frameworks tend to focus on logging this information, but do not present in an EASY TO UNDERSTAND FORMAT TO USERS" (this last one is an implicit claim). If a user evaluation showing that the proposed narratives produce explanations that are more useful for users than plain statements is not provided, then any statement similar to the ones I have highlighted should be modified in the manuscript.
That said, I think that such evaluation would make this paper very strong. And I am particularly curious on whether users would prefer this kind of representation versus a table with a set of selected property values, which sometimes is easier to navigate rather a full paragraph.
2) Where are the requirements for the narrative? I found some general requirements in Section 4, but I don't understand why a particular narrative was chosen, and who validated it. Was it validated just by the authors or someone else? For example, for the PROV narrative there are statements that seem unnecessary (e.g., stating that a resource is an entity may not add much value to some users). Maybe it would be enough saying that a resource was derived from another, etc. Different users might have different needs at different granularity. Please discuss this in the paper.
3) There is a missing discussion on how the vocabulary and templates can be extended (just briefly in the conclusions). How difficult would be for anyone trying to adapt this approach to reuse it? Has the prov narrative been used in other contexts?
4) The definition and semantics of the vocabulary is confusing. The authors referr to the model used as a vocabulary. Is this an RDF vocabulary/ontology? I haven't found barely any descriptions at all in http://smartsociety-project.github.io/cas/. It looks like an ongoing documentation (incomplete at the moment).
There seem to be some inconsistencies in the naming: a plan is a prov:Entity but not a prov:Plan? sending_request is an activity, but sending_negotation_response is an entity? Some other activities have a "_activity" at the end. Please use a consistent naming scheme.
I am also a bit confused on the "roles" in the definition of the ride share vocabulary. PROV introduces the notion of roles, which are not types of agents. Are the authors mixing these notions? If a driver is not a role played during an activity, the please avoid using that word in its definition. It can be confused with the notion of role in prov.
5) In the related work the authors say that some related work influences the Smart City vocabulary. How does this happen? I haven't seen any reference when they introduce the approach. Also, last paragraph just enumerates other approaches, but doesn't differentiate them from the current work. Why aren't any of these reused? A model can expand PROV and other existent ontologies as well...
6) I haven't been able to see the figures 5, 6 and 7. With the printed copy is not enough (too small), and the prov store returned a 404 for all the 3 urls.
Minor concerns:
1) Some resources are not available: the images I mentioned above, the REST API (for at least testing the proposed approach) and the documentation of the model.
2) Examples are often not explained or referenced in the text. If an example is added please explain it briefly in the text.
3) The tables require a subject, but the sentences assume that all the provenance information is available. What would happen if the provenance information would be partially missing? Would the sentence only appear in part?
4) Some of the produced paragraphs are quite verbose. Maybe the verbosity would be reduced if the labels of the resources were shown instead of their ids. I wonder what would happen if the ids were urls, probably the text would become very difficult to read. Another question related to that is why not have identifiers/uris that are resolvable instead of having to create an additional GET operation for the ids in the REST API? It would be more simple and compliant with the semantic web principles.
5) I would not use the wikipedia definition for "smart city". Haven't the authors found a better one?
6) Some of the urls used across the in the footnotes are very long. I suggest shortening them with https://goo.gl/ (for example)
7) I found the second paragraph of section 3 quite confusing and verbose. I think it would benefit from some rewording by the authors.
8) The term "accessible account" is never introduced before section 4. I think that I understand what the authors mean, but I would appreciate it if they defined it first.
Typos:
I have found these typos/suggestions. I recommend the authors doing a proof read before resubmitting the paper for the next round of reviews.
1 Introduction:
"they mediate access to real people" -> wouldn't "human users" or something like that would be more appropriate?
"for unlicensed drivers.". -> extra full stop
"reputation systems"-> missing full stop.
References 19 and 20 are used for defining "provenance". Use provenance[19,20] or remove one.
"Prov is a W3C standard [...], which can be a PROV entity, activity or agent. However, [...]" -> I don't understand the use of "however" here.
2 Background work:
"It is well suited to describing provenance data"-> describe (or for describing).
". While, the work presented in..." -> isn't Meanwhile more appropriate here?
3 Smart City Vocabulary
"The type information allows..." -> The type of the information allows...
"facilitating the leverage the information in the provenance..." -> facilitating leveraging
"a Smart City application needs to be able to describes..." ->describe
4 Accountability as a service
"It is important that order to support this" -> in order to support this.
5.1 Feedback and reputation reports
"There will be an investigate" ->there will be an investigation
5.2 Ride Share Vocabulary
"when a feedback is submit" -> when a feedback is submitted.
6 Ride Share Accountability Use Cases
"the explanation peer to generate two narratives" -> the explanation peer generates two narratives.
Table 10: The first sentence goes into the second column
7 Conclusions
"board class of applications" -> broad class
"Firth, investigate..." -> First, investigate.
"json-DL" ->JSON-LD.
|