Review Comment:
Summary:
The paper describes the development of an ontology to represent the data in the public transportation domain, particularly passenger transportation by buses. It covers three major aspects, namely (1) agencies, and operators, (2) lines, routes, and journeys, and (3) journey scheduling. The ontology is aligned with the Transmodel standard, an international standard for public transportation data exchange. The paper is highly relevant to the special issue, and the approach overall appears to be technically solid in most places. However, some issues (e.g., lack of a proper literature review, concrete use cases, and stakeholders) make it hard to judge in specific review dimensions. Therefore I suggest that the paper goes through a major revision.
Design principles, methodologies applied at creation
The LOT methodology is used. The methodology and its adaptation have been explained overall adequately. However, I still see a few relatively significant issues:
* Where are the restrictions? The authors claim that a previous attempt to engineer an ontology based on the Transmodel standard had its drawbacks. One of these drawbacks is the implementation of the restrictions. Does the proposed ontology implement these restrictions? How are they evaluated? Is any language other than OWL (e.g., SHACL, assuming the restrictions are seen more as constraints in Transmodel) used?
* The evaluation and requirement specification is only briefly covered (and no supplemental material submitted with the paper). Who are the domain experts? Who are the stakeholders? There are external links to the competency questions and use cases; however, there are only four questions and three use cases, which are not linked together explicitly. Are these questions comprehensive enough? Where are the generated Knowledge Graphs that are used in the evaluation mentioned in the paper? Did the queries formalizing the competency questions run on a triple store with OWL DL reasoning activated? It is claimed that OOPS! Report showed only minor issues. I do not see any link to this report, so what "minor" means is not very clear. For example, when I run the Hermit reasoner with Protege on the ontology, there are some impossible (equivalent to owl:Nothing) class definitions, e.g., Authority is a subclass of Organization, and these two classes are disjoint). Some properties are a sub-property of every other property in the ontology (e.g.viewedAs. not sure if this is the intended meaning). (Disclaimer: I downloaded the ontology from the linked website on the 4th of December 2020). In general, it would be beneficial to expand the requirement specification and evaluation sections slightly and provide supplemental material (e.g., OOPS! Report) that clarify the evaluation and requirement specification.
* The NOR transformation step is depicted as an extension to the LOT methodology, which is based on the NeOn methodology. According to the paper, NOR re-engineering pattern is already a part of the NeON methodology, then how is it an extension to LOT?
* This is more of a design question for my understanding: Why is the Incidence type a part of the ontology's Organization module? It appears to be more suitable for Part III as it affects the journey of a bus.
Comparison with other ontologies on the same topic
There is no proper related work section. Only the ontology produced by the SNAP project has been analyzed. Given the long line of smart city projects at both the EU and national level, there must be many other ontologies (e.g., [1]) that can be discussed with this work.
Pointers to existing applications or use-case experiments
The use cases and applications are provided minimally. Only very generic use cases are mentioned in the introduction and a bit more in Section 5. There is an external link to the use cases in which the descriptions are not much more detailed than in the paper (Also in Spanish, so I can tell as much as Google Translate allows me). Given that the work is done in the scope of a national "Open data for Smart Cities" project, there should be some concrete use cases, foreseen applications, prototypes, etc.
Convincing evidence for the quality and the relevance of the described ontology
The ontology is aligned with an international standard and reuses several existing ontologies. This fact already indicates a potential wide-adoption. However, judging the paper in this dimension is particularly challenging because there is no proper related work section (only some reused resources are reviewed in detail). The issues mentioned above in "Explanation of Design principles, methodologies applied at creation" makes it hard to judge the quality. The limited concrete use case descriptions and the missing alignment with other international efforts in the field prevent the full evaluation of the ontology's relevance.
Illustration, clarity, and readability of the paper
The paper is overall well-written and understandable. However, some sentences are too long. The first sentence in the introduction already takes ~10 lines!
There are some small typos. The paper can benefit from proof-reading. Examples:
* page 7 line 22 500 mt ratio -> 500 mt radius
* page 4 line 44 idea if -> idea of
* page 5 line 26 based of -> based on
The structure of Section 5 does not reflect the steps of the methodology. It would have been easier to follow if each step was a subsection.
The acronym GTFS is used first in the abstract; however, explained later in the paper.
The introduction should contain the answers to the What, How, and Why questions. However, How is too detailed and can be covered in the sections where the ontology development is explained in detail.
There are also some issues with the references and namespace prefixes:
* A citation to the RDF Mapping Language (RML) is missing.
* Some namespace prefixes are not explicitly connected with an * ontology (e.g., estraf, sf). Overall the representation of reused ontologies and namespaces could be improved.
* "GeoSPARQL location design pattern" is mentioned several times, but I did not see any footnote referring to this specific design pattern.
The concepts of "incidence" and "incident" are mixed up. One means the frequency of occurrence; the other refers to something that occurred. Nevertheless, I assume this confusion comes from one of the reused ontologies. Maybe an explanation with a footnote is in order.
In Table 1, the last row has never been mentioned in the paper.
[1] Katsumi, M., & Fox, M. (2018). Ontologies for transportation research: A survey. Transportation Research Part C-emerging Technologies, 89, 53-82.
|