Review Comment:
This manuscript was submitted as 'Survey Article' and should be reviewed along the following dimensions: (1) Suitability as introductory text, targeted at researchers, PhD students, or practitioners, to get started on the covered topic. (2) How comprehensive and how balanced is the presentation and coverage. (3) Readability and clarity of the presentation. (4) Importance of the covered material to the broader Semantic Web community.
This paper is very insightful about the historical reality and the complexity of modelling place-related entities.
An overall excellent analysis, a good introductory text for researchers and practitioners. The coverage of aspects of complexity of place-related historical phenomena is very good, in particular all things that do not fit well to most of the current practice of designing gazetteers, but also naive application of logic. Its value lies in this deep understanding of the domain, more than in the analysis of the ontological representations.
Since there is currently not any satisfactory implementation for cultural spatiotemporal semantic gazetteers, all struggling with inadequate simplifications, this paper is quite important for the Semantic Web. It is well written.
The literature references are rich and quite comprehensive. I suggest the authors to add Papadakis, M., Doerr, M., & Plexousakis, D. (2014). Fuzzy Times on Space-Time Volumes. eChallenges e-2014, 2014 Conference, Belfast, 29-30 October 2014 (pp. 1-11). IEEE (978-1-905824-45-8). The latter may give the authors a further understanding of the possibilities of the CIDOC CRM and its extensions.
The paper could say a bit more about topological relations - spatial, temporal and spatiotemporal, and that and how semantic relations and topological ones can justify each other, such as creation becoming a terminus antequem for use etc.
I would like to shortly comment some fundamentally alternative views the authors may still at least mention, ones they are core ideas of the CIDOC CRM, being suited to resolve a lot of the riddles about places to my opinion:
The paper uses Yi-Fu Tuan's definition: "a center of meaning constructed by human experience". Personally, I regard this well received definition a major source of ontological confusion.
A clearcut concept of identity must be based on an adequate definition of substance (see David Wiggins,"sameness and substance renewed", ISBN-13: 978-0521456197
ISBN-10: 0521456193). Here we confuse the substance of (1) geometric extent, (2)the materiality present or activities going on within a geometric extent or the human intentions about a geometric extent, and (3) the relationships of meaning.
page 2: "The name as well as the location of a place is just a designation, while the cultural and temporal setting in which it has certain properties, make it unique as an entity".
The identity providing substance is not that of "place", but that of the concepts (2) above. What is observable are the latter. Place is where they happen to be, not what they are.
What makes them a "place" is a psychological-linguistic projection, like a pars-pro toto, focussing on things having spatial projections and some spatial stability, and herefore calling them "place". Tuan confuses the identity of "having a place" with "being a place".
The authors should mention, when describing the CRM, that E53 Place has this different definition (dependent pure geometric extent), and because of that it can describe the relations to events and spatial ambiguity or indeterminacy in a consistent way:
The authors write: "Such a contextual approach offers a more accurate description of what distinguishes
a place from another one." What is the substance of this context that provides an objective distinction?
and Page 3: "Because most places and territories prior to the 19th century lack clearly defined borders, this approach has many advantages for modelling the fuzziness of historical places."
Also, a lot of subjectivity and domain-specific views ("This paper will take a step back from this global perspective") the authors describe would not be necessary if the substance of place were adequately defined. A medieval rulership is a quite objective fact, comparable to ruling all over the world.("One should note, that such a conceptualization is not universal but represents a specific view on reality [14, p. 84].")
I suggest the authors to question a bit more the concept of "hierarchical relations" on page 3 and page 5. Firstly, the phenomena described are often not a tree structure, but rater a DAG. Secondly, what is called hierarchy is a semantic compression of geometric inclusion (missing overlaps), and part-of of material or social phenomena. The idea to press named places into a hierarchical structure simultaneously being geometric inclusion and semantic part-of, as if they were terminology, is a major cause of inadequacy of current gazetteer models. Semantic part-of can justify geometric inclusion, but not otherwise round. Semantic part-of has different semantics for different phenomena, such as administration, building parts of geological areas. IsA hierarchy of terms is not among them.
Page 4: "Firstly, one can distinguish between fiat objects and physical objects. Fiat objects are virtual spaces conceived by men [4, p. 135]." The authors should notice that the CRMgeo declarative places are "fiat" objects. The opposite are "bona fide" objects, which are the phenomenal places, including extents of physical objects.
I regard this as wrong: "But less clearly defined spaces such as the Finno-Ugric language group or ’the Christian world’ can be understood as fiat objects, too.". It confuses fuzziness with virtual conception. The extent of the Finno-Ugric language group is defined by observation, not by conception, all be it fuzzy and prone to subjective differences.
Page 4, third and last paragraph is confusing and should be rewritten: "Place concepts like duchy, republic, prince-bishopric, or parish are specific terms, and their meaning is related to a certain historical context, while general concepts can be understood as concepts of broader categories." and "The assumption that a duchy and a republic can be grasped by the same concept, like secular dominion, is a very broad historical simplification."
The authors confuse here the question of (1)IsA generalization, with the (2) adequacy of specialization, and (3) the use of terms in a system. All concepts have broader generalizations (IsA). No concept grasps the specificity of an individual. Even "duchy" doesn't. The question is the adequacy of the term for the intended documentation. "duchy" and "republic" are secular dominions, regardless all obvious differences. The idea that a classification makes a prejudice about missing more detailed features is wrong. It may only be insufficient for the purpose of a documentation system, but not a "historical simplification"
Page 5: . "A more accurate way would be to model ruling as relations between agents, privileges and places."
In the CRM, this is modelled either as instance of Period or Activity.
page 7: "Using GIS practices as an example, the easiest way to model time is to understand the whole dataset as a
representation of the world at a certain point in time." In principle, this is really never known, not just hard to achieve. It will always be an interpolation of events and fuzzy boundaries.
Page 8: "Like the Julian and Gregorian calendars, these can exist simultaneously.": It is more practical to normalize all calendars with known relations to the modern one. Only Egyptian kings lists etc. pose a problem of unknown point of reference.
"Sometimes it may not be possible to state a date as a definite starting point of an event, but only a terminus
ante quem (or post quem);" : A date is neither a point, but both a terminus ante quem and post quem, from the beginning of the day to its end. Rather, all time information should be understood as approximations, never "definite". To regard a day a point in time is an arbitrary restriction of precision.
"This could be achieved by either simply adding a fictitious or mythological tag to the affected places, or
by using a copy of a class tree with an actual root for first and a fictitious root for the second." :
A tag is ontologically a bad solution, because fictitious items do not behave like real ones. There are no fixed topologies, they have unlimited conceptual variants etc. Cases, in which real places inspire fictitios ones should be modelled by linking.
See also: Theodoridou, M., Bruseker, G., Daskalaki, M., & Doerr, M. (2016). Methodological tips for mappings to CIDOC CRM. 44th Computer Applications and Quantitative Methods in Archaeology Conference (CAA 2016) "Exploring Oceans of Data", Oslo, Norway, March 29 - April 2, 2016.
page 10:
"The second one, SP6 Declarative Place, is the place as we represent it, based on historical source, archeological findings or guessed approximations."
This is not correct. It is not "THE place as...", SP6 is A place in its own right, a FIAT place in the proper sense. It can be used to approximate a phenomenal place. All geometric representations of Phenomenal Places are declarative approximations. In case of legal claims of land, the claim may be described in a purely declarative way. The words "declarative" and "fiat" are virtually synonymous.
Page 11, fig 1: the P160 property does not go from E4 to E92 but only from E92 to E52 and E4 is a subclass of E92
Page 16: "The distinction between fiat places and physical places as well as general and specific concepts is done only by the GOV ontology." See above, except for using the term "declarative" instead of "Fiat", the CRMgeo makes exactly the same distinction, even better, because it generalizes to phenomenal, which is more consistent with historical concepts.
"All ontologies introduced in this paper solve the problem of multiple names. However, only Pleiades takes into account that names retrieved from historical sources can be flawed data and should therefore be modeled as such."
The CRMinf extension of the CIDOC CRM describes rich belief states in general, and FRBRoo describes name use activities of the past.
"By separating the historical source from the editor of a dataset, only the Pleiades ontology allows in parts
a model of provenance considering the needs of academic research."
This is not correct. CRMInf describes provenance and inferencing. It relies on a Named Graph approach. CRM itself has the Attribute assignment construct to describe provenance, a sort of reification.
Page 18: "As shown with the Pleiades ontology, it is to be preferred to also represent the trustworthiness of a source." CRMinf introduces the concept of "Belief Adoption".
In general, Provenance of knowledge should not be integral to some domain specific ontologies, because the respective epistemology is generally applicable to any class and property of many domains. Adding provenance to place-specific classes confuses the situation, because it leaves others without provenance, which cannot be. There is, e.g. PROV-O as general purpose provenance ontology, but not necessarily geared for historical discourse.
|