Rights to Richness: Connecting Cultural Artefacts with Rights and Context

Tracking #: 3908-5122

Authors: 
Ruben Peeters
Giacomo Blanco
Georgios Botsoglou
Isabel Davis
Xuemin Duan
Christian Fuhrhop
Tobias Hendrickx
Maurice Mengel
Deniza Popova
Tommaso Monopoli
Kyriaki Oudatzi
Theodoros Semertzidis
Elisa Simionato
Dieter Suls
Giuseppe Rizzo
Anastasia Dimou

Responsible editor: 
Guest Editors 2025 OD+CH

Submission type: 
Ontology Description
Abstract: 
The Cultural Heritage domain is actively engaged in digitising collections and building a central digital infrastructure, a critical effort supported by European initiatives to preserve artefacts and facilitate their reuse. Integrating diverse cultural heritage data into a central infrastructure is challenging due to the heterogeneity of the data and the limitations of existing data models in preserving contextual information. Furthermore, the lack of machine-understandable rights statements for digital artefacts hinders their reuse. This work addresses these issues by developing a framework that incorporates machine-understandable rights statements and enhances preservation of context during data integration. This paper introduces two key contributions: CACAO, an ontology extending CIDOC-CRM for rights and context with accompanying rights vocabulary and ARTKB, a knowledge graph utilising CACAO. The CACAO ontology was developed in collaboration with four CH institutes, validating its applicability and understandability across diverse data types. The ontology extends CIDOC-CRM by integrating ODRL for structured rights expression and by introducing new, more specific concepts for detailed contextual information. The CACAO ontology serves as a common data model, improving interoperability among CH institutions, enhancing the preservation of original context, and allowing for the definition of flexible usage policies for digital artefacts. This work represents significant progress in Intellectual Property Rights (IPR) management and contextual modelling for cultural heritage.
Full PDF Version: 
Tags: 
Reviewed

Decision/Status: 
Major Revision

Solicited Reviews:
Click to Expand/Collapse
Review #1
By Renato Iannella submitted on 26/Sep/2025
Suggestion:
Major Revision
Review Comment:

Specific comments:
Page 9 Line 19
Reference [26] to ODRL should be to the W3C ODRL specification
Note: Reference [26] is not correct. R Iannella is not the author of that book.

Page 12 Line 49
See the licenses at for CC0: https://rdflicense.linkeddata.es
These are good templates to follow for "instant" licenses

Page 17
For the InC-EDU1.0 "instant" license...
Line 41
- An empty obligation means nothing?
Line 42
- The action is missing?
Line 46
- An empty prohibition means nothing?

General:
Define a profile URI for the ODRL policy
Perhaps crm:E30Right is a subclass of odrl:Policy ?
The same for LegalObject (Asset) and Actor (Party) ?

The CRM model (Figure 3) seems to imply structure for Actors and Legal Objects.
These potentially could be mapped into the ODRL structure for Party and Asset.
For example, a E47Group is an odrl:PartyCollection ?

Overall, there is a lot of content packed into this paper.
I think it would be better to separate into 2 different papers
dealing specific to the ontology development, and the use of ODRL.

Review #2
Anonymous submitted on 30/Jan/2026
Suggestion:
Major Revision
Review Comment:

Summary. This paper presents CACAO, a Cultural Artefacts’ Contextual Ontology extending CIDOC-CRM with ODRL for machine-understandable rights representation and enriched contextual modeling, and ARTKB, a knowledge graph built using CACAO for the REEVALUATE project. The work aims to address two major challenges in Cultural Heritage (CH) data infrastructures: (i) loss of contextual richness during data integration, and (ii) lack of machine-readable rights statements hindering reuse of digital artefacts. The ontology is developed via stakeholder-driven requirements, competency questions, and alignment with established standards (CIDOC-CRM, ODRL, FOAF, Wikidata, EDM). The paper also discusses use cases and implementation choices.

Strong Points

* Rights expression and contextual interoperability in CH data spaces are interesting and real problems
* Reuse and extension of CIDOC-CRM rather than reinvention is a good choice too
* The discussion and positioning relative to EDM, Wikidata, Getty vocabularies, FOAF, and ODRL provides an overview of the ecosystem.

Weak points
* What makes CACAO distinctive beyond combining CIDOC+CRM and ODRL? What is the relation with [Ref-1] below?
* Evaluation if mostly design justification, rather than empirical validation (e.g. have the competency questions used for testing too?).
* A full diagram of CACAO’s core classes and properties is missing. There is Fig 4 but it difficult to read and does not contain many properties.
* Missing related work, not very clear novelty.

Comments/Questions

C1: Missing very related work:
[Ref-1] C Prandoni, M Valentini, M Doerr, Formalising a model for digital rights clearance, TPDL 2009, Springer. That paper proposed an extension of CIDOC CRM for digital rights management: Url: https://www.digitalmeetsculture.net/wp-content/uploads/2011/10/dro_paper...

C2: Motivanting examples: they focus more on usability, I would like to also see a complex rights-related use case

C3: Fig 1: probably the event-centric nature of CIDOC CRM should be stressed more

C4: Section 3.6: One related paper for Section 3.6 and system is
M. Mountantonakis, et al. , Why we need Ontology-specific Data Portals: A case study for CIDOC-CRM, SWODCH (Semantic Web and Ontology Design for Cultural Heritage) Workshop 2023. And the related portal: https://demos.isl.ics.forth.gr/CIDOC-CRM_Porta

C5: Table 1: too few competency questions that are related with right mgmt

C6: Figure 3 is somewhat unclear. It introduces subclasses of CIDOC CRM, but the properties (relationships) it establishes via odrl:function do not appear to align with the CIDOC CRM model. Specifically, odrl:party is treated as a subclass of E39 Actor, and odrl:Asset as a subclass of E72 Legal Entity. However, it is not clear what odrl:function, which links these two, corresponds to in CIDOC. Does it relate to E13 Attribute Assignment? Moreover, the definition of odrl:function is not provided. Based on a preliminary review, ODRL seems to function as a metadata model rather than a formal ontology. I believe there are gaps in this mapping. For comparison, the CACAO ontology primarily reuses CIDOC classes and properties, adding only schema-level classes that in CIDOC would typically be modeled as instances. In short, Figure 3 lacks explicit definitions for the proposed extensions, making the mapping and intended semantics unclear.

C7: Can DSSC (page 9, footnote 20) be considered an EC recommendation? Please clarify

C8: Evaluation of modeling is missing.

Review #3
Anonymous submitted on 02/Apr/2026
Suggestion:
Major Revision
Review Comment:

The manuscript addresses the integration of heterogeneous data from different sources in the domain of Cultural Heritage (CH). This domain presents particular challenges, such as the need for machine-understandable rights statements for digital artifacts and the preservation of contextual information during data integration. According to the authors, the manuscript presents two main contributions: (i) CACAO, an ontology extending CIDOC-CRM for rights and context, along with an accompanying rights vocabulary; and (ii) ARTKB, a knowledge graph that utilizes CACAO. The CACAO ontology aims to serve as a shared data model among different CH institutions.

The work is situated in a relevant context, namely the construction of a central digital infrastructure for Cultural Heritage based on the digitization of collections from Galleries, Libraries, Archives, and Museums. This aligns with important initiatives such as the European Collaborative Cloud for Cultural Heritage. In this context, the need for data integration and the establishment of policies for use, reuse, and access naturally arises.

According to the authors, although there are conceptual models such as the Europeana Data Model and CIDOC Conceptual Reference Model (CIDOC-CRM), these models tend to be abstract in their definitions, often requiring specialization for specific domains. Furthermore, the conditions for (re)use of digital artifacts are typically expressed in plain text, which may lead to misinterpretation and prevents machine interpretability.

Thus, the work aims to integrate data from REEVALUATE CH institutes (and external initiatives) into a central knowledge base. CACAO is intended to represent contextual information associated with artifacts as well as terms for rights statements, while ARTKB serves as the central knowledge base for REEVALUATE.

Despite the contributions of the manuscript, some issues need to be addressed:

1) Regarding Section 3.1, its role in the manuscript is unclear. None of the described top-level ontologies appears to have been directly used in the work. Therefore, it is not evident why they are presented in detail. This section could likely be removed without loss to the manuscript.

2) Section 3.6 appears to describe methodological steps of the ontology engineering process, particularly the identification of ontologies for reuse. Such content is typically part of the early stages of ontology development and would be more appropriately placed in a methodology section rather than under “Related Work.”

3) Similarly, Section 3.7 also seems to address methodological aspects and therefore may not belong in the “Related Work” section.

4) In my opinion, the methodological aspects related to the CACAO development process need to be strengthened. The description could be structured according to the activities and expected outputs of the Linked Open Terms (LOT) methodology:

4.1) The issues mentioned in points 2 and 3 above could be incorporated into a dedicated methodology section.
4.2) The development process could be illustrated with explicit inputs and outputs produced during CACAO development process.
4.3) In the first paragraph of Section 4, the authors state that “CH experts were consulted to formulate both functional and non-functional requirements.” However, the four requirements presented are not clearly classified into these categories. What are the non-functional requirements? Additionally, it is somewhat surprising that only four requirements were identified after consulting domain experts and ontology engineering specialists. One would expect a larger set of requirements. What explains this limited number?
4.4) The process of reusing CIDOC-CRM and ODRL, as well as the alignment/mapping effort between them, is not clearly described (as a step of CACAO development process). The ontology engineering literature provides established approaches for ontology alignment and mapping that could support and better justify how CACAO was constructed from these sources.
4.5) Regarding the alignment/mapping effort, most relationships appear to be modeled as subClassOf. Were other types of relations considered (e.g., equivalence, sameAs, overlap, covering)? If not, are they unnecessary in this context?
4.6) In addition to diagrams, mapping tables could be useful. These are commonly used to document correspondences between ontologies’ concepts and often serve as the basis for diagram construction.

5) Regarding Table 1:
5.1) Should not all competency questions (CQs) be related to requirements R3 and R4? These requirements appear to have a transversal nature, unlike R1 and R2. It is unclear whether R3 and R4 should be associated with all CQs or only some. Additionally, in cases such as CQ7 and CQ8, can R3 and R4 alone adequately capture the requirements? Could these be considered non-functional requirements?
5.2) To improve the understanding of conceptual coverage during CACAO development process, an additional column could be added to Table 1 indicating which ontologies (CIDOC-CRM, ODRL, or CACAO) address each high-level concept. This would offer a good notion of how each ontology contributed answer the CQ.

6) The authors could also discuss the benefits and drawbacks/difficulties of reusing and aligning/mapping ontologies for designing a third one. Besides the benefits, such processes often involve trade-offs and may introduce unintended side effects.

7) The design process of ARTKB is an important aspect of the work. However, given its relevance, it deserves more detailed treatment. In particular, the process of linking Wikidata entities to CACAO concepts and relations should be better explained. Was this process manual, semi-automatic, or automated? The manuscript describes what was done, but it could be more detailed on how it was done.

In summary, the manuscript presents a valuable contribution in a relevant context and is generally well presented. However, in my opinion, some issues need to be addressed.