Review Comment:
Review of: "Repairing EL_$\bot$ Ontologies Using Debugging, Weakening and Completing"
The paper deals with the subject of repairing ontologies. It addresses the combinations of different variants of fundamental operations that constitute a repairing process. The paper introduces an interactive approach for repairing ontologies, and discusses the quality of the obtained repairs in terms of their correctness and completeness with respect to their entailments. Additionally, this work includes both an implementation of the discussed approach and an experiment.
I find the topic of this paper to be of great relevance to the journal.
The paper presents a framework for repairing ontologies that is general enough to characterize previous works on the subject. Given an EL bottom TBox and a set of wrong axioms, the framework allows the combination of debugging, removing, weakening and completing operations to obtain a repaired TBox that does not entail any of the wrong axioms. The framework also incorporates an oracle (user) in order to validate axioms during the entire process of repairing. The paper presents multiple variants of those operations, and discusses their trade-offs in terms of completeness and correctness of the repaired TBoxes, where completeness and correctness concern the entailments of the resulting TBoxes. The paper also introduces new algorithms for weakening and completing. Furthermore, it describes two systems: an implementation of the framework as an extension of the EL version of RepOSE, and a Protégé plugin for a specific instantiation of the framework. Additionally, an experiment evaluating the discussed approach is also included. The authors also provide access to implementations and resources used in this work.
This paper extends the authors' previous work. Here, the considered Description Logic is EL bottom instead of EL. Wrong axioms can be any entailed axiom instead of only asserted axioms. Additionally, the debugging operation has been integrated into the framework.
Overall, the paper is well structured and interesting to read. However, I am not sure if the provided extensions are sufficient to merit a significant delta. Furthermore, there are some issues that require clarification.
The authors assert that all proposed algorithms generate repairs as defined in Definition 1. However, I have concerns about this claim based on several observations. Notably, in the authors' previous work, debugging was not included in the repairing pipeline, and the set W of wrong axioms was assumed to be complete in a way that, once removed from a TBox T, T would not entail any axioms in W. The authors have this assumption in this paper as well. However, since debugging is now part of the framework, the set W of wrong axioms is not exactly the set D of axioms that need to be removed, which now have to be computed instead of being given as part of the input.
Furthermore, the paper mentions that the oracle used in the approach has no restrictions and may provide incorrect or inconsistent answers. This raises questions about the correctness of the set D, which is computed based on the oracle validations, as illustrated by Algorithm C14 for example. The possibility of oracle errors suggests that the set D might not be comprehensive, potentially affecting the reliability of the repair output. For example, if T = {A <= B, B <=C} and W = {A <= C}, an erroneous oracle could lead to a scenario where A = {A <= B, B <= C} and D = {}, resulting in an output T_r identical to the input T, which still entails the incorrect axiom.
I understand the reason behind not requiring the oracle to always be correct, as this mirrors the reality where domain experts can make mistakes. However, it should be clearly stated in the text and the algorithms that generating a repair is not always guaranteed.
I tested the previous example using the provided Protégé plugin. I noticed that, aside from the absence of warnings indicating the TBox has not been repaired, the plugin also requires the axiom A <= C to be asserted, which should no longer be a requirement.
Additionally, here are the remaining points that require further explanation.
- Preliminaries:
- individual names are introduced but they are never used in the paper, as the footnote also suggests. So what is the reason behind introducing them?
- P and Q are used here as arbitrary concepts, but later in the paper they are used as concept names. The authors make it clear that this is the case. For clarity, using different letters to differentiate between concept names and complex concepts might benefit the reader.
- Problem Formulation
- Definition 1: Let W be a finite set of TBox axioms in T ... -> The footnote on the first page indicates that, unlike in the authors' previous work, the wrong axioms in this paper do not need to be explicitly asserted. However, in this definition, "a set of TBox axioms in T" is understood as these axioms are indeed asserted.
- line 6: ... a repair for Debug-Problem DP(T,Or,W) ... -> Symbols like "Or" and "(A,D)" have been introduced before the definition but "DP(T,Or,W)" was not. It may benefit the reader to be introduced to all components used in the definition beforehand.
- Basic operations - debugging, removing, weakening and completing
- line 46: N_C^T and N_R^T -> These symbols are used here but introduced later in Definition 4. It would be easier for the reader if the meaning of these symbol is clarified when they are first introduced.
- line 24 - 26: ... where P, Q, R \in N_C^T and r \in N_R^T ... -> The authors note here that fresh concept names might be introduced. But with normalization, for example, added axioms might not adhere to having concept names only from N_C^T. Similarly with a weakening that can introduce new names.
- Combination strategies
- In Table 2, it is not clear to me what is the purpose of operations "R-none" and "AB-none". If no axioms should be removed (added), then why would one need a special process that removes (adds) nothing?
- page 10 line 20: T_1 \sqsubseteq T_2 -> Subsumption between TBoxes is not defined anywhere in the paper. I assume the authors imply T_2 \models T_1 with this notation. But it is unclear to me why entailment is not simply used?
- line 20: Der(T_1) \sqsubseteq Der(T_2) -> Why not use \subseteq instead of \sqsubseteq?
- line 24: ... sets of wrong asserted axioms D_1 and D_2 such that D_1 \sqsubseteq T ... -> Again, why not use \subseteq instead of \sqsubseteq?
- page 11 lines 10 - 14: I'm not clear on the argument presented here. Which computation is being referred to here? It would be helpful if this could be explained further.
- line 17: ... updating after each wrong axiom is the same ... -> Does this refer to the update after each axiom is weakened or removed, or something else. This requires more clarification.
- I believe discussing the impact of various operation variants and their sequence on the input size for the next operation in the pipeline, as mentioned in this section, is crucial. However, have the authors also considered how the order in which axioms are weakened affects the quality of the repaired ontology (and similarly for completing), as well as the overall quality of the repairing process?
- Appendix
- In Algorithm 11, what is the difference between the if statements in lines 8 and 11?
- Also in Algorithm 11, what happens when concepts Q, R or P are complex concepts? how are these concepts normalized?
- Resources
- Instructions on how to use the provided tools, as well as the data used in the experiment, are made available. However, I could not find any scripts or instructions for reproducing the results.
Finally, I've noted some typos and minor wording issues that can be improved.
- Introduction
- page 1 line 40: ... and the repairing. -> ... and their repair.
- page 2 line 2: In this paper we mitigate these effects of removing wrong axioms by, in addition to removing those axioms,... -> This might be a bit redundant, as it is already understood that certain axioms will be removed.
- Preliminaries:
- line 22: ... during the repairing and ... -> ... during repairing or during the repair process ...
- Problem Formulation
- line 6: ... a repair for Debug-Problem DP(T,Or,W) ... -> ... a repair for a Debug-Problem ...
- line 15: ... that formalize these intuitions, respectively. -> ... that respectively formalize these intuitions.
- reconsidering the line breaks in all definitions, particularly in Definitions 2 and 3, could yield a cleaner format.
- Basic operations - debugging, removing, weakening and completing
- The title of the previous section uses a title case style, whereas the rest of the titles use a sentence case style. Consistency across all section titles would enhance the document's appearance.
- line 43: ... simple complex concept set for a TBox T, ... -> ... simple complex concept set for a TBox T, denoted by SCC(T), ...
- line 14: It might look better if "T" were to fit in the previous line.
- line 31: ... removed from the ontologies ... -> ... removed from the ontology ...
- line 32: ... Figure 1 derived wrong axiom ... -> ... Figure 1, the derived wrong axiom ...
- Hitting Set: sometimes "Hitting set" is used, and other times "hitting set".
- line 38: ... for computing the justifications ... -> ... for computing all justifications ...
- Figure 1: Completion: wanted axiom \alpha2 \sqsubseteq \beta2 is replaced by correct axiom ... -> The term "replace" is somewhat misleading. It implies that an asserted axiom is swapped with another, whereas, in reality, the axiom is not and will instead be entailed as a result of completing.
- page 7 line 16: a similar issue to the one previously mentioned.
- Algorithm 1: Generate the justifications ... -> Generate all justifications ...
Line 3 in the algorithm, the spacing in "GenerateJustifications" looks a bit off, maybe using something like $\mathit{GenerateJustifications}$ could solve the problem.
"Or" is italicized in line 6 but appears in a non-italicized format in the algorithm's input section (this issue appears in all algorithms).
The titles of most of the algorithms provided in this paper describe in length what the algorithms actually do, which results in some lengthy titles, for example Algorithm C15.
- Combination strategies
- line 5: In this section -> In this section,
- line 6 and table 2: one at the time -> one at a time
- line 8: ... the influence of using different choices ... -> ... the influence of using different combinations ...
- line 13: ... between the choices for different combination strategies ... -> ... between the choices of different combination strategies ...
- Table 2: operations are not italicized, whereas in the main text, they are.
- line 47: ... to generate asserted wrong axioms ... -> ... to extract asserted wrong axioms ...
- line 19: ... as soon as one is computed ... -> ... as soon as it is computed ...
- line 23: Similarly as for weakening, ... -> Similarly, as with weakening, ...
- line 34: Figure 2b -> Figure 2c
- line 36: Figure 2c -> Figure 2d
- line 36: ... one at a time completing ... -> ... one at a time completing strategy ...
- Figure 2: It seems that a different font is used for the node labels which makes the style of operation names inconsistent with the one in the text.
- page 10 footnote: I think this should be moved to the main text.
- page 11 line 8: If one wrong axiom at the time is removed ... -> If one wrong axiom is removed at a time ...
- line 11: ... then they will be added back at the end or not. -> ... then they might be added back.
- line 17: First, we note that updating immediately ... -> First, we note that updating the TBox immediately ...
- line 26: When completing one axiom at a time ... -> When completing for one axiom at a time ...
- line 42: ... transformed into the sequence of operators of a second algorithm, ... - > ... transformed into a sequence of operators of another algorithm, ...
- pages 12, 15, 19, etc.: The document contains a significant amount of white space. It may enhance readability if the space were managed more efficiently.
- Implemented systems
- line 4: ... for repairing based ... -> ... for repairing ontologies based ...
- line 11 and 12: ... left/right hand concepts ... -> ... left/right hand side concepts ...
- The paper contains many long sentences; for example, the sentence spanning lines 8 to 12, or the one from lines 46 to 49, just to point out a few. I believe breaking such sentences into shorter ones would enhance the paper's readability.
- line 18: ... that focused on completing ... -> ... that focuses on completing ...
- line 22: ... and save ... -> ... and saving ...
- line 35: After loading the ontology, ... -> After loading an ontology, ...
- All examples in this section are labeled "Example." Numbering them, for instance, might improve their presentation.
- page 22 line 29: For the completion step ... -> for the completing step ...
- line 33: Figure2c -> Figure 2d
- line 35: ... we also implemented the function that the user can remove the specified wrong asserted axioms ... -> ... we also implemented a function that allows users to remove specified wrong asserted axioms ...
|