Review Comment:
I would like to thank the authors for the quite detailed response letter for the previous round of reviews. The paper has improved a lot since the previous version and most of the review comments are either addressed or clarified.
This paper presents an approach called SHACLearner that extracts rules from a KG in the form of paths (Inverse Open Paths) which can be trivially converted to SHACL shapes. The approach is based on embedding-based open path rule learning. The main concerns regarding the previous review related to Originality, Significance of results, Quality of writing. The authors have addressed all three of those major concerns in this revision. There are still a few minor things that can be improved in the paper as listed below.
Other comments:
(1) Regarding the response to 3.4, while I agree that there are various formalisms for shapes defined in the literature to express diverse patterns, as the title of this paper, “Learning SHACL Shapes from Knowledge Graphs” suggests it focuses on SHACL Shapes. Thus, I believe it’s important to provide some insight on which SHACL features are covered, which can be covered theoretically but not covered due to time constraints, and which are fundamentally can’t be covered by the approach out of the constraint types such as value type, cardinality, value range, string-based, property-pair etc defined in https://www.w3.org/TR/shacl/.
(2) Regarding the response 3.6, while I understand the synaptic difference of encapsulating the type in an entity or in a predicate, I still fail to see the benefit of that. Why not follow the RDF/OWL convention of using binary predicates for defining types? What problem that is being solved by unary predicates that was not possible to solve using the binary predicates (similar to how the authors have done in Yago2)? Isn’t the notation P(e,e) for unary predicates conflict with reflexive properties in the KG. Including unary predicates includes a lot additional details to the method and makes the SHACL Shapes have unconventional target classes such as `sh:targetclass class:_;. Thus, I believe still a bit clear description why binary predicates were not possible to use with types has to be included in the paper.
(3) In general, there are several points in the answer letter that are explained there but not explained in the paper such as 3.8, 3.14, 3.21. It might be beneficial to incorporate some of those explanations to the paper itself.
(4) Regarding the response 3.25, even though I understand a fully-fledged human evaluation is out of the scope of this paper, a qualitative analysis by the authors of a smaller sample of discovered trees would still be beneficial for the reader to understand the usefulness of the discovered trees and also the limitations. Such analysis will really validate the results and give readers ideas about the current challenges and potential future work.
|