Review Comment:
Overall evaluation
Select your choice from the options below and write its number below.
== 3 strong accept
== 2 accept
== 1 weak accept
== 0 borderline paper
== -1 weak reject
== -2 reject
== -3 strong reject
-1
Reviewer's confidence
Select your choice from the options below and write its number below.
== 5 (expert)
== 4 (high)
== 3 (medium)
== 2 (low)
== 1 (none)
5
Interest to the Knowledge Engineering and Knowledge Management Community
Select your choice from the options below and write its number below.
== 5 excellent
== 4 good
== 3 fair
== 2 poor
== 1 very poor
4
Novelty
Select your choice from the options below and write its number below.
== 5 excellent
== 4 good
== 3 fair
== 2 poor
== 1 very poor
4
Technical quality
Select your choice from the options below and write its number below.
== 5 excellent
== 4 good
== 3 fair
== 2 poor
== 1 very poor
3
Evaluation
Select your choice from the options below and write its number below.
== 5 excellent
== 4 good
== 3 fair
== 2 poor
== 1 not present
3
Clarity and presentation
Select your choice from the options below and write its number below.
== 5 excellent
== 4 good
== 3 fair
== 2 poor
== 1 very poor
2
Review
Please provide your textual review here.
The paper address and important and pertinent problem, i.e. the acquisition of knowledge from non ontological sources in order to bootstrap the ontology construction process for those domains that are rich in structured or semi structured data, or published as LOD. These non ontological sources can contribute to different stages of the ontology engineering lifecycle. This paper proposes to start building a new ontology by developing a module through ontological patterns and details a transformation process that extracts knowledge from multiple non ontological sources to adapt and enrich the module with ontological elements that are common across the sources, and hence are deemed trusted.
The main issue with the paper is that the presentation is high level, and not provide sufficient details to fully assess the relevance of the contribution and to assess the experimental results provided in the evaluation. The proposed process assesses the quality of knowledge that is common to the different non ontological sources according to three criteria: source reputation, source freshness and its level of agreement with the module used to bootstrap the ontology construction. These are just some of the many measures proposed in the literature. However, these measure are only described in words, and no precise mathematical formulation is provided with their formal characteristics, but the paper only mentions the weighted aggregation function. It would be useful to have a much more precise definition, and especially of the similarity between the sources and the given module. This impacts the experiments, because it is not clear the effect that each of these measure has on the evaluation and whether changing any of the used measures would change the results obtained.
One other issue concerns the merging of the different knowledge bases using logmap. One of the issues with aggregating redundant knowledge sources is that when aligning them the resulting alignment might present more candidate mappings for each given entity in that is being aligned. Logmap uses reasoning to resolve some of these cases, but for some mappings it requires some form of validation by the knowledge engineer. Is this an aspect that affects the proposed process, and if so, how?
More detailed comments
Page 8, Source quality definition. The mathematical formulation of the quality measures and criteria proposed are missing.
Page 10, definition of degree. If the degree(e_i, e_j) is 0 does this mean that there is no mapping or that there is no confidence or trust in the mapping between e_i and e_j? These are two different aspects, and the paper should be more clear. If a different alignment system was used, could the value of the degree function be different and hence could the mapping be considered?
Page 10, Figure 4: is x i-j on the arcs in the graph the similarity between xi and xj (i,j: 1..3)?
Page 10: definition of the sigmoid function. The sigmoid function is used to normalize the intuition of confidence, but wrt what?
Page 11: The relation hasHigherRank is extracted but not fully explained What is the role in the transformation pattern. If this is not always hierarchical it should be defined better, is the measure module dependent? is it just about generality?
Page 13, section 5.2: Only candidate with a trust score of 0.9 are extracted. How is this value determined? Have you tried varying the trust score to see whether there is a minimum trust score below which the experimental results degrade? Why not consider a trust score of 1 to consider only the entities that are certain?
|