Review Comment:
Kamdar et al. extends upon previous research in “Investigating Term Reuse and Overlap in Biomedical Ontologies”. The reuse of terms is particularly important for interoperability and engineering of ontologies. Both papers conclude that term reuse (where class level terms are not shared among two or more ontologies) and term overlap (where two or more class-level terms are related) are minimal among 377 biomedical ontologies, either due to lack of guidelines or lack of tools to correctly reuse terms. Metrics for term reuse and overlap utilizes IRI, xref, and CUI references to identify specific terms. In this paper, the author furthers the previous study with a better metric (Equation 1) to calculate term reuse and term overlap to overcome some of the previous limitations. Also, the author introduces clustering analysis and log data from BioPortal Import Protégé plugin to validate user's term reuse intention.
In this study, Kamdar et al. summarizes that term reuse amongst a corpus of NCBO biomedical ontologies amounted to less than 9% of their total terms (6% for IRI, 5% for xref, and 8% for CUI reuse). This is slightly higher than the previous study of 5%, but nonetheless a low percentage. For term overlap, utilizing the equation and five overlap modules, the authors reported between 25%-30% term overlap that are also higher than the previous study of 18%. Overall, the new metric is said to have produced more precise figures, yet the same conclusion, as before, i.e., despite some term overlap, term reuse is minimal among biomedical ontologies. Furthermore, the authors employed clustering and BioPortal Import Plugin logs (based from 90 countries and 3538 unique IP addresses) to determine user’s intention of term reuse. The authors correlated the log data with the clustering analyses to support that ontology engineers reuse single terms and hierarchical subtrees, as well as, the reuse of terms from parent-child structures located in higher and upper-middle levels of the hierarchy from an ontology. Visualization tools and publication of the results can be found http://onto-apps.stanford.edu
While the paper highlights some new contributions and also improvements from their previous study, it would be recommended to have a subsection that discusses the previous study to give this current paper more context and clarity. This could be accomplished with a subsection near the Related Studies section, for example. In addition, section 3.4 needs more conceptual explanation. Aside from the clustering analyses and log data to support validation of user’s intent to reuse terms, the other findings are more or less similar to the previous study but with more precise figures to make the same claim. Overall, the study reveals some new findings relating to term usage in biomedical ontologies and innovative methods for the analysis of term usage.
|