Review Comment:
The work presented in this article is interesting and very relevant for the Semantic Web community and provides a solution to improve the completeness and performance of virtual OBDA systems. Below are my comments per section.
# 1. Introduction
What are the operators mentioned in the goals?
# 2. Motivating example
The performance issue is only explained in the caption of the figure and not in the text. It helps the reader to also explain it in the text.
# 3. Ontology Based Data Access Over Tabular Data
How do the properties of 3.2 align with the challenges of 3.1?
# 4. The Morph-CSV Framework
Can you add an example of what is explained in 4.2, because else it is pretty hard to understand.
For #answers(eval(Q, theta++(VTD))) >= #answers(eval(Q, theta(VTD))) missing closing bracket for the first.
For #time(eval(Q, theta++(VTD))) <= #time(eval(Q, theta(VTD))) the equation says <= but the text says <
Figure 3 uses "Tabular Dataset" during Source Selection, but how are updates of that dataset processed because it was mentioned in Section 3.1 that for other systems this leads to performance issues.
For Figure 4b it would be good to mention that YARRRML is used, because not everybody might know that and they might expect something like [R2]RML.
On page 10 line 26 what is the problem here actually? That is not clear from the example.
# 5. Evaluation
Nice that there are research questions, but it would be good to have corresponding hypotheses that are checked during the evaluation.
What is meant with research question 3? I don't understand the "different levels of data heterogeneity".
Why are Ontop and Morph-RDB chosen for the evaluation?
Regarding the metrics, I don't see anything about the actual query results. Are all queries correctly answered? This should be discussed in the text.
How is the baseline determined? What tool is used to generate the baseline?
One page 14 line 46 what is meant with the "difference between the two approaches is not very relevant and is maintained across the datasets"?
It was only until reaching the end of page 14 that I understood that Morph-CSV cannot be used on its own (right?). This should be clarified more in the text. I see it in the abstract, but not really in the text.
In Figure 11c what are the red parts?
It might be good to merge Figures 15 and 16 so that the results could be better compared.
# 7. Related Work
It is not clear what the approach of [38] is lacking.
If you have questions or remarks don't hesitate to contact me!
|