Review Comment:
The authors present a mapping study on sensor-based health monitoring systems using semantic technology. A general background is given on the use of ontologies, KG and linked data for sensor data and health domain. The authors discuss other reviews that tackle related fields, but not the particular intersection that is targeted here. The PRISMA guidelines are used for guiding the mapping study. The study classifies and ranks 40 systems around 6 key challenges (and their sub-categories); describes common system architectures and data collection, development, and evaluation aspects. The authors observe multiple gaps, including: situational prediction and handling of uncertainty is inadequately addressed, nearly none of the systems utilize syntactic standards (e.g., FHIR), and there is little use of established guidelines to guide recommendations, among others.
The mapping study is quite impressive and comprehensive. I believe it has the potential to be a very useful review of the field in particular, and of the use of semantic technology in general. Further, it gives a good introduction on the topic in general. The paper is well written and the language is clear.
However, I have a few comments that I think should be resolved before publication:
The authors promise a thorough review of how semantic technology is used to overcome the challenges (this is objective 3 of the study). However, starting from the "context-awareness" part (Section 5.2), very little is mentioned about semantic technology. The authors mention that "Contextual information can be represented using semantic technologies, and most of the systems do so", and Table 7 lists the types of context that are covered by semantic technology. But, a (critical) reflection seems missing. For instance, what types of ontologies are popular for these particular purposes (e.g., context, situation); which types of knowledge graphs are often used (and using which platform, e.g., Jena, StarDog); what types of reasoning are implemented in CDSS (DL, first order); what types of rules / formalism are being used (SWRL, SHACL..)? I believe most of the content is in the paper, and may be resolved (to an extent) by re-shuffling some content from the background, semantic interoperability, and development languages section. This would allow the reader to get a better idea how semantic technology is used to overcome the challenges - currently this has to be puzzled together.
In a similar vein, the conclusion section could benefit from a reflection on (better) use of semantic technology in PHMS. Aside from the part on sensor ontologies, for instance, the part on explainability could be fleshed out; it is indeed an important feature of symbolic reasoning, but is it utilized in semantics-based PHMS? Are state-of-the-art reasoning technologies, such as SHACL, rdf-star, and Notation3 being utilized? Are ontologies being sufficiently re-used or are application-specific ontologies being used too often?
In the conclusion, I also invite the authors to revisit the final conclusion on generalizability of PHMS. Perhaps I'm misunderstanding, but it can be argued that disease-specific systems are more useful that a "one-size-fits-all" approach. Systems for COPD management need to focus on EMI to avoid flare-ups, whereas diabetes rather focuses on long-term behavior change (and apply psychological theory to that end). They have different management requirements and thus require different solutions. Regarding the lack of support for data standards (e.g., FHIR), perhaps this is due to the fact that systems (I assume) mostly rely on RDF? (It is true that FHIR has an RDF representation, but it is a bit awkward to use, requiring a whole lot of blank nodes).
The radar figures can be quite useful, but I feel they are currently giving a bit of a warped view. For instance, scoring "high" for DSS would involve quite a lot of DSS features, and go beyond the needs of a particular chronic illness. From Fig. 6, one would get the impression that most systems do not support DSS - they do, but simply not all possible DSS features the authors can think of.
The proposed reference architecture seems quite straightforward and even a bit reductive, compared to presented architectures. Why not separate important and distinct functionality, such as sensor, network and data storage, into separate layers? Why is it better to put them into a single layer? Why not go for a modular approach? There is insufficient motivation - the statement that it is "consistent with the layered architectures proposed in related reviews" is insufficient.
With regards to the evaluation setup: criterion 3 is not met by the mapping study, at least not as described by the authors. This criterion pertains to the quality of the individual studies (e.g., level of evidence, internal quality, sample size). (Note that some mapping studies even use the lack of robust evaluation (e.g., RCT) as _exclusion criteria_ for individual studies.) Instead, the discussion in Section 7 could be used to justify that the mapping study meets the criterion.
As a minor comment, the authors ruled out non open-access journals - I'm wondering how many papers were ruled out because of this? I don't think this is a common exclusion criterion, but I could be wrong.
Other comments:
- The authors provide a good description of 3 semantic technologies, but I'm unsure whether the basic intro's (in sections 2.2.1, 2.2.2, 2.2.3) are needed for the SWJ intended audience.
- Reasons are provided for using ontologies and LD in PHMS, but not for KG, it seems.
- Instead of ICD 10, refer to its latest (2019) version, ICD 11 (or, you could make a point that the latter is not being used in most systems)
- Provide a reference for: "Additionally, ambient sensors are increasingly being incorporated in health monitoring to monitor the state of the external environment [..]"
- The following should be better supported:
"systems that re-use existing ontologies have a higher degree of expressiveness for sensor and sensor data concepts than those that do not"
- This is a redundant statement: "Related to duration is frequency."
- What is meant by ambiguous sensor data?
- Please explain "photoplethysmography" when first used
- What is drawback of rules being based on existing knowledge?
"Secondly, the manual creation of rules is time-consuming, difficult to scale, and is often static and based on existing knowledge."
- Please elaborate on BSN acronym
- "Similarly, Ali et al. [93] tested compared" -> "Similarly, Ali et al. [93] compared"
|