Review Comment:
NutriLink: An Ontology for Linking Digital Receipts to Food Nutrition Information and Dietary
I would like to commend the authors on their significant contributions research on semantics and data science more broadly, as well as to food and nutrition research. The creation of the NutriLink ontology represents an important advancement in linking digital receipts to nutritional information, which has the potential to provide actionable insights for guide healthier dietary behaviours, as well as a solid foundation for much future research. The integration of NutriLink with other ontologies like FoodOn and GoodRelations, along with its application in a diet recommendation system, is impressive and innovative. Overall, my comments below serve to improve the positioning of the paper, better explain the methods used, and address some other limitations.
Abstract:
- Likely needs to be reworked a little based on the comments below.
- Mention the country where the research was done, as its relevant in relation to nutrition guidelines as well as the retail data used on the case study.
Introduction:
- Most of the motivation for the project relies on legal reasoning (e.g., GDPR, EU regulation 1169/2011), but I am not sure if/how these regulations apply. My understanding was that nutrition information was mandated to be provided on food labels (front/back of pack), whereas I see the authors taking a more informatics lens to aggregate this data across products, shopping trips etc and provide more data than is legally required. Does this legal framework really necessitate providing this more comprehensive data or does it just apply to labelling?
- My understanding would be that retailers may be required to provide the transactional data, whereas the food manufactures are responsible for providing the nutritional content data of the food product. Note that fresh categories (meat, fruit & veg) are typically not required to have a nutrition facts label. Perhaps a brief description of responsibilities for data required from retailers vs. manufacturers vs. producers (farmers) would be helpful.
- The introduction may also benefit by motivating the research project by first discussing the more macro-level nutrition/health challenges facing society (e.g., high incidence of diet-related chronic diseases like diabetes, some cancers, obesity) and then discuss what tools/interventions have been tried before to address. Then maybe move to digital tools and how this project addresses a critical gap (which was started to be discussed on lines 5 to 8 of page 2 with the self-report vs. automated tracking).
- Also, linking nutrition information and digital receipts isn’t necessarily new. I know of several researchers who have used Neilsen IQ market data and linked it to food label information (for example, see Mary L’Abbé’s work at University of Toronto in Canada), or nutrition facts with loyalty card data already (see recent work by Mikael Fogelhom at University of Helsinki in Finland). Explaining what this project contributes beyond this existing work would be important.
- Page 2 lines 30-32 “this ontology also includes fine-grained Nutri-Score [12] details at the levels of both individual products and aggregated baskets.” Need to explain what Nutri-Score is to readers here and why it’s important. At this point the reader does not know what it is, especially for those in geographies where it has not been implemented.
Related work:
- Nutri-Score is just one of many nutrition scoring systems worldwide. There are also traffic light systems, warning labels (e.g., Canada’s new FOP labels; high in sugar/salt etc), guiding stars ratings, and others. This work should be reviewed briefly, and then explain why Nutri-Score was chosen for this project relative to others.
- Regarding the Food Composition Database (FCD) (lines 34-43 on page 3), more details are required regarding how the database was created/ where the data was obtained? Also, what is involved in maintaining the DB as mentioned on line 41?
- A discussion of recommendation systems is warranted. Broadly and within food retailing. What is currently done and what does this project contribute? I know most of the recommenders are based on past purchase history only, and not nutrition per se, but this should be discussed.
- Also, a discussion of what and how the nutrition recommendations are structured in Switzerland would be beneficial before moving to the Nutri-link ontology.
Nutri-link ontology:
- Page 5 lines 16-21. It is not clear how the Nutri-Score is aggregated at the basket level. Please explain because typically it is for an individual product.
- CQ4: Monthly energy and expense calculations… I think, at least, protein and fat are equally important macro-nutrients as energy. Also, maybe explain the difference between energy and how its calculated (vs. carbohydrates) somewhere.
3.2 Semantic model:
- Your list of re-used ontologies (page 5-6) is missing FoodON.
- Why is Dublin Core vocabulary needed? Do you also have images of the products in this ontology and the case study dataset?
- Page 6 lines 5-26. You have referenced how Nutri-link integrates with several different ontologies (FoodON, QUDT etc.) but there is no discussion of the methodology used to link these different ontologies. Was an NLP package used to match text between terms/definitions in the ontologies? What’s the degree of overlap and were there any challenges or missing links?
Case study:
- Page 9 line 21. 100 users at maximum is awkward. Do you have an accurate figure?
- Page 9 lines 30-33. In regard to “linking” the receipt data to the NutriLink, it is unclear how this matching is done. Again, are you using NLP to match item descriptions from the 2 retailers to the terms used in Nutrilink?
- Overall, I still a little confused about the dietary recommendations and the recommendation system. By recommendations, are you only referring to the text (e.g., reduce energy from sweet snacks) or are you also referring to the recommended “healthier products” that are depicted in Figure 2 [bottom, right screenshot]?
Limitations & Future work:
- One of the challenges of using receipt data is not necessarily knowing the household size and/or the occasion that the food was purchased for (e.g., for a party and not consuming all the food themselves). How has this been addressed in your project?
- Also, for the nutritional analysis section, consumers may buy food from other sources (e.g., restaurants, convenience stores, small shops), so how can the system be designed to incorporate this aspect of missing purchases? Or is it just a major limitation that needs to be acknowledged?
- Great to mention extension of this work to sustainability, but a bit more explanation would be nice. Sometimes sustainability refers to environmental footprint (e.g., CO2 emissions during production/processing), but it can also incorporate social aspects relating to equity (e.g., sustainable development goals). This also sometimes appears through food labels (e.g., free trade, B-corp certification, woman/Black/Indigenous-owned) How could social dimensions of sustainability be considered within this project in the future?
- For future research, I also think that there will be a large need for implementation science and designing of the human-computer interface (UX/UI) to promote uptake of the digital tools, as well as ensure that the data, insights, and recommendations actually produce a change in people’s food choices and nudge them in a healthier and more sustainable direction.
- Also, how can other marketing/promotional data beyond just prices (e.g., coupons, discounts, bonus points etc.) be incorporated in future work and perhaps in relation to the product recommendations?
Thank you again for this compelling and well-executed research. I look forward to seeing its continued development and application.
|