Review Comment:
This paper provides an overview of the current state-of-the-art in knowledge-aware news recommendation. This is a particularly challenging domain for recommendation in general, due to the rapidly changing nature of content, and of user interests. The aim of the models discussed is to include structured knowledge (such as that from a knowledge base), to enrich the textual content of the news, and to alleviate some of these issues.
## Main concerns
Recommender systems are not my main area of research, so I have a limited view of how complete the paper is. Nevertheless, it seems thoroughly researched, and presented with attention to detail.
My main concern is the lack of a strict methodology for finding relevant papers. The authors claim on page 2, to provide a comprehensive review, but no discussion seems to be provided for how papers were selected for potential inclusion, and what criteria were used. A standard methodology, like starting from a set of DBLP search keywords to find a set of seed papers and following citations to a certain depth, would help to convince the reader that every effort was made to find relevant work. Since such a methodology can help to surface relevant work that may not have received the attention it deserves when it was published, I think this is an especially worthwhile approach.
While I agree with the authors' approach of highlight only a selection of representative papers to illustrate the basic idea of each approach, I do feel that a comprehensive review ought to include all relevant papers. They need not all be discussed in detail, but they should all be cited (perhaps the authors did follow this approach, the citation list certainly seems long enough, but the way it is written down no makes it seem that a fairly arbitrary selection was made).
Finally, the paper lacks somewhat in a strong message. Perhaps there is simply no such message to be found, but I would expect that a higher degree of synthesis can be achieved. I think it should be possible to answer questions like "Do knowledge-aware RSs actually solve the fundamental problems highlighted in the introduction, and how?" or "What is currently the most promising of the directions proposed, and what are the pros and cons of each?" or "Is knowledge-aware news recommendation currently a promising technology for production systems?". Section 7 covers some of this ground to some extent, but it currently reads more like a lengthy brainstorm, somewhat disconnected from what came before than a strong conclusion, synthesizing and extrapolating the work that has been done.
I believe these issues can be easily solved with a modest amount of extra work, so I recommend an accept with minor revisions.
## Other suggestions for improvement
* In Section 2, the problems that make news recommendation challenging are explained, but the case is never explicitly made for why and _how_ the inclusion of structured knowledge can help solve these problems. for instance, the fact that users do no make accounts, and do not leave explicit feedback, seem at first sight to be entirely unrelated to whether or not the system has access to DBPedia. I can work out how these might relate, but it would be better if the paper made the case explicitly, ideally with a skeptical reader in mind (since many in the Recommender System community may need some persuading).
* It seems that almost all methods discussed require some form of entity linking/named entity recognition to map the concepts discussed in the news articles to the structured knowledge. As far as I can tell, this step is never explicitly discussed. It seems to me that this is a rather important dimension, and choices made here may lead to great differences in performance. There may be other steps like these (general text pre-processing for instance) that any knowledge-aware news recommender requires, that may differ between systems.
* Some statements,in particular in sections 6 and 7 should be made more strongly. For example p34l2: "It can be said that ..." If the authors of a survey are not going to say this, then who? I suggest that the authors be a little more bold and state explicitly that the field lacks reproducibility and strong evaluation. I think they have made that case well enough not to couch their words.
* The paper uses passive tense a little too often. For instance on p10l26 in the sentence reading "These systems are analysed..." it is genuinely unclear whether this analysing is done by the authors, or by the community in general. Active tense solves such problems.
* Some sections could be better structured and read a bit more like a loose assortment of ideas than a coherent narrative. For instance:
** In Section 2, the paragraph starting on p3l40, starts by discussing the difference between long and short-term preferences, and then jumps to the difficulty of tracking users without an account and then jumps to the lack of explicit feedback. These are related concerns, but they are better discussed in turn, each in their own paragraph.
** Section 7.1 is a really important section, containing one of the main takeaways, but it reads like a long list of loose recommendations. I think this section can be made stronger by introducing a little more structure. For instance, stating up front that the authors make N specific recommendations to improve evaluation, and then including these in a numbered list, with the topic sentences for each in bold. It may even be worthwhile to remove some less important recommendations to better emphasize the most important ones.
* Occasionally, references to the origin of methods outside the domain of knowledge-aware RSs is missing. For instance, the Bing similarity is presumably a variant of the Google distance and related methods. In this case it would be good to reference that method briefly to position the research better. Semantic similarity based on graph distance is another example.
## Typos and small mistakes
* "utilize" - > "use"
* "X is comprised of Y" -> "X comprises Y" or "X is composed of Y"
* "If several users read the same two news ..." -> "If several users read the same two news articles ..."
* "This phenomena" -> "This phenomenon"
* "A stricter criteria" - > "A stricter criterion"
* The mathematical typesetting need a little bit more work. For instance:
** Any names consisting of multiple characters ("agg", "softmax") should be set in text mode.
** Angle brackets should be set with \langle and \rangle rather than < and >
* The four different types of approach are hard to pick out as subsections. Perhaps it would be better to make each their own section.
|