Editorial Board

Editor-in-Chief
Cogan Shimizu
Eva Blomqvist

Editorial Board
Mehwish Alam
Claudia d’Amato
Stefano Borgo
Boyan Brodaric
Philipp Cimiano
Michael Cochez
Oscar Corcho
Bernardo Cuenca-Grau
Elena Demidova
Jerome Euzenat
Mark Gahegan
Aldo Gangemi
Dagmar Gromann
Armin Haller
Pascal Hitzler
Aidan Hogan
Katja Hose
Eero Hyvönen
Sabrina Kirrane
Agnieszka Lawrynowicz
Freddy Lecue
Maria Maleshkova
Raghava Mutharaju
Axel Polleres
Guilin Qi
Marta Sabou
Harald Sack
Angelo Salatino
Christoph Schlieder
Stefan Schlobach
Cogan Shimizu
Blerina Spahiu
GQ Zhang
Rui Zhu

Former/Founding Editors-in-Chief
Krzysztof Janowicz
Pascal Hitzler

Editorial Assistants
Michael McCain

Syndicate

Decoding Deception with TAXODIS – a Taxonomy of Disinformation Cues for Fine-grained Text Labeling

Submitted by Konstantin Todorov on 09/17/2025 - 01:24

Tracking #: 3946-5160

Authors:

Isabel Bezzaoui

Pavlos Fafalios

Jonas Fegert

Konstantin Todorov

Achim Rettinger

Responsible editor:

Philipp Cimiano

Submission type:

Ontology Description

Abstract:

The ubiquity of disinformation on digital platforms poses a threat to democracy and social cohesion. Despite significant developments in machine learning for disinformation detection and more specific related tasks (such as fact-checking, check-worthiness detection, claim linking, propaganda and rumor detection), effectively applying empirical knowledge during the training of such models in a standardized and transparent way remains a challenge. In this paper, following the semantic web principles, we propose TAXODIS—the first of its kind openly available Taxonomy of Online Disinformation. It structures an interdisciplinary set of well-defined and analyzed linguistic features of online disinformation discourse and is meant to help annotate training data to nourish machine learning and computational models that deal with the above-mentioned tasks. The systematic clustering of linguistic features into a comprehensive and publicly available framework provides a basis for the empirically grounded training of models and enhances the understanding of disinformation on a textual and linguistic level. Demonstrating and evaluating the artifact, we find that it facilitates data labeling processes by offering annotators a compact yet empirically informed guide to identifying textual indicators of disinformation. This paper, proposing a structured taxonomy as a valuable tool for automated detection systems, contributes to disinformation detection by mapping nuanced linguistic characteristics in disinformation content.

Full PDF Version:

swj3946.pdf

Previous Version:

Decoding Deception with TAXODIS - a Taxonomy of Disinformation Cues for Fine-grained Text Labeling

Tags:

Reviewed

Decision/Status:

Minor Revision

Solicited Reviews:

Click to Expand/Collapse

Review #1

Anonymous submitted on 20/Oct/2025

Suggestion:
Accept

Review Comment:

The authors improved the manuscript following the suggestions given but they should proof-read the new parts in order to fix typos (e.g. nor the dataset-focused DeFaktS paper [9] present+s) and, when it's possible, instead of arXives, they should cite the peer-reviewed version of the papers, e.g.:

Arcos I., Rosso P., Salaverría R. Divergent Emotional Patterns in Disinformation on Social Media? An Analysis of Tweets and TikToks about the DANA in Valencia. In: ICAART-2025, Proc. 17th Int. Conf. on Agents and Artificial Intelligence, Feb. 23-25 (2025)

instead of:

I. Arcos, P. Rosso and R. Salaverría, Divergent Emotional Patterns in Disinformation on Social Media? An Analysis of Tweets and TikToks about the DANA in Valencia, arXiv preprint arXiv:2501.18640 (2025).

Review #2

Anonymous submitted on 24/Mar/2026

Suggestion:
Major Revision

Review Comment:

The article introduces the TAXODIS taxonomy. The taxonomy provides 66 concepts for annotating online misinformation. The taxonomy is developed through a systematic review of existing research, and it has been published using the SKOS vocabulary. Overall, the paper is easy to follow, and the TAXODIS model responds to the clear need for more structured misinformation annotation.

Detailed comments:
1) Novelty and relationship to prior work
The article appears to extend two previous contributions ([13] and [19]) by only adding a sixth aspect to the previously published taxonomy and providing a formal SKOS implementation. However, the clear difference between this work and the previous publications is not sufficiently discussed. The difference is currently addressed with a single sentence in the introduction and a short paragraph in the second section.

The authors mention that a previous paper presented an earlier unimplemented version of the taxonomy, but do not discuss what has changed besides the formal implementation. These differences remain unclear, and the discussion fails to highlight the extent of the changes. The paper should be more upfront about what is new and what is not, including where content or findings are reused. These distinctions should be highlighted within the body of the paper (e.g., Section 3) rather than only in the related work.

2) Conceptual Scope
The article focuses on the creation of a taxonomy for NLP and machine learning tasks as a facilitating tool for data annotation. This initial premise is somewhat restrictive, as the proposed taxonomy appears to be designed more as a codebook than a model to aid the broader understanding of online disinformation.

3) Related work
While extensive, the related work seems to largely dismiss previous efforts in building categorisation schemes for misinformation (e.g., "None of the mentioned efforts above propose a shared semantic model"). Furthermore, the discussion does not address in detail existing efforts to extract and annotate misinformation automatically, focusing primarily on LIWC.

There is also a lack of discussion regarding existing datasets and knowledge graphs. For example, the paragraph starting at line 37 on page 3 discusses MultiFC and ClaimsKG but fails to mention CimpleKG, which provides specific misinformation, textual and linguistic features. CimpleKG is only mentioned briefly on page 12 without sufficient context.

4) Modelling
The taxonomy follows established practice by reusing SKOS, which facilitates the integration of TAXODIS into other knowledge sources. However, the use of SKOS may not be the most appropriate choice for an aspect-based taxonomy. Such a model might be better coupled with a more traditional ontological model to avoid a faceted taxonomy structure (e.g., by separating veracity, categorisation, and detection features).

The decision to use SKOS rather than a more formal ontological model should be discussed. For instance, characteristic values and boundaries may not be suitably represented in SKOS, and constraints for veracity grades are not well represented by the taxonomy (see how schema.org represents Rating). The example in Section 4.2 may be better represented using an ontological model built upon the ClaimReview model.

5) Methodology and usage
The methodology is sound and draws from multiple sources with a systematic approach. However, as previously noted, the authors should be clearer about what is drawn from [13] and [19] compared to what is new, as these articles overlap with the presented methodology and findings.

Regarding usage, the authors list potential queries combining Schema.org and OA. While this provides context, it is important to note that some of these queries can already be answered using existing vocabularies and knowledge graphs. For example, ClaimReview can already be used to retrieve "mostly false" claims, while CimpleKG can be queried for misinformation factors and mentions. This should be discussed in greater detail.

6) Evaluation
The evaluation section largely refers to a prior evaluation of the model; there is no new evaluation of the SKOS implementation or its integration with other resources. As a result, the paper provides limited novel insights regarding the real-world usage of the taxonomy implementation. Finally, the maintenance and monitoring plan appears to be an afterthought. For example, ClaimsKG is now largely outdated. More current knowledge graphs exist, such as DBFK and CimpleKG.

Log in or register to post comments
1024 reads

Main menu

Editorial Board

Syndicate

Decoding Deception with TAXODIS – a Taxonomy of Disinformation Cues for Fine-grained Text Labeling

Tracking #: 3946-5160

Reviewed Articles

Authors & Reviewers

Links

Recent blog posts

Accepted Articles

Search form

Main menu

Login

Editorial Board

Syndicate

Decoding Deception with TAXODIS – a Taxonomy of Disinformation Cues for Fine-grained Text Labeling

Tracking #: 3946-5160

Reviewed Articles

Authors & Reviewers

Links

Recent blog posts

Accepted Articles