CaLiGraph: A Knowledge Graph from Wikipedia Categories and Lists

Tracking #: 3601-4815

This paper is currently under review
Nicolas Heist
Heiko Paulheim

Responsible editor: 
Raghava Mutharaju

Submission type: 
Dataset Description
Knowledge Graphs (KGs) are increasingly used for solving or supporting tasks such as question answering or recommendation. To achieve a useful performance on such tasks, it is important that the knowledge modelled by KGs is as correct and complete as possible. While this is an elusive goal for many domains, techniques for automated KG construction (AKGC) serve as a means to approach it. Yet, AKGC has many open challenges, like learning expressive ontologies or incorporating long-tail entities. With CaLiGraph, we present a KG automatically constructed from categories and lists in Wikipedia, offering a rich taxonomy with semantic class descriptions and a broad coverage of entities. We describe its extraction framework and provide details about its purpose, resources, usage and quality. Further, we evaluate the performance of CaLiGraph on downstream tasks and compare it to other popular KGs.
Full PDF Version: 
Under Review