Managing FAIR Knowledge Graphs as Polyglot Data End Points: A Benchmark based on the rdf2pg Framework and Plant Biology Data

Tracking #: 3702-4916

This paper is currently under review
Marco Brandizi
Carlos Bobed
Luca Garulli
Arné de Klerk
Keywan Hassani-Pak

Responsible editor: 
Guest Editors KG Construction 2024

Submission type: 
Full Paper
Linked data and labelled property graphs (LPG) are two data management approaches with complementary strengths and weaknesses, making their integration beneficial for sharing datasets and supporting software ecosystems. In this paper, we introduce rdf2pg, an extensible framework for mapping RDF data to semantically equivalent LPG formats and data-bases. Utilising this framework, we perform a comparative analysis of three popular graph databases - Virtuoso, Neo4j, and ArcadeDB - and the well-known graph query languages SPARQL, Cypher, and Gremlin. Our qualitative and quanti-tative assessments underline the strengths and limitations of these graph database technologies. Additionally, we high-light the potential of rdf2pg as a versatile tool for enabling polyglot access to knowledge graphs, aligning with estab-lished standards of linked data and the semantic web.
Full PDF Version: 
Under Review