Editorial Board

Editor-in-Chief
Cogan Shimizu
Eva Blomqvist

Editorial Board
Mehwish Alam
Claudia d’Amato
Stefano Borgo
Boyan Brodaric
Philipp Cimiano
Michael Cochez
Oscar Corcho
Bernardo Cuenca-Grau
Elena Demidova
Jerome Euzenat
Sebastián Ferrada
Mark Gahegan
Aldo Gangemi
Dagmar Gromann
Armin Haller
Pascal Hitzler
Aidan Hogan
Katja Hose
Eero Hyvönen
Krzysztof Janowicz
Sabrina Kirrane
Agnieszka Lawrynowicz
Freddy Lecue
Maria Maleshkova
Raghava Mutharaju
Axel Polleres
Guilin Qi
Marta Sabou
Harald Sack
Angelo Salatino
Christoph Schlieder
Stefan Schlobach
Cogan Shimizu
Blerina Spahiu
Sanju Tiwari
GQ Zhang
Rui Zhu

Former/Founding Editors-in-Chief
Krzysztof Janowicz
Pascal Hitzler

Editorial Assistants
Michael McCain

Syndicate

A new Graph-based RAG approach for querying and instantiating large-scale industrial semantic artifacts

Submitted by Nilay Tufek on 04/02/2026 - 08:02

Tracking #: 4060-5274

This paper is currently under review

Authors:

Nilay Tufek

Burak Yigit Uslu

Valentin Philipp Just

Tathagata Bandyopadhyay

Aparna Saisree Thuluva

Marta Sabou

Allan Hanbury

Responsible editor:

Guest Editors 2025 LLM GenAI KGs

Submission type:

Full Paper

Abstract:

Large Language Models (LLMs) have demonstrated remarkable capabilities in extracting knowledge and generating new content from a wide range of resources, particularly text-based ones. Beyond unstructured data, LLMs also show strong performance on structured yet semantically rich resources such as ontologies, schemas, and knowledge graphs. However, the direct utilization of large-scale semantic artifacts as input to LLMs is constrained by prompt size and token limits. The state-of-the-art solution to this challenge is the use of Retrieval-Augmented Generation (RAG) systems. In this work, we propose IndustrialGraphRAG, a novel graph-based RAG approach specifically designed for large semantic artifacts. Our method integrates LLM-based Named Entity Recognition (NER) and Entity Linking (EL), forming a unified pipeline tailored for semantically complex resources. Within this framework, we implement three use cases that combine LLM reasoning with our RAG system: (i) semantic artifact validation, (ii) information retrieval, and (iii) information model generation. The first two tasks convert natural language queries (NLQs) into executable SPARQL queries, whereas the third populates semantic artifacts based on NLQ-driven instructions. Across all use cases, the system demonstrates strong performance, confirming the effectiveness of the approach. Comparative experiments against two additional RAG baselines further show superior performance in both accuracy and contextual reasoning. OPC UA serves as our primary data resource due to its breadth and semantic richness. To demonstrate generalizability, we additionally evaluate the system on the large-scale SAREF ontology, a structurally and semantically distinct artifact. Consistent performance across both resources indicates that the proposed system is not domain-specific and can be reliably applied to diverse semantic datasets.

Full PDF Version:

swj4060.pdf

Previous Version:

A new Graph-based Retrieval-Augmented Generation (RAG) approach for querying and instantiating large-scale industrial semantic artifacts

Tags:

Under Review

Long-term Stable Link to Resources:

https://github.com/nilaytufekozkaya/IndustrialGraphRAG?tab=MIT-1-ov-file

Log in or register to post comments
198 reads

Main menu

Editorial Board

Syndicate

A new Graph-based RAG approach for querying and instantiating large-scale industrial semantic artifacts

Tracking #: 4060-5274

Reviewed Articles

Authors & Reviewers

Links

Recent blog posts

Accepted Articles

Search form

Main menu

Login

Editorial Board

Syndicate

A new Graph-based RAG approach for querying and instantiating large-scale industrial semantic artifacts

Tracking #: 4060-5274

Reviewed Articles

Authors & Reviewers

Links

Recent blog posts

Accepted Articles