Schema-Miner Pro: Agentic AI for Ontology Grounding over LLM-Discovered Scientific Schemas in a Human-in-the-Loop Workflow

Tracking #: 3983-5197

Authors: 
Sameer Sadruddin
Jennifer D'Souza
Eleni Poupaki
Alex Watkins
Bora Karasulu
Sören Auer1
Adrie Mackus
Erwin Kessels

Responsible editor: 
Guest Editors 2025 LLM GenAI KGs

Submission type: 
Full Paper
Abstract: 
Scientific processes are often described in free text, making it difficult to represent and reason over them computationally. We present schema-miner pro, a human-in-the-loop framework that automatically extracts and grounds structured schemas from scientific literature. Our approach combines large language models for schema extraction with an agent-based system that aligns extracted elements to external ontologies through interpretable, multi-step reasoning. The agent leverages lexical heuristics, semantic similarity, and expert feedback to ensure accurate grounding. We demonstrate the framework on two semiconductor manufacturing workflows—Atomic Layer Deposition (ALD) and Atomic Layer Etching (ALE)—mapping process parameters and outputs to the QUDT (Quantities, Units, Dimensions, and Types) ontology. By producing ontology-aligned, semantically precise schemas, schema-miner pro lays the groundwork for machine-actionable scientific knowledge and automated reasoning across disciplines.
Full PDF Version: 
Tags: 
Reviewed

Decision/Status: 
Accept

Solicited Reviews:
Click to Expand/Collapse
Review #1
By Antonello Meloni submitted on 18/Jan/2026
Suggestion:
Accept
Review Comment:

The authors have carefully addressed the previous reviewer comments, and the manuscript is now suitable for publication.