Publishing and Using Parliamentary Linked Data on the Semantic Web: ParliamentSampo System for Parliament of Finland

Tracking #: 3605-4819

This paper is currently under review
Authors: 
Eero Hyvonen
Laura Sinikallio
Petri Leskinen
Senka Drobac
Rafael Leal
Matti La Mela
Jouni Tuominen1
Henna Poikkkimäki
Heikki Rantala

Responsible editor: 
Guest Editors KG Gen 2023

Submission type: 
Tool/System Report
Abstract: 
This paper presents a new infrastructure and semantic portal called ParliamentSampo for studying parliamentary speeches, culture, language, and activities in Finland. For the first time, the entire time series of some million plenary speeches of the Parliament of Finland (PoF) since 1907 have been converted from text into knowledge graphs and data services in unified formats, including CSV, Parla-CLARIN, ParlaMint, and RDF Linked Open Data (LOD). The speech data have been interlinked with a semi-automatically created ontology and a knowledge graph about the activities of over \num{2800} Members of Parliament (MP) and other speakers in the plenary sessions of the PoF. The data was enriched by data linking to external data sources and by reasoning into a broader LOD service. Knowledge extraction techniques based on Natural Language Processing (NLP) were used for automatic semantic annotations and topical classification of the speeches. The data and data services have been used in Digital Humanities (DH) research projects and for application development, especially for developing the in-use semantic portal ParliamentSampo. The infrastructure and the portal were published on February 14th 2023 on the Web using the open CC BY 4.0 license, and quickly gathered thousands of users, including citizens, media, politicians, and researchers of politics. \ps\ is a new member in the ``Sampo'' series of over 20 interlinked LOD services and semantic portals in Finland, based on a national Semantic Web infrastructure.
Full PDF Version: 
Tags: 
Under Review