LegalNERo: A linked corpus for named entity recognition in the Romanian legal domain

Tracking #: 2816-4030

This paper is currently under review
Authors: 
Vasile Pais
Maria Mitrofan
Carol Luca Gasan
Alexandru Ianov
Corvin Ghiță
Vlad Silviu Coneschi
Andrei Onuț

Responsible editor: 
Harald Sack

Submission type: 
Dataset Description
Abstract: 
LegalNERo is a manually annotated corpus for named entity recognition in the Romanian legal domain. It provides gold annotations for organizations, locations, persons, time and legal resources mentioned in legal documents. Furthermore, GeoNames identifiers are provided for location entities, when linking was possible. The resource is available in multiple formats, including span-based, token-based and RDF. The Linked Open Data version, in RDF-Turtle format, is available for both download and interrogation using a SPARQL endpoint.
Full PDF Version: 
Tags: 
Under Review