UDante: First Steps Towards the Universal Dependencies Treebank of Dante's Latin Works

Flavio Massimiliano Cecchini, Rachele Sprugnoli, Giovanni Moretti, Marco Carlo Passarotti

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.
Original languageEnglish
Title of host publicationProceedings of the Seventh Italian Conference on Computational Linguistics. Bologna, Italy, March 1-3
Pages1-7
Number of pages7
Publication statusPublished - 2020
EventSeventh Italian Conference on Computational Linguistics - Bologna
Duration: 1 Mar 20213 Mar 2021

Conference

ConferenceSeventh Italian Conference on Computational Linguistics
CityBologna
Period1/3/213/3/21

Keywords

  • Dante Alighieri
  • Latin
  • Treebank
  • Universal Dependencies

Fingerprint

Dive into the research topics of 'UDante: First Steps Towards the Universal Dependencies Treebank of Dante's Latin Works'. Together they form a unique fingerprint.

Cite this