Abstract
This paper presents the early stages of the development of a new treebank
containing all of Dante Alighieri’s Latin works. In particular, it describes the
conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard,
the process of training four annotators and the evaluation of the syntactic annotation
in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release
a new resource, in view of the celebrations for the 700th anniversary of Dante’s death,
which can support the development of the Vocabolario Dantesco.
Original language | English |
---|---|
Title of host publication | Proceedings of the Seventh Italian Conference on Computational Linguistics. Bologna, Italy, March 1-3 |
Pages | 1-7 |
Number of pages | 7 |
Publication status | Published - 2020 |
Event | Seventh Italian Conference on Computational Linguistics - Bologna Duration: 1 Mar 2021 → 3 Mar 2021 |
Conference
Conference | Seventh Italian Conference on Computational Linguistics |
---|---|
City | Bologna |
Period | 1/3/21 → 3/3/21 |
Keywords
- Dante Alighieri
- Latin
- Treebank
- Universal Dependencies