Challenges in Converting the Index Thomisticus Treebank into Universal Dependencies

Flavio Massimiliano Cecchini*, Marco Carlo Passarotti*, Paola Marongiu, Daniel Zeman

*Autore corrispondente per questo lavoro

Risultato della ricerca: Contributo in libroContributo a convegno

Abstract

This paper describes the changes applied to the original process used to convert the Index Thomisticus Treebank, a corpus including texts in Medieval Latin by Thomas Aquinas, into the annotation style of Universal Dependencies. The changes are made both to harmonise the Universal Dependencies version of the Index Thomisticus Treebank with the two other available Latin treebanks and to fix errors and inconsistencies resulting from the original process. The paper details the treatment of different issues in PoS tagging, lemmatisation and assignment of dependency relations. Finally, it assesses the quality of the new conversion process by providing an evaluation against a gold standard.
Lingua originaleEnglish
Titolo della pubblicazione ospiteProceedings of the Second Workshop on Universal Dependencies (UDW 2018)
Pagine27-36
Numero di pagine10
Stato di pubblicazionePubblicato - 2018
EventoSecond Workshop on Universal Dependencies (UDW 2018) - Bruxelles, Belgium
Durata: 1 nov 20181 nov 2018

Workshop

WorkshopSecond Workshop on Universal Dependencies (UDW 2018)
CittàBruxelles, Belgium
Periodo1/11/181/11/18

Keywords

  • Treebank
  • Universal Dependencies

Fingerprint

Entra nei temi di ricerca di 'Challenges in Converting the Index Thomisticus Treebank into Universal Dependencies'. Insieme formano una fingerprint unica.

Cita questo