The annotation guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank. The treatment of some specific syntactic constructions in Latin

Marco Carlo Passarotti, David Bamman, Roberto Busa, Gregory Crane

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

The paper describes the treatment of some specific syntactic constructions in two treebanks of Latin according to a common set of annotation guidelines. Both projects work within the theoretical framework of Dependency Grammar, which has been demonstrated to be an especially appropriate framework for the representation of languages with a moderately free word order, where the linear order of constituents is broken up with elements of other constituents. The two projects are the first of their kind for Latin, so no prior established guidelines for syntactic annotation are available to rely on. The general model for the adopted style of representation is that used by the Prague Dependency Treebank, with departures arising from the Latin grammar of Pinkster, specifically in the traditional grammatical categories of the ablative absolute, the accusative + infinitive, and gerunds/gerundives. Sharing common annotation guidelines allows us to compare the datasets of the two treebanks for tasks such as mutually checking annotation consistency, diachronically studying specific syntactic constructions, and training statistical dependency parsers.
Original languageEnglish
Title of host publicationProceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)
Pages71-76
Number of pages6
Publication statusPublished - 2008
EventLREC 2008 - Marrakech
Duration: 28 May 200830 May 2008

Conference

ConferenceLREC 2008
CityMarrakech
Period28/5/0830/5/08

Keywords

  • Latin
  • Treebank

Fingerprint

Dive into the research topics of 'The annotation guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank. The treatment of some specific syntactic constructions in Latin'. Together they form a unique fingerprint.

Cite this