Despite a centuries-long tradition in lexicography, Latin lacks state-of-the-art computational lexical resources. This situation is strictly related to the still quite limited amount of linguistically annotated textual data for Latin, which can help the building of new lexical resources by supporting them with empirical evidence. However, projects for creating new language resources for Latin have been launched over the last decade to fill this gap. In this paper, we present Latin Vallex, a valency lexicon for Latin built in mutual connection with the semantic and pragmatic annotation of two Latin treebanks featuring texts of different eras. On the one hand, such a connection between the empirical evidence provided by the treebanks and the lexicon allows to enhance each frame entry in the lexicon with its frequency in real data. On the other hand, each valency-capable word in the treebanks is linked to a frame entry in the lexicon.
|Title of host publication||Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)|
|Number of pages||8|
|Publication status||Published - 2016|
|Event||Tenth International Conference on Language Resources and Evaluation (LREC 2016) - Portorož|
Duration: 23 May 2016 → 28 May 2016
|Conference||Tenth International Conference on Language Resources and Evaluation (LREC 2016)|
|Period||23/5/16 → 28/5/16|