Vir is to Moderatus as Mulier is to Intemperans. Lemma Embeddings for Latin

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents a new set of lemma embeddings for the Latin language. Embeddings are trained on a manually annotated corpus of texts belonging to the Classical era: different models, architectures and dimensions are tested and evaluated using a novel benchmark for the synonym selection task. A qualitative evaluation is also performed on the embeddings of rare lemmas. In addition, we release vectors pre-trained on the “Opera Maiora” by Thomas Aquinas, thus providing a resource to analyze Latin in a diachronic perspective.
Original languageEnglish
Title of host publicationProceedings of the Sixth Italian Conference on Computational Linguistics
Pages1-7
Number of pages7
DOIs
Publication statusPublished - 2019
EventSixth Italian Conference on Computational Linguistics - BARI -- ITA
Duration: 13 Nov 201915 Nov 2019

Publication series

NameCOLLANA DELL'ASSOCIAZIONE ITALIANA DI LINGUISTICA COMPUTAZIONALE

Conference

ConferenceSixth Italian Conference on Computational Linguistics
CityBARI -- ITA
Period13/11/1915/11/19

Keywords

  • Latin
  • Word Embeddings

Fingerprint

Dive into the research topics of 'Vir is to Moderatus as Mulier is to Intemperans. Lemma Embeddings for Latin'. Together they form a unique fingerprint.

Cite this