TY - GEN
T1 - Vir is to Moderatus as Mulier is to Intemperans. Lemma Embeddings for Latin
AU - Sprugnoli, Rachele
AU - Passarotti, Marco Carlo
AU - Moretti, Giovanni
PY - 2019
Y1 - 2019
N2 - This paper presents a new set of lemma embeddings for the Latin language. Embeddings are trained on a manually annotated corpus of texts belonging to the Classical era: different models, architectures and dimensions are tested and evaluated using a novel benchmark for the synonym selection task. A qualitative evaluation is also performed on the embeddings of rare lemmas. In addition, we release vectors pre-trained on the “Opera Maiora” by Thomas Aquinas, thus providing a resource to analyze Latin in a diachronic perspective.
AB - This paper presents a new set of lemma embeddings for the Latin language. Embeddings are trained on a manually annotated corpus of texts belonging to the Classical era: different models, architectures and dimensions are tested and evaluated using a novel benchmark for the synonym selection task. A qualitative evaluation is also performed on the embeddings of rare lemmas. In addition, we release vectors pre-trained on the “Opera Maiora” by Thomas Aquinas, thus providing a resource to analyze Latin in a diachronic perspective.
KW - Latin
KW - Word Embeddings
KW - Latin
KW - Word Embeddings
UR - http://hdl.handle.net/10807/144302
UR - http://ceur-ws.org/vol-2481/paper69.pdf
U2 - 10.5281/zenodo.3565572
DO - 10.5281/zenodo.3565572
M3 - Conference contribution
SN - 9791280136008
T3 - COLLANA DELL'ASSOCIAZIONE ITALIANA DI LINGUISTICA COMPUTAZIONALE
SP - 1
EP - 7
BT - Proceedings of the Sixth Italian Conference on Computational Linguistics
T2 - Sixth Italian Conference on Computational Linguistics
Y2 - 13 November 2019 through 15 November 2019
ER -