Abstract
Lemlat is a morphological analyser for Latin, which shows a remarkably wide coverage of the Latin lexicon. However, the performance of the tool is limited by the absence of proper names in its lexical basis. In this paper we present the extension of Lemlat with a large Onomasticon for Latin. First, we describe and motivate the automatic and manual procedures for including the proper names in Lemlat. Then, we compare the new version of Lemlat with the previous one, by evaluating their lexical coverage of four Latin texts of different era and genre.
Original language | English |
---|---|
Title of host publication | Proceedings of the 10th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH 2016) |
Pages | 90-94 |
Number of pages | 5 |
Publication status | Published - 2016 |
Event | Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities - Berlino Duration: 11 Aug 2016 → 11 Aug 2016 |
Workshop
Workshop | Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities |
---|---|
City | Berlino |
Period | 11/8/16 → 11/8/16 |
Keywords
- Latin, Morphology, NLP