Abstract
Although lexicography of Latin has a long tradition dating back to ancient grammarians, and almost all Latin grammars devote to
wordformation at least one part of the section(s) concerning morphology, none of the today available lexical resources and NLP tools
of Latin feature a wordformation-based organization of the Latin lexicon. In this paper, we describe the first steps towards the
semi-automatic development of a wordformation-based lexicon of Latin, by detailing several problems occurring while building the
lexicon and presenting our solutions. Developing a wordformation-based lexicon of Latin is nowadays of outmost importance, as the
last years have seen a large growth of annotated corpora of Latin texts of different eras. While these corpora include lemmatization,
morphological tagging and syntactic analysis, none of them features segmentation of the word forms and wordformation relations
between the lexemes. This restricts the browsing and the exploitation of the annotated data for linguistic research and NLP tasks, such
as information retrieval and heuristics in PoS tagging of unknown words.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Eighth International Conference on Language Resources and Evaluation |
| Pages | 852-859 |
| Number of pages | 8 |
| Publication status | Published - 2012 |
| Event | LREC 2012 - Istanbul Duration: 23 May 2012 → 25 May 2012 |
Conference
| Conference | LREC 2012 |
|---|---|
| City | Istanbul |
| Period | 23/5/12 → 25/5/12 |
Keywords
- Latin
- Morphology
Fingerprint
Dive into the research topics of 'First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver