Abstract
We present a lexical-based investigation into the corpus of the opera omnia of Seneca. By applying a number of statistical techniques to textual data we aim to automatically collect similar texts into closely related groups. Comparison with the orationes of Cicero, with the Latin New Testament by Jerome (Vulgata) and with the opera maiora of Thomas Aquinas is performed as well. We demonstrate that our objective and unsupervised method is able to distinguish the texts by work, genre and author.
Lingua originale | English |
---|---|
Titolo della pubblicazione ospite | Latinitatis Rationes. Descriptive and Historical Accounts for the Latin Language |
Editor | Paolo Poccetti |
Pagine | 684-706 |
Numero di pagine | 23 |
Stato di pubblicazione | Pubblicato - 2016 |
Keywords
- Clustering
- Computational Linguistics
- Latin
- Seneca