Abstract

We present a lexical-based investigation into the corpus of the opera omnia of Seneca. By applying a number of statistical techniques to textual data we aim to automatically collect similar texts into closely related groups. Comparison with the orationes of Cicero, with the Latin New Testament by Jerome (Vulgata) and with the opera maiora of Thomas Aquinas is performed as well. We demonstrate that our objective and unsupervised method is able to distinguish the texts by work, genre and author.
Lingua originaleEnglish
Titolo della pubblicazione ospiteLatinitatis Rationes. Descriptive and Historical Accounts for the Latin Language
EditorPaolo Poccetti
Pagine684-706
Numero di pagine23
Stato di pubblicazionePubblicato - 2016

Keywords

  • Clustering
  • Computational Linguistics
  • Latin
  • Seneca

Fingerprint Entra nei temi di ricerca di 'A Statistical Investigation into the Corpus of Seneca'. Insieme formano una fingerprint unica.

Cita questo