TY - BOOK
T1 - Paradigm Structure and Predictability in Latin Inflection: An Entropy-based Approach
AU - Pellegrini, Matteo
PY - 2023
Y1 - 2023
N2 - In the last few years, the field of morphology has started to question some fundamental assumptions on the structure of wordforms. In particular, the idea is gaining ground that wordforms should not be viewed as obtained by concatenating smaller meaningful pieces one to another, as in classical morphemic analysis. Instead, the opposite happens: from the comparison of full inflected wordforms, recurrent partials are extracted which can be thought as having a discriminative function within the paradigm – i.e., what matters is that they are useful in order to distinguish wordforms from one another, rather than their association with a particular meaning.
A problem that has been widely investigated in this context is the possibility of predicting full inflected wordforms from one another within the inflectional paradigm of a lexeme, exploiting the presence of more or less reliable implicative relations, in what has been labelled the “Paradigm Cell Filling Problem”. As a way of quantifying the difficulty of this task, the information-theoretic notion of conditional entropy has been used in much recent work.
In this work, the above-mentioned theoretical and methodological innovations are exploited to investigate the Latin verbal and nominal paradigm, to obtain a quantitative analysis of the reliability of implicative relations, and thus of the patterns of interpredictability between inflected wordforms – i.e., of the difficulty of the Paradigm Cell Filling Problem.
The book is divided into six chapters. Chapters 1 and 2 provide a more detailed picture of the theoretical framework within which this work is located and of the adopted, entropy-based, methodology, respectively. As we will see in more detail in Chapter 1, our theoretical framework can be considered as abstractive – i.e., considering morphemes as possibly extracted a posteriori from full inflected wordforms, rather than starting from morphemes and assembling them to obtain wordforms – and implicative – i.e., focusing on implicative relations, rather than on exponence of morphosyntactic properties. Our approach is also quantitative, as the entropy-based assessment of predictability in inflectional paradigms is obtained by taking the type frequency of different inflectional patterns into account – as is shown in Chapter 2, where the details of the adopted methodology are outlined.
To obtain information on the type frequency of inflectional patterns, an inflected lexicon listing the wordforms of a representative selection of lexemes is necessary. In Chapter 3, the lexical resource that was created for the purposes of this work – LatInfLexi – is presented, showing how it was obtained from the large database of a recently renewed morphological analyser of Latin, Lemlat 3.0.
We can then move to the presentation of our results on verb paradigms – in Chapter 4 – and on noun paradigms – in Chapter 5. On the one hand, such results are exploited to obtain a mapping of the paradigm in zones of interpredictability – i.e., groups of cells that can be predicted from one another with no uncertainty. On the other hand, if not only predictions from one cell but also predictions from more than one cell are taken into account, principal parts – i.e., sets of cells from which the whole paradigm of a lexeme can be inferred without uncertainty – or at least near principal parts – which reduce uncertainty greatly, but not completely – can be found in a more principled way than in traditional descriptions.
In the last section of Chapter 5, a methodological innovation with respect to the standard procedure is introduced. In §5.3, uncertainty in predicting one cell from another is quantified assuming that not only the phonotactic shape of the wordforms is known, but information of a different kind too – namely, the gender of a noun, that is partly predictive of its inflection behaviour, as is already acknowledged in traditional descriptio
AB - In the last few years, the field of morphology has started to question some fundamental assumptions on the structure of wordforms. In particular, the idea is gaining ground that wordforms should not be viewed as obtained by concatenating smaller meaningful pieces one to another, as in classical morphemic analysis. Instead, the opposite happens: from the comparison of full inflected wordforms, recurrent partials are extracted which can be thought as having a discriminative function within the paradigm – i.e., what matters is that they are useful in order to distinguish wordforms from one another, rather than their association with a particular meaning.
A problem that has been widely investigated in this context is the possibility of predicting full inflected wordforms from one another within the inflectional paradigm of a lexeme, exploiting the presence of more or less reliable implicative relations, in what has been labelled the “Paradigm Cell Filling Problem”. As a way of quantifying the difficulty of this task, the information-theoretic notion of conditional entropy has been used in much recent work.
In this work, the above-mentioned theoretical and methodological innovations are exploited to investigate the Latin verbal and nominal paradigm, to obtain a quantitative analysis of the reliability of implicative relations, and thus of the patterns of interpredictability between inflected wordforms – i.e., of the difficulty of the Paradigm Cell Filling Problem.
The book is divided into six chapters. Chapters 1 and 2 provide a more detailed picture of the theoretical framework within which this work is located and of the adopted, entropy-based, methodology, respectively. As we will see in more detail in Chapter 1, our theoretical framework can be considered as abstractive – i.e., considering morphemes as possibly extracted a posteriori from full inflected wordforms, rather than starting from morphemes and assembling them to obtain wordforms – and implicative – i.e., focusing on implicative relations, rather than on exponence of morphosyntactic properties. Our approach is also quantitative, as the entropy-based assessment of predictability in inflectional paradigms is obtained by taking the type frequency of different inflectional patterns into account – as is shown in Chapter 2, where the details of the adopted methodology are outlined.
To obtain information on the type frequency of inflectional patterns, an inflected lexicon listing the wordforms of a representative selection of lexemes is necessary. In Chapter 3, the lexical resource that was created for the purposes of this work – LatInfLexi – is presented, showing how it was obtained from the large database of a recently renewed morphological analyser of Latin, Lemlat 3.0.
We can then move to the presentation of our results on verb paradigms – in Chapter 4 – and on noun paradigms – in Chapter 5. On the one hand, such results are exploited to obtain a mapping of the paradigm in zones of interpredictability – i.e., groups of cells that can be predicted from one another with no uncertainty. On the other hand, if not only predictions from one cell but also predictions from more than one cell are taken into account, principal parts – i.e., sets of cells from which the whole paradigm of a lexeme can be inferred without uncertainty – or at least near principal parts – which reduce uncertainty greatly, but not completely – can be found in a more principled way than in traditional descriptions.
In the last section of Chapter 5, a methodological innovation with respect to the standard procedure is introduced. In §5.3, uncertainty in predicting one cell from another is quantified assuming that not only the phonotactic shape of the wordforms is known, but information of a different kind too – namely, the gender of a noun, that is partly predictive of its inflection behaviour, as is already acknowledged in traditional descriptio
KW - Computational morphology
KW - Conditional entropy
KW - Implicative relations
KW - Inflected lexicon
KW - Inflectional morphology
KW - Inflectional predictability
KW - Information theory
KW - Latin linguistics
KW - Paradigm Cell Filling Problem
KW - Paradigm structure
KW - Computational morphology
KW - Conditional entropy
KW - Implicative relations
KW - Inflected lexicon
KW - Inflectional morphology
KW - Inflectional predictability
KW - Information theory
KW - Latin linguistics
KW - Paradigm Cell Filling Problem
KW - Paradigm structure
UR - http://hdl.handle.net/10807/227032
U2 - 10.1007/978-3-031-24844-3
DO - 10.1007/978-3-031-24844-3
M3 - Book
SN - 978-3-031-24843-6
VL - 6
T3 - STUDIES IN MORPHOLOGY
BT - Paradigm Structure and Predictability in Latin Inflection: An Entropy-based Approach
PB - Springer
ER -