TY - JOUR
T1 - Enhancing Derivational Information on Latin Lemmas in the LiLa Knowledge Base. A Structural and Diachronic Extension
AU - Pellegrini, Matteo
AU - Passarotti, Marco Carlo
AU - Litta Modignani Picozzi, Eleonora Maria Gabriella
AU - Mambrini, Francesco
AU - Moretti, Giovanni
AU - Corbetta, Claudia
AU - Verdelli, Martina
PY - 2022
Y1 - 2022
N2 - In this paper we document both the structural and the diachronic extension of the derivational information provided in the LiLa Knowledge Base of interoperable linguistic resources for Latin. Structurally, to the flat information on families (i.e., groups of lemmas that share the same base) and affixes that is already available for the collection of lemmas of the LiLa Lemma Bank, we add hierarchical information on derivation processes provided by the Word Formation Latin (WFL) lexical resource, which in turn is characterised by a step-to-step morphotactic approach, where lexemes that are directly derived from one another are connected through word formation rules of different kinds. This is done by modelling WFL data into an ontology that adheres to the principles of the Linked Data paradigm, and connecting these data to the LiLa Lemma Bank. From a diachronic point of view, while the previous version of WFL only took Classical Latin lemmas into account, in this paper we describe the work conducted to produce a new version of WFL that is enhanced with derivational information on Medieval Latin lemmas. We then show how the data of this new version of WFL were used to extract derivational information in the format required by the LiLa Lemma Bank.
AB - In this paper we document both the structural and the diachronic extension of the derivational information provided in the LiLa Knowledge Base of interoperable linguistic resources for Latin. Structurally, to the flat information on families (i.e., groups of lemmas that share the same base) and affixes that is already available for the collection of lemmas of the LiLa Lemma Bank, we add hierarchical information on derivation processes provided by the Word Formation Latin (WFL) lexical resource, which in turn is characterised by a step-to-step morphotactic approach, where lexemes that are directly derived from one another are connected through word formation rules of different kinds. This is done by modelling WFL data into an ontology that adheres to the principles of the Linked Data paradigm, and connecting these data to the LiLa Lemma Bank. From a diachronic point of view, while the previous version of WFL only took Classical Latin lemmas into account, in this paper we describe the work conducted to produce a new version of WFL that is enhanced with derivational information on Medieval Latin lemmas. We then show how the data of this new version of WFL were used to extract derivational information in the format required by the LiLa Lemma Bank.
KW - Latin
KW - Linguistic Linked Data
KW - Linguistic resources
KW - Morphology
KW - Latin
KW - Linguistic Linked Data
KW - Linguistic resources
KW - Morphology
UR - http://hdl.handle.net/10807/214008
UR - https://ufal.mff.cuni.cz/pbml/119/art-pellegrini-et-al.pdf
M3 - Article
SN - 0032-6585
SP - 67
EP - 92
JO - THE PRAGUE BULLETIN OF MATHEMATICAL LINGUISTICS
JF - THE PRAGUE BULLETIN OF MATHEMATICAL LINGUISTICS
ER -