Abstract
We present the results of our attempt to use NLP tools in order to identify named entities in the publications of the Deutsches Archäologisches Institute (DAI) and link the identified locations to entries in the iDAI.gazetteer. Our case study focuses on articles written in German and published in the journal Chiron between 1971 and 2014. We describe the annotation pipeline that starts from the digitized texts published in the new portal of the DAI. We evaluate the performances of geoparsing and NER and test an approach to improve the accuracy of the latter.
| Lingua originale | Inglese |
|---|---|
| Titolo della pubblicazione ospite | Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018). 10-12 December 2018, Torino |
| Editore | CEUR-WS |
| Pagine | 253-257 |
| Numero di pagine | 5 |
| Volume | 2253 |
| ISBN (stampa) | 978-88-31978-41-5 |
| DOI | |
| Stato di pubblicazione | Pubblicato - 2018 |
All Science Journal Classification (ASJC) codes
- Informatica Generale
Keywords
- digital archaeology
- entity linking
- named entity recognition
- text mining