Abstract
Code-mixing is the alternation between two or more languages in the same text. This phenomenon is very relevant in the travel domain, since it can provide new insight in the way foreign cultures are perceived and described to the readers. In this paper, we analyse EnglishItalian code-mixing in historical English travel writings about Italy. We retrain and compare two existing systems for the automatic detection of code-mixing, and analyse the semantic categories mostly connected to Italian. Besides, we release the domain corpus used in our experiments and the output of the extraction
Lingua originale | English |
---|---|
Titolo della pubblicazione ospite | Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017) |
Pagine | 304-309 |
Numero di pagine | 6 |
Volume | 2006 |
DOI | |
Stato di pubblicazione | Pubblicato - 2017 |
Evento | CLiC-it 2017 - Roma Durata: 11 dic 2017 → 13 dic 2017 |
Convegno
Convegno | CLiC-it 2017 |
---|---|
Città | Roma |
Periodo | 11/12/17 → 13/12/17 |
Keywords
- code-mixing
- computational linguistics
- digital humanities
- travel writing