Abstract
This paper presents QUANDHO (QUestion ANswering Data for italian HistOry), an Italian question answering dataset created to cover a specific domain, i.e. the history of Italy in the first half of the XX century. The dataset includes questions manually classified and annotated with Lexical Answer Types, and a set of question-answer pairs. This resource, freely available for research purposes, has been used to retrain a domain independent question answering system so to improve its performances in the domain of interest. Ongoing experiments on the development of a question classifier and an automatic tagger of Lexical Answer Types are also presented.
Lingua originale | English |
---|---|
Titolo della pubblicazione ospite | Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) |
Pagine | 3502-3509 |
Numero di pagine | 8 |
Stato di pubblicazione | Pubblicato - 2016 |
Evento | Tenth International Conference on Language Resources and Evaluation (LREC 2016) - Portorož, Slovenia Durata: 23 mag 2016 → 28 mag 2016 |
Convegno
Convegno | Tenth International Conference on Language Resources and Evaluation (LREC 2016) |
---|---|
Città | Portorož, Slovenia |
Periodo | 23/5/16 → 28/5/16 |
Keywords
- Corpus
- Digital History
- Question Answering