The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts

Rachele Sprugnoli*, Tommaso Caselli, Sara Tonelli, Giovanni Moretti

*Autore corrispondente per questo lavoro

Risultato della ricerca: Contributo in libroContributo a convegno

1 Citazioni (Scopus)

Abstract

This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles. By developing this dataset, we also introduce a new NLP task for the automatic classification of Content Types. The annotation scheme and the dataset are described together with two sets of classification experiments.
Lingua originaleEnglish
Titolo della pubblicazione ospiteProceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
Pagine260-266
Numero di pagine7
Volume2
DOI
Stato di pubblicazionePubblicato - 2017
Evento15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Valencia (Spagna)
Durata: 3 apr 20177 apr 2017

Convegno

Convegno15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017
CittàValencia (Spagna)
Periodo3/4/177/4/17

Keywords

  • semantics, computational linguistics, corpus annotation, information extraction

Fingerprint

Entra nei temi di ricerca di 'The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts'. Insieme formano una fingerprint unica.

Cita questo