Mining and Analysing One Billion Requests to Linguistic Services

Greta Franzini, Marco Buchler, Emily Franzini, Thomas Eckart

Risultato della ricerca: Contributo in libroContributo a convegno

Abstract

From 2004 to 2016 the Leipzig Linguistic Services (LLS) existed as a SOAP-based cyber infrastructure of atomic micro-services for the Wortschatz project, which covered different-sized textual corpora in more than 230 languages. The LLS were developed in 2004 and went live in 2005 in order to provide a Web service-based API to these corpus databases. In 2006, the LLS infrastructure began to systematically log and store requests made to the text collection, and in August 2016 the LLS were shut down. This article summarises the experience of the past ten years of running such a cyberinfrastructure with a total of nearly one billion requests. It includes an explanation of the technical decisions and limitations but also provides an overview of how the services were used.
Lingua originaleEnglish
Titolo della pubblicazione ospiteProceedings of the IEEE International Conference on Big Data 2016 (IEEE BigData 2016)
Pagine3230-3239
Numero di pagine10
DOI
Stato di pubblicazionePubblicato - 2017
EventoIEEE International Conference on Big Data 2016 (IEEE BigData 2016) - Washington, DC
Durata: 5 dic 20168 dic 2016

Convegno

ConvegnoIEEE International Conference on Big Data 2016 (IEEE BigData 2016)
CittàWashington, DC
Periodo5/12/168/12/16

Keywords

  • collective intelligence
  • data analysis
  • data processing
  • internet
  • knowledge discovery
  • natural language
  • software as a service
  • web services

Fingerprint

Entra nei temi di ricerca di 'Mining and Analysing One Billion Requests to Linguistic Services'. Insieme formano una fingerprint unica.

Cita questo