Abstract
From 2004 to 2016 the Leipzig Linguistic Services (LLS) existed as a SOAP-based cyber infrastructure of atomic micro-services for the Wortschatz project, which covered different-sized textual corpora in more than 230 languages. The LLS were developed in 2004 and went live in 2005 in order to provide a Web service-based API to these corpus databases. In 2006, the LLS infrastructure began to systematically log and store requests made to the text collection, and in August 2016 the LLS were shut down. This article summarises the experience of the past ten years of running such a cyberinfrastructure with a total of nearly one billion requests. It includes an explanation of the technical decisions and limitations but also provides an overview of how the services were used.
Lingua originale | English |
---|---|
Titolo della pubblicazione ospite | Proceedings of the IEEE International Conference on Big Data 2016 (IEEE BigData 2016) |
Pagine | 3230-3239 |
Numero di pagine | 10 |
DOI | |
Stato di pubblicazione | Pubblicato - 2017 |
Evento | IEEE International Conference on Big Data 2016 (IEEE BigData 2016) - Washington, DC Durata: 5 dic 2016 → 8 dic 2016 |
Convegno
Convegno | IEEE International Conference on Big Data 2016 (IEEE BigData 2016) |
---|---|
Città | Washington, DC |
Periodo | 5/12/16 → 8/12/16 |
Keywords
- collective intelligence
- data analysis
- data processing
- internet
- knowledge discovery
- natural language
- software as a service
- web services