Experimentation of an automatic resolution method for protein abbreviations in full-text papers

Daniele Toti, Paolo Atzeni, Fabio Polticelli

Risultato della ricerca: Contributo in libroContributo a convegno

8 Citazioni (Scopus)

Abstract

We report and comment the experimental results of the PRAISED system, which implements an automatic method for discovering and resolving a wide range of protein name abbreviations from the full-text versions of scientific articles. This system has been recently proposed as part of a framework for creating and maintaining a publicly-accessible abbreviation repository. The testing phase was carried out against the widely used Medstract Gold Standard Corpus and a relevant subset of real scientific papers extracted from the PubMed database. As far as the Medstract corpus is concerned, we obtained significantly high results in terms of recall, precision and overall correctness. As for the fulltext papers, results inevitably varied, due to the complex and often chaotic nature of the confronted domain; even so, we detected encouraging levels of recall and extremely fast execution times. The major strength of the system lies in addressing the unstructuredness of the scientific publications and being able to save time and effort for extracting protein-related information in an automatic fashion, while at the same time keeping computational overhead to a minimum thanks to its light-weight approach. Copyright © 2011 ACM.
Lingua originaleEnglish
Titolo della pubblicazione ospite2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, BCB 2011
Pagine465-467
Numero di pagine3
DOI
Stato di pubblicazionePubblicato - 2011
Evento2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, ACM-BCB 2011 - Chicago, IL, usa
Durata: 1 ago 20113 ago 2011

Convegno

Convegno2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, ACM-BCB 2011
CittàChicago, IL, usa
Periodo1/8/113/8/11

Keywords

  • Abbreviations
  • Data mining
  • Extraction
  • Proteins
  • Resolution

Fingerprint

Entra nei temi di ricerca di 'Experimentation of an automatic resolution method for protein abbreviations in full-text papers'. Insieme formano una fingerprint unica.

Cita questo