Salta alla navigazione principale Salta alla ricerca Salta al contenuto principale

A framework for semi-automatic identification, disambiguation and storage of protein-related abbreviations in scientific literature

  • P. Atzeni*
  • , F. Polticelli
  • , Daniele Toti
  • *Autore corrispondente per questo lavoro
  • Roma Tre University

Risultato della ricerca: Contributo in libroContributo a conferenza

Abstract

We propose a framework for identifying, disambiguating and storing protein-related abbreviations as found in the full texts of scientific papers, in order to build and maintain a publicly available abbreviation repository via a semi-automatic process. This process involves information extraction methods and techniques for acronym identification and resolution, based on lexical clues and syntactical, largely domain-independent criteria. A dictionary and an ontology for proteins provide the means for matching and disambiguating the biological entities. User feedback is gathered at the end of the process and the confirmed entries are then stored and made available to the scientific community for further reviewing. © 2011 IEEE.
Lingua originaleInglese
Titolo della pubblicazione ospiteProceedings - International Conference on Data Engineering
EditoreN/A
Pagine59-61
Numero di pagine3
ISBN (stampa)978-1-4244-9195-7
DOI
Stato di pubblicazionePubblicato - 2011

All Science Journal Classification (ASJC) codes

  • Software
  • Teoria dei Segnali
  • Sistemi Informativi

Keywords

  • abbreviations

Fingerprint

Entra nei temi di ricerca di 'A framework for semi-automatic identification, disambiguation and storage of protein-related abbreviations in scientific literature'. Insieme formano una fingerprint unica.

Cita questo