TY - JOUR
T1 - Medical Information Extraction With NLP-Powered QABots: A Real-World Scenario
AU - Crema, Claudio
AU - Verde, Federico
AU - Tiraboschi, Pietro
AU - Marra, Camillo
AU - Arighi, Andrea
AU - Fostinelli, Silvia
AU - Giuffré, Guido Maria
AU - Maschio, Vera Pacoova Dal
AU - L'Abbate, Federica
AU - Solca, Federica
AU - Poletti, Barbara
AU - Silani, Vincenzo
AU - Rotondo, Emanuela
AU - Borracci, Vittoria
AU - Vimercati, Roberto
AU - Crepaldi, Valeria
AU - Inguscio, Emanuela
AU - Filippi, Massimo
AU - Caso, Francesca
AU - Rosati, Alessandra Maria
AU - Quaranta, Davide
AU - Binetti, Giuliano
AU - Pagnoni, Ilaria
AU - Morreale, Manuela
AU - Burgio, Francesca
AU - Stanzani-Maserati, Michelangelo
AU - Capellari, Sabina
AU - Pardini, Matteo
AU - Girtler, Nicola
AU - Piras, Federica
AU - Piras, Fabrizio
AU - Lalli, Stefania
AU - Perdixi, Elena
AU - Lombardi, Gemma
AU - Di Tella, Sonia
AU - Costa, Alfredo
AU - Capelli, Marco
AU - Fundarò, Cira
AU - Manera, Marina
AU - Muscio, Cristina
AU - Pellencin, Elisa
AU - Lodi, Raffaele
AU - Tagliavini, Fabrizio
AU - Redolfi, Alberto
PY - 2024
Y1 - 2024
N2 - The advent of computerized medical recording systems in healthcare facilities has made data retrieval tasks easier, compared to manual recording. Nevertheless, the potential of the information contained within medical records remains largely untapped, mostly due to the time and effort required to extract data from unstructured documents. Natural Language Processing (NLP) represents a promising solution to this challenge, as it enables the use of automated text-mining tools for clinical practitioners. In this work, we present the architecture of the Virtual Dementia Institute (IVD), a consortium of sixteen Italian hospitals, using the NLP Extraction and Management Tool (NEMT), a (semi-) automated end-to-end pipeline that extracts relevant information from clinical documents and stores it in a centralized REDCap database. After defining a common Case Report Form (CRF) across the IVD hospitals, we implemented NEMT, the core of which is a Question Answering Bot (QABot) based on a modern NLP model. This QABot is fine-tuned on thousands of examples from IVD centers. Detailed descriptions of the process to define a common minimum dataset, Inter-Annotator Agreement calculated on clinical documents, and NEMT results are provided. The best QABot performance show an Exact Match score (EM) of 78.1%, a F1-score of 84.7%, a Lenient Accuracy (LAcc) of 0.834, and a Mean Reciprocal Rank (MRR) of 0.810. EM and F1 scores outperform the same metrics obtained with ChatGPTv3.5 (68.9% and 52.5%, respectively). With NEMT the IVD has been able to populate a database that will contain data from thousands of Italian patients, all screened with the same procedure. NEMT represents an efficient tool that paves the way for medical information extraction and exploitation for new research studies.
AB - The advent of computerized medical recording systems in healthcare facilities has made data retrieval tasks easier, compared to manual recording. Nevertheless, the potential of the information contained within medical records remains largely untapped, mostly due to the time and effort required to extract data from unstructured documents. Natural Language Processing (NLP) represents a promising solution to this challenge, as it enables the use of automated text-mining tools for clinical practitioners. In this work, we present the architecture of the Virtual Dementia Institute (IVD), a consortium of sixteen Italian hospitals, using the NLP Extraction and Management Tool (NEMT), a (semi-) automated end-to-end pipeline that extracts relevant information from clinical documents and stores it in a centralized REDCap database. After defining a common Case Report Form (CRF) across the IVD hospitals, we implemented NEMT, the core of which is a Question Answering Bot (QABot) based on a modern NLP model. This QABot is fine-tuned on thousands of examples from IVD centers. Detailed descriptions of the process to define a common minimum dataset, Inter-Annotator Agreement calculated on clinical documents, and NEMT results are provided. The best QABot performance show an Exact Match score (EM) of 78.1%, a F1-score of 84.7%, a Lenient Accuracy (LAcc) of 0.834, and a Mean Reciprocal Rank (MRR) of 0.810. EM and F1 scores outperform the same metrics obtained with ChatGPTv3.5 (68.9% and 52.5%, respectively). With NEMT the IVD has been able to populate a database that will contain data from thousands of Italian patients, all screened with the same procedure. NEMT represents an efficient tool that paves the way for medical information extraction and exploitation for new research studies.
KW - biomedical informatics
KW - clinical neuroscience
KW - Natural language processing
KW - question answering (information retrieval)
KW - text mining
KW - biomedical informatics
KW - clinical neuroscience
KW - Natural language processing
KW - question answering (information retrieval)
KW - text mining
UR - https://publicatt.unicatt.it/handle/10807/313904
UR - https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=85202725656&origin=inward
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85202725656&origin=inward
U2 - 10.1109/jbhi.2024.3450118
DO - 10.1109/jbhi.2024.3450118
M3 - Article
SN - 2168-2194
VL - 28
SP - 6906
EP - 6917
JO - IEEE Journal of Biomedical and Health Informatics
JF - IEEE Journal of Biomedical and Health Informatics
IS - 11
ER -