Abstract
This work describes an automatic text classification method implemented in a software tool called NETHIC, which takes advantage of the inner capabilities of highly-scalable neural networks combined with the expressiveness of hierarchical taxonomies. As such, NETHIC succeeds in bringing about a mechanism for text classification that proves to be significantly effective as well as efficient. The tool had undergone an experimentation process against both a generic and a domain-specific corpus, outputting promising results. On the basis of this experimentation, NETHIC has been now further refined and extended by adding a document embedding mechanism, which has shown improvements in terms of performance on the individual networks and on the whole hierarchical model.
| Lingua originale | Inglese |
|---|---|
| Titolo della pubblicazione ospite | Lecture Notes in Business Information Processing |
| Editore | Springer |
| Pagine | 57-77 |
| Numero di pagine | 21 |
| Volume | 378 |
| ISBN (stampa) | 978-3-030-40782-7 |
| DOI | |
| Stato di pubblicazione | Pubblicato - 2020 |
All Science Journal Classification (ASJC) codes
- Sistemi Informativi di Gestione
- Ingegneria del Controllo e dei Sistemi
- Business e Management Internazionale
- Sistemi Informativi
- Modellazione e Simulazione
- Sistemi Informativi e Gestione dell’Informazione
Keywords
- Document embedding
- Machine learning
- Neural networks
- Taxonomies
- Text classification