Abstract
Gaussian mixture models (GMM) are the most-widely employed approach to perform model-based clustering of continuous features. Grievously, with the increasing availability of high-dimensional datasets, their direct applicability is put at stake: GMMs suffer from the curse of dimensionality issue, as the number of parameters grows quadratically with the number of variables. To this extent, a methodological link between Gaussian mixtures and Gaussian graphical models has recently been established in order to provide a framework for performing penalized model-based clustering in presence of large precision matrices. Notwithstanding, current methodologies do not account for the fact that groups may be under or over-connected, thus implicitly assuming similar levels of sparsity across clusters. We overcome this limitation by defining data-driven and component specific penalty factors, automatically accounting for different degrees of connections within groups. A real data experiment on handwritten digits recognition showcases the validity of our proposal.
| Lingua originale | Inglese |
|---|---|
| Titolo della pubblicazione ospite | Building Bridges between Soft and Statistical Methodologies for Data Science |
| Editore | Springer |
| Pagine | 73-78 |
| Numero di pagine | 6 |
| Volume | 1433 |
| ISBN (stampa) | 978-3-031-15508-6 |
| DOI | |
| Stato di pubblicazione | Pubblicato - 2023 |
Keywords
- Penalised model-based clustering
Fingerprint
Entra nei temi di ricerca di 'Penalized Model-Based Clustering with Group-Dependent Shrinkage Estimation'. Insieme formano una fingerprint unica.Cita questo
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver