Abstract
The estimation of the intrinsic dimension is an essential step in many data analyses involving, for example, dimensionality reduction. Likelihood-based estimators, which rely on the distributions of the ratios of distances between nearest neighbors, have been recently proposed. However, these distributional results de- pend on several assumptions. One of the most important is the local homogeneity of the point process characterizing the data-generating mechanism. By exploiting a recent theoretical result, we develop the Consecutive Ratio Paths, a graphical tool to assess the validity of the local-homogeneity assumption in a dataset. This tool is also helpful to uncover the presence of multiple latent manifolds, a potential indicator of the existence of heterogeneous intrinsic dimensions.
Lingua originale | English |
---|---|
Titolo della pubblicazione ospite | Book of Short Paper SIS 2022 |
Pagine | 1233-1238 |
Numero di pagine | 6 |
Stato di pubblicazione | Pubblicato - 2022 |
Evento | SIS 2022 - Caserta Durata: 22 giu 2022 → 24 giu 2022 |
Convegno
Convegno | SIS 2022 |
---|---|
Città | Caserta |
Periodo | 22/6/22 → 24/6/22 |
Keywords
- Pareto distribution
- graphic tool
- intrinsic dimension
- model- based estimation
- nearest neighbors