A tool to validate the assumptions on ratios of nearest neighbors’ distances: the Consecutive Ratio Paths

Francesco Denti*, Antonietta Mira

*Autore corrispondente per questo lavoro

Risultato della ricerca: Contributo in libroContributo a convegno

Abstract

The estimation of the intrinsic dimension is an essential step in many data analyses involving, for example, dimensionality reduction. Likelihood-based estimators, which rely on the distributions of the ratios of distances between nearest neighbors, have been recently proposed. However, these distributional results de- pend on several assumptions. One of the most important is the local homogeneity of the point process characterizing the data-generating mechanism. By exploiting a recent theoretical result, we develop the Consecutive Ratio Paths, a graphical tool to assess the validity of the local-homogeneity assumption in a dataset. This tool is also helpful to uncover the presence of multiple latent manifolds, a potential indicator of the existence of heterogeneous intrinsic dimensions.
Lingua originaleEnglish
Titolo della pubblicazione ospiteBook of Short Paper SIS 2022
Pagine1233-1238
Numero di pagine6
Stato di pubblicazionePubblicato - 2022
EventoSIS 2022 - Caserta
Durata: 22 giu 202224 giu 2022

Convegno

ConvegnoSIS 2022
CittàCaserta
Periodo22/6/2224/6/22

Keywords

  • Pareto distribution
  • graphic tool
  • intrinsic dimension
  • model- based estimation
  • nearest neighbors

Fingerprint

Entra nei temi di ricerca di 'A tool to validate the assumptions on ratios of nearest neighbors’ distances: the Consecutive Ratio Paths'. Insieme formano una fingerprint unica.

Cita questo