Abstract
This work aims to design a Gibbs sampling algorithm for posterior Bayesian inference of a Dirichlet process mixture model based on Hamming distributed kernels, a probability measure built upon the Hamming distance. This model is employed to provide model-based clustering analysis of categorical data with no natural ordering. The proposed algorithm leverages a split-and-merge Markov chain Monte Carlo technique to address the curse of dimensionality issue and improve the search over the space of random partitions.
| Lingua originale | Inglese |
|---|---|
| Titolo della pubblicazione ospite | Statistics for Innovation III |
| Editore | Springer |
| Pagine | 147-152 |
| Numero di pagine | 6 |
| ISBN (stampa) | 9783031959943 |
| DOI | |
| Stato di pubblicazione | Pubblicato - 2025 |
Keywords
- Clustering
- Hamming distribution
- Markov chain Monte Carlo
- Nominal data