Abstract
Abstract. Gaussian graphical models are useful tools for exploring network structures in multivariate normal data. In this paper we are interested in situations
where data show departures from Gaussianity, therefore requiring alternative modeling distributions. The multivariate t-distribution, obtained by dividing each component of the data vector by a gamma random variable, is a straightforward generalization to accommodate deviations from normality such as heavy tails. Since
different groups of variables may be contaminated to a different extent, Finegold
and Drton (2014) introduced the Dirichlet t-distribution, where the divisors are
clustered using a Dirichlet process. In this work, we consider a more general class
of nonparametric distributions as the prior on the divisor terms, namely the class
of normalized completely random measures (NormCRMs). To improve the effectiveness of the clustering, we propose modeling the dependence among the divisors
through a nonparametric hierarchical structure, which allows for the sharing of
parameters across the samples in the data set. This desirable feature enables us
to cluster together different components of multivariate data in a parsimonious
way. We demonstrate through simulations that this approach provides accurate
graphical model inference, and apply it to a case study examining the dependence
structure in radiomics data derived from The Cancer Imaging Atlas.
Lingua originale | English |
---|---|
pagine (da-a) | 1271-1301 |
Numero di pagine | 31 |
Rivista | Bayesian Analysis |
Volume | 14 |
DOI | |
Stato di pubblicazione | Pubblicato - 2019 |
Keywords
- graphical models, Bayesian nonparametrics, normalized completely random measures, hierarchical models, radiomics data, t-distribution.