TY - JOUR
T1 - A Laplace-based model with flexible tail behavior
AU - Tortora, Cristina
AU - Franczak, Brian C.
AU - Bagnato, Luca
AU - Punzo, Antonio
PY - 2024
Y1 - 2024
N2 - The proposed multiple scaled contaminated asymmetric Laplace (MSCAL) distribution is an extension of the multivariate asymmetric Laplace distribution to allow for a different excess kurtosis on each dimension and for more flexible shapes of the hyper-contours. These peculiarities are obtained by working on the principal component (PC) space. The structure of the MSCAL distribution has the further advantage of allowing for automatic PC-wise outlier detection – i.e., detection of outliers separately on each PC – when convenient constraints on the parameters are imposed. The MSCAL is fitted using a Monte Carlo expectation-maximization (MCEM) algorithm that uses a Monte Carlo method to estimate the orthogonal matrix of eigenvectors. A simulation study is used to assess the proposed MCEM in terms of computational efficiency and parameter recovery. In a real data application, the MSCAL is fitted to a real data set containing the anthropometric measurements of monozygotic/dizygotic twins. Both a skewed bivariate subset of the full data, perturbed by some outlying points, and the full data are considered.
AB - The proposed multiple scaled contaminated asymmetric Laplace (MSCAL) distribution is an extension of the multivariate asymmetric Laplace distribution to allow for a different excess kurtosis on each dimension and for more flexible shapes of the hyper-contours. These peculiarities are obtained by working on the principal component (PC) space. The structure of the MSCAL distribution has the further advantage of allowing for automatic PC-wise outlier detection – i.e., detection of outliers separately on each PC – when convenient constraints on the parameters are imposed. The MSCAL is fitted using a Monte Carlo expectation-maximization (MCEM) algorithm that uses a Monte Carlo method to estimate the orthogonal matrix of eigenvectors. A simulation study is used to assess the proposed MCEM in terms of computational efficiency and parameter recovery. In a real data application, the MSCAL is fitted to a real data set containing the anthropometric measurements of monozygotic/dizygotic twins. Both a skewed bivariate subset of the full data, perturbed by some outlying points, and the full data are considered.
KW - Contaminated distributions
KW - Directional outlier detection
KW - Monte Carlo expectation-maximization algorithm
KW - Multiple scaled distributions
KW - Normal variance-mean mixtures
KW - Contaminated distributions
KW - Directional outlier detection
KW - Monte Carlo expectation-maximization algorithm
KW - Multiple scaled distributions
KW - Normal variance-mean mixtures
UR - http://hdl.handle.net/10807/277397
U2 - 10.1016/j.csda.2023.107909
DO - 10.1016/j.csda.2023.107909
M3 - Article
SN - 0167-9473
VL - 192
SP - 1
EP - 19
JO - COMPUTATIONAL STATISTICS & DATA ANALYSIS
JF - COMPUTATIONAL STATISTICS & DATA ANALYSIS
ER -