Use of Relevant Principal Components to Define a Simplified Multivarate Test Procedure of Optimal Clutering

Marta Nai Ruscone, Giuseppe Boari

Risultato della ricerca: Contributo in libroContributo a convegno

Abstract

Clustering is the problem of partitioning data into a finite number, k, of homogeneous and separate groups, called clusters. A good choice of k is essential for obtaining meaningful clusters. The intraclass correlation coefficient r is frequently used to measure the degree of intragroup resemblance (for example of characteristics such as blood pressure, weight and height). The theory concerning r is well established for single variables analysis (Sheff`e, 1959; Rao, 1973). In this paper, this task is addressed by means of a multiple test procedure defining the optimal cluster solution under normality assumption of the involved variables. Relevant principal components are used to define a simplified multivariate test of null intraclass correlation procedure and the proposal of a new statistical stopping rule is evaluated.
Lingua originaleEnglish
Titolo della pubblicazione ospiteCladag 2013. 9th Meeting of the Classification and Data Analysis Group. Book of Abstracts
Pagine1-4
Numero di pagine4
Stato di pubblicazionePubblicato - 2013
EventoCladag 2013 - Modena
Durata: 18 set 201320 feb 2014

Convegno

ConvegnoCladag 2013
CittàModena
Periodo18/9/1320/2/14

Keywords

  • Principal components
  • cluster analysis
  • intra class correlation
  • union intersection principle

Fingerprint

Entra nei temi di ricerca di 'Use of Relevant Principal Components to Define a Simplified Multivarate Test Procedure of Optimal Clutering'. Insieme formano una fingerprint unica.

Cita questo