Use of Relevant Principal Components to Define a Simplified Multivarate Test Procedure of Optimal Clutering

Marta Nai Ruscone, Giuseppe Boari

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Clustering is the problem of partitioning data into a finite number, k, of homogeneous and separate groups, called clusters. A good choice of k is essential for obtaining meaningful clusters. The intraclass correlation coefficient r is frequently used to measure the degree of intragroup resemblance (for example of characteristics such as blood pressure, weight and height). The theory concerning r is well established for single variables analysis (Sheff`e, 1959; Rao, 1973). In this paper, this task is addressed by means of a multiple test procedure defining the optimal cluster solution under normality assumption of the involved variables. Relevant principal components are used to define a simplified multivariate test of null intraclass correlation procedure and the proposal of a new statistical stopping rule is evaluated.
Original languageEnglish
Title of host publicationCladag 2013. 9th Meeting of the Classification and Data Analysis Group. Book of Abstracts
Pages1-4
Number of pages4
Publication statusPublished - 2013
EventCladag 2013 - Modena
Duration: 18 Sept 201320 Feb 2014

Conference

ConferenceCladag 2013
CityModena
Period18/9/1320/2/14

Keywords

  • Principal components
  • cluster analysis
  • intra class correlation
  • union intersection principle

Fingerprint

Dive into the research topics of 'Use of Relevant Principal Components to Define a Simplified Multivarate Test Procedure of Optimal Clutering'. Together they form a unique fingerprint.

Cite this