The aim of this article is to propose a procedure to cluster functional observations in a subspace of reduced dimension. The dimensional reduction is obtained by constraining the cluster centroids to lie into a subspace which preserves the maximum amount of discriminative information contained in the original data. The model is estimated by using penalized least squares to take into account the functional nature of the data. The smoothing is carried out within the clustering and its amount is adaptively calibrated. A simulation study shows how the combination of these two elements, feature-extraction and automatic data-driven smoothing, improves the performance of clustering by reducing irrelevant and redundant information in the data. The effectiveness of the proposal is demonstrated by an application to a real dataset regarding a speech recognition problem. Implementation details of the algorithm together with a computer code are available in the online supplements. © 2012 American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America.

Clustering curves on a reduced subspace

Gattone S. A.
Primo
;
Rocci R.
Secondo
2012-01-01

Abstract

The aim of this article is to propose a procedure to cluster functional observations in a subspace of reduced dimension. The dimensional reduction is obtained by constraining the cluster centroids to lie into a subspace which preserves the maximum amount of discriminative information contained in the original data. The model is estimated by using penalized least squares to take into account the functional nature of the data. The smoothing is carried out within the clustering and its amount is adaptively calibrated. A simulation study shows how the combination of these two elements, feature-extraction and automatic data-driven smoothing, improves the performance of clustering by reducing irrelevant and redundant information in the data. The effectiveness of the proposal is demonstrated by an application to a real dataset regarding a speech recognition problem. Implementation details of the algorithm together with a computer code are available in the online supplements. © 2012 American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America.
File in questo prodotto:
File Dimensione Formato  
jcgs.pdf

Solo gestori archivio

Descrizione: Article
Tipologia: PDF editoriale
Dimensione 772.96 kB
Formato Adobe PDF
772.96 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11564/761949
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 13
  • ???jsp.display-item.citation.isi??? 11
social impact