In this work we describe a clustering and feature selection technique applied to the analysis of international dietary profiles. An asymmetric entropy-based measure for assessing the similarity between two clusterizations, also taking into account subclustering relationships, is at the core of the technique, together with PCA. Then, a feature analysis of the dataset with respect to its hierarchical clusterization is performed. This way, most significant features of the dataset are found and a deep understanding of the data distribution is made possible.
PCA Based Feature Selection Applied to the Analysis of the International Variation in Diet
MARIANI COSTANTINI, Renato;VERGINELLI, Fabio
2007-01-01
Abstract
In this work we describe a clustering and feature selection technique applied to the analysis of international dietary profiles. An asymmetric entropy-based measure for assessing the similarity between two clusterizations, also taking into account subclustering relationships, is at the core of the technique, together with PCA. Then, a feature analysis of the dataset with respect to its hierarchical clusterization is performed. This way, most significant features of the dataset are found and a deep understanding of the data distribution is made possible.File in questo prodotto:
Non ci sono file associati a questo prodotto.
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.