Please use this identifier to cite or link to this item:
Title: Measuring similarity of complex and heterogeneous data in clustering of large data sets
Authors: Nicolau, Helena Bacelar
Nicolau, Fernando
Sousa, Áurea
Nicolau, Leonor Bacelar
Keywords: Cluster analysis
Different type variables
Similarity coefficient
Three-way data
Issue Date: 2009
Publisher: Polish Academy of Sciences
Citation: Biocybernetics and Biomedical Engineering 2009;29(2):9–18
Abstract: Cluster analysis or classification usually concerns a set of exploratory multivariate data analysis methods and techniques for finding a clustering structure on a dataset. That may refer either to groups of statistical data units or to groups of variables. In this work we deal with a generalization of this paradigm concerning clustering of complex data described by three different types of variables, frequently present in a three-way context. We obtain compatible versions of the same affinity coefficient for measuring similarity between statistical data units described by those three types of variables. A global generalized similarity coefficient is analyzed for such kind of mixed data, often arising in data mining or knowledge mining.
Peer review: yes
ISSN: 0208-5216
Appears in Collections:FM-IMP-Artigos em Revistas Internacionais

Files in This Item:
File Description SizeFormat 
Data_cluster_analysis.pdf143,7 kBAdobe PDFView/Open    Request a copy

FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpace
Formato BibTex MendeleyEndnote Degois 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.