|
Repositório da Universidade de Lisboa >
Faculdade de Medicina (FM) >
Instituto de Medicina Preventiva (FM-IMP) >
FM-IMP-Artigos em Revistas Internacionais >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10451/5659
|
| Title: | Measuring similarity of complex and heterogeneous data in clustering of large data sets |
| Authors: | Nicolau, Helena Bacelar Nicolau, Fernando Sousa, Áurea Nicolau, Leonor Bacelar |
| Keywords: | Cluster analysis Different type variables Similarity coefficient Three-way data Statistics |
| Issue Date: | 2009 |
| Publisher: | Polish Academy of Sciences |
| Citation: | Biocybernetics and Biomedical Engineering 2009;29(2):9–18 |
| Abstract: | Cluster analysis or classification usually concerns a set of exploratory multivariate data analysis methods and techniques for finding a clustering structure on a dataset. That may refer either to groups of statistical data units or to groups of variables. In this work we deal with a generalization of this paradigm concerning clustering of complex data described
by three different types of variables, frequently present in a three-way context. We obtain compatible versions of the same affinity coefficient for measuring similarity between statistical data units described by those three types of variables. A global generalized similarity coefficient is analyzed for such kind of mixed data, often arising in data mining or knowledge mining. |
| Peer Reviewed: | yes |
| URI: | http://hdl.handle.net/10451/5659 http://www.ibib.waw.pl/bbe/bbefulltext/BBE_29_2_009_FT.pdf |
| ISSN: | 0208-5216 |
| Appears in Collections: | FM-IMP-Artigos em Revistas Internacionais
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|