Search results for: C. Sakata

Items from 1 to 10 out of 10 results

chapter

CVis — Towards a novel visualization tool to explore the relationship between input and output partitions in multi-objective clustering ensembles

Katti Faceli, Tiemi C. Sakata, Julia Handl

2017 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB) > 1 - 6

2017 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)

Ensemble methods for clustering take a collection of input partitions, produced for the same data set, and generate an ensemble partition that tries to preserve the information carried in this collective. Acceptance of the resulting partition(s) by decision makers can be a problem, due to the inherent complexity of ensemble techniques, and the associated lack of intuition on how a consensus has been...

chapter

PVis — Partitions' visualizer: Extracting knowledge by visualizing a collection of partitions

Katti Faceli, Tiemi C. Sakata, Andre C. P. L. F. de Carvalho, Marcilio C. P. de Souto

2014 International Joint Conference on Neural Networks (IJCNN) > 3056 - 3061

2014 International Joint Conference on Neural Networks (IJCNN)

Recent advances in cluster analysis highlight the importance of finding multiple meaningful partitions and point out to the need for approaches to evaluate them. They also suggest that the evaluation should consider knowledge of a domain expert. In this paper, we present a visualization method, called PVis¹ (Partition's Visualizer), that allows the integrated visualization of a collection of partitions...

chapter

The Assessment of the Quality of Sugar Using Electronic Tongue and Machine Learning Algorithms

Tiemi C. Sakata, Katti Faceli, Tiago A. Almeida, Antonio Riul Junior, more

2012 11th International Conference on Machine Learning and Applications > 1 > 538 - 541

2012 Eleventh International Conference on Machine Learning and Applications (ICMLA)

The correct classification of sugar according to its physico-chemical characteristics directly influences the value of the product and its acceptance by the market. This study shows that using an electronic tongue system along with established techniques of supervised learning leads to the correct classification of sugar samples according to their qualities. In this paper, we offer two new real, public...

chapter

A Comparison of External Clustering Evaluation Indices in the Context of Imbalanced Data Sets

Marcilio C.P. de Souto, Andre L.V. Coelho, Katti Faceli, Tiemi C. Sakata, more

2012 Brazilian Symposium on Neural Networks > 49 - 54

2012 Brazilian Symposium on Neural Networks (SBRN)

For highly imbalanced data sets, almost all the instances are labeled as one class, whereas far fewer examples are labeled as the other classes. In this paper, we present an empirical comparison of seven different clustering evaluation indices when used to assess partitions generated from highly imbalanced data sets. Some of the metrics are based on matching of sets (F-measure), information theory...

chapter

Improvements in the Partitions Selection Strategy for Set of Clustering Solutions

T C Sakata, K Faceli, M C P de Souto, A C P L F de Carvalho

2010 Eleventh Brazilian Symposium on Neural Networks > 49 - 54

2010 Eleventh Brazilian Symposium on Neural Networks (SBRN 2010)

No clustering algorithm is guaranteed to find actual groups in any dataset. Thus, the selection of the most suitable clustering algorithm to be applied to a given dataset is not easy. To deal with this problem, one can apply various clustering algorithms to the dataset, generating a set of partitions (solutions). Next, one can choose the best partition generated, according to a given validation measure...

article

Partitions selection strategy for set of clustering solutions

Katti Faceli, Tiemi C. Sakata, Marcilio C.P. de Souto, André C.P.L.F. de Carvalho

Neurocomputing > 2010 > 73 > 16-18 > 2809-2819

Clustering is a difficult task: there is no single cluster definition and the data can have more than one underlying structure. Pareto-based multi-objective genetic algorithms (e.g., MOCK—Multi-Objective Clustering with automatic K-determination and MOCLE—Multi-Objective Clustering Ensemble) were proposed to tackle these problems. However, the output of such algorithms can often contains a high number...