Search results

chapter

EC3: Combining Clustering and Classification for Ensemble Learning

Tanmoy Chakraborty

2017 IEEE International Conference on Data Mining (ICDM) > 781 - 786

2017 IEEE International Conference on Data Mining (ICDM)

We propose EC3, a novel algorithm that merges classification and clustering together in order to support both binary and multi-class classification. EC3 is based on a principled combination of multiple classification and multiple clustering methods using a convex optimization function. We additionally propose iEC3, a variant of EC3 that handles imbalanced training data. We perform an extensive experimental...

chapter

Ship route extraction and clustering analysis based on automatic identification system data

Sainan Wang, Suixiang Gao, Wenguo Yang

2017 Eighth International Conference on Intelligent Control and Information Processing (ICICIP) > 33 - 38

2017 Eighth International Conference on Intelligent Control and Information Processing (ICICIP)

This paper considers ship route extraction and clustering problem based on Automatic Identification System (AIS) data. For the ships with known Maritime Mobile Service Identify (MMSI), we propose a ship route extraction method by using AIS data. For ship route clustering, hierarchical clustering method is selected. We firstly define a distance between ship routes to measure the dissimilarity of them...

chapter

Using mutual information clustering to discover food allergen cross-reactivity

Kenneth H. Lai, Suzanne V. Blackley, Li Zhou

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 732 - 735

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Mutual information clustering is an agglomerative hierarchical clustering method that has been used to group random variables or sets thereof. Some researchers have found that the normalization method used can lead to oddly-sized clusters that do not line up with expected results. We introduce a new normalization parameter to control the size of the clusters, and apply it to food allergy data from...

chapter

Efficient Computation of Multiple Density-Based Clustering Hierarchies

Antonio Cavalcante Araujo Neto, Joerg Sander, Ricardo J. G. B. Campello, Mario A. Nascimento

2017 IEEE International Conference on Data Mining (ICDM) > 991 - 996

2017 IEEE International Conference on Data Mining (ICDM)

HDBSCAN*, a state-of-the-art density-based hierarchical clustering method, produces a hierarchical organization of clusters in a dataset w.r.t. a parameter mpts. While the performance of HDBSCAN* is robust w.r.t. mpts, choosing a "good" value for it can be challenging: depending on the data distribution, a high or low value for mpts may be more appropriate, and certain data clusters may...

chapter

Automatic density clustering with multiple kernels for high-dimension bioinformatics data

Longlong Liao, Kenli Li, Keqin Li, Qi Tian, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 2105 - 2112

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Clustering is an effective method for data analysis and can be exploited to unknown features of data samples, its applications range from data mining to bioinformatics analysis. Several clustering approaches have been proposed in order to obtain a better trade-off between accuracy and efficiency of the clustering process. It is well-known that no existing clustering algorithm completely satisfies...

chapter

Analysis of clustering algorithms in biological networks

Asuda Sharma, Hesham H. Ali

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 2303 - 2305

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Biological data is often represented as networks, as in the case of protein-protein interactions and metabolic pathways. Modeling, analyzing, and visualizing networks can help make sense of large volumes of data generated by high-throughput experiments. However, due to their size and complex structure, biological networks can be difficult to interpret without further processing. Cluster analysis is...

chapter

A K-medoids based clustering scheme with an application to document clustering

Aytug Onan

2017 International Conference on Computer Science and Engineering (UBMK) > 354 - 359

2017 International Conference on Computer Science and Engineering (UBMK)

Clustering is an important unsupervised data analysis technique, which divides data objects into clusters based on similarity. Clustering has been studied and applied in many different fields, including pattern recognition, data mining, decision science and statistics. Clustering algorithms can be mainly classified as hierarchical and partitional clustering approaches. Partitioning around medoids...

chapter

Dark patches in clustering

Waqar Ishaq, Eliya Buyukkaya

2017 International Conference on Computer Science and Engineering (UBMK) > 806 - 811

2017 International Conference on Computer Science and Engineering (UBMK)

This survey highlights issues in clustering which hinder in achieving optimal solution or generates inconsistent outputs. We called such malignancies as dark patches. We focus on the issues relating to clustering rather than concepts and techniques of clustering. For better insight into the issues of clustering, we categorize dark patches into three classes and then compare various clustering methods...

chapter

Using the combination of particle swarm algorithms and fuzzy approach to provide a clustering method for network nodes with coverage maintenance in wireless sensor networks

Seyyed Amir Reza Taghdisi Heydariyan, Amir Hussein Mohajerzadeh

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE) > 20 - 26

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)

Wireless sensor network (WSN) is an inexpensive newfound technology with many applications in various fields (such as biology Environment, war and natural disasters). A network consisting of a large number of sensor nodes and collecting information from the environment in a distributed environment. The main limitations include limited energy, low communication capacity, low storage volume, and low...

chapter

Graph-based clustering for identifying region of interest in eye tracker data analysis

Kanghang He, Cheng Yang, Vladimir Stankovic, Lina Stankovic

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

Localization of a viewer's region of interest (ROI) on eye gaze signal trajectories acquired by eye trackers is a widely used approach in scene analysis, image compression, and quality of experience assessment. In this paper, we propose a novel clustering approach for ROI estimation from potentially noisy raw eye gaze data, based on signal processing on graphs. The clustering approach adapts graph...

chapter

A correlation-based bi-partition hierarchical clustering method for mode identification of multimode processes

Yilin Wang, Tongshuai Zhang, Hao Ye, Ling Wang

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1645 - 1650

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Clustering is a popular method to deal with the problem for mode identification of multimode processes. Unlike traditional distance-based clustering methods, in this paper, a new correlation-based bi-partition hierarchical clustering (CBHC) method is proposed, which classifies the observations according to their correlation relationships rather than their distances. Motivated by an existing correlation-based...

chapter

Spectral clustering based on JS-divergence for uncertain data

Yingxu Wang, Jiwen Dong, Jin Zhou, Lin Wang, more

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1972 - 1975

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Spectral clustering is one of the most effective methods of data mining, in which the adjacency matrix is constructed by using the similarity matrix. In this paper, to extend spectral clustering method for uncertain data clustering, we propose a new spectral clustering method based on JS-divergence. In the proposed method, the JS-divergence is used to construct the adjacency matrix in the spectral...

chapter

Diagnose for downhole working conditions of the beam pumping unit based on 16-directions chain codes and K-means clustering method

Kun Li, Ying Han

2017 Chinese Automation Congress (CAC) > 7178 - 7182

2017 Chinese Automation Congress (CAC)

The dynamometer card is a main method to analyze downhole working conditions of the beam pumping unit in actual operation. For computer based diagnosis mode, a method based on 16-directions chain codes and K-means clustering is proposed in this paper. First, the 16-directions chain codes are used to recreate boundary contour curve of the dynamometer card; then seven feature vectors which can accurately...

chapter

Deep Adaptive Image Clustering

Jianlong Chang, Lingfeng Wang, Gaofeng Meng, Shiming Xiang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 5880 - 5888

2017 IEEE International Conference on Computer Vision (ICCV)

Image clustering is a crucial but challenging task in machine learning and computer vision. Existing methods often ignore the combination between feature learning and clustering. To tackle this problem, we propose Deep Adaptive Clustering (DAC) that recasts the clustering problem into a binary pairwise-classification framework to judge whether pairs of images belong to the same clusters. In DAC, the...

chapter

Implementation of K-means clustering method to distribution of high school teachers

Triyanna Widiyaningtyas, Martin Indra Wisnu Prabowo, M. Ardhika Mulya Pratama

2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI) > 1 - 6

2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI)

Currently, the government is still having difficulties in distributing teachers. The current problem is not just about less teachers, but also more teachers in some cities. The problem of unequal distribution of teachers then became dependent on local government. The distribution of teachers now can not be centralized because of the decentralization system implemented in Indonesia. Clustering in data...

chapter

Improved K-means algorithm based on hybrid rice optimization algorithm

Chuan Liu, Chunzhi Wang, Jixiong Hu, Zhiwei Ye

2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS) > 2 > 788 - 791

2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS)

Clustering analysis is an active research branch in the area of data mining due to its simplicity and rapidity. However, K-means algorithm has the shortcomings of heavily depending on the initial clustering center and easily falls into local optimum. In this paper, we consider a deep research on K-means algorithm of optimization. We put forward the first selected initial clustering center of K-means...

chapter

Progressive clustering of manifold-modeled data based on tangent space variations

Gokhan Gokdogan, Elif Vural

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

An important research topic of the recent years has been to understand and analyze manifold-modeled data for clustering and classification applications. Most clustering methods developed for data of non-linear and low-dimensional structure are based on local linearity assumptions. However, clustering algorithms based on locally linear representations can tolerate difficult sampling conditions only...

chapter

HACGA: An artifacts-based clustering approach for malware classification

Oliviu-Bogdan Botocan, Gabriela Czibula

2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) > 5 - 12

2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

More and more sophisticated malware attacks are developed nowadays and new variants of existing malicious software are released daily. Malware clustering is often applied to identify patterns of malicious software, with similar samples being grouped together and considered variants of the same malware family. In this paper we propose an automated technique based on agglomerative hierarchical clustering...

chapter

An elliptical-shaped density-based classification algorithm for detection of entangled clusters

Stanley Smith, Mylene Pischella, Michel Terre

2017 25th European Signal Processing Conference (EUSIPCO) > 316 - 320

2017 25th European Signal Processing Conference (EUSIPCO)

We present a density-based clustering method producing a covering of the dataset by ellipsoidal structures in order to detect possibly entangled clusters. We first introduce an unconstrained version of the algorithm which does not require any assumption on the number of clusters. Then a constrained version using a priori knowledge to improve the bare clustering is discussed. We evaluate the performance...

chapter

Grey-incidence clustering decision-making method with three-parameter interval grey number based on regret theory

Ye Li, Yufei Niu, Wenliang Wang, Bingjun Li

2017 International Conference on Grey Systems and Intelligent Services (GSIS) > 211 - 218

2017 International Conference on Grey Systems and Intelligent Services (GSIS)

Aiming at the multiple attribute decision making problem with three-parameter interval grey numbers, a grey-incidence clustering decision making method based on regret theory is proposed in this paper. First, according to the idea of TOPSIS method, a kind of comprehensive grey interval incidence coefficient of three-parameter interval grey number is defined, and the “regret-rejoice” value is calculated...

INFONA - science communication portal

Search results

EC3: Combining Clustering and Classification for Ensemble Learning

Ship route extraction and clustering analysis based on automatic identification system data

Using mutual information clustering to discover food allergen cross-reactivity

Efficient Computation of Multiple Density-Based Clustering Hierarchies

Automatic density clustering with multiple kernels for high-dimension bioinformatics data

Analysis of clustering algorithms in biological networks

A K-medoids based clustering scheme with an application to document clustering

Dark patches in clustering

Using the combination of particle swarm algorithms and fuzzy approach to provide a clustering method for network nodes with coverage maintenance in wireless sensor networks

Graph-based clustering for identifying region of interest in eye tracker data analysis

A correlation-based bi-partition hierarchical clustering method for mode identification of multimode processes

Spectral clustering based on JS-divergence for uncertain data

Diagnose for downhole working conditions of the beam pumping unit based on 16-directions chain codes and K-means clustering method

Deep Adaptive Image Clustering

Implementation of K-means clustering method to distribution of high school teachers

Improved K-means algorithm based on hybrid rice optimization algorithm

Progressive clustering of manifold-modeled data based on tangent space variations

HACGA: An artifacts-based clustering approach for malware classification

An elliptical-shaped density-based classification algorithm for detection of entangled clusters

Grey-incidence clustering decision-making method with three-parameter interval grey number based on regret theory

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options