Search results

chapter

The research on detecting complex network community structure

Wang Zongjiang

2011 3rd International Conference on Computer Research and Development > 3 > 163 - 166

2011 3rd International Conference on Computer Research and Development (ICCRD 2011)

This paper mainly studies the complex network detection algorithm, and improves an algorithm based on K-means, Another reference node density properties, this paper puts forward a method community structure detection algorithms (BSTN) based on similarity between the nodes of the complex network, the algorithm greatly reduce iteration times, using the algorithm in the computer generated stochastic...

chapter

A Modified k-means Algorithm for Clustering Problem with Balancing Constraints

Sun Yuepeng, Liu Min, Wu Cheng

2011 Third International Conference on Measuring Technology and Mechatronics Automation > 1 > 127 - 130

2011 International Conference on Measuring Technology and Mechatronics Automation (ICMTMA)

A clustering problem with balancing constraints is studied in this paper, which means that the sample number in each cluster has to be at least pre-given value. A modified k-means clustering algorithm is proposed, which adopt the proposed heuristic cluster assignment algorithm to deal with the balancing constraints. Numerical computation shows that the proposed algorithm can deal with the balancing...

chapter

A new hybrid approach for data clustering

D Yazdani, S Golyari, M R Meybodi

2010 5th International Symposium on Telecommunications > 914 - 919

2010 5th International Symposium on Telecommunications (IST)

Data clustering has been applied in multiple fields such as machine learning, data mining, wireless sensor networks and pattern recognition. One of the most famous clustering approaches is K-means which effectively has been used in many clustering problems, but this algorithm has some problems such as local optimal convergence and initial point sensitivity. Artificial fishes swarm algorithm (AFSA)...

chapter

An efficient K-Means clustering algorithm for reducing time complexity using uniform distribution data points

D Napoleon, P G Lakshmi

Trendz in Information Sciences&Computing(TISC2010) > 42 - 45

2nd International Conference on Trendz in Information Sciences & Computing (TISC 2010)

Data mining has been defined as "The nontrivial extraction of implicit, previously unknown, and potentially useful information from data". Clustering is the automated search for group of related observations in a data set. The K-Means method is one of the most commonly used clustering techniques for a variety of applications. This paper proposes a method for making the K-Means algorithm...

chapter

An improved clustering technique based on statistical model preprocessing for gene expression dataset

N Tajunisha, V Saravanan

Trendz in Information Sciences&Computing(TISC2010) > 46 - 49

2nd International Conference on Trendz in Information Sciences & Computing (TISC 2010)

Data mining has become an important topic in effective analysis of gene expression data due to its wide application in the biomedical industry. Within a gene expression matrix there are usually several particular macroscopic phenotypes of samples. Selection of genes most relevant and informative for certain phenotypes is an important aspect in gene expression analysis. Currently most of the research...

chapter

Unsupervised Speaker Clustering in a Linear Discriminant Subspace

T Giannakopoulos, S Petridis

2010 Ninth International Conference on Machine Learning and Applications > 1005 - 1009

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

We present an approach for grouping single-speaker speech segments into speaker-specific clusters. Our approach is based on applying the K-means clustering algorithm to a suitable discriminant subspace, where the euclidean distance reflect speaker differences. A core feature of our approach is approximating speaker-conditional statistics, that are not available, with single-speaker segments statistics,...

chapter

Patient-Specific Seizure Detection from Intra-cranial EEG Using High Dimensional Clustering

Haimonti Dutta, David Waltz, Karthik M Ramasamy, Phil Gross, more

2010 Ninth International Conference on Machine Learning and Applications > 782 - 787

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Automatic seizure detection is becoming popular in modern epilepsy monitoring units since it assists diagnostic monitoring and reduces manual review of large volumes of EEG recordings. In this paper, we describe the application of machine learning algorithms for building patient-specific seizure detectors on multiple frequency bands of intra-cranial electroencephalogram (iEEG) recorded by a dense...

chapter

Geo-visualization and Clustering to Support Epidemiology Surveillance Exploration

Jingyuan Zhang, Hao Shi

2010 International Conference on Digital Image Computing: Techniques and Applications > 381 - 386

2010 International Conference on Digital Image Computing: Techniques and Applications (DICTA 2010)

WebEpi is an epidemiological WebGIS service developed for the Population Health Epidemiology Unit of the Tasmania Department of Health and Human Services (DHHS). Epidemiological geographical studies help analyze public health surveillance and medical situations. It is still a challenge to conduct large-scale geographical information exploration of epidemiology surveillance based on patterns and relationships...

chapter

Space Partitioning for Scalable K-Means

D Pettinger, G Di Fatta

2010 Ninth International Conference on Machine Learning and Applications > 319 - 324

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

K-Means is a popular clustering algorithm which adopts an iterative refinement procedure to determine data partitions and to compute their associated centres of mass, called centroids. The straightforward implementation of the algorithm is often referred to as `brute force' since it computes a proximity measure from each data point to each centroid at every iteration of the K-Means process. Efficient...

chapter

Clustering of Image Data Set Using K-Means and Fuzzy K-Means Algorithms

V K Dehariya, S K Shrivastava, R C Jain

2010 International Conference on Computational Intelligence and Communication Networks > 386 - 391

2010 International Conference on Computational Intelligence and Communication Networks (CICN 2010)

Clustering or data grouping is a key initial procedure in image processing. In present scenario the size of database of companies has increased dramatically, these databases contain large amount of text, image. They need to mine these huge databases and make accurate decisions in short durations in order to gain marketing advantage. As image is a collection of number of pixels. It is difficult to...

chapter

An Efficient Dimension Reduction and Optimal Cluster Center Initialization Technique

D S Rajput, P K Singh, M Bhattacharya

2010 International Conference on Computational Intelligence and Communication Networks > 503 - 508

2010 International Conference on Computational Intelligence and Communication Networks (CICN 2010)

Most of the clustering algorithms perform loosely when dimensionality of the data set increase because some dimensions contain irrelevant or noisy data and randomly initialization of clusters centres gives the local optimum clustering. In this paper, we proposed a technique for reducing the effect of high dimensionality and randomly initialization of clusters centres. It consists of three phases....

chapter

Research and Implementation of an Anomaly Detection Model Based on Clustering Analysis

Li Han

2010 International Symposium on Intelligence Information Processing and Trusted Computing > 458 - 462

2010 International Symposium on Intelligence Information Processing and Trusted Computing (IPTC 2010)

IDS (Intrusion Detection system) is an active and driving defense technology. This paper mainly focuses on intrusion detection based on data mining. The aim is to improve the detection rate and decrease the false alarm rate, and the main research method is clustering analysis. The algorithm and model of ID are proposed and corresponding simulation experiments are presented. Firstly, a method to reduce...

chapter

Improving the K-means algorithm using improved downhill simplex search

E Saboori, S Parsazad, A Sadeghi

2010 2nd International Conference on Software Technology and Engineering > 2 > V2-350 - V2-354

2010 2nd International Conference on Software Technology and Engineering (ICSTE 2010)

The k-means algorithm is one of the well-known and most popular clustering algorithms. K-means seeks an optimal partition of the data by minimizing the sum of squared error with an iterative optimization procedure, which belongs to the category of hill climbing algorithms. As we know hill climbing searches are famous for converging to local optimums. Since k-means can converge to a local optimum,...

chapter

Based on k-Means and Fuzzy k-Means Algorithm Classification of Precipitation

Yang Lihua, Deng Meilan

2010 International Symposium on Computational Intelligence and Design > 1 > 218 - 221

2010 3rd International Symposium on Computational Intelligence and Design (ISCID 2010)

In this paper, the author used K-means and fuzzy K-means to analyze the classification of precipitation in JingDeZhen City, and the results showed that using fuzzy k-means algorithm is a more efficient data clustering algorithm, with better value of promotion and practical application.

chapter

System anomaly detection in distributed systems through MapReduce-Based log analysis

Yan Liu, Wei Pan, Ning Cao, Guangwei Qiao

2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE) > 6 > V6-410 - V6-413

2010 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE 2010)

System anomaly detection is very important for development, maintenance and performance refinement in large scale distributed systems. It's a good way to obtain the troubleshooting and problem diagnosis by analyzing system logs produced by distributed systems. However, due to the increasing scale and complexity of distributed systems, the size of logs must be very large. Thus, it's inefficient for...

chapter

Augmenting Rapid Clustering Method for Social Network Analysis

J Prabhu, M Sudharshan, M Saravanan, G Prasad

2010 International Conference on Advances in Social Networks Analysis and Mining > 407 - 408

2010 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2010)

Presently, in the data mining scenario clustering of large dataset is one of the very important techniques widely applied to many applications including social network analysis. Applying more specific pre-processing method to prepare the data for clustering algorithms is considered to be a significant step for generating meaningful segments. In this paper we propose an innovative clustering technique...

chapter

An Increased Performance of Clustering High Dimensional Data Using Principal Component Analysis

N Tajunisha, V Saravanan

2010 First International Conference on Integrated Intelligent Computing > 17 - 21

2010 First International Conference on Integrated Intelligent Computing (ICIIC 2010)

In many application domains such as information retrieval, computational biology, and image processing the data dimension is usually very high. Developing effective clustering methods for high dimensional dataset is a challenging problem due to the curse of dimensionality. The k-means clustering algorithm is used for many practical applications. But it is computationally expensive and the quality...

chapter

Community structure of the Chinese document network based on content similarity

Xin Pan, Jian-Guo Liu, Guishi Deng

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 4 > 1515 - 1519

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Based on the complex network theory, we proposed a clustering algorithm based on content similarity. Firstly, the Chinese documents are represented by the vector-space model, and the content similarity between any two documents is computed by the cosine similarity. Consequently, the network node is defined as a document, and the edge weight is defined as the similarity obtained by the cosine similarity...

chapter

Improve K-means clustering for audio data by exploring a reasonable sampling rate

Gang Chen, Bo Han

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 4 > 1639 - 1642

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

K-means clustering is sensitive to starting points and its time cost is expensive for large scale of data, such as audio. Sampling approach is widely applied to find “better” starting points for speeding up the clustering converging procedure. However, how to choose a reasonable sampling-rate remains a problem. In this paper, we reported our initial exploration of locating reasonable sampling-rates...

chapter

Careful Seeding Based on Independent Component Analysis for k-Means Clustering

Takashi Onoda, Miho Sakai, Seiji Yamada

2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology > 3 > 112 - 115

2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT)

The k-means method is a widely used clustering technique because of its simplicity and speed. However, the clustering result depends heavily on the chosen initial value. In this report, we propose a seeding method with independent component analysis for the k-means method. Using a benchmark dataset, we evaluate the performance of our proposed method and compare it with other seeding methods.

INFONA - science communication portal

Search results

The research on detecting complex network community structure

A Modified k-means Algorithm for Clustering Problem with Balancing Constraints

A new hybrid approach for data clustering

An efficient K-Means clustering algorithm for reducing time complexity using uniform distribution data points

An improved clustering technique based on statistical model preprocessing for gene expression dataset

Unsupervised Speaker Clustering in a Linear Discriminant Subspace

Patient-Specific Seizure Detection from Intra-cranial EEG Using High Dimensional Clustering

Geo-visualization and Clustering to Support Epidemiology Surveillance Exploration

Space Partitioning for Scalable K-Means

Clustering of Image Data Set Using K-Means and Fuzzy K-Means Algorithms

An Efficient Dimension Reduction and Optimal Cluster Center Initialization Technique

Research and Implementation of an Anomaly Detection Model Based on Clustering Analysis

Improving the K-means algorithm using improved downhill simplex search

Based on k-Means and Fuzzy k-Means Algorithm Classification of Precipitation

System anomaly detection in distributed systems through MapReduce-Based log analysis

Augmenting Rapid Clustering Method for Social Network Analysis

An Increased Performance of Clustering High Dimensional Data Using Principal Component Analysis

Community structure of the Chinese document network based on content similarity

Improve K-means clustering for audio data by exploring a reasonable sampling rate

Careful Seeding Based on Independent Component Analysis for k-Means Clustering

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options