Improvement and Parallelism of k-Means Clustering Algorithm

Jinlan Tian; Lin Zhu; Suqin Zhang; Lu Liu

doi:10.1016/S1007-0214(05)70069-9

Improvement and Parallelism of k-Means Clustering Algorithm

Jinlan Tian, Lin Zhu, Suqin Zhang, Lu Liu

Source

Tsinghua Science & Technology > 2005 > 10 > 3 > 277-281

Abstract

Abstract The k-means clustering algorithm is one of the most commonly used algorithms for clustering analysis. The traditional k-means algorithm is, however, inefficient while working on large numbers of data sets and improving the algorithm efficiency remains a problem. This paper focuses on the efficiency issues of cluster algorithms. A refined initial cluster centers method is designed to reduce the number of iterative procedures in the algorithm. A parallel k-means algorithm is also studied for the problem of the operation limitation of a single processor machine when given huge data sets. The analytical results demonstrate that these improvements can greatly enhance the efficiency of the k-means algorithm, i.e., allow the grouping of a large number of data sets more accurately and more quickly. The analysis has theoretical and practical importance for work on the improvement and parallelism of cluster algorithms.

Identifiers

journal ISSN :	1007-0214
DOI	10.1016/S1007-0214(05)70069-9

Authors

Keywords

data mining cluster analysis k-means algorithm parallelism

Additional information

Publication languages: English

Data set: Elsevier

Publisher

Elsevier Science

Fields of science

No field of science has been suggested yet.

article

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Improvement and Parallelism of k-Means Clustering Algorithm $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Jinlan Tian

Lin Zhu

Suqin Zhang

Lu Liu

Keywords

Additional information

Publisher

Fields of science

Fields of science

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Improvement and Parallelism of k-Means Clustering Algorithm