A clustering algorithm for sample data based on environmental pollution characteristics

Mei Chen; Pengfei Wang; Qiang Chen; Jiadong Wu; Xiaoyun Chen

doi:10.1016/j.atmosenv.2015.02.042

A clustering algorithm for sample data based on environmental pollution characteristics

Mei Chen, Pengfei Wang, Qiang Chen, Jiadong Wu, Xiaoyun Chen

Source

Atmospheric Environment > 2015 > 107 > C > 194-203

Abstract

Environmental pollution has become an issue of serious international concern in recent years. Among the receptor-oriented pollution models, CMB, PMF, UNMIX, and PCA are widely used as source apportionment models. To improve the accuracy of source apportionment and classify the sample data for these models, this study proposes an easy-to-use, high-dimensional EPC algorithm that not only organizes all of the sample data into different groups according to the similarities in pollution characteristics such as pollution sources and concentrations but also simultaneously detects outliers. The main clustering process consists of selecting the first unlabelled point as the cluster centre, then assigning each data point in the sample dataset to its most similar cluster centre according to both the user-defined threshold and the value of similarity function in each iteration, and finally modifying the clusters using a method similar to k-Means. The validity and accuracy of the algorithm are tested using both real and synthetic datasets, which makes the EPC algorithm practical and effective for appropriately classifying sample data for source apportionment models and helpful for better understanding and interpreting the sources of pollution.

Identifiers

journal ISSN :	1352-2310
DOI	10.1016/j.atmosenv.2015.02.042

Authors

Mei Chen

School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China

Pengfei Wang

School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China

Qiang Chen

College of Atmospheric Sciences, Lanzhou University, Lanzhou 730000, China

Jiadong Wu

School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China

see all

Keywords

Environmental pollution High-dimensional sample data Pollution characteristics Clustering algorithm

Additional information

Publication languages: English

Data set: Elsevier

Publisher

Elsevier Science

Fields of science

No field of science has been suggested yet.

article

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

A clustering algorithm for sample data based on environmental pollution characteristics $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Mei Chen

Pengfei Wang

Qiang Chen

Jiadong Wu

Keywords

Additional information

Publisher

Fields of science

Fields of science

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

A clustering algorithm for sample data based on environmental pollution characteristics