Search results for: Jing Zhang

Items from 1 to 4 out of 4 results

chapter

K-Means Clustering with Bagging and MapReduce

Hai-Guang Li, Gong-Qing Wu, Xue-Gang Hu, Jing Zhang, more

2011 44th Hawaii International Conference on System Sciences > 1 - 8

2011 44th Hawaii International Conference on System Sciences (HICSS 2011)

Clustering is one of the most widely used techniques for exploratory data analysis. Across all disciplines, from social sciences over biology to computer science, people try to get a first intuition about their data by identifying meaningful groups among the data objects. K-means is one of the most famous clustering algorithms. Its simplicity and speed allow it to run on large data sets. However,...

chapter

A 2-Tier Clustering Algorithm with Map-Reduce

Jing Zhang, Gongqing Wu, Haiguang Li, Xuegang Hu, more

2010 Fifth Annual ChinaGrid Conference > 160 - 166

Fifth ChinaGrid Annual Conference (ChinaGrid 2010)

In the field of data mining, clustering is one of the important methods. K-Means is a typical distance-based clustering algorithm; 2-tier clustering should implement scalable clustering by means of dividing, sampling and knowledge integrating. Among those tools of distributed processing, Map-Reduce has been widely embraced by both academia and industry. Hadoop is an open-source parallel and distributed...

chapter

An Intelligent Spam Filtering System Based on Fuzzy Clustering

Yong Hu, Ce Guo, Xiangzhou Zhang, Zhihui Guo, more

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery > 7 > 515 - 519

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2009)

Spam, also known as unsolicited bulk email (UBE), is becoming increasingly harmful for email traffics. Filtering is a simple and efficient way to combat against spam. Machine-learning-based classification algorithms are of excellent performance in filtering spam. However, the classifiers need be trained with a group of training samples before being able to work. Heavy manual labor and privacy problems...

chapter

Name Disambiguation Using Atomic Clusters

Feng Wang, Juanzi Li, Jie Tang, Jing Zhang, more

2008 The Ninth International Conference on Web-Age Information Management > 357 - 364

2008 9th International Conference on Web-Age Information Management (WAIM)

Name ambiguity is a critical problem in many applications, in particular in the online bibliography systems, such as DBLP and CiteSeer. Previously, several clustering based methods have been proposed although, the problem still presents to be a big challenge for both research and industry communities. In this paper, we present a complementary study to the problem from another point of view. We propose...

Filter options

Keywords:
CLUSTERING ALGORITHMS
PATTERN CLUSTERING
Publication type:
book

Publication date

Set your own date range

Content availability

Available (3)
None (1)

Keywords

TRAINING (3)
ALGORITHM DESIGN AND ANALYSIS (2)
CLASSIFICATION ALGORITHMS (2)
DATA MINING (2)
DISTRIBUTED COMPUTING (2)
DISTRIBUTED PROCESSING (2)
ELECTRONIC MAIL (2)
2-TIER CLUSTERING (1)
2-TIER CLUSTERING ALGORITHM (1)
ARNETMINER.ORG (1)
ATOMIC CLUSTER (1)
ATOMIC CLUSTER FINDING (1)
BAGGING (1)
BIBLIOGRAPHIC SYSTEMS (1)
CITESEER (1)
CLASSIFICATION ALGORITHM (1)
CLUSTERING METHODS (1)
CLUSTERING-BASED METHOD (1)
COMPUTATIONAL MODELING (1)
COMPUTER SCIENCE (1)
DATA ANALYSIS (1)
DATA OBJECT (1)
DATA PRIVACY (1)
DATA SET (1)
DBLP (1)
DISTANCE MEASUREMENT (1)
DISTRIBUTED CLUSTERING (1)
DISTRIBUTED DATABASES (1)
DISTRIBUTED PROGRAMMING (1)
EMAIL TRAFFICS (1)
ENSEMBLE LEARNING METHOD BAGGING (1)
FEATURE EXTRACTION (1)
FILTERING (1)
FUZZY CLUSTERING (1)
FUZZY SET THEORY (1)
HIDDEN MARKOV MODELS (1)
INTELLIGENT SPAM FILTERING SYSTEM (1)
K-MEANS (1)
K-MEANS CLUSTERING ALGORITHM (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
MACHINE LEARNING ALGORITHMS (1)
MACHINE LEARNING BASED CLASSIFICATION (1)
MAP REDUCE (1)
MAP-REDUCE (1)
MAPREDUCE (1)
MERGING (1)
NAME DISAMBIGUATION (1)
ONLINE BIBLIOGRAPHY SYSTEMS (1)
PATTERN CLASSIFICATION (1)
PRIVACY PROBLEMS (1)
PROGRAMMING (1)
SCALABLE CLUSTERING (1)
SOCIAL SCIENCES (1)
TEXT ANALYSIS (1)
UNSOLICITED BULK EMAIL (1)
UNSOLICITED E-MAIL (1)
more

INFONA - science communication portal

Search results for: Jing Zhang

K-Means Clustering with Bagging and MapReduce

A 2-Tier Clustering Algorithm with Map-Reduce

An Intelligent Spam Filtering System Based on Fuzzy Clustering

Name Disambiguation Using Atomic Clusters

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options