Search results for: Shengzhong Feng

Items from 1 to 7 out of 7 results

article

MR-DBSCAN: a scalable MapReduce-based DBSCAN algorithm for heavily skewed data

Yaobin He, Haoyu Tan, Wuman Luo, Shengzhong Feng, more

Frontiers of Computer Science > 2014 > 8 > 1 > 83-99

DBSCAN (density-based spatial clustering of applications with noise) is an important spatial clustering technique that is widely adopted in numerous applications. As the size of datasets is extremely large nowadays, parallel processing of complex data analysis such as DBSCAN becomes indispensable. However, there are three major drawbacks in the existing parallel DBSCAN algorithms. First, they fail...

chapter

MR-DBSCAN: An Efficient Parallel Density-Based Clustering Algorithm Using MapReduce

Yaobin He, Haoyu Tan, Wuman Luo, Huajian Mao, more

2011 IEEE 17th International Conference on Parallel and Distributed Systems > 473 - 480

2011 IEEE 17th International Conference on Parallel and Distributed Systems (ICPADS)

Data clustering is an important data mining technology that plays a crucial role in numerous scientific applications. However, it is challenging due to the size of datasets has been growing rapidly to extra-large scale in the real world. Meanwhile, MapReduce is a desirable parallel programming platform that is widely applied in kinds of data process fields. In this paper, we propose an efficient parallel...

chapter

Minimum Spanning Tree Based Classification Model for Massive Data with MapReduce Implementation

Jin Chang, Jun Luo, Joshua Zhexue Huang, Shengzhong Feng, more

2010 IEEE International Conference on Data Mining Workshops > 129 - 137

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

Rapid growth of data has provided us with more information, yet challenges the tradition techniques to extract the useful knowledge. In this paper, we propose MCMM, a Minimum spanning tree (MST) based Classification model for Massive data with MapReduce implementation. It can be viewed as an intermediate model between the traditional K nearest neighbor method and cluster based classification method,...

chapter

Balanced parallel FP-Growth with MapReduce

Le Zhou, Zhiyong Zhong, Jin Chang, Junjie Li, more

2010 IEEE Youth Conference on Information, Computing and Telecommunications > 243 - 246

2010 IEEE Youth Conference on Information, Computing and Telecommunications (YC-ICT 2010)

Frequent itemset mining (FIM) plays an essential role in mining associations, correlations and many other important data mining tasks. Unfortunately, as the volume of dataset gets larger day by day, most of the FIM algorithms in literature become ineffective due to either too huge resource requirement or too much communication cost. In this paper, we propose a balanced parallel FP-Growth algorithm...

chapter

Accelerating MapReduce with Distributed Memory Cache

Shubin Zhang, Jizhong Han, Zhiyong Liu, Kai Wang, more

2009 15th International Conference on Parallel and Distributed Systems > 472 - 478

2009 IEEE 15th International Conference on Parallel and Distributed Systems (ICPADS 2009)

MapReduce is a partition-based parallel programming model and framework enabling easy development of scalable parallel programs on clusters of commodity machines. In order to make time-intensive applications benefit from MapReduce on small scale clusters, this paper proposes a new method to improve the performance of MapReduce by using distributed memory cache as a high speed access between map tasks...

chapter

Spatial Queries Evaluation with MapReduce

Shubin Zhang, Jizhong Han, Zhiyong Liu, Kai Wang, more

2009 Eighth International Conference on Grid and Cooperative Computing > 287 - 292

2009 Eighth International Conference on Grid and Cooperative Computing (GCC)

Spatial queries include spatial selection query, spatial join query, nearest neighbor query, etc. Most of spatial queries are computing intensive and individual query evaluation may take minutes or even hours. Parallelization seems a good solution for such problems. However, parallel programs must communicate efficiently, balance work across all nodes, and address problems such as failed nodes. We...

chapter

Mining User's Interest from Interactive Behaviors in QA System

Zhongying Zhao, Shengzhong Feng, Yongquan Liang, Qingtian Zeng, more

2009 First International Workshop on Education Technology and Computer Science > 2 > 1025 - 1029

2009 First International Workshop on Education Technology and Computer Science. ETCS 2009

User interest model, as a key component of user model, is very important for personalized or user adaptive E-learning systems. In this paper, we propose an approach for mining userpsilas interest from interactive behaviors. We also develop and implement a domain-specific interactive QA system oriented to Artificial Intelligence. The course ontology, predefined to describe the skeleton of AI course,...

Filter options

Keywords:
DATA MINING

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Shengzhong Feng

MR-DBSCAN: a scalable MapReduce-based DBSCAN algorithm for heavily skewed data

MR-DBSCAN: An Efficient Parallel Density-Based Clustering Algorithm Using MapReduce

Minimum Spanning Tree Based Classification Model for Massive Data with MapReduce Implementation

Balanced parallel FP-Growth with MapReduce

Accelerating MapReduce with Distributed Memory Cache

Spatial Queries Evaluation with MapReduce

Mining User's Interest from Interactive Behaviors in QA System

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options