Search results for: Jiarong Cai

Items from 1 to 4 out of 4 results

chapter

Clustering Massive Text Data Streams by Semantic Smoothing Model

Yubao Liu, Jiarong Cai, Jian Yin, Ada Wai-Chee Fu

Lecture Notes in Computer Science > Advanced Data Mining and Applications > Regular Papers > 389-400

Clustering text data streams is an important issue in data mining community and has a number of applications such as news group filtering, text crawling, document organization and topic detection and tracing etc. However, most methods are similarity-based approaches and use the TF*IDF scheme to represent the semantics of text data and often lead to poor clustering quality. In this paper, we firstly...

chapter

An Effective Maximal Subspace Clustering Algorithm Based on Enumeration Tree

Jian Yin, Zhilan Huang, Yubao Liu, Jiarong Cai, more

Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007) > 1 > 572 - 576

2007 International Conference on Fuzzy Systems and Knowledge Discovery

Subspace clustering is one of the best approaches for discovering meaningful clusters in high dimensional space. However, the existing algorithms often produce clusters of great redundancy that are not easy to be understood. In this paper, based on the enumeration tree of subspace, we propose a new subspace clustering algorithm MSC to find the clusters hidden in the maximal subspace. MSC uses the...

chapter

An Improved Semantic Smoothing Model for Model-Based Document Clustering

Jiarong Cai, Yubao Liu, Jian Yin

Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007) > 3 > 670 - 675

2007 8th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing

Recently, semantic smoothing is proposed as an efficient solution for the improvement of document cluster quality. However, the existing semantic smoothing model is not effective for partitional clustering to enhance the document clustering quality. In this paper, inspired by the TF*IDF schema and background elimination strategy, we first introduce an improved semantic smoothing model, which is suitable...

chapter

An Efficient Clustering Algorithm for Small Text Documents

Yubao Liu, Jiarong Cai, Jian Yin, Zhilan Huang

2006 Seventh International Conference on Web-Age Information Management Workshops > 16

2006 Seventh International Conference on Web-Age Information Management Workshops

Clustering text documents into different category groups is an important problem. The size of desired clusters is an important requirement for a clustering solution. In this paper, we present an efficient clustering algorithm called RTC based on the spherical k-means algorithm for small text documents. In RTC, we present a new initial centers choice method based on the density and farthest distance...

Filter options

Publication type:
book

Publication date

Set your own date range

Keywords

PATTERN CLUSTERING (2)
AGGLOMERATIVE-PARTITIONAL CLUSTERING (1)
CLUSTER DISTRIBUTION MONOTONY (1)
CLUSTERING (1)
DATA MINING (1)
ENUMERATION TREE (1)
MAXIMAL SUBSPACE CLUSTERING ALGORITHM (1)
MODEL-BASED TEXT DOCUMENT CLUSTERING (1)
SEMANTIC SMOOTHING (1)
SEMANTIC SMOOTHING MODEL (1)
SUBSPACE ENUMERATION TREE (1)
TEXT ANALYSIS (1)
TEXT DATA STREAMS (1)
more

Data set

ieee (3)
Springer (1)

INFONA - science communication portal

Search results for: Jiarong Cai

Clustering Massive Text Data Streams by Semantic Smoothing Model

An Effective Maximal Subspace Clustering Algorithm Based on Enumeration Tree

An Improved Semantic Smoothing Model for Model-Based Document Clustering

An Efficient Clustering Algorithm for Small Text Documents

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options