Search results

Items from 1 to 5 out of 5 results

chapter

Text Categorization with Considering Temporal Patterns of Term Usages

H Abe, S Tsumoto

2010 IEEE International Conference on Data Mining Workshops > 800 - 807

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

In document categorization method by using similarity measures based on word vectors, it is important to determine key words to characterize each document. However, conventional methods select the key words based on their frequency or/and particular importance index such as tf-idf. In this paper, we propose a method to characterize each document by using temporal clusters of technical term usages...

chapter

Evaluation of Text Clustering Based on Iterative Classification

Wang Xiaohua, Lou Jia

2009 International Conference on Computational Intelligence and Software Engineering > 1 - 5

2009 International Conference on Computational Intelligence and Software Engineering

Text clustering is a useful and inexpensive way to organize vast text repositories into meaningful topics categories. Although text clustering can be seen as an alternative to supervised text categorization, the question remains of how to determine if the resulting clusters are of sufficient quality in a real-life application. However, it is difficult to evaluate a given clustering of documents. Furthermore,...

chapter

An efficient k-means algorithm integrated with Jaccard distance measure for document clustering

M.-U.-S. Shameem, R. Ferdous

2009 First Asian Himalayas International Conference on Internet > 1 - 6

2009 First Asian Himalayas International Conference on Internet. AH-ICI 2009

Document Clustering is a widely studied problem in Text Categorization. It is the process of partitioning or grouping a given set of documents into disjoint clusters where documents in the same cluster are similar. K-means, one of the simplest unsupervised learning algorithms, solves the well known clustering problem following a simple and easy way to classify a given data set through a certain number...

chapter

Evaluation of Partition-Based Text Clustering Techniques to Categorize Indic Language Documents

D.A. Meedeniya, A.S. Perera

2009 IEEE International Advance Computing Conference > 1497 - 1500

2009 IEEE International Advance Computing Conference. IACC 2009

Wide availability of electronic data has led to the vast interest in text analysis, information retrieval and text categorization methods. To provide a better service, there is a need for non-English based document analysis and categorizing systems, as is currently available for English text documents. This study is mainly focused on categorizing Indic language documents. The main techniques examined...

chapter

Recent trends in Data Mining (DM): Document Clustering of DM Publications

Yi Peng, Gang Kou, Zhengxin Chen, Yong Shi

2006 International Conference on Service Systems and Service Management > 2 > 1653 - 1659

2006 International Conference on Service Systems and Service Management

Data mining (DM) brings knowledge and theories from several fields including databases, machine learning, optimization, statistics, and data visualization and has been applied to various real-life applications. A large amount of data mining articles have been published. The goal of this study is to establish an overview of the past and current data mining research activities from the title and abstract...

Filter options

Content availability:
Available
Keywords:
PATTERN CLUSTERING
DOCUMENT CLUSTERING

Publication date

Set your own date range

Keywords

CLUSTERING ALGORITHMS (4)
DATA MINING (3)
CLASSIFICATION ALGORITHMS (2)
INDEXES (2)
ALGORITHM DESIGN AND ANALYSIS (1)
ARTIFICIAL NEURAL NETWORKS (1)
BIBLIOGRAPHICAL DOCUMENT (1)
CATEGORIZATION (1)
CONFERENCES (1)
CONTENT ANALYSIS (1)
DATA MINING FIELD (1)
DATA MINING JOURNAL (1)
DATA MINING PUBLICATION (1)
DATA PRE-PROCESSING (1)
DOCUMENT CATEGORIZATION METHOD (1)
ELECTRONIC PUBLISHING (1)
ENTROPY (1)
F1-MEASURE (1)
FEATURE EXTRACTION (1)
FREQUENCY CONVERSION (1)
FREQUENCY MODULATION (1)
GAUSSIAN MIXTURE MODEL CLUSTERING (1)
GAUSSIAN PROCESSES (1)
INDIC LANGUAGE DOCUMENT CATEGORIZATION (1)
INFORMATION RETRIEVAL (1)
INVERSE DOCUMENT FREQUENCY (1)
ITERATIVE CLASSIFICATION (1)
ITERATIVE METHODS (1)
JACCARD DISTANCE MEASURE (1)
K-MEANS ALGORITHM (1)
K-MEANS CLUSTERING (1)
KEYWORD DETERMINATION (1)
LATENT SEMANTIC ANALYSIS (1)
MARINE VEHICLES (1)
MATRIX DECOMPOSITION (1)
NATURAL LANGUAGES (1)
PARTITION-BASED TEXT CLUSTERING TECHNIQUE (1)
PARTITIONING ALGORITHMS (1)
PATTERN CLASSIFICATION (1)
PRECISSION (1)
PROBABILITY DENSITY FUNCTION (1)
RECALL (1)
SILICON (1)
SIMILARITY MEASURE (1)
STATISTICS (1)
SUPERVISED TEXT CATEGORIZATION (1)
TECHNICAL TERM USAGE (1)
TEMPORAL CLUSTER (1)
TEMPORAL PATTERN (1)
TEMPORAL PATTERNS (1)
TERM USAGE INDEX (1)
TEXT CLUSTERING (1)
TEXT FREQUENCY (1)
TEXT MINING (1)
TEXT REPOSITORY (1)
TOPICS CATEGORY (1)
TRAINING (1)
UNSUPERVISED LEARNING (1)
UNSUPERVISED LEARNING ALGORITHMS (1)
WORD VECTOR (1)
more

INFONA - science communication portal

Search results

Text Categorization with Considering Temporal Patterns of Term Usages

Evaluation of Text Clustering Based on Iterative Classification

An efficient k-means algorithm integrated with Jaccard distance measure for document clustering

Evaluation of Partition-Based Text Clustering Techniques to Categorize Indic Language Documents

Recent trends in Data Mining (DM): Document Clustering of DM Publications

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options