Search results

Items from 1 to 5 out of 5 results

chapter

Table based KNN for categorizing words

Taeho Jo

2016 18th International Conference on Advanced Communication Technology (ICACT) > 692 - 696

2016 18th International Conference on Advanced Communication Technology (ICACT)

In this research, we propose the table based KNN as the approach to the text categorization. In previous works, we discovered that encoding texts into tables improved the performance in the text categorization, so in this research, become to consider the possibility of encoding words into tables as well as texts. In this research, we encode words into tables where entries are texts and their weights,...

chapter

Table based AHC algorithm for clustering words

Taeho Jo

2016 18th International Conference on Advanced Communication Technology (ICACT) > 570 - 575

2016 18th International Conference on Advanced Communication Technology (ICACT)

This research proposes the table based AHC algorithm as the approach to the word clustering task. The results from encoding texts into tables were successful in the previous works on the text categorization and the text clustering, and if oppositely to the case of the text encoding, texts are assumed to be elements of each word, it becomes to be possible to encode words into tables. In this research,...

chapter

Using semantic similarity matrix for defining operations involved in NTSO for clustering 20NewsGroups

Taeho Jo

IEEE Congress on Evolutionary Computation > 1 - 6

2010 IEEE Congress on Evolutionary Computation

In this research, we propose the similarity matrix based version of NTSO as the approach to the text clustering. For using one of traditional approaches to text clustering, documents should be encoded into numerical vectors; encoding so causes the two main problems: the huge dimensionality and the sparse distribution. In order to solve the problems, in this research, we propose to encode documents...

chapter

A new feature selection algorithm in text categorization

Wei Zhao, Yafei Wang, Dan Li

2010 International Symposium on Computer, Communication, Control and Automation (3CA) > 1 > 146 - 149

2010 International Symposium on Computer, Communication, Control and Automation (3CA 2010)

A major problem with text classification problems is the high dimensionality of the feature space. This paper investigates how genetic algorithm and k-means algorithm can help select relevant features in text classification. which uses the genetic algorithm (GA) optimization features to implement global searching, and uses k-means algorithm to selection operation to control the scope of the search,...

chapter

An Improved Genetic Algorithm for Text Feature Selection

Wei Zhao, Yafei Wang

2010 International Conference on Intelligent Computing and Cognitive Informatics > 7 - 10

2010 International Conference on Intelligent Computing and Cognitive Informatics (ICICCI 2010)

High-dimensional feature space affects the quality and efficiency of text categorization. This paper investigates an improved genetic algorithm that how to help select relevant features in text classification. We follow the so-called "region growing" method to initialize the population, and uses k-means algorithm to selection operation to control the scope of the search, ensure the validity...

Filter options

Keywords:
ENCODING
CLUSTERING ALGORITHMS

Publication date

Set your own date range

Content availability

Available (4)
None (1)

INFONA - science communication portal

Search results

Table based KNN for categorizing words

Table based AHC algorithm for clustering words

Using semantic similarity matrix for defining operations involved in NTSO for clustering 20NewsGroups

A new feature selection algorithm in text categorization

An Improved Genetic Algorithm for Text Feature Selection

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options