Search results

chapter

Unsupervised feature selection for text classification via word embedding

Weikang Rui, Jinwen Liu, Yawei Jia

2016 IEEE International Conference on Big Data Analysis (ICBDA) > 1 - 5

2016 IEEE International Conference on Big Data Analysis (ICBDA)

The key of big text documents data analysis is to classify those text documents. To classify those text documents, it is necessary to represent those text documents as vectors which is vector space model (VSM). A powerful vector space model should remain the classification information with dimensions as little as possible. To achieve that, it is important to select most effective features for text...

chapter

A web text classification technique for unlabeled training samples

Francois Tchiegue, Rui Li, Shilong Ma

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 437 - 440

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)

The common classification is conducted under the supervised learning algorithms, which design classifiers through learning the labeled training samples. However, in actual situations, it is very costly to acquire class-labeled samples, because manually labeling documents requires a lot of time and efforts from experts. Therefore, it restrains the text classification to a great extent. To solve the...

chapter

Towards Reliable Clustering of English Text Documents Using Correlation Coefficient

Hrishikesh Bhaumik, Anirban Mukherjee, Siddhartha Bhattacharyya, Manojit Chattopadhyay

2014 International Conference on Computational Intelligence and Communication Networks > 530 - 535

2014 International Conference on Computational Intelligence and Communication Networks (CICN)

This paper proposes a new approach for clustering English text documents, based on finding the pair wise correlation of documents in a given set of text documents. The correlation coefficient for each pair of documents is calculated on the basis of ranks given to the words in the documents. The ranking of the words occurring in a document is computed on the basis of weights of the words calculated...

article

Multilabel Text Categorization Based on Fuzzy Relevance Clustering

Shie-Jue Lee, Jung-Yi Jiang

IEEE Transactions on Fuzzy Systems > 2014 > 22 > 6 > 1457 - 1471

We propose a fuzzy based method for multilabel text classification in which a document can belong to one or more than one category. In text categorization, the number of the involved features is usually huge, causing the curse of the dimensionality problem. Besides, a category can be a nonconvex region, which is a union of several overlapping or disjoint subregions. An automatic classification system,...

chapter

Clustering based two-stage text classification requiring minimal training data

Xue Zhang, Wang-xin Xiao

2012 International Conference on Systems and Informatics (ICSAI2012) > 2233 - 2237

2012 International Conference on Systems and Informatics (ICSAI)

Clustering aided classification methods are based on the assumption that the learned clusters under the guidance of initial training data can somewhat characterize the underlying distribution of the data set. However, our experiments show that whether such assumption holds is based on both the separability of the considered data set and the size of the training data set. It is often violated on data...

INFONA - science communication portal

Search results

Unsupervised feature selection for text classification via word embedding

A web text classification technique for unlabeled training samples

Towards Reliable Clustering of English Text Documents Using Correlation Coefficient

Multilabel Text Categorization Based on Fuzzy Relevance Clustering

Clustering based two-stage text classification requiring minimal training data

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Unsupervised feature selection for text classification via word embedding

A web text classification technique for unlabeled training samples

Towards Reliable Clustering of English Text Documents Using Correlation Coefficient

Multilabel Text Categorization Based on Fuzzy Relevance Clustering

Clustering based two-stage text classification requiring minimal training data

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options