Search results

Items from 1 to 5 out of 5 results

chapter

A Text Classifier of English Movie Reviews Based on Information Gain

Lianjing Jin, Wei Gong, Wenlong Fu, Hongbin Wu

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence > 454 - 457

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence (ACIT-CSI)

Text classification is the foundation and core of text mining. Naive Bayes is an effective method for text classification. This paper improves the accuracy of Naive Bayes classification using improved information gain, one of methods of feature extraction, by reducing the impact of low-frequency words. In this paper, we use a widely corpus of NLTK. According to the test results, The accuracy of the...

chapter

A lexicon pool augmented Naive Bayes Classifier for Nepali Text

S. K. Thakur, V. K. Singh

2014 Seventh International Conference on Contemporary Computing (IC3) > 542 - 546

2014 Seventh International Conference on Contemporary Computing (IC3)

This paper presents our experimental work on machine classification of Nepali texts. We have implemented a Naive Bayes classifier for the task and then augmented it through a multinomial lexicon pooling. The lexicon-pooled Naive Bayes Classifier obtains better results on classification task as compared to a normal Naive Bayes implementation. This hybrid approach also helps in dealing with the unavailability...

chapter

A New Method of Training Sample Selection in Text Classification

Yixing Liao, Xuezeng Pan

2010 Second International Workshop on Education Technology and Computer Science > 1 > 211 - 214

2010 2nd International Workshop on Education Technology and Computer Science (ETCS)

Aiming to noise samples in the training dataset, a new method for reducing the amount of training dataset is proposed in the paper which is applicable to text classification. This method describes the distribution of training dataset according to the representativeness score of samples in the class they belong to, so as to show representative samples and noise samples in each class. The new method...

chapter

The Optimization of Threshold-Based Naive Bayesian Algorithm

Wang Xin, Jiang Hua

2009 Third International Conference on Genetic and Evolutionary Computing > 762 - 764

2009 Third International Conference on Genetic and Evolutionary Computing (WGEC 2009)

In order to realize the text classification and spam filtering, the Naive Bayesian algorithm estimate what class are the text in by basing on some statistical probability values in accordance with the characteristic in straining sample, but it is easy to expose the overflow problem, this article will optimize the algorithm by setting the threshold, the optimization strategy is comparing the times...

chapter

An Incremental Chinese Text Classification Algorithm Based on Quick Clustering

Houfeng Ma, Xinghua Fan, Ji Chen

2008 International Symposiums on Information Processing > 308 - 312

2008 International Symposiums on Information Processing - ISIP 2008; 2008 International Pacific Workshop on Web Mining and Web-Based Application - WMWA 2008

Most conventional incremental learning algorithms perform incremental learning by selecting only one optimized text sample each time, which neither considers the relationship between texts in the unlabeled text set, nor improves incremental learning efficiency. In addition, because of the shortage of the classifierpsilas information storage, the selected optimized text is easily classified incorrectly...

Filter options

Keywords:
TEXT CATEGORIZATION
TRAINING
PROBABILITY

Publication date

Set your own date range

Keywords

CLASSIFICATION ALGORITHMS (4)
ACCURACY (3)
TEXT ANALYSIS (3)
ALGORITHM DESIGN AND ANALYSIS (2)
CLASSIFICATION (2)
NAIVE BAYES (2)
ACCUMULATED PROBABILITY VALUES (1)
AFFINITY PROPAGATION (1)
BAYES (1)
BAYES METHODS (1)
BAYESIAN METHODS (1)
CHINESE TEXT CLASSIFICATION ALGORITHM (1)
CLUSTERING ALGORITHMS (1)
CONVENTIONAL INCREMENTAL LEARNING ALGORITHM (1)
FEATURE EXTRACTION (1)
FILTERING (1)
HEURISTIC ALGORITHMS (1)
INCREMENTAL LEARNING (1)
INFORMATION FILTERING (1)
INFORMATION GAIN (1)
INFORMATION STORAGE CLASSIFIER (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
MULTINOMIAL LEXICON POOLING (1)
NAIVE BAYESIAN CLASSIFICATION (1)
NATURAL LANGUAGE PROCESSING (1)
NEPALI TEXT CORPUS (1)
NOISE (1)
NOISE SAMPLES REDUCTION (1)
OPTIMIZATION (1)
OPTIMIZATION STRATEGY (1)
OVERFLOW (1)
OVERFLOW PROBLEM (1)
PATTERN CLUSTERING (1)
PRAGMATICS (1)
REASONABLE LEARNING SEQUENCE (1)
REPRESENTATIVENESS SCORE (1)
SPAM FILTERING (1)
STATISTICAL PROBABILITY VALUES (1)
SUPPORT VECTOR MACHINE CLASSIFICATION (1)
TEXT CLUSTERING (1)
THRESHOLD-BASED NAIVE BAYESIAN ALGORITHM (1)
TRAINING DATA (1)
TRAINING DATASET DISTRIBUTION (1)
TRAINING DATASET SELECTION (1)
TRAINING SAMPLE SELECTION (1)
UNSOLICITED E-MAIL (1)
VOCABULARY (1)
more

INFONA - science communication portal

Search results

A Text Classifier of English Movie Reviews Based on Information Gain

A lexicon pool augmented Naive Bayes Classifier for Nepali Text

A New Method of Training Sample Selection in Text Classification

The Optimization of Threshold-Based Naive Bayesian Algorithm

An Incremental Chinese Text Classification Algorithm Based on Quick Clustering

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options