Search results

Items from 1 to 7 out of 7 results

chapter

A novel text classification based on Mahalanobis distance

Suli Zhang, Xin Pan

2011 3rd International Conference on Computer Research and Development > 3 > 156 - 158

2011 3rd International Conference on Computer Research and Development (ICCRD 2011)

In text mining field, The KNN (K Nearest Neighbors) is one of the oldest and simplest methods of text classification. But it is known to be sensitive to the distance (or similarity) function used in classifying a test instance, this disadvantage can cause low classification accuracy and limit the KNN classifier's utilization in text classification in text mining. In this paper, we introduce Mahalanobis...

chapter

Research and Implement of Chinese Text Classifier Based on Naïve Bayes Method

Jian Huang, Zhongdi Cen, Qiuhong Zheng

2010 Sixth International Conference on Semantics, Knowledge and Grids > 426 - 428

2010 Sixth International Conference on Semantics Knowledge and Grid (SKG 2010)

Naïve Bayes classifier is proved to be one of the most effective classifier an be used widely. It applies statistical theory to text classification. This paper researched and implemented a Chinese text classifier using JAVA base on Naïve Bayes Method. First of all, this paper described test classification system, the content includes text information expressing, extracting and the method of Chinese...

chapter

A new feature weighting method based on probability distribution in imbalanced text classification

Leilei Chu, Hui Gao, Wenbo Chang

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 5 > 2335 - 2339

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Many real-world text classification tasks involve imbalanced training examples. Categories with fewer examples are under-represented and their classifiers often perform far below satisfactory. We propose a new approach using a probability distribution to assign the feature weight and apply it to Naive Bayes classifier. The method is evaluated in our experiments on FuDan Chinese Corpus. The experimental...

chapter

Research and improvement for feature selection on naive bayes text classifier

Guo Qiang

2010 2nd International Conference on Future Computer and Communication > 2 > V2-156 - V2-159

2010 2nd International Conference on Future Computer and Communication (ICFCC 2010)

An effective feature selection is very important for an classifier. Improved feature selection method can enhance its classifier efficiency in the practical test validates. This paper studies the principle·, merits and limitations of the prevalent feature selection method. Then, the paper adopts two-stage selection modulus which is calculated by the position of paragraph and sentences respectively,...

chapter

Classifying Text with Statistically Selected Features to Closely Related Categories

M. Janaki Meena, K.R. Chandran

2009 International Conference on Advances in Recent Technologies in Communication and Computing > 297 - 301

2009 International Conference on Advances in Recent Technologies in Communication and Computing. ARTCom 2009

Text classification is continuing to be one of the most researched problems due to continuously-increasing amount of electronic documents and digital data. Classifying documents to closely related categories is the most complex task in text categorization. Feature selection is an essential preprocessing step for improving the efficiency and accuracy of the text classifiers by removing redundant and...

chapter

Semi-supervised text classification from unlabeled documents using class associated words

Hong-qi Han, Dong-hua Zhu, Xue-feng Wang

2009 International Conference on Computers&Industrial Engineering > 1255 - 1260

2009 International Conference on Computers & Industrial Engineering (CIE39)

Automatically classifying text documents is an important field in machine learning. Unsupervised text classification does not need training data but is often criticized to cluster blindly. Supervised text classification needs large quantities of labeled training data to achieve high accuracy. However, in practice, labeled samples are often difficult, expensive or time consuming to obtain. In the meanwhile,...

chapter

A Novel Text Representation Model for Text Classification

Jun Wang, Yiming Zhou

2008 First International Conference on Intelligent Networks and Intelligent Systems > 702 - 705

2008 First International Conference on Intelligent Networks and Intelligent Systems (ICINIS)

The text representation in text classification is usually a sequence of terms. As the number of terms becomes very high, it is greatly time-consuming to perform existed text categorization tasks. In this paper we presented a novel text representation model for text classification which greatly reduced the required resources. This model represents text with several features. Each feature corresponds...

Filter options

Data set:
ieee
Keywords:
TRAINING
BAYES METHODS
NAIVE BAYES CLASSIFIER
TEXT CATEGORIZATION

Publication date

Set your own date range

Keywords

CLASSIFICATION ALGORITHMS (6)
ACCURACY (5)
TEXT CLASSIFICATION (5)
PATTERN CLASSIFICATION (4)
CLASSIFICATION (3)
MACHINE LEARNING (3)
FEATURE SELECTION (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
STATISTICAL ANALYSIS (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
TRAINING DATA (2)
ALGORITHM DESIGN AND ANALYSIS (1)
APPRAISAL (1)
ARRAYS (1)
AUTOMATIC TEXT DOCUMENT CLASSIFICATION (1)
CHI-SQUARE MAX METHOD (1)
CHI-SQUARE STATISTICS (1)
CHINESE (1)
CHINESE TEXT CLASSIFIER (1)
CLASS ASSOCIATED WORDS (1)
CLASSIFICATION ACCURACY (1)
CONFERENCES (1)
COVARIANCE MATRIX (1)
DATA MINING (1)
DATA MODELS (1)
DICTIONARIES (1)
DIGITAL DATA (1)
DOCUMENT CLASSIFICATION (1)
ELECTRONIC DOCUMENT (1)
ELECTRONIC MAIL (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
EXPECTATION-MAXIMIZATION (1)
FEATURE EXTRACTION (1)
FEATURE SELECTION METHOD (1)
FEATURE WEIGHT (1)
FEATURE WEIGHTING (1)
FUDAN CHINESE CORPUS (1)
IMBALANCED TEXT CLASSIFICATION (1)
INTEGRATED CIRCUITS (1)
JAVA (1)
K NEAREST NEIGHBORS (1)
KNN CLASSIFIER (1)
MAHALANOBIS DISTANCE (1)
MUTUAL INFORMATION (1)
MUTUAL INFORMATION METHOD (1)
NA&#X00EF;VE BAYES (1)
NAIVE BAYES (1)
NAIVE BAYES METHOD (1)
NAIVE BAYES TEXT CLASSIFIER (1)
NIOBIUM (1)
PROBABILISTIC LOGIC (1)
PROBABILITY DISTRIBUTION (1)
REUTERS-21578 CORPUS (1)
SEMI-SUPERVISED (1)
SEMI-SUPERVISED TEXT CLASSIFICATION (1)
SKEW (1)
SPAM FILTER CATEGORIZATION (1)
STATISTICAL DISTRIBUTIONS (1)
STATISTICAL THEORY (1)
TAGGING (1)
TEXT CATEGORIZATION TASKS (1)
TEXT MINING (1)
TEXT REPRESENTATION MODEL (1)
TWO-STAGE SELECTION MODULUS (1)
UNLABELED DOCUMENTS (1)
UNSUPERVISED TEXT CLASSIFICATION (1)
more

INFONA - science communication portal

Search results

A novel text classification based on Mahalanobis distance

Research and Implement of Chinese Text Classifier Based on Naïve Bayes Method

A new feature weighting method based on probability distribution in imbalanced text classification

Research and improvement for feature selection on naive bayes text classifier

Classifying Text with Statistically Selected Features to Closely Related Categories

Semi-supervised text classification from unlabeled documents using class associated words

A Novel Text Representation Model for Text Classification

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options