Search results

Items from 1 to 7 out of 7 results

chapter

Reducing Samples Learning for Text Categorization

Yan Zhan, Hao Chen

2010 3rd International Conference on Information Management, Innovation Management and Industrial Engineering > 2 > 586 - 589

2010 3rd International Conference on Information Management, Innovation Management and Industrial Engineering (ICIII 2010)

Text Categorization (TC) is an important component in many information organization and information management tasks. In Text Categorization question there will be too many instances which need much computation time and memory requirement. It proposes a Generalization Capability (GC) algorithm that has the highest average generalization accuracy in these experiments, especially in the presence of...

chapter

Feature Weighting Scheme for Text Categorization Based on Rough Set

Huang Lican, Xu Xin, Zhao Yuhong, Gao Junzhou

2010 First International Conference on Networking and Distributed Computing > 186 - 188

First International Conference on Networking and Distributed Computing (ICNDC 2010)

Feature weighting is an important issue in text categorization. In this paper we analyze the characteristics of rough set theory and TF-IDF, and propose a feature weighting scheme for text categorization by applying approximation accuracy and approximation quality in variable rough set model. The decision information of a feature for categorization is introduced into the weight, which reflects the...

chapter

Text Classification Algorithm Study Based on Rough Set Theory

Lin Xun, Li Zhishu, Zhou Yong, Xue Yuan

2010 International Forum on Information Technology and Applications > 1 > 117 - 120

2010 International Forum on Information Technology and Applications (IFITA 2010)

Text Classification is an important research area in Chinese information processing, whose goal is on the base of analyzing the text content to give the allocation of one or more of the text to more appropriate classes to enhance the text retrieval, storage, applications such as processing efficiency. In this paper, text dataset is transformed to information system without attribute of decision making...

chapter

Feature selection method based on the improved of mutual information and genetic algorithm

Qiu Ye, Liu Peiyu, Yang Yuzhen

2009 IEEE International Symposium on IT in Medicine&Education > 1 > 836 - 839

2009 IEEE International Symposium on IT in Medicine & Education (ITME2009)

The feature selection is a key method of text categorization technology, this paper proposed a text feature selection method based on the improved of mutual information and genetic algorithm. Used the improved of mutual information algorithm to do the initial choose to removing redundancy and noise words at first, and then used the genetic algorithm to training the template which generate by a subset...

chapter

Structural poisson mixtures for classification of documents

J. Grim, J. Novovicova, P. Somol

2008 19th International Conference on Pattern Recognition > 1 - 4

ICPR 2008 19th International Conference on Pattern Recognition

Considering the statistical text classification problem we approximate class-conditional probability distributions by structurally modified Poisson mixtures. By introducing the structural model we can use different subsets of input variables to evaluate conditional probabilities of different classes in the Bayes formula. The method is applicable to document vectors of arbitrary dimension without any...

chapter

Rough Set-Based SVM Classifier for Text Categorization

Peng Chen, Shuang Liu

2008 Fourth International Conference on Natural Computation > 2 > 153 - 157

2008 Fourth International Conference on Natural Computation (ICNC)

Efficiency of feature selection affects the whole classifier performance in text categorization. Integrating the distinct aspects of indiscernibility capability of rough set theory and good generalization ability of support vector machine, this paper proposes a new classification method named Rough Support Vector Machine. Rough set was employed as an attribute reduction tool to work on the original...

chapter

A Approach for Text Classification Feature Dimensionality Reduction and Rule Generation on Rough Set

Shiqun Yin, Zhixing Huang, Lu Chen, Yuhui Qiu

2008 3rd International Conference on Innovative Computing Information and Control > 554

2008 3rd International Conference on Innovative Computing Information and Control (ICICIC)

The high dimensional data are frequently met when we apply Web text classification. Mining in high dimensional data is extraordinarily difficult because of the curse of dimensionality. We must adopt feature dimensionality reduction to solve these problems. A attribute reduction algorithm based on rough set theory is given in this paper to reduce the text feature term and extract rule. First, the weight...

Filter options

Data set:
ieee
Keywords:
TRAINING
CLASSIFICATION ALGORITHMS
TEXT CATEGORIZATION
SET THEORY

Publication date

Set your own date range

Keywords

ROUGH SET THEORY (4)
ACCURACY (3)
CLASSIFICATION (3)
FEATURE EXTRACTION (3)
SUPPORT VECTOR MACHINE CLASSIFICATION (3)
ROUGH SET (2)
APPROXIMATION ACCURACY (1)
APPROXIMATION METHODS (1)
APPROXIMATION QUALITY (1)
ATTRIBUTE REDUCTION ALGORITHM (1)
ATTRIBUTE VALUE DISCRETIZATION (1)
BAYES FORMULA (1)
BAYES METHODS (1)
CHINESE INFORMATION PROCESSING (1)
CLASS-CONDITIONAL PROBABILITY DISTRIBUTION (1)
CLASSIFICATION RULES (1)
COMPUTATIONAL MODELING (1)
DATA MINING (1)
DECISION ATTRIBUTES (1)
DECISION MAKING (1)
DOCUMENT VECTOR (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
FEATURE DIMENSIONALITY REDUCTION (1)
FEATURE SELECTION METHOD (1)
FEATURE WEIGHTING (1)
FEATURE WEIGHTING SCHEME (1)
GENERALIZATION CAPABILITY ALGORITHM (1)
GENETIC ALGORITHM (1)
GENETIC ALGORITHMS (1)
GENETICS (1)
HIGH DIMENSIONAL DATA MINING (1)
INFORMATION MANAGEMENT (1)
INFORMATION ORGANIZATION (1)
INFORMATION RETRIEVAL (1)
INFORMATION SYSTEMS (1)
INPUT VARIABLE SUBSET (1)
K-NEAREST NEIGHBOR ALGORITHM (1)
K-NN (1)
MACHINE LEARNING (1)
MACHINE LEARNING ALGORITHMS (1)
MACHINE LEARNING METHODS (1)
MANGANESE (1)
MUTUAL INFORMATION (1)
MUTUAL INFORMATION ALGORITHM (1)
NATURAL LANGUAGE PROCESSING (1)
NEAREST NEIGHBOR SEARCHES (1)
NOISE (1)
OPTIMAL FEATURE SUBSET (1)
OPTIMISATION (1)
POISSON DISTRIBUTION (1)
PRIORI INFORMATION (1)
PROBABILITY (1)
REDUCING SAMPLES (1)
REDUCTION (1)
ROUGHT SET (1)
RULE EXTRACTION (1)
RULE GENERATION (1)
STATISTICAL DOCUMENT TEXT CLASSIFICATION PROBLEM (1)
STRUCTURAL OPTIMIZATION (1)
STRUCTURAL POISSON MIXTURE (1)
SUPPORT VECTOR MACHINE (1)
SUPPORT VECTOR MACHINES (1)
SVM CLASSIFIER (1)
TESTING (1)
TEXT CATEGORIZATION TECHNOLOGY (1)
TEXT CLASSIFICATION (1)
TEXT CLASSIFICATION ALGORITHM (1)
TEXT RETRIEVAL (1)
TEXT SET INFORMATION SYSTEMS (1)
TEXT VECTOR DETERMINATION (1)
TF-IDF (1)
VECTOR SPACE COMPARISON (1)
VOCABULARY (1)
WEB TEXT CLASSIFICATION (1)
more

INFONA - science communication portal

Search results

Reducing Samples Learning for Text Categorization

Feature Weighting Scheme for Text Categorization Based on Rough Set

Text Classification Algorithm Study Based on Rough Set Theory

Feature selection method based on the improved of mutual information and genetic algorithm

Structural poisson mixtures for classification of documents

Rough Set-Based SVM Classifier for Text Categorization

A Approach for Text Classification Feature Dimensionality Reduction and Rule Generation on Rough Set

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options