Search results

Items from 21 to 30 out of 30 results

chapter

Research on Multi-classification and Multi-label in Text Categorization

Liu Hua

2009 International Conference on Intelligent Human-Machine Systems and Cybernetics > 2 > 86 - 89

2009 International Conference on Intelligent Human-Machine Systems and Cybernetics. IHMSC 2009

Aiming at multi-classification and multi-label in text categorization, an apery algorithm is proposed which judges whether document has multi-classification and multi-label by estimating the similarity difference among final classifier values. If the quotient of the biggest category's classifier value divided by the second biggest category's classifier value is less than or equal to a threshold, the...

chapter

Using top n Recognition Candidates to Categorize On-line Handwritten Documents

S.P. Saldarriaga, E. Morin, C. Viard-Gaudin

2009 10th International Conference on Document Analysis and Recognition > 881 - 885

2009 10th International Conference on Document Analysis and Recognition (ICDAR)

The traditional weighting schemes used in text categorization for the vector space model (VSM) cannot exploit information intrinsic to texts obtained through online handwriting recognition or any OCR process. Especially, top n (n > 1) recognition candidates could not be used without flooding the resulting text with false occurrences of spurious terms. In this paper, an improved weighting scheme...

chapter

Categorization, clustering and association rule mining on WWW

S.S. Bedi, H. Yadav, P. Yadav

2009 International Multimedia, Signal Processing and Communication Technologies > 173 - 177

2009 International Multimedia, Signal Processing and Communication Technologies (IMPACT-2009)

Clustering techniques have been used by many intelligent software agents in order to retrieve, filter, and categorize documents available on the World Wide Web. Clustering is also useful in extracting salient features of related Web documents to automatically formulate queries and search for other similar documents on the Web. Traditional clustering algorithms either use a priori knowledge of document...

chapter

Question Classification Based on Incremental Modified Bayes

Li Ying-wei, Yu Zheng-tao, Meng Xiang-yan, Che Wen-gang, more

2008 Second International Conference on Future Generation Communication and Networking > 2 > 149 - 152

2008 Second International Conference on Future Generation Communication and Networking (FGCN)

How to use the incremental training corpus to improve the question classification accuracy rate in the process of question classification based on statistic learning. A question classification method based on the incremental modified Bayes was presented in this paper. The method used the modified Bayes and combined the incremental learning to correct the parameter by the incremental training set stage...

chapter

A Novel Hybrid system for Large-Scale Chinese Text Classification Problem

Zhong Gao, Guanming Lu, Daquan Gu

2008 Japan-China Joint Workshop on Frontier of Computer Science and Technology > 121 - 124

2008 Japan-China Joint Workshop on Frontier of Computer Science and Technology

Most of the Chinese text classification systems are all based on the technology of bag of words (BW) which is a valid probability tool for text representation and can provide a better semantic architecture. But the weakness in classification accuracy is still unconquerable. Support vector machine (SVM) has become a popular classification tool and can be applied in the scheme, but the main disadvantages...

chapter

Structural poisson mixtures for classification of documents

J. Grim, J. Novovicova, P. Somol

2008 19th International Conference on Pattern Recognition > 1 - 4

ICPR 2008 19th International Conference on Pattern Recognition

Considering the statistical text classification problem we approximate class-conditional probability distributions by structurally modified Poisson mixtures. By introducing the structural model we can use different subsets of input variables to evaluate conditional probabilities of different classes in the Bayes formula. The method is applicable to document vectors of arbitrary dimension without any...

chapter

Linear Neighborhood Spread: A Way for Semi-Supervised Learning

Hui He, Bo Chen, Jun Guo

2008 Second International Symposium on Intelligent Information Technology Application > 2 > 80 - 83

2008 Second International Symposium on Intelligent Information Technology Application

This paper is to introduce a novel semi-supervised learning algorithm named linear neighborhood spread (LNS), which is capable for learning manifold structures. Labeled and unlabeled data are represented as vertices in a weighted graph, and each data point is assumed can be linearly constructed from its neighborhood. Labels are spread through the edges, and the weighted graph is regarded as probabilistic...

chapter

A hybrid approach to automatic text summarization

Te-Min Chang, Wen-Feng Hsiao

2008 8th IEEE International Conference on Computer and Information Technology > 65 - 70

2008 8th IEEE International Conference on Computer and Information Technology

Automatic text summarization is to compress an original document into an abridged version by extracting almost all of the essential concepts with text mining techniques. This research focuses on developing a hybrid automatic text summarization approach, KCS, to enhancing the quality of summaries. KCS employs the K-mixture probabilistic model to establish term weights in a statistical sense, and further...

chapter

An Incremental Chinese Text Classification Algorithm Based on Quick Clustering

Houfeng Ma, Xinghua Fan, Ji Chen

2008 International Symposiums on Information Processing > 308 - 312

2008 International Symposiums on Information Processing - ISIP 2008; 2008 International Pacific Workshop on Web Mining and Web-Based Application - WMWA 2008

Most conventional incremental learning algorithms perform incremental learning by selecting only one optimized text sample each time, which neither considers the relationship between texts in the unlabeled text set, nor improves incremental learning efficiency. In addition, because of the shortage of the classifierpsilas information storage, the selected optimized text is easily classified incorrectly...

chapter

Web Search with Text Categorization Using Probabilistic Framework of SVM

B.P.C. Lim, M.H. Tsui, V. Charastrakul, D. Shi

2006 IEEE International Conference on Systems, Man and Cybernetics > 4 > 2950 - 2955

2006 IEEE International Conference on Systems, Man and Cybernetics

The role of text categorization algorithms is to deal with the ever increasing amount of documents either online or offline. Its capability to organize numerous documents into pre-defined categories significantly increases the efficiency and decreases human resources. Recently, support vector machine (SVM) gained popularity due to its excellent generalization ability and fast training speed on large...

Keywords:
PROBABILITY
Publication type:
book

Publication date

Set your own date range

Keywords

TRAINING (21)
CLASSIFICATION ALGORITHMS (19)
TEXT ANALYSIS (19)
ACCURACY (9)
FEATURE EXTRACTION (9)
MACHINE LEARNING (9)
SUPPORT VECTOR MACHINES (8)
TEXT CLASSIFICATION (8)
DATA MINING (7)
ALGORITHM DESIGN AND ANALYSIS (6)
CLASSIFICATION (6)
PATTERN CLASSIFICATION (5)
TEXT MINING (5)
BAYES METHODS (4)
DISTANCE MEASUREMENT (4)
FEATURE SELECTION (4)
INFORMATION RETRIEVAL (4)
LEARNING (ARTIFICIAL INTELLIGENCE) (4)
NAIVE BAYES (4)
NATURAL LANGUAGE PROCESSING (4)
SUPPORT VECTOR MACHINE (4)
SUPPORT VECTOR MACHINE CLASSIFICATION (4)
TEXT RECOGNITION (4)
COMPUTERS (3)
GRAPH THEORY (3)
MATHEMATICAL MODEL (3)
PRAGMATICS (3)
VOCABULARY (3)
BAYES (2)
CLUSTERING ALGORITHMS (2)
EDUCATIONAL INSTITUTIONS (2)
ENCODING (2)
INCREMENTAL LEARNING (2)
INFORMATION FILTERING (2)
INTERNET (2)
LANGUAGE IDENTIFICATION (2)
NOISE (2)
PATTERN CLUSTERING (2)
POSTERIOR PROBABILITY (2)
SEMANTICS (2)
TRAINING DATA (2)
WORD PROCESSING (2)
ACADEMIC DISSERTATIONS (1)
ACCUMULATED PROBABILITY VALUES (1)
AFFINITY PROPAGATION (1)
AGENT (1)
ANT COLONY ALGORITHM (1)
APERY ALGORITHM (1)
APPROXIMATION METHODS (1)
ASSOCIATION RULE MINING (1)
AUTOMATIC TEXT CATEGORIZATION (1)
AUTOMATIC TEXT SUMMARIZATION (1)
BAG OF WORDS (1)
BAYES CLASSIFICATION (1)
BAYES FORMULA (1)
BAYESIAN CLASSIFICATION METHOD (1)
BAYESIAN METHODS (1)
BAYESIAN TEXT CLASSIFICATION METHODS (1)
BETA PROBABILITY DENSITY FUNCTION (1)
CHARACTERISTIC COLLECTION (1)
CHARACTERISTIC COLLECTION DEFLATION (1)
CHI (1)
CHI-SQUARE STATISTIC (1)
CHINESE TEXT CATEGORIZATION (1)
CHINESE TEXT CLASSIFICATION ALGORITHM (1)
CLASS-CONDITIONAL PROBABILITY DISTRIBUTION (1)
CLUSTERING TECHNIQUE (1)
COGNITION (1)
COMBINATORIAL FUSION ANALYSIS (1)
COMBINATORIAL FUSION ANALYSIS (CFA) (1)
COMPUTATIONAL MODELING (1)
CONNECTIVE STRENGTH (1)
CONVENTIONAL INCREMENTAL LEARNING ALGORITHM (1)
COOCCURRENCE PROBABILITY (1)
DATA SKEW (1)
DATA SPARSE CATEGORIES (1)
DEPENDENCY PARSING (1)
DICTIONARIES (1)
DIGIT CLASSIFICATION (1)
DOCUMENT IMAGE PROCESSING (1)
DOCUMENT VECTOR (1)
DOCUMENTS CATEGORIES (1)
EDUCATIONAL TECHNOLOGY (1)
ELEMENTS EXTRACTION (1)
EMOTION COMPUTATION (1)
ENTROPY (1)
EQUATIONS (1)
ERROR ANALYSIS (1)
ERROR CORRECTION CODES (1)
ERROR PROPAGATION (1)
ERROR PROPAGATION REDUCTION (1)
EVENT EXTRACTION (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
FEATURE EXTRACTION METHODS (1)
FEATURE REDUCTION (1)
FEATURE TERMS EXTRACTION (1)
FEEDBACK (1)
FILTERING (1)
more

INFONA - science communication portal

Search results

Research on Multi-classification and Multi-label in Text Categorization

Using top n Recognition Candidates to Categorize On-line Handwritten Documents

Categorization, clustering and association rule mining on WWW

Question Classification Based on Incremental Modified Bayes

A Novel Hybrid system for Large-Scale Chinese Text Classification Problem

Structural poisson mixtures for classification of documents

Linear Neighborhood Spread: A Way for Semi-Supervised Learning

A hybrid approach to automatic text summarization

An Incremental Chinese Text Classification Algorithm Based on Quick Clustering

Web Search with Text Categorization Using Probabilistic Framework of SVM

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options