Search results

Items from 1 to 5 out of 5 results

chapter

Aspect Guided Text Categorization with Unobserved Labels

D. Roth, Yuancheng Tu

2009 Ninth IEEE International Conference on Data Mining > 962 - 967

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

This paper proposes a novel multiclass classification method and exhibits its advantage in the domain of text categorization with a large label space and, most importantly, when some of the labels were not observed in the training data. The key insight is the introduction of intermediate aspect variables that encode properties of the labels. Aspect variables serve as a joint representation for observed...

chapter

Cross-domain classification: Trade-off between complexity and accuracy

E. Lex, C. Seifert, M. Granitzer, A. Juffinger

2009 International Conference for Internet Technology and Secured Transactions, (ICITST) > 1 - 6

2009 4th International Conference for Internet Technology and Secured Transactions (ICITST 2009)

Text classification is one of the core applications in data mining due to the huge amount of not categorized digital data available. Training a text classifier generates a model that reflects the characteristics of the domain. However, if no training data is available, labeled data from a related but different domain might be exploited to perform cross-domain classification. In our work, we aim to...

chapter

Document classification efficiency of phrase-based techniques

N. Kapalavayi, S.N.J. Murthy, Gongzhu Hu

2009 IEEE/ACS International Conference on Computer Systems and Applications > 174 - 178

2009 7th IEEE/ACS International Conference on Computer Systems and Applications (AICCSA-2009)

Due to the exponential growth of available text documents in digital form, it is of great importance to develop techniques for automatic document classification based on the textual contents. Earlier document classification techniques have used keyword-based features and related statistics to achieve good results when applied to certain datasets. More recently, some of these techniques have been extended...

chapter

A Novel Text Representation Model for Text Classification

Jun Wang, Yiming Zhou

2008 First International Conference on Intelligent Networks and Intelligent Systems > 702 - 705

2008 First International Conference on Intelligent Networks and Intelligent Systems (ICINIS)

The text representation in text classification is usually a sequence of terms. As the number of terms becomes very high, it is greatly time-consuming to perform existed text categorization tasks. In this paper we presented a novel text representation model for text classification which greatly reduced the required resources. This model represents text with several features. Each feature corresponds...

chapter

An efficient method of language identification using LVQ network

Han Xiao, Lei Yu, Kai Chen

2008 9th International Conference on Signal Processing > 1690 - 1694

2008 9th International Conference on Signal Processing (ICSP 2008)

This paper presents a new method to identify languages. A LVQ (learning vector quantization) network aimed at language identification is introduced. The presence of particular characters, words and the statistical information of word lengths are used as a feature vector. The new classification technique is faster than the conventional N-gram based classification approach, but it performs similarly...

Filter options

Data set:
ieee
Keywords:
ACCURACY
CLASSIFICATION
TRAINING DATA
TEXT ANALYSIS

Publication date

Set your own date range

Keywords

DATA MINING (3)
TEXT CATEGORIZATION (3)
ARTIFICIAL NEURAL NETWORKS (2)
CLASSIFICATION ALGORITHMS (2)
DATA MODELS (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
TEXT CLASSIFICATION (2)
ASPECT GUIDED TEXT CATEGORIZATION (1)
ASPECT VARIABLE (1)
BAYES METHODS (1)
BLOGS (1)
CENTROID-BASED ALGORITHM (1)
CLASS-FEATURE-CENTROID CLASSIFIER (1)
CLASSIFICATION ACCURACY (1)
COMPUTATIONAL COMPLEXITY (1)
CONSTRAINED OPTIMIZATION (1)
CROSS-DOMAIN CLASSIFICATION (1)
DECISION TREES (1)
DOCUMENT CLASSICATION (1)
DOCUMENT CLASSIFICATION (1)
FEATURE EXTRACTION (1)
FEATURE VECTOR (1)
INFERENCE ALGORITHMS (1)
KEYWORD BASED FEATURE (1)
KEYWORD-BASED AND PHRASE-BASED FEATURES (1)
LABELED DATA (1)
LABELED NEWS (1)
LANGUAGE IDENTIFICATION (1)
LEARNING VECTOR QUANTIZATION (1)
LINEAR TIME COMPLEXITY (1)
MULTICLASS CLASSIFICATION (1)
MULTICLASS CLASSSIFICATION (1)
NAIVE BAYES CLASSIFIER (1)
NATURAL LANGUAGES (1)
NEURONS (1)
NEWS DOMAIN (1)
NEWSPAPER CATEGORIES (1)
NIOBIUM (1)
PHRASE BASED TECHNIQUE (1)
PREDICTIVE MODELS (1)
REUTERS-21578 CORPUS (1)
ROMAN ALPHABET LANGUAGES (1)
SHORT SENTENCE (1)
STATISTICAL ANALYSIS (1)
STATISTICAL DATASET (1)
STRUCTURE LEARNING (1)
SUPPORT VECTOR MACHINES (1)
TEXT CATEGORIZATION TASKS (1)
TEXT CLASSIFIER TRAINING (1)
TEXT DOCUMENT (1)
TEXT MINING (1)
TEXT REPRESENTATION MODEL (1)
TEXTUAL CONTENT (1)
UNLABELED BLOG CORPUS (1)
UNOBSERVED LABEL (1)
VECTOR QUANTISATION (1)
VISUALIZATION (1)
WEB SITES (1)
WORD LENGTHS (1)
more

INFONA - science communication portal

Search results

Aspect Guided Text Categorization with Unobserved Labels

Cross-domain classification: Trade-off between complexity and accuracy

Document classification efficiency of phrase-based techniques

A Novel Text Representation Model for Text Classification

An efficient method of language identification using LVQ network

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options