Search results

Items from 1 to 6 out of 6 results

chapter

A refined weighted K-Nearest Neighbors algorithm for text categorization

Fang Lu, Qingyuan Bai

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering > 326 - 330

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2010)

Text categorization is one important task of text mining, for automated classification of large numbers of documents. Many useful supervised learning methods have been introduced to the field of text classification. Among these useful methods, K-Nearest Neighbor (KNN) algorithm is a widely used method and one of the best text classifiers for its simplicity and efficiency. For text categorization,...

chapter

A new topic-bridged model for transfer learning

Meng-Sung Wu, Jen-Tzung Chien

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5346 - 5349

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In real-world information systems, there are abundant unlabeled data but sparse labeled data. It is challenging to construct an adaptive model to classify a large amount of documents containing different domains. The classifiers trained from a source domain shall perform poorly for the test data in a target domain due to the domain mismatch. In this study, we build a topic-bridged latent Dirichlet...

chapter

Applying machine learning algorithms for automatic Persian text classification

Mojgan Farhoodi, Alireza Yari

2010 6th International Conference on Advanced Information Management and Service (IMS) > 318 - 323

2010 6th International Conference on Advanced Information Management and Service (IMS 2010)

Automatic document classification due to its various applications in data mining and information technology is one of the important topics in computer science. Classification plays a vital role in many information management and retrieval tasks. Document classification, also known as document categorization, is the process of assigning a document to one or more predefined category labels. Classification...

chapter

CCPR 2008 Keynote Speech 2

Chin-Hui Lee

2008 Chinese Conference on Pattern Recognition > 1

2008 Chinese Conference on Pattern Recognition

With an increasing amount of audio and video materials made available on the web, information extraction from multimedia documents is becoming a key area of growing business and technology interest. Research opportunities range from traditional topics, such as multimedia signal representation, processing, coding, modeling, authentication, and recognition, to emerging subjects, such as language modeling,...

chapter

Class document frequency as a learned feature for text categorization

A. Sharma, A. Kuh

2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence) > 2988 - 2993

2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence)

Document classification uses different types of word weightings as features for representation of documents. In our findings we find the class document frequency, dfc, of a word is the most important feature in document classification. Machine learning algorithms trained with dfc of words show similar performance in terms of correct classification of test documents when compared to more complicated...

chapter

Document Classification with ACM Subject Hierarchy

Tao Wang, B.C. Desai

2007 Canadian Conference on Electrical and Computer Engineering > 792 - 795

2007 Canadian Conference on Electrical and Computer Engineering

Text categorization or text classification (TC) has recently received increased research attention from information retrieval and machine learning communities, this focus is driven mostly by the ever growing demand for effective and efficient content-based, document management. In the context of digital library or Web portal application, the problem of text categorization is normally that of classification...

Filter options

Keywords:
LEARNING (ARTIFICIAL INTELLIGENCE)
DOCUMENT CLASSIFICATION

Publication date

Set your own date range

Keywords

MACHINE LEARNING (5)
CLASSIFICATION ALGORITHMS (4)
PATTERN CLASSIFICATION (4)
TEXT ANALYSIS (4)
INFORMATION RETRIEVAL (3)
SUPPORT VECTOR MACHINES (3)
TRAINING (3)
DATA MINING (2)
FEATURE EXTRACTION (2)
KNN (2)
MACHINE LEARNING ALGORITHMS (2)
SUPPORT VECTOR MACHINE (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
TEXT CLASSIFICATION (2)
TEXT PROCESSING (2)
ADAPTATION MODEL (1)
ALGORITHM DESIGN AND ANALYSIS (1)
AUTOMATIC DOCUMENT CLASSIFICATION (1)
AUTOMATIC PERSIAN TEXT CLASSIFICATION (1)
BAYES PROCEDURES (1)
BAYESIAN METHODS (1)
CLASS DOCUMENT FREQUENCY (1)
CLASSIFICATION (1)
COARSE-TO-FINE CATEGORIZATION PROCEDURE (1)
CONTENT-BASED, DOCUMENT MANAGEMENT (1)
DATA MODELS (1)
DECISION-FEEDBACK DISCRIMINATIVE LEARNING (1)
DISCRIMINATIVE CLASSIFIER LEARNING (1)
DOCUMENT CATEGORIZATION (1)
DOCUMENT CLASSIFIER (1)
DOCUMENT HANDLING (1)
FEATURE REDUCTION (1)
FEATURE REPRESENTATION (1)
FEATURE SELECTION (1)
FEATURE SELECTION ALGORITHM (1)
FEATURE VECTOR CONSTRUCTION (1)
FEATURE WEIGHTING (1)
HAMSHAHRI (1)
HAMSHAHRI DATASET (1)
IMAGE CODING (1)
INFERENCE MECHANISMS (1)
INFORMATION EXTRACTION (1)
INFORMATION MANAGEMENT (1)
INFORMATION TECHNOLOGY (1)
K-NEAREST NEIGHBOR ALGORITHM (1)
K-NEAREST NEIGHBORS ALGORITHM (1)
KERNEL (1)
KNOWLEDGE DISCOVERY (1)
LANGUAGE MODELING (1)
LANGUAGE PROCESSING (1)
LATENT SEMANTIC ANALYSIS (1)
MACHINE LEARNING ALGORITHM (1)
MAXIMAL FIGURE-OF-MERIT (1)
MEDIA DATA MINING (1)
MULTIMEDIA DOCUMENTS (1)
MULTIMEDIA PATTERN RECOGNITION (1)
MULTIMEDIA SIGNAL AUTHENTICATION (1)
MULTIMEDIA SIGNAL CODING (1)
MULTIMEDIA SIGNAL MODELING (1)
MULTIMEDIA SIGNAL PROCESSING (1)
MULTIMEDIA SIGNAL RECOGNITION (1)
MULTIMEDIA SIGNAL REPRESENTATION (1)
MULTIMEDIA SYSTEMS (1)
NATURAL LANGUAGE PROCESSING (1)
NEWSGROUP DATASET (1)
NIOBIUM (1)
PROBABILISTIC LOGIC (1)
SEMANTIC CONCEPT DECODING (1)
SEMISUPERVISED LEARNING (1)
SPARSE LABELED DATA (1)
SPEECH CODING (1)
SUPERVISED LEARNING METHOD (1)
SUPERVISED LEARNING PROBLEM (1)
SVM (1)
TEXT MINING (1)
TEXT RECOGNITION (1)
TOKENIZATION (1)
TOPIC BRIDGED LATENT DIRICHLET ALLOCATION (1)
TOPIC IDENTIFICATION (1)
TRANSFER LEARNING (1)
VARIATIONAL INFERENCE (1)
VARIATIONAL TECHNIQUES (1)
VECTOR SPACE MODEL (1)
VECTOR SPACE REPRESENTATION (1)
VECTORS (1)
VOCABULARY (1)
WEB PORTAL (1)
WEIGHT CALCULATION (1)
WEIGHT MEASUREMENT (1)
WORD PROCESSING (1)
more

INFONA - science communication portal

Search results

A refined weighted K-Nearest Neighbors algorithm for text categorization

A new topic-bridged model for transfer learning

Applying machine learning algorithms for automatic Persian text classification

CCPR 2008 Keynote Speech 2

Class document frequency as a learned feature for text categorization

Document Classification with ACM Subject Hierarchy

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options