Search results

Items from 1 to 6 out of 6 results

chapter

A corpus-based approach for keyword identification using supervised learning techniques

J. TeCho, C. Nattee, T. Theeramunkong

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology > 1 > 33 - 36

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

This paper presents a corpus-based approach for extracting keywords from a text written in a language that has no word boundary. Based on the concept of Thai character cluster, a Thai running text is preliminarily segmented into a sequence of inseparable units, called TCCs. To enable the handling of a large-scaled

chapter

An Automatic Document Classifier System based on Naíve Bayes Classifier and Ontology

Yi-Hsing Chang, Hsiu-Yi Huang

2008 International Conference on Machine Learning and Cybernetics > 6 > 3144 - 3149

2008 International Conference on Machine Learning and Cybernetics (ICMLC)

An automatic document classifier system based on ontology and the naive Bayes classifier is proposed in this paper. The main concept is to first establish a keyword synonymous table by experts for narrowing down the range and getting the consistency of keywords. The formal concept analysis is then used for

chapter

A Framework for the Classification of Unstructured Data

D.A. Ostrowski

2009 IEEE International Conference on Semantic Computing > 373 - 377

2009 IEEE International Conference on Semantic Computing (ICSC)

mechanisms with a traditional indexing method. The goal is to identify a higher semantic content and more meaningful keyword combinations, considering both supervised and unsupervised techniques. Within a specific implementation both Bayesian learning as well as clustering are integrated to support a boost parameter towards

chapter

A supervised machine learning approach of extracting coexpression relationship among genes from literature

Richa Tiwari, Chengcui Zhang, Thamar Solorio

2010 IEEE International Conference on Information Reuse&Integration > 98 - 103

2010 IEEE International Conference on Information Reuse & Integration (IRI 2010)

learning approach. We use a graphical model, Dynamic Conditional Random Fields (DCRFs), for training our classifier. Our approach is based on semantic analysis of text to classify the predicates describing coexpression relationship rather than detecting the presence of keywords. We compared our results of sentence

chapter

Hybrid text mining model for document classification

K A Vidhya, G Aghila

2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE) > 1 > 210 - 214

2nd International Conference on Computer and Automation Engineering (ICCAE 2010)

likelihood in the entire training documents where the training and test data are split randomly into k-subsets like 2/3 for training and 1/3 for test data. In addition, it also utilizes two level hierarchy structures for training documents like features from title, keywords and content with the predefined knowledge available

chapter

A Large-Scale Evaluation of an E-mail Management Assistant

W. Wobcke, A. Krzywicki, Yiu-Wa Chan

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology > 2 > 438 - 442

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology

EMMA is an e-mail management assistant based on ripple down rules, providing a high degree of classification accuracy while simplifying the task of maintaining the consistency of the rule base. A naive Bayes algorithm is used to improve the usability of EMMA by suggesting keywords to help the user define rules. In

Filter options

Keywords:
TRAINING
CLASSIFICATION ALGORITHMS
BAYES METHODS

Publication date

Set your own date range

Keywords

TEXT ANALYSIS (4)
ACCURACY (3)
BAYESIAN METHODS (3)
DATA MINING (3)
MACHINE LEARNING (3)
PATTERN CLASSIFICATION (3)
FEATURE EXTRACTION (2)
INFORMATION RETRIEVAL (2)
KNOWLEDGE ACQUISITION (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MACHINE LEARNING ALGORITHMS (2)
NAIVE BAYES (2)
ONTOLOGIES (2)
TESTING (2)
APPROXIMATION METHODS (1)
ASSESSMENT CRITERIA (1)
AUTOMATIC DOCUMENT CLASSIFIER SYSTEM (1)
BAYESIAN LEARNING (1)
CLASSIFICATION (1)
CLUSTERING (1)
COEXPRESSION RELATIONSHIP EXTRACTION (1)
CONFERENCES (1)
DICTIONARIES (1)
DOCUMENT CLASSIFICATION (1)
DOCUMENT HANDLING (1)
DYNAMIC CONDITIONAL RANDOM FIELD (1)
DYNAMIC CONDITIONAL RANDOM FIELDS (1)
E-MAIL MANAGEMENT ASSISTANT (1)
ELECTRONIC MAIL (1)
F1-MEASURE (1)
FEATURE REDUCTION (1)
FEATURE SELECTION (1)
FORMAL CONCEPT ANALYSIS (1)
GENE COEXPRESSION (1)
GENE INFORMATION (1)
HIDDEN MARKOV MODELS (1)
HYBRID TEXT MINING MODEL (1)
INDEXING (1)
INDEXING METHOD (1)
INTERNET (1)
KEYWORD COMBINATIONS (1)
KEYWORD IDENTIFICATION (1)
KEYWORD SYNONYMOUS TABLE (1)
KNOWLEDGE ACQUISITION METHOD (1)
KNOWLEDGE ENGINEERING (1)
LEARNING SYSTEMS (1)
LUCENE INDEX (1)
MEDICAL ADMINISTRATIVE DATA PROCESSING (1)
MEDICAL TEXT (1)
NAïVE BAYES (1)
NA&#X00ED;VE BAYES CLASSIFIER (1)
NAIVE BAYES ALGORITHM (1)
NAIVE BAYES CLASSIFICATION (1)
NAIVE BAYES CLASSIFIER (1)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (1)
ONTOLOGY (1)
PRE-CLASSIFIED MESSAGES (1)
PROBABILISTIC VALUE (1)
PROBABILITY (1)
PUBLISHED LITERATURE (1)
RANDOM FUNCTIONS (1)
RELATIONSHIP EXTRACTION (1)
RIPPLE DOWN RULES (1)
ROUGH SET THEORY (1)
ROUGH SETS (1)
SEMANTIC CONTENT (1)
STANDARD MACHINE LEARNING METHODS (1)
SUPERVISED LEARNING TECHNIQUES (1)
SUPERVISED MACHINE LEARNING (1)
SUPERVISED MACHINE LEARNING TECHNIQUES (1)
SUPERVISED TECHNIQUE (1)
TEXT CATEGORIZATION (1)
TEXT DOCUMENT CLASSIFICATION (1)
TEXT MINING (1)
THAI CHARACTER CLUSTER (1)
THAI RUNNING TEXT (1)
TRAINING DOCUMENT (1)
UNSTRUCTURED DATA (1)
UNSTRUCTURED DATA CLASSIFICATION (1)
UNSTRUCTURED INFORMATION (1)
UNSTRUCTURED TEXT CLASSIFICATION (1)
UNSUPERVISED LEARNING (1)
UNSUPERVISED TECHNIQUE (1)
USER DEFINE RULES (1)
WORD BOUNDARY (1)
WORD MATCHING (1)
WORD PROBABILITY (1)
WORD PROCESSING (1)
more

INFONA - science communication portal

Search results

A corpus-based approach for keyword identification using supervised learning techniques

An Automatic Document Classifier System based on Naíve Bayes Classifier and Ontology

A Framework for the Classification of Unstructured Data

A supervised machine learning approach of extracting coexpression relationship among genes from literature

Hybrid text mining model for document classification

A Large-Scale Evaluation of an E-mail Management Assistant

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options