Search results

Items from 1 to 10 out of 10 results

chapter

Research and Implement of Chinese Text Classifier Based on Naïve Bayes Method

Jian Huang, Zhongdi Cen, Qiuhong Zheng

2010 Sixth International Conference on Semantics, Knowledge and Grids > 426 - 428

2010 Sixth International Conference on Semantics Knowledge and Grid (SKG 2010)

Naïve Bayes classifier is proved to be one of the most effective classifier an be used widely. It applies statistical theory to text classification. This paper researched and implemented a Chinese text classifier using JAVA base on Naïve Bayes Method. First of all, this paper described test classification system, the content includes text information expressing, extracting and the method of Chinese...

chapter

Naïve Bayes text classification with positive features selected by statistical method

M.J. Meena, K.R. Chandran

2009 First International Conference on Advanced Computing > 28 - 33

2009 First International Conference on Advanced Computing (ICAC 2009)

Text classification is enduring to be one of the most researched problems due to continuously-increasing amount of electronic documents and digital data. Naive Bayes is an effective and a simple classifier for data mining tasks, but does not show much satisfactory results in automatic text classification problems. In this paper, the performance of naive Bayes classifier is analyzed by training the...

chapter

Increasing the Accuracy of Discriminative of Multinomial Bayesian Classifier in Text Classification

T. Mouratis, S. Kotsiantis

2009 Fourth International Conference on Computer Sciences and Convergence Information Technology > 1246 - 1251

2009 Fourth International Conference on Computer Sciences and Convergence Information Technology

Text classification plays an important role in information extraction and summarization, text retrieval, and question-answering. The discriminative multinomial naive Bayes classifier has been a focus of research in the field of text classification. This paper increases the accuracy of discriminative multinomial Bayesian classifier with the usage of the feature selection technique that evaluates the...

chapter

Classifying Text with Statistically Selected Features to Closely Related Categories

M. Janaki Meena, K.R. Chandran

2009 International Conference on Advances in Recent Technologies in Communication and Computing > 297 - 301

2009 International Conference on Advances in Recent Technologies in Communication and Computing. ARTCom 2009

Text classification is continuing to be one of the most researched problems due to continuously-increasing amount of electronic documents and digital data. Classifying documents to closely related categories is the most complex task in text categorization. Feature selection is an essential preprocessing step for improving the efficiency and accuracy of the text classifiers by removing redundant and...

chapter

Classifying non-gaussian and mixed data sets in their natural parameter space

C. Levasseur, U.F. Mayer, K. Kreutz-Delgado

2009 IEEE International Workshop on Machine Learning for Signal Processing > 1 - 6

2009 IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2009)

We consider the problem of both supervised and unsupervised classification for multidimensional data that are non-Gaussian and of mixed types (continuous and/or discrete). An important subclass of graphical model techniques called generalized linear statistics (GLS) is used to capture the underlying statistical structure of these complex data. GLS exploits the properties of exponential family distributions,...

chapter

A Text Classification Method with an Effective Feature Extraction Based on Category Analysis

Yun Li, Yan Sheng, Luan Luan, Ling Chen

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery > 1 > 95 - 99

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2009)

Text classification refers to determine the class of an unknown text according to its content in the given classification system. In order to extract fewer features to express the information in the text as much as possible, the paper analysis the various features' statistical properties and to extract the global features according to Zipf's law; and then, based on the statistical analysis of the...

chapter

Automatic Genre Classification by Using Co-training

Rui Liu, Minghu Jiang, Zheng Tie

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery > 1 > 129 - 132

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2009)

Researchers have concentrated on topic-based text classification while the genre of a document is rarely considered. In this article, we discuss the automatic genre classification and its application. We argue that word level features and sentence level features are two important measures which vary in number among different genres. Word level features include word frequency and POS (part of speech)...

chapter

Studies of Comprehensive Auto-Indexing System Based on Key Words' Subject Degree

Liu Hua

2009 International Forum on Information Technology and Applications > 3 > 334 - 336

2009 International Forum on Information Technology and Applications (IFITA)

Key words are expressions that indicate and express the subject concept of a text, the major property of key words is to denote subject. Based on the domainal inhomogeneity and critical region of key words, subject degree is brought up and calculated by statistical model to cue textpsila subject concept. Based on key words and itspsila subject degree, constructed a comprehensive auto-indexing system,...

chapter

Fine Text Categorization: Using Very Aggressive Feature Selection to Cope with Mass Duplicated Features

Liuling Dai, Jinwu Hu, ShiKun Wu

2008 International Conference on Intelligent Computation Technology and Automation (ICICTA) > 2 > 984 - 988

2008 International Conference on Intelligent Computation Technology and Automation (ICICTA)

Text categorization is a key issue of text mining. Although there are many studies on this problem, the majority of them are focused on classification of rough categories. In this kind of problem, there are obviously different features that can differentiate one category from others. Only very few researches concerned fine text categorization (FTC) problem which is characterized by many duplicated...

chapter

A novel risk assessment system for port state control inspection

Zhong Gao, Guanming Lu, Mengjue Liu, Meng Cui

2008 IEEE International Conference on Intelligence and Security Informatics > 242 - 244

2008 IEEE International Conference on Intelligence and Security Informatics (ISI 2008)

Port state control (PSC) inspection is the most important mechanism to ensure world marine safe. Recently, some SVM-based risk assessment systems have been presented in the world. They estimate the risk of each candidate ship based on its generic factors and history inspection factors to select high-risk one before conducting on-board PSC inspection. However, how to improve the performance of the...

Filter options

Data set:
ieee
Keywords:
TRAINING
CLASSIFICATION ALGORITHMS
TEXT CATEGORIZATION
STATISTICAL ANALYSIS

Publication date

Set your own date range

Keywords

TEXT ANALYSIS (9)
ACCURACY (5)
PATTERN CLASSIFICATION (5)
BAYES METHODS (4)
DATA MINING (4)
FEATURE EXTRACTION (4)
SUPPORT VECTOR MACHINES (4)
MACHINE LEARNING (3)
SUPPORT VECTOR MACHINE CLASSIFICATION (3)
ALGORITHM DESIGN AND ANALYSIS (2)
DICTIONARIES (2)
DIGITAL DATA (2)
FEATURE SELECTION (2)
NAIVE BAYES CLASSIFIER (2)
TESTING (2)
TEXT CLASSIFICATION (2)
TEXT MINING (2)
TRAINING DATA (2)
AEROSPACE ELECTRONICS (1)
AGGRESSIVE FEATURE SELECTION (1)
ARRAYS (1)
ARTIFICIAL INTELLIGENCE (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTO-INDEXING SYSTEM (1)
AUTOMATIC GENRE CLASSIFICATION (1)
BAG OF WORDS (1)
BAG-OF-WORDS (1)
BAYESIAN METHODS (1)
BELIEF NETWORKS (1)
BUILDINGS (1)
CANDIDATE SHIP (1)
CATEGORICAL DATA TEXT CATEGORIZATION (1)
CATEGORY ANALYSIS (1)
CATEGORY FREQUENCY (1)
CHI-SQUARE MAX METHOD (1)
CHI-SQUARE STATISTICS (1)
CHI-SQUARED STATISTIC (1)
CHINESE TEXT CLASSIFIER (1)
CHIR (1)
CLASSICAL STATISTICAL TECHNIQUES (1)
CLASSIFICATION (1)
CO-TRAINING (1)
COMPUTATIONAL LINGUISTICS (1)
COMPUTATIONAL MODELING (1)
COMPUTER LANGUAGES (1)
COMPUTERS (1)
CONDITIONAL MUTUAL INFORMATION MAXIMIZATION (1)
DATABASES (1)
DECISION MAKING (1)
DIRECTED GRAPH (1)
DISCRIMINATIVE MULTINOMIAL NAIVE BAYES CLASSIFIER (1)
DOCUMENT CLASSIFICATION (1)
ELECTRONIC DOCUMENT (1)
ELECTRONIC DOCUMENTS (1)
ENTROPY (1)
ESTIMATION (1)
EXPONENTIAL FAMILY DISTRIBUTION (1)
FEATURE SELECTION ALGORITHM (1)
FEATURE SELECTION TECHNIQUE (1)
FEATURE WEIGHT (1)
FINE TEXT CATEGORIZATION (1)
FREQUENCY DOMAIN ANALYSIS (1)
GAIN (1)
GENERALIZED LINEAR STATISTICS (1)
GENRE CLASSIFICATION (1)
GOVERNMENT (1)
GRAMMAR RULES (1)
GRAPHICAL MODEL TECHNIQUES (1)
GRAPHICAL MODELS (1)
HIDDEN MARKOV MODELS (1)
HISTORY (1)
HTML (1)
INDEXING (1)
INFORMATION CLASSIFICATION (1)
INFORMATION EXTRACTION (1)
INFORMATION SUMMARIZATION (1)
INSPECTION (1)
INTELLIGENT CONTROL (1)
INTERNET (1)
JAVA (1)
K-NEAREST NEIGHBOR (1)
KEY WORD SUBJECT DEGREE (1)
KEY WORDS INDEXING (1)
KNN (1)
KNN-SVM CLASSIFIER (1)
KNOWLEDGE BASED SYSTEMS (1)
KNOWLEDGE ENGINEERING (1)
LEAD (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEARNING ALGORITHM (1)
LEARNING ALGORITHMS (1)
LOWER DIMENSIONAL PARAMETER SUBSPACE (1)
MAINTENANCE ENGINEERING (1)
MARINE ENGINEERING (1)
MARINE SAFETY (1)
MARINE VEHICLES (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options