Search results

Items from 1 to 5 out of 5 results

chapter

Fusing Gini Index and Term Frequency for Text Feature Selection

Lin Wu, Yongbin Wang, Shengyan Zhang, Yannan Zhang

2017 IEEE Third International Conference on Multimedia Big Data (BigMM) > 280 - 283

2017 IEEE Third International Conference on Multimedia Big Data (BigMM)

Automatic text classification is the key technology to process and organize large-scale text data. It is well known that the high dimensionality of feature space is a main challenge for text classification. In order to attenuate such a problem as well as inspired by existing arts, we propose an effective text feature selection algorithm by novelly fusing the classical methodologies of Gini index and...

chapter

Optimized Approach of Feature Selection Based on Information Gain

Guohua Wu, Junjun Xu

2015 International Conference on Computer Science and Mechanical Automation (CSMA) > 157 - 161

2015 International Conference on Computer Science and Mechanical Automation (CSMA)

Text feature selection is the key technology in text classification and text information retrieval. The feature selection method - information gain - has extensive application in text categorization. This paper theoretically analyzed the deficiency of information gain in feature selection methods, and then introduced two improvement factors which were LDFWF (Limiting Document Frequency's Word Frequency)...

chapter

Extreme learning machines in the field of text classification

Rajendra Kumar Roul, Ashish Nanda, Viraj Patel, Sanjay Kumar Sahay

2015 IEEE/ACIS 16th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) > 1 - 7

2015 IEEE/ACIS 16th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

The World Wide Web serves as a huge repository of information that is highly dynamic, diverse and growing at an exponential rate in a lightening speed. In order to speed-up and further improve tasks like information search and retrieval, personalization etc; it is highly important to develop techniques to classify text documents more accurately and efficiently than before. This paper is an effort...

chapter

An empirical evaluation of linear and nonlinear kernels for text classification using Support Vector Machines

Ya Gao, Shiliang Sun

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 4 > 1502 - 1505

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

This paper compares the performance of linear and nonlinear kernels of Support Vector Machines (SVM) used for text classification. The study is motivated by the previous viewpoint that linear SVM performs better than nonlinear one, and that, although there are many investigations have proved that SVM performs well in text classification, there is no serious investigation on the comparison between...

chapter

A study of the identification of authorship for Chinese texts

Zhang Jian, Yao Tianfang

2008 IEEE International Conference on Intelligence and Security Informatics > 263 - 264

2008 IEEE International Conference on Intelligence and Security Informatics (ISI 2008)

Style-based text authorship identification extracts features from authorship-known texts, constructs classifier and then identifies disputed texts. Authorship identification belongs to the domain of style classification and is a branch of text classification. In contrast with text classification which deals with the content of texts, authorship identification focuses on the form property of texts...

Filter options

Keywords:
SUPPORT VECTOR MACHINES
COMPUTER SCIENCE

Publication date

Set your own date range

Keywords

TEXT CATEGORIZATION (4)
ACCURACY (2)
ALGORITHM DESIGN AND ANALYSIS (2)
CLASSIFICATION ALGORITHMS (2)
FEATURE SELECTION (2)
MACHINE LEARNING (2)
PATTERN CLASSIFICATION (2)
SVM (2)
TEXT ANALYSIS (2)
TRAINING (2)
ARTIFICIAL INTELLIGENCE (1)
BEHAVIORAL SCIENCE (1)
CHINESE TEXT (1)
CLASSIFIER (1)
COMPUTATIONAL LINGUISTICS (1)
COMPUTERS (1)
DATA MINING (1)
EDUCATIONAL INSTITUTIONS (1)
ELECTRONIC MAIL (1)
EMPIRICAL EVALUATION (1)
ENTROPY (1)
EXTREME LEARNING MACHINE (1)
FEATURE EXTRACTION (1)
FREQUENCY MEASUREMENT (1)
FUSE (1)
GAIN (1)
GINI INDEX (1)
GOVERNMENT (1)
INDEXES (1)
INFORMATION GAIN (1)
INTERNET (1)
KERNEL (1)
KNOWLEDGE ENGINEERING (1)
LEAD (1)
LIMITING (1)
LINEAR KERNEL (1)
LINEAR KERNELS (1)
MACHINE LEARNING ALGORITHMS (1)
MATERIALS (1)
MEDIA (1)
MULTI-LAYER ELM (1)
MUTUAL INFORMATION (1)
NOISE (1)
NONLINEAR KERNEL (1)
NONLINEAR KERNELS (1)
PRESSES (1)
RESISTANCE (1)
SOFTWARE ENGINEERING (1)
STYLE-BASED TEXT AUTHORSHIP IDENTIFICATION (1)
SUPPORT VECTOR MACHINE (1)
TERM FREQUENCY (1)
TESTING (1)
TEXT FEATURE SELECTION (1)
TRAINING DATA (1)
TRANSFORMS (1)
TUNING (1)
WEB PAGES (1)
WEB SITES (1)
WRITING (1)
more

INFONA - science communication portal

Search results

Fusing Gini Index and Term Frequency for Text Feature Selection

Optimized Approach of Feature Selection Based on Information Gain

Extreme learning machines in the field of text classification

An empirical evaluation of linear and nonlinear kernels for text classification using Support Vector Machines

A study of the identification of authorship for Chinese texts

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options