Search results

Items from 1 to 20 out of 114 results

chapter

An Improved Information Gain Feature Selection Algorithm for SVM Text Classifier

Jiamin Xu, Hong Jiang

2015 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery > 273 - 276

2015 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)

Feature selection algorithm has a great influence on the accuracy of text categorization. The traditional information gain (IG) feature selection algorithm usually selects the features that rarely appear in the specified categories, but frequently appear in other categories. To overcome this drawback, on the basis of in-depth analysis of the related algorithms, an improved IG feature selection method...

chapter

A novel classifier based on meaning for text classification

Murat Can Ganiz, Melike Tutkan, Selim Akyokus

2015 International Symposium on Innovations in Intelligent SysTems and Applications (INISTA) > 1 - 5

2015 International Symposium on Innovations in Intelligent SysTems and Applications (INISTA)

Text classification is one of the key methods used in text mining. Generally, traditional classification algorithms from machine learning field are used in text classification. These algorithms are primarily designed for structured data. In this paper, we propose a new classifier for textual data, called Supervised Meaning Classifier (SMC). The new SMC classifier uses meaning measure, which is based...

chapter

A novel feature selection based on Tibetan grammar for Tibetan text classification

Tao Jiang, Hongzhi Yu

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 445 - 448

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)

Feature selection is a strategy that aims at making text classifiers more efficient and accurate. In this paper, we proposed a novel feature selection method based on Tibetan grammar for Tibetan classification. Tibetan language express grammatical meaning through the function words and word order, and the function word has large proportions. By analyzing the Tibetan grammar and distribution of part...

chapter

Classification and clustering for neuroinformatics: Assessing the efficacy on reverse-mapped NeuroNLP data using standard ML techniques

Nidheesh Melethadathil, Priya Chellaiah, Bipin Nair, Shyam Diwakar

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1065 - 1070

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

NeuroinformaticsNatural Language Processing (NeuroNLP) relies on clustering and classification for information categorization of biologically relevant extraction targets and for interconnections to knowledge-related patterns in event and text mined datasets. The accuracy of machine learning algorithms depended on quality of text-mined data while efficacy relied on the context of the choice of techniques...

chapter

Comparison of Four Text Classifiers on Movie Reviews

Yaguang Wang, Wenlong Fu, Aina Sui, Yuqing Ding

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence > 495 - 498

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence (ACIT-CSI)

Text Categorization plays an important role in the fields of information retrieval, machine learning, natural language processing, data mining and others. With the development of computer and information technology, there have been many classification algorithms. Each text classification algorithms will get result at differing speeds and efficiency due to the various feature of test text. It has been...

chapter

A Text Classifier of English Movie Reviews Based on Information Gain

Lianjing Jin, Wei Gong, Wenlong Fu, Hongbin Wu

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence > 454 - 457

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence (ACIT-CSI)

Text classification is the foundation and core of text mining. Naive Bayes is an effective method for text classification. This paper improves the accuracy of Naive Bayes classification using improved information gain, one of methods of feature extraction, by reducing the impact of low-frequency words. In this paper, we use a widely corpus of NLTK. According to the test results, The accuracy of the...

chapter

Performance of using LDA for Chinese news text classification

Xiaojun Wu, Liying Fang, Pu Wang, Nan Yu

2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE) > 1260 - 1264

2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE)

Chinese text classification is always challenging, especially when data are high dimensional and sparse. In this paper, we are interested in the way of text representation and dimension reduction in Chinese text classification. First, we introduces a topic model — Latent Dirichlet Allocation(LDA), which is uses LDA model as a dimension reduction method. Second, we choose Support Vector Machine(SVM)...

chapter

Parallel Processing System for Marathi Content Generation

Sushma R. Vispute, Shrikant Patil, Sagar Sangale, Akshay Padwal, more

2015 International Conference on Computing Communication Control and Automation > 575 - 579

2015 International Conference on Computing Communication Control and automation(ICCUBEA)

The objective of the present work is to design a HADOOP based parallel Marathi content retrieval system using clustering technique to get the efficient and optimized result than existing systems. The system also focuses on providing the personalized documents in Marathi language to the end user based on their interests identified from the browsing history and using time session mechanism for re ranking...

chapter

Augmenting the novice-expert overlay model in an intelligent tutoring system: Using confidence-weighted linear classifiers

Tenzin Doleck, Ram B. Basnet, Eric Poitras, Susanne Lajoie

2014 IEEE International Conference on Computational Intelligence and Computing Research > 1 - 4

2014 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

In BioWorld, a medical intelligent tutoring system, novice physicians are tasked with solving virtual patient cases. Whilst the importance of modeling and predicting clinical reasoning is recognized, an important aspect of the learner contribution remains unexplored — the written case summary prepared by the learner. The premise of investigating the case summaries is that it captures the thought and...

chapter

News classification based on their headlines: A review

Mazhar Iqbal Rana, Shehzad Khalid, Muhammad Usman Akbar

17th IEEE International Multi Topic Conference 2014 > 211 - 216

2014 IEEE 17th International Multi-Topic Conference (INMIC)

For the last few years, text mining has been gaining significant importance. Since Knowledge is now available to users through variety of sources i.e. electronic media, digital media, print media, and many more. Due to huge availability of text in numerous forms, a lot of unstructured data has been recorded by research experts and have found numerous ways in literature to convert this scattered text...

chapter

Active learning for text classification: Using the LSI Subspace Signature Model

Weizhong Zhu, Robert B. Allen

2014 International Conference on Data Science and Advanced Analytics (DSAA) > 149 - 155

2014 International Conference on Data Science and Advanced Analytics (DSAA)

Supervised learning methods rely on large sets of labeled training examples. However, large training sets are rare and making them is expensive. In this research, Latent Semantic Indexing Subspace Signature Model (LSISSM) is applied to labeling for active learning of unstructured text. Based on Singular Value Decomposition (SVD), LSISSM represents terms and documents as semantic signatures by the...

chapter

Automated document classification for news article in Bahasa Indonesia based on term frequency inverse document frequency (TF-IDF) approach

Ari Aulia Hakim, Alva Erwin, Kho I Eng, Maulahikmah Galinium, more

2014 6th International Conference on Information Technology and Electrical Engineering (ICITEE) > 1 - 4

2014 6th International Conference on Information Technology and Electrical Engineering (ICITEE)

The exponential growth of the data may lead us to the information explosion era, an era where most of the data cannot be managed easily. Text mining study is believed to prevent the world from entering that era. One of the text mining studies that may prevent the explosion era is text classification. It is a way to classify articles into several predefined categories. In this research, the classifier...

chapter

A high performance hybrid algorithm for text classification

Prema Nedungadi, Haripriya Harikumar, Maneesha Ramesh

The Fifth International Conference on the Applications of Digital Information and Web Technologies (ICADIWT 2014) > 118 - 123

2014 Fifth International Conference on the Applications of Digital Information and Web Technologies (ICADIWT)

The high computational complexity of text classification is a significant problem with the growing surge in text data. An effective but computationally expensive classification is the k-nearest-neighbor (kNN) algorithm. Principal Component Analysis (PCA) has commonly been used as a preprocessing phase to reduce the dimensionality followed by kNN. However, though the dimensionality is reduced, the...

chapter

A term weighting method for identifying emotions from text content

Jenomi De Silva, Prasanna S. Haddela

2013 IEEE 8th International Conference on Industrial and Information Systems > 381 - 386

2013 IEEE 8th International Conference on Industrial and Information Systems (ICIIS)

Since the inception of the concept of social networking, communication patterns have shifted drastically with the unmitigated trend in socializing over the Internet, especially when people began connecting via mobile devices. Nowadays people tend to use these modern communication systems to share their emotions with each other. Human emotions play a vital role in human relationships and people share...

chapter

A Simple Study of Webpage Text Classification Algorithms for Arabic and English Languages

Sumaia Mohammed Al-Ghuribi, Saleh Alshomrani

2013 International Conference on IT Convergence and Security (ICITCS) > 1 - 5

2013 International Conference on IT Convergence and Security (ICITCS)

Webpage text Classification is an important problem that has been studied through different approaches and algorithms. It aims to assign a predefined category to a Webpage based on its content and linguistic features. It has many applications such as word sense disambiguation, document indexing, text filtering, Webpages hierarchical categorization and document organization. This study is a part of...

chapter

Research on Large Scale Hierarchical Classification Based on Candidate Search

Li He, Yan Jia, Zhaoyun Ding, Weihong Han

2013 10th Web Information System and Application Conference > 355 - 360

2013 10th Web Information System and Application Conference (WISA)

Large scale hierarchical classification problem researches how to classify web documents into the categories among a class hierarchy. As the class hierarchy is very large that containing thousands or even tens of thousands of categories, the performance of the classification is still lower. While a reduce-and-conquer strategy has been proposed to make the problem tractable, candidate search is a bottleneck...

chapter

The Effect of Combining Different Feature Selection Methods on Arabic Text Classification

Abdulmohsen Al-Thubaity, Norah Abanumay, Sara Al-Jerayyed, Aljoharah Alrukban, more

2013 14th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing > 211 - 216

2013 14th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

Feature selection is one of several factors affecting text classification systems. Feature selection aims to choose a representative subset of all features to reduce the complexity of classification problems. Usually a single method is used for feature selection. For English, several attempts were reported examining the combination of different feature selection methods. To the best of our knowledge...

chapter

An approach to meta feature selection

JianLin Li

2013 26th IEEE Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 4

2013 26th IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)

Many methods, such as mutual information (MI), document frequency (DF), information gain (IG) and χ² statistics (CHI) algorithm, have been discussed and applied to the study of meta feature selection. This paper gives a brief review of the recent approaches on this topic. By summarizing and synthesizing these approaches, we propose a framework of the application of meta feature selections, where the...

chapter

Weirdness Coefficient as a Feature Selection Method for Arabic Special Domain Text Classification

Abdulmohsen Al-Thubaity, Albandari Alanazi, Itisam Hazzaa, Haya Al-Tuwaijri

2012 International Conference on Asian Language Processing > 69 - 72

2012 International Conference on Asian Language Processing (IALP)

Given the importance of organizing and managing the rapid growth in knowledge of Arabic electronic content, this study introduces the Weirdness Coefficient (W) as a new feature selection method for Arabic special domain text classification. The proposed method was used to classify a dataset comprising five Islamic topics using NaÃ¯ve base (NB) and K-nearest neighbor (K-NN) classifiers, and three representation...

chapter

Dynamic feature selection strategy in incremental Chinese text classification

Dan Yang, Xinghua Fan

2012 2nd International Conference on Applied Robotics for the Power Industry (CARPI) > 1123 - 1126

2012 2nd International Conference on Applied Robotics for the Power Industry (CARPI 2012)

In Chinese text classification field, the content and size of feature space have decisive impact on accuracy and efficiency. Those two kinds feature information of incremental unlabeled training samples are ignored during current incremental learning research. For large scale of high dimensional Chinese texts, this paper presents a flexible, effective and universal feature selection strategy. In this...

Keywords:
CLASSIFICATION ALGORITHMS
TEXT CATEGORIZATION

Publication date

Set your own date range

Content availability

Available (113)
None (1)

Keywords

TEXT ANALYSIS (63)
TRAINING (63)
TEXT CLASSIFICATION (44)
SUPPORT VECTOR MACHINES (37)
FEATURE EXTRACTION (31)
PATTERN CLASSIFICATION (27)
CLASSIFICATION (25)
MACHINE LEARNING (25)
FEATURE SELECTION (23)
DATA MINING (22)
SUPPORT VECTOR MACHINE CLASSIFICATION (19)
ALGORITHM DESIGN AND ANALYSIS (16)
LEARNING (ARTIFICIAL INTELLIGENCE) (13)
SUPPORT VECTOR MACHINE (12)
BAYES METHODS (11)
NATURAL LANGUAGE PROCESSING (11)
MACHINE LEARNING ALGORITHMS (10)
TEXT MINING (10)
COMPUTERS (9)
ENTROPY (9)
INTERNET (9)
ARTIFICIAL NEURAL NETWORKS (8)
DECISION TREES (8)
STATISTICAL ANALYSIS (8)
TESTING (8)
NIOBIUM (7)
BAYESIAN METHODS (6)
EDUCATIONAL INSTITUTIONS (6)
NAIVE BAYES CLASSIFIER (6)
PROBABILITY (6)
ROUGH SET (6)
TRAINING DATA (6)
WEB PAGES (6)
CLUSTERING ALGORITHMS (5)
CORRELATION (5)
INFORMATION GAIN (5)
INFORMATION RETRIEVAL (5)
KNN (5)
MUTUAL INFORMATION (5)
NOISE (5)
ROUGH SET THEORY (5)
SVM (5)
ARABIC TEXT CLASSIFICATION (4)
CHINESE TEXT CATEGORIZATION (4)
CLASSIFICATION TREE ANALYSIS (4)
CLUSTERING (4)
DIMENSIONALITY REDUCTION (4)
GAIN (4)
MATHEMATICAL MODEL (4)
MATRIX DECOMPOSITION (4)
PATTERN CLUSTERING (4)
SEMANTICS (4)
VECTOR SPACE MODEL (4)
ARABIC TEXT CATEGORIZATION (3)
ARTIFICIAL INTELLIGENCE (3)
BUILDINGS (3)
COMPUTATIONAL MODELING (3)
CONTEXT (3)
DATABASES (3)
DISPERSION (3)
ELECTRONIC MAIL (3)
FEATURE SUBSET (3)
FEATURE WEIGHT (3)
FILTERING (3)
GENETIC ALGORITHMS (3)
GENETICS (3)
HEURISTIC ALGORITHMS (3)
INDEXING (3)
INFORMATION FILTERING (3)
INFORMATION MANAGEMENT (3)
K-NN (3)
KERNEL (3)
NAIVE BAYES (3)
NEAREST NEIGHBOR SEARCHES (3)
NEURAL NETS (3)
SEARCH PROBLEMS (3)
SET THEORY (3)
SVM CLASSIFIER (3)
TERM WEIGHTING (3)
TEXT CLASSIFICATION ALGORITHM (3)
UNSOLICITED E-MAIL (3)
WEB SITES (3)
WORD PROCESSING (3)
ACTIVE LEARNING (2)
ANN (2)
ANT COLONY OPTIMIZATION (2)
APPROXIMATION METHODS (2)
ARABIC CORPUS (2)
ARABIC DOCUMENT CATEGORIZATION (2)
ARTIFICIAL NEURAL NETWORK (2)
BIOLOGICAL SYSTEM MODELING (2)
BUSINESS (2)
CATEGORIZATION (2)
CLASSIFICATION ACCURACY (2)
CLASSIFICATION ALGORITHM (2)
CLASSIFICATION MODEL (2)
CLASSIFIERS (2)
more

INFONA - science communication portal

Search results

An Improved Information Gain Feature Selection Algorithm for SVM Text Classifier

A novel classifier based on meaning for text classification

A novel feature selection based on Tibetan grammar for Tibetan text classification

Classification and clustering for neuroinformatics: Assessing the efficacy on reverse-mapped NeuroNLP data using standard ML techniques

Comparison of Four Text Classifiers on Movie Reviews

A Text Classifier of English Movie Reviews Based on Information Gain

Performance of using LDA for Chinese news text classification

Parallel Processing System for Marathi Content Generation

Augmenting the novice-expert overlay model in an intelligent tutoring system: Using confidence-weighted linear classifiers

News classification based on their headlines: A review

Active learning for text classification: Using the LSI Subspace Signature Model

Automated document classification for news article in Bahasa Indonesia based on term frequency inverse document frequency (TF-IDF) approach

A high performance hybrid algorithm for text classification

A term weighting method for identifying emotions from text content

A Simple Study of Webpage Text Classification Algorithms for Arabic and English Languages

Research on Large Scale Hierarchical Classification Based on Candidate Search

The Effect of Combining Different Feature Selection Methods on Arabic Text Classification

An approach to meta feature selection

Weirdness Coefficient as a Feature Selection Method for Arabic Special Domain Text Classification

Dynamic feature selection strategy in incremental Chinese text classification

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options