Search results

Items from 1 to 20 out of 252 results

chapter

A comprehensive study of text classification algorithms

Vikas K Vijayan, K. R. Bindu, Latha Parameswaran

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1109 - 1113

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Huge amount of data in today's world are stored in the form of electronic documents. Text mining is the process of extracting the information out of those textual documents. Text classification is the process of classifying text documents into fixed number of predefined classes. The application of text classification includes spam filtering, email routing, sentiment analysis, language identification...

chapter

Reasearch on feature mapping based on labels information in multi-label text classification

Tao Wang, Tao Luo, Jianfeng Li, Cong Wang

2017 7th IEEE International Conference on Electronics Information and Emergency Communication (ICEIEC) > 452 - 456

2017 7th IEEE International Conference on Electronics Information and Emergency Communication (ICEIEC)

Feature representation plays an important role in text classification. Feature mapping based on labels information is an algorithm suitable for Binary Relevance. Compared with the conventional text representation, it makes the dimension of the text under control by means of word embedding. More importantly, it takes full advantage of the general characteristics of the label on text representation...

chapter

The evaluation of heterogeneous classifier ensembles for Turkish texts

Zeynep Hilal Kilimci, Selim Akyokus, Sevinc Ilhan Omurca

2017 IEEE International Conference on INnovations in Intelligent SysTems and Applications (INISTA) > 307 - 311

2017 IEEE International Conference on INnovations in Intelligent SysTems and Applications (INISTA)

The basic idea behind the classifier ensembles is to use more than one classifier by expecting to improve the overall accuracy. It is known that the classifier ensembles boost the overall classification performance by depending on two factors namely, individual success of the base learners and diversity. One way of providing diversity is to use the same or different type of base learners. When the...

chapter

On multiclass text classification algorithm based on 1-a-r and multiconlitron

Yuping Qin, Fengfeng Qin, Qiangkui Leng, Aihua Zhang

2017 6th Data Driven Control and Learning Systems (DDCLS) > 370 - 373

2017 IEEE 6th Data Driven Control and Learning Systems Conference (DDCLS)

Aim to multiclass text categorization problem, a classification algorithm based on multiconlitron and 1-a-r method is presented. 1-a-r method is used to convert a multiclass categorization problem to several binary problems. Multiconlitron is constructed for each binary problem in input space. For the text to be classified, its class is decided by multiconlitrons. The classification experiments are...

chapter

Using KNN algorithm for classification of textual documents

Aiman Moldagulova, Rosnafisah Bte. Sulaiman

2017 8th International Conference on Information Technology (ICIT) > 665 - 671

2017 8th International Conference on Information Technology (ICIT)

Nowadays the exponential growth of generation of textual documents and the emergent need to structure them increase the attention to the automated classification of documents into predefined categories. There is wide range of supervised learning algorithms that deal with text classification. This paper deals with an approach for building a machine learning system in R that uses K-Nearest Neighbors...

chapter

A preprocessing method of AdaBoost for mislabeled data classification

Xiangyang Liu, Yaping Dai, Yan Zhang, Qiao Yuan, more

2017 29th Chinese Control And Decision Conference (CCDC) > 2738 - 2742

2017 29th Chinese Control And Decision Conference (CCDC)

AdaBoost is one of the most popular algorithm for classification and has been successfully used for text classification, face detection and tracking. However noise sensitivity is regarded as a major disadvantage and previous works show that AdaBoost will be overfitting when dealing with the data sets with noisy data. To improve the noise tolerance of conventional AdaBoost, this paper proposed a preprocessing...

chapter

Research review on key techniques of topic-based news elements extraction

Song Qing, Zhang Ying, Zhang Pengzhou

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS) > 585 - 590

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)

With the development of computer and network techniques, and the digital Chinese news texts explosion, facing a massive unstructured news data, a better way for knowledge extraction and storage, on the one hand, can help readers understand the core content of news, on the other hand, completed news knowledge accumulation will support the reportage. In recent years, information extraction technology...

chapter

Naive Bayes classifiers for music emotion classification based on lyrics

Yunjing An, Shutao Sun, Shujuan Wang

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS) > 635 - 638

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)

There is a constantly growing interest in evaluating music information retrieval (MIR) systems that can provide effective management of the music resources. The crucial characteristic of music is its emotion, which reflect the human's perception. To do the automatic classification of Chinese music emotions more effective, we use the lyrics of music to analysis and classify music based on emotion....

chapter

Research on text categorization model based on LDA — KNN

Weihua Chen, Xian Zhang

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 2719 - 2726

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

In the text classification, The similarity between the text need to be calculated, but the existing classification methods only consider the similarity between feature words and categories and does not involve the semantic similarity between feature words. In this paper, a new classification model LDA (Latent Dirichlet Allocation) — KNN (K-Nearest Neighbor) is proposed. LDA is used to solve the problem...

chapter

An improved text classification model for mobile data security testing

Feng Xiaorong, Lin Jun, Mai Songtao, Jia Shizhun

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1732 - 1736

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

In the view of mobile data security detection, text classification model can be realized in the application layer to detect malicious attacks. Since traditional C4.5 decision tree has the disadvantage of no considering about interaction influence between properties in attribute selection, an improved model of C4.5 decision tree based on AdaBoost algorithm is put forward. The problem in measuring the...

chapter

Effective text classification using multi-level fuzzy neural network

Shima Zobeidi, Marjan Naderan, Seyed Enayatollah Alavi

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS) > 91 - 96

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS)

Nowadays, large volumes of text data are being produced in real time due to expansion of communication. It is necessary to organize this data for exploitation and extraction of useful information. Text classification based on the topic is one of the efficient solutions to this problem. Efficient algorithms are applied for text classification if they address high dimensional data. In this paper, a...

chapter

Investigation of the influence of outliers on text documents probabilistic classifier quality

Andrey I. Kapitanov, Ilona I. Kapitanova, Vladimir M. Troyanovskiy, Valentin V. Slyusar, more

2017 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus) > 438 - 439

2017 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus)

In this paper we investigate the influence of outliers in the training set on the probabilistic classifier quality. By the example of naive Bayes classifier we show how the qualitative characteristics depend on the percentage of outliers' ratio. This dependence is built on three basic metrics of the classifier quality: precision, recall and F1 score. At the end we propose method for reducing the outliers...

chapter

Effective threshold estimation for filter-based feature selection

Past Pramokchon, Punpiti Piamsa-nga

2016 International Computer Science and Engineering Conference (ICSEC) > 1 - 6

2016 International Computer Science and Engineering Conference (ICSEC)

For data classification, a feature subset is selected from all features by prior knowledge or determined by empirical experiments; however, it varies to contents, feature measures, and classifiers. This paper presents a filter based algorithm to select a subset of features by using outlier cut-offs of relevance between features and targeted categories. This algorithm uses the statistical techniques...

chapter

Research on the text sentiment classification about the social hot events on Weibo

Fulian Yin, Beibei Zhang, Pei Su, Juanfang Chai

2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC) > 1537 - 1541

2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)

The public comments on the social hot events on Weibo has attracted lots of attentions in recent years. To remedy the shortage of sentiment analysis about typical events on Weibo, the classification method based on sentiment dictionary is put forward in this paper, and the accuracy rate is close to 50%. This paper also proposed a sentiment classification method based on Naive Bayesian in order to...

chapter

Nonlinearly assembling method and its application in large-scale text classification

Zhong-bao Liu, Zhang Jing

2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC) > 1466 - 1468

2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)

Support Vector Machine (SVM) is one of widely-used text classification method. Although SVM performs well in practice, SVM encounters two problems: the data distribution is not taken into consideration in the process of classification and its performance is greatly influenced by noises. In view of this, Fuzzy Support Vector Machine based on Manifold Discriminant Analysis (FSVM-MDA) is proposed and...

chapter

Semantic text classification with tensor space model-based naïve Bayes

Han-joon Kim, Jiyun Kim, Jinseog Kim

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 4206 - 4210

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

This paper presents a semantic naïve Bayes classification technique that is based upon our tensor space model for text representation. In our work, each of Wikipedia articles is defined as a single concept, and a document is represented as a 2^nd-order tensor. Our method expands the conventional naïve Bayes by incorporating the semantic concept features into term feature statistics under the tensor-space...

chapter

Presenting an improved combination for classification of Persian texts

Morteza Jahantigh, Negin Daneshpour, Mohammad Erfani, Nargess Orojlou

2016 Eighth International Conference on Information and Knowledge Technology (IKT) > 234 - 240

2016 Eighth International Conference on Information and Knowledge Technology (IKT)

Since text mining saves a large amount of information in text format, it has a very high potential application. One of the main applications of text mining is to classify texts in subject order. In this paper, we tried to propose a aarianew method in order to increase classification accuracy and efficiency, by considering different methods of Persian text classification. We used a number of 5330 news...

chapter

Study on Short Text Classification with Integrated Algorithm

Dexin Zhao, Nana Du, Liangliang Qin

2016 13th Web Information Systems and Applications Conference (WISA) > 121 - 124

2016 13th Web Information Systems and Applications Conference (WISA)

With the rapid growth of the number of short text, how to effectively realize the automatic classification of short text is needed to be solved in the information domain. According to the characteristics of short text, this paper proposes Bagging_NB & Bagging_BSJ, which are two classification algorithms based on the improvement of current integrated classifiers. Traditional classifier NB, SVM,...

chapter

Enhancing spam detection on mobile phone Short Message Service (SMS) performance using FP-growth and Naive Bayes Classifier

Dea Delvia Arifin, Shaufiah, Moch. Arif Bijaksana

2016 IEEE Asia Pacific Conference on Wireless and Mobile (APWiMob) > 80 - 84

2016 IEEE Asia Pacific Conference on Wireless and Mobile (APWiMob)

SMS (Short Message Service) is still the primary choice as a communication medium even though nowadays mobile phone is growing with a variety of communication media messenger applications. However, nowadays along with the SMS tariff reduction leads to the increase of SMS spam, as used by some people as an alternative to advertise and fraud. Therefore, it becomes an important issue as it can bug and...

chapter

Dynamic Neural Networks for Text Classification

Lea Vega, Andres Mendez-Vazquez

2016 International Conference on Computational Intelligence and Applications (ICCIA) > 6 - 11

2016 International Conference on Computational Intelligence and Applications (ICCIA)

This research proposes an approach for text classification that uses a simple neural network called Dynamic Text Classifier Neural Network (DTCNN). The neural network uses as input vectors of words with variable dimension without information loss called Dynamic Token Vectors (DTV). The proposed neural network is designed for the classification of large and short text into categories. The learning...

Data set:
ieee
Keywords:
TRAINING
CLASSIFICATION ALGORITHMS
TEXT CATEGORIZATION

Publication date

Set your own date range

Content availability

Available (248)
None (4)

Keywords

TEXT ANALYSIS (144)
SUPPORT VECTOR MACHINES (85)
TEXT CLASSIFICATION (81)
PATTERN CLASSIFICATION (77)
ACCURACY (63)
FEATURE EXTRACTION (61)
SUPPORT VECTOR MACHINE CLASSIFICATION (55)
MACHINE LEARNING (46)
ALGORITHM DESIGN AND ANALYSIS (43)
CLASSIFICATION (42)
LEARNING (ARTIFICIAL INTELLIGENCE) (39)
DATA MINING (38)
FEATURE SELECTION (28)
BAYES METHODS (25)
TESTING (25)
NATURAL LANGUAGE PROCESSING (23)
SUPPORT VECTOR MACHINE (19)
COMPUTERS (18)
KNN (17)
SVM (17)
MACHINE LEARNING ALGORITHMS (16)
ENTROPY (15)
INTERNET (15)
TEXT MINING (15)
INFORMATION RETRIEVAL (13)
PROBABILITY (13)
MUTUAL INFORMATION (12)
SEMANTICS (12)
VECTOR SPACE MODEL (12)
CLUSTERING ALGORITHMS (11)
DECISION TREES (11)
KERNEL (11)
MATHEMATICAL MODEL (11)
NIOBIUM (11)
TRAINING DATA (11)
VECTORS (11)
ARTIFICIAL NEURAL NETWORKS (10)
CLASSIFICATION TREE ANALYSIS (10)
STATISTICAL ANALYSIS (10)
WORD PROCESSING (10)
COMPUTATIONAL MODELING (9)
CORRELATION (9)
DATABASES (9)
NAIVE BAYES (9)
NEAREST NEIGHBOR SEARCHES (9)
ROUGH SET THEORY (9)
WEB PAGES (9)
NAIVE BAYES CLASSIFIER (8)
PREDICTION ALGORITHMS (8)
SEMI-SUPERVISED LEARNING (8)
BAYESIAN METHODS (7)
DISTANCE MEASUREMENT (7)
FILTERING (7)
INDEXING (7)
INFORMATION FILTERING (7)
PATTERN CLUSTERING (7)
SET THEORY (7)
DATA MODELS (6)
DICTIONARIES (6)
DOCUMENT HANDLING (6)
EQUATIONS (6)
FEATURE SELECTION METHOD (6)
GAIN (6)
INFORMATION GAIN (6)
NOISE (6)
ROUGH SET (6)
TERM WEIGHTING (6)
TEXT CLASSIFICATION ALGORITHM (6)
ARTIFICIAL INTELLIGENCE (5)
BAGGING (5)
CHINESE TEXT CATEGORIZATION (5)
CONTEXT (5)
DECISION MAKING (5)
DIMENSION REDUCTION (5)
DOCUMENT CATEGORIZATION (5)
DOCUMENT CLASSIFICATION (5)
K-NEAREST NEIGHBOR (5)
KNN ALGORITHM (5)
MATRIX DECOMPOSITION (5)
NAïVE BAYES (5)
NEURAL NETWORKS (5)
OPTIMISATION (5)
OPTIMIZATION (5)
PROTOTYPES (5)
SENTIMENT ANALYSIS (5)
TEXT REPRESENTATION (5)
TF-IDF (5)
VOCABULARY (5)
WEB SITES (5)
ACTIVE LEARNING (4)
AUTOMATIC TEXT CATEGORIZATION (4)
BOOSTING (4)
CLASSIFICATION ALGORITHM (4)
CO-TRAINING (4)
DECISION TREE (4)
DIMENSIONALITY REDUCTION (4)
EDUCATIONAL INSTITUTIONS (4)
more

INFONA - science communication portal

Search results

A comprehensive study of text classification algorithms

Reasearch on feature mapping based on labels information in multi-label text classification

The evaluation of heterogeneous classifier ensembles for Turkish texts

On multiclass text classification algorithm based on 1-a-r and multiconlitron

Using KNN algorithm for classification of textual documents

A preprocessing method of AdaBoost for mislabeled data classification

Research review on key techniques of topic-based news elements extraction

Naive Bayes classifiers for music emotion classification based on lyrics

Research on text categorization model based on LDA — KNN

An improved text classification model for mobile data security testing

Effective text classification using multi-level fuzzy neural network

Investigation of the influence of outliers on text documents probabilistic classifier quality

Effective threshold estimation for filter-based feature selection

Research on the text sentiment classification about the social hot events on Weibo

Nonlinearly assembling method and its application in large-scale text classification

Semantic text classification with tensor space model-based naïve Bayes

Presenting an improved combination for classification of Persian texts

Study on Short Text Classification with Integrated Algorithm

Enhancing spam detection on mobile phone Short Message Service (SMS) performance using FP-growth and Naive Bayes Classifier

Dynamic Neural Networks for Text Classification

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options