Search results

Items from 1 to 20 out of 128 results

chapter

Vietnamese news classification based on BoW with keywords extraction and neural network

Toan Pham Van, Ta Minh Thanh

2017 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES) > 43 - 48

2017 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES)

Nowadays, text classification (TC) becomes the main applications of NLP (natural language processing). Actually, we have a lot of researches in classifying text documents, such as Random Forest, Support Vector Machines and Naive Bayes. However, most of them are applied for English documents. Therefore, the text classification researches on Vietnamese still are limited. By using a Vietnamese news corpus,...

chapter

Document embedding approach for efficient authorship attribution

Hayri Volkan Agun, Ozgur Yilmazel

2017 2nd International Conference on Knowledge Engineering and Applications (ICKEA) > 194 - 198

2017 2nd International Conference on Knowledge Engineering and Applications (ICKEA)

Authorship attribution has been well studied in terms of text classification with many diverse feature sets. However, finding topic independent features is hard and trained models with hand crafted features in one domain may not work in another domain. In this study we used a semi-supervised neural language model which is known as document embeddings for authorship attribution problem. This method...

chapter

Character-Level neural networks for short text classification

Jingxue Liu, Fanrong Meng, Yong Zhou, Bing Liu

2017 International Smart Cities Conference (ISC2) > 1 - 7

2017 International Smart Cities Conference (ISC2)

Since short text is characterized of the short length, sparse features and strong context dependency, the traditional models have a limited precision. Motivated by this, this article offers an empirical exploration on a character-level model which implements a combination of convolutional neural network(CNN) and recurrent neural networks(RNN) for short text classification. Including the highway networks...

chapter

A comprehensive study of text classification algorithms

Vikas K Vijayan, K. R. Bindu, Latha Parameswaran

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1109 - 1113

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Huge amount of data in today's world are stored in the form of electronic documents. Text mining is the process of extracting the information out of those textual documents. Text classification is the process of classifying text documents into fixed number of predefined classes. The application of text classification includes spam filtering, email routing, sentiment analysis, language identification...

chapter

Using KNN algorithm for classification of textual documents

Aiman Moldagulova, Rosnafisah Bte. Sulaiman

2017 8th International Conference on Information Technology (ICIT) > 665 - 671

2017 8th International Conference on Information Technology (ICIT)

Nowadays the exponential growth of generation of textual documents and the emergent need to structure them increase the attention to the automated classification of documents into predefined categories. There is wide range of supervised learning algorithms that deal with text classification. This paper deals with an approach for building a machine learning system in R that uses K-Nearest Neighbors...

chapter

Research on text categorization model based on LDA — KNN

Weihua Chen, Xian Zhang

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 2719 - 2726

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

In the text classification, The similarity between the text need to be calculated, but the existing classification methods only consider the similarity between feature words and categories and does not involve the semantic similarity between feature words. In this paper, a new classification model LDA (Latent Dirichlet Allocation) — KNN (K-Nearest Neighbor) is proposed. LDA is used to solve the problem...

chapter

Effective text classification using multi-level fuzzy neural network

Shima Zobeidi, Marjan Naderan, Seyed Enayatollah Alavi

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS) > 91 - 96

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS)

Nowadays, large volumes of text data are being produced in real time due to expansion of communication. It is necessary to organize this data for exploitation and extraction of useful information. Text classification based on the topic is one of the efficient solutions to this problem. Efficient algorithms are applied for text classification if they address high dimensional data. In this paper, a...

chapter

Performance analysis of supervised machine learning algorithms for text classification

Sadia Zaman Mishu, S. M. Rafiuddin

2016 19th International Conference on Computer and Information Technology (ICCIT) > 409 - 413

2016 19th International Conference on Computer and Information Technology (ICCIT)

The demand of text classification is growing significantly in web searching, data mining, web ranking, recommendation systems and so many other fields of information and technology. This paper illustrates the text classification process on different dataset using some standard supervised machine learning techniques. Text documents can be classified through various kinds of classifiers. Labeled text...

chapter

Recurrent convolutional neural networks for structured speech act tagging

Takashi Ushio, Hongjie Shi, Mitsuru Endo, Katsuyoshi Yamagami, more

2016 IEEE Spoken Language Technology Workshop (SLT) > 518 - 524

2016 IEEE Spoken Language Technology Workshop (SLT)

Spoken language understanding (SLU) is one of the important problem in natural language processing, and especially in dialog system. Fifth Dialog State Tracking Challenge (DSTC5) introduced a SLU challenge task, which is automatic tagging to speech utterances by two speaker roles with speech acts tag and semantic slots tag. In this paper, we focus on speech acts tagging. We propose local coactivate...

chapter

Importance weighted feature selection strategy for text classification

Baoli Li

2016 International Conference on Asian Language Processing (IALP) > 344 - 347

2016 International Conference on Asian Language Processing (IALP)

Feature selection, which aims at obtaining a compact and effective feature subset for better performance and higher efficiency, has been studied for decades. The traditional feature selection metrics, such as Chi-square and information gain, fail to consider how important a feature is in a document. Features, no matter how much effective semantic information they hold, are treated equally. Intuitively,...

chapter

Semantic text classification with tensor space model-based naïve Bayes

Han-joon Kim, Jiyun Kim, Jinseog Kim

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 4206 - 4210

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

This paper presents a semantic naïve Bayes classification technique that is based upon our tensor space model for text representation. In our work, each of Wikipedia articles is defined as a single concept, and a document is represented as a 2^nd-order tensor. Our method expands the conventional naïve Bayes by incorporating the semantic concept features into term feature statistics under the tensor-space...

chapter

Text classification using combined sparse representation classifiers and support vector machines

Neeraj Sharma, Anshu Sharma, Veena Thenkanidiyoor, A. D. Dileep

2016 4th International Symposium on Computational and Business Intelligence (ISCBI) > 181 - 185

2016 4th International Symposium on Computational and Business Intelligence (ISCBI)

Text classification is an important task in managing huge repository of textual content prevailing in various domains. In this paper, we propose to use sparse representation classifier (SRC) and support vector machines (SVMs) based classifiers using frequency-based kernels for text classification. We consider term-frequency (TF) representation for a text document. The sparse representation of an example...

chapter

Study on Short Text Classification with Integrated Algorithm

Dexin Zhao, Nana Du, Liangliang Qin

2016 13th Web Information Systems and Applications Conference (WISA) > 121 - 124

2016 13th Web Information Systems and Applications Conference (WISA)

With the rapid growth of the number of short text, how to effectively realize the automatic classification of short text is needed to be solved in the information domain. According to the characteristics of short text, this paper proposes Bagging_NB & Bagging_BSJ, which are two classification algorithms based on the improvement of current integrated classifiers. Traditional classifier NB, SVM,...

chapter

Enhancing spam detection on mobile phone Short Message Service (SMS) performance using FP-growth and Naive Bayes Classifier

Dea Delvia Arifin, Shaufiah, Moch. Arif Bijaksana

2016 IEEE Asia Pacific Conference on Wireless and Mobile (APWiMob) > 80 - 84

2016 IEEE Asia Pacific Conference on Wireless and Mobile (APWiMob)

SMS (Short Message Service) is still the primary choice as a communication medium even though nowadays mobile phone is growing with a variety of communication media messenger applications. However, nowadays along with the SMS tariff reduction leads to the increase of SMS spam, as used by some people as an alternative to advertise and fraud. Therefore, it becomes an important issue as it can bug and...

chapter

Dynamic Neural Networks for Text Classification

Lea Vega, Andres Mendez-Vazquez

2016 International Conference on Computational Intelligence and Applications (ICCIA) > 6 - 11

2016 International Conference on Computational Intelligence and Applications (ICCIA)

This research proposes an approach for text classification that uses a simple neural network called Dynamic Text Classifier Neural Network (DTCNN). The neural network uses as input vectors of words with variable dimension without information loss called Dynamic Token Vectors (DTV). The proposed neural network is designed for the classification of large and short text into categories. The learning...

chapter

Using latent Dirichlet allocation to improve text classification performance of support vector machine

Yaw-Huei Chen, Shu-Fong Li

2016 IEEE Congress on Evolutionary Computation (CEC) > 1280 - 1286

2016 IEEE Congress on Evolutionary Computation (CEC)

Text classification is an important task in natural language processing that aims to determine the category of a document. In the simplest settings, we adopt the bag-of-words model and convert documents in the corpus into term frequency vectors so that the classifier can process them. Because the bag-of-words model retains only the number of occurrences of each individual term, the classifier cannot...

chapter

News text classification model based on topic model

Zhenzhong Li, Wenqian Shang, Menghan Yan

2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS) > 1 - 5

2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS)

In modern society, some famous news websites such as Sina and Times server to provide information every day for millions of users. But with the continuous development of information technology, the amount of disorder data is increasing. How to organize the text and make automatically text classification has become a challenge. The traditional manual classification of news text not only consumes a...

chapter

Text classification using KM-ELM classifier

K S Neethu, T S Jyothis, Jithin Dev

2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT) > 1 - 5

2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT)

Classification systems adapts many machine learning techniques for quality performance in data classification. The neural networks has some unique characteristics and features which can handle high dimensional features and documents with noise and contradictory data. Classification is important to classify the input text into different domains appropriately. This paper give out a move towards classification...

chapter

Unsupervised feature selection for text classification via word embedding

Weikang Rui, Jinwen Liu, Yawei Jia

2016 IEEE International Conference on Big Data Analysis (ICBDA) > 1 - 5

2016 IEEE International Conference on Big Data Analysis (ICBDA)

The key of big text documents data analysis is to classify those text documents. To classify those text documents, it is necessary to represent those text documents as vectors which is vector space model (VSM). A powerful vector space model should remain the classification information with dimensions as little as possible. To achieve that, it is important to select most effective features for text...

chapter

A technical study and analysis of text classification techniques in N - Lingual documents

Shalini Puri, S. P. Singh

2016 International Conference on Computer Communication and Informatics (ICCCI) > 1 - 6

2016 International Conference on Computer Communication and Informatics

In the current era, there is a high demand of accurate text identification and categorization methods in N - Lingual non-scanned and scanned machine printed documents, where N represents mono, bi, tri or multi mode. In this paper, a technical study and analysis is presented to show N-lingual document classification for normal text, printed and handwritten documents. Text classification for normal...

Keywords:
TEXT CATEGORIZATION
TRAINING

Publication date

Set your own date range

Content availability

Available (124)
None (4)

Publication type

book (127)
article (1)

Keywords

CLASSIFICATION ALGORITHMS (81)
TEXT ANALYSIS (67)
SUPPORT VECTOR MACHINES (42)
FEATURE EXTRACTION (38)
ACCURACY (34)
PATTERN CLASSIFICATION (33)
SUPPORT VECTOR MACHINE CLASSIFICATION (27)
MACHINE LEARNING (22)
CLASSIFICATION (21)
DATA MINING (21)
LEARNING (ARTIFICIAL INTELLIGENCE) (17)
FEATURE SELECTION (16)
ALGORITHM DESIGN AND ANALYSIS (15)
SVM (14)
VECTORS (12)
BAYES METHODS (11)
NATURAL LANGUAGE PROCESSING (11)
KNN (10)
SUPPORT VECTOR MACHINE (10)
COMPUTERS (9)
SEMANTICS (9)
TESTING (9)
TEXT MINING (9)
KERNEL (8)
TRAINING DATA (8)
ARTIFICIAL NEURAL NETWORKS (7)
MUTUAL INFORMATION (7)
NEURAL NETWORKS (7)
VECTOR SPACE MODEL (7)
DATA MODELS (6)
DICTIONARIES (6)
FILTERING (6)
INTERNET (6)
BAYESIAN METHODS (5)
CLUSTERING ALGORITHMS (5)
COMPUTATIONAL MODELING (5)
CORRELATION (5)
EDUCATIONAL INSTITUTIONS (5)
INFORMATION RETRIEVAL (5)
MACHINE LEARNING ALGORITHMS (5)
MEASUREMENT (5)
NAIVE BAYES CLASSIFIER (5)
NEURAL NETWORK (5)
NEURONS (5)
NIOBIUM (5)
OPTIMIZATION (5)
PROBABILITY (5)
ARTIFICIAL INTELLIGENCE (4)
DATABASES (4)
DIMENSIONALITY REDUCTION (4)
EQUATIONS (4)
FREQUENCY MEASUREMENT (4)
INFORMATION FILTERING (4)
LEARNING SYSTEMS (4)
NAïVE BAYES (4)
OPTIMISATION (4)
PRINCIPAL COMPONENT ANALYSIS (4)
RESOURCE MANAGEMENT (4)
SEMI-SUPERVISED LEARNING (4)
TDT EVALUATION (4)
TEXT CLASSIFICATION ALGORITHM (4)
TOPIC TRACKING (4)
VOCABULARY (4)
VSM (4)
CLUSTERING (3)
CONFERENCES (3)
COVARIANCE MATRIX (3)
DIMENSION REDUCTION (3)
ELECTRONIC MAIL (3)
ENTROPY (3)
FEATURE SELECTION METHOD (3)
HIDDEN MARKOV MODELS (3)
INDEXES (3)
INFORMATION EXTRACTION (3)
INFORMATION GAIN (3)
KNN ALGORITHM (3)
LATENT DIRICHLET ALLOCATION (3)
LOGISTICS (3)
MATHEMATICAL MODEL (3)
NAIVE BAYES (3)
NOISE (3)
POLYNOMIALS (3)
PREDICTION ALGORITHMS (3)
PROTOTYPES (3)
TAGGING (3)
WEB PAGES (3)
ACTIVE LEARNING (2)
AFFINITY PROPAGATION (2)
ANALYTICAL MODELS (2)
AUTOMATIC TEXT CATEGORIZATION (2)
BACK PROPAGATION NETWORK (2)
BUSINESS (2)
CLASSIFICATION TREE ANALYSIS (2)
CO-TRAINING (2)
COLLABORATION (2)
COLLABORATIVE OPTIMIZATION (2)
COMPANIES (2)
more

INFONA - science communication portal

Search results

Vietnamese news classification based on BoW with keywords extraction and neural network

Document embedding approach for efficient authorship attribution

Character-Level neural networks for short text classification

A comprehensive study of text classification algorithms

Using KNN algorithm for classification of textual documents

Research on text categorization model based on LDA — KNN

Effective text classification using multi-level fuzzy neural network

Performance analysis of supervised machine learning algorithms for text classification

Recurrent convolutional neural networks for structured speech act tagging

Importance weighted feature selection strategy for text classification

Semantic text classification with tensor space model-based naïve Bayes

Text classification using combined sparse representation classifiers and support vector machines

Study on Short Text Classification with Integrated Algorithm

Enhancing spam detection on mobile phone Short Message Service (SMS) performance using FP-growth and Naive Bayes Classifier

Dynamic Neural Networks for Text Classification

Using latent Dirichlet allocation to improve text classification performance of support vector machine

News text classification model based on topic model

Text classification using KM-ELM classifier

Unsupervised feature selection for text classification via word embedding

A technical study and analysis of text classification techniques in N - Lingual documents

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options