Search results

Items from 1 to 20 out of 210 results

chapter

Vietnamese news classification based on BoW with keywords extraction and neural network

Toan Pham Van, Ta Minh Thanh

2017 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES) > 43 - 48

2017 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES)

Nowadays, text classification (TC) becomes the main applications of NLP (natural language processing). Actually, we have a lot of researches in classifying text documents, such as Random Forest, Support Vector Machines and Naive Bayes. However, most of them are applied for English documents. Therefore, the text classification researches on Vietnamese still are limited. By using a Vietnamese news corpus,...

chapter

Detection of cyberbullying on social media messages in Turkish

Selma Ayse Ozel, Esra Sarac, Seyran Akdemir, Hulya Aksu

2017 International Conference on Computer Science and Engineering (UBMK) > 366 - 370

2017 International Conference on Computer Science and Engineering (UBMK)

The increased use of the Internet and the ease of access to online communities like social media have provided an avenue for cybercrimes. Cyberbullying, which is a kind of cybercrime, is defined as an aggressive, intentional action against a defenseless person by using the Internet, social media, or other electronic contents. Researchers have found that many of the bullying cases have tragically ended...

chapter

Categorizing the Turkish web pages by data mining techniques

Secil Sekerci Husem, Ayla Gulcu

2017 International Conference on Computer Science and Engineering (UBMK) > 255 - 260

2017 International Conference on Computer Science and Engineering (UBMK)

Today, it is not possible to use human power alone to cope with the increasing amount of data. For this reason, some automated methods are needed to group similar documents together or to place documents in predefined categories according to certain rules. The use of automated classification techniques is becoming increasingly important for this reason. In this study, a database consisting of 22 thousand...

chapter

Distribution shift resilient discrimination information space for SVM classification

Khurum Nazir Junejo

2017 8th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) > 378 - 383

2017 8th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)

There has been a phenomenal increase in the utility of text classification (TC) in applications like targeted advertisement and sentiment analysis. Most applications demand that the model be efficient and robust, yet produce accurate categorizations. This is quite challenging as their is a dearth of labelled training data because it requires assigning labels after reading the whole document. Secondly,...

chapter

N-gram based approach to recognize the twitter accounts of Turkish daily newspapers

Islam Mayda, Mirsat Yesiltepe

2017 International Artificial Intelligence and Data Processing Symposium (IDAP) > 1 - 5

2017 International Artificial Intelligence and Data Processing Symposium (IDAP)

Twitter is one of the most popular social media networks in the world. It is also mostly used by corporate companies, media as well as individual users. Media organizations use Twitter to announce about the news. Although the language of the given news is formal and preferred words to share information are different for each organization. In this study, we proposed an approach to recognize the Twitter...

chapter

A comprehensive study of text classification algorithms

Vikas K Vijayan, K. R. Bindu, Latha Parameswaran

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1109 - 1113

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Huge amount of data in today's world are stored in the form of electronic documents. Text mining is the process of extracting the information out of those textual documents. Text classification is the process of classifying text documents into fixed number of predefined classes. The application of text classification includes spam filtering, email routing, sentiment analysis, language identification...

chapter

Domain specific syntax based approach for text classification in machine learning context

Alaa Mohasseb, Mohamed Bader-El-Den, Han Liu, Mihaela Cocea

2017 International Conference on Machine Learning and Cybernetics (ICMLC) > 2 > 658 - 663

2017 International Conference on Machine Learning and Cybernetics (ICMLC)

Due to the vast amount of data, searching and obtaining relevant information on the web is a challenging task. Despite that a broad range of classification techniques have been proposed to improve the information retrieval methods, many difficulties are still present because of the continuous increase in the amount of web contents, as well as its diversity. In this paper, we propose a method that...

chapter

Labeled LDA-Kernel SVM: A Short Chinese Text Supervised Classification Based on Sina Weibo

Xueli Wang, Jiao Wang, Yang Yang, Jinbao Duan

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 428 - 432

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

At present, it is a great challenge that solving high-dimension and text sparsity problems in short text classification. To resolve these problems, this paper proposes a method which takes the correlation between lexical items and tags before completing Latent Dirichlet Allocation(LDA) topic model. Meanwhile, this paper adjusts parameters of Support Vector Machine(SVM) to find the optimal values by...

chapter

Sentiment analysis of social network posts in Slovak language

Rastislav Krchnavy, Marian Simko

2017 12th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP) > 20 - 25

2017 12th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP)

In this paper we tackle the issue of sentiment analysis of social network posts in a not well targeted language — Slovak. There is a significant lack of research in this area for minor languages, as they often introduce additional language-specific issues for text processing. In case of Slovak, common issues are high flection, complex morphology and syntax. User-generated content of social networks...

chapter

Exploring linguistic features for extremist texts detection (on the material of Russian-speaking illegal texts)

Dmitry Devyatkin, Ivan Smirnov, Ananyeva Margarita, Kobozeva Maria, more

2017 IEEE International Conference on Intelligence and Security Informatics (ISI) > 188 - 190

2017 IEEE International Conference on Intelligence and Security Informatics (ISI)

In this paper we present results of a research on automatic extremist text detection. For this purpose an experimental dataset in the Russian language was created. According to the Russian legislation we cannot make it publicly available. We compared various classification methods (multinomial naive Bayes, logistic regression, linear SVM, random forest, and gradient boosting) and evaluated the contribution...

article

Enhancing Binary Classification by Modeling Uncertain Boundary in Three-Way Decisions

Yuefeng Li, Libiao Zhang, Yue Xu, Yiyu Yao, more

IEEE Transactions on Knowledge and Data Engineering > 2017 > 29 > 7 > 1438 - 1451

Text classification is a process of classifying documents into predefined categories through different classifiers learned from labelled or unlabelled training samples. Many researchers who work on binary text classification attempt to find a more effective way to separate relevant texts from a large data set. However, current text classifiers cannot unambiguously describe the decision boundary between...

chapter

Category Classification of Text Data with Machine Learning Technique for Visualizing Flow of Conversation in Counseling

Yuma Hayashida, Tomoya Uetsuji, Yasuo Ebara, Koji Koyamada

2017 Nicograph International (NicoInt) > 37 - 40

2017 Nicograph International (NicoInt)

The beginner counselors have more likely to continue counseling in their own interest, they have a high tendency to make great use of the closed-ended question in order to confirm the interpretation with the client. While expert counselors are instructing the counseling skill to beginner counselors, we consider that the reaction of a client for a beginner counselor's question is important to visualize...

chapter

Turkish tweet sentiment analysis with word embedding and machine learning

Deger Ayata, Murat Saraclar, Arzucan Ozgur

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

This work includes processing and classification of tweets which are written in Turkish language. Four different sector tweet datasets are vectorized with Word Embedding model and classified with Support Vector Machine and Random Forests classifiers and results have been compared. We have showed that sector based tweet classification is more successful compared to general tweets. Accuracy rates for...

chapter

Comparison of feature selection methods for sentiment analysis on Turkish Twitter data

Tuba Parlar, Esra Sarac, Selma Ayse Ozel

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

The Internet and social media provide a major source of information about people's opinions. Due to the rapidly growing number of online documents, it becomes both time-consuming and hard task to obtain and analyze the desired opinionated information. Sentiment analysis is the classification of sentiments expressed in documents. To improve classification perfromance feature selection methods which...

chapter

An ensemble based NLP feature assessment in binary classification

Saurabh Kr. Srivasatava, Roshan Kumari, Sandeep Kr. Singh

2017 International Conference on Computing, Communication and Automation (ICCCA) > 345 - 349

2017 International Conference on Computing, Communication and Automation (ICCCA)

Text feature selection plays an important role in text mining. Terms are the key players in document representation. The document representation can help application in following areas-indexing, summarization, classification, clustering and filtering. Text instances come with a challenge of high dimensional feature space and using such features can be extremely useful in text analysis. Hence it is...

chapter

Fusing Gini Index and Term Frequency for Text Feature Selection

Lin Wu, Yongbin Wang, Shengyan Zhang, Yannan Zhang

2017 IEEE Third International Conference on Multimedia Big Data (BigMM) > 280 - 283

2017 IEEE Third International Conference on Multimedia Big Data (BigMM)

Automatic text classification is the key technology to process and organize large-scale text data. It is well known that the high dimensionality of feature space is a main challenge for text classification. In order to attenuate such a problem as well as inspired by existing arts, we propose an effective text feature selection algorithm by novelly fusing the classical methodologies of Gini index and...

chapter

Feature selection algorithm for hierarchical text classification using Kullback-Leibler divergence

Yao Lifang, Qin Sijun, Zhu Huan

2017 IEEE 2nd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA) > 421 - 424

2017 IEEE 2nd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA)

Text classification, a simple and effective method, is considered as the key technology to deal with and organize a large amount of text data. At present, the simple text classification is unable to meet the increasing of user's demand, hierarchical text classification has received extensive attention and has broad application prospects. Hierarchical feature selection algorithm is the key technology...

chapter

Effective text classification using multi-level fuzzy neural network

Shima Zobeidi, Marjan Naderan, Seyed Enayatollah Alavi

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS) > 91 - 96

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS)

Nowadays, large volumes of text data are being produced in real time due to expansion of communication. It is necessary to organize this data for exploitation and extraction of useful information. Text classification based on the topic is one of the efficient solutions to this problem. Efficient algorithms are applied for text classification if they address high dimensional data. In this paper, a...

chapter

Performance analysis of supervised machine learning algorithms for text classification

Sadia Zaman Mishu, S. M. Rafiuddin

2016 19th International Conference on Computer and Information Technology (ICCIT) > 409 - 413

2016 19th International Conference on Computer and Information Technology (ICCIT)

The demand of text classification is growing significantly in web searching, data mining, web ranking, recommendation systems and so many other fields of information and technology. This paper illustrates the text classification process on different dataset using some standard supervised machine learning techniques. Text documents can be classified through various kinds of classifiers. Labeled text...

chapter

An empirical analysis and classification of crisis related tweets

J. Rexiline Ragini, P. M. Rubesh Anand

2016 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC) > 1 - 4

2016 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

The social media generates large volume of data through tweets and text messages during and after any disaster. The analysis and classification of the obtained data at the time of disaster is essential for conveying the information to the appropriate rescue personnel. In this paper, an automated text classification system is proposed in order to classify the data effectively. The classification of...

Keywords:
SUPPORT VECTOR MACHINES

Publication date

Set your own date range

Content availability

Available (204)
None (6)

Publication type

book (194)
article (16)

Keywords

TEXT CATEGORIZATION (94)
TEXT ANALYSIS (92)
TRAINING (77)
CLASSIFICATION ALGORITHMS (69)
FEATURE EXTRACTION (62)
SUPPORT VECTOR MACHINE (57)
MACHINE LEARNING (55)
ACCURACY (46)
PATTERN CLASSIFICATION (44)
SVM (40)
LEARNING (ARTIFICIAL INTELLIGENCE) (33)
DATA MINING (32)
FEATURE SELECTION (31)
CLASSIFICATION (30)
KERNEL (27)
NATURAL LANGUAGE PROCESSING (20)
NIOBIUM (19)
ALGORITHM DESIGN AND ANALYSIS (17)
INTERNET (16)
SENTIMENT ANALYSIS (16)
SUPPORT VECTOR MACHINE CLASSIFICATION (16)
INFORMATION RETRIEVAL (13)
TRAINING DATA (13)
TEXT MINING (12)
COMPUTATIONAL MODELING (11)
TESTING (10)
DATA MODELS (9)
ELECTRONIC MAIL (9)
MACHINE LEARNING ALGORITHMS (9)
PATTERN CLUSTERING (9)
SEMANTICS (9)
VECTORS (9)
COMPUTERS (8)
INDEXING (8)
BAYES METHODS (7)
CLUSTERING ALGORITHMS (7)
DECISION TREES (7)
EDUCATIONAL INSTITUTIONS (7)
FILTERING (7)
KNN (7)
STATISTICAL ANALYSIS (7)
TWITTER (7)
VECTOR SPACE MODEL (7)
ARTIFICIAL NEURAL NETWORKS (6)
DOCUMENT CLASSIFICATION (6)
INDEXES (6)
INFORMATION FILTERING (6)
K-NEAREST NEIGHBOR (6)
ONTOLOGIES (6)
OPINION MINING (6)
ROUGH SET THEORY (6)
SVM CLASSIFIER (6)
UNSOLICITED E-MAIL (6)
COMPUTER SCIENCE (5)
CONTEXT (5)
DATABASES (5)
DICTIONARIES (5)
DOCUMENT HANDLING (5)
LOGISTICS (5)
MEASUREMENT (5)
MUTUAL INFORMATION (5)
SUPERVISED LEARNING (5)
WEB SITES (5)
ACTIVE LEARNING (4)
BAYESIAN METHODS (4)
BLOGS (4)
CORRELATION (4)
DECISION TREE (4)
DIMENSION REDUCTION (4)
FEATURE SELECTION ALGORITHM (4)
FUZZY SET THEORY (4)
IMAGE CLASSIFICATION (4)
IMAGE SEGMENTATION (4)
INFORMATION EXTRACTION (4)
INFORMATION GAIN (4)
MANIFOLDS (4)
MATHEMATICAL MODEL (4)
MEDIA (4)
NAïVE BAYES (4)
NAIVE BAYES (4)
NATURAL LANGUAGES (4)
NEURAL NETWORKS (4)
OPTIMIZATION (4)
PRINCIPAL COMPONENT ANALYSIS (4)
ROUGH SET (4)
SECURITY OF DATA (4)
SEMANTIC KERNEL (4)
SOFTWARE (4)
STANDARDS (4)
STEMMING (4)
WEB MINING (4)
WEB PAGES (4)
ARTIFICIAL INTELLIGENCE (3)
AUTHORSHIP ATTRIBUTION (3)
BIOLOGICAL SYSTEM MODELING (3)
BOOSTING (3)
BUSINESS (3)
CLASSIFICATION ALGORITHM (3)
more

Data set

ieee (199)
Elsevier (6)
Springer (4)
Wiley (1)

INFONA - science communication portal

Search results

Vietnamese news classification based on BoW with keywords extraction and neural network

Detection of cyberbullying on social media messages in Turkish

Categorizing the Turkish web pages by data mining techniques

Distribution shift resilient discrimination information space for SVM classification

N-gram based approach to recognize the twitter accounts of Turkish daily newspapers

A comprehensive study of text classification algorithms

Domain specific syntax based approach for text classification in machine learning context

Labeled LDA-Kernel SVM: A Short Chinese Text Supervised Classification Based on Sina Weibo

Sentiment analysis of social network posts in Slovak language

Exploring linguistic features for extremist texts detection (on the material of Russian-speaking illegal texts)

Enhancing Binary Classification by Modeling Uncertain Boundary in Three-Way Decisions

Category Classification of Text Data with Machine Learning Technique for Visualizing Flow of Conversation in Counseling

Turkish tweet sentiment analysis with word embedding and machine learning

Comparison of feature selection methods for sentiment analysis on Turkish Twitter data

An ensemble based NLP feature assessment in binary classification

Fusing Gini Index and Term Frequency for Text Feature Selection

Feature selection algorithm for hierarchical text classification using Kullback-Leibler divergence

Effective text classification using multi-level fuzzy neural network

Performance analysis of supervised machine learning algorithms for text classification

An empirical analysis and classification of crisis related tweets

Filter options

Publication date

Content availability

Publication type

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options