Search results

Items from 1 to 20 out of 62 results

chapter

Detection of cyberbullying on social media messages in Turkish

Selma Ayse Ozel, Esra Sarac, Seyran Akdemir, Hulya Aksu

2017 International Conference on Computer Science and Engineering (UBMK) > 366 - 370

2017 International Conference on Computer Science and Engineering (UBMK)

The increased use of the Internet and the ease of access to online communities like social media have provided an avenue for cybercrimes. Cyberbullying, which is a kind of cybercrime, is defined as an aggressive, intentional action against a defenseless person by using the Internet, social media, or other electronic contents. Researchers have found that many of the bullying cases have tragically ended...

chapter

A comprehensive study of text classification algorithms

Vikas K Vijayan, K. R. Bindu, Latha Parameswaran

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1109 - 1113

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Huge amount of data in today's world are stored in the form of electronic documents. Text mining is the process of extracting the information out of those textual documents. Text classification is the process of classifying text documents into fixed number of predefined classes. The application of text classification includes spam filtering, email routing, sentiment analysis, language identification...

chapter

Labeled LDA-Kernel SVM: A Short Chinese Text Supervised Classification Based on Sina Weibo

Xueli Wang, Jiao Wang, Yang Yang, Jinbao Duan

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 428 - 432

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

At present, it is a great challenge that solving high-dimension and text sparsity problems in short text classification. To resolve these problems, this paper proposes a method which takes the correlation between lexical items and tags before completing Latent Dirichlet Allocation(LDA) topic model. Meanwhile, this paper adjusts parameters of Support Vector Machine(SVM) to find the optimal values by...

chapter

Sentiment analysis of social network posts in Slovak language

Rastislav Krchnavy, Marian Simko

2017 12th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP) > 20 - 25

2017 12th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP)

In this paper we tackle the issue of sentiment analysis of social network posts in a not well targeted language — Slovak. There is a significant lack of research in this area for minor languages, as they often introduce additional language-specific issues for text processing. In case of Slovak, common issues are high flection, complex morphology and syntax. User-generated content of social networks...

chapter

Exploring linguistic features for extremist texts detection (on the material of Russian-speaking illegal texts)

Dmitry Devyatkin, Ivan Smirnov, Ananyeva Margarita, Kobozeva Maria, more

2017 IEEE International Conference on Intelligence and Security Informatics (ISI) > 188 - 190

2017 IEEE International Conference on Intelligence and Security Informatics (ISI)

In this paper we present results of a research on automatic extremist text detection. For this purpose an experimental dataset in the Russian language was created. According to the Russian legislation we cannot make it publicly available. We compared various classification methods (multinomial naive Bayes, logistic regression, linear SVM, random forest, and gradient boosting) and evaluated the contribution...

chapter

An ensemble based NLP feature assessment in binary classification

Saurabh Kr. Srivasatava, Roshan Kumari, Sandeep Kr. Singh

2017 International Conference on Computing, Communication and Automation (ICCCA) > 345 - 349

2017 International Conference on Computing, Communication and Automation (ICCCA)

Text feature selection plays an important role in text mining. Terms are the key players in document representation. The document representation can help application in following areas-indexing, summarization, classification, clustering and filtering. Text instances come with a challenge of high dimensional feature space and using such features can be extremely useful in text analysis. Hence it is...

chapter

Effective text classification using multi-level fuzzy neural network

Shima Zobeidi, Marjan Naderan, Seyed Enayatollah Alavi

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS) > 91 - 96

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS)

Nowadays, large volumes of text data are being produced in real time due to expansion of communication. It is necessary to organize this data for exploitation and extraction of useful information. Text classification based on the topic is one of the efficient solutions to this problem. Efficient algorithms are applied for text classification if they address high dimensional data. In this paper, a...

chapter

Multi-layer text classification with voting for consumer reviews

Yan Zhu, Melody Moh, Teng-Sheng Moh

2016 IEEE International Conference on Big Data (Big Data) > 1991 - 1999

2016 IEEE International Conference on Big Data (Big Data)

As social media has become increasingly popular in the modern world, people are using these platforms to express their opinions about products, businesses, and services. The need for categorizing these consumer reviews has been prominent. One effective solution is sentiment analysis (SA), which has been an active research topic. The goal of SA is to automatically extracting and classifying user opinions...

chapter

Authorship attribution in Arabic poetry using NB, SVM, SMO

Alfalahi Ahmed, Ramdani Mohamed, Bellafkih Mostafa

2016 11th International Conference on Intelligent Systems: Theories and Applications (SITA) > 1 - 5

2016 11th International Conference on Intelligent Systems: Theories and Applications (SITA)

We study in this paper an authorship attribution in Arabic poetry using text mining classification. Several features such as Characters, Poetry Sentence length; Word length, Rhyme, Meter and First word in the sentence are used as input data for text mining classification algorithms Naïve Bayes NB, Support Vector Machine SVM, and Sequential Minimal Optimization SMO. The data set of experiment was divided...

chapter

A framework to detect unqualified restaurant reviews

Boonjira Angsumalee, Natthapat Sotthisopha, Peerapol Vateekul

2016 Eighth International Conference on Knowledge and Systems Engineering (KSE) > 115 - 120

2016 Eighth International Conference on Knowledge and Systems Engineering (KSE)

Nowadays there are numerous user-generated restaurant reviews available on the Internet, of which they are considered valuable resources for decision making to customers. In reality, not every reviews available online are helpful to users, so the need for filtering unqualified reviews is realized. There have been several studies on spam review detection that attempt to detect unqualified reviews using...

chapter

A Bayesian classifiers based combination model for automatic text classification

Amna Rahman, Usman Qamar

2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 63 - 67

2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS)

Text classification deals with allocating a text document to a predetermined class. Generally, this involves learning about a class from representations of documents belonging to that class. In this paper, we propose a classifier combination that uses a Multinomial Naïve Bayesian (MNB) classifier along with Bayesian Networks (BN) classifier. The results of two classifiers are combined by taking an...

chapter

Iterative evolution of feature space in text classification

Liutao Zhao, Yitian Ren, Bo Yan

2015 8th International Congress on Image and Signal Processing (CISP) > 1210 - 1214

2015 8th International Congress on Image and Signal Processing (CISP)

Nature language processing is an important part in data mining, which counts a lot in the internet age. Feature extraction effects the accuracy of text classification. This paper proposes a method of iterative feature space evolution to optimize the result. Adjusting the extended dictionary and the stop word list, we optimize the feature space time and again to get a better classifier model. The final...

chapter

Evaluating text features for lyrics-based songwriter prediction

Basar Kirmaci, Hasan Ogul

2015 IEEE 19th International Conference on Intelligent Engineering Systems (INES) > 405 - 409

2015 IEEE 19th International Conference on Intelligent Engineering Systems (INES)

We offer an automated way of estimating the author of a song using only its lyrics content. To this end, we introduce a complete text classification framework which takes raw lyrics data as input and report estimated songwriter. The performance of the system is evaluated based on its classification and retrieval ability on a large dataset of Turkish songs, which was collected in this study. The results...

chapter

Predicting Effectiveness of IR-Based Bug Localization Techniques

Tien-Duy B. Le, Ferdian Thung, David Lo

2014 IEEE 25th International Symposium on Software Reliability Engineering > 335 - 345

2014 IEEE 25th International Symposium on Software Reliability Engineering (ISSRE)

Recently, many information retrieval (IR) based bug localization approaches have been proposed in the literature. These approaches use information retrieval techniques to process a textual bug report and a collection of source code files to find buggy files. They output a ranked list of files sorted by their likelihood to contain the bug. Recent approaches can achieve reasonable accuracy, however,...

chapter

Sentiment analysis on Weibo data

Di Li, Jianwei Niu, Meikang Qiu, Meiqin Liu

2014 IEEE Computers, Communications and IT Applications Conference > 249 - 254

2014 IEEE Computing, Communications and IT Applications Conference (ComComAp)

With the development of the Internet, people share their emotion statuses or attitudes on online social websites, leading to an explosive rise on the scale of data. Mining sentiment information behind these data helps people know about public opinions and social trends. In this paper a sentiment analysis algorithm adapting to Weibo (Microblog) data is proposed. Given that a Weibo post is usually short,...

chapter

QuIET: A Text Classification Technique Using Automatically Generated Span Queries

Vassilis Polychronopoulos, Nick Pendar, Shawn R. Jeffery

2014 IEEE International Conference on Semantic Computing > 52 - 59

2014 IEEE International Conference on Semantic Computing (ICSC)

We propose a novel algorithm, QuIET, for binary classification of texts. The method automatically generates a set of span queries from a set of annotated documents and uses the query set to categorize unlabeled texts. QuIET generates models that are human understandable. We describe the method and evaluate it empirically against Support Vector Machines, demonstrating a comparable performance for a...

chapter

Research on web topic detection based on domain lexicon

Zhao Zhibin, Jia Yanfeng, Bao Yubin

2013 25th Chinese Control and Decision Conference (CCDC) > 3655 - 3660

2013 25th Chinese Control and Decision Conference (CCDC)

Web topic detection is a crucial prerequisite to web-based data integration and also a key component for Vertical Search Engine. So, it attracts much attention from not only the industry but also the literature. In this paper, we proposed a domain-lexicon-based framework for Web topic detection. In our framework, we extracted the topical features from the web page first. Next, we employed Vector Space...

chapter

RLS-MARS: An Effective Feature Selection Tool for Text Classification

Li Xi, Dai Hang, Wang Mingwen

2012 Fourth International Conference on Multimedia Information Networking and Security > 254 - 257

2012 4th International Conference on Multimedia Information Networking and Security (MINES)

The RLS-MARS (Regularized Least Squares-Multi Angle Regression and Shrinkage) feature selection model is used to select the relevant information, in which both, the keeping and the leaving-out of the regularizer are present. The RLS-MARS model is to find a series of directions in multidimensional space, leading the gradient vectors to change along those directions which would make the gradient matrix's...

chapter

Sentiment Classification for Microblog by Machine Learning

Zhen Niu, Zelong Yin, Xiangyu Kong

2012 Fourth International Conference on Computational and Information Sciences > 286 - 289

2012 Fourth International Conference on Computational and Information Sciences (ICCIS)

With the development of microblog, many studies pay special attention to sentiment classification of the reviews in microblog. This paper summarizes three well-known methods for text classification and then improves one of them for sentiment analysis. We come up with a new model in which we introduce efficient approaches to select features, calculate weights, train samples and evaluate classifier...

chapter

Machine Learning for Author Affiliation within Web Forums -- Using Statistical Techniques on NLP Features for Online Group Identification

Jeffrey Ellen, Shibin Parameswaran

2011 10th International Conference on Machine Learning and Applications and Workshops > 1 > 100 - 105

2011 Tenth International Conference on Machine Learning and Applications (ICMLA 2011)

Although there have been previous studies performing authorship attribution to a specific individual, we find a shortage of efforts to group authors based on their affiliations. This paper presents our work on classification of website forum posts by the author's group affiliation. Specifically, we seek to classify translated website forum posts by the (inferred) political affiliation of the author...

Keywords:
SUPPORT VECTOR MACHINES
FEATURE EXTRACTION

Publication date

Set your own date range

INFONA - science communication portal

Search results

Detection of cyberbullying on social media messages in Turkish

A comprehensive study of text classification algorithms

Labeled LDA-Kernel SVM: A Short Chinese Text Supervised Classification Based on Sina Weibo

Sentiment analysis of social network posts in Slovak language

Exploring linguistic features for extremist texts detection (on the material of Russian-speaking illegal texts)

An ensemble based NLP feature assessment in binary classification

Effective text classification using multi-level fuzzy neural network

Multi-layer text classification with voting for consumer reviews

Authorship attribution in Arabic poetry using NB, SVM, SMO

A framework to detect unqualified restaurant reviews

A Bayesian classifiers based combination model for automatic text classification

Iterative evolution of feature space in text classification

Evaluating text features for lyrics-based songwriter prediction

Predicting Effectiveness of IR-Based Bug Localization Techniques

Sentiment analysis on Weibo data

QuIET: A Text Classification Technique Using Automatically Generated Span Queries

Research on web topic detection based on domain lexicon

RLS-MARS: An Effective Feature Selection Tool for Text Classification

Sentiment Classification for Microblog by Machine Learning

Machine Learning for Author Affiliation within Web Forums -- Using Statistical Techniques on NLP Features for Online Group Identification

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options