Search results

Items from 61 to 80 out of 474 results

chapter

Attribute reduction for Chinese question classification

Yuan Liwei, Su Lei, Shu Peng

2016 Chinese Control and Decision Conference (CCDC) > 5488 - 5492

2016 Chinese Control and Decision Conference (CCDC)

With the rapid development of the question and answer services based on community, like Sina Ask, Baidu Zhidao and Yahoo! The Community-based Question Answering service has been became a new knowledge-sharing model with characteristics of interactivity and openness. The community sites provide high quality service to meet clients' need and attract them actively participation. In order to accurately...

chapter

Examining the performance of classification algorithms for imbalanced data sets in web author identification

Alisa A. Vorobeva

2016 18th Conference of Open Innovations Association and Seminar on Information Security and Protection of Information Technology (FRUCT-ISPIT) > 385 - 390

2016 18th Conference of Open Innovations Association and Seminar on Information Security and Protection of Information Technology (FRUCT-ISPIT)

Individuals, criminals or even terrorist organizations can use web-communication for criminal purposes; to avoid the prosecution they try to hide their identity. To increase level of safety in Web we have to improve the author (or web-user) identification and authentication procedures. In field of web author identification the situation of imbalanced data sets appears rather frequent, when number...

chapter

A novel text mining approach based on TF-IDF and Support Vector Machine for news classification

Seyyed Mohammad Hossein Dadgar, Mohammad Shirzad Araghi, Morteza Mastery Farahani

2016 IEEE International Conference on Engineering and Technology (ICETECH) > 112 - 116

2016 IEEE International Conference on Engineering and Technology (ICETECH)

With the development of weblogs and social networks, many news providers share their news headlines on different websites and weblogs. One of the main text mining topics is how to classify news into different groups. This study aims to classify news into various groups so that users can identify the most popular news group in the desired country at any given time. Based on Term Frequency-Inverse Document...

chapter

Document classification with a weighted frequency pattern tree algorithm

Froila Helixia Dsouza, Ananthanarayana V.S.

2016 International Conference on Data Mining and Advanced Computing (SAPIENCE) > 29 - 34

2016 International Conference on Data Mining and Advanced Computing (SAPIENCE)

Document classification can be defined as the task of automatically categorizing collections of electronic documents into their annotated classes, based on their contents. It is an important problem in Data mining. Due to the exponential growth of documents in the Internet and the emergent need to organize them, developing an efficient document classification method to automatically manipulate web...

chapter

Text classification using KM-ELM classifier

K S Neethu, T S Jyothis, Jithin Dev

2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT) > 1 - 5

2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT)

Classification systems adapts many machine learning techniques for quality performance in data classification. The neural networks has some unique characteristics and features which can handle high dimensional features and documents with noise and contradictory data. Classification is important to classify the input text into different domains appropriately. This paper give out a move towards classification...

chapter

Arabic text stemming: Comparative analysis

Rasha Mamoun, Mahmoud Ahmed

2016 Conference of Basic Sciences and Engineering Studies (SGCAC) > 88 - 93

2016 Conference of Basic Sciences and Engineering Studies (SGCAC)

Text classification is the most important research issues in the field of data mining. The main idea of using the stemming technique is to reduce the number of features that can be extracted from the document. Furthermore, the stemming aims to enhance the accuracy of the classifier. This paper aims to study the effectiveness of using stemming techniques. The paper will use two popular word extractions:...

chapter

A technical study and analysis of text classification techniques in N - Lingual documents

Shalini Puri, S. P. Singh

2016 International Conference on Computer Communication and Informatics (ICCCI) > 1 - 6

2016 International Conference on Computer Communication and Informatics

In the current era, there is a high demand of accurate text identification and categorization methods in N - Lingual non-scanned and scanned machine printed documents, where N represents mono, bi, tri or multi mode. In this paper, a technical study and analysis is presented to show N-lingual document classification for normal text, printed and handwritten documents. Text classification for normal...

chapter

Stemming impact on Arabic text categorization performance: A survey

Fawaz S. Al-Anzi, Dia AbuZeina

2015 5th International Conference on Information & Communication Technology and Accessibility (ICTA) > 1 - 7

2015 5th International Conference on Information & Communication Technology and Accessibility (ICTA)

The significant growth of online textual information has increased the demand for effective content-based Arabic text categorization methods. The categorization of Arabic texts has some challenges that need to be addressed specially when using stemming. In the literature, we found a debate among researchers about the benefits of using stemming in Arabic text categorization. Hence, we performed a study...

chapter

Online analysis of sentiment on Twitter

Shokoufeh Salem Minab, Mehrdad Jalali, Mohammad Hossein Moattar

2015 International Congress on Technology, Communication and Knowledge (ICTCK) > 359 - 365

2015 International Congress on Technology, Communication and Knowledge (ICTCK)

Social media such as Twitter create space to explain the thoughts and opinions on various topics and different events, millions of users can share their ideas in this Micrblog, Therefore Twitter is converted as a source to exploration of information; make a decision and an analysis of sentiment. There is a sense in all of the texts, but it is more important to provide strategies for obtaining suitable...

chapter

Text categorization with machine learning and hierarchical structures

M. Krendzelak, F. Jakab

2015 13th International Conference on Emerging eLearning Technologies and Applications (ICETA) > 1 - 5

2015 13th International Conference on Emerging eLearning Technologies and Applications (ICETA)

Text categorization with machine learning algorithms usually assumes to have flat set of categories. Such classifiers are very domain specific and not reusable for some other generic text classifications. It is very possible that a hierarchically structured set of categories might have a higher impact on the way classifiers are used and built. As presented in this document, the list of most common...

chapter

Ranking in multi label classification of text documents using quantifiers

Rajni Jindal, Shweta Taneja

2015 IEEE International Conference on Control System, Computing and Engineering (ICCSCE) > 162 - 166

2015 IEEE International Conference on Control System, Computing and Engineering (ICCSCE)

In today's world, many real world examples are based on multi label classification. A single document may belong to a set of class labels simultaneously. The process of ranking i.e. strict ordering of class labels is of great concern here. We have used the concept of quantifiers for ranking of class labels. We have proposed eight new quantifiers, which calculate the degree of membership of class labels...

chapter

Automatic Indonesia's questions classification based on bloom's taxonomy using Natural Language Processing a preliminary study

Selvia Ferdiana Kusuma, Daniel Siahaan, Umi Laili Yuhana

2015 International Conference on Information Technology Systems and Innovation (ICITSI) > 1 - 6

2015 International Conference on Information Technology Systems and Innovation (ICITSI)

Identification of students' cognitive ability should be done to know students' understanding towards what have been taught. The identification result will be the benchmark to choose the basis of assessment. The identification process of cognitive ability can be done by giving questions in certain difficulties levels. The appropriateness of difficulty levels can be made based on bloom taxonomy introduced...

chapter

Improved Expected Cross Entropy Method for Text Feature Selection

Guohua Wu, Liuyang Wang, Nailiang Zhao, Hairong Lin

2015 International Conference on Computer Science and Mechanical Automation (CSMA) > 49 - 54

2015 International Conference on Computer Science and Mechanical Automation (CSMA)

Feature selection plays an important role in text categorization, and contributes directly to the accuracy of the categorization. In the process of feature selection, due to the lack of consideration of the traditional expected cross entropy algorithm for document frequency, we first improve the expected cross entropy formula of the traditional, and then propose an improved text feature selection...

chapter

Optimized Approach of Feature Selection Based on Information Gain

Guohua Wu, Junjun Xu

2015 International Conference on Computer Science and Mechanical Automation (CSMA) > 157 - 161

2015 International Conference on Computer Science and Mechanical Automation (CSMA)

Text feature selection is the key technology in text classification and text information retrieval. The feature selection method - information gain - has extensive application in text categorization. This paper theoretically analyzed the deficiency of information gain in feature selection methods, and then introduced two improvement factors which were LDFWF (Limiting Document Frequency's Word Frequency)...

chapter

An Improved Information Gain Feature Selection Algorithm for SVM Text Classifier

Jiamin Xu, Hong Jiang

2015 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery > 273 - 276

2015 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)

Feature selection algorithm has a great influence on the accuracy of text categorization. The traditional information gain (IG) feature selection algorithm usually selects the features that rarely appear in the specified categories, but frequently appear in other categories. To overcome this drawback, on the basis of in-depth analysis of the related algorithms, an improved IG feature selection method...

chapter

A novel classifier based on meaning for text classification

Murat Can Ganiz, Melike Tutkan, Selim Akyokus

2015 International Symposium on Innovations in Intelligent SysTems and Applications (INISTA) > 1 - 5

2015 International Symposium on Innovations in Intelligent SysTems and Applications (INISTA)

Text classification is one of the key methods used in text mining. Generally, traditional classification algorithms from machine learning field are used in text classification. These algorithms are primarily designed for structured data. In this paper, we propose a new classifier for textual data, called Supervised Meaning Classifier (SMC). The new SMC classifier uses meaning measure, which is based...

chapter

A novel feature selection based on Tibetan grammar for Tibetan text classification

Tao Jiang, Hongzhi Yu

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 445 - 448

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)

Feature selection is a strategy that aims at making text classifiers more efficient and accurate. In this paper, we proposed a novel feature selection method based on Tibetan grammar for Tibetan classification. Tibetan language express grammatical meaning through the function words and word order, and the function word has large proportions. By analyzing the Tibetan grammar and distribution of part...

chapter

A web text classification technique for unlabeled training samples

Francois Tchiegue, Rui Li, Shilong Ma

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 437 - 440

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)

The common classification is conducted under the supervised learning algorithms, which design classifiers through learning the labeled training samples. However, in actual situations, it is very costly to acquire class-labeled samples, because manually labeling documents requires a lot of time and efforts from experts. Therefore, it restrains the text classification to a great extent. To solve the...

chapter

Graph-Based Web Query Classification

Chunwei Xia, Xin Wang

2015 12th Web Information System and Application Conference (WISA) > 241 - 244

2015 12th Web Information System and Application Conference (WISA)

Understanding Web users' search intent expressed by their queries is essential for a search engine to provide the appropriate answers. Web query classification (QC) algorithms have been widely studied to improve the accuracy and meet users' demands. Some QC algorithms convert queries into vectors and use SVM or CRF model as the classifier. However, with the volume of data increasing, the time consumed...

chapter

Sentiment analysis of a document using deep learning approach and decision trees

Arman S. Zharmagambetov, Alexandr A. Pak

2015 Twelve International Conference on Electronics Computer and Computation (ICECCO) > 1 - 4

2015 Twelve International Conference on Electronics Computer and Computation (ICECCO)

The given paper describes modern approach to the task of sentiment analysis of movie reviews by using deep learning recurrent neural networks and decision trees. These methods are based on statistical models, which are in a nutshell of machine learning algorithms. The fertile area of research is the application of Google's algorithm Word2Vec presented by Tomas Mikolov, Kai Chen, Greg Corrado and Jeffrey...

Keywords:
CLASSIFICATION ALGORITHMS
Publication type:
book

Publication date

Set your own date range

Content availability

Available (469)
None (5)

Keywords

TRAINING (252)
TEXT ANALYSIS (247)
SUPPORT VECTOR MACHINES (165)
TEXT CLASSIFICATION (145)
FEATURE EXTRACTION (137)
PATTERN CLASSIFICATION (120)
ACCURACY (114)
ALGORITHM DESIGN AND ANALYSIS (94)
SUPPORT VECTOR MACHINE CLASSIFICATION (89)
CLASSIFICATION (88)
MACHINE LEARNING (87)
DATA MINING (76)
FEATURE SELECTION (68)
LEARNING (ARTIFICIAL INTELLIGENCE) (59)
SUPPORT VECTOR MACHINE (46)
INTERNET (45)
NATURAL LANGUAGE PROCESSING (43)
BAYES METHODS (40)
SVM (38)
CLUSTERING ALGORITHMS (37)
INFORMATION RETRIEVAL (37)
COMPUTERS (34)
MACHINE LEARNING ALGORITHMS (33)
TEXT MINING (32)
TESTING (30)
SEMANTICS (29)
ENTROPY (27)
NIOBIUM (26)
KERNEL (24)
VECTOR SPACE MODEL (24)
COMPUTATIONAL MODELING (23)
KNN (22)
WEB PAGES (21)
ARTIFICIAL NEURAL NETWORKS (20)
DECISION TREES (20)
TRAINING DATA (20)
PROBABILITY (19)
DATABASES (18)
FILTERING (18)
MUTUAL INFORMATION (18)
STATISTICAL ANALYSIS (18)
MATHEMATICAL MODEL (17)
VECTORS (17)
BAYESIAN METHODS (16)
CORRELATION (16)
DICTIONARIES (16)
PATTERN CLUSTERING (16)
CLASSIFICATION TREE ANALYSIS (15)
PREDICTION ALGORITHMS (15)
COMPUTER SCIENCE (14)
GENETIC ALGORITHMS (14)
INDEXING (14)
INFORMATION GAIN (14)
NAIVE BAYES (14)
EDUCATIONAL INSTITUTIONS (13)
INFORMATION FILTERING (13)
DOCUMENT HANDLING (12)
INDEXES (12)
ROUGH SET THEORY (12)
SEMI-SUPERVISED LEARNING (12)
SENTIMENT ANALYSIS (12)
VOCABULARY (12)
WORD PROCESSING (12)
DISTANCE MEASUREMENT (11)
EQUATIONS (11)
NEAREST NEIGHBOR SEARCHES (11)
ONTOLOGIES (11)
WEB SITES (11)
CONTEXT (10)
DATA MODELS (10)
DECISION TREE (10)
ELECTRONIC MAIL (10)
ENCODING (10)
NOISE (10)
ROUGH SET (10)
TEXT CLASSIFICATION ALGORITHM (10)
CHINESE TEXT CATEGORIZATION (9)
CLUSTERING (9)
DIMENSION REDUCTION (9)
FUZZY SET THEORY (9)
MATRIX DECOMPOSITION (9)
NAIVE BAYES CLASSIFIER (9)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (9)
OPTIMIZATION (9)
SEARCH ENGINES (9)
SET THEORY (9)
ARTIFICIAL INTELLIGENCE (8)
COMPLEXITY THEORY (8)
DECISION MAKING (8)
DOCUMENT CLASSIFICATION (8)
FEATURE SELECTION METHOD (8)
FILTERING ALGORITHMS (8)
GAIN (8)
GENETIC ALGORITHM (8)
K-NEAREST NEIGHBOR (8)
KNN ALGORITHM (8)
KNOWLEDGE ENGINEERING (8)
NAïVE BAYES (8)
more

INFONA - science communication portal

Search results

Attribute reduction for Chinese question classification

Examining the performance of classification algorithms for imbalanced data sets in web author identification

A novel text mining approach based on TF-IDF and Support Vector Machine for news classification

Document classification with a weighted frequency pattern tree algorithm

Text classification using KM-ELM classifier

Arabic text stemming: Comparative analysis

A technical study and analysis of text classification techniques in N - Lingual documents

Stemming impact on Arabic text categorization performance: A survey

Online analysis of sentiment on Twitter

Text categorization with machine learning and hierarchical structures

Ranking in multi label classification of text documents using quantifiers

Automatic Indonesia's questions classification based on bloom's taxonomy using Natural Language Processing a preliminary study

Improved Expected Cross Entropy Method for Text Feature Selection

Optimized Approach of Feature Selection Based on Information Gain

An Improved Information Gain Feature Selection Algorithm for SVM Text Classifier

A novel classifier based on meaning for text classification

A novel feature selection based on Tibetan grammar for Tibetan text classification

A web text classification technique for unlabeled training samples

Graph-Based Web Query Classification

Sentiment analysis of a document using deep learning approach and decision trees

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options