Search results

Items from 121 to 140 out of 474 results

1 ...
4
5
6
7
8
9
10

chapter

Cloud Based Predictive Analytics: Text Classification, Recommender Systems and Decision Support

Klavdiya Hammond, Aparna S. Varde

2013 IEEE 13th International Conference on Data Mining Workshops > 607 - 612

2013 IEEE 13th International Conference on Data Mining Workshops (ICDMW)

This paper presents a detailed study of technologies based on Hadoop and MapReduce available over the cloud for large-scale data mining and predictive analytics. Although some studies may have shown that cloud technologies relying on the MapReduce framework do not perform as well as parallel database management systems, e.g., with ad hoc queries and interactive applications, MapReduce has still been...

chapter

A term weighting method for identifying emotions from text content

Jenomi De Silva, Prasanna S. Haddela

2013 IEEE 8th International Conference on Industrial and Information Systems > 381 - 386

2013 IEEE 8th International Conference on Industrial and Information Systems (ICIIS)

Since the inception of the concept of social networking, communication patterns have shifted drastically with the unmitigated trend in socializing over the Internet, especially when people began connecting via mobile devices. Nowadays people tend to use these modern communication systems to share their emotions with each other. Human emotions play a vital role in human relationships and people share...

chapter

A Simple Study of Webpage Text Classification Algorithms for Arabic and English Languages

Sumaia Mohammed Al-Ghuribi, Saleh Alshomrani

2013 International Conference on IT Convergence and Security (ICITCS) > 1 - 5

2013 International Conference on IT Convergence and Security (ICITCS)

Webpage text Classification is an important problem that has been studied through different approaches and algorithms. It aims to assign a predefined category to a Webpage based on its content and linguistic features. It has many applications such as word sense disambiguation, document indexing, text filtering, Webpages hierarchical categorization and document organization. This study is a part of...

chapter

Research on Large Scale Hierarchical Classification Based on Candidate Search

Li He, Yan Jia, Zhaoyun Ding, Weihong Han

2013 10th Web Information System and Application Conference > 355 - 360

2013 10th Web Information System and Application Conference (WISA)

Large scale hierarchical classification problem researches how to classify web documents into the categories among a class hierarchy. As the class hierarchy is very large that containing thousands or even tens of thousands of categories, the performance of the classification is still lower. While a reduce-and-conquer strategy has been proposed to make the problem tractable, candidate search is a bottleneck...

chapter

Research on Text Feature Selection Algorithm Based on Information Gain and Feature Relation Tree

Hong Zhang, Yong-gong Ren, Xue Yang

2013 10th Web Information System and Application Conference > 446 - 449

2013 10th Web Information System and Application Conference (WISA)

The classification performance of previous IG algorithm may decline obviously because of the maldistribution of classes and features, due to which an improved text feature selection method UDsIG is proposed. First, we select features by classes to reduce the impact on feature selection when the classes are unevenly distributed. After that, we use feature equilibrium of distribution to decrease the...

chapter

Automatic text categorization of marathi documents using clustering technique

Sushma R. Vispute, M. A. Potey

2013 15th International Conference on Advanced Computing Technologies (ICACT) > 1 - 5

2013 15th International Conference on Advanced Computing Technologies (ICACT)

The purpose of the present work is creating an intelligent system to retrieve desired documents in Marathi language. The system also focuses on providing the personalized documents in Marathi language to the end user based on their interests identified from the browsing history. This paper presents the automatic categorization of Marathi documents and the literature survey of the related work done...

chapter

An Improved Mutual Information-Based Feature Selection Algorithm for Text Classification

Jiang Xiaoyu, Jin Shui

2013 5th International Conference on Intelligent Human-Machine Systems and Cybernetics > 1 > 126 - 129

2013 5th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC)

Feature selection plays an important role in text classification, and contributes directly to the accuracy of the classification. In order to correct the defects, such as mutual information-Based feature selection method tends to select rare words and those words from small samples as features, and negative MI value. This paper proposes a new improved feature evaluation function for automatic text...

chapter

Adaptive learning algorithm for pattern classification

Maohu Zhu, Nanfeng Jie, Tianzi Jiang

2013 IEEE International Conference on Information and Automation (ICIA) > 976 - 978

2013 IEEE International Conference on Information and Automation (ICIA)

In this paper, a pattern classification task was regarded as a sample selection problem where a sparse subset of sample from the labeled training set was chosen. We proposed an adaptive learning algorithm utilizing the least square function to address this problem. Using these selected samples, which we call informative vectors, a classifier capable of recognizing the test samples was established...

chapter

The Effect of Combining Different Feature Selection Methods on Arabic Text Classification

Abdulmohsen Al-Thubaity, Norah Abanumay, Sara Al-Jerayyed, Aljoharah Alrukban, more

2013 14th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing > 211 - 216

2013 14th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

Feature selection is one of several factors affecting text classification systems. Feature selection aims to choose a representative subset of all features to reduce the complexity of classification problems. Usually a single method is used for feature selection. For English, several attempts were reported examining the combination of different feature selection methods. To the best of our knowledge...

chapter

An Improved Density-Based Method for Reducing Training Data in KNN

Yongxia Jing, Heping Gou, Yaling Zhu

2013 International Conference on Computational and Information Sciences > 972 - 975

2013 Fifth International Conference on Computational and Information Sciences (ICCIS)

k-Nearest Neighbor (KNN) algorithm was an efficient text categorization algorithm in recall and accuracy, but the computational overhead of KNN was directly proportional to the sample size, so its classification speed was low in large-scale sample data. Aiming at this problem, the paper presented a density-based method for reducing training data, the method clustered each class of sample data into...

chapter

Context extraction from reviews for Context Aware Recommendation using Text Classification techniques

Fatima Zahra Lahlou, Houda Benbrahimand, Asmaa Mountassir, Ismail Kassou

2013 ACS International Conference on Computer Systems and Applications (AICCSA) > 1 - 4

2013 ACS International Conference on Computer Systems and Applications (AICCSA)

In this paper, we investigate the use of Text Classification techniques to extract contextual information from user reviews for Context Aware Recommendation. We conduct several experiments to identify the best Text Representation settings and the best classification algorithm for our dataset. We carry out our experiments on hotel reviews. We focus on extracting the trip type, as contextual information,...

chapter

The instructional design of Chinese text classification based on SVM

Sichao Wei, Jianyi Guo, Zhengtao Yu, Peng Chen, more

2013 25th Chinese Control and Decision Conference (CCDC) > 5114 - 5117

2013 25th Chinese Control and Decision Conference (CCDC)

In order to resolve the comprehension difficulties of theory and implementation about Chinese text classification in “ The principle and application of pattern recognition” curriculum for graduate students, this paper introduces the experiment of Chinese text classification into teaching practice. According to the text classification characteristics, we design the experiment scheme about Chinese text...

chapter

An approach to meta feature selection

JianLin Li

2013 26th IEEE Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 4

2013 26th IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)

Many methods, such as mutual information (MI), document frequency (DF), information gain (IG) and χ² statistics (CHI) algorithm, have been discussed and applied to the study of meta feature selection. This paper gives a brief review of the recent approaches on this topic. By summarizing and synthesizing these approaches, we propose a framework of the application of meta feature selections, where the...

chapter

The Novel k Nearest Neighbor Algorithm

Anjali Ganesh Jivani

2013 International Conference on Computer Communication and Informatics > 1 - 4

2013 International Conference on Computer Communication and Informatics (ICCCI)

In the field of Text Classification/Categorization, the k Nearest Neighbor algorithm (kNN) has been to date one of the oldest and most popular methods. It has been experimented upon, implemented and tested by many researchers all over the world. There have been variations in the implementation of this algorithm and I have in this paper done the same. As the name suggests the method is dependent on...

chapter

An ontology-based dimensionality reduction algorithm for biomedical literature classification

Jing Wang, Gongqing Wu, Xuegang Hu

IEEE Conference Anthology > 1 - 5

2013 IEEE Conference Anthology

Dimension reduction is an important component in automatic text categorization, especially biomedical literature classification. Many studies have showed that statistic-based dimension reduction algorithms, like Information Gain (IG), are very effective in document categorization. However these algorithms still suffer from major drawbacks. One facet is that they tend to use all the words as features...

chapter

Extraction of Strong Associations in Classes of Similarities

Ismail Biskri, Louis Rompre, Steve Descoteaux, Abdelghani Achouri, more

2012 11th International Conference on Machine Learning and Applications > 2 > 56 - 61

2012 Eleventh International Conference on Machine Learning and Applications (ICMLA)

Several algorithms are proposed to support the process of automated classification of textual documents. Each of these algorithms has characteristics that influence the classification result. Depending on the amount and nature of the data submitted, the quality of results may vary considerably from one algorithm to another. The generated classes are often noisy. In addition, the number of classes...

chapter

Feature Reduction for Text Categorization Using Cluster-Based Discriminant Coefficient

Li-Ju Gao, Been-Chian Chien

2012 Conference on Technologies and Applications of Artificial Intelligence > 137 - 142

2012 Conference on Technologies and Applications of Artificial Intelligence (TAAI)

Text classification is an important research topic for managing numerous electronic documents. Feature reduction is the key issue for text classification with high dimensional keywords. A document analysis method called discriminant coefficient was proposed to reduce features and achieve high precisiontext classification. However, the main problem of the discriminant based feature reduction method...

chapter

Weirdness Coefficient as a Feature Selection Method for Arabic Special Domain Text Classification

Abdulmohsen Al-Thubaity, Albandari Alanazi, Itisam Hazzaa, Haya Al-Tuwaijri

2012 International Conference on Asian Language Processing > 69 - 72

2012 International Conference on Asian Language Processing (IALP)

Given the importance of organizing and managing the rapid growth in knowledge of Arabic electronic content, this study introduces the Weirdness Coefficient (W) as a new feature selection method for Arabic special domain text classification. The proposed method was used to classify a dataset comprising five Islamic topics using NaÃ¯ve base (NB) and K-nearest neighbor (K-NN) classifiers, and three representation...

chapter

Dynamic feature selection strategy in incremental Chinese text classification

Dan Yang, Xinghua Fan

2012 2nd International Conference on Applied Robotics for the Power Industry (CARPI) > 1123 - 1126

2012 2nd International Conference on Applied Robotics for the Power Industry (CARPI 2012)

In Chinese text classification field, the content and size of feature space have decisive impact on accuracy and efficiency. Those two kinds feature information of incremental unlabeled training samples are ignored during current incremental learning research. For large scale of high dimensional Chinese texts, this paper presents a flexible, effective and universal feature selection strategy. In this...

chapter

Text associative classification approach for mining Arabic data set

Abdullah S. Ghareb, Abdul Razak Hamdan, Azuraliza Abu Bakar

2012 4th Conference on Data Mining and Optimization (DMO) > 114 - 120

2012 4th Conference on Data Mining and Optimization (DMO)

Text classification problem receives a lot of research that are based on machine learning, statistical, and information retrieval techniques. In the last decade, the associative classification algorithms which depends on pure data mining techniques appears as an effective method for classification. In this paper, we examine associative classification approach on the Arabic language to mine knowledge...

1 ...
4
5
6
7
8
9
10

Keywords:
CLASSIFICATION ALGORITHMS
Publication type:
book

Publication date

Set your own date range

Content availability

Available (469)
None (5)

Keywords

TRAINING (252)
TEXT ANALYSIS (247)
SUPPORT VECTOR MACHINES (165)
TEXT CLASSIFICATION (145)
FEATURE EXTRACTION (137)
PATTERN CLASSIFICATION (120)
ACCURACY (114)
ALGORITHM DESIGN AND ANALYSIS (94)
SUPPORT VECTOR MACHINE CLASSIFICATION (89)
CLASSIFICATION (88)
MACHINE LEARNING (87)
DATA MINING (76)
FEATURE SELECTION (68)
LEARNING (ARTIFICIAL INTELLIGENCE) (59)
SUPPORT VECTOR MACHINE (46)
INTERNET (45)
NATURAL LANGUAGE PROCESSING (43)
BAYES METHODS (40)
SVM (38)
CLUSTERING ALGORITHMS (37)
INFORMATION RETRIEVAL (37)
COMPUTERS (34)
MACHINE LEARNING ALGORITHMS (33)
TEXT MINING (32)
TESTING (30)
SEMANTICS (29)
ENTROPY (27)
NIOBIUM (26)
KERNEL (24)
VECTOR SPACE MODEL (24)
COMPUTATIONAL MODELING (23)
KNN (22)
WEB PAGES (21)
ARTIFICIAL NEURAL NETWORKS (20)
DECISION TREES (20)
TRAINING DATA (20)
PROBABILITY (19)
DATABASES (18)
FILTERING (18)
MUTUAL INFORMATION (18)
STATISTICAL ANALYSIS (18)
MATHEMATICAL MODEL (17)
VECTORS (17)
BAYESIAN METHODS (16)
CORRELATION (16)
DICTIONARIES (16)
PATTERN CLUSTERING (16)
CLASSIFICATION TREE ANALYSIS (15)
PREDICTION ALGORITHMS (15)
COMPUTER SCIENCE (14)
GENETIC ALGORITHMS (14)
INDEXING (14)
INFORMATION GAIN (14)
NAIVE BAYES (14)
EDUCATIONAL INSTITUTIONS (13)
INFORMATION FILTERING (13)
DOCUMENT HANDLING (12)
INDEXES (12)
ROUGH SET THEORY (12)
SEMI-SUPERVISED LEARNING (12)
SENTIMENT ANALYSIS (12)
VOCABULARY (12)
WORD PROCESSING (12)
DISTANCE MEASUREMENT (11)
EQUATIONS (11)
NEAREST NEIGHBOR SEARCHES (11)
ONTOLOGIES (11)
WEB SITES (11)
CONTEXT (10)
DATA MODELS (10)
DECISION TREE (10)
ELECTRONIC MAIL (10)
ENCODING (10)
NOISE (10)
ROUGH SET (10)
TEXT CLASSIFICATION ALGORITHM (10)
CHINESE TEXT CATEGORIZATION (9)
CLUSTERING (9)
DIMENSION REDUCTION (9)
FUZZY SET THEORY (9)
MATRIX DECOMPOSITION (9)
NAIVE BAYES CLASSIFIER (9)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (9)
OPTIMIZATION (9)
SEARCH ENGINES (9)
SET THEORY (9)
ARTIFICIAL INTELLIGENCE (8)
COMPLEXITY THEORY (8)
DECISION MAKING (8)
DOCUMENT CLASSIFICATION (8)
FEATURE SELECTION METHOD (8)
FILTERING ALGORITHMS (8)
GAIN (8)
GENETIC ALGORITHM (8)
K-NEAREST NEIGHBOR (8)
KNN ALGORITHM (8)
KNOWLEDGE ENGINEERING (8)
NAïVE BAYES (8)
more

INFONA - science communication portal

Search results

Cloud Based Predictive Analytics: Text Classification, Recommender Systems and Decision Support

A term weighting method for identifying emotions from text content

A Simple Study of Webpage Text Classification Algorithms for Arabic and English Languages

Research on Large Scale Hierarchical Classification Based on Candidate Search

Research on Text Feature Selection Algorithm Based on Information Gain and Feature Relation Tree

Automatic text categorization of marathi documents using clustering technique

An Improved Mutual Information-Based Feature Selection Algorithm for Text Classification

Adaptive learning algorithm for pattern classification

The Effect of Combining Different Feature Selection Methods on Arabic Text Classification

An Improved Density-Based Method for Reducing Training Data in KNN

Context extraction from reviews for Context Aware Recommendation using Text Classification techniques

The instructional design of Chinese text classification based on SVM

An approach to meta feature selection

The Novel k Nearest Neighbor Algorithm

An ontology-based dimensionality reduction algorithm for biomedical literature classification

Extraction of Strong Associations in Classes of Similarities

Feature Reduction for Text Categorization Using Cluster-Based Discriminant Coefficient

Weirdness Coefficient as a Feature Selection Method for Arabic Special Domain Text Classification

Dynamic feature selection strategy in incremental Chinese text classification

Text associative classification approach for mining Arabic data set

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options