Search results

Items from 81 to 100 out of 474 results

chapter

Parallelized text classification algorithm for processing large scale TCM clinical data with MapReduce

Xianju Fei, XiaoFang Li, Chunti Shen

2015 IEEE International Conference on Information and Automation > 1983 - 1986

2015 IEEE International Conference on Information and Automation (ICIA)

There are many opportunities and challenges in data analytic research for TCM (Traditional Chinese Medicine) in advent of big data era, like various clinical record sources, different symptom descriptions, lots of collected clinical symptoms, more than one syndrome attached to one clinical record and etc. Novel methods on support vector machines, ensemble learning, feature selection, multi-label learning...

chapter

Classification and clustering for neuroinformatics: Assessing the efficacy on reverse-mapped NeuroNLP data using standard ML techniques

Nidheesh Melethadathil, Priya Chellaiah, Bipin Nair, Shyam Diwakar

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1065 - 1070

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

NeuroinformaticsNatural Language Processing (NeuroNLP) relies on clustering and classification for information categorization of biologically relevant extraction targets and for interconnections to knowledge-related patterns in event and text mined datasets. The accuracy of machine learning algorithms depended on quality of text-mined data while efficacy relied on the context of the choice of techniques...

chapter

Handling imbalanced dataset in multi-label text categorization using Bagging and Adaptive Boosting

Genta Indra Winata, Masayu Leylia Khodra

2015 International Conference on Electrical Engineering and Informatics (ICEEI) > 500 - 505

2015 International Conference on Electrical Engineering and Informatics (ICEEI)

Imbalanced dataset is occurred due to uneven distribution of data available in the real world such as disposition of complaints on government offices in Bandung. Consequently, multi-label text categorization algorithms may not produce the best performance because classifiers tend to be weighed down by the majority of the data and ignore the minority. In this paper, Bagging and Adaptive Boosting algorithms...

chapter

A New SVM Method for Short Text Classification Based on Semi-Supervised Learning

Chunyong Yin, Jun Xiang, Hui Zhang, Jin Wang, more

2015 4th International Conference on Advanced Information Technology and Sensor Application (AITS) > 100 - 103

2015 4th International Conference on Advanced Information Technology and Sensor Application (AITS)

Short text is a popular text form, which is widely used in short commentary, micro-blog and many other fields. With the development of the social software and movie websites, the size of data is also becoming larger and larger. Most data is useless for us while other data is important for us. Therefore, it is very necessary for us to extract the useful short text from the big data. However, there...

chapter

Based on Rough Sets and the Associated Analysis of KNN Text Classification Research

Guo Aizhang, Yang Tao

2015 14th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES) > 485 - 488

2015 14th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES)

With the rapid development of network information technology, the text is as a basic information carrier and begins to present exponential growth. The existing text classification methods haven't got information from the vast amounts of information resources timely and accurately. In order to solve the problem, the paper puts forward a new method about text categorization. It is a KNN algorithm based...

chapter

Microblog Sentiment Analysis Algorithm Research and Implementation Based on Classification

Yanxia Yang, Fengli Zhou

2015 14th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES) > 288 - 291

2015 14th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES)

Under the background of today's information age, micro blog obtains a rapid development. With the news on the micro blog updating, in order to avoid the users getting lost in the ocean of information, emotion analysis of the information becomes urgent and important. This paper based on the implementation of micro blog emotion mining of Bayesian classifier and SVM classification algorithm, making comparison...

chapter

Classification of Chinese-to-English translated social network timelines using naive Bayes

Xiang-Ru Yu, Zhong-Liang Xiang, Dae-Ki Kang

2015 17th International Conference on Advanced Communication Technology (ICACT) > 296 - 299

2015 17th International Conference on Advanced Communication Technology (ICACT)

This study proposes a method that classifies Chinese social network positive-negative comments (Weibo) using naive Bayes algorithm trained from English social network (Twitter) corpus. We train our text classifier using Twitter corpus (in English language), and use this classifier to classify Chinese text. In the previous research, Chinese sentences are processed using Chinese word segmentation algorithms...

chapter

Comparison of Four Text Classifiers on Movie Reviews

Yaguang Wang, Wenlong Fu, Aina Sui, Yuqing Ding

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence > 495 - 498

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence (ACIT-CSI)

Text Categorization plays an important role in the fields of information retrieval, machine learning, natural language processing, data mining and others. With the development of computer and information technology, there have been many classification algorithms. Each text classification algorithms will get result at differing speeds and efficiency due to the various feature of test text. It has been...

chapter

A k-Highest Expert Text Classification Algorithm Based on Choquet Integral

Shuchao Feng, Wenqian Shang, Yuqi Wang

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence > 499 - 503

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence (ACIT-CSI)

In recent years, the research on text classification algorithm is still a hot topic in text mining. The KNN is a classic text classification algorithm. The rule of finding the nearest neighbors directly affects the performance and precision of categorization. In this paper, we mainly focus on distance measure and similarity. We propose a new text classification algorithm which combines KNN and Choquet...

chapter

A Text Classifier of English Movie Reviews Based on Information Gain

Lianjing Jin, Wei Gong, Wenlong Fu, Hongbin Wu

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence > 454 - 457

2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence (ACIT-CSI)

Text classification is the foundation and core of text mining. Naive Bayes is an effective method for text classification. This paper improves the accuracy of Naive Bayes classification using improved information gain, one of methods of feature extraction, by reducing the impact of low-frequency words. In this paper, we use a widely corpus of NLTK. According to the test results, The accuracy of the...

chapter

A term weighting scheme based on the measure of relevance and distinction for text categorization

Jieming Yang, Jing Wang, Zhiying Liu, Zhaoyang Qu

2015 IEEE/ACIS 16th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) > 1 - 6

2015 IEEE/ACIS 16th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

Feature selection is often considered as a key step in text categorization. In this paper, we proposed a new feature selection algorithm, named AD, which comprehensively measures the degree of relevance and distinction of terms occur in document set. We evaluated AD on three benchmark document collections, 20-Newsgroups, Reuters-21578 and WebKB, using two classification algorithms, Naive Bayes and...

chapter

Hierarchical approach for scientific document classification

Arlina D'cunha, A. K. Sen

International Conference on Computing, Communication & Automation > 100 - 104

2015 International Conference on Computing, Communication & Automation (ICCCA)

Classification is the grouping of information or objects in predefined labeled categories based on similarities. Exponential growth rates of scientific document collection leads to unmanageable manual classification. Feature extraction is the central prerequisite of automatic document classification. TF-IDF (term frequency-inverse document frequency) is commonly used to express the text feature weight...

chapter

Performance of using LDA for Chinese news text classification

Xiaojun Wu, Liying Fang, Pu Wang, Nan Yu

2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE) > 1260 - 1264

2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE)

Chinese text classification is always challenging, especially when data are high dimensional and sparse. In this paper, we are interested in the way of text representation and dimension reduction in Chinese text classification. First, we introduces a topic model — Latent Dirichlet Allocation(LDA), which is uses LDA model as a dimension reduction method. Second, we choose Support Vector Machine(SVM)...

chapter

Data classification with k-NN using novel character frequency-direct word frequency (CF-DWF) similarity formula

Munwar Ali Zardari, Low Tang Jung

2015 International Symposium on Mathematical Sciences and Computing Research (iSMSC) > 280 - 285

2015 International Symposium on Mathematical Sciences and Computing Research (iSMSC)

The k-NN is one of the most popular and easy in implementation algorithm to classify the data. The best thing about k-NN is that it accepts changes with improved version. Despite many advantages of the k-NN, it is also facing many issues. These issues are: distance/similarity calculation complexity, training dataset complexity at classification phase, proper selection of k, and get duplicate values...

chapter

Predicting the Popularity of Trending Arabic Wikipedia Articles Based on External Stimulants Using Data/Text Mining Techniques

Hanadi Muqbil Al-Mutairi, Mohammad Badruddin Khan

2015 International Conference on Cloud Computing (ICCC) > 1 - 6

2015 International Conference on Cloud Computing (ICCC)

Wikipedia is considered to be one of the most famous online encyclopedias. We study the issues related to trending articles on Arabic Wikipedia and how it is influenced by certain external stimulants: for example, breaking news, celebrities' tweets, special events from the past, instant messages on any social media application or any other reasons that could affect the Arabic Wikipedia articles in...

chapter

Comparison of multilabel problem transformation methods for text mining

Ziad Abdallah, Ali El-Zaart, Mohamad Oueidat

2015 Fifth International Conference on Digital Information and Communication Technology and its Applications (DICTAP) > 115 - 118

2015 Fifth International Conference on Digital Information and Communication Technology and its Applications (DICTAP)

Primarily, the need for automatic text categorization and medical diagnosis was the start of Multi-label classification. Multi-label classification received a great attention and used in several real world applications The demand of its applications increased to cover additional fields like functional genomics, music, biology, scene, video etc. For example, a text document may belong to many subjects...

chapter

Personalised book recommendation system based on opinion mining technique

Kumari Priyanka, Anand Shanker Tewari, Asim Gopal Barman

2015 Global Conference on Communication Technologies (GCCT) > 285 - 289

2015 Global Conference on Communication Technologies (GCCT)

Recommendation systems are tools in e-commerce websites which helps user to find the most suitable products. From the huge number of books, it is really difficult to choose a particular book. So, the recommendation system technique plays very important role and helps user to get books according to their need and interest. This paper presents online book recommendation system for users who purchase...

chapter

Parallel Processing System for Marathi Content Generation

Sushma R. Vispute, Shrikant Patil, Sagar Sangale, Akshay Padwal, more

2015 International Conference on Computing Communication Control and Automation > 575 - 579

2015 International Conference on Computing Communication Control and automation(ICCUBEA)

The objective of the present work is to design a HADOOP based parallel Marathi content retrieval system using clustering technique to get the efficient and optimized result than existing systems. The system also focuses on providing the personalized documents in Marathi language to the end user based on their interests identified from the browsing history and using time session mechanism for re ranking...

chapter

Improved Comprehensive Measurement Feature Selection Method for Text Categorization

LiZhou Feng, WanLi Zuo, YouWei Wang

2015 International Conference on Network and Information Systems for Computers > 125 - 128

2015 International Conference on Network and Information Systems for Computers (ICNISC)

Text categorization plays an important role in applications where information is filtered, monitored, personalized, categorized, organized or searched. Feature selection remains as an effective and efficient technique in text categorization. Traditional feature selections ignored the effects of unbalanced categories and the distribution of a term in different categories. On this basis, we improved...

chapter

Application of Cooperative Algorithm in Text Feature Acquiring

Mengli Qiao

2014 Seventh International Symposium on Computational Intelligence and Design > 1 > 344 - 346

2014 7th International Symposium on Computational Intelligence and Design (ISCID)

Text feature acquiring is the key to construct the classifier to classify the text, According to the problem that the text dimension of the original feature vector is reduced and accurate, put forward a text feature acquiring algorithm based on co evolution, the algorithm uses genetic algorithm optimization performance and co evolution can implement multiple population mutual evaluation and competition,...

Keywords:
CLASSIFICATION ALGORITHMS
Publication type:
book

Publication date

Set your own date range

Content availability

Available (469)
None (5)

Keywords

TRAINING (252)
TEXT ANALYSIS (247)
SUPPORT VECTOR MACHINES (165)
TEXT CLASSIFICATION (145)
FEATURE EXTRACTION (137)
PATTERN CLASSIFICATION (120)
ACCURACY (114)
ALGORITHM DESIGN AND ANALYSIS (94)
SUPPORT VECTOR MACHINE CLASSIFICATION (89)
CLASSIFICATION (88)
MACHINE LEARNING (87)
DATA MINING (76)
FEATURE SELECTION (68)
LEARNING (ARTIFICIAL INTELLIGENCE) (59)
SUPPORT VECTOR MACHINE (46)
INTERNET (45)
NATURAL LANGUAGE PROCESSING (43)
BAYES METHODS (40)
SVM (38)
CLUSTERING ALGORITHMS (37)
INFORMATION RETRIEVAL (37)
COMPUTERS (34)
MACHINE LEARNING ALGORITHMS (33)
TEXT MINING (32)
TESTING (30)
SEMANTICS (29)
ENTROPY (27)
NIOBIUM (26)
KERNEL (24)
VECTOR SPACE MODEL (24)
COMPUTATIONAL MODELING (23)
KNN (22)
WEB PAGES (21)
ARTIFICIAL NEURAL NETWORKS (20)
DECISION TREES (20)
TRAINING DATA (20)
PROBABILITY (19)
DATABASES (18)
FILTERING (18)
MUTUAL INFORMATION (18)
STATISTICAL ANALYSIS (18)
MATHEMATICAL MODEL (17)
VECTORS (17)
BAYESIAN METHODS (16)
CORRELATION (16)
DICTIONARIES (16)
PATTERN CLUSTERING (16)
CLASSIFICATION TREE ANALYSIS (15)
PREDICTION ALGORITHMS (15)
COMPUTER SCIENCE (14)
GENETIC ALGORITHMS (14)
INDEXING (14)
INFORMATION GAIN (14)
NAIVE BAYES (14)
EDUCATIONAL INSTITUTIONS (13)
INFORMATION FILTERING (13)
DOCUMENT HANDLING (12)
INDEXES (12)
ROUGH SET THEORY (12)
SEMI-SUPERVISED LEARNING (12)
SENTIMENT ANALYSIS (12)
VOCABULARY (12)
WORD PROCESSING (12)
DISTANCE MEASUREMENT (11)
EQUATIONS (11)
NEAREST NEIGHBOR SEARCHES (11)
ONTOLOGIES (11)
WEB SITES (11)
CONTEXT (10)
DATA MODELS (10)
DECISION TREE (10)
ELECTRONIC MAIL (10)
ENCODING (10)
NOISE (10)
ROUGH SET (10)
TEXT CLASSIFICATION ALGORITHM (10)
CHINESE TEXT CATEGORIZATION (9)
CLUSTERING (9)
DIMENSION REDUCTION (9)
FUZZY SET THEORY (9)
MATRIX DECOMPOSITION (9)
NAIVE BAYES CLASSIFIER (9)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (9)
OPTIMIZATION (9)
SEARCH ENGINES (9)
SET THEORY (9)
ARTIFICIAL INTELLIGENCE (8)
COMPLEXITY THEORY (8)
DECISION MAKING (8)
DOCUMENT CLASSIFICATION (8)
FEATURE SELECTION METHOD (8)
FILTERING ALGORITHMS (8)
GAIN (8)
GENETIC ALGORITHM (8)
K-NEAREST NEIGHBOR (8)
KNN ALGORITHM (8)
KNOWLEDGE ENGINEERING (8)
NAïVE BAYES (8)
more

INFONA - science communication portal

Search results

Parallelized text classification algorithm for processing large scale TCM clinical data with MapReduce

Classification and clustering for neuroinformatics: Assessing the efficacy on reverse-mapped NeuroNLP data using standard ML techniques

Handling imbalanced dataset in multi-label text categorization using Bagging and Adaptive Boosting

A New SVM Method for Short Text Classification Based on Semi-Supervised Learning

Based on Rough Sets and the Associated Analysis of KNN Text Classification Research

Microblog Sentiment Analysis Algorithm Research and Implementation Based on Classification

Classification of Chinese-to-English translated social network timelines using naive Bayes

Comparison of Four Text Classifiers on Movie Reviews

A k-Highest Expert Text Classification Algorithm Based on Choquet Integral

A Text Classifier of English Movie Reviews Based on Information Gain

A term weighting scheme based on the measure of relevance and distinction for text categorization

Hierarchical approach for scientific document classification

Performance of using LDA for Chinese news text classification

Data classification with k-NN using novel character frequency-direct word frequency (CF-DWF) similarity formula

Predicting the Popularity of Trending Arabic Wikipedia Articles Based on External Stimulants Using Data/Text Mining Techniques

Comparison of multilabel problem transformation methods for text mining

Personalised book recommendation system based on opinion mining technique

Parallel Processing System for Marathi Content Generation

Improved Comprehensive Measurement Feature Selection Method for Text Categorization

Application of Cooperative Algorithm in Text Feature Acquiring

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options