Search results

Items from 101 to 120 out of 474 results

1 ...
3
4
5
6
7
8
9

chapter

Augmenting the novice-expert overlay model in an intelligent tutoring system: Using confidence-weighted linear classifiers

Tenzin Doleck, Ram B. Basnet, Eric Poitras, Susanne Lajoie

2014 IEEE International Conference on Computational Intelligence and Computing Research > 1 - 4

2014 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

In BioWorld, a medical intelligent tutoring system, novice physicians are tasked with solving virtual patient cases. Whilst the importance of modeling and predicting clinical reasoning is recognized, an important aspect of the learner contribution remains unexplored — the written case summary prepared by the learner. The premise of investigating the case summaries is that it captures the thought and...

chapter

News classification based on their headlines: A review

Mazhar Iqbal Rana, Shehzad Khalid, Muhammad Usman Akbar

17th IEEE International Multi Topic Conference 2014 > 211 - 216

2014 IEEE 17th International Multi-Topic Conference (INMIC)

For the last few years, text mining has been gaining significant importance. Since Knowledge is now available to users through variety of sources i.e. electronic media, digital media, print media, and many more. Due to huge availability of text in numerous forms, a lot of unstructured data has been recorded by research experts and have found numerous ways in literature to convert this scattered text...

chapter

Learning to Rank with Only Positive Examples

Mingzhu Zhu, Wei Xiong, Yi-Fang Brook Wu

2014 13th International Conference on Machine Learning and Applications > 87 - 92

2014 13th International Conference on Machine Learning and Applications (ICMLA)

Search By Multiple Examples (SBME) is a new search paradigm that allows users to specify their information needs as a set of relevant documents rather than as a set of keywords. In this study, we propose a Transductive Positive Unlabeled learning (TPU learning) based framework for SBME. The framework consists of two steps: 1) identifying potential relevant documents for searching space reduction,...

chapter

An Improved Text Categorization Algorithm Based on VSM

Ji Geng, Yunling Lu, Wei Chen, Zhiguang Qin

2014 IEEE 17th International Conference on Computational Science and Engineering > 1701 - 1706

2014 IEEE 17th International Conference on Computational Science and Engineering (CSE)

With the advent of the information age, various kinds of information have been spread on the Internet. The amount of junk information affects people's lives seriously. In order to filter the harmful Web pages efficiently and effectively, we have suggested a novel text classification algorithm based on Vector Space Model in this paper. This algorithm has adopted the modularized processing mode to deal...

chapter

A Hierarchy Method Based on LDA and SVM for News Classification

Limeng Cui, Fan Meng, Yong Shi, Minqiang Li, more

2014 IEEE International Conference on Data Mining Workshop > 60 - 64

2014 IEEE International Conference on Data Mining Workshop (ICDMW)

He growth of the online data provides the user a access to information on the Internet but also creates the challenges to obtain the valuable knowledge. In this paper we focus on news text classification, which is meaningful for information provider to organize and display the news but also for the users to reach the valuable information easily. A hierarchy method based on LDA and SVM is proposed...

chapter

Research on energy-efficient text classification

Hao Lin

Proceedings of 2nd International Conference on Information Technology and Electronic Commerce > 257 - 261

2014 2nd International Conference on Information Technology and Electronic Commerce (ICITEC)

People rely on data mining techniques like text categorization more and more to explore valuable information, due to the ever-increasing electronic documents produced. Although the energy consumed by text categorization increases with the data, people usually pay attention to its effectiveness and there is little research about its energy cost. In this paper, we evaluate the energy cost of different...

chapter

Text classification based on a novel ensemble multi-label learning method

Tao Zhang, Jiansheng Wu, Haifeng Hu

The 2014 2nd International Conference on Systems and Informatics (ICSAI 2014) > 964 - 968

2014 2nd International Conference on Systems and Informatics (ICSAI)

Text classification is one of the most significant contents in Natural Language Processing research field. In most real cases, text classification is usually a multi-label learning task. Currently, there are three mainstream attribute measures (i.e., information gain, document frequency and chi-square test values) which are often used to describe documents. The three attribute measures have been applied...

chapter

Towards Reliable Clustering of English Text Documents Using Correlation Coefficient

Hrishikesh Bhaumik, Anirban Mukherjee, Siddhartha Bhattacharyya, Manojit Chattopadhyay

2014 International Conference on Computational Intelligence and Communication Networks > 530 - 535

2014 International Conference on Computational Intelligence and Communication Networks (CICN)

This paper proposes a new approach for clustering English text documents, based on finding the pair wise correlation of documents in a given set of text documents. The correlation coefficient for each pair of documents is calculated on the basis of ranks given to the words in the documents. The ranking of the words occurring in a document is computed on the basis of weights of the words calculated...

chapter

A new feature selection method for text categorization based on information gain and particle swarm optimization

Ferruh Yigit, Omer Kaan Baykan

2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems > 523 - 529

2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems (CCIS)

Rapid increases of the documents which are created in digital media necessitate analyze and classify of these documents automatically. Feature extraction, feature selection and classifier selection in the analysis of documents and classification affects performance. In text document categorization, it is a fundamental problem that the numbers of extracted features are a lot of. In this study, by using...

chapter

Application of knowledge gain on multi-type feature space in microblog user classification

Xu Yan

2014 IEEE International Conference on Granular Computing (GrC) > 340 - 345

2014 IEEE International Conference on Granular Computing (GrC)

Feature selection plays an important role in text categorization. Classic feature selection methods such as document frequency (DF), information gain (IG), mutual information (MI) are commonly applied in text categorization. But usually they only take plain text into account. Knowledge Gain (KG) is a new feature selection method which is proposed in my previous paper. It measures attribute's importance...

chapter

Active learning for text classification: Using the LSI Subspace Signature Model

Weizhong Zhu, Robert B. Allen

2014 International Conference on Data Science and Advanced Analytics (DSAA) > 149 - 155

2014 International Conference on Data Science and Advanced Analytics (DSAA)

Supervised learning methods rely on large sets of labeled training examples. However, large training sets are rare and making them is expensive. In this research, Latent Semantic Indexing Subspace Signature Model (LSISSM) is applied to labeling for active learning of unstructured text. Based on Singular Value Decomposition (SVD), LSISSM represents terms and documents as semantic signatures by the...

chapter

Automated document classification for news article in Bahasa Indonesia based on term frequency inverse document frequency (TF-IDF) approach

Ari Aulia Hakim, Alva Erwin, Kho I Eng, Maulahikmah Galinium, more

2014 6th International Conference on Information Technology and Electrical Engineering (ICITEE) > 1 - 4

2014 6th International Conference on Information Technology and Electrical Engineering (ICITEE)

The exponential growth of the data may lead us to the information explosion era, an era where most of the data cannot be managed easily. Text mining study is believed to prevent the world from entering that era. One of the text mining studies that may prevent the explosion era is text classification. It is a way to classify articles into several predefined categories. In this research, the classifier...

chapter

An opinion mining approach for Romanian language

Roxana Monica Russu, Mihaela Dinsoreanu, Oana Luminita Vlad, Rodica Potolea

2014 IEEE 10th International Conference on Intelligent Computer Communication and Processing (ICCP) > 43 - 46

2014 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

The paper proposes a solution for document and aspect levels sentiment analysis for unstructured documents written in the Romanian language. The opinion extraction relies on two different approaches for polarity identification. At the aspect level we propose a rule-based approach. For the document level we consider supervised learning techniques, based on features extracted and filtered in different...

chapter

A Clique Based Web Page Classification Corrective Approach

Belmouhcine Abdelbadie, Benkhalifa Mohammed

2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) > 2 > 467 - 473

2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT)

Nowadays, the web is the most relevant data source. Its size does not stop growing day by day. Web page classification becomes crucial due to this overwhelming amount of data. Web pages contain many noisy contents that bias textual classifiers and lead them to lose focus on their main subject. Web pages are related to each other either implicitly by users' intuitive judgments or explicitly by hyperlinks...

chapter

A method of pre-sentence text based on Map/Reduce storage and indexing classification

Wu Qing, Yu Yue, Yao Yi, Wu Liang

2014 IEEE 5th International Conference on Software Engineering and Service Science > 195 - 199

2014 5th IEEE International Conference on Software Engineering and Service Science (ICSESS)

Today, as more and more businesses and individuals into the study of cloud computing, data storage in the cloud platform is also growing. So how cloud environment quickly and effectively store, manage and use these data has become a very important and challenging issues. This paper mainly discusses the storage model based on Map/Reduce text categorization, at the same time combining forecasting data...

chapter

Research on text classification based on SVM-KNN

Yun Lin, Jie Wang

2014 IEEE 5th International Conference on Software Engineering and Service Science > 842 - 844

2014 5th IEEE International Conference on Software Engineering and Service Science (ICSESS)

A new text classification algorithm has been put forward based on basic support vector machine algorithm. The SVM-KNN algorithm for text classification has been proposed which combined SVM algorithm and KNN algorithm. The SVM-KNN algorithm can improve the performance of classifier by the feedback and improvement of classifying prediction probability. The actual effect of SVM-KNN algorithm is tested...

chapter

Public Opinion Analysis of Microblog Content

Yonghe Lu, Jianhua Chen

2014 International Conference on Information Science & Applications (ICISA) > 1 - 5

2014 International Conference on Information Science and Applications (ICISA)

In this paper, a public opinion analysis system is built up. It consists of a crawler used to retrieve online microblog content and a text classifier for distinguishing sentimental content. This system is used to identify public opinions towards certain topics. Microblogs are divided into three categories based on their emotional tendency, namely "positive", "negative" and "objective",...

chapter

A short message classification algorithm for tweet classification

P. Selvaperumal, A. Suruliandi

2014 International Conference on Recent Trends in Information Technology > 1 - 3

2014 Fourth International Conference on Recent Trends in Information Technology (ICRTIT)

Twitter users tweet their views in the form of short text messages. Twitter topic classification is classifying the tweets in to a set of predefined classes. In this work, a new tweet classification Method that makes use of tweet features like URL's in the tweet, retweeted tweets and influential users tweet is proposed. Experiments were carried out with extensive tweet data set. The performance of...

chapter

A high performance hybrid algorithm for text classification

Prema Nedungadi, Haripriya Harikumar, Maneesha Ramesh

The Fifth International Conference on the Applications of Digital Information and Web Technologies (ICADIWT 2014) > 118 - 123

2014 Fifth International Conference on the Applications of Digital Information and Web Technologies (ICADIWT)

The high computational complexity of text classification is a significant problem with the growing surge in text data. An effective but computationally expensive classification is the k-nearest-neighbor (kNN) algorithm. Principal Component Analysis (PCA) has commonly been used as a preprocessing phase to reduce the dimensionality followed by kNN. However, though the dimensionality is reduced, the...

chapter

Large-Scale Web Page Classification

Sathi T. Marath, Michael Shepherd, Evangelos Milios, Jack Duffy

2014 47th Hawaii International Conference on System Sciences > 1813 - 1822

2014 47th Hawaii International Conference on System Sciences (HICSS)

This research investigates the design of a unified framework for the content-based classification of highly imbalanced hierarchical datasets, such as web directories. In an imbalanced dataset, the prior probability distribution of a category indicates the presence or absence of class imbalance. This may include the lack of positive training instances (rarity) or an overabundance of positive instances...

1 ...
3
4
5
6
7
8
9

Keywords:
CLASSIFICATION ALGORITHMS
Publication type:
book

Publication date

Set your own date range

Content availability

Available (469)
None (5)

Keywords

TRAINING (252)
TEXT ANALYSIS (247)
SUPPORT VECTOR MACHINES (165)
TEXT CLASSIFICATION (145)
FEATURE EXTRACTION (137)
PATTERN CLASSIFICATION (120)
ACCURACY (114)
ALGORITHM DESIGN AND ANALYSIS (94)
SUPPORT VECTOR MACHINE CLASSIFICATION (89)
CLASSIFICATION (88)
MACHINE LEARNING (87)
DATA MINING (76)
FEATURE SELECTION (68)
LEARNING (ARTIFICIAL INTELLIGENCE) (59)
SUPPORT VECTOR MACHINE (46)
INTERNET (45)
NATURAL LANGUAGE PROCESSING (43)
BAYES METHODS (40)
SVM (38)
CLUSTERING ALGORITHMS (37)
INFORMATION RETRIEVAL (37)
COMPUTERS (34)
MACHINE LEARNING ALGORITHMS (33)
TEXT MINING (32)
TESTING (30)
SEMANTICS (29)
ENTROPY (27)
NIOBIUM (26)
KERNEL (24)
VECTOR SPACE MODEL (24)
COMPUTATIONAL MODELING (23)
KNN (22)
WEB PAGES (21)
ARTIFICIAL NEURAL NETWORKS (20)
DECISION TREES (20)
TRAINING DATA (20)
PROBABILITY (19)
DATABASES (18)
FILTERING (18)
MUTUAL INFORMATION (18)
STATISTICAL ANALYSIS (18)
MATHEMATICAL MODEL (17)
VECTORS (17)
BAYESIAN METHODS (16)
CORRELATION (16)
DICTIONARIES (16)
PATTERN CLUSTERING (16)
CLASSIFICATION TREE ANALYSIS (15)
PREDICTION ALGORITHMS (15)
COMPUTER SCIENCE (14)
GENETIC ALGORITHMS (14)
INDEXING (14)
INFORMATION GAIN (14)
NAIVE BAYES (14)
EDUCATIONAL INSTITUTIONS (13)
INFORMATION FILTERING (13)
DOCUMENT HANDLING (12)
INDEXES (12)
ROUGH SET THEORY (12)
SEMI-SUPERVISED LEARNING (12)
SENTIMENT ANALYSIS (12)
VOCABULARY (12)
WORD PROCESSING (12)
DISTANCE MEASUREMENT (11)
EQUATIONS (11)
NEAREST NEIGHBOR SEARCHES (11)
ONTOLOGIES (11)
WEB SITES (11)
CONTEXT (10)
DATA MODELS (10)
DECISION TREE (10)
ELECTRONIC MAIL (10)
ENCODING (10)
NOISE (10)
ROUGH SET (10)
TEXT CLASSIFICATION ALGORITHM (10)
CHINESE TEXT CATEGORIZATION (9)
CLUSTERING (9)
DIMENSION REDUCTION (9)
FUZZY SET THEORY (9)
MATRIX DECOMPOSITION (9)
NAIVE BAYES CLASSIFIER (9)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (9)
OPTIMIZATION (9)
SEARCH ENGINES (9)
SET THEORY (9)
ARTIFICIAL INTELLIGENCE (8)
COMPLEXITY THEORY (8)
DECISION MAKING (8)
DOCUMENT CLASSIFICATION (8)
FEATURE SELECTION METHOD (8)
FILTERING ALGORITHMS (8)
GAIN (8)
GENETIC ALGORITHM (8)
K-NEAREST NEIGHBOR (8)
KNN ALGORITHM (8)
KNOWLEDGE ENGINEERING (8)
NAïVE BAYES (8)
more

INFONA - science communication portal

Search results

Augmenting the novice-expert overlay model in an intelligent tutoring system: Using confidence-weighted linear classifiers

News classification based on their headlines: A review

Learning to Rank with Only Positive Examples

An Improved Text Categorization Algorithm Based on VSM

A Hierarchy Method Based on LDA and SVM for News Classification

Research on energy-efficient text classification

Text classification based on a novel ensemble multi-label learning method

Towards Reliable Clustering of English Text Documents Using Correlation Coefficient

A new feature selection method for text categorization based on information gain and particle swarm optimization

Application of knowledge gain on multi-type feature space in microblog user classification

Active learning for text classification: Using the LSI Subspace Signature Model

Automated document classification for news article in Bahasa Indonesia based on term frequency inverse document frequency (TF-IDF) approach

An opinion mining approach for Romanian language

A Clique Based Web Page Classification Corrective Approach

A method of pre-sentence text based on Map/Reduce storage and indexing classification

Research on text classification based on SVM-KNN

Public Opinion Analysis of Microblog Content

A short message classification algorithm for tweet classification

A high performance hybrid algorithm for text classification

Large-Scale Web Page Classification

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options