Search results

Items from 1 to 20 out of 40 results

chapter

An Intelligent Tutoring System for Argument-Making in Higher Education: A Pilot Study

Ching-Hua Chuan, Daniel Dinsmore, Joseph Schmuller, Tyler Morris

2014 13th International Conference on Machine Learning and Applications > 553 - 556

2014 13th International Conference on Machine Learning and Applications (ICMLA)

This paper presents a pilot study on an intelligent tutoring system for domain-independent argument making. Students' responses to an open-ended question were collected as the instances for supervised text classification based on the grade given by the instructor using structured outcome of the learning observation taxonomy. The responses were processed using Cohmetrix as well as n-gram models to...

chapter

Text classification based on a novel ensemble multi-label learning method

Tao Zhang, Jiansheng Wu, Haifeng Hu

The 2014 2nd International Conference on Systems and Informatics (ICSAI 2014) > 964 - 968

2014 2nd International Conference on Systems and Informatics (ICSAI)

Text classification is one of the most significant contents in Natural Language Processing research field. In most real cases, text classification is usually a multi-label learning task. Currently, there are three mainstream attribute measures (i.e., information gain, document frequency and chi-square test values) which are often used to describe documents. The three attribute measures have been applied...

chapter

An opinion mining approach for Romanian language

Roxana Monica Russu, Mihaela Dinsoreanu, Oana Luminita Vlad, Rodica Potolea

2014 IEEE 10th International Conference on Intelligent Computer Communication and Processing (ICCP) > 43 - 46

2014 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

The paper proposes a solution for document and aspect levels sentiment analysis for unstructured documents written in the Romanian language. The opinion extraction relies on two different approaches for polarity identification. At the aspect level we propose a rule-based approach. For the document level we consider supervised learning techniques, based on features extracted and filtered in different...

chapter

A new feature selection method in fishery information processing

Jun Gu, Nan He

2014 10th International Conference on Natural Computation (ICNC) > 834 - 838

2014 10th International Conference on Natural Computation (ICNC)

Fishery information processing can help fishery researchers obtain the needed information easily and quickly. The current information processing techniques have not solved the problem of high dimensional features in fishery information processing. In this paper, a feature selection method for fishery texts based on SVM-RFE was put forward in view of the characteristics of fishery texts. It removed...

chapter

Word clustering based on word2vec and semantic similarity

Jie Luo, Qinglin Wang, Yuan Li

Proceedings of the 33rd Chinese Control Conference > 517 - 521

2014 33rd Chinese Control Conference (CCC)

Domain words clustering have important theoretical and practical significance in text categorization, the ontology research, machine learning and many other research areas. The domain words clustering method in this article is a method based on word2vec and semantic similarity computation. First of all, we get the candidate word set with word2vec tools to preliminary clustering of words. Then we tectonic...

chapter

Public Opinion Analysis of Microblog Content

Yonghe Lu, Jianhua Chen

2014 International Conference on Information Science & Applications (ICISA) > 1 - 5

2014 International Conference on Information Science and Applications (ICISA)

In this paper, a public opinion analysis system is built up. It consists of a crawler used to retrieve online microblog content and a text classifier for distinguishing sentimental content. This system is used to identify public opinions towards certain topics. Microblogs are divided into three categories based on their emotional tendency, namely "positive", "negative" and "objective",...

chapter

Non-standard words as features for text categorization

Slobodan Beliga, Sanda Martincic-Ipsic

2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) > 1165 - 1169

2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

This paper presents categorization of Croatian texts using Non-Standard Words (NSW) as features. NonStandard Words are: numbers, dates, acronyms, abbreviations, currency, etc. NSWs in Croatian language are determined according to Croatian NSW taxonomy. For the purpose of this research, 390 text documents were collected and formed the SKIPEZ collection with 6 classes: official, literary, informative,...

chapter

A better indicator for genre classification: Topic word or surface text feature: A case study of recognition of brief biography

Wenxin Xiong

2014 International Conference on Information Science, Electronics and Electrical Engineering > 3 > 2015 - 2019

2014 International Conference on Information Science, Electronics and Electrical Engineering (ISEEE)

Classification based on topic (content) rather than genre (form) prevails in the text data mining and search engine circle. To simplify this work, a BOW (Bag of Words) strategy, counting topic-related words as features, is comprehensively utilized to make a final decision. Indeed, texts can be categorized by expression styles rather than their themes. Brief biography is a typical text class which...

chapter

Effective categorization of text in practical design

S. Ravi, M. Sambath, K. Ramesh Kumar

International Conference on Information Communication and Embedded Systems (ICICES2014) > 1 - 5

2014 International Conference on Information Communication and Embedded Systems (ICICES)

Data mining extracts novel and useful knowledge from large repositories of data and has become an effective analysis and decision means in corporation In many information processing tasks, labels are usually expensive and the unlabeled data points are abundant. To reduce the cost on collecting labels, it is crucial to predict which unlabeled examples are the most informative, i.e., improve the classifier...

chapter

Language identification: A new fast algorithm to identify the language of a text in a multilingual corpus

Said Gadri, Abdelouahab Moussaoui, Linda Belabdelouahab-Fernini

2014 International Conference on Multimedia Computing and Systems (ICMCS) > 321 - 326

2014 International Conference on Multimedia Computing and Systems (ICMCS)

Identifying the language of a text is a very important preliminary phase in the categorization of multilingual documents or even in information retrieval. This phase becomes difficult if we just consider the word as a basic unit of information in texts. Because It could be possible for some languages as French or English but very difficult for some other languages as German, Chinese and Arabic. In...

chapter

Text classification based on semi-supervised learning

Vo Duy Thanh, Pham Minh Tuan, Vo Trung Hung, Doan Van Ban

2013 International Conference on Soft Computing and Pattern Recognition (SoCPaR) > 232 - 236

2013 International Conference of Soft Computing and Pattern Recognition (SoCPaR)

In this paper, we present our solution and experimental results of the application of semi-supervised machine learning techniques and the improvement of SVM algorithm to build text classification applications. Firstly, we create a features model which is based on labeled data, and then we will be improved it by the unlabeled data. The technique that is to be added a label into new data is based on...

chapter

Research on Text Feature Selection Algorithm Based on Information Gain and Feature Relation Tree

Hong Zhang, Yong-gong Ren, Xue Yang

2013 10th Web Information System and Application Conference > 446 - 449

2013 10th Web Information System and Application Conference (WISA)

The classification performance of previous IG algorithm may decline obviously because of the maldistribution of classes and features, due to which an improved text feature selection method UDsIG is proposed. First, we select features by classes to reduce the impact on feature selection when the classes are unevenly distributed. After that, we use feature equilibrium of distribution to decrease the...

chapter

An effective method to recognize the language of a text in a collection of multilingual documents

Said Kadri, Abdelouahab Moussaoui

2013 International Conference on Electronics, Computer and Computation (ICECCO) > 208 - 211

2013 International Conference on Electronics, Computer and Computation (ICECCO)

Identifying the language of a text means that we assign this text to a language in which it is written. This identification becomes important because of the increased diversity of textual data in different languages on the web. A real recognition of the text language is not possible if we just consider the word as a basic unit of information. It could be possible in some languages but very difficult...

chapter

A Document Image Segmentation System Using Analysis of Connected Components

F. Zirari, A. Ennaji, S. Nicolas, D. Mammass

2013 12th International Conference on Document Analysis and Recognition > 753 - 757

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

Page segmentation into text and non-text elements is an essential preprocessing step before optical character recognition (OCR) operation. In case of poor segmentation, an OCR classification engine produces garbage characters due to the presence of non-text elements. This paper presents a method to separate the textual and non textual components in document images using a graph-based modeling and...

chapter

The Effect of Combining Different Feature Selection Methods on Arabic Text Classification

Abdulmohsen Al-Thubaity, Norah Abanumay, Sara Al-Jerayyed, Aljoharah Alrukban, more

2013 14th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing > 211 - 216

2013 14th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

Feature selection is one of several factors affecting text classification systems. Feature selection aims to choose a representative subset of all features to reduce the complexity of classification problems. Usually a single method is used for feature selection. For English, several attempts were reported examining the combination of different feature selection methods. To the best of our knowledge...

chapter

A simple text/graphic separation method for document image segmentation

F. Zirari, A. Ennaji, S. Nicolas, D. Mammass

2013 ACS International Conference on Computer Systems and Applications (AICCSA) > 1 - 4

2013 ACS International Conference on Computer Systems and Applications (AICCSA)

chapter

An ontology-based dimensionality reduction algorithm for biomedical literature classification

Jing Wang, Gongqing Wu, Xuegang Hu

IEEE Conference Anthology > 1 - 5

2013 IEEE Conference Anthology

Dimension reduction is an important component in automatic text categorization, especially biomedical literature classification. Many studies have showed that statistic-based dimension reduction algorithms, like Information Gain (IG), are very effective in document categorization. However these algorithms still suffer from major drawbacks. One facet is that they tend to use all the words as features...

chapter

Performance analysis and improvement of naïve Bayes in text classification application

Wei Zhang, Feng Gao

IEEE Conference Anthology > 1 - 4

2013 IEEE Conference Anthology

Naive Bayes classifier is widely used in machine learning for its simplicity and efficiency. However, most of the existing work on naïve Bayes focused on improving the Bayes model itself or whether the “naïve assumption” is satisfied. In this paper, the performance of naïve bayes in text classification is analyzed and the corresponding results from different points of view is proposed, then an improving...

chapter

RLS-MARS: An Effective Feature Selection Tool for Text Classification

Li Xi, Dai Hang, Wang Mingwen

2012 Fourth International Conference on Multimedia Information Networking and Security > 254 - 257

2012 4th International Conference on Multimedia Information Networking and Security (MINES)

The RLS-MARS (Regularized Least Squares-Multi Angle Regression and Shrinkage) feature selection model is used to select the relevant information, in which both, the keeping and the leaving-out of the regularizer are present. The RLS-MARS model is to find a series of directions in multidimensional space, leading the gradient vectors to change along those directions which would make the gradient matrix's...

chapter

Weirdness Coefficient as a Feature Selection Method for Arabic Special Domain Text Classification

Abdulmohsen Al-Thubaity, Albandari Alanazi, Itisam Hazzaa, Haya Al-Tuwaijri

2012 International Conference on Asian Language Processing > 69 - 72

2012 International Conference on Asian Language Processing (IALP)

Given the importance of organizing and managing the rapid growth in knowledge of Arabic electronic content, this study introduces the Weirdness Coefficient (W) as a new feature selection method for Arabic special domain text classification. The proposed method was used to classify a dataset comprising five Islamic topics using NaÃ¯ve base (NB) and K-nearest neighbor (K-NN) classifiers, and three representation...

Keywords:
EDUCATIONAL INSTITUTIONS

Publication date

Set your own date range

Keywords

ACCURACY (15)
TRAINING (14)
CLASSIFICATION ALGORITHMS (13)
FEATURE SELECTION (11)
MACHINE LEARNING (11)
SUPPORT VECTOR MACHINES (11)
COMPUTERS (8)
FEATURE EXTRACTION (8)
TEXT CLASSIFICATION (8)
VECTORS (6)
ELECTRONIC MAIL (4)
NIOBIUM (4)
SEMANTICS (4)
SUPPORT VECTOR MACHINE CLASSIFICATION (4)
TEXT MINING (4)
ARABIC TEXT CLASSIFICATION (3)
CLASSIFICATION (3)
COMPUTER SCIENCE (3)
INFORMATION RETRIEVAL (3)
MACHINE LEARNING ALGORITHMS (3)
NATURAL LANGUAGE PROCESSING (3)
ONTOLOGIES (3)
PRAGMATICS (3)
PREDICTION ALGORITHMS (3)
TEXT ANALYSIS (3)
TEXT RECOGNITION (3)
ABSTRACTS (2)
CONFERENCES (2)
CONNECTED COMPONENTS (2)
CRAWLERS (2)
DATA MINING (2)
DOCUMENT HANDLING (2)
DOCUMENT IMAGE (2)
GAIN (2)
GRAPH (2)
HISTOGRAMS (2)
IMAGE EDGE DETECTION (2)
IMAGE SEGMENTATION (2)
INDEXING (2)
INTERNET (2)
KERNEL (2)
LANGUAGE IDENTIFICATION (2)
MATHEMATICAL MODEL (2)
NOISE (2)
OPINION MINING (2)
PRESSES (2)
PROBABILITY (2)
RANDOM VARIABLES (2)
STRUCTURAL ANALYSIS (2)
TAXONOMY (2)
“LEAST-MAX-COVER” STRATEGY (1)
ANALYTICAL MODELS (1)
ANN (1)
AQUACULTURE (1)
ARABIC TEXT CATEGORIZATION (1)
ARABIC TEXT MINING (1)
ART (1)
ARTIFICIAL INTELLIGENCE (1)
ARTIFICIAL NEURAL NETWORK (1)
ARTIFICIAL NEURAL NETWORKS (1)
ASSEMBLY (1)
AUTOMATIC TEXT CATEGORIZATION (1)
AUTOMATION (1)
BAYES METHODS (1)
BAYESIAN METHODS (1)
BEHAVIORAL SCIENCE (1)
BIOINFORMATICS (1)
BOOSTING (1)
BUG TRIAGE (1)
CANCER (1)
CHI-SQUARE (1)
CHINESE TEXT (1)
CHINESE TEXT CATEGORIZATION (1)
CHINESE TEXT CLASSIFICATION (1)
CLASSIFICATION ACCURACY (1)
CLASSIFICATION MODEL (1)
CLASSIFIER (1)
COARSE GRAIN FILTERING (1)
COGNITION (1)
COLLECTION REPRESENTATION (1)
COMMUNITIES (1)
COMPUTATIONAL LINGUISTICS (1)
COMPUTATIONAL MODELING (1)
COMPUTER BUGS (1)
CONNECTORS (1)
CONTENT MINING (1)
CORRELATION (1)
COURSE REVIEW MINING (1)
CURRENT MEASUREMENT (1)
DATABASES (1)
DECISION MAKING (1)
DECISION-MAKING INFORMATION (1)
DIMENSION REDUCTION (1)
DIMENSIONALITY REDUCTION (1)
DISCRIMINATIVE ANALYSIS (1)
DISPERSION (1)
DISTANCE MEASUREMENT (1)
DIVERSITY RECEPTION (1)
more

INFONA - science communication portal

Search results

An Intelligent Tutoring System for Argument-Making in Higher Education: A Pilot Study

Text classification based on a novel ensemble multi-label learning method

An opinion mining approach for Romanian language

A new feature selection method in fishery information processing

Word clustering based on word2vec and semantic similarity

Public Opinion Analysis of Microblog Content

Non-standard words as features for text categorization

A better indicator for genre classification: Topic word or surface text feature: A case study of recognition of brief biography

Effective categorization of text in practical design

Language identification: A new fast algorithm to identify the language of a text in a multilingual corpus

Text classification based on semi-supervised learning

Research on Text Feature Selection Algorithm Based on Information Gain and Feature Relation Tree

An effective method to recognize the language of a text in a collection of multilingual documents

A Document Image Segmentation System Using Analysis of Connected Components

The Effect of Combining Different Feature Selection Methods on Arabic Text Classification

A simple text/graphic separation method for document image segmentation

An ontology-based dimensionality reduction algorithm for biomedical literature classification

Performance analysis and improvement of naïve Bayes in text classification application

RLS-MARS: An Effective Feature Selection Tool for Text Classification

Weirdness Coefficient as a Feature Selection Method for Arabic Special Domain Text Classification

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options