Search results

Items from 1 to 20 out of 27 results

chapter

Deep learning algorithms based text classifier

Arthi Venkataraman

2016 2nd International Conference on Applied and Theoretical Computing and Communication Technology (iCATccT) > 220 - 224

2016 2nd International Conference on Applied and Theoretical Computing and Communication Technology (iCATccT)

There exists a base classification system for classification of problem tickets in the Enterprise domain. Different deep learning algorithms (Gated Recursive Unit and Long Short Term Memory) were investigated for solving the classification problem. Experiments were conducted for different parameters and layers for these algorithms. Paper brings out the architectures tried, results obtained, our conclusions...

chapter

Augmenting a classifier ensemble with automatically generated class level patterns for higher accuracy

Arthi Venkataraman

2015 Conference on Technologies and Applications of Artificial Intelligence (TAAI) > 266 - 272

2015 Conference on Technologies and Applications of Artificial Intelligence (TAAI)

Different types of classifiers were investigated in the context of classification of problem tickets in the Enterprise domain. There were still challenges in building an accurate classifier post data cleaning and other accuracy improving pre-processing techniques. Creating an ensemble of classifiers gave better accuracy than individual classifiers. The maximum accuracy was got by enhancing the ensemble...

chapter

Using a thesaurus-based approach for the categorisation of web sites

Sameerchand Pudaruth, Youven Ankiah, Keshav Sembhoo

2014 Seventh International Conference on Contemporary Computing (IC3) > 624 - 628

2014 Seventh International Conference on Contemporary Computing (IC3)

With the increasing number of Mauritian-owned websites on the internet, the need for classification is becoming highly important. Our objective in this research is to classify a list of websites into seven broad categories namely education, entertainment, government, health, tourism, sports and shopping. The homepage of three hundred and nineteen websites have been used in this study. We have exploited...

chapter

Identifying temporal relations between main events in new articles

Ines Berrazega, Rim Faiz

2013 ACS International Conference on Computer Systems and Applications (AICCSA) > 1 - 4

2013 ACS International Conference on Computer Systems and Applications (AICCSA)

With the expansion of the Web 2.0, daily huge amount of data is produced everywhere, namely new articles. These contents need to be exploited in order to extract relevant information and to build knowledge databases. In this concern, processing the temporal dimension of language and extracting temporal information from electronic news articles is becoming a prominent task. In this concern, we propose...

chapter

A Pointwise Approach for Vietnamese Diacritics Restoration

Tuan Anh Luu, Kazuhide Yamamoto

2012 International Conference on Asian Language Processing > 189 - 192

2012 International Conference on Asian Language Processing (IALP)

The automatic insertion of diacritics in electronic texts is necessary for a number of languages, including French, Romanian, Croatian, Sindhi, Vietnamese, etc. When diacritics are removed from a word and the resulting string of characters is not a word, it is easy to recover the diacritics. However, sometimes the resulting string is also a word, possibly with different grammatical properties or a...

chapter

Classifying Natural Language Sentences for Policy

John Slankas, Laurie Williams

2012 IEEE International Symposium on Policies for Distributed Systems and Networks > 33 - 36

2012 IEEE International Symposium on Policies for Distributed Systems and Networks - POLICY

Organizations derive policies from a wide variety of sources, such business plans, laws, regulations, and contracts. However, an efficient process does not yet exist for quickly finding or automatically deriving policies from uncontrolled natural language sources. The goal of our research is to assure compliance with established policies by ensuring policies in existing natural language texts are...

chapter

Chinese question classification in community question answering

Yunqi Lei, Yiyuan Jiang

2010 IEEE International Conference on Service-Oriented Computing and Applications (SOCA) > 1 - 6

2010 IEEE International Conference on Service-Oriented Computing and Applications (SOCA 2010)

Community Question Answering (CQA) has become a popular and effective mean for seeking information on the Web. It is now possible and effective to post a question asked in natural language on a popular community Question Answering (QA) portal, and to rely on other users to provide answers. These online collaborative services are attracting users and questions at an explosive rate, while how to correctly...

chapter

English and Taiwanese text categorization using N-gram based on Vector Space Model

M Suzuki, N Yamagishi, Yi-Ching Tsai, T Ishida, more

2010 International Symposium On Information Theory&Its Applications > 106 - 111

2010 International Symposium On Information Theory & Its Applications (ISITA 2010)

In this paper, we present a new mathematical model based on a “Vector Space Model” and consider its implications. The proposed method is evaluated by performing several experiments. In these experiments, we classify newspaper articles from the English Reuters-21578 data set, and Taiwanese China Times 2005 data set using the proposed method. The Reuters-21578 data set is a benchmark data set for automatic...

chapter

A comparative study of Neural networks architectures on Arabic text categorization using feature extraction

F Harrag, Abdul Malik Salman Al-Salman, M BenMohammed

2010 International Conference on Machine and Web Intelligence > 102 - 107

International Conference on Machine and Web Intelligence (ICMWI 2010)

In this paper, we present a model based on the Neural Network (NN) for classifying Arabic texts. We propose the use of Singular Value Decomposition (SVD) as a preprocessor of NN with the aim of further reducing data in terms of both size and dimensionality. Indeed, the use of SVD makes data more amenable to classification and the convergence training process faster. Specifically, the effectiveness...

chapter

Automatic positive sentiment word extraction for Chinese text classification

Zhen'gang Yu, Ning Zhen, Ming Xu

2010 International Conference On Computer Design and Applications > 1 > V1-250 - V1-255

2010 International Conference on Computer Design and Applications (ICCDA 2010)

Sentiment analysis aims to predict sentiment tendency automatically. Traditional methods tackling this problem are mostly based on supervised learning,but it is time-consuming and uneasy to extendable. In this paper,we provide a novel method of sentiment analysis based on un-supervised learning together with some language rules. It is no necessary to have a positive sentiment dictionary beforehand...

chapter

Classification of voice disorders in children with cochlear implantation and hearing aid using multiple classifier fusion

Z Mahmoudi, S Rahati, M M Ghasemi, V Asadpour, more

10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010) > 304 - 307

2010 10th International Conference on Information Sciences, Signal Processing and their Applications (ISSPA 2010)

Speech production and speech phonetic features gradually improve in children by obtaining audio feedback after cochlear implantation or using hearing aid. In this study, voice disorders in children with cochlear implantation and hearing aid are classified. 30 Persian children participated in the study, including 6 children in levels 1 to 3 and 12 in level 4. Voice samples of 5 isolated Persian words...

chapter

Text categorization algorithms representations based on inductive learning

Cao Jian-fang, Wang Hong-bin

2010 2nd IEEE International Conference on Information Management and Engineering > 352 - 355

2010 2nd IEEE International Conference on Information Management and Engineering (ICIME 2010)

Text categorization-assignment of natural language texts to one or more predefined categories based on their content-is an important component in many information organization and management tasks. Categorization algorithm is the most critical factor to text categorization system performance. The inductive learning classifiers are put forward. Very accurate text categorization result can be learned...

chapter

SVM Based Part of Speech Tagger for Malayalam

P J Antony, Santhanu P Mohan, K P Soman

2010 International Conference on Recent Trends in Information, Telecommunication and Computing > 339 - 341

2010 International Conference on Recent Trends in Information, Telecommunication and Computing (ITC 2010)

This paper presents the building of part-of-speech Tagger for Malayalam Language using Support Vector Machine (SVM). POS tagger plays an important role in Natural language applications like speech recognition, natural language parsing, information retrieval and information extraction. This supervised machine learning POS tagging approach requires a large amount of annotated training corpus to tag...

chapter

A New Method of Training Sample Selection in Text Classification

Yixing Liao, Xuezeng Pan

2010 Second International Workshop on Education Technology and Computer Science > 1 > 211 - 214

2010 2nd International Workshop on Education Technology and Computer Science (ETCS)

Aiming to noise samples in the training dataset, a new method for reducing the amount of training dataset is proposed in the paper which is applicable to text classification. This method describes the distribution of training dataset according to the representativeness score of samples in the class they belong to, so as to show representative samples and noise samples in each class. The new method...

chapter

Study on Method of Word Segmentation in Feature Selection in Chinese Text Categorization

Huang Wei, Liu Yi, Gao Bing, Yang Ke-wei

2010 Third International Conference on Knowledge Discovery and Data Mining > 411 - 415

2010 3rd International Conference on Knowledge Discovery and Data Mining (WKDD 2010)

Since the automatic word segmentation of Chinese text will bring the lack of information, method of word segmentation according to lexical chunk as segmentation unit are proposed. Use traditional segmentation method segment Chinese text based calculate mutual information between two lexical entries and adjacent frequency of two or more lexical entries, according to this calculated value judge and...

chapter

A discriminative learning approach for orientation detection of Urdu document images

S.F. Rashid, S.S. Bukhari, F. Shafait, T.M. Breuel

2009 IEEE 13th International Multitopic Conference > 1 - 5

2009 IEEE 13th International Multitopic Conference (INMIC 2009)

Orientation detection is an important preprocessing step for accurate recognition of text from document images. Many existing orientation detection techniques are based on the fact that in Roman script text ascenders occur more likely than descenders, but this approach is not applicable to document of other scripts like Urdu, Arabic, etc. In this paper, we propose a discriminative learning approach...

chapter

The Automatic Categorization of Arabic Documents by Boosting Decision Trees

S Raheel, J Dichy, M Hassoun

2009 Fifth International Conference on Signal Image Technology and Internet Based Systems > 294 - 301

2009 Fifth International Conference on Signal-Image Technology & Internet-Based Systems (SITIS 2009)

Automatic document classification has been subject to research since the early 1960s. However, additional research is still required and possible because the results obtained until now remain subject to further enhancement and refinement. Although a lot of literature has been written on the subject, very little research was reported on the automatic classification of Arabic documents none of which...

chapter

Dealing with Chinese Overlapping Ambiguity Based on Type Functional Application

Dongping Gao, Jiahong Guo

2009 International Conference on Artificial Intelligence and Computational Intelligence > 3 > 67 - 71

2009 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2009)

The method of type functional application is employed attempting to resolve Chinese overlapping ambiguity in the area of Chinese word segmentation. Instead of traditional methods which treat Chinese overlapping ambiguity as classification problems, the proposed approach regards this task as a sentence type calculus problem. The method is based on type theory and the benefit of this method is that...

chapter

Chinese Text Classification Using Key Characters String Kernel

Shiqiang Zheng, Yujiu Yang, Haiping Wu, Wenhuang Liu

2009 Fifth International Conference on Semantics, Knowledge and Grid > 113 - 119

2009 Fifth International Conference on Semantics, Knowledge and Grid (SKG 2009)

Most Chinese text classification methods are based on Chinese word segmentation and bag of words (BOW). The classification performance largely relies on the accuracy of segmentation. Unfortunately, perfect precision and disambiguation of segmentation cannot be reached. In order to solve this problem, a novel Chinese text classification method using string kernel is presented. String kernel computes...

chapter

A Chinese Classifier Research for Query Intention and Non-query Intention

Wu Xiaohui, G. Allan, Song Pingping, Zhang Rongxin

2009 Fifth International Conference on Semantics, Knowledge and Grid > 366 - 370

2009 Fifth International Conference on Semantics, Knowledge and Grid (SKG 2009)

Previous research to improve the performance of Internet search engines has focused on classifying questions, sentences and user-goals but not the classification of sentences and phrases based on query intention and non-query intention. This paper investigates a classification system of query intention and non-query intention of sentences and phrases by firstly analyzing previous work and based on...

Keywords:
CLASSIFICATION
NATURAL LANGUAGE PROCESSING

Publication date

Set your own date range

Publication type

book (26)
article (1)

Keywords

TRAINING (14)
TEXT ANALYSIS (12)
CLASSIFICATION ALGORITHMS (11)
FEATURE EXTRACTION (9)
DATA MINING (8)
TEXT CATEGORIZATION (8)
MACHINE LEARNING (7)
LEARNING (ARTIFICIAL INTELLIGENCE) (6)
SUPPORT VECTOR MACHINES (6)
ARTIFICIAL NEURAL NETWORKS (4)
DICTIONARIES (4)
INTERNET (4)
SUPPORT VECTOR MACHINE CLASSIFICATION (4)
NATURAL LANGUAGES (3)
NEURAL NETS (3)
NEURAL NETWORK (3)
SEMANTICS (3)
SPEECH (3)
SUPPORT VECTOR MACHINE (3)
TEXT CLASSIFICATION (3)
TEXT MINING (3)
ARABIC CORPUS (2)
ARABIC TEXT CATEGORIZATION (2)
BOOK REVIEWS (2)
CHINESE TEXT CLASSIFICATION (2)
CLASSIFICATION TREE ANALYSIS (2)
CORRELATION (2)
DECISION TREES (2)
DOCUMENT HANDLING (2)
ERROR ANALYSIS (2)
FEATURE SELECTION (2)
GOLD (2)
HIDDEN MARKOV MODELS (2)
INFORMATION PROCESSING (2)
INFORMATION RETRIEVAL (2)
KERNEL (2)
MATRIX DECOMPOSITION (2)
PATTERN CLASSIFICATION (2)
PROBABILITY (2)
RADIAL BASIS FUNCTION NETWORKS (2)
RBF (2)
SENTIMENT ANALYSIS (2)
SINGULAR VALUE DECOMPOSITION (2)
SPEECH PROCESSING (2)
SVM (2)
TAGGING (2)
TRAINING DATA (2)
WORD PROCESSING (2)
ADADELTA (1)
ADAGRAD (1)
ADAM (1)
AMBIGUOUS STRINGS (1)
ANN (1)
ANN CLASSIFIER (1)
ARABIC DOCUMENT CLASSIFICATION (1)
ARABIC DOCUMENT CLASSIFICATION CATEGORIZATION (1)
ARABIC LANGUAGE (1)
ARABIC LANGUAGE DOCUMENTS (1)
ARABIC TEXT CLASSIFICATION (1)
ARABIC TEXT DOCUMENT CLASSIFICATION (1)
ARRAYS (1)
ARTIFICIAL NEURAL NETWORK (1)
AUDIO FEEDBACK (1)
AUTOMATIC CLASSIFICATION (1)
AUTOMATIC CLASSIFIER (1)
AUTOMATIC DIACRITIC RESTORATION (1)
AUTOMATIC DOCUMENT CATEGORIZATION (1)
AUTOMATIC DOCUMENT CLASSIFICATION (1)
AUTOMATIC POSITIVE SENTIMENT WORD EXTRACTION (1)
AUTOMATIC TEXT CATEGORIZATION (1)
BAYES CRITERION (1)
BAYES METHODS (1)
BOOKS (1)
BOOSTING (1)
CALL CENTERS (1)
CALL CENTRES (1)
CALL ROUTING (1)
CHAOS (1)
CHINESE (1)
CHINESE CLASSIFIER RESEARCH (1)
CHINESE OVERLAPPING AMBIGUITY (1)
CHINESE QUESTION CLASSIFICATION (1)
CHINESE TEXT CATEGORIZATION (1)
CHINESE WORD SEGMENTATION (1)
CLASSIFICATION PROBLEMS (1)
CLASSIFICATION SYSTEM (1)
COCHLEAR IMPLANTATION (1)
COCHLEAR IMPLANTS (1)
COMBINATION STRATEGY (1)
COMMITTEE-BASED APPROACH (1)
COMMUNITIES (1)
COMMUNITY QUESTION ANSWERING (1)
COMMUNITY QUESTION ANSWERING PORTAL (1)
COMPOUNDS (1)
COMPUTATIONAL MODELING (1)
COMPUTER ARCHITECTURE (1)
COMPUTER SCIENCE (1)
more

INFONA - science communication portal

Search results

Deep learning algorithms based text classifier

Augmenting a classifier ensemble with automatically generated class level patterns for higher accuracy

Using a thesaurus-based approach for the categorisation of web sites

Identifying temporal relations between main events in new articles

A Pointwise Approach for Vietnamese Diacritics Restoration

Classifying Natural Language Sentences for Policy

Chinese question classification in community question answering

English and Taiwanese text categorization using N-gram based on Vector Space Model

A comparative study of Neural networks architectures on Arabic text categorization using feature extraction

Automatic positive sentiment word extraction for Chinese text classification

Classification of voice disorders in children with cochlear implantation and hearing aid using multiple classifier fusion

Text categorization algorithms representations based on inductive learning

SVM Based Part of Speech Tagger for Malayalam

A New Method of Training Sample Selection in Text Classification

Study on Method of Word Segmentation in Feature Selection in Chinese Text Categorization

A discriminative learning approach for orientation detection of Urdu document images

The Automatic Categorization of Arabic Documents by Boosting Decision Trees

Dealing with Chinese Overlapping Ambiguity Based on Type Functional Application

Chinese Text Classification Using Key Characters String Kernel

A Chinese Classifier Research for Query Intention and Non-query Intention

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options