Advanced search

Advanced search in people

From:

To:

Items from 1 to 20 out of 34 results

chapter

Myanmar to English verb translation disambiguation approach based on Naïve Bayesian classifier

Phyo Phyo Wai

2011 3rd International Conference on Computer Research and Development > 3 > 6 - 9

2011 3rd International Conference on Computer Research and Development (ICCRD 2011)

Natural Language processing (NLP) is a field of computer science and linguistics concerned with the interactions between computers and human (natural) languages. Ambiguity is one of these problems which have been a great challenge for computational linguists. This paper concentrates on the problem of target word selection in Myanmar to English machine translation, for which the approach is directly...

chapter

Domain Independent Sentiment Classification with Many Lexicons

B Ohana, B Tierney, S Delany

2011 IEEE Workshops of International Conference on Advanced Information Networking and Applications > 632 - 637

2011 25th IEEE International Conference on Advanced Information Networking and Applications Workshops (WAINA 2011)

Sentiment lexicons are language resources widely used in opinion mining and important tools in unsupervised sentiment classification. We present a comparative study of sentiment classification of reviews on six different domains using sentiment lexicons from different sources. Our results highlight the tendency of a lexicon's performance to be imbalanced towards one class, and indicate lexicon accuracy...

chapter

Sentence-Level and Document-Level Sentiment Mining for Arabic Texts

N Farra, E Challita, R A Assi, H Hajj

2010 IEEE International Conference on Data Mining Workshops > 1114 - 1119

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

In this work, we investigate sentiment mining of Arabic text at both the sentence level and the document level. Existing research in Arabic sentiment mining remains very limited. For sentence-level classification, we investigate two approaches. The first is a novel grammatical approach that employs the use of a general structure for the Arabic sentence. The second approach is based on the semantic...

chapter

Automatic lexical stress detection for Chinese learners' of English

Jin-Yu Chen, Lan Wang

2010 7th International Symposium on Chinese Spoken Language Processing > 407 - 411

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

This paper investigates lexical stress detection for Chinese learners of English, where a combined differential acoustic feature is developed to represent the lexical stress of polysyllabic words in continuous speech. The use of frame-averaged feature and the contextual information intra-word can be input to the classifiers without normalization. The word-based stress detection method proposed in...

chapter

Online Handwritten Kannada Word Recognizer with Unrestricted Vocabulary

R Kunwar, K Shashikiran, A G Ramakrishnan

2010 12th International Conference on Frontiers in Handwriting Recognition > 611 - 616

2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010)

In this paper, we propose a novel heuristic approach to segment recognizable symbols from online Kannada word data and perform recognition of the entire word. Two different estimates of first derivative are extracted from the preprocessed stroke groups and used as features for classification. Estimate 2 proved better resulting in 88% accuracy, which is 3% more than that achieved with estimate 1. Classification...

chapter

A Hybrid Model for Recognition of Online Handwriting in Indian Scripts

Amit Arora, Anoop M Namboodiri

2010 12th International Conference on Frontiers in Handwriting Recognition > 433 - 438

2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010)

We present a complete online handwritten character recognition system for Indian languages that handles the ambiguities in segmentation as well as recognition of the strokes. The recognition is based on a generative model of handwriting formation, coupled with a discriminative model for classification of strokes. Such an approach can seamlessly integrate language and script information in the generative...

chapter

Study on Multiple Classifiers for Chinese Word Sense Disambiguation

Guo Jiang, Zhang Yangsen

2010 International Conference on Artificial Intelligence and Computational Intelligence > 1 > 433 - 437

2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2010)

In this paper, a new method of multiple layer classifiers integration based on single classifier is proposed which called Auto Weight Adjust. In the most used classifiers, Maximum Entropy (ME) model has excellent performance, and Naïve Bayesian (NB) is preferred by researchers for it's simple and useful. So in our experiments we chose ME and NB as single classifiers and use the ME classifier result...

chapter

English and Taiwanese text categorization using N-gram based on Vector Space Model

M Suzuki, N Yamagishi, Yi-Ching Tsai, T Ishida, more

2010 International Symposium On Information Theory&Its Applications > 106 - 111

2010 International Symposium On Information Theory & Its Applications (ISITA 2010)

In this paper, we present a new mathematical model based on a “Vector Space Model” and consider its implications. The proposed method is evaluated by performing several experiments. In these experiments, we classify newspaper articles from the English Reuters-21578 data set, and Taiwanese China Times 2005 data set using the proposed method. The Reuters-21578 data set is a benchmark data set for automatic...

chapter

Realization of a high performance bilingual OCR system for Thai-English printed documents

S Tangwongsan, B Suvacharakulton

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) > 1 - 6

2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2010)

This paper presents a high performance bilingual OCR system for printed Thai and English text. With the complex nature of both Thai and English languages, the first stage is to identify languages within different zones by using geometric properties for differentiation. The second stage is the process of character recognition, in which the technique developed includes a feature extractor and a classifier...

chapter

A Study of Designing Compact Recognizers of Handwritten Chinese Characters Using Multiple-Prototype Based Classifiers

Yongqiang Wang, Qiang Huo

2010 20th International Conference on Pattern Recognition > 1872 - 1875

2010 20th International Conference on Pattern Recognition (ICPR 2010)

We present a study of designing compact recognizers of handwritten Chinese characters using multiple-prototype based classifiers. A modified Quick prop algorithm is proposed to optimize a sample-separation-margin based minimum classification error objective function. Split vector quantization technique is used to compress classifier parameters. Benchmark results are reported for classifiers with different...

chapter

Comparison of HMM and SDTW for Tamil handwritten character recognition

K Shashikiran, K S Prasad, R Kunwar, A G Ramakrishnan

2010 International Conference on Signal Processing and Communications (SPCOM) > 1 - 4

2010 International Conference on Signal Processing and Communications (SPCOM 2010)

In this paper, we compare the experimental results for Tamil online handwritten character recognition using HMM and Statistical Dynamic Time Warping (SDTW) as classifiers. HMM was used for a 156-class problem. Different feature sets and values for the HMM states & mixtures were tried and the best combination was found to be 16 states & 14 mixtures, giving an accuracy of 85%. The features used...

chapter

Incremental Bayesian classification for Chinese question sentences based on fuzzy feedback

Shuling Di, Hui Li, Pilian He

2010 2nd International Conference on Future Computer and Communication > 1 > V1-401 - V1-404

2010 2nd International Conference on Future Computer and Communication (ICFCC 2010)

Aiming at problems such as fixed training set and lacking of completed information in traditional Bayesian classification, incremental learning mechanism is introduced. Combining with the characteristics of question sentences in Chinese question answering system, Semi-Naive Bayesian model is used to construct classifier. In order to make prior distribution of samples lean to even distribution, samples...

chapter

Text categorization algorithms representations based on inductive learning

Cao Jian-fang, Wang Hong-bin

2010 2nd IEEE International Conference on Information Management and Engineering > 352 - 355

2010 2nd IEEE International Conference on Information Management and Engineering (ICIME 2010)

Text categorization-assignment of natural language texts to one or more predefined categories based on their content-is an important component in many information organization and management tasks. Categorization algorithm is the most critical factor to text categorization system performance. The inductive learning classifiers are put forward. Very accurate text categorization result can be learned...

chapter

Error corrective classifier fusion for spoken Language Recognition

Omid Dehzangi, Bin Ma, Eng Siong Chng, Haizhou Li

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 1994 - 1997

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

A number of effective classification algorithms have been developed for spoken language recognition, and it has been a common practice in the NIST Language Recognition Evaluations (LREs) that an information fusion is applied to boost the performance of the recognition system. This paper investigates the fusion of multiple output scores generated using different classifiers that complement to further...

chapter

Authorship attribution of web forum posts

S R Pillay, T Solorio

2010 eCrime Researchers Summit > 1 - 7

2010 eCrime Researchers Summit (eCrime 2010)

Extracting useful information from user generated text on the web is an important ongoing research in natural language processing, machine learning, and data mining. Online tools like emails, news groups, blogs, and web forums provide an effective communication platform for millions of users around the globe and also provide an added advantage of anonymity. Millions of people post information on different...

chapter

A Bayesian classifier for the identification of non-referential pronouns in Arabic

Souha Mezghani Hammami, Rahma Sallemi, Lamia Hadrich Belguith

2010 The 7th International Conference on Informatics and Systems (INFOS) > 1 - 6

2010 7th International Conference on Informatics and Systems (INFOS 2010)

Machine learning has become the predominant problem-solving strategy for computational linguistics problems in the last decade. In this paper, we present an implemented machine learning system for the automatic identification of non-referential pronouns in Arabic texts. Our system is based on a Bayesian network which has shown its efficiency for modeling NLP problems. We have evaluated our approach...

chapter

Research of Chinese Text Classification Methods Based on Semantic Vector and Semantic Similarity

Xin Song, Jia Huang, Jing-min Zhou, Xi Chen

2009 International Forum on Computer Science-Technology and Applications > 2 > 187 - 190

2009 International Forum on Computer Science-Technology and Applications (IFCSTA 2009)

To overcome the limitations of traditional text classification approaches based on bag-of-words representation and to effectively incorporate linguistic knowledge and conceptual index into text vector space model, based on two thesaurus HowNet and Tongyici Cilin (hereinafter referred to Cilin), we use semantic vector to describe a document instead of traditional keywords vector, which is based on...

chapter

Automated classification of radiology reports for acute lung injury: Comparison of keyword and machine learning based natural language processing approaches

I. Solti, C.R. Cooke, Fei Xia, M.M. Wurfel

2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop > 314 - 319

2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop, BIBMW

This paper compares the performance of keyword and machine learning-based chest x-ray report classification for Acute Lung Injury (ALI). ALI mortality is approximately 30 percent. High mortality is, in part, a consequence of delayed manual chest x-ray classification. An automated system could reduce the time to recognize ALI and lead to reductions in mortality. For our study, 96 and 857 chest x-ray...

chapter

Automated Identification of LTL Patterns in Natural Language Requirements

A.P. Nikora, G. Balcom

2009 20th International Symposium on Software Reliability Engineering > 185 - 194

2009 20th International Symposium on Software Reliability Engineering (ISSRE 2009)

Analyzing requirements for consistency and checking them for correctness can require significant effort, particularly if they have not been maintained with a requirements management tool (e.g., DOORS) or specified in a machine-readable notation. By restricting the number of requirements being analyzed, fewer opportunities exist for introducing errors into the analysis. This can be accomplished by...

chapter

Are SentiWordNet scores suited for multi-domain sentiment classification?

K. Denecke

2009 Fourth International Conference on Digital Information Management > 1 - 6

2009 Fourth International Conference on Digital Information Management

Motivated by the numerous applications of analysing opinions in multi-domain scenarios, this paper studies the potential of a still rarely considered approach to the problem of multi-domain sentiment analysis based on Senti-WordNet as lexical resource. SentiWordNet scores are exploited together with additional features to assign a polarity to a text using machine learning. On the other hand, a rule-based...

Keywords:
ACCURACY
PATTERN CLASSIFICATION
NATURAL LANGUAGE PROCESSING

Publication date

Set your own date range

Content availability

Available (33)
None (1)

Keywords

TRAINING (18)
DATA MINING (15)
LEARNING (ARTIFICIAL INTELLIGENCE) (14)
FEATURE EXTRACTION (13)
CLASSIFICATION ALGORITHMS (12)
MACHINE LEARNING (12)
TEXT ANALYSIS (12)
SUPPORT VECTOR MACHINES (6)
TEXT CATEGORIZATION (6)
CHARACTER RECOGNITION (5)
CONTEXT (5)
HANDWRITING RECOGNITION (5)
SPEECH (5)
HIDDEN MARKOV MODELS (4)
OPINION MINING (4)
SUPPORT VECTOR MACHINE CLASSIFICATION (4)
SYNTACTICS (4)
ALGORITHM DESIGN AND ANALYSIS (3)
BAYES METHODS (3)
BAYESIAN METHODS (3)
COMPUTERS (3)
DICTIONARIES (3)
ENTROPY (3)
TESTING (3)
ARTIFICIAL NEURAL NETWORKS (2)
BOOK REVIEWS (2)
CHINESE TEXT CLASSIFICATION (2)
CLASSIFICATION (2)
CLASSIFICATION TASK (2)
DATABASES (2)
FUZZY SET THEORY (2)
GRAMMARS (2)
HANDWRITTEN CHARACTER RECOGNITION (2)
LEARNING SYSTEMS (2)
MACHINE LEARNING ALGORITHMS (2)
MOTION PICTURES (2)
PATTERN CLUSTERING (2)
PATTERN RECOGNITION (2)
PREDICTIVE MODELS (2)
PROBABILITY (2)
SEMANTICS (2)
SENTIMENT CLASSIFICATION (2)
STATISTICAL ANALYSIS (2)
SUPERVISED LEARNING (2)
TAGGING (2)
TEXT CLASSIFICATION (2)
TEXT MINING (2)
THESAURI (2)
VECTOR SPACE MODEL (2)
WEIGHT MEASUREMENT (2)
WRITING (2)
ACOUSTICS (1)
ACUTE LUNG INJURY (1)
ANALYTICAL MODELS (1)
ANAPHORA RESOLUTION (1)
ANNOTATION (1)
ARABIC (1)
ARABIC LANGUAGE PROCESSING (1)
ARABIC MORPHOSYNTACTIC DISAMBIGUATION (1)
ARABIC TEXT (1)
ARABIC TEXTS (1)
ARTIFICIAL INTELLIGENCE (1)
AUTHORISATION (1)
AUTHORSHIP ATTRIBUTION (1)
AUTO WEIGHT ADJUST (1)
AUTOMATED CONSISTENCY CHECKING (1)
AUTOMATED IDENTIFICATION (1)
AUTOMATIC LEXICAL STRESS DETECTION (1)
AUTOMATIC STOP WORDS IDENTIFICATION (1)
AUTOMATIC SUBJECTIVITY JUDGMENT (1)
AUTOMATIC TEXT CATEGORIZATION (1)
AUTOMATION (1)
BACK PROPAGATION NETWORK (1)
BACKPROPAGATION (1)
BAG-OF-WORDS REPRESENTATION (1)
BAM-VOTE BOX-BASED FRAMEWORK (1)
BAYESIAN CLASSIFICATION (1)
BAYESIAN CLASSIFIER (1)
BAYESIAN NETWORK (1)
BELIEF NETWORKS (1)
BILINGUAL OCR (1)
BILINGUAL OCR SYSTEM (1)
BOOKS (1)
BVB MODEL (1)
BVB-BASED COMPOSITE MODEL (1)
CAMERAS (1)
CHARACTER N-GRAMS (1)
CHEST X-RAY REPORT CLASSIFICATION (1)
CHINESE INTELLIGENT QUESTION ANSWERING SYSTEM IMPLEMENTATION (1)
CHINESE INTELLIGENT QUESTION ANSWERING SYSTEM STUDIES (1)
CHINESE LANGUAGE (1)
CHINESE LEARNER (1)
CHINESE QUESTION ANSWERING SYSTEM (1)
CHINESE QUESTION SENTENCES (1)
CHINESE REVIEWS (1)
CHINESE SENTIMENT ANALYSIS (1)
CHINESE SUBCATEGORIZATION ANNOTATION (1)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Myanmar to English verb translation disambiguation approach based on Naïve Bayesian classifier

Domain Independent Sentiment Classification with Many Lexicons

Sentence-Level and Document-Level Sentiment Mining for Arabic Texts

Automatic lexical stress detection for Chinese learners' of English

Online Handwritten Kannada Word Recognizer with Unrestricted Vocabulary

A Hybrid Model for Recognition of Online Handwriting in Indian Scripts

Study on Multiple Classifiers for Chinese Word Sense Disambiguation

English and Taiwanese text categorization using N-gram based on Vector Space Model

Realization of a high performance bilingual OCR system for Thai-English printed documents

A Study of Designing Compact Recognizers of Handwritten Chinese Characters Using Multiple-Prototype Based Classifiers

Comparison of HMM and SDTW for Tamil handwritten character recognition

Incremental Bayesian classification for Chinese question sentences based on fuzzy feedback

Text categorization algorithms representations based on inductive learning

Error corrective classifier fusion for spoken Language Recognition

Authorship attribution of web forum posts

A Bayesian classifier for the identification of non-referential pronouns in Arabic

Research of Chinese Text Classification Methods Based on Semantic Vector and Semantic Similarity

Automated classification of radiology reports for acute lung injury: Comparison of keyword and machine learning based natural language processing approaches

Automated Identification of LTL Patterns in Natural Language Requirements

Are SentiWordNet scores suited for multi-domain sentiment classification?

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options