Search results

Items from 101 to 120 out of 708 results

1 ...
3
4
5
6
7
8
9

chapter

Compression-based arabic text classification

Haneen Ta'amneh, Ehsan Abu Keshek, Manar Bani Issa, Mahmoud Al-Ayyoub, more

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA) > 594 - 600

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)

Text classification (TC) is one of the fundamental problems in text mining. Plenty of works exist on TC with interesting approaches and excellent results; however, most of these works follow a word-based approach for feature extraction. In this work, we are interested in an alternative (byte-based or character-based) approach known as compression-based TC (CTC). CTC has been used for some languages...

chapter

Chunking Arabic texts using Conditional Random Fields

Nabil Khoufi, Chafik Aloulou, Lamia Hadrich Belguith

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA) > 428 - 432

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)

Chunking or shallow syntactic parsing is proving to be a task of interest to many natural language processing applications. The problem gets worse for the Arabic language because of its specific features that make it quite different and even more ambiguous than other natural languages when processed. In this paper, we present a method for chunking Arabic texts based on supervised learning. We use...

chapter

Comparison of SVM classification method and semantic similarity method for sentiment classification

Changqin Quan, Xiquan Wei, Fuji Ren

2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems > 23 - 28

2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems (CCIS)

With the growth of the Internet and electronic commerce, there is more and more review data on the Internet. Quite a lot of Internet users refer to related comments of a product before they make a decision, which can teach them about the quality and reputation of the product and help them decide whether to buy it. A system that can automatically classify the polarity of a given text would be a great...

chapter

A framework for multilingual real-time spoken dialogue agents

Arnaud Jordan, Kenji Araki

2014 IEEE 6th International Conference on Awareness Science and Technology (iCAST) > 1 - 7

2014 IEEE 6th International Conference on Awareness Science and Technology (iCAST)

In this paper, we propose a framework for a spoken dialogue agent that is not dependent on any specific language; it takes some dialogues and sentences as training sets and uses them to acquire knowledge about the target language, then it uses this knowledge to generate several possible responses corresponding to the user input and finally it uses a simple score method to select the best one to show...

chapter

Syllabic Markov models of Arabic HMMs of spoken Arabic using CV units

Michael Ingleby, Fatmah Baothman

2014 Third IEEE International Colloquium in Information Science and Technology (CIST) > 254 - 259

2014 Third IEEE International Colloquium in Information Science and Technology (CIST)

We survey evidence — orthographic distributional phonological and psycholinguistic — in favor of a model of Arabic speech sounds based on the CV unit and extensive use of the silent sukuun vowel. We then construct a small-vocabulary multi-speaker CV HMM similar to the phonemic HMMs based on tied triphones that are widely used in speech recognizers for English and other European languages. Using experimental...

chapter

A modified technique for Word Sense Disambiguation using Lesk algorithm in Hindi language

Radhike Sawhney, Arvinder Kaur

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 2745 - 2749

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Word Sense Disambiguation (WSD) is a key factor in written and verbal communication of natural language processing. It is a method of selecting the appropriate sense of an ambiguous word in the given context. This paper aims at determining the correct sense of the given ambiguous word in Hindi language. A modified Lesk approach is used which uses the concept of dynamic context window. Dynamic context...

chapter

Authorship Analysis of Inspire Magazine through Stylometric and Psychological Features

Jennifer Sikos, Peter David, Nizar Habash, Reem Faraj

2014 IEEE Joint Intelligence and Security Informatics Conference > 33 - 40

2014 IEEE Joint Intelligence and Security Informatics Conference (JISIC)

When we read a piece of writing, the meaning we derive from that text often includes information about the authors themselves. Clues to their identity, worldview, and even psychological states are encoded in features such as word choice and sentence structure. This work describes how writing style features can be used to analyze the authorship of extreme jihadist writing. Inspire magazine is an online,...

chapter

REEL: A Relation Extraction Learning framework

Pablo Barrio, Goncalo Simoes, Helena Galhardas, Luis Gravano

IEEE/ACM Joint Conference on Digital Libraries > 455 - 456

2014 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

We introduce the REEL (RElation Extraction Learning) framework, an open source framework that facilitates the development and evaluation of relation extraction systems over text collections. To define a relation extraction system for a new relation and text collection, users only need to specify the parsers to load the collection, the relation and its constraints, and the learning and extraction techniques...

chapter

Joint layer based deep learning framework for bilingual machine transliteration

Sanjanaashree P, Anand Kumar M

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1737 - 1743

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Between the growth of Internet or World Wide Web (WWW) and the emersion of the social networking site like Friendster, Myspace etc., information society started facing exhilarating challenges in language technology applications such as Machine Translation (MT) and Information Retrieval (IR). Nevertheless, there were researchers working in Machine Translation that deal with real time information for...

chapter

Using topic models in domain adaptation

Samira Tofighi Zahabi, Somayeh Bakhshaei, Shahram Khadivi

7'th International Symposium on Telecommunications (IST'2014) > 539 - 543

2014 7th International Symposium on Telecommunications (IST)

An important factor of a corpus is its domain, usually the quality of a SMT system trained on an in-domain corpus increases by adding out-of-domain sentences to its training corpus. In this paper we have shown out-of-domain corpora may also contains sentences which are proper for improving the quality of in-domain corpus. These sentences have words and phrases that occur in indomain corpora so, their...

chapter

A novel unsupervised method for named-entity identification in resource-poor languages using bilingual corpus

Ramtin Mehdizadeh Seraj, Fattaneh Jabbari, Shahram Khadivi

7'th International Symposium on Telecommunications (IST'2014) > 519 - 523

2014 7th International Symposium on Telecommunications (IST)

We propose a new unsupervised method to identify Named Entities (NE) in resource-poor languages. The idea is to transfer the knowledge of NEs from a resource-rich language to a resource-poor one by using a bilingual parallel corpus of this language pair. After extracting all NE pair candidates and filtering these candidates (includes lexical and contextual filters) to obtain a high precision seed...

chapter

Rule-based machine translation from English to Telugu with emphasis on prepositions

Keerthi Lingam, E. Rama Lakshmi, L Ravi Theja

2014 First International Conference on Networks & Soft Computing (ICNSC2014) > 183 - 187

2014 International Conference on Networks & Soft Computing (ICNSC)

This paper deals with adaptive rule based machine translation from English to Telugu. This is a proposed approach and it is a rule-based methodology. Set of production rules, training set for English and Telugu sentences and English to telugu dictionary are developed for this purpose. In the process of machine translation, handling prepositions is the main issue. There are many different kinds of...

chapter

A Vector Space Model Based Education Resources Automatic Classifier

Tian Xia

2014 Enterprise Systems Conference > 323 - 326

2014 Enterprise Systems Conference (ES)

Along with the rapid improvements of informational technology, educational data grows quickly. Such data become massive and raw data. Researchers develop educational standards to regular such data. However, the standards are multiple and the education resources based on different education standards have different structure, which is hard to be shared. Most of them have become Information Islands...

chapter

A Neurobiologically Plausible Vector Symbolic Architecture

Daniel E. Padilla, Mark D. McDonnell

2014 IEEE International Conference on Semantic Computing > 242 - 245

2014 IEEE International Conference on Semantic Computing (ICSC)

Vector Symbolic Architectures (VSA) are approaches to representing symbols and structured combinations of symbols as high-dimensional vectors. They have applications in machine learning and for understanding information processing in neurobiology. VSAs are typically described in an abstract mathematical form in terms of vectors and operations on vectors. In this work, we show that a machine learning...

chapter

Improved Chinese-Japanese phrase-based MT quality using an extended quasi-parallel corpus

Hao Wang, Wei Yang, Yves Lepage

2014 IEEE International Conference on Progress in Informatics and Computing > 6 - 10

2014 International Conference on Progress in Informatics and Computing (PIC)

State-of-the-art phrase-based machine translation (MT) systems usually demand large parallel corpora in the step of training. The quality and the quantity of the training data exert a direct influence on the performance of such translation systems. The lack of open-source bilingual corpora for a particular language pair results in lower translation scores reported for such a language pair. This is...

chapter

Using Continuous Integration to organize and monitor the annotation process of domain specific corpora

Marc Schreiber, Kai Barkschat, Bodo Kraft

2014 5th International Conference on Information and Communication Systems (ICICS) > 1 - 6

2014 5th International Conference on Information and Communication Systems (ICICS)

Applications in the World Wide Web aggregate vast amounts of information from different data sources. The aggregation process is often implemented with Extract, Transform and Load (ETL) processes. Usually ETL processes require information for aggregation available in structured formats, e. g. XML or JSON. In many cases the information is provided in natural language text which makes the application...

chapter

An integrated approach to spam classification on Twitter using URL analysis, natural language processing and machine learning techniques

Kamalanathan Kandasamy, Preethi Koroth

2014 IEEE Students' Conference on Electrical, Electronics and Computer Science > 1 - 5

2014 IEEE Students' Conference on Electrical, Electronics and Computer Science (SCEECS)

In the present day world, people are so much habituated to Social Networks. Because of this, it is very easy to spread spam contents through them. One can access the details of any person very easily through these sites. No one is safe inside the social media. In this paper we are proposing an application which uses an integrated approach to the spam classification in Twitter. The integrated approach...

chapter

Event Causality Identification Using Conditional Random Field in Geriatric Care Domain

Saeed Mehrabi, Anand Krishnan, Eric Tinsley, Jon Sligh, more

2013 12th International Conference on Machine Learning and Applications > 1 > 339 - 343

2013 12th International Conference on Machine Learning and Applications (ICMLA)

Event extraction is a key step in many text-mining applications such as question-answering, information extraction and summarization systems. In this study we used conditional random field (CRF) to extract causal events from PubMed articles related to Geriatric care. Abstracts of geriatric care domain were manually reviewed and categorized into 42 different sub domains. There are a total of 19, 677...

chapter

A Hybrid Approach Using Maximum Entropy Model and Rules to Identify Tibetan Person Names

Yangji Jia, Jing Jiang, Hongzhi Yu

2013 International Conference on Computer Sciences and Applications > 377 - 380

2013 International Conference on Computer Sciences and Applications (CSA)

Tibetan person name recognition is one of the most difficult tasks in the area of Tibetan information processing, and the effect of recognition impacts directly on the precision of Tibetan word segmentation and the performance of relative application systems, which include Tibetan-Chinese machine translation, Tibetan information search, text categorization, etc. Based on the analysis of wording rules...

chapter

Intelligent Classroom System for Qualitative Analysis of Students' Conceptual Understanding

Jannat Talwar, Shree Ranjani, Anwaya Aras, Mangesh Bedekar

2013 6th International Conference on Emerging Trends in Engineering and Technology > 25 - 29

2013 6th International Conference on Emerging Trends in Engineering and Technology (ICETET)

With the increase of ubiquitous data all over the internet, intelligent classroom systems that integrate traditional learning techniques with modern e-learning tools have become quite popular and necessary today. Although a substantial amount of work has been done in the field of e-learning, specifically in automation of objective question and answer evaluation, personalized learning, adaptive evaluation...

1 ...
3
4
5
6
7
8
9

Keywords:
TRAINING
NATURAL LANGUAGE PROCESSING

Publication date

Set your own date range

Content availability

Available (697)
None (11)

Keywords

DATA MINING (194)
FEATURE EXTRACTION (175)
HIDDEN MARKOV MODELS (175)
ACCURACY (155)
SPEECH (151)
TEXT ANALYSIS (134)
SPEECH RECOGNITION (124)
SUPPORT VECTOR MACHINES (110)
LEARNING (ARTIFICIAL INTELLIGENCE) (91)
MACHINE LEARNING (90)
CONTEXT (81)
TAGGING (80)
DICTIONARIES (78)
CLASSIFICATION ALGORITHMS (76)
COMPUTATIONAL MODELING (73)
ARTIFICIAL NEURAL NETWORKS (69)
SEMANTICS (69)
DATA MODELS (64)
TESTING (64)
TRAINING DATA (62)
COMPUTATIONAL LINGUISTICS (59)
PATTERN CLASSIFICATION (59)
STATISTICAL ANALYSIS (59)
SPEECH PROCESSING (51)
INFORMATION RETRIEVAL (49)
LANGUAGE TRANSLATION (49)
ENTROPY (47)
PROBABILITY (47)
ACOUSTICS (46)
MATHEMATICAL MODEL (41)
TEXT CATEGORIZATION (39)
VOCABULARY (39)
DATABASES (38)
LABELING (38)
CHARACTER RECOGNITION (36)
HIDDEN MARKOV MODEL (36)
INTERNET (36)
SUPPORT VECTOR MACHINE CLASSIFICATION (35)
ADAPTATION MODEL (34)
SYNTACTICS (34)
COMPUTERS (33)
GRAMMARS (32)
WORD PROCESSING (32)
SUPPORT VECTOR MACHINE (31)
CLASSIFICATION (30)
EDUCATIONAL INSTITUTIONS (30)
HMM (28)
ALGORITHM DESIGN AND ANALYSIS (27)
KERNEL (27)
NATURAL LANGUAGES (27)
NEURAL NETS (27)
HUMANS (26)
DECODING (25)
HANDWRITING RECOGNITION (24)
NEURAL NETWORKS (24)
LINGUISTICS (23)
MACHINE TRANSLATION (23)
CONFERENCES (21)
CONTEXT MODELING (21)
KNOWLEDGE BASED SYSTEMS (21)
CONDITIONAL RANDOM FIELDS (20)
ERROR ANALYSIS (20)
HANDWRITTEN CHARACTER RECOGNITION (20)
SPEECH SYNTHESIS (20)
GAUSSIAN PROCESSES (19)
PROBABILITY DENSITY FUNCTION (19)
RANDOM PROCESSES (19)
CONDITIONAL RANDOM FIELD (18)
FEATURE SELECTION (18)
NAMED ENTITY RECOGNITION (18)
PRAGMATICS (18)
SENTIMENT ANALYSIS (18)
WORD SENSE DISAMBIGUATION (18)
CRF (17)
LANGUAGE MODEL (17)
NIST (17)
ORGANIZATIONS (17)
TEXT CLASSIFICATION (17)
BAYES METHODS (16)
STATISTICAL MACHINE TRANSLATION (16)
SVM (16)
TEXT MINING (16)
DOCUMENT HANDLING (15)
INFORMATION EXTRACTION (15)
MAXIMUM ENTROPY METHODS (15)
NEURONS (15)
PATTERN CLUSTERING (15)
CHINESE WORD SEGMENTATION (14)
CORRELATION (14)
PATTERN RECOGNITION (14)
PREDICTIVE MODELS (14)
SEARCH ENGINES (14)
SPEAKER RECOGNITION (14)
STANDARDS (14)
AUTOMATIC SPEECH RECOGNITION (13)
CLUSTERING ALGORITHMS (13)
DECISION TREES (13)
EQUATIONS (13)
more

INFONA - science communication portal

Search results

Compression-based arabic text classification

Chunking Arabic texts using Conditional Random Fields

Comparison of SVM classification method and semantic similarity method for sentiment classification

A framework for multilingual real-time spoken dialogue agents

Syllabic Markov models of Arabic HMMs of spoken Arabic using CV units

A modified technique for Word Sense Disambiguation using Lesk algorithm in Hindi language

Authorship Analysis of Inspire Magazine through Stylometric and Psychological Features

REEL: A Relation Extraction Learning framework

Joint layer based deep learning framework for bilingual machine transliteration

Using topic models in domain adaptation

A novel unsupervised method for named-entity identification in resource-poor languages using bilingual corpus

Rule-based machine translation from English to Telugu with emphasis on prepositions

A Vector Space Model Based Education Resources Automatic Classifier

A Neurobiologically Plausible Vector Symbolic Architecture

Improved Chinese-Japanese phrase-based MT quality using an extended quasi-parallel corpus

Using Continuous Integration to organize and monitor the annotation process of domain specific corpora

An integrated approach to spam classification on Twitter using URL analysis, natural language processing and machine learning techniques

Event Causality Identification Using Conditional Random Field in Geriatric Care Domain

A Hybrid Approach Using Maximum Entropy Model and Rules to Identify Tibetan Person Names

Intelligent Classroom System for Qualitative Analysis of Students' Conceptual Understanding

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options