Search results

Items from 1 to 8 out of 8 results

chapter

Managed N-gram language model based on Hadoop framework and a Hbase tables

Tahani Mahmoud Allam, Alsayed Abdelhameed Sallam, Hatem M. Abdullkader

2014 9th International Conference on Informatics and Systems > PDC-58 - PDC-63

2014 9th International Conference on Informatics and Systems (INFOS)

N-grams are a building block in natural language processing and information retrieval. It is a sequence of a string data like contiguous words or other tokens in text documents. In this work, we study how N-gram can be computed efficiently using a MapReduce for distributed data processing and a distributed database named Hbase This technique is applied to construct the training and testing processes...

chapter

Using topic models in domain adaptation

Samira Tofighi Zahabi, Somayeh Bakhshaei, Shahram Khadivi

7'th International Symposium on Telecommunications (IST'2014) > 539 - 543

2014 7th International Symposium on Telecommunications (IST)

An important factor of a corpus is its domain, usually the quality of a SMT system trained on an in-domain corpus increases by adding out-of-domain sentences to its training corpus. In this paper we have shown out-of-domain corpora may also contains sentences which are proper for improving the quality of in-domain corpus. These sentences have words and phrases that occur in indomain corpora so, their...

chapter

Sentence Similarity-Based Source Context Modelling in PBSMT

R Haque, S K Naskar, A Way, M R Costa-jussa, more

2010 International Conference on Asian Language Processing > 257 - 260

2010 International Conference on Asian Language Processing (IALP 2010)

Target phrase selection, a crucial component of the state-of-the-art phrase-based statistical machine translation(PBSMT) model, plays a key role in generating accurate translation hypotheses. Inspired by context-rich word-sense disambiguation techniques, machine translation (MT) researchers have successfully integrated various types of source language context into the PBSMT model to improve target...

chapter

Can Information Retrieval techniques automatic assessment challenges?

M.M. Hasan

2009 12th International Conference on Computers and Information Technology > 333 - 338

2009 12th International Conference on Computer and Information Technology (ICCIT 2009)

In Information Retrieval (IR), the similarity scores between a query and a set of documents are calculated, and the relevant documents are ranked based on their similarity scores. IR systems often consider queries as short documents containing only a few words in calculating document similarity score. In Computer Aided Assessment (CAA) of narrative answers, when model answers are available, the similarity...

chapter

Computing Word Similarity on Large-Scale Corpus

Tao Xu, Weiguang Qu, Xuri Tang, Dexin Ding, more

2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC) > 1076 - 1079

2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC 2009)

This paper proposes a novel approach for word similarity computation based on word sense vectors. The word sense vector is built using HIT-IR Tongyici Cilin (extended) for concept generalization and is further modified by the use of relative and absolute frequency filters. Experiments show that the approach not only overcomes the problem of similarity computation of unseen words but also yields a...

chapter

Automatic Choosing of English Rhymes in Translation of Chinese Ancient Poems

Miao Fang, Xin Jiang, Qi Zhao, Yi Jiang

2009 Second International Symposium on Knowledge Acquisition and Modeling > 1 > 434 - 437

2009 Second International Symposium on Knowledge Acquisition and Modeling (KAM 2009)

Translating Chinese ancient poem is a valuable but hard thing. Automatic choosing of English rhymes in translation of Chinese ancient poems would do translators a favor. This paper extracts three important factors that influence English rhymes, and presents a set of statistical models based on these factors, and then trains these models and acquires their parameters, which at last are used to recommend...

chapter

Research on Prosodic Features and Their Prediction Issues in Uyghur Text-to-Speech System

A. Hamdulla, A. Rozi, G. Eli, D. Tursun

2009 Pacific-Asia Conference on Circuits, Communications and Systems > 257 - 260

2009 Pacific-Asia Conference on Circuits, Communications and Systems (PACCS 2009)

As one of the core technologies of minority language information processing, in recent years, the Uyghur speech synthesis technology has made great progress, but in TTS (text to speech) systems, prosodic phrases are not predicted with high accuracy which slows down the improvement of naturalness of synthesized speech. In this paper, Uyghur prosodic features was studied and the context features which...

chapter

Word-Sense Disambiguation using maximum entropy model

N. Chatterjee, R. Misra

2009 Proceeding of International Conference on Methods and Models in Computer Science (ICM2CS) > 1 - 4

2009 International Conference on Methods and Models in Computer Science (ICM2CS)

Natural languages are typically replete with homographs, words which have more than one meaning. Consequently, machine understanding of natural language sentences sometimes suffers from certain ambiguities in getting the correct sense of a word in a given sentence. In this work we present a trainable model for word sense disambiguation (WSD) for resolving this ambiguity. The proposed model applies...

Filter options

Data set:
ieee
Keywords:
COMPUTATIONAL MODELING
CONTEXT
TRAINING
NATURAL LANGUAGE PROCESSING

Publication date

Set your own date range

Content availability

Available (7)
None (1)

Keywords

LANGUAGE TRANSLATION (3)
CONTEXT MODELING (2)
DATA MINING (2)
DICTIONARIES (2)
HUMANS (2)
STATISTICAL MACHINE TRANSLATION (2)
ABSOLUTE FREQUENCY FILTERS (1)
ACCURACY (1)
ADAPTATION MODELS (1)
BILINGUAL TRAINING SENTENCES (1)
BLEU SCORE (1)
CHINESE ANCIENT POEMS TRANSLATION (1)
COMPUTATIONAL LINGUISTICS (1)
COMPUTER AIDED ASSESSMENT (1)
COMPUTER AIDED INSTRUCTION (1)
COMPUTERS (1)
CONFERENCES (1)
CONTEXT FEATURE (1)
CONTEXT-RICH WORD-SENSE DISAMBIGUATION TECHNIQUES (1)
CORRELATION (1)
DISTRIBUTED COMPUTING (1)
DOCUMENT SIMILARITY SCORES (1)
DOCUMENT VECTORS (1)
DOMAIN ADAPTATION (1)
ENGLISH RHYMES (1)
ENGLISH-TO-CHINESE TRANSLATION TASK (1)
ENTROPY (1)
EQUATIONS (1)
FEATURE EXTRACTION (1)
GRAMMAR (1)
GRAMMATICAL STRUCTURE (1)
HADOOP FRAMEWORK (1)
HBASE TABLES (1)
HOMOGRAPHS (1)
IN-DOMAIN PARALLEL CORPUS (1)
INFORMATICS (1)
INFORMATION RETRIEVAL (1)
INFORMATION THEORY (1)
INTELLIGENT TEXT ANALYSIS (1)
KEYPHRASE EXTRACTION (1)
LARGE-SCALE CORPUS (1)
LEXICAL SYNTACTIC DESCRIPTIONS (1)
LITERATURE (1)
LONG-RANGE WORD-TO-WORD DEPENDENCIES (1)
MAPREDUCE (1)
MATHEMATICAL MODEL (1)
MAXIMUM ENTROPY METHODS (1)
MAXIMUM ENTROPY MODEL (1)
MODEL ANSWER (1)
N-GRAM MODEL (1)
N-GRAM WORD SEQUENCES (1)
NATURAL LANGUAGES (1)
OUT-OF-DOMAIN SENTENCES (1)
PBSMT (1)
PHRASE-BASED STATISTICAL MACHINE TRANSLATION MODEL (1)
PREDICTION (1)
PREDICTIVE MODELS (1)
PROBABILITY (1)
RELATIVE FILTERS (1)
RELEVANT DOCUMENTS (1)
RHYME (1)
RHYTHM (1)
SENTENCE EXTRACTION (1)
SENTENCE LENGTH (1)
SENTENCE SIMILARITY (1)
SENTENCE SIMILARITY-BASED SOURCE CONTEXT MODELLING (1)
SENTENCE-SIMILARITY FEATURES (1)
SHORT DOCUMENTS (1)
SMOOTHING METHODS (1)
SMT SYSTEM (1)
SOURCE CONTEXT INFORMATION (1)
SPEECH (1)
SPEECH SYNTHESIS (1)
STATISTICAL ANALYSIS (1)
STATISTICAL MODEL (1)
STATISTICAL MODELS (1)
STRESS (1)
SUPER TAG-BASED FEATURES (1)
SYNONYM RESOLUTION (1)
SYNTACTICS (1)
TARGET PHRASE SELECTION (1)
TESTING (1)
TEXT ANALYSIS (1)
TEXT-TO-SPEECH (1)
TOPIC MODEL (1)
TOPIC MODELS (1)
TRANSLATION (1)
TRANSLATION HYPOTHESES (1)
TRANSLATION MODEL (1)
UYGHUR LANGUAGE (1)
UYGHUR PROSODIC FEATURE (1)
UYGHUR PROSODIC FEATURES (1)
UYGHUR SPEECH SYNTHESIS (1)
UYGHUR TEXT-TO-SPEECH SYSTEM (1)
VECTOR SPACE FRAMEWORK (1)
WEB BASED AUTOMATIC ASSESSMENT SYSTEM (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options