2010 International Conference on Asian Language Processing

Items from 1 to 20 out of 49 results

chapter

Title Page i

2010 International Conference on Asian Language Processing > i

2010 International Conference on Asian Language Processing (IALP 2010)

The following topics are dealt with:language lexicon , morphology, syntax and parsing; information extraction; text understanding and summarization; machine translation; language resources; semantics; and spoken language processing.

chapter

A Survey on Rendering Traditional Mongolian Script

Biligsaikhan Batjargal, Fuminori Kimura, Akira Maeda

2010 International Conference on Asian Language Processing > 3 - 6

2010 International Conference on Asian Language Processing (IALP 2010)

This paper discusses the rendering issues of complex text layout - traditional Mongolian script. The traditional Mongolian script has been standardized in Unicode. We analyzed existing Open Type fonts and their rendering schemes for traditional Mongolian script. We found some errors, and discovered grammatical rules, which are not documented in international standards. None of the existing Open Type...

chapter

A Combination of Statistical and Rule-Based Approach for Mongolian Lexical Analysis

Lili Zhao, Jia Men, Congpin Zhang, Qun Liu, more

2010 International Conference on Asian Language Processing > 7 - 10

2010 International Conference on Asian Language Processing (IALP 2010)

Mongolian lexical analysis is the first step in Mongolian information processing such as Chinese-Mongolian machine translation. In this paper, we introduce a statistic and rule based approach to solving the Mongolian word segmentation & POS tagging all at once. In this method, we use tree frame as basic statistical model. And then we combine the model with some rules to improve the lexical analysis...

chapter

A Letter Tagging Approach to Uyghur Tokenization

B Aisha

2010 International Conference on Asian Language Processing > 11 - 14

2010 International Conference on Asian Language Processing (IALP 2010)

In this paper, we present a letter tagging approach(LTA) to Uyghur tokenization. Experiments show that the problem with label bias (rich and complex suffixes) problem to be resolved using LTA combined with CRFs, so it is more effective than previous work, the accuracy of word tokenization reaches 93.3%. In future our tokenization research will be very useful to other Altaic languages information processing.

chapter

Development of Analysis Rules for Bangla Root and Primary Suffix for Universal Networking Language

M N Y Ali, S A Noor, M Z Hossain, J K Das

2010 International Conference on Asian Language Processing > 15 - 18

2010 International Conference on Asian Language Processing (IALP 2010)

This paper describes a method for the development of Bangla Enconversion within the framework of the Universal Networking Language (UNL). We also discuss some issues and problems related to the UNL representation that affect the quality of generation. Additionally, the ling ware engineering is introduced as a technique to enhance the quality and increase the development efficiency. In this paper a...

chapter

A Suffix-Based Noun and Verb Classifier for an Inflectional Language

N Saharia, U Sharma, J Kalita

2010 International Conference on Asian Language Processing > 19 - 22

2010 International Conference on Asian Language Processing (IALP 2010)

Nouns and verbs pose the major challenge in part-of-speech tagging exercises. In this paper we present a suffix based noun and verb classifier for Assamese, an inflectional, relatively free word order Indic language. We used a tiny dictionary of frequent words to increase the accuracy. We obtained F-score of around 85%.

chapter

Behavior of Word 'kaa' in Urdu Language

M K Malik, A Ali, S Siddiq

2010 International Conference on Asian Language Processing > 23 - 26

2010 International Conference on Asian Language Processing (IALP 2010)

This paper discusses the behavior of `kaa' and suggests the selection of Part of Speech (POS) on the basis of linguistic evidence. It also suggests some tests that can be used for correct classification of `kaa'. The selection of correct POS is important for computational processing, including parsing, generation, and identification of grammatical relations.

chapter

Methods to Divide Uygur Morphemes and Treatments for Exceptions

Pu Li, Hao Zhao

2010 International Conference on Asian Language Processing > 27 - 30

2010 International Conference on Asian Language Processing (IALP 2010)

Based on necessity of the establishment of modern Uygur morphemes database, the paper studies the principle and the method to define Uygur morphemes and focuses on some special conditions including syllabic of morphemes, dual-part-words, morpheme cluster and compound morphemes. It is a basic study of the establishment of Uygur morphemes database.

chapter

Rules for Morphological Analysis of Bangla Verbs for Universal Networking Language

M N Y Ali, M Z H Sarker, G F Ahmed, J K Das

2010 International Conference on Asian Language Processing > 31 - 34

2010 International Conference on Asian Language Processing (IALP 2010)

The Universal Networking Language (UNL) deals with the communication across nations of different languages and involves with many different related discipline such as linguistics, epistemology, computer science etc. It helps to overcome the language barrier among people of different nations to solve problems emerging from current globalization trends and geopolitical interdependence. Morphological...

chapter

Discussion on Collation of Tibetan Syllable

Heming Huang, Feipeng Da

2010 International Conference on Asian Language Processing > 35 - 38

2010 International Conference on Asian Language Processing (IALP 2010)

Based on the general syllable structure, a syllable's component letters should be expanded orderly into the series of basic consonant, prefix consonant, head consonant... and the second suffix consonant. If there is no letter in a syllable's particular position, a special character, whose collation element is less than that of any Tibetan letter, should be used in the corresponding position of the...

chapter

Development of Templates for Dictionary Entries of Bangla Roots and Primary Suffixes for Universal Networking Language

M Z Hossain, Md Nawab Yousuf Ali, Shaikh Muhammad Allayear, J K Das

2010 International Conference on Asian Language Processing > 43 - 46

2010 International Conference on Asian Language Processing (IALP 2010)

The Universal Networking Language (UNL) is a world wide generalizes form of human interactive language in a machine independent digital platform for defining, recapitulating, amending, storing and dissipating knowledge or information among people of different affiliations. The theoretical and applied research associated with this interdisciplinary endeavor facilitates in a number of practical applications...

chapter

Improving Dependency Parsing Using Punctuation

Zhenghua Li, Wanxiang Che, Ting Liu

2010 International Conference on Asian Language Processing > 53 - 56

2010 International Conference on Asian Language Processing (IALP 2010)

The high-order graph-based dependency parsing model achieves state-of-the-art accuracy by incorporating rich feature representations. However, its parsing efficiency and accuracy degrades dramatically when the input sentence gets longer. This paper presents a novel two-stage method to improve high-order graph-based parsing, which uses punctuation, such as commas and semicolons, to segment the input...

chapter

A Tree Probability Generation Using VB-EM for Thai PGLR Parser

Kanokorn Trakultaweekoon, Taneth Ruangrajitpakorn, Prachya Boonkwan, Thepchai Supnithi

2010 International Conference on Asian Language Processing > 57 - 60

2010 International Conference on Asian Language Processing (IALP 2010)

In this paper, we applied VB-EM algorithm to generate a probability of constituent combination for PGLR parser. Three linguistic features which are simple PCFG, head-outward dependency and head-emission were calculated. The probabilities were used in a parsing process to find the best probable output tree. From our experiment, the parsing result from a combination of all features for first path and...

chapter

Research on Verb Subcategorization-Based Syntactic Parsing Postprocess for Chinese Language

Jinyong Wang, Xiwu Han

2010 International Conference on Asian Language Processing > 61 - 64

2010 International Conference on Asian Language Processing (IALP 2010)

In this paper, we propose a simple approach to use verb sub categorization-based pattern matching method to rerank the output of a baseline parsing system. A baseline parser first provides a set of n-best candidate parsing trees. Then we extract various features of verb sub categorization from train corpora. And use those features of verb sub categorization extracted from train corpus to rerank the...

chapter

Identification of Maximal-Length Noun Phrases Based on Maximal-Length Preposition Phrases in Chinese

Guiping Zhang, Wenjing Lang, Qiaoli Zhou, Dongfeng Cai

2010 International Conference on Asian Language Processing > 65 - 68

2010 International Conference on Asian Language Processing (IALP 2010)

The paper proposes an identification method of Maximal-Length Noun Phrase (MNP) based on Maximal-Length Preposition Phrase (MPP). We identify MNP utilizing the mutual restricting characteristic of MNP and adverbial MPP. We employ Conditional Random Fields (CRFs) model in identification processing, and use new tags and above long-distance word as features. Experimental result shows a high quality performance...

chapter

Urdu Noun Phrase Chunking - Hybrid Approach

Shahid Siddiq, Sarmad Hussain, Aasim Ali, Kamran Malik, more

2010 International Conference on Asian Language Processing > 69 - 72

2010 International Conference on Asian Language Processing (IALP 2010)

In this work, chunking is used to mark the noun phrases of Urdu sentences. The approach used in this work is hybrid that combines statistical method and hand crafted rules. The statistical model used in this work is HMM along with IOB chunk annotation. From a POS tagged corpus of 100,000 words, around 90,000 word tokens are used for training and 10,000 word tokens for testing. Several experiments...

chapter

Problems and Review of Statistical Parsing Language Model

Faguo Zhou, Fan Zhang, Bingru Yang

2010 International Conference on Asian Language Processing > 77 - 80

2010 International Conference on Asian Language Processing (IALP 2010)

The lexical language model is recently the hotspot in grammar research, which is promoted by incorporating the phrase head with statistics. This paper summarizes about four improving language models which belong to this kind of model: they have utilized heads that is extracted by CFG and calculated the probability between the heads or inside CFG. Different from N-gram and SCFG, the probability calculation...

chapter

Finding Semantic Similarity in Vietnamese

Dat Tien Nguyen, Son Bao Pham

2010 International Conference on Asian Language Processing > 91 - 94

2010 International Conference on Asian Language Processing (IALP 2010)

Finding semantic similarity is an important task in many natural language processing applications. Despite numerous works for popular languages, there is still limited research done for Vietnamese. In this paper, we tackle the problem of finding semantic similarity for Vietnamese using Random Indexing and Hyperspace Analogue to Language to represent the semantics of words and documents. We build a...

chapter

Event Entailment Extraction Based on EM Iteration

Zhen Li, Hanjing Li, Mo Yu, Tiejun Zhao, more

2010 International Conference on Asian Language Processing > 101 - 104

2010 International Conference on Asian Language Processing (IALP 2010)

In the research and development of various natural language processing systems, like Q&A system and text-to-scene conversation system, we realize that knowledge of text entailment helps a lot in improving the performance of the system. Systems with text entailment knowledge will be smarter than those who without entailment knowledge. Currently many research teams are focusing on text entailment,...

chapter

On the Semantic Orientation and Computer Identification of the Adverb "Jiù"

Lin He, Jiaqin Wu

2010 International Conference on Asian Language Processing > 105 - 109

2010 International Conference on Asian Language Processing (IALP 2010)

The recognition of the semantic orientation of the adverb on the computer is a new temptation to discuss sentence processing starting from semantic. In this paper, in order to reach computer automatic identification of the adverb “Jiù”, the rules and principles of the semantic orientation of this type are summarized and proposed respectively according to its sentence structure. On the basis of these,...

Keywords:
NATURAL LANGUAGE PROCESSING

Publication date

Set your own date range

Keywords

SEMANTICS (23)
SYNTACTICS (16)
DICTIONARIES (14)
PRAGMATICS (14)
COMPUTATIONAL LINGUISTICS (12)
CONTEXT (12)
ACCURACY (10)
TRAINING (9)
COMPUTATIONAL MODELING (8)
LANGUAGE TRANSLATION (8)
PRESSES (7)
TEXT ANALYSIS (7)
WORD PROCESSING (7)
GRAMMAR (6)
HIDDEN MARKOV MODELS (6)
KNOWLEDGE BASED SYSTEMS (6)
SPEECH (6)
COMPOUNDS (5)
COMPUTERS (5)
LINGUISTICS (5)
TAGGING (5)
ACOUSTICS (4)
BOOKS (4)
BUILDINGS (4)
DATABASES (4)
FEATURE EXTRACTION (4)
GRAMMARS (4)
INTERNET (4)
TONE (4)
UNIVERSAL NETWORKING LANGUAGE (4)
ACOUSTIC SPACE (3)
ARTIFICIAL NEURAL NETWORKS (3)
HUMANS (3)
INFORMATION EXTRACTION (3)
MORPHOLOGY (3)
NAMED ENTITY RECOGNITION (3)
PITCH (3)
PROBABILITY (3)
SPEECH PROCESSING (3)
STATISTICAL ANALYSIS (3)
TESTING (3)
ANALYTICAL MODELS (2)
AUTOMATIC EXTRACTION (2)
BANGLA ROOTS (2)
BUSINESS (2)
CITIES AND TOWNS (2)
CLASSIFICATION ALGORITHMS (2)
COMPUTER SCIENCE (2)
CONDITIONAL RANDOM FIELD (2)
CONDITIONAL RANDOM FIELDS (2)
CONFERENCES (2)
CRF (2)
DATA MODELS (2)
DICTIONARY (2)
EDUCATIONAL INSTITUTIONS (2)
ENCONVERTER (2)
FORMANT FREQUENCIES (2)
GOOGLE (2)
HELIUM (2)
IDENTIFICATION TECHNOLOGY (2)
INFORMATION PROCESSING (2)
INFORMATION RETRIEVAL (2)
INSTRUMENTS (2)
KRIT PROTTOY (2)
LABELING (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MACHINE TRANSLATION (2)
MAGNETIC HEADS (2)
MARINE ANIMALS (2)
MATHEMATICAL MODEL (2)
MEDICAL SERVICES (2)
MOBILE HANDSETS (2)
MORPHOLOGICAL ANALYSIS (2)
MORPHOLOGICAL RULES (2)
ONTOLOGIES (2)
PART OF SPEECH (2)
PERIPHERAL VOWEL (2)
POS TAGGING (2)
RANDOM PROCESSES (2)
SENTENCE SIMILARITY (2)
SPEECH RECOGNITION (2)
STATISTICAL MACHINE TRANSLATION (2)
TAMIL (2)
TEXT SUMMARIZATION (2)
THAI LANGUAGE (2)
TRAINING DATA (2)
VIETNAMESE (2)
VOCABULARY (2)
WRITING (2)
'KAY' (1)
'KEE' (1)
ACCEPTABLE WORD (1)
ACOUSTIC ANALYSIS (1)
ACOUSTICAL SIMULATION (1)
ADVERB (1)
AEROSPACE ENGINEERING (1)
AGGLUTINATIVE LANGUAGE (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ALTAIC LANGUAGE INFORMATION PROCESSING (1)
more

INFONA - science communication portal

2010 International Conference on Asian Language Processing

Title Page i

A Survey on Rendering Traditional Mongolian Script

A Combination of Statistical and Rule-Based Approach for Mongolian Lexical Analysis

A Letter Tagging Approach to Uyghur Tokenization

Development of Analysis Rules for Bangla Root and Primary Suffix for Universal Networking Language

A Suffix-Based Noun and Verb Classifier for an Inflectional Language

Behavior of Word 'kaa' in Urdu Language

Methods to Divide Uygur Morphemes and Treatments for Exceptions

Rules for Morphological Analysis of Bangla Verbs for Universal Networking Language

Discussion on Collation of Tibetan Syllable

Development of Templates for Dictionary Entries of Bangla Roots and Primary Suffixes for Universal Networking Language

Improving Dependency Parsing Using Punctuation

A Tree Probability Generation Using VB-EM for Thai PGLR Parser

Research on Verb Subcategorization-Based Syntactic Parsing Postprocess for Chinese Language

Identification of Maximal-Length Noun Phrases Based on Maximal-Length Preposition Phrases in Chinese

Urdu Noun Phrase Chunking - Hybrid Approach

Problems and Review of Statistical Parsing Language Model

Finding Semantic Similarity in Vietnamese

Event Entailment Extraction Based on EM Iteration

On the Semantic Orientation and Computer Identification of the Adverb "Jiù"

Filter options

Publication date

Keywords

INFONA - science communication portal

2010 International Conference on Asian Language Processing $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2010 International Conference on Asian Language Processing