Asian Language Processing (IALP), 2010 International Conference on

chapter

Cover Art

2010 International Conference on Asian Language Processing > C1

2010 International Conference on Asian Language Processing (IALP 2010)

chapter

Title Page i

2010 International Conference on Asian Language Processing > i

2010 International Conference on Asian Language Processing (IALP 2010)

The following topics are dealt with:language lexicon , morphology, syntax and parsing; information extraction; text understanding and summarization; machine translation; language resources; semantics; and spoken language processing.

chapter

Title Page iii

2010 International Conference on Asian Language Processing > iii

2010 International Conference on Asian Language Processing (IALP 2010)

chapter

Copyright Page

2010 International Conference on Asian Language Processing > iv

2010 International Conference on Asian Language Processing (IALP 2010)

chapter

Message from Program Chairs

2010 International Conference on Asian Language Processing > xiii

2010 International Conference on Asian Language Processing (IALP 2010)

chapter

Conference Committees

2010 International Conference on Asian Language Processing > xiv

2010 International Conference on Asian Language Processing (IALP 2010)

chapter

Invited Talks

Francis Bond

2010 International Conference on Asian Language Processing > xix

2010 International Conference on Asian Language Processing (IALP 2010)

Summary form only given. In this talk, the speaker will measure the reduction in ambiguity that can be gained by using translated text to constrain meanings. Instead of using the translation itself to determine senses, they use a shared hierarchy of word senses: WordNet. Experiments with aligned Chinese, English and Japanese text show a substantial reduction in ambiguity for each language.

chapter

Message from General Chairs

2010 International Conference on Asian Language Processing > xi - xii

2010 International Conference on Asian Language Processing (IALP 2010)

chapter

Organizers and Sponsors

2010 International Conference on Asian Language Processing > xvii - xviii

2010 International Conference on Asian Language Processing (IALP 2010)

chapter

Program Committee

2010 International Conference on Asian Language Processing > xv - xvi

2010 International Conference on Asian Language Processing (IALP 2010)

chapter

A Survey on Rendering Traditional Mongolian Script

Biligsaikhan Batjargal, Fuminori Kimura, Akira Maeda

2010 International Conference on Asian Language Processing > 3 - 6

2010 International Conference on Asian Language Processing (IALP 2010)

This paper discusses the rendering issues of complex text layout - traditional Mongolian script. The traditional Mongolian script has been standardized in Unicode. We analyzed existing Open Type fonts and their rendering schemes for traditional Mongolian script. We found some errors, and discovered grammatical rules, which are not documented in international standards. None of the existing Open Type...

chapter

A Combination of Statistical and Rule-Based Approach for Mongolian Lexical Analysis

Lili Zhao, Jia Men, Congpin Zhang, Qun Liu, more

2010 International Conference on Asian Language Processing > 7 - 10

2010 International Conference on Asian Language Processing (IALP 2010)

Mongolian lexical analysis is the first step in Mongolian information processing such as Chinese-Mongolian machine translation. In this paper, we introduce a statistic and rule based approach to solving the Mongolian word segmentation & POS tagging all at once. In this method, we use tree frame as basic statistical model. And then we combine the model with some rules to improve the lexical analysis...

chapter

A Letter Tagging Approach to Uyghur Tokenization

B Aisha

2010 International Conference on Asian Language Processing > 11 - 14

2010 International Conference on Asian Language Processing (IALP 2010)

In this paper, we present a letter tagging approach(LTA) to Uyghur tokenization. Experiments show that the problem with label bias (rich and complex suffixes) problem to be resolved using LTA combined with CRFs, so it is more effective than previous work, the accuracy of word tokenization reaches 93.3%. In future our tokenization research will be very useful to other Altaic languages information processing.

chapter

Development of Analysis Rules for Bangla Root and Primary Suffix for Universal Networking Language

M N Y Ali, S A Noor, M Z Hossain, J K Das

2010 International Conference on Asian Language Processing > 15 - 18

2010 International Conference on Asian Language Processing (IALP 2010)

This paper describes a method for the development of Bangla Enconversion within the framework of the Universal Networking Language (UNL). We also discuss some issues and problems related to the UNL representation that affect the quality of generation. Additionally, the ling ware engineering is introduced as a technique to enhance the quality and increase the development efficiency. In this paper a...

chapter

A Suffix-Based Noun and Verb Classifier for an Inflectional Language

N Saharia, U Sharma, J Kalita

2010 International Conference on Asian Language Processing > 19 - 22

2010 International Conference on Asian Language Processing (IALP 2010)

Nouns and verbs pose the major challenge in part-of-speech tagging exercises. In this paper we present a suffix based noun and verb classifier for Assamese, an inflectional, relatively free word order Indic language. We used a tiny dictionary of frequent words to increase the accuracy. We obtained F-score of around 85%.

chapter

Behavior of Word 'kaa' in Urdu Language

M K Malik, A Ali, S Siddiq

2010 International Conference on Asian Language Processing > 23 - 26

2010 International Conference on Asian Language Processing (IALP 2010)

This paper discusses the behavior of `kaa' and suggests the selection of Part of Speech (POS) on the basis of linguistic evidence. It also suggests some tests that can be used for correct classification of `kaa'. The selection of correct POS is important for computational processing, including parsing, generation, and identification of grammatical relations.

chapter

Methods to Divide Uygur Morphemes and Treatments for Exceptions

Pu Li, Hao Zhao

2010 International Conference on Asian Language Processing > 27 - 30

2010 International Conference on Asian Language Processing (IALP 2010)

Based on necessity of the establishment of modern Uygur morphemes database, the paper studies the principle and the method to define Uygur morphemes and focuses on some special conditions including syllabic of morphemes, dual-part-words, morpheme cluster and compound morphemes. It is a basic study of the establishment of Uygur morphemes database.

chapter

Rules for Morphological Analysis of Bangla Verbs for Universal Networking Language

M N Y Ali, M Z H Sarker, G F Ahmed, J K Das

2010 International Conference on Asian Language Processing > 31 - 34

2010 International Conference on Asian Language Processing (IALP 2010)

The Universal Networking Language (UNL) deals with the communication across nations of different languages and involves with many different related discipline such as linguistics, epistemology, computer science etc. It helps to overcome the language barrier among people of different nations to solve problems emerging from current globalization trends and geopolitical interdependence. Morphological...

chapter

Discussion on Collation of Tibetan Syllable

Heming Huang, Feipeng Da

2010 International Conference on Asian Language Processing > 35 - 38

2010 International Conference on Asian Language Processing (IALP 2010)

Based on the general syllable structure, a syllable's component letters should be expanded orderly into the series of basic consonant, prefix consonant, head consonant... and the second suffix consonant. If there is no letter in a syllable's particular position, a special character, whose collation element is less than that of any Tibetan letter, should be used in the corresponding position of the...

INFONA - science communication portal

2010 International Conference on Asian Language Processing

Cover Art

Title Page i

Title Page iii

Copyright Page

Table of Contents

Message from Program Chairs

Conference Committees

Invited Talks

Message from General Chairs

Organizers and Sponsors

Program Committee

A Survey on Rendering Traditional Mongolian Script

A Combination of Statistical and Rule-Based Approach for Mongolian Lexical Analysis

A Letter Tagging Approach to Uyghur Tokenization

Development of Analysis Rules for Bangla Root and Primary Suffix for Universal Networking Language

A Suffix-Based Noun and Verb Classifier for an Inflectional Language

Behavior of Word 'kaa' in Urdu Language

Methods to Divide Uygur Morphemes and Treatments for Exceptions

Rules for Morphological Analysis of Bangla Verbs for Universal Networking Language

Discussion on Collation of Tibetan Syllable

Filter options

Publication date

Keywords

INFONA - science communication portal

2010 International Conference on Asian Language Processing $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2010 International Conference on Asian Language Processing