2016 International Conference on Asian Language Processing (IALP)

Items from 1 to 5 out of 5 results

chapter

Syntactic characteristics and similarities of Japanese authors' writing styles: A kernel-based approach

Eriko Kanagawa, Takeshi Okadome

2016 International Conference on Asian Language Processing (IALP) > 59 - 62

2016 International Conference on Asian Language Processing (IALP)

The subtree kernel and the information tree kernel proposed here measure the syntactic similarity of sentences. For two syntactic trees, these kernels are defined, respectively, as the total number of common subtrees in the syntactic trees and the total information content contained in their common subtrees, where the information content of a common subtree is calculated using its probability. Analyses...

chapter

Japanese orthographical normalization does not work for statistical machine translation

Kazuhide Yamamoto, Kanji Takahashi

2016 International Conference on Asian Language Processing (IALP) > 133 - 136

2016 International Conference on Asian Language Processing (IALP)

We have investigated the effect of normalizing Japanese orthographical variants into a uniform orthography on statistical machine translation (SMT) between Japanese and English. In Japanese, 10% of words have reportedly more than one orthographical variants, which is a promising fact for improving translation quality when we normalize these orthographical variants. However, the results show that SMT...

chapter

Developing learner corpus annotation for Chinese grammatical errors

Lung-Hao Lee, Li-Ping Chang, Yuen-Hsien Tseng

2016 International Conference on Asian Language Processing (IALP) > 254 - 257

2016 International Conference on Asian Language Processing (IALP)

This study describes the construction of the TOCFL (Test Of Chinese as a Foreign Language) learner corpus, including the collection and grammatical error annotation of 2,837 essays written by Chinese language learners originating from a total of 46 different mother-tongue languages. We propose hierarchical tagging sets to manually annotate grammatical errors, resulting in 33,835 inappropriate usages...

chapter

Improved Arabic characters recognition by combining multiple machine learning classifiers

Maytham Alabbas, Raidah S. Khudeyer, Sardar Jaf

2016 International Conference on Asian Language Processing (IALP) > 262 - 265

2016 International Conference on Asian Language Processing (IALP)

In this paper, we investigate a range of strategies for combining multiple machine learning techniques for recognizing Arabic characters, where we are faced with imperfect and dimensionally variable input characters. Experimental results show that combined confidence-based backoff strategies can produce more accurate results than each technique produces by itself and even the ones exhibited by the...

chapter

A machine learning approach for authorship attribution for Bengali blogs

Shanta Phani, Shibamouli Lahiri, Arindam Biswas

2016 International Conference on Asian Language Processing (IALP) > 271 - 274

2016 International Conference on Asian Language Processing (IALP)

In this paper we described an authorship attribution system for Bengali blog texts. We have presented a new Bengali blog corpus of 3000 passages written by three authors. Our study proposes a text classification system, based on lexical features such as character bigrams and trigrams, word n-grams (n = 1, 2, 3) and stop words, using four classifiers. We achieve best results (more than 99%) on the...

Filter options

Keywords:
WRITING

Publication date

Set your own date range

Keywords

TRAINING (3)
ELECTRONIC MAIL (2)
STANDARDS (2)
BLOGS (1)
CHARACTER RECOGNITION (1)
CLEANING (1)
COMPUTATIONAL LINGUISTICS (1)
COMPUTER-ASSISTED LANGUAGE LEARNING (1)
COMPUTERS (1)
DEGRADATION (1)
DISCRETE COSINE TRANSFORMS (1)
ERROR SCHEMA (1)
ERROR TAGGING (1)
EUROPE (1)
FEATURE EXTRACTION (1)
GRAMMATICAL ERROR DIAGNOSIS (1)
INFORMATION TECHNOLOGY (1)
INTERLANGUAGE (1)
KERNEL (1)
KNN (1)
NIOBIUM (1)
NORMALIZING TEXT (1)
OPTICAL CHARACTER RECOGNITION (OCR) (1)
OPTICAL CHARACTER RECOGNITION SOFTWARE (1)
ORTHOGRAPHICAL VARIANT (1)
PNN (1)
PRAGMATICS (1)
PROBABILITY (1)
SECOND LANGUAGE ACQUISITION (1)
SIZE MEASUREMENT (1)
SMT (1)
SPEECH (1)
SUPPORT VECTOR MACHINES (1)
SVM (1)
SYNTACTICS (1)
SYSTEMS COMBINATION (1)
TAGGING (1)
TONGUE (1)
USABILITY (1)
VOCABULARY (1)
more

INFONA - science communication portal

2016 International Conference on Asian Language Processing (IALP) $("#expandableTitles").expandable();

Syntactic characteristics and similarities of Japanese authors' writing styles: A kernel-based approach

Japanese orthographical normalization does not work for statistical machine translation

Developing learner corpus annotation for Chinese grammatical errors

Improved Arabic characters recognition by combining multiple machine learning classifiers

A machine learning approach for authorship attribution for Bengali blogs

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 International Conference on Asian Language Processing (IALP)