2016 International Conference on Asian Language Processing (IALP)

chapter

Front matter

2016 International Conference on Asian Language Processing (IALP) > i - xx

chapter

Comparison on Neural Network based acoustic model in Mongolian speech recognition

Hongwei Zhang, Feilong Bao, Guanglai Gao, Hui Zhang

2016 International Conference on Asian Language Processing (IALP) > 1 - 5

2016 International Conference on Asian Language Processing (IALP)

Deep Neural Networks (DNNs) beat the Gaussian Mixture Models (GMMs), and become the state-of-the-art techniques for acoustic model. Then various neural networks based acoustic models are proposed to make the speech recognition systems better and better. However these successes are not adopted in the researches of Mongolian speech recognition. This study fills in this gap. We study a series of neural...

chapter

Dialog State Tracking and action selection using deep learning mechanism for interview coaching

Ming-Hsiang Su, Kun-Yi Huang, Tsung-Hsien Yang, Kuan-Jung Lai, more

2016 International Conference on Asian Language Processing (IALP) > 6 - 9

2016 International Conference on Asian Language Processing (IALP)

The best way to prepare for an interview is to review the different types of possible interview questions you will be asked during an interview and practice responding to questions. An interview coaching system tries to simulate an interviewer to provide mock interview practice simulation sessions for the users. The traditional interview coaching systems provide some feedbacks, including facial preference,...

chapter

Policy optimization of dialogue management in spoken dialogue system for out-of-domain utterances

Yuhong Xu, Peijie Huang, Jiecong Tang, Qiangjia Huang, more

2016 International Conference on Asian Language Processing (IALP) > 10 - 13

2016 International Conference on Asian Language Processing (IALP)

This paper addresses the policy optimization of a dialogue management scheme based on partially observable Markov decision processes (POMDP), which is designed for out-of-domain (OOD) utterances processing in spoken dialogue system. First, POMDP-Based DM Modeling for OOD Utterances is proposed, together with detail of some principal elements. Then, joint state transition exploration and dialogue policy...

chapter

Dialogue act recognition for Chinese out-of-domain utterances using hybrid CNN-RF

Jundong Wang, Peijie Huang, Qiangjia Huang, Zixuan Ke, more

2016 International Conference on Asian Language Processing (IALP) > 14 - 17

2016 International Conference on Asian Language Processing (IALP)

Due to the short length, diversity, openness and colloquialism characteristic of out-of-domain (OOD) utterances, dialogue act (DA) recognition for OOD utterances in restricted domain spoken dialogue system remains a great challenge. This paper tackles this problem by proposing an effective DA recognition method using hybrid convolutional neural network (CNN) and random forest (RF). CNN acts as a feature...

chapter

Annotating Chinese noun phrases based on Semantic Dependency graph

Yimeng Li, Yanqiu Shao

2016 International Conference on Asian Language Processing (IALP) > 18 - 21

2016 International Conference on Asian Language Processing (IALP)

Annotating complicated noun phrases is a difficulty in semantic analysis. In this paper we investigate the annotation methods of noun phrases in Nombank, Chinese Nombank and Sinica Treebank trying to propose an annotation scheme based on semantic dependency graph for noun phrases.

chapter

Sense Annotated Hindi Corpus

Satyendr Singh, Tanveer J. Siddiqui

2016 International Conference on Asian Language Processing (IALP) > 22 - 25

2016 International Conference on Asian Language Processing (IALP)

This paper describes about the development and details of a linguistic resource, Sense Annotated Hindi Corpus. Word Sense Disambiguation (WSD) is an important task in Natural Language Processing. Sense annotated Hindi Corpus was developed for Lexical Sample WSD task for Hindi language. It consists of 60 polysemous Hindi nouns. The sense inventory for sense annotated Hindi corpus was derived from Hindi...

chapter

Towards building a standard dataset for Arabic keyphrase extraction evaluation

Muhammad Helmy, Marco Basaldella, Eddy Maddalena, Stefano Mizzaro, more

2016 International Conference on Asian Language Processing (IALP) > 26 - 29

2016 International Conference on Asian Language Processing (IALP)

Keyphrases are short phrases that best represent a document content. They can be useful in a variety of applications, including document summarization and retrieval models. In this paper, we introduce the first dataset of keyphrases for an Arabic document collection, obtained by means of crowdsourcing. We experimentally evaluate different crowdsourced answer aggregation strategies and validate their...

chapter

Semantic annotation for Mandarin verbal lexicon

Mei-chun Liu, Jui-ching Chang

2016 International Conference on Asian Language Processing (IALP) > 30 - 36

2016 International Conference on Asian Language Processing (IALP)

This study examines the challenging issues in the semantic annotation of the characteristics of verbal information of Mandarin Chinese. It proposes a frame-based constructional approach that aligns with linguistic premises in Frame Semantics, Construction Grammar and Cognitive Grammar. Given that semantic processing has a lot to do with human cognitive capacities, semantic transfer and profile on...

chapter

Information extraction and text mining of Ancient Vattezhuthu characters in historical documents using image zoning

E.K. Vellingiriraj, M. Balamurugan, P. Balasubramanie

2016 International Conference on Asian Language Processing (IALP) > 37 - 40

2016 International Conference on Asian Language Processing (IALP)

The aim of this paper is to develop a system that involves character recognition of Brahmi, Grantha and Vattezuthu characters from palm manuscripts of historical Tamil ancient documents, analyzed the text and machine translated the present Tamil digital text format. Though many researchers have implemented various algorithms and techniques for character recognition in different languages, ancient...

chapter

Content-based approach for Vietnamese spam SMS filtering

Thai-Hoang Pham, Phuong Le-Hong

2016 International Conference on Asian Language Processing (IALP) > 41 - 44

2016 International Conference on Asian Language Processing (IALP)

Short Message Service (SMS) spam is a serious problem in Vietnam because of the availability of very cheap prepaid SMS packages. There are some systems to detect and filter spam messages for English, most of which use machine learning techniques to analyze the content of messages and classify them. For Vietnamese, there is some research on spam email filtering but none focused on SMS. In this work,...

chapter

Detecting representative web articles using heterogeneous graphs

Richeng Xuan, Sang-goo Lee

2016 International Conference on Asian Language Processing (IALP) > 45 - 48

2016 International Conference on Asian Language Processing (IALP)

With the rapid growth of on-line news media, guarding against malicious news articles is becoming an essential requirement for on-line news service providers. Near duplicate articles are one of the most common types of malicious news articles. However, previous research has concentrated on how to improve the effectiveness and accuracy of finding near-duplicate article pairs or clusters, and not so...

chapter

Word clustering for parallelism in Classical Chinese poems

John Lee, Mengqi Luo

2016 International Conference on Asian Language Processing (IALP) > 49 - 52

2016 International Conference on Asian Language Processing (IALP)

This paper explores the use of statistical methods to describe the phenomenon of parallelism in Classical Chinese poems. We apply a graph-based clustering method to automatically induce word clusters from a corpus of poems. We describe several methods for computing similarity scores. We compare these methods by evaluating the quality of the induced clusters, with respect to a semantic taxonomy for...

chapter

Annotation scheme for legal discourse information and hierarchical levels

Hong Wang, Yunfeng Ge

2016 International Conference on Asian Language Processing (IALP) > 53 - 58

2016 International Conference on Asian Language Processing (IALP)

Corpus annotation at discourse level requires modeling the entire structure of a discourse. The existing methods have difficulties in differentiate macro- and microstructure of a discourse. Taking account of this, discourse information theory (DIT) provides the theoretical basis for establishing discourse information annotation tagsets and practical annotation methods. Having set up an equation between...

chapter

Syntactic characteristics and similarities of Japanese authors' writing styles: A kernel-based approach

Eriko Kanagawa, Takeshi Okadome

2016 International Conference on Asian Language Processing (IALP) > 59 - 62

2016 International Conference on Asian Language Processing (IALP)

The subtree kernel and the information tree kernel proposed here measure the syntactic similarity of sentences. For two syntactic trees, these kernels are defined, respectively, as the total number of common subtrees in the syntactic trees and the total information content contained in their common subtrees, where the information content of a common subtree is calculated using its probability. Analyses...

chapter

An initial study of Indonesian semantic role labeling and its application on event extraction

Ade Romadhony, Ayu Purwarianti, Lisa Madlberger

2016 International Conference on Asian Language Processing (IALP) > 63 - 66

2016 International Conference on Asian Language Processing (IALP)

Semantic role labeling (SRL) is a task to assign semantic role labels to sentence elements. This paper describes the initial development of an Indonesian semantic role labeling system and its application to extract event information from Tweets. We compare two feature types when designing the SRL systems: Word-to-Word and Phrase-to-Phrase. Our experiments showed that the Word-to-Word feature approach...

chapter

Generating Manipuri English pronunciation dictionary using sequence labelling problem

Rajlakshmi Saikia, Sanasam Ranbir Singh

2016 International Conference on Asian Language Processing (IALP) > 67 - 70

2016 International Conference on Asian Language Processing (IALP)

Creating a highly accurate pronunciation dictionary plays an important role in building English TTS system to produce high quality synthesised speech. Majority of the existing studies related to building Indian English TTS systems adapt CMU pronunciation dictionary to corresponding target Indian accent. Majority of these studies use hand-crafted rule-based approaches to adapt to the target language...

chapter

Recurrent neural network-based language models with variation in net topology, language, and granularity

Tzu-Hsuan Yang, Tzu-Hsuan Tseng, Chia-Ping Chen

2016 International Conference on Asian Language Processing (IALP) > 71 - 74

2016 International Conference on Asian Language Processing (IALP)

In this paper, we study language models based on recurrent neural networks on three databases in two languages. We implement basic recurrent neural networks (RNN) and refined RNNs with long short-term memory (LSTM) cells. We use the corpora of Penn Tree Bank (PTB) and AMI in English, and the Academia Sinica Balanced Corpus (ASBC) in Chinese. On ASBC, we investigate word-based and character-based language...

chapter

Verifying the long-range dependency of RNN language models

Tzu-Hsuan Tseng, Tzu-Hsuan Yang, Chia-Ping Chen

2016 International Conference on Asian Language Processing (IALP) > 75 - 78

2016 International Conference on Asian Language Processing (IALP)

It has been argued that recurrent neural network language models are better in capturing long-range dependency than n-gram language models. In this paper, we attempt to verify this claim by investigating the prediction accuracy and the perplexity of these language models as a function of word position, i.e., the position of a word in a sentence. It is expected that as word position increases, the...

chapter

The effect of shallow segmentation on English-Tigrinya statistical machine translation

Yemane Tedla, Kazuhide Yamamoto

2016 International Conference on Asian Language Processing (IALP) > 79 - 82

2016 International Conference on Asian Language Processing (IALP)

This paper presents initial research on English-to-Tigrinya statistical machine translation (SMT). Tigrinya is a highly inflected Semitic language spoken in Eritrea and Ethiopia. Translation involving morphologically complex languages is challenged by factors including data sparseness, word alignment and language model. We try to address these problems through morphological segmentation of Tigrinya...

INFONA - science communication portal

2016 International Conference on Asian Language Processing (IALP)

Front matter

Comparison on Neural Network based acoustic model in Mongolian speech recognition

Dialog State Tracking and action selection using deep learning mechanism for interview coaching

Policy optimization of dialogue management in spoken dialogue system for out-of-domain utterances

Dialogue act recognition for Chinese out-of-domain utterances using hybrid CNN-RF

Annotating Chinese noun phrases based on Semantic Dependency graph

Sense Annotated Hindi Corpus

Towards building a standard dataset for Arabic keyphrase extraction evaluation

Semantic annotation for Mandarin verbal lexicon

Information extraction and text mining of Ancient Vattezhuthu characters in historical documents using image zoning

Content-based approach for Vietnamese spam SMS filtering

Detecting representative web articles using heterogeneous graphs

Word clustering for parallelism in Classical Chinese poems

Annotation scheme for legal discourse information and hierarchical levels

Syntactic characteristics and similarities of Japanese authors' writing styles: A kernel-based approach

An initial study of Indonesian semantic role labeling and its application on event extraction

Generating Manipuri English pronunciation dictionary using sequence labelling problem

Recurrent neural network-based language models with variation in net topology, language, and granularity

Verifying the long-range dependency of RNN language models

The effect of shallow segmentation on English-Tigrinya statistical machine translation

Filter options

Publication date

Keywords

INFONA - science communication portal

2016 International Conference on Asian Language Processing (IALP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 International Conference on Asian Language Processing (IALP)