Joris Pelemans

chapter

Language model adaptation for ASR of spoken translations using phrase-based translation models and named entity models

Joris Pelemans, Tom Vanallemeersch, Kris Demuynck, Lyan Verwimp, more

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5985 - 5989

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Language model adaptation based on Machine Translation (MT) is a recently proposed approach to improve the Automatic Speech Recognition (ASR) of spoken translations that does not suffer from a common problem in approaches based on rescoring i.e. errors made during recognition cannot be recovered by the MT system. In previous work we presented an efficient implementation for MT-based language model...

article

Integrating meta-information into recurrent neural network language models

Yangyang Shi, Martha Larson, Joris Pelemans, Catholijn M. Jonker, more

Speech Communication > 2015 > 73 > Complete > 64-80

Due to their advantages over conventional n-gram language models, recurrent neural network language models (rnnlms) recently have attracted a fair amount of research attention in the speech recognition community. In this paper, we explore one advantage of rnnlms, namely, the ease with which they allow the integration of additional knowledge sources. We concentrate on features that provide complementary...

chapter

Improving n-gram probability estimates by compound-head clustering

Joris Pelemans, Kris Demuynck, Hugo Van hamme, Patrick Wambacq

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5221 - 5225

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Compounding is one of the most productive word formation processes in many languages and is therefore a main source of data sparsity in language modeling. Many solutions have been suggested to model compound words, most of which break the compound into its constituents and train a new model with them. In earlier work, we argued that this approach is suboptimal and we presented a novel technique that...

chapter

Coping with language data sparsity: Semantic head mapping of compound words

Joris Pelemans, Kris Demuynck, Hugo Van hamme, Patrick Wambacq

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 141 - 145

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we present a novel clustering technique for compound words. By mapping compounds onto their semantic heads, the technique is able to estimate n-gram probabilities for unseen compounds. We argue that compounds are well represented by their heads which allows the clustering of rare words and reduces the risk of over-generalization. The semantic heads are obtained by a two-step process...

chapter

A layered approach for dutch large vocabulary continuous speech recognition

Joris Pelemans, Kris Demuynck, Patrick Wambacq

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4421 - 4424

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper we investigate whether a layered architecture that has already proven its value for small tasks, works for a system with large lexica (400k words) and language models (5-grams) as well. The architecture was designed to decouple phone and word recognition which allows for the integration of more complex linguistic components, especially at the sub-word level. It was tested on the Dutch...

INFONA - science communication portal

Search results for: Joris Pelemans

Language model adaptation for ASR of spoken translations using phrase-based translation models and named entity models

Integrating meta-information into recurrent neural network language models

Improving n-gram probability estimates by compound-head clustering

Coping with language data sparsity: Semantic head mapping of compound words

A layered approach for dutch large vocabulary continuous speech recognition

Filter options

Publication date

Publication type

Keywords

Data set

INFONA - science communication portal

Search results for: Joris Pelemans

Language model adaptation for ASR of spoken translations using phrase-based translation models and named entity models

Integrating meta-information into recurrent neural network language models

Improving n-gram probability estimates by compound-head clustering

Coping with language data sparsity: Semantic head mapping of compound words

A layered approach for dutch large vocabulary continuous speech recognition

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options