Search results for: Peter Bell

Items from 1 to 4 out of 4 results

chapter

Sequence-to-sequence models for punctuated transcription combining lexical and acoustic features

Ondrej Klejch, Peter Bell, Steve Renals

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5700 - 5704

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we present an extension of our previously described neural machine translation based system for punctuated transcription. This extension allows the system to map from per frame acoustic features to word level representations by replacing the traditional encoder in the encoder-decoder architecture with a hierarchical encoder. Furthermore, we show that a system combining lexical and acoustic...

article

Multitask Learning of Context-Dependent Targets in Deep Neural Network Acoustic Models

Peter Bell, Pawel Swietojanski, Steve Renals

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2017 > 25 > 2 > 238 - 247

This paper investigates the use of multitask learning to improve context-dependent deep neural network (DNN) acoustic models. The use of hybrid DNN systems with clustered triphone targets is now standard in automatic speech recognition. However, we suggest that using a single set of DNN targets in this manner may not be the most effective choice, since the targets are the result of a somewhat arbitrary...

chapter

The MGB-2 challenge: Arabic multi-dialect broadcast media recognition

Ahmed Ali, Peter Bell, James Glass, Yacine Messaoui, more

2016 IEEE Spoken Language Technology Workshop (SLT) > 279 - 284

2016 IEEE Spoken Language Technology Workshop (SLT)

This paper describes the Arabic Multi-Genre Broadcast (MGB-2) Challenge for SLT-2016. Unlike last year's English MGB Challenge, which focused on recognition of diverse TV genres, this year, the challenge has an emphasis on handling the diversity in dialect in Arabic speech. Audio data comes from 19 distinct programmes from the Aljazeera Arabic TV channel between March 2005 and December 2015. Programmes...

chapter

Grapheme-to-phoneme conversion methods for minority language conditions

Mengxue Cao, Steve Renals, Peter Bell, Aijun Li, more

2012 International Conference on Speech Database and Assessments > 151 - 156

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

This study attempts to investigate the grapheme-to-phoneme conversion approaches for minority language conditions. Instead of isolated-word data for major languages, sentence-form data is defined to be a proper form of training data for minority languages. Joint-multigram Model and Hidden Markov Model were examined in this study. The “treat-sentence-as-word” training method and the forced-alignment...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Publication type

book (3)
article (1)

INFONA - science communication portal

Search results for: Peter Bell

Sequence-to-sequence models for punctuated transcription combining lexical and acoustic features

Multitask Learning of Context-Dependent Targets in Deep Neural Network Acoustic Models

The MGB-2 challenge: Arabic multi-dialect broadcast media recognition

Grapheme-to-phoneme conversion methods for minority language conditions

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options