ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Items from 1 to 6 out of 6 results

chapter

Reduction of acoustic model training time and required data passes via stochastic approaches to maximum likelihood and discriminative training

Petr Novak, Roman Otec, Antonio Lee, Vaibhava Goel

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5577 - 5581

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The recent boom in use of speech recognition technology has made the access to potentially large amounts of training data easier. This, however, also constitutes a challenge in processing such large, continuously growing amount of information. Here we present a stochastic modification of traditional iterative training approach which leads to the same or even better accuracy of acoustic models and...

chapter

RASR/NN: The RWTH neural network toolkit for speech recognition

Simon Wiesler, Alexander Richard, Pavel Golik, Ralf Schluter, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3281 - 3285

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper describes the new release of RASR — the open source version of the well-proven speech recognition toolkit developed and used at RWTH Aachen University. The focus is put on the implementation of the NN module for training neural network acoustic models. We describe code design, configuration, and features of the NN module. The key feature is a high flexibility regarding the network topology,...

chapter

Improving deep neural network acoustic models using generalized maxout networks

Xiaohui Zhang, Jan Trmal, Daniel Povey, Sanjeev Khudanpur

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 215 - 219

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recently, maxout networks have brought significant improvements to various speech recognition and computer vision tasks. In this paper we introduce two new types of generalized maxout units, which we call p-norm and soft-maxout. We investigate their performance in Large Vocabulary Continuous Speech Recognition (LVCSR) tasks in various languages with 10 hours and 60 hours of data, and find that the...

chapter

Asynchronous stochastic optimization for sequence training of deep neural networks

Georg Heigold, Erik McDermott, Vincent Vanhoucke, Andrew Senior, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5587 - 5591

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper explores asynchronous stochastic optimization for sequence training of deep neural networks. Sequence training requires more computation than frame-level training using pre-computed frame data. This leads to several complications for stochastic optimization, arising from significant asynchrony in model updates under massive parallelization, and limited data shuffling due to utterance-chunked...

chapter

Joint training of convolutional and non-convolutional neural networks

Hagen Soltau, George Saon, Tara N. Sainath

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5572 - 5576

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We describe a simple modification of neural networks which consists in extending the commonly used linear layer structure to an arbitrary graph structure. This allows us to combine the benefits of convolutional neural networks with the benefits of regular networks. The joint model has only a small increase in parameter size and training and decoding time are virtually unaffected. We report significant...

chapter

Context dependent state tying for speech recognition using deep neural network acoustic models

Michiel Bacchiani, David Rybach

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 230 - 234

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper proposes an algorithm to design a tied-state inventory for a context dependent, neural network-based acoustic model for speech recognition. Rather than relying on a GMM/HMM system that operates on a different feature space and is of a different model family, the proposed algorithm optimizes state tying on the activation vectors of the neural network directly. Experiments show the viability...

Filter options

Keywords:
ACOUSTIC MODELING

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (4)
NEURAL NETWORKS (3)
ASYNCHRONOUS STOCHASTIC OPTIMIZATION (1)
CNN (1)
CONTEXT MODELING (1)
DEEP LEARNING (1)
DEEP NEURAL NETWORKS (1)
DISCRIMINATIVE TRAINING (1)
GPU (1)
MAXOUT NETWORKS (1)
MLP (1)
OPEN SOURCE (1)
RASR (1)
SEQUENCE TRAINING (1)
STATE TYING (1)
STOCHASTIC TRAINING (1)
more

INFONA - science communication portal

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Reduction of acoustic model training time and required data passes via stochastic approaches to maximum likelihood and discriminative training

RASR/NN: The RWTH neural network toolkit for speech recognition

Improving deep neural network acoustic models using generalized maxout networks

Asynchronous stochastic optimization for sequence training of deep neural networks

Joint training of convolutional and non-convolutional neural networks

Context dependent state tying for speech recognition using deep neural network acoustic models

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)