Comparison of feedforward and recurrent neural network language models

M. Sundermeyer; I. Oparin; J.-L. Gauvain; B. Freiberg; R. Schluter; H. Ney

doi:10.1109/ICASSP.2013.6639310

Comparison of feedforward and recurrent neural network language models

Sundermeyer, M., Oparin, I., Gauvain, J.-L., Freiberg, B., Schluter, R., Ney, H.

Source

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 8430 - 8434

Abstract

Research on language modeling for speech recognition has increasingly focused on the application of neural networks. Two competing concepts have been developed: On the one hand, feedforward neural networks representing an n-gram approach, on the other hand recurrent neural networks that may learn context dependencies spanning more than a fixed number of predecessor words. To the best of our knowledge, no comparison has been carried out between feedforward and state-of-the-art recurrent networks when applied to speech recognition. This paper analyzes this aspect in detail on a well-tuned French speech recognition task. In addition, we propose a simple and efficient method to normalize language model probabilities across different vocabularies, and we show how to speed up training of recurrent neural networks by parallelization.