Novel deep autoencoder features for non-intrusive speech quality assessment

Meet H. Soni; Hemant A. Patil

doi:10.1109/EUSIPCO.2016.7760662

Novel deep autoencoder features for non-intrusive speech quality assessment

Source

2016 24th European Signal Processing Conference (EUSIPCO) > 2315 - 2319

Abstract

To emulate the human perception in quality assessment, an objective metric or assessment method is required, which is a challenging task. Moreover, assessing the quality of speech without any reference or the ground truth is altogether more difficult. In this paper, we propose a new non-intrusive speech quality assessment metric for objective evaluation of speech quality. The originality of proposed scheme lies in using deep autoencoder to extract low-dimensional features from a spectrum of the speech signal and finds a mapping between features and subjective scores using an artificial neural network (ANN). We have shown that autoencoder features capture noise information in a better way than state-of-the-art Filterbank Energies (FBEs). Quantification of our experimental results suggests that proposed metric gives more accurate and correlated scores than an existing benchmark for objective, non-intrusive quality assessment metric ITU-T P.563 standard.