Search results for: Aleksej Chinaev

Items from 1 to 7 out of 7 results

chapter

A generalized log-spectral amplitude estimator for single-channel speech enhancement

Aleksej Chinaev, Reinhold Haeb-Umbach

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4980 - 4984

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The benefits of both a logarithmic spectral amplitude (LSA) estimation and a modeling in a generalized spectral domain (where short-time amplitudes are raised to a generalized power exponent, not restricted to magnitude or power spectrum) are combined in this contribution to achieve a better tradeoff between speech quality and noise suppression in single-channel speech enhancement. A novel gain function...

chapter

BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge

Jahn Heymann, Lukas Drude, Aleksej Chinaev, Reinhold Haeb-Umbach

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) > 444 - 451

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)

We present a new beamformer front-end for Automatic Speech Recognition and apply it to the 3rd-CHiME Speech Separation and Recognition Challenge. Without any further modification of the back-end, we achieve a 53% relative reduction of the word error rate over the best baseline enhancement system for the relevant test data set. Our approach leverages the power of a bi-directional Long Short-Term Memory...

chapter

Towards online source counting in speech mixtures applying a variational EM for complex Watson mixture models

Lukas Drude, Aleksej Chinaev, Dang Hai Tran Vu, Reinhold Haeb-Umbach

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC) > 213 - 217

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC)

This contribution describes a step-wise source counting algorithm to determine the number of speakers in an offline sce-nario. Each speaker is identified by a variational expectation maximization (VEM) algorithm for complex Watson mixture models and therefore directly yields beamforming vectors for a subsequent speech separation process. An observation selection criterion is proposed which improves...

chapter

Source counting in speech mixtures using a variational EM approach for complex WATSON mixture models

Lukas Drude, Aleksej Chinaev, Dang Hai Tran Vu, Reinhold Haeb-Umbach

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6834 - 6838

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this contribution we derive a variational EM (VEM) algorithm for model selection in complex Watson mixture models, which have been recently proposed as a model of the distribution of normalized microphone array signals in the short-time Fourier transform domain. The VEM algorithm is applied to count the number of active sources in a speech mixture by iteratively estimating the mode vectors of the...

chapter

Improved single-channel nonstationary noise tracking by an optimized MAP-based postprocessor

Aleksej Chinaev, Reinhold Haeb-Umbach, Jalal Taghia, Rainer Martin

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 7477 - 7481

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we present an improved version of the recently proposed Maximum A-Posteriori (MAP) based noise power spectral density estimator. An empirical bias compensation and bandwidth adjustment reduce bias and variance of the noise variance estimates. The main advantage of the MAP-based postprocessor is its low estimation variance. The estimator is employed in the second stage of a two-stage...

chapter

Map-based estimation of the parameters of a Gaussian Mixture Model in the presence of noisy observations

Aleksej Chinaev, Reinhold Haeb-Umbach

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 3352 - 3356

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this contribution we derive the Maximum A-Posteriori (MAP) estimates of the parameters of a Gaussian Mixture Model (GMM) in the presence of noisy observations. We assume the distortion to be white Gaussian noise of known mean and variance. An approximate conjugate prior of the GMM parameters is derived allowing for a computationally efficient implementation in a sequential estimation framework...

chapter

Improved noise power spectral density tracking by a MAP-based postprocessor

Aleksej Chinaev, Alexander Krueger, Dang Hai Tran Vu, Reinhold Haeb-Umbach

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4041 - 4044

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper we present a novel noise power spectral density tracking algorithm and its use in single-channel speech enhancement. It has the unique feature that it is able to track the noise statistics even if speech is dominant in a given time-frequency bin. As a consequence it can follow non-stationary noise superposed by speech, even in the critical case of rising noise power. The algorithm requires...

Filter options

Publication date

Set your own date range

Keywords

SPEECH (3)
SPEECH ENHANCEMENT (3)
BAYES METHODS (2)
BLIND SOURCE SEPARATION (2)
DIRECTIONAL STATISTICS (2)
ESTIMATION (2)
MAXIMUM A POSTERIORI ESTIMATION (2)
NOISE MEASUREMENT (2)
NOISE POWER ESTIMATION (2)
NUMBER OF SPEAKERS (2)
SIGNAL TO NOISE RATIO (2)
ACCURACY (1)
ACOUSTICS (1)
BEAMFORMING (1)
FEATURE ENHANCEMENT (1)
FREQUENCY ESTIMATION (1)
GAIN (1)
GAUSSIAN DISTRIBUTION (1)
GAUSSIAN MIXTURE MODEL (1)
GENERALIZED STATISTICAL-MODEL BASED ALGORITHMS (1)
MAP PARAMETER ESTIMATION (1)
MATHEMATICAL MODEL (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
NEURAL NETWORKS (1)
OPTIMIZED PRODUCTION TECHNOLOGY (1)
ROBUST SPEECH RECOGNITION (1)
SINGLE-CHANNEL SPECTRAL SPEECH ENHANCEMENT (1)
SOURCE SEPARATION (1)
SPEAKER DIARIZATION (1)
VECTORS (1)
more

INFONA - science communication portal

Search results for: Aleksej Chinaev

A generalized log-spectral amplitude estimator for single-channel speech enhancement

BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge

Towards online source counting in speech mixtures applying a variational EM for complex Watson mixture models

Source counting in speech mixtures using a variational EM approach for complex WATSON mixture models

Improved single-channel nonstationary noise tracking by an optimized MAP-based postprocessor

Map-based estimation of the parameters of a Gaussian Mixture Model in the presence of noisy observations

Improved noise power spectral density tracking by a MAP-based postprocessor

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options