S. Williamson

article

Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising

Donald S. Williamson, DeLiang Wang

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2017 > 25 > 7 > 1492 - 1501

In real-world situations, speech is masked by both background noise and reverberation, which negatively affect perceptual quality and intelligibility. In this paper, we address monaural speech separation in reverberant and noisy environments. We perform dereverberation and denoising using supervised learning with a deep neural network. Specifically, we enhance the magnitude and phase by performing...

chapter

Speech dereverberation and denoising using complex ratio masks

Donald S. Williamson, DeLiang Wang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5590 - 5594

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Traditional speech separation systems enhance the magnitude response of noisy speech. Recent studies, however, have shown that perceptual speech quality is significantly improved when magnitude and phase are both enhanced. These studies, however, have not determined if phase enhancement is beneficial in environments that contain reverberation as well as noise. In this paper, we present an approach...

chapter

Complex ratio masking for joint enhancement of magnitude and phase

Donald S. Williamson, Yuxuan Wang, DeLiang Wang

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5220 - 5224

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The phase response of noisy speech has largely been ignored, but recent research shows the importance of phase for perceptual speech quality. A few phase enhancement approaches have been developed. These systems, however, require a separate algorithm for enhancing the magnitude response. In this paper, we present a novel framework for performing monaural speech separation in the complex domain. We...

chapter

Deep neural networks for estimating speech model activations

Donald S. Williamson, Yuxuan Wang, DeLiang Wang

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5113 - 5117

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents an approach for improving the perceptual quality of speech separated from background noise at low signal-to-noise ratios. Our approach uses two stages of deep neural networks, where the first stage estimates the ideal ratio mask that separates speech from noise, and the second stage maps the ratio-masked speech to the clean speech activation matrices that are used for nonnegative...

chapter

A two-stage approach for improving the perceptual quality of separated speech

Donald S. Williamson, Yuxuan Wang, DeLiang Wang

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7034 - 7038

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Binary time-frequency masking and model-based nonnegative matrix factorization (NMF) are two common approaches to speech separation. However, binary masking often suffers from poor perceptual quality, while NMF typically requires pretrained models for both speech and noise and frequently does not perform well. In this paper we examine whether a single or two-stage approach should be used for performing...

chapter

A sparse representation approach for perceptual quality improvement of separated speech

Donald S. Williamson, Yuxuan Wang, DeLiang Wang

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 7015 - 7019

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speech separation based on time-frequency masking has been shown to improve intelligibility of speech signals corrupted by noise. A perceived weakness of binary masking is the quality of separated speech. In this paper, an approach for improving the perceptual quality of separated speech from binary masking is proposed. Our approach consists of two stages, where a binary mask is generated in the first...

INFONA - science communication portal

Search results for: S. Williamson

Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising

Speech dereverberation and denoising using complex ratio masks

Complex ratio masking for joint enhancement of magnitude and phase

Deep neural networks for estimating speech model activations

A two-stage approach for improving the perceptual quality of separated speech

A sparse representation approach for perceptual quality improvement of separated speech

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results for: S. Williamson

Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising

Speech dereverberation and denoising using complex ratio masks

Complex ratio masking for joint enhancement of magnitude and phase

Deep neural networks for estimating speech model activations

A two-stage approach for improving the perceptual quality of separated speech

A sparse representation approach for perceptual quality improvement of separated speech

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options