2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Items from 1 to 7 out of 7 results

chapter

Transductive nonnegative matrix factorization for semi-supervised high-performance speech separation

Naiyang Guan, Long Lan, Dacheng Tao, Zhigang Luo, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2534 - 2538

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Regarding the non-negativity property of the magnitude spectrogram of speech signals, nonnegative matrix factorization (NMF) has obtained promising performance for speech separation by independently learning a dictionary on the speech signals of each known speaker. However, traditional NM-F fails to represent the mixture signals accurately because the dictionaries for speakers are learned in the absence...

chapter

Single-channel speech separation with memory-enhanced recurrent neural networks

Felix Weninger, Florian Eyben, Bjorn Schuller

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3709 - 3713

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we propose the use of Long Short-Term Memory recurrent neural networks for speech enhancement. Networks are trained to predict clean speech as well as noise features from noisy speech features, and a magnitude domain soft mask is constructed from these features. Extensive tests are run on 73 k noisy and reverberated utterances from the Audio-Visual Interest Corpus of spontaneous, emotionally...

chapter

Discriminative non-negative matrix factorization for single-channel speech separation

Zi Wang, Fei Sha

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3749 - 3753

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Non-negative matrix factorization (NMF) has emerged as a promising approach for single-channel speech separation. In this paper, we propose a new method of discriminative learning of NMF. In contrast to conventional approaches where the basis vectors are learned independently on clean signals from each speaker, our approach optimizes all basis vectors jointly to reconstruct both clean signals and...

chapter

A structure-preserving training target for supervised speech separation

Yuxuan Wang, DeLiang Wang

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6107 - 6111

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Supervised learning based speech separation has shown considerable success recently. In its simplest form, a discriminative model is trained as a time-frequency masking function, where the training target is an ideal mask. Ideal masks, such as the ideal binary masks, are structured spectro-temporal patterns. However, previous formulations do not model prominent output structure. In this paper, we...

chapter

Deep stacking networks with time series for speech separation

Shuai Nie, Hui Zhang, XueLiang Zhang, WenJu Liu

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6667 - 6671

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In many present speech separation approaches, the separation task is formulated as a binary classification problem. Several classification-based approaches have been proposed and performed satisfactorily. However, they do not explicitly model the correlation in time and each time-frequency (T-F) unit is still classified individually. As we know, the speech signal has a very rich time series and temporal...

chapter

A two-stage approach for improving the perceptual quality of separated speech

Donald S. Williamson, Yuxuan Wang, DeLiang Wang

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7034 - 7038

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Binary time-frequency masking and model-based nonnegative matrix factorization (NMF) are two common approaches to speech separation. However, binary masking often suffers from poor perceptual quality, while NMF typically requires pretrained models for both speech and noise and frequently does not perform well. In this paper we examine whether a single or two-stage approach should be used for performing...

chapter

A feature study for classification-based speech separation at very low signal-to-noise ratio

Jitong Chen, Yuxuan Wang, DeLiang Wang

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7039 - 7043

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speech separation is a challenging problem at low signal-to-noise ratios (SNRs). Separation can be formulated as a classification problem. In this study, we focus on the SNR level of −5 dB in which speech is generally dominated by background noise. In such a low SNR condition, extracting robust features from a noisy mixture is crucial for successful classification. Using a common neural network classifier,...

Filter options

Keywords:
SPEECH SEPARATION

Publication date

Set your own date range

Keywords

NONNEGATIVE MATRIX FACTORIZATION (2)
ARMA FILTERING (1)
BINARY CLASSIFICATION (1)
BINARY MASKING (1)
CLASSIFICATION (1)
COMPUTATIONAL AUDITORY SCENE ANALYSIS (CASA) (1)
DEEP NEURAL NETWORKS (1)
DEEP STACKING NETWORKS (1)
DISCRIMINATIVE TRAINING (1)
LONG SHORT-TERM MEMORY (1)
MULTIRESOLUTION COCHLEAGRAM (1)
NON-NEGATIVE MATRIX FACTORIZATION (1)
RECURRENT NEURAL NETWORKS (1)
SPECTRO-TEMPORAL PATTERNS (1)
SPEECH ENHANCEMENT (1)
SPEECH QUALITY (1)
TRAINING TARGET (1)
TRANSDUCTIVE LEARNING (1)
more

INFONA - science communication portal

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Transductive nonnegative matrix factorization for semi-supervised high-performance speech separation

Single-channel speech separation with memory-enhanced recurrent neural networks

Discriminative non-negative matrix factorization for single-channel speech separation

A structure-preserving training target for supervised speech separation

Deep stacking networks with time series for speech separation

A two-stage approach for improving the perceptual quality of separated speech

A feature study for classification-based speech separation at very low signal-to-noise ratio

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)