ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Items from 1 to 13 out of 13 results

chapter

Alternating direction method of multipliers for non-negative matrix factorization with the beta-divergence

Dennis L. Sun, Cedric Fevotte

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6201 - 6205

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Non-negative matrix factorization (NMF) is a popular method for learning interpretable features from non-negative data, such as counts or magnitudes. Different cost functions are used with NMF in different applications. We develop an algorithm, based on the alternating direction method of multipliers, that tackles NMF problems whose cost function is a beta-divergence, a broad class of divergence functions...

chapter

Piecewise constant nonnegative matrix factorization

N. Seichepine, S. Essid, C. Fevotte, O. Cappe

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6721 - 6725

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we propose a non-negative matrix factorization (NMF) model with piecewise-constant activation coefficients. This structure is enforced using a total variation penalty on the rows of the activation matrix. The resulting optimization problem is solved with a majorization-minimization procedure. The proposed algorithm is well suited to analyze data explained by underlying piecewise-constant...

chapter

Exploiting long-term temporal dependencies in NMF using recurrent neural networks with application to source separation

Nicolas Boulanger-Lewandowski, Gautham J. Mysore, Matthew Hoffman

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6969 - 6973

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper seeks to exploit high-level temporal information during feature extraction from audio signals via non-negative matrix factorization. Contrary to existing approaches that impose local temporal constraints, we train powerful recurrent neural network models to capture long-term temporal dependencies and event co-occurrence in the data. This gives our method the ability to “fill in the blanks”...

chapter

Discriminative non-negative matrix factorization for single-channel speech separation

Zi Wang, Fei Sha

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3749 - 3753

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Non-negative matrix factorization (NMF) has emerged as a promising approach for single-channel speech separation. In this paper, we propose a new method of discriminative learning of NMF. In contrast to conventional approaches where the basis vectors are learned independently on clean signals from each speaker, our approach optimizes all basis vectors jointly to reconstruct both clean signals and...

chapter

Speech-guided source separation using a pitch-adaptive guide signal model

Romain Hennequin, Juan Jose Burred, Simon Maller, Pierre Leveau

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6672 - 6676

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we present a new method to perform underdetermined audio source separation using a spoken or sung reference signal to inform the separation process. This method explicitly models possible differences between the spoken reference and the target signal, such as pitch differences and time lag. We show that the proposed algorithm outperforms state-of-the art methods.

chapter

A study of instrument-wise onset detection in Beijing Opera percussion ensembles

Mi Tian, Ajay Srinivasamurthy, Mark Sandler, Xavier Serra

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2159 - 2163

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Note onset detection and instrument recognition are two of the most investigated tasks in Music Information Retrieval (MIR). Various detection methods have been proposed in previous research for western music, with less focus on other music cultures of the world. In this paper, we focus on onset detection for percussion instruments in Beijing Opera, a major genre of Chinese traditional music. A dataset...

chapter

Phase constrained complex NMF: Separating overlapping partials in mixtures of harmonic musical sources

James Bronson, Philippe Depalle

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7475 - 7479

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper examines complex non-negative matrix factorization (CMF) as a tool for separating overlapping partials in mixtures of harmonic musical sources. Unlike non-negative matrix factorization (NMF), CMF allows for the development of source separation procedures founded on a mixture model rooted in the complex-spectrum domain (in which the superposition of overlapping sources is preserved). This...

chapter

Multichannel audio separation by direction of arrival based spatial covariance model and non-negative matrix factorization

Joonas Nikunen, Tuomas Virtanen

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6677 - 6681

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper studies multichannel audio separation using non-negative matrix factorization (NMF) combined with a new model for spatial covariance matrices (SCM). The proposed model for SCMs is parameterized by source direction of arrival (DoA) and its parameters can be optimized to yield a spatially coherent solution over frequencies thus avoiding permutation ambiguity and spatial aliasing. The model...

chapter

Multimodal voice conversion using non-negative matrix factorization in noisy environments

Kenta Masaka, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1542 - 1546

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a multimodal voice conversion (VC) method for noisy environments. In our previous NMF-based VC method, source exemplars and target exemplars are extracted from parallel training data, in which the same texts are uttered by the source and target speakers. The input source signal is then decomposed into source exemplars, noise exemplars obtained from the input signal, and their weights...

chapter

Non-negative source-filter dynamical system for speech enhancement

Umut Simsekli, Jonathan Le Roux, John R. Hershey

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6206 - 6210

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Model-based speech enhancement methods, which rely on separately modeling the speech and the noise, have been shown to be powerful in many different problem settings. When the structure of the noise can be arbitrary, which is often the case in practice, modelbased methods have to focus on developing good speech models, whose quality will be key to their performance. In this study, we propose a novel...

chapter

Speech enhancement combining statistical models and NMF with update of speech and noise bases

Kisoo Kwon, Jong Won Shin, Sukanya Sonowat, Inkyu Choi, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7053 - 7057

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speech enhancement based on statistical models has shown good performance, but the performance degrades when environment noise is highly non-stationary due to the stationary assumption. On the contrary, the template-based enhancement methods are more robust to non-stationary noise, but these are heavily dependent on a priori information present in training data. In order to get over both of the shortcomings,...

chapter

Active-set newton algorithm for non-negative sparse coding of audio

Tuomas Virtanen, Bhiksha Raj, Jort F. Gemmeke, Hugo Van hamme

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3092 - 3096

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a new algorithm to efficiently obtain non-negative sparse representations for audio. The spectrum of an audio signal is represented as a sparse linear combination of atoms taken from an overcomplete dictionary. The algorithm is based on minimizing the generalized Kullback-Leibler divergence between an observed magnitude spectrum and a non-negative linear combination of atoms, plus an ℓ₁...

chapter

Semi-supervised noise dictionary adaptation for exemplar-based noise robust speech recognition

Yi Luan, Daisuke Saito, Yosuke Kashiwagi, Nobuaki Minematsu, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1745 - 1748

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The exemplar-based approaches, which model signals as a sparse linear combination of exemplars of signals, are proved to have state-of-the-art performance in noise robust ASR, especially on low SNRs. However, since both the speech exemplars and noise exemplars are built from training data and are fixed throughout the process of enhancing speech features, the conventional approach is especially weak...

Filter options

Keywords:
NON-NEGATIVE MATRIX FACTORIZATION

Publication date

Set your own date range

Keywords

AUDIO SOURCE SEPARATION (2)
SOURCE SEPARATION (2)
ALTERNATING DIRECTION METHOD OF MULTIPLIERS (1)
BEIJING OPERA (1)
BETA-DIVERGENCE (1)
COMPLEX NON-NEGATIVE MATRIX FACTORIZATION (1)
CONVEX OPTIMIZATION (1)
DISCRIMINATIVE TRAINING (1)
DRUM TRANSCRIPTION (1)
EXEMPLAR-BASED (1)
HARMONIC RESOLUTION (1)
IMAGE FEATURES (1)
INFORMED SOURCE SEPARATION (1)
LONG-TERM TEMPORAL DEPENDENCIES (1)
MULTIMODAL (1)
NEWTON ALGORITHM (1)
NOISE REDUCTION (1)
NOISE ROBUSTNESS (1)
NON-NEGATIVE DYNAMICAL SYSTEM (1)
ON-LINE UPDATE OF BASES (1)
ONSET DETECTION (1)
PHASE CONSTRAINTS (1)
RECURRENT NEURAL NETWORKS (1)
ROBUST SPEECH RECOGNITION (1)
SEMI-SUPERVISED (1)
SOUND SOURCE SEPARATION (1)
SOURCE-FILTER MODEL (1)
SPARSE CODING (1)
SPATIAL COVARIANCE MODELS (1)
SPATIAL SOUND SEPARATION (1)
SPEECH ENHANCEMENT (1)
SPEECH SEPARATION (1)
STATISTICAL MODEL-BASED ENHANCEMENT (1)
TEMPORAL SMOOTHING (1)
TOTAL VARIATION (1)
VOICE CONVERSION (1)
more

INFONA - science communication portal

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)