2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In Cajun slang, Lagniappe means something extra, and ICASSP 2017 is really a Lagniappe. On behalf of ICASSP 2017 and IEEE Signal Processing Society, I welcome you to the beautiful and historic city of New Orleans. New Orleans is the heart of great bayous; the melting pot of Cajun, Zydeco, and Creole cultures; the capital of Jazz music; and the home of Mardi Gras.

chapter

Technical program chairs' overview

Tulay Adali, Eli Saber

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > xxx - xxxii

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Welcome to ICASSP 2017, the world's largest and most comprehensive conference on Acoustics, Speech, and Signal Processing, held this year in the beautiful city of New Orleans, Louisiana—the home of jazz, year long festivities, and a unique cuisine with a Cajun kick. We are happy to welcome you to New Orleans and hope you will enjoy the colors of the city, the music, the active nightlife, and of course...

chapter

Informed source separation via compressive graph signal sampling

Gilles Puy, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Perez

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1 - 5

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a novel informed source separation method for audio object coding based on a recent sampling theory for smooth signals on graphs. Assuming that only one source is active at each time-frequency point, we compute an ideal map indicating which source is active at each time-frequency point at the encoder. This map is then sampled with a compressive graph signal sampling strategy that guarantees...

chapter

Motion informed audio source separation

Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6 - 10

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we tackle the problem of single channel audio source separation driven by descriptors of the sounding object's motion. As opposed to previous approaches, motion is included as a soft-coupling constraint within the nonnegative matrix factorization framework. The proposed method is applied to a multimodal dataset of instruments in string quartet performance recordings where bow motion...

chapter

Supervised monaural source separation based on autoencoders

Keiichi Osako, Yuki Mitsufuji, Rita Singh, Bhiksha Raj

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 11 - 15

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a new supervised monaural source separation based on autoencoders. We employ the autoencoder for the dictionary training such that the nonlinear network can encode the target source with high expressiveness. The dictionary is trained by each target source without the mixture signal, which makes the system independent from the context where the dictionaries will be used. In...

chapter

An EM algorithm for joint source separation and diarisation of multichannel convolutive speech mixtures

Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 16 - 20

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We present a probabilistic model for joint source separation and diarisation of multichannel convolutive speech mixtures. We build upon the framework of local Gaussian model (LGM) with non-negative matrix factorization (NMF). The diarisation is introduced as a temporal labeling of each source in the mix as active or inactive at the short-term frame level. We devise an EM algorithm in which the source...

chapter

Blind source separation based on independent low-rank matrix analysis with sparse regularization for time-series activity

Yoshiki Mitsui, Daichi Kitamura, Shinnosuke Takamichi, Nobutaka Ono, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 21 - 25

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a new blind source separation (BSS) method based on independent low-rank matrix analysis (ILRMA) with novel sparse regularization. ILRMA is a recently proposed BSS algorithm that simultaneously estimates a demixing matrix and source spectrogram models based on nonnegative matrix factorization (NMF). To improve the separation accuracy and stability, an additional constraint...

chapter

Multichannel audio source separation: Variational inference of time-frequency sources from time-domain observations

Simon Leglaive, Roland Badeau, Gael Richard

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 26 - 30

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

A great number of methods for multichannel audio source separation are based on probabilistic approaches in which the sources are modeled as latent random variables in a Time-Frequency (TF) domain. For reverberant mixtures, it is common to approximate the time-domain convolutive mixing process as being instantaneous in the short-term Fourier transform domain, under a short mixing filters assumption...

chapter

Overlapping sound event detection with supervised Nonnegative Matrix Factorization

Victor Bisot, Slim Essid, Gael Richard

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 31 - 35

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we propose a supervised Nonnegative Matrix Factorization (NMF) model for overlapping sound event detection in real life audio. We start by highlighting the usefulness of non-euclidean NMF to learn representations for detecting and classifying acoustic events in a multi-label setting. Then, we propose to learn a classifier and the NMF decomposition in a joint optimization problem. This...

chapter

Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification

Romain Serizel, Victor Bisot, Slim Essid, Gael Richard

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 36 - 40

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents supervised feature learning approaches for speaker identification that rely on nonnegative matrix factorisation. Recent studies have shown that group nonnegative matrix factorisation and task-driven supervised dictionary learning can help performing effective feature learning for audio classification problems. This paper proposes to integrate a recent method that relies on group...

chapter

Tracking metrical structure changes with sparse-NMF

Elio Quinton, Ken O'Hanlon, Simon Dixon, Mark Sandler

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 41 - 45

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The estimation of rhythmic properties such as tempo, beat positions or metrical structure are central aspects of Music Information Retrieval (MIR) research. Meter inference algorithms are typically designed to track metrical structure in presence of mild deviations of the feature estimates over time in order to account for performance imprecisions, expressive timing or musical effects such as accelerando...

chapter

Drum extraction in single channel audio signals using multi-layer Non negative Matrix Factor Deconvolution

Clement Laroche, Helene Papadopoulos, Matthieu Kowalski, Gael Richard

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 46 - 50

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a supervised multilayer factorization method designed for harmonic/percussive source separation and drum extraction. Our method decomposes the audio signals in sparse orthogonal components which capture the harmonic content, while the drum is represented by an extension of non negative matrix factorization which is able to exploit time-frequency dictionaries to take into...

INFONA - science communication portal

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Author index

Cover page

Title page

Copyright page

Organizing committee

Technical program committee

Reviewers

General chairs' welcome

Table of contents

Technical program chairs' overview

Informed source separation via compressive graph signal sampling

Motion informed audio source separation

Supervised monaural source separation based on autoencoders

An EM algorithm for joint source separation and diarisation of multichannel convolutive speech mixtures

Blind source separation based on independent low-rank matrix analysis with sparse regularization for time-series activity

Multichannel audio source separation: Variational inference of time-frequency sources from time-domain observations

Overlapping sound event detection with supervised Nonnegative Matrix Factorization

Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification

Tracking metrical structure changes with sparse-NMF

Drum extraction in single channel audio signals using multi-layer Non negative Matrix Factor Deconvolution

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)