Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Welcome to spectacular Vancouver for the 38th edition of ICASSP, the premier conference in Signal Processing to be held at the Vancouver Convention Center in British Columbia, Canada. This year, we received 3314 regular paper submissions (not including special session papers). Submission figures are listed below with topics represented by a Technical Committee (TC) of Signal Processing Society.

chapter

General chairs' message

Rabab Ward, Li Deng

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > xvi - xvii

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

On behalf of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2013 Organizing Committee, we would like to cordially welcome you to Vancouver, a city that has been chosen by the United Nations as the world's “Most Livable City” eight times in the last 10 years.

chapter

Future SPS conferences

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > xx

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

chapter

Exploiting structural relationships in audio music signals using Markov Logic Networks

Helene Papadopoulos, George Tzanetakis

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 1 - 5

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose an innovative approach for music description at several time-scales in a single unified formalism. More specifically, chord information at the analysis-frame level and global semantic structure are integrated in an elegant and flexible model. Using Markov Logic Networks (MLNs) low-level signal features are encoded with high-level information expressed by logical rules, without the need...

chapter

A discriminative approach to polyphonic piano note transcription using supervised non-negative matrix factorization

Felix Weninger, Christian Kirst, Bjorn Schuller, Hans-Joachim Bungartz

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 6 - 10

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We introduce a novel method for the transcription of polyphonic piano music by discriminative training of support vector machines (SVMs). As features, we use pitch activations computed by supervised non-negative matrix factorization from low-level spectral features. Different approaches to low-level feature extraction, NMF dictionary learning and activation feature extraction are analyzed in a large-scale...

chapter

Does inharmonicity improve an NMF-based piano transcription model?

Francois Rigaud, Antoine Falaize, Bertrand David, Laurent Daudet

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 11 - 15

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper investigates how precise a model should be for a robust model-based NMF analysis of piano recordings. While inharmonicity is an essential feature of piano tones from a perceptual point of view, its explicit inclusion in sound models is not straightforward and may even damage the quality of the analysis. Here, we assess the quality of the analysis with a transcription task, and compare three...

chapter

Automatic Music Transcription using row weighted decompositions

Ken O'Hanlon, Mark D. Plumbley

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 16 - 20

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Automatic Music Transcription (AMT) seeks to understand a musical piece in terms of note activities. Matrix decomposition methods are often used for AMT, seeking to decompose a spectrogram over a dictionary matrix of note-specific template vectors. The performance of these methods can suffer due to the large harmonic overlap found in tonal musical spectra. We propose a row weighting scheme that transforms...

chapter

Unsupervised training of detection threshold for polyphonic musical note tracking based on event periodicity

Tiago Fernandes Tavares, Jayme Garcia Arnal Barbedo, Romis Attux, Amauri Lopes

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 21 - 25

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

A common approach to the detection of simultaneous musical notes in an acoustic recording involves defining a function that yields activation levels for each candidate musical note over time. These levels tend to be high when the note is active and low when it is not. Therefore, by applying a simple threshold decision process, it is possible to decide whether each note is active or not at a given...

chapter

Missing template estimation for user-assisted music transcription

Holger Kirchhoff, Simon Dixon, Anssi Klapuri

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 26 - 30

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

For a user-assisted music transcription system in which the user is asked to label some notes for each instrument in the recording, we investigate ways to limit the amount of information the user has to provide. Different methods are proposed and experimentally compared that enable the estimation of template spectra at pitch positions that have not been annotated by the user, in order to derive a...

chapter

A psychoacoustical preprocessing technique for virtual bass enhancement of the parametric loudspeaker

Chuang Shi, Hao Mu, Woon-Seng Gan

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 31 - 35

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The parametric loudspeaker is a novel type of loudspeaker that can project a directional sound beam. It is commonly used in creating personal sound zone and projecting private messages to a targeted audience. However, the parametric loudspeaker possesses a very poor bass (or low-frequency) response due inherently to the nonlinear acoustic principle generating sound from ultrasound in air. A psychoacoustic...

chapter

A timbre matching approach to enhance audio quality of psychoacoustic bass enhancement system

Hao Mu, Woon-Seng Gan, Ee-Leng Tan

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 36 - 40

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Small and flat loudspeakers usually result in poor low-frequency (or bass) responses. Conventional gain equalization does not help significantly and may even result in overdriving and distortion. A psychoacoustic approach has been found to be suitable in tricking the human ear to perceive the fundamental frequency from its higher harmonics. Past research efforts have generally focused on weighting...

INFONA - science communication portal

2013 IEEE International Conference on Acoustics, Speech and Signal Processing

Author index

Front cover

Title page

Copyright page

Blank page

ICASSP 2013 conference committee

Reviewers

Technical program committee

Technical chair's overview

General chairs' message

Future SPS conferences

Table of contents

Exploiting structural relationships in audio music signals using Markov Logic Networks

A discriminative approach to polyphonic piano note transcription using supervised non-negative matrix factorization

Does inharmonicity improve an NMF-based piano transcription model?

Automatic Music Transcription using row weighted decompositions

Unsupervised training of detection threshold for polyphonic musical note tracking based on event periodicity

Missing template estimation for user-assisted music transcription

A psychoacoustical preprocessing technique for virtual bass enhancement of the parametric loudspeaker

A timbre matching approach to enhance audio quality of psychoacoustic bass enhancement system

Filter options

Publication date

Keywords

INFONA - science communication portal

2013 IEEE International Conference on Acoustics, Speech and Signal Processing $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2013 IEEE International Conference on Acoustics, Speech and Signal Processing