The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Welcome to spectacular Vancouver for the 38th edition of ICASSP, the premier conference in Signal Processing to be held at the Vancouver Convention Center in British Columbia, Canada. This year, we received 3314 regular paper submissions (not including special session papers). Submission figures are listed below with topics represented by a Technical Committee (TC) of Signal Processing Society.
On behalf of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2013 Organizing Committee, we would like to cordially welcome you to Vancouver, a city that has been chosen by the United Nations as the world's “Most Livable City” eight times in the last 10 years.
We propose an innovative approach for music description at several time-scales in a single unified formalism. More specifically, chord information at the analysis-frame level and global semantic structure are integrated in an elegant and flexible model. Using Markov Logic Networks (MLNs) low-level signal features are encoded with high-level information expressed by logical rules, without the need...
We introduce a novel method for the transcription of polyphonic piano music by discriminative training of support vector machines (SVMs). As features, we use pitch activations computed by supervised non-negative matrix factorization from low-level spectral features. Different approaches to low-level feature extraction, NMF dictionary learning and activation feature extraction are analyzed in a large-scale...
This paper investigates how precise a model should be for a robust model-based NMF analysis of piano recordings. While inharmonicity is an essential feature of piano tones from a perceptual point of view, its explicit inclusion in sound models is not straightforward and may even damage the quality of the analysis. Here, we assess the quality of the analysis with a transcription task, and compare three...
Automatic Music Transcription (AMT) seeks to understand a musical piece in terms of note activities. Matrix decomposition methods are often used for AMT, seeking to decompose a spectrogram over a dictionary matrix of note-specific template vectors. The performance of these methods can suffer due to the large harmonic overlap found in tonal musical spectra. We propose a row weighting scheme that transforms...
A common approach to the detection of simultaneous musical notes in an acoustic recording involves defining a function that yields activation levels for each candidate musical note over time. These levels tend to be high when the note is active and low when it is not. Therefore, by applying a simple threshold decision process, it is possible to decide whether each note is active or not at a given...
For a user-assisted music transcription system in which the user is asked to label some notes for each instrument in the recording, we investigate ways to limit the amount of information the user has to provide. Different methods are proposed and experimentally compared that enable the estimation of template spectra at pitch positions that have not been annotated by the user, in order to derive a...
The parametric loudspeaker is a novel type of loudspeaker that can project a directional sound beam. It is commonly used in creating personal sound zone and projecting private messages to a targeted audience. However, the parametric loudspeaker possesses a very poor bass (or low-frequency) response due inherently to the nonlinear acoustic principle generating sound from ultrasound in air. A psychoacoustic...
Small and flat loudspeakers usually result in poor low-frequency (or bass) responses. Conventional gain equalization does not help significantly and may even result in overdriving and distortion. A psychoacoustic approach has been found to be suitable in tricking the human ear to perceive the fundamental frequency from its higher harmonics. Past research efforts have generally focused on weighting...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.