2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Items from 1 to 8 out of 8 results

chapter

Multi-scale modulation filtering in automatic detection of emotions in telephone speech

Jouni Pohjalainen, Paavo Alku

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 980 - 984

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This study investigates emotion detection from noise-corrupted telephone speech. A generic modulation filtering approach for audio pattern recognition is proposed that utilizes inherent long-term properties of acoustic features in different classes. When applied to binary classification along the activation and valence dimensions, filtering the baseline short-time timbral features in both the training...

chapter

UT-Vocal Effort II: Analysis and constrained-lexicon recognition of whispered speech

Shabnam Ghaffarzadegan, Hynek Boril, John H. L. Hansen

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2544 - 2548

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This study focuses on acoustic variations in speech introduced by whispering, and proposes several strategies to improve robustness of automatic speech recognition of whispered speech with neutral-trained acoustic models. In the analysis part, differences in neutral and whispered speech captured in the UT-Vocal Effort II corpus are studied in terms of energy, spectral slope, and formant center frequency...

chapter

Trajectory analysis of speech using continuous state hidden Markov Models

P. Weber, S. M. Houghton, C. J. Champion, M. J. Russell, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3042 - 3046

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Many current speech models used in recognition involve thousands of parameters, whereas the mechanisms of speech production are conceptually very simple. We present and evaluate a new continuous state probabilistic model (CS-HMM) for recovering dwell-transition and phoneme sequences from dynamic speech production features. We show that with very few parameters, these features can be tracked, and phoneme...

chapter

Ecologically valid long-term mood monitoring of individuals with bipolar disorder using speech

Zahi N. Karam, Emily Mower Provost, Satinder Singh, Jennifer Montgomery, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4858 - 4862

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speech patterns are modulated by the emotional and neurophysiological state of the speaker. There exists a growing body of work that computationally examines this modulation in patients suffering from depression, autism, and post-traumatic stress disorder. However, the majority of the work in this area focuses on the analysis of structured speech collected in controlled environments. Here we expand...

chapter

Robust full-band adaptive Sinusoidal analysis and synthesis of speech

George P. Kafentzis, Olivier Rosec, Yannis Stylianou

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6260 - 6264

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recent advances in speech analysis have shown that voiced speech can be very well represented using quasi-harmonic frequency tracks and local parameter adaptivity to the underlying signal. In this paper, we revisit the quasi-harmonicity approach through the extended adaptive Quasi-Harmonic Model — eaQHM, and we show that the application of a continuous f₀ estimation method plus an adaptivity scheme...

chapter

Time varying linear prediction using sparsity constraints

Srikanth Raj Chetupalli, T. V. Sreenivas

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6290 - 6293

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Time-varying linear prediction has been studied in the context of speech signals, in which the auto-regressive (AR) coefficients of the system function are modeled as a linear combination of a set of known bases. Traditionally, least squares minimization is used for the estimation of model parameters of the system. Motivated by the sparse nature of the excitation signal for voiced sounds, we explore...

chapter

Guslar: A framework for automated singing voice correction

Elias Azarov, Maxim Vashkevich, Alexander Petrovsky

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7919 - 7923

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The paper presents a solution for singing voice processing that is used in a karaoke application with automated voice correction¹. The intended purpose of the application is to automatically improve user's performance towards performance of a professional singer by implementation of voice effects such as pitch correction, artificial polyphony, time stretching and other. The proposed framework incorporates...

chapter

Pitch modifications of speech based on an adaptive Harmonic Model

George P. Kafentzis, Gilles Degottex, Olivier Rosec, Yannis Stylianou

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7924 - 7928

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, a simple method for pitch-scale modifications of speech based on a recently suggested model for AM-FM decomposition of speech signals, is presented. This model is referred to as the adaptive Harmonic Model (aHM). The aHM models speech as a sum of harmonically related sinusoids that can adapt to the local characteristics of the signal. It was shown that this model provides high quality...

Filter options

Keywords:
SPEECH ANALYSIS

Publication date

Set your own date range

Keywords

1-NORM MINIMIZATION (1)
ADAPTIVE HARMONIC MODEL (1)
ADAPTIVE QUASI-HARMONIC MODEL (1)
BIPOLAR DISORDER (1)
COMPUTATIONAL PARALINGUISTICS (1)
CONTINUOUS STATE HIDDEN MARKOV MODEL (1)
DYNAMIC FEATURES (1)
EMOTION DETECTION (1)
EXTENDED ADAPTIVE QUASI-HARMONIC MODEL (1)
FILTER-BANK OPTIMIZATION (1)
LINEAR PREDICTION (1)
MOOD MODELING (1)
NON-STATIONARY SIGNALS (1)
PITCH MODIFICATION (1)
PROBABILISTIC MODEL (1)
SINGING VOICE PROCESSING (1)
SINUSOIDAL MODELLING (1)
SPARSE REPRESENTATION (1)
SPEECH MODELLING (1)
SPEECH SYNTHESIS (1)
TIME-VARYING SYSTEMS (1)
WHISPER SPEECH RECOGNITION (1)
more

INFONA - science communication portal

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)