Search results for: L. Girin

Items from 1 to 6 out of 6 results

article

Informed Source Separation of Linear Instantaneous Under-Determined Audio Mixtures by Source Index Embedding

M Parvaix, L Girin

IEEE Transactions on Audio, Speech, and Language Processing > 2011 > 19 > 6 > 1721 - 1733

In this paper, we address the issue of underdetermined source separation of I nonstationary audio sources from a J -channel linear instantaneous mixture (J <; I). This problem is addressed with a specific coder-decoder configuration. At the coder, source signals are assumed to be available before the mixing is processed. A time-frequency (TF) joint analysis of each source signal and mixture signal...

chapter

A watermarking-based method for single-channel audio source separation

M. Parvaix, L. Girin, J.-M. Brossier

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 101 - 104

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we address the issue of audio source separation with a single channel, i.e. the estimation of source signals from a single mixture of these signals. This problem is addressed with a specific configuration: source signals are assumed to be available before the mix is processed. We propose an original method that uses a watermarking technique to embed information about the source signals...

chapter

Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies

K. Hermus, L. Girin, H. Van hamme, S. Irhimeh

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4473 - 4476

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

We present a new algorithm for the automatic estimation of the voicing cut-off frequency (VCO), i.e., the frequency that separates the periodic low-frequency part from the aperiodic high-frequency part in voiced segments of natural speech. Starting from the power spectrum of a two pitch period speech frame, we define the VCO to be located at the frequency for which the sum of the periodic and aperiodic...

chapter

Long-term flexible 2D cepstral modeling of speech spectral amplitudes

M. Firouzmand, L. Girin

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 3937 - 3940

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

This paper presents a method for modeling the envelope of spectral amplitude parameters of speech signals in "two dimensions" (2D). It consists of two cascaded modelings: the first one along the frequency axis is the usual cepstrum technique, which consists of modeling the log-scaled spectral envelope with a discrete cosine model (DCM). The second one, along the time axis, consists of modeling...

chapter

Using a Visual Voice Activity Detector to Regularize the Permutations in Blind Separation of Convolutive Speech Mixtures

B. Rivet, L. Girin, C. Serviere, Dinh-Tuan Pham, more

2007 15th International Conference on Digital Signal Processing > 223 - 226

2007 15th International Conference on Digital Signal Processing

Audio-visual speech source separation consists in mixing visual speech processing techniques (e.g. lip parameters tracking) with source separation methods to improve and/or simplify the extraction of a speech signal from a mixture of acoustic signals. In this paper, we present a new approach to this problem: visual information is used here as a voice activity detector (VAD). Results show that, in...

chapter

An Analysis of Visual Speech Information Applied to Voice Activity Detection

D. Sodoyer, B. Rivet, L. Girin, J.-L. Schwartz, more

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 1 > I

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

We present a new approach to the voice activity detection (VAD) problem for speech signals embedded in non-stationary noise. The method is based on automatic lipreading: the objective is to detect voice activity or non-activity by exploiting the coherence between the speech acoustic signal and the speaker's lip movements. From a comprehensive analysis of lip shape parameters during speech and non-speech...

Filter options

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

SPEECH PROCESSING (4)
SOURCE SEPARATION (3)
AUDIO CODING (2)
DECODING (2)
SPEECH (2)
SPEECH ANALYSIS (2)
SPEECH CODING (2)
SPEECH SYNTHESIS (2)
VISUAL VOICE ACTIVITY DETECTOR (2)
WATERMARKING (2)
WATERMARKING TECHNIQUE (2)
ACOUSTIC CONVOLUTION (1)
APERIODIC ENERGIES (1)
AUDIO PROCESSING (1)
AUDIO SYSTEM (1)
AUDIO WATERMARKING (1)
AUDIOVISUAL SPEECH (1)
AUTOMATIC FREQUENCY ESTIMATION (1)
AUTOMATIC LIPREADING (1)
BLIND SEPARATION (1)
BLIND SOURCE SEPARATION (1)
CASCADED MODELINGS (1)
CONVOLUTIVE MIXTURES (1)
CONVOLUTIVE SPEECH MIXTURES (1)
CORRESPONDING SOURCES INDEX CODE (1)
DATA MINING (1)
DISCRETE COSINE MODEL (1)
DISCRETE COSINE TRANSFORMS (1)
DYNAMIC PROGRAMMING (1)
DYNAMIC PROGRAMMING BASED SMOOTHING (1)
FACE RECOGNITION (1)
FREQUENCY MASKING (1)
GESTURE RECOGNITION (1)
HARMONIC ENERGIES (1)
INDEXES (1)
INFORMED SOURCE SEPARATION (1)
INSTRUMENTS (1)
ITERATIVE ALGORITHM (1)
ITERATIVE METHODS (1)
J-CHANNEL LINEAR INSTANTANEOUS MIXTURE (1)
LINEAR INSTANTANEOUS UNDER-DETERMINED AUDIO MIXTURES (1)
LIP MOVEMENTS (1)
LOG-SCALED SPECTRAL ENVELOPE (1)
LONG-TERM FLEXIBLE 2D CEPSTRAL MODELING (1)
MIX SIGNAL (1)
MIXING (1)
MIXTURE SIGNAL (1)
MULTIPLE SIGNAL CLASSIFICATION (1)
MUSIC (1)
NATURAL SPEECH (1)
NONSPEECH EVENTS (1)
NONSTATIONARY AUDIO SOURCES (1)
NONSTATIONARY NOISE (1)
PERCEPTUAL CRITERION (1)
POST-MIXING PROCESSING (1)
REMIXING (1)
SEMIBLIND REFERENCE METHOD (1)
SEPARATE MANIPULATION (1)
SHAPE (1)
SINGING VOICE SIGNALS (1)
SINGLE-CHANNEL AUDIO SOURCE SEPARATION (1)
SINGLE-CHANNEL MIXTURE (1)
SINUSOIDAL SPEECH CODING (1)
SOURCE CODING (1)
SOURCE INDEX EMBEDDING (1)
SOURCE SIGNALS (1)
SPECIFIC CODER-DECODER CONFIGURATION (1)
SPECTRAL ANALYSIS (1)
SPEECH ACOUSTIC SIGNAL (1)
SPEECH MODELING (1)
SPEECH RECOGNITION (1)
SPEECH SIGNAL EXTRACTION (1)
SPEECH SIGNALS (1)
SPEECH SPECTRAL AMPLITUDES (1)
STEREO MUSIC RESTITUTION (1)
TF JOINT ANALYSIS (1)
TIME FREQUENCY ANALYSIS (1)
TIME-FREQUENCY ANALYSIS (1)
TIME-FREQUENCY JOINT ANALYSIS (1)
TWO-CHANNEL STEREO MIXTURES (1)
UNDER-DETERMINED SOURCE SEPARATION (1)
UNDERDETERMINED SOURCE SEPARATION (1)
VISUAL SPEECH INFORMATION (1)
VISUAL VOICE ACTIVITY DETECTION (1)
VOICE ACTIVITY DETECTION (1)
VOICE SIGNAL SEGREGATION (1)
VOICING CUT-OFF FREQUENCY CONTOUR (1)
more

INFONA - science communication portal

Search results for: L. Girin

Informed Source Separation of Linear Instantaneous Under-Determined Audio Mixtures by Source Index Embedding

A watermarking-based method for single-channel audio source separation

Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies

Long-term flexible 2D cepstral modeling of speech spectral amplitudes

Using a Visual Voice Activity Detector to Regularize the Permutations in Blind Separation of Convolutive Speech Mixtures

An Analysis of Visual Speech Information Applied to Voice Activity Detection

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options