Search results for: Dongmei Jiang

Items from 1 to 7 out of 7 results

chapter

Multimodal depression recognition with dynamic visual and audio cues

Lang He, Dongmei Jiang, Hichem Sahli

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 260 - 266

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

In this paper, we present our system design for audio visual multi-modal depression recognition. To improve the estimation accuracy of the Beck Depression Inventory (BDI) score, besides the Low Level Descriptors (LLD) features and the Local Gabor Binary Pattern-Three Orthogonal Planes (LGBP-TOP) features provided by the 2014 Audio/Visual Emotion Challenge and Workshop (AVEC2014), we extract extra...

chapter

Dimensional emotion driven facial expression synthesis based on the multi-stream DBN model

Hao Wu, Dongmei Jiang, Yong Zhao, Hichem Sahli

Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference > 1 - 6

2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

This paper proposes a dynamic Bayesian network (DBN) based MPEG-4 compliant 3D facial animation synthesis method driven by the (Evaluation, Activation) values in the continuous emotion space. For each emotion, a state synchronous DBN model (SS_DBN) is firstly trained using the Cohn-Kanade (CK) database with two streams of inputs: (i) the annotated (Evaluation, Activation) values, and (ii) the extracted...

chapter

Realistic mouth animation based on an articulatory DBN model with constrained asynchrony

Dongmei Jiang, I Ravyse, Peizhen Liu, Hichem Sahli, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 2478 - 2481

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In this paper, we propose an approach to convert acoustic speech to video realistic mouth animation based on an articulatory dynamic Bayesian network model with constrained asynchrony (AF_AVDBN). Conditional probability distributions are defined to control the asynchronies between the articulators such as lips, tongue and glottis/velum. An EM-based conversion algorithm is also presented to learn the...

chapter

A Visual Silence Detector Constraining Speech Source Separation

I. Gonzalez, I. Ravyse, H. Brouckxon, W. Verhelst, more

2009 Fifth International Conference on Image and Graphics > 463 - 470

Fifth International Conference on Image and Graphics (ICIG 2009)

We propose an audiovisual source separation algorithm for speech signals. In our proposed algorithm we first extract the time segments with low activity of the mouth region from synchronous video recordings. An automatically selected optimal classifier is used to detect silent intervals in these instants of low visual mouth activity. Then, the source separation problem is formulated and solved for...

chapter

Audio-Visual Emotion Recognition Based on a DBN Model with Constrained Asynchrony

Danqi Chen, Dongmei Jiang, I. Ravyse, H. Sahli

2009 Fifth International Conference on Image and Graphics > 912 - 916

Fifth International Conference on Image and Graphics (ICIG 2009)

This paper presents an audio visual multi-stream DBN model (Asy_DBN) for emotion recognition with constraint asynchrony, in which audio state and visual state transit individually in their corresponding stream but the transition is constrained by the allowed maximum audio visual asynchrony. Emotion recognition experiments of Asy_DBN with different asynchrony constraints are carried out on an audio...

chapter

Video Realistic Mouth Animation Based on an Audio Visual DBN Model with Articulatory Features and Constrained Asynchrony

Dongmei Jiang, Peizhen Liu, I. Ravyse, H. Sahli, more

2009 Fifth International Conference on Image and Graphics > 658 - 662

Fifth International Conference on Image and Graphics (ICIG 2009)

This paper presents a mouth animation construction method based on the DBN models with articulatory features (AF_AVDBN), in which the articulatory features of lips, tongue, glottis/velum can be asynchronous within a maximum asynchrony constraint to describe the speech production process more reasonably. Given an audio input and the trained AF_AVDBN models, the optimal visual feature learning algorithm...

chapter

Accurate visual speech synthesis based on diviseme unit selection and concatenation

Dongmei Jiang, I. Ravyse, H. Sahli, Yanning Zhang

2008 IEEE 10th Workshop on Multimedia Signal Processing > 906 - 909

2008 IEEE 10th Workshop on Multimedia Signal Processing (MMSP)

This paper presents a novel speech driven accurate realistic visual speech synthesis approach. Firstly, an audio visual instance database is built for different viseme context combinations, i.e. diviseme units, using 100 audio visual speech sentences of a female speaker. Then a diviseme instance selection algorithm is introduced to choose the optimal diviseme instances for the viseme contexts in the...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Keywords

VISUALIZATION (6)
FEATURE EXTRACTION (4)
HIDDEN MARKOV MODELS (4)
MOUTH (4)
IMAGE SEQUENCES (3)
SPEECH PROCESSING (3)
AF AVDBN (2)
ANIMATION (2)
ASYNCHRONY (2)
AUDIO-VISUAL SYSTEMS (2)
COMPUTER ANIMATION (2)
CONSTRAINED ASYNCHRONY (2)
FACIAL ANIMATION (2)
MAXIMUM LIKELIHOOD ESTIMATION (2)
MOUTH ANIMATION (2)
SPEECH RECOGNITION (2)
SPEECH SYNTHESIS (2)
VIDEO REALISTIC MOUTH ANIMATION (2)
ACOUSTIC PRONUNCIATION PROCESS (1)
ACOUSTIC SPEECH (1)
ACOUSTICS (1)
AF DBN (1)
ARTICULATORY DBN MODEL (1)
ARTICULATORY DYNAMIC BAYESIAN NETWORK MODEL (1)
ARTICULATORY FEATURES (1)
ASYNCHRONOUS DBN MODEL (1)
AUDIO VISUAL ASYNCHRONY (1)
AUDIO VISUAL DBN MODEL (1)
AUDIO VISUAL INSTANCE DATABASE (1)
AUDIO VISUAL MULTI-STREAM (1)
AUDIO-VISUAL EMOTION RECOGNITION (1)
AUDIOVISUAL SOURCE SEPARATION (1)
AUDITORY INPUT (1)
BELIEF NETWORKS (1)
CONDITIONAL PROBABILITY DISTRIBUTION (1)
CONSTRAINT HANDLING (1)
DBN MODEL (1)
DEPRESSION RECOGNITION (1)
DISCRETE COSINE TRANSFORMS (1)
DIVISEME INSTANCE SELECTION ALGORITHM (1)
DOWNSAMPLED YUV SPATIAL FREQUENCY (1)
EM-BASED CONVERSION ALGORITHM (1)
EMOTION RECOGNITION (1)
FACE (1)
FACE RECOGNITION (1)
FREQUENCY DOMAIN ANALYSIS (1)
GLOBAL FEATURES (1)
HEAD (1)
HEARING (1)
HISTOGRAMS (1)
HISTORY (1)
HMM (1)
IMAGE MATCHING (1)
IMAGE MOTION ANALYSIS (1)
IMAGE SEGMENTATION (1)
INTERPOLATED MOUTH IMAGE SEQUENCES (1)
INTERPOLATION (1)
LOW VISUAL MOUTH ACTIVITY (1)
MAXIMUM LIKELIHOOD ESTIMATION CRITERION (1)
MICROPHONES (1)
MOUTH ANIMATIONS (1)
MOUTH IMAGES (1)
MOUTH MOVEMENTS MATCHING (1)
MULTI-MODAL FUSION (1)
MULTISTREAM HIDDEN MARKOV MODEL (1)
OPTICAL IMAGING (1)
OPTIMAL CLASSIFIER (1)
OPTIMAL VISUAL FEATURES (1)
PROBABILITY DISTRIBUTIONS (1)
SOURCE SEPARATION (1)
SPATIAL UPSAMPLING (1)
SPATIO-TEMPORAL CHANGES (1)
SPEAKERS (1)
SPEECH CORPORA (1)
SPEECH SIGNALS (1)
SPEECH SOURCE SEPARATION (1)
STATISTICAL DISTRIBUTIONS (1)
SYNCHRONOUS VIDEO RECORDINGS (1)
TEMPORAL DOWNSAMPLING (1)
TRACKING (1)
TRAJECTORY (1)
TRANSFORM CODING (1)
TV (1)
VIDEO RECORDING (1)
VISUAL COMMUNICATION (1)
VISUAL FEATURE LEARNING ALGORITHM (1)
VISUAL SILENCE DETECTOR (1)
VISUAL SPEECH SYNTHESIS APPROACH (1)
more

INFONA - science communication portal

Search results for: Dongmei Jiang

Multimodal depression recognition with dynamic visual and audio cues

Dimensional emotion driven facial expression synthesis based on the multi-stream DBN model

Realistic mouth animation based on an articulatory DBN model with constrained asynchrony

A Visual Silence Detector Constraining Speech Source Separation

Audio-Visual Emotion Recognition Based on a DBN Model with Constrained Asynchrony

Video Realistic Mouth Animation Based on an Audio Visual DBN Model with Articulatory Features and Constrained Asynchrony

Accurate visual speech synthesis based on diviseme unit selection and concatenation

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options