Lijuan Wang

chapter

Photo-real talking head with deep bidirectional LSTM

Bo Fan, Lijuan Wang, Frank K. Soong, Lei Xie

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4884 - 4888

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Long short-term memory (LSTM) is a specific recurrent neural network (RNN) architecture that is designed to model temporal sequences and their long-range dependencies more accurately than conventional RNNs. In this paper, we propose to use deep bidirectional LSTM (BLSTM) for audio/visual modeling in our photo-real talking head system. An audio/visual database of a subject's talking is firstly recorded...

chapter

Improved minimum converted trajectory error training for real-time speech-to-lips conversion

Wei Han, Lijuan Wang, Frank Soong, Bo Yuan

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4513 - 4516

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Gaussian mixture model (GMM) based speech-to-lips conversion often operates in two alternative ways: batch conversion and sliding window-based conversion for real-time processing. Previously, Minimum Converted Trajectory Error (MCTE) training has been proposed to improve the performance of batch conversion. In this paper, we extend previous work and propose a new training criteria, MCTE for Real-time...

chapter

High quality lips animation with speech and captured facial action unit as A/V input

Lijuan Wang, Frank K. Soong

Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference > 1 - 4

2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

Rendering realistic lips movements in avatar with camera captured human's facial features is desirable in many applications, e.g. telepresence, video gaming, social networking, etc. We have proposed to use Gaussian Mixture Model (GMM) to generate lips trajectory and successfully tested in speech-to-lips conversion experiments, where only audio signal (speech) is used as input. In this paper real-time...

chapter

Synthesizing visual speech trajectory with minimum generation error

Lijuan Wang, Yi-Jian Wu, Xiaodan Zhuang, Frank K. Soong

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4580 - 4583

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a minimum generation error (MGE) training method to refine the audio-visual HMM to improve visual speech trajectory synthesis. Compared with the traditional maximum likelihood (ML) estimation, the proposed MGE training explicitly optimizes the quality of generated visual speech trajectory, where the audio-visual HMM modeling is jointly refined by using a heuristic method...

INFONA - science communication portal

Search results for: Lijuan Wang

Photo-real talking head with deep bidirectional LSTM

Improved minimum converted trajectory error training for real-time speech-to-lips conversion

High quality lips animation with speech and captured facial action unit as A/V input

Synthesizing visual speech trajectory with minimum generation error

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Lijuan Wang

Photo-real talking head with deep bidirectional LSTM

Improved minimum converted trajectory error training for real-time speech-to-lips conversion

High quality lips animation with speech and captured facial action unit as A/V input

Synthesizing visual speech trajectory with minimum generation error

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options