Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System

Masato Akagi; Xiao Han; Reda Elbarougy; Yasuhiro Hamada; Junfeng Li

doi:10.1109/IIH-MSP.2014.148

Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System

Akagi, Masato, Han, Xiao, Elbarougy, Reda, Hamada, Yasuhiro, Li, Junfeng

Source

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 574 - 577

Abstract

Speech-to-speech translation (S2ST) is the process by which a spoken utterance in one language is used to produce a spoken output in another language. The conventional approach to S2ST has focused on processing linguistic information only by directly translating the spoken utterance from the source language to the target language without taking into account paralinguistic and non-linguistic information such as the emotional states at play in the source language. In this work, we explore how to deal with Para-and non-linguistic information among multiple languages, with a particular focus on speakers' emotional states, in S2ST scenarios called "affective S2ST." In our efforts to construct an effective system, we discuss (1) how to describe emotions in speech and how to model the perception/production of emotions and (2) the commonality and differences among multiple languages in the proposed model. We then use these discussions as context for (3) an examination of our "affective S2ST" system in operation.

Identifiers

book e-ISBN :	978-1-4799-5390-5 , 978-1-4799-5389-9
DOI	10.1109/IIH-MSP.2014.148

Authors

Keywords

Speech Speech recognition Databases Acoustics Emotion recognition Production Semantics multiple languages Speech-to-speech translation (S2ST) system paralinguistic and non-linguistic information emotion recognition/synthesis

Additional information

Data set: ieee

Publisher

IEEE

chapter

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Akagi, Masato

Han, Xiao

Elbarougy, Reda

Hamada, Yasuhiro

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System