Search results

chapter

Extraction of user preferences based on voice interaction

Takahiro Uchiya, Satoshi Otake, Ryota Nishimura, Daisuke Yamamoto, more

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE) > 1 - 2

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)

Our research group at Nagoya Institute of Technology is developing “MMDAgent” as a voice interaction toolkit. Using MMDAgent, system developers can create various speech dialogue contents. When developers create voice interaction contents, it is important to consider user needs. Therefore, an approach is necessary to elicit preference information of the user. In this paper, we propose a method to...

chapter

Automatic reference point assignment technique for voice morphing

Shota Takizawa, Shinichi Kawamoto

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE) > 1 - 3

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)

In this study, a method for determining the reference points in the time and frequency domains for voice morphing is proposed. Many studies have considered voice morphing out of which many methods require manual determination of the reference points. In this study, we automatically determine the reference points via modified restricted temporal decomposition and a line spectral frequency. The evaluation...

chapter

Multimodal spoken dialog system using state estimation by body motion

Takeru Koseki, Tetsuo Kosaka

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE) > 1 - 4

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)

Spoken dialog systems are presently used widely. However, some users avoid using them because of poor usability and unattractiveness. In this study, we develop a system that captures the user's movement and estimates the user state. This function is incorporated into the existing spoken dialog system to build a multimodal dialog system. In the experiments, the recognition performance of body motion...

chapter

Investigation of efficient semi-automatic correction method using STD for automatic captioning

Yuji Terada, Kenta Tamiya, Atsuhiko Kai

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE) > 1 - 2

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)

Captioning lecture speech is very useful for better understanding. However, it takes high cost to do real-time manual captioning or even if we employ automatic speech recognition system and human correction together. In this paper, we propose a method to reduce a cost for human correction as a prerequisite of a framework for captioning using automatic speech recognition system. Specifically, we investigate...

chapter

Sound event classification with feature vector combination for automatic audio-based surveillance

Seunghyung Lee, Jinuk Park, Sangjun Park, Minsoo Hahn

2016 IEEE International Conference on Consumer Electronics (ICCE) > 147 - 148

2016 IEEE International Conference on Consumer Electronics (ICCE)

This paper deals with the sound event classification for automatic audio-based surveillance. To improve the performance, we proposed a feature vector combination scheme to use multiple feature vectors simultaneously. Then, the performance is evaluated by using the combination of three segment-based features. The result shows significant amount of improvement compare to the conventional method.

chapter

Technologies, implementations and applications for the Kleistian development of thoughts in speech

Oksana Arnold, Andre Schulz

The 1st IEEE Global Conference on Consumer Electronics 2012 > 75 - 79

2012 IEEE 1st Global Conference on Consumer Electronics (GCCE)

Innovations in the fields of consumer electronics and media technologies are pervading the daily life and invading the private home. There emerge novel communication opportunities. Educational processes such as university studies, for illustration, are unconventionally expanded towards a casual home office. Web browsers and networking technologies provide access to serious content and direct manipulation...

INFONA - science communication portal

Search results

Extraction of user preferences based on voice interaction

Automatic reference point assignment technique for voice morphing

Multimodal spoken dialog system using state estimation by body motion

Investigation of efficient semi-automatic correction method using STD for automatic captioning

Sound event classification with feature vector combination for automatic audio-based surveillance

Technologies, implementations and applications for the Kleistian development of thoughts in speech

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Extraction of user preferences based on voice interaction

Automatic reference point assignment technique for voice morphing

Multimodal spoken dialog system using state estimation by body motion

Investigation of efficient semi-automatic correction method using STD for automatic captioning

Sound event classification with feature vector combination for automatic audio-based surveillance

Technologies, implementations and applications for the Kleistian development of thoughts in speech

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options