Search results for: K. Komatani

Items from 1 to 20 out of 48 results

chapter

Robot musical accompaniment: integrating audio and visual cues for real-time synchronization with a human flutist

Angelica Lim, T Mizumoto, L Cahier, T Otsuka, more

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems > 1964 - 1969

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)

Musicians often have the following problem: they have a music score that requires 2 or more players, but they have no one with whom to practice. So far, score-playing music robots exist, but they lack adaptive abilities to synchronize with fellow players' tempo variations. In other words, if the human speeds up their play, the robot should also increase its speed. However, computer accompaniment systems...

chapter

Motion generation based on reliable predictability using self-organized object features

S Nishide, T Ogata, J Tani, T Takahashi, more

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems > 3453 - 3458

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)

Predictability is an important factor for determining robot motions. This paper presents a model to generate robot motions based on reliable predictability evaluated through a dynamics learning model which self-organizes object features. The model is composed of a dynamics learning module, namely Recurrent Neural Network with Parametric Bias (RNNPB), and a hierarchical neural network as a feature...

chapter

Speedup and performance improvement of ICA-based robot audition by parallel and resampling-based block-wise processing

R Takeda, K Nakadai, T Takahashi, K Komatani, more

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems > 1949 - 1956

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)

This paper describes a speedup and performance improvement of multi-channel semi-blind ICA (MCSB-ICA) with parallel and resampling-based block-wise processing. MCSB-ICA is an integrated method of sound source separation that accomplishes blind source separation, blind dereverberation, and echo cancellation. This method enables robots to separate user's speech signals from observed signals including...

chapter

Human-robot ensemble between robot thereminist and human percussionist using coupled oscillator model

T Mizumoto, T Otsuka, K Nakadai, T Takahashi, more

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems > 1957 - 1963

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)

This paper presents a novel synchronizing method for a human-robot ensemble using coupled oscillators. We define an ensemble as a synchronized performance produced through interactions between independent players. To attain better synchronized performance, the robot should predict the human's behavior to reduce the difference between the human's and robot's onset timings. Existing studies in such...

chapter

Exploiting harmonic structures to improve separating simultaneous speech in under-determined conditions

Y Hirasawa, T Takahashi, K Komatani, T Ogata, more

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems > 450 - 457

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)

In real-world situations, a robot may often encounter “under-determined” situation, where there are more sound sources than microphones. This paper presents a speech separation method using a new constraint on the harmonic structure for a simultaneous speech-recognition system in under-determined conditions. The requirements for a speech separation method in a simultaneous speech-recognition system...

chapter

An improvement in automatic speech recognition using soft missing feature masks for robot audition

T Takahashi, K Nakadai, K Komatani, T Ogata, more

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems > 964 - 969

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)

We describe integration of preprocessing and automatic speech recognition based on Missing-Feature-Theory (MFT) to recognize a highly interfered speech signal, such as the signal in a narrow angle between a desired and interfered speakers. As a speech signal separated from a mixture of speech signals includes the leakage from other speech signals, recognition performance of the separated speech degrades...

chapter

Voice quality manipulation for humanoid robots consistent with their head movements

T. Otsuka, K. Nakadai, T. Takahashi, K. Komatani, more

2009 9th IEEE-RAS International Conference on Humanoid Robots > 405 - 410

2009 9th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2009)

This paper presents voice-quality control of humanoid robots based on a new model of spectral envelope modification corresponding to the vertical head motions, and left-right sound-pressure modulation corresponding to the horizontal head motions. We assume that a pitch-axis rotation, or a vertical head motion, and a yaw-axis rotation, or a horizontal head motion, affect the voice quality independently...

chapter

Automatic estimation of reverberation time with robot speech to improve ICA-based robot audition

R. Takeda, K. Nakadai, T. Takahashi, K. Komatani, more

2009 9th IEEE-RAS International Conference on Humanoid Robots > 250 - 255

2009 9th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2009)

This paper presents an ICA-based robot audition system which estimates the reverberation time of the environment automatically by using the robot's own speech. The system is based on multi-channel semi-blind independent component analysis (MCSB-ICA), a source separation method using a microphone array that can separate user and robot speech under reverberant environments. Perception of the reverberation...

chapter

Phoneme acquisition model based on vowel imitation using Recurrent Neural Network

H. Kanda, T. Ogata, T. Takahashi, K. Komatani, more

2009 IEEE/RSJ International Conference on Intelligent Robots and Systems > 5388 - 5393

2009 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2009)

A phoneme-acquisition system was developed using a computational model that explains the developmental process of human infants in the early period of acquiring language. There are two important findings in constructing an infant's acquisition of phonemes: (1) an infant's vowel like cooing tends to invoke utterances that are imitated by its caregiver, and (2) maternal imitation effectively reinforces...

chapter

Incremental polyphonic audio to score alignment using beat tracking for singer robots

T. Otsuka, T. Takahashi, H.G. Okuno, K. Komatani, more

2009 IEEE/RSJ International Conference on Intelligent Robots and Systems > 2289 - 2296

2009 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2009)

We aim at developing a singer robot capable of listening to music with its own ??ears?? and interacting with a human's musical performance. Such a singer robot requires at least three functions: listening to the music, understanding what position in the music is being performed, and generating a singing voice. In this paper, we focus on the second function, that is, the capability to align an audio...

chapter

Step-size parameter adaptation of multi-channel semi-blind ICA with piecewise linear model for barge-in-able robot audition

R. Takeda, K. Nakadai, T. Takahashi, K. Komatani, more

2009 IEEE/RSJ International Conference on Intelligent Robots and Systems > 2277 - 2282

2009 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2009)

This paper describes a step-size parameter adaptation technique of multi-channel semi-blind independent component analysis (MCSB-ICA) for a ??barge-in-able?? robot audition system. By ??barge-in??, we mean that the user can speak simultaneously when the robot is speaking.We focused on MCSB-ICA to achieve such an audition system because it can separate a user's and a robot's speech under reverberant...

chapter

Missing-feature-theory-based robust simultaneous speech recognition system with non-clean speech acoustic model

T. Takahashi, K. Nakadai, K. Komatani, T. Ogata, more

2009 IEEE/RSJ International Conference on Intelligent Robots and Systems > 2730 - 2735

2009 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2009)

A humanoid robot must recognize a target speech signal while people around the robot chat with them in real-world. To recognize the target speech signal, robot has to separate the target speech signal among other speech signals and recognize the separated speech signal. As separated signal includes distortion, automatic speech recognition (ASR) performance degrades. To avoid the degradation, we trained...

chapter

Development of a Meeting Browser towards Supporting Public Involvement

S. Shiramatsu, T. Ozono, T. Shintani, K. Komatani, more

2009 International Conference on Computational Science and Engineering > 4 > 717 - 722

2009 International Conference on Computational Science and Engineering (CSE)

This paper presents novel methods for support for browsing a long meeting record towards supporting public involvement. Facilitating public involvement in the consensus building process for community development needs a lot of effort and time for sharing context and concerns among citizens and stakeholders. A record of public meeting often becomes too long to overview and to understand for people...

chapter

Analysis of motion searching based on reliable predictability using recurrent neural network

S. Nishide, T. Ogata, J. Tani, K. Komatani, more

2009 IEEE/ASME International Conference on Advanced Intelligent Mechatronics > 192 - 197

2009 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM)

Reliable predictability is one of the main factors that determine human behaviors. The authors developed a model that searches and generates robot motions based on reliable predictability. Training of the model consists of three phases. In the first phase, the model trains a sequential learner, namely recurrent neural network with parametric bias, to self-organize robot and object dynamics. In the...

chapter

Continuous vocal imitation with self-organized vowel spaces in Recurrent Neural Network

H. Kanda, T. Ogata, T. Takahashi, K. Komatani, more

2009 IEEE International Conference on Robotics and Automation > 4438 - 4443

2009 IEEE International Conference on Robotics and Automation (ICRA)

A continuous vocal imitation system was developed using a computational model that explains the process of phoneme acquisition by infants. Human infants perceive speech sounds not as discrete phoneme sequences but as continuous acoustic signals. One of critical problems in phoneme acquisition is the design for segmenting these continuous speech sounds. The key idea to solve this problem is that articulatory...

chapter

Prediction and imitation of other's motions by reusing own forward-inverse model in robots

T. Ogata, R. Yokoya, J. Tani, K. Komatani, more

2009 IEEE International Conference on Robotics and Automation > 4144 - 4149

2009 IEEE International Conference on Robotics and Automation (ICRA)

This paper proposes a model that enables a robot to predict and imitate the motions of another by reusing its body forward-inverse model. Our model includes three approaches: (i) projection of a self-forward model for predicting phenomena in the external environment (other individuals), (ii) ldquotriadic relationrdquo that is mediation by a physical object between self and others, (iii) introduction...

chapter

ICA-based efficient blind dereverberation and echo cancellation method for barge-in-able robot audition

R. Takeda, K. Nakadai, T. Takahashi, K. Komatani, more

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 3677 - 3680

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper describes a new method that allows ldquoBarge-Inrdquo in various environments for robot audition. ldquoBarge-inrdquo means that a user begins to speak simultaneously while a robot is speaking. To achieve the function, we must deal with problems on blind dereverberation and echo cancellation at the same time. We adopt Independent Component Analysis (ICA) because it essentially provides a...

chapter

3D Auditory Scene Visualizer with Face Tracking: Design and Implementation for Auditory Awareness Compensation

Y. Kubota, S. Shiramatsu, M. Yoshida, K. Komatani, more

2008 Second International Symposium on Universal Communication > 42 - 49

2008 Second International Symposium on Universal Communication

This paper presents the design and implementation of 3D Auditory Scene Visualizer based on the visual information seeking mantra, ``overview first, zoom and filter, then details on demand''. The machine audition system called HARK captures 3D sounds with a microphone array.The natural language processing called SalienceGraph visualizes topic transition by using discourse salience. The 3D visualizer...

chapter

Design and Implementation of 3D Auditory Scene Visualizer towards Auditory Awareness with Face Tracking

Y. Kubota, M. Yoshida, K. Komatani, T. Ogata, more

2008 Tenth IEEE International Symposium on Multimedia > 468 - 476

2008 Tenth IEEE International Symposium on Multimedia

If machine audition can recognize an auditory scene containing simultaneous and moving talkers, what kinds of awareness will people gain from an auditory scene visualizer? This paper presents the design and implementation of 3D Auditory Scene Visualizer based on the visual information seeking mantra, i.e., ldquooverview first, zoom and filter, then details on demandrdquo. The machine audition system...

chapter

Target speech detection and separation for humanoid robots in sparse dialogue with noisy home environments

Hyun-Don Kim, Jinsung Kim, K. Komatani, T. Ogata, more

2008 IEEE/RSJ International Conference on Intelligent Robots and Systems > 1705 - 1711

2008 IEEE/RSJ International Conference on Intelligent Robots and Systems

In normal human communication, people face the speaker when listening and usually pay attention to the speakerpsila face. Therefore, in robot audition, the recognition of the front talker is critical for smooth interactions. This paper presents an enhanced speech detection method for a humanoid robot that can separate and recognize speech signals originating from the front even in noisy home environments...

Publication date

Set your own date range

INFONA - science communication portal

Search results for: K. Komatani

Robot musical accompaniment: integrating audio and visual cues for real-time synchronization with a human flutist

Motion generation based on reliable predictability using self-organized object features

Speedup and performance improvement of ICA-based robot audition by parallel and resampling-based block-wise processing

Human-robot ensemble between robot thereminist and human percussionist using coupled oscillator model

Exploiting harmonic structures to improve separating simultaneous speech in under-determined conditions

An improvement in automatic speech recognition using soft missing feature masks for robot audition

Voice quality manipulation for humanoid robots consistent with their head movements

Automatic estimation of reverberation time with robot speech to improve ICA-based robot audition

Phoneme acquisition model based on vowel imitation using Recurrent Neural Network

Incremental polyphonic audio to score alignment using beat tracking for singer robots

Step-size parameter adaptation of multi-channel semi-blind ICA with piecewise linear model for barge-in-able robot audition

Missing-feature-theory-based robust simultaneous speech recognition system with non-clean speech acoustic model

Development of a Meeting Browser towards Supporting Public Involvement

Analysis of motion searching based on reliable predictability using recurrent neural network

Continuous vocal imitation with self-organized vowel spaces in Recurrent Neural Network

Prediction and imitation of other's motions by reusing own forward-inverse model in robots

ICA-based efficient blind dereverberation and echo cancellation method for barge-in-able robot audition

3D Auditory Scene Visualizer with Face Tracking: Design and Implementation for Auditory Awareness Compensation

Design and Implementation of 3D Auditory Scene Visualizer towards Auditory Awareness with Face Tracking

Target speech detection and separation for humanoid robots in sparse dialogue with noisy home environments

Filter options

Publication date

Content availability

Publication type

Keywords

Data set

Journal

INFONA - science communication portal

Search results for: K. Komatani

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Data set

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options