Search results for: S. Nakamura

Items from 1 to 6 out of 6 results

chapter

Active learning of confidence measure function in robot language acquisition framework

K Sugiura, N Iwahashi, H Kashioka, S Nakamura

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems > 1774 - 1779

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)

In an object manipulation dialogue, a robot may misunderstand an ambiguous command from a user, such as “Place the cup down (on the table),” potentially resulting in an accident. Although making confirmation questions before all motion will decrease the risk of this failure, the user will find it more convenient if confirmation questions are not made under trivial situations. This paper proposes a...

chapter

Temporal Modulation Normalization for Robust Speech Feature Extraction and Recognition

Xugang Lu, S. Matsuda, M. Unoki, S. Nakamura

2009 2nd International Congress on Image and Signal Processing > 1 - 4

2009 2nd International Congress on Image and Signal Processing (CISP)

Traditional noise reduction methods usually are based on the assumption that the short-term statistical distributions of speech and noise are different. Differently from that assumption, we have proposed a noise reduction method based on the assumption that the temporal modulations of noise and speech are different. Two steps are used in the proposed algorithm: one is the temporal modulation contrast...

chapter

Toward translating Indonesian spoken utterances to/from other languages

S. Sakti, M. Paul, R. Maia, S. Sakai, more

2009 Oriental COCOSDA International Conference on Speech Database and Assessments > 137 - 142

2009 Oriental COCOSDA International Conference on Speech Database and Assessments

This paper outlines the National Institute of Information and Communications Technology / Advanced Telecommunications Research Institute International (NICT/ATR) research activities in developing a spoken language translation system, specially for translating Indonesian spoken utterances into/from Japanese or English. Since the NICT/ATR Japanese-English speech translation system is an established...

chapter

CART-based modeling of Chinese tonal patterns with a functional model tracing the fundamental frequency trajectories

Jinfu Ni, S. Sakai, T. Shimizu, S. Nakamura

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4253 - 4256

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

We propose an approach to modeling Chinese tonal patterns, focusing on the basic fundamental frequency (F₀) patterns characterized by the contextual linguistic features that can be directly extracted from text. We analyze tonal patterns as sparse target points (tonal F₀ peaks and valleys) and represent them in parametric form within the framework of a functional F₀ model. The relationships between...

chapter

Prosody Modeling from Tone to Intonation in Chinese using a Functional F0 Model

J. Ni, S. Sakai, T. Shimizu, S. Nakamura

2008 Second International Symposium on Universal Communication > 397 - 404

2008 Second International Symposium on Universal Communication

Chinese is a tonal language. It has both lexical tones and intonation. The fundamental frequency (F₀) contours thereby consist of tone and intonation components. This paper presents an approach to modeling the two components in separate ways and combining them to form the final F₀ contours based on a functional F₀ model. We analyze tonal patterns as sparse target points (tonal F₀ peaks and valleys)...

chapter

Normalization on Temporal Modulation Transfer Function for Robust Speech Recognition

X. Lu, S. Matsuda, T. Shimizu, S. Nakamura

2008 Second International Symposium on Universal Communication > 16 - 23

2008 Second International Symposium on Universal Communication

In this paper, we proposed a robust speech feature extraction algorithm for automatic speech recognition which reduced the noise effect in the temporal modulation domain. The proposed algorithm has two steps to deal with the time series of cepstral coefficients. The first step adopted a modulation contrast normalization to normalize the temporal modulation contrast of both clean and noisy speech to...

Filter options

Keywords:
FEATURE EXTRACTION
SPEECH

Publication date

Set your own date range

Keywords

TRAINING (4)
HIDDEN MARKOV MODELS (3)
MODULATION (3)
SPEECH RECOGNITION (3)
SPEECH SYNTHESIS (3)
CART-BASED MODELING (2)
CEPSTRAL ANALYSIS (2)
CONTEXTUAL LINGUISTIC FEATURES (2)
DATA MINING (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
NOISE (2)
NOISE MEASUREMENT (2)
PROSODY MODELING (2)
SMOOTHING METHODS (2)
SPEECH PROCESSING (2)
ACOUSTICS (1)
ACTIVE LEARNING (1)
ADDITIVE NOISE (1)
ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (1)
AFE (1)
AUTOMATIC SPEECH RECOGNITION (1)
BAYESIAN LOGISTIC REGRESSION (1)
BELIEF NETWORKS (1)
CART (1)
CEPSTRAL COEFFICIENT (1)
CHINESE (1)
CHINESE INTONATION (1)
CHINESE TONAL PATTERNS (1)
CLASSIFICATION AND REGRESSION TREES (1)
CONTEXT (1)
CORPUS-BASED SPEECH (1)
CORRELATION (1)
EDGE-PRESERVED SMOOTHING (1)
ETSI ADVANCED FRONT-END METHOD (1)
F0 MODEL (1)
FAULT CURRENTS (1)
FUNCTIONAL F<INF>0</INF> MODEL (1)
FUNCTIONAL MODEL TRACING (1)
FUNDAMENTAL FREQUENCY CONTOURS (1)
FUNDAMENTAL FREQUENCY TRAJECTORIES (1)
HUMAN ROBOT SPOKEN DIALOGUE (1)
INDEPENDENT FRONT-END PROCESSOR (1)
INDONESIAN SPEECH SYNTHESIZER (1)
INDONESIAN SPOKEN LANGUAGE TECHNOLOGY (1)
INDONESIAN-ENGLISH MACHINE TRANSLATORS (1)
INDONESIAN-JAPANESE MACHINE TRANSLATOR (1)
INTERACTIVE PROGRAMMING (1)
INTONATION (1)
JAPANESE-ENGLISH SPEECH TRANSLATION SYSTEM (1)
LANGUAGE TRANSLATION (1)
LEAST MEAN SQUARES METHODS (1)
LEXICAL TONE CONTEXT (1)
LINGUISTICS (1)
MACHINE LEARNING (1)
MANIPULATORS (1)
MINIMUM MEAN SQUARE ERROR ESTIMATION (1)
MMSE (1)
MODULATION EVENT PRESERVED SMOOTHING (1)
MULTILINGUAL SPEECH TRANSLATION SYSTEM (1)
NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY (1)
NOISE REDUCTION (1)
OBJECT MANIPULATION DIALOGUE (1)
OPTICAL TRANSFER FUNCTION (1)
PARTICLE FILTERING (1)
PARTICLE FILTERING (NUMERICAL METHODS) (1)
PREDICTIVE MODELS (1)
REGRESSION ANALYSIS (1)
REGRESSION TREE ANALYSIS (1)
ROBOT LANGUAGE ACQUISITION (1)
ROBOTS (1)
ROBUST SPEECH FEATURE EXTRACTION (1)
ROBUST SPEECH FEATURE EXTRACTION ALGORITHM (1)
ROBUST SPEECH RECOGNITION (1)
SHORT-TERM STATISTICAL DISTRIBUTION (1)
SIGNAL DENOISING (1)
SPEAKER RECOGNITION (1)
SPEAKER-INDEPENDENT MODEL (1)
SPEECH INTELLIGIBILITY (1)
SPOKEN LANGUAGE TRANSLATION SYSTEM (1)
STATISTICAL DISTRIBUTIONS (1)
TEMPORAL MODULATION CONTRAST NORMALIZATION (1)
TEMPORAL MODULATION TRANSFER FUNCTION NORMALIZATION (1)
TIME SERIES (1)
TONE (1)
TRAJECTORY (1)
TREES (MATHEMATICS) (1)
VISUALIZATION (1)
VOCABULARY (1)
VOCABULARY CONTINUOUS SPEECH RECOGNIZER (1)
more

INFONA - science communication portal

Search results for: S. Nakamura

Active learning of confidence measure function in robot language acquisition framework

Temporal Modulation Normalization for Robust Speech Feature Extraction and Recognition

Toward translating Indonesian spoken utterances to/from other languages

CART-based modeling of Chinese tonal patterns with a functional model tracing the fundamental frequency trajectories

Prosody Modeling from Tone to Intonation in Chinese using a Functional F0 Model

Normalization on Temporal Modulation Transfer Function for Robust Speech Recognition

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options