Search results for: S. Sakai

Items from 1 to 6 out of 6 results

chapter

Toward translating Indonesian spoken utterances to/from other languages

S. Sakti, M. Paul, R. Maia, S. Sakai, more

2009 Oriental COCOSDA International Conference on Speech Database and Assessments > 137 - 142

2009 Oriental COCOSDA International Conference on Speech Database and Assessments

This paper outlines the National Institute of Information and Communications Technology / Advanced Telecommunications Research Institute International (NICT/ATR) research activities in developing a spoken language translation system, specially for translating Indonesian spoken utterances into/from Japanese or English. Since the NICT/ATR Japanese-English speech translation system is an established...

chapter

Optimal learning of P-Layer additive F0 models with cross-validation

S. Sakai, T. Kawahara, T. Shimizu, S. Nakamura

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4245 - 4248

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we present the derivation of the backfitting training algorithms for generic p-layer additive F₀ models for arbitrary positive integer p. We have presented the special cases of the algorithms with p = 2 and p = 3 that have been successfully applied to the modelings of Japanese and English F₀ contours, whereas the derivation of the algorithm was presented only for the two-layer case...

chapter

CART-based modeling of Chinese tonal patterns with a functional model tracing the fundamental frequency trajectories

Jinfu Ni, S. Sakai, T. Shimizu, S. Nakamura

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4253 - 4256

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

We propose an approach to modeling Chinese tonal patterns, focusing on the basic fundamental frequency (F₀) patterns characterized by the contextual linguistic features that can be directly extracted from text. We analyze tonal patterns as sparse target points (tonal F₀ peaks and valleys) and represent them in parametric form within the framework of a functional F₀ model. The relationships between...

chapter

Prosody Modeling from Tone to Intonation in Chinese using a Functional F0 Model

J. Ni, S. Sakai, T. Shimizu, S. Nakamura

2008 Second International Symposium on Universal Communication > 397 - 404

2008 Second International Symposium on Universal Communication

Chinese is a tonal language. It has both lexical tones and intonation. The fundamental frequency (F₀) contours thereby consist of tone and intonation components. This paper presents an approach to modeling the two components in separate ways and combining them to form the final F₀ contours based on a functional F₀ model. We analyze tonal patterns as sparse target points (tonal F₀ peaks and valleys)...

chapter

Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems

K. Oura, Y. Nankaku, T. Toda, K. Tokuda, more

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

A new integrated model for simultaneous modeling of linguistic and acoustic models, and a training algorithm is proposed. Usually, text-to-speech (TTS) systems based on the hidden Markov model (HMM) consist of text analysis and speech synthesis modules. Linguistic and acoustic model training are performed independently using different training data sets. Integrated model parameters were simultaneously...

chapter

Content-based music retrieval with nonlinear feature space transformation using relevance feedback

S. Sakai, K. Kameyama

2008 IEEE International Conference on Systems, Man and Cybernetics > 1379 - 1384

2008 IEEE International Conference on Systems, Man and Cybernetics (SMC 2008)

In recent years, studies of similar music retrieval have been conducted actively. However, because the similarity of music is based on subjective measures, the systems need to be adaptive to user preference. In this paper, we propose an effective method for adaptive similar music retrieval reflecting the user preference by nonlinear feature space transformation based on relevance feedback. The user's...

Filter options

Keywords:
TRAINING

Publication date

Set your own date range

Keywords

SPEECH (5)
SPEECH SYNTHESIS (5)
FEATURE EXTRACTION (4)
HIDDEN MARKOV MODELS (4)
DATA MINING (3)
ACOUSTICS (2)
CART-BASED MODELING (2)
CONTEXTUAL LINGUISTIC FEATURES (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
PROSODY MODELING (2)
TRAINING DATA (2)
ACOUSTIC MODEL TRAINING (1)
ADDITIVE MODELS (1)
ADDITIVES (1)
ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (1)
ARTIFICIAL NEURAL NETWORKS (1)
BACKFITTING TRAINING ALGORITHMS (1)
BOSTON UNIVERSITY RADIO NEWS CORPUS (1)
CART (1)
CHINESE (1)
CHINESE INTONATION (1)
CHINESE TONAL PATTERNS (1)
CLASSIFICATION AND REGRESSION TREES (1)
COARSE DIVISION (1)
COMPUTATIONAL MODELING (1)
CONTENT-BASED MUSIC RETRIEVAL (1)
CONTENT-BASED RETRIEVAL (1)
CORPUS-BASED SPEECH (1)
CORRELATION (1)
CURVE FITTING (1)
DATA MODELS (1)
DATA SET TRAINING (1)
DATABASES (1)
DISTANCE MEASUREMENT (1)
F0 MODEL (1)
FAULT CURRENTS (1)
FITTED CURVES SMOOTHNESS (1)
FUNCTIONAL F<INF>0</INF> MODEL (1)
FUNCTIONAL MODEL TRACING (1)
FUNDAMENTAL FREQUENCY (1)
FUNDAMENTAL FREQUENCY CONTOURS (1)
FUNDAMENTAL FREQUENCY TRAJECTORIES (1)
HIDDEN MARKOV MODEL (1)
INDONESIAN SPEECH SYNTHESIZER (1)
INDONESIAN SPOKEN LANGUAGE TECHNOLOGY (1)
INDONESIAN-ENGLISH MACHINE TRANSLATORS (1)
INDONESIAN-JAPANESE MACHINE TRANSLATOR (1)
INTEGRATED MODEL PARAMETER (1)
INTONATION (1)
INTONATION MODELING (1)
JAPANESE-ENGLISH SPEECH TRANSLATION SYSTEM (1)
LANGUAGE TRANSLATION (1)
LEXICAL TONE CONTEXT (1)
LINGUISTIC MODEL TRAINING (1)
LINGUISTICS (1)
MACHINE LEARNING (1)
MODULATION (1)
MULTILINGUAL SPEECH TRANSLATION SYSTEM (1)
MULTIPLE SIGNAL CLASSIFICATION (1)
MUSIC (1)
MUSIC RETRIEVAL (1)
NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY (1)
NATURAL LANGUAGE PROCESSING (1)
NEURAL NETWORK (1)
NONLINEAR FEATURE SPACE TRANSFORMATION (1)
NONLINEAR TRANSFORMATION (1)
PHRASING MODEL TRAINING (1)
PREDICTIVE MODELS (1)
PROSODIC MODEL TRAINING (1)
REGRESSION ANALYSIS (1)
REGRESSION TREE ANALYSIS (1)
RELEVANCE FEEDBACK (1)
SMOOTHING METHODS (1)
SPEAKER RECOGNITION (1)
SPEAKER-INDEPENDENT MODEL (1)
SPEECH PROCESSING (1)
SPEECH RECOGNITION (1)
SPEECH SYNTHESIS MODULES (1)
SPOKEN LANGUAGE TRANSLATION SYSTEM (1)
STATISTICAL LEARNING (1)
TEXT-TO-SPEECH SYSTEMS (1)
TONE (1)
TRANSFORMS (1)
TREES (MATHEMATICS) (1)
VOCABULARY (1)
VOCABULARY CONTINUOUS SPEECH RECOGNIZER (1)
WORD SEQUENCE (1)
more

INFONA - science communication portal

Search results for: S. Sakai

Toward translating Indonesian spoken utterances to/from other languages

Optimal learning of P-Layer additive F0 models with cross-validation

CART-based modeling of Chinese tonal patterns with a functional model tracing the fundamental frequency trajectories

Prosody Modeling from Tone to Intonation in Chinese using a Functional F0 Model

Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems

Content-based music retrieval with nonlinear feature space transformation using relevance feedback

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options