Search results for: Daisuke Saito

Items from 1 to 4 out of 4 results

chapter

SAS: A speaker verification spoofing database containing diverse attacks

Zhizheng Wu, Ali Khodabakhsh, Cenk Demiroglu, Junichi Yamagishi, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4440 - 4444

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents the first version of a speaker verification spoofing and anti-spoofing database, named SAS corpus. The corpus includes nine spoofing techniques, two of which are speech synthesis, and seven are voice conversion. We design two protocols, one for standard speaker verification evaluation, and the other for producing spoofing materials. Hence, they allow the speech synthesis community...

chapter

Voice conversion based on matrix variate Gaussian mixture model

Daisuke Saito, Hidenobu Doi, Nobuaki Minematsu, Keikichi Hirose

2014 12th International Conference on Signal Processing (ICSP) > 567 - 571

2014 12th International Conference on Signal Processing (ICSP 2014)

This paper describes a novel approach to construct a mapping function between a given speaker pair using probability density functions (PDF) of matrix variate. In voice conversion studies, two important functions should be realized: 1) precise modeling of both the source and target feature spaces, and 2) construction of a proper transform function between these spaces. Voice conversion based on Gaussian...

chapter

High accurate model-integration-based voice conversion using dynamic features and model structure optimization

Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, Nobuaki Minematsu

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4576 - 4579

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper combines a parameter generation algorithm and a model optimization approach with the model-integration-based voice conversion (MIVC). We have proposed probabilistic integration of a joint density model and a speaker model to mitigate a requirement of the parallel corpus in voice conversion (VC) based on Gaussian Mixture Model (GMM). As well as the other VC methods, MIVC also suffers from...

chapter

HMM-based sequence-to-frame mapping for voice conversion

Yu Qiao, Daisuke Saito, N Minematsu

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4830 - 4833

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Voice conversion can be reduced to a problem to find a transformation function between the corresponding speech sequences of two speakers. Perhaps the most voice conversions methods are GMM-based statistical mapping methods. However, the classical GMM-based mapping is frame-to-frame, and cannot take account of the contextual information existing over a speech sequence. It is well known that HMM yields...

Filter options

Keywords:
SPEECH
VOICE CONVERSION

Publication date

Set your own date range

Keywords

EQUATIONS (2)
GAUSSIAN MIXTURE MODEL (2)
JOINTS (2)
MATHEMATICAL MODEL (2)
SPEECH SYNTHESIS (2)
TRAINING (2)
VECTORS (2)
ADAPTATION MODELS (1)
CEPSTRUM (1)
COVARIANCE MATRICES (1)
DATA MODELS (1)
DATABASE (1)
DATABASES (1)
DYNAMIC FEATURES (1)
FRAME POSTERIOR PROBABILITY (1)
GAUSSIAN PROCESSES (1)
GMM-BASED STATISTICAL MAPPING METHODS (1)
HIDDEN MARKOV MODELS (1)
HMM (1)
HMM-BASED SEQUENCE-TO-FRAME MAPPING (1)
INFORMATION CRITERION (1)
LEAST SQUARES APPROXIMATION (1)
MATRIX VARIATE DISTRIBUTION (1)
MATRIX VARIATE GAUSSIAN MIXTURE MODEL (1)
MATRIX VARIATE NORMAL (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
OPTIMIZATION (1)
PROBABILISTIC INTEGRATION (1)
SECURITY (1)
SEQUENCE-TO-FRAME MAPPING (1)
SOFT MAPPING FUNCTION (1)
SPEAKER VERIFICATION (1)
SPEECH PROCESSING (1)
SPEECH RECOGNITION (1)
SPEECH SEQUENCE (1)
SPOOFING ATTACK (1)
STANDARDS (1)
STATISTICAL ANALYSIS (1)
SYNTHETIC APERTURE SONAR (1)
more

INFONA - science communication portal

Search results for: Daisuke Saito

SAS: A speaker verification spoofing database containing diverse attacks

Voice conversion based on matrix variate Gaussian mixture model

High accurate model-integration-based voice conversion using dynamic features and model structure optimization

HMM-based sequence-to-frame mapping for voice conversion

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options