2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Items from 1 to 6 out of 6 results

chapter

AP16-OL7: A multilingual database for oriental languages and a language recognition baseline

Dong Wang, Lantian Li, Difei Tang, Qing Chen

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

We present the AP16-OL7 database which was released as the training and test data for the oriental language recognition (OLR) challenge on APSIPA 2016. Based on the database, a baseline system was constructed on the basis of the i-vector model. We report the baseline results evaluated in various metrics defined by the AP16-OLR evaluation plan and demonstrate that AP16-OL7 is a reasonable data resource...

chapter

Speech emotion classification using multiple kernel Gaussian process

Sih-Huei Chen, Jia-Ching Wang, Wen-Chi Hsieh, Yu-Hao Chin, more

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Given the increasing attention paid to speech emotion classification in recent years, this work presents a novel speech emotion classification approach based on the multiple kernel Gaussian process. Two major aspects of a classification problem that play an important role in classification accuracy are addressed, i.e. feature extraction and classification. Prosodic features and other features widely...

chapter

On the use of I-vectors and average voice model for voice conversion without parallel data

Jie Wu, Zhizheng Wu, Lei Xie

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Recently, deep and/or recurrent neural networks (DNNs/RNNs) have been employed for voice conversion, and have significantly improved the performance of converted speech. However, DNNs/RNNs generally require a large amount of parallel training data (e.g., hundreds of utterances) from source and target speakers. It is expensive to collect such a large amount of data, and impossible in some applications,...

chapter

Unsupervised single-channel speech separation via deep neural network for different gender mixtures

Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this study, we propose a regression approach via deep neural network (DNN) for unsupervised speech separation in a single-channel setting. We rely on a key assumption that two speakers could be well segregated if they are not too similar to each other. A dissimilarity measure between two speakers is then proposed to characterize the separation ability between competing speakers. We demonstrate...

chapter

Multi-task recurrent model for speech and speaker recognition

Zhiyuan Tang, Lantian Li, Dong Wang

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Although highly correlated, speech and speaker recognition have been regarded as two independent tasks and studied by two communities. This is certainly not the way that people behave: we decipher both speech content and speaker traits at the same time. This paper presents a unified model to perform speech and speaker recognition simultaneously and altogether. The model is based on a unified neural...

chapter

A study on target feature activation and normalization and their impacts on the performance of DNN based speech dereverberation systems

Bo Wu, Kehuang Li, Minglei Yang, Chin-Hui Lee

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

We adopt a linear activation function at the output layer and globally normalize the target features into zero mean and unit variance to learn the complicated mapping from reverberant to anechoic speech with a regression model based on deep neural networks (DNNs). The proposed feature activation and normalization framework was found to retain clearly observable harmonics and improve the speech quality...

Filter options

Keywords:
SPEECH PROCESSING

Publication date

Set your own date range

Keywords

TRAINING (4)
FEATURE EXTRACTION (3)
HIDDEN MARKOV MODELS (3)
SPEECH RECOGNITION (3)
ADAPTATION MODELS (1)
AVERAGE VOICE MODEL (1)
COMPUTATIONAL MODELING (1)
DATA MINING (1)
DATA MODELS (1)
DATABASES (1)
GAUSSIAN PROCESSES (1)
HARMONIC ANALYSIS (1)
I-VECTOR (1)
KERNEL (1)
LONG SHORT-TERM MEMORY (1)
MATRIX DECOMPOSITION (1)
MEASUREMENT (1)
MOBILE COMMUNICATION (1)
MULTIPLE KERNEL GAUSSIAN PROCESS (1)
NEURAL NETWORKS (1)
NONPARALLEL TRAINING (1)
REVERBERATION (1)
SEMI-NONNEGATIVE MATRIX FACTORIZATION (1)
SIGNAL TO NOISE RATIO (1)
SPEAKER RECOGNITION (1)
SPECTROGRAM (1)
SPEECH EMOTION CLASSIFICATION (1)
TRAINING DATA (1)
VOICE CONVERSION (1)
more

INFONA - science communication portal

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) $("#expandableTitles").expandable();

AP16-OL7: A multilingual database for oriental languages and a language recognition baseline

Speech emotion classification using multiple kernel Gaussian process

On the use of I-vectors and average voice model for voice conversion without parallel data

Unsupervised single-channel speech separation via deep neural network for different gender mixtures

Multi-task recurrent model for speech and speaker recognition

A study on target feature activation and normalization and their impacts on the performance of DNN based speech dereverberation systems

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)