2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

chapter

Improving BLSTM RNN based Mandarin speech recognition using accent dependent bottleneck features

Jiangyan Yi, Hao Ni, Zhengqi Wen, Jianhua Tao

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

This paper proposes an approach to perform accent adaptation by using accent dependent bottleneck (BN) features to improve the performance of multi-accent Mandarin speech recognition system. The architecture of the adaptation uses two neural networks. First, deep neural network (DNN) acoustic model acts as a feature extractor which is used to extract accent dependent BN (BN-DNN) features. The input...

chapter

Speaker recognition in duration-mismatched condition using bootstrapped i-vectors

Atsushi Ando, Taichi Asami, Yoshikazu Yamaguchi, Yushi Aono

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper presents a novel speaker recognition framework that handles duration mismatch between registered and test utterances. The i-vectors extracted from short utterances exhibit high variance due to phoneme imbalance, which causes performance degradation in the duration mismatch condition. Most conventional methods attempt to decrease the variance by offsetting i-vectors or speaker similarity...

chapter

Fast and accurate personal authentication using ear acoustics

Takayuki Arakawa, Takafumi Koshinaka, Shohei Yano, Hideki Irisawa, more

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper presents a biometric personal-authentication method that exploits acoustic characteristics of human ears. It transmits a probe signal into the ear and receives its reflection, which contains personal identity information about the shape of the ear canal. Based on a study of effective and efficient acoustic feature representation and the use of audio equipment suitable for acquiring features...

chapter

DNN based detection of pronunciation erroneous tendency in data sparse condition

Yingming Gao, Yanlu Xie, Ju Lin, Jinsong Zhang

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Detecting pronunciation erroneous tendency (PET) can provide second languages learners with detailedly instructive feedbacks in the computer aided pronunciation training (CAPT) systems. Due to the data sparseness, DNN-HMM achieved limited improvement over GMM-HMM in our previous work. Instead of directly employing DNN-HMM to detect PETs, this paper investigated how to further improve the performance...

chapter

Highlighting root notes in chord recognition using cepstral features and multi-task learning

Mu-Heng Yang, Li Su, Yi-Hsuan Yang

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 8

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

A musical chord is usually described by its root note and the chord type. While a substantial amount of work has been done in the field of music information retrieval (MIR) to automate chord recognition, the role of root notes in this task has seldom received specific attention. In this paper, we present a new approach and empirical studies demonstrating improved accuracy in chord recognition by properly...

chapter

Locality sensitive discriminant analysis for speaker verification

Danwei Cai, Weicheng Cai, Zhidong Ni, Ming Li

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, we apply Locality Sensitive Discriminant Analysis (LSDA) to speaker verification system for intersession variability compensation. As opposed to LDA which fails to discover the local geometrical structure of the data manifold, LSDA finds a projection which maximizes the margin between i-vectors from different speakers at each local area. Since the number of samples varies in a wide...

INFONA - science communication portal

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Improving BLSTM RNN based Mandarin speech recognition using accent dependent bottleneck features

Speaker recognition in duration-mismatched condition using bootstrapped i-vectors

Fast and accurate personal authentication using ear acoustics

DNN based detection of pronunciation erroneous tendency in data sparse condition

Highlighting root notes in chord recognition using cepstral features and multi-task learning

Locality sensitive discriminant analysis for speaker verification

Filter options

Publication date

Keywords

INFONA - science communication portal

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) $("#expandableTitles").expandable();

Improving BLSTM RNN based Mandarin speech recognition using accent dependent bottleneck features

Speaker recognition in duration-mismatched condition using bootstrapped i-vectors

Fast and accurate personal authentication using ear acoustics

DNN based detection of pronunciation erroneous tendency in data sparse condition

Highlighting root notes in chord recognition using cepstral features and multi-task learning

Locality sensitive discriminant analysis for speaker verification

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)