Search results for: Kun Li

Items from 1 to 6 out of 6 results

article

Mispronunciation Detection and Diagnosis in L2 English Speech Using Multidistribution Deep Neural Networks

Kun Li, Xiaojun Qian, Helen Meng

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2017 > 25 > 1 > 193 - 207

This paper investigates the use of multidistribution deep neural networks (DNNs) for mispronunciation detection and diagnosis (MDD), to circumvent the difficulties encountered in an existing approach based on extended recognition networks (ERNs). The ERNs leverage existing automatic speech recognition technology by constraining the search space via including the likely phonetic error patterns of the...

chapter

Phonetic posteriorgrams for many-to-one voice conversion without parallel data training

Lifa Sun, Kun Li, Hao Wang, Shiyin Kang, more

2016 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2016 IEEE International Conference on Multimedia and Expo (ICME)

This paper proposes a novel approach to voice conversion with non-parallel training data. The idea is to bridge between speakers by means of Phonetic PosteriorGrams (PPGs) obtained from a speaker-independent automatic speech recognition (SI-ASR) system. It is assumed that these PPGs can represent articulation of speech sounds in a speaker-normalized space and correspond to spoken content speaker-independently...

chapter

Voice conversion using deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks

Lifa Sun, Shiyin Kang, Kun Li, Helen Meng

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4869 - 4873

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper investigates the use of Deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks (DBLSTM-RNNs) for voice conversion. Temporal correlations across speech frames are not directly modeled in frame-based methods using conventional Deep Neural Networks (DNNs), which results in a limited quality of the converted speech. To improve the naturalness and continuity of the speech...

chapter

Perceptually-motivated assessment of automatically detected lexical stress in L2 learners' speech

Kun Li, Helen Meng

2012 8th International Symposium on Chinese Spoken Language Processing > 179 - 183

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

This paper presents a method of automatic lexical stress assessment for L2 English speech. Syllable stress can be labeled at three levels - primary (P), secondary (S) and no (N) stress, but secondary stress may vary among word pronunciations within and across accents and present difficulties for human perception. Hence, evaluation of lexical stress based on all three levels (i.e., the P-S-N criterion...

chapter

Research on Inland Ship Navigation Status Monitoring System

Kun Li, Xinping Yan, Zhe Mao, Lingzhi Sang

2012 11th International Symposium on Distributed Computing and Applications to Business, Engineering & Science > 366 - 370

2012 11th International Symposium on Distributed Computing and Applications to Business, Engineering & Science

Inland ship navigation status monitoring system plays an important role in the prevention and reduction of inland ship accident as well as navigation security. The system has functions of ship status data collection and recording, status monitoring and alarming. The hardware and software of the system are designed and the feasibility of the system is discussed. The hardware part of the system achieves...

chapter

Detection of intonation in L2 English speech of native Mandarin learners

Kun Li, Shuang Zhang, Mingxing Li, Wai-Kit Lo, more

2010 7th International Symposium on Chinese Spoken Language Processing > 69 - 74

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

We aim to detect salient mispronunciations in intonation of English speech uttered by Mandarin speakers. The goal of our project is to detect intonation errors and provide corrective feedback to English second language (ESL) learners. An intonational event includes the pitch accent and edge tone, and the intonation is closely related to the nuclear tone of an intonational phrase (IP). Hence, we first...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

FEATURE EXTRACTION (3)
ACCURACY (2)
ACOUSTICS (2)
DETECTORS (2)
L2 ENGLISH SPEECH (2)
TRAINING (2)
VOICE CONVERSION (2)
ACCIDENTS (1)
BIDIRECTIONAL LONG SHORT-TERM MEMORY (1)
COMPUTATIONAL MODELING (1)
CONTEXT (1)
CONTEXT MODELING (1)
CORRECTIVE FEEDBACK (1)
CORRELATION (1)
DBLSTM (1)
DEEP NEURAL NETWORKS (1)
DICTIONARIES (1)
DYNAMIC FEATURES (1)
EARLY-WARNING (1)
EDGE TONE (1)
EDUCATIONAL INSTITUTIONS (1)
ENGLISH INTONATION (1)
ENGLISH SECOND LANGUAGE LEARNER (1)
ENGLISH SPEECH (1)
ESL LEARNERS (1)
GAUSSIAN DISCRIMINATOR (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PROCESSES (1)
HIDDEN MARKOV MODELS (1)
HUMANS (1)
IMAGE EDGE DETECTION (1)
INLAND SHIP (1)
INTONATION DETECTION (1)
INTONATIONAL PHRASE (1)
IP NETWORKS (1)
L2 SUPRASEGMENTAL FEATURES (1)
LANGUAGE LEARNING (1)
LOGIC GATES (1)
MANDARIN SPEAKER (1)
MANY-TO-ONE (1)
MARINE VEHICLES (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MISPRONUNCIATION DETECTION (1)
MISPRONUNCIATION DIAGNOSIS (1)
MONITORING (1)
NATIVE MANDARIN LEARNER (1)
NATURAL LANGUAGES (1)
NAVIGATION (1)
NEURAL NETWORKS (1)
NON-PARALLEL (1)
PHONETIC POSTERIORGRAMS (1)
PITCH ACCENT DETECTOR (1)
PITCH CONTOUR (1)
RECURRENT NEURAL NETWORKS (1)
SI-ASR (1)
SPEAKER RECOGNITION (1)
SPEECH RECOGNITION (1)
STATUS (1)
STRESS (1)
STRESS ASSESSMENT (1)
STRESS DETECTION (1)
STRESS PERCEPTION (1)
TRAINING DATA (1)
more

INFONA - science communication portal

Search results for: Kun Li

Mispronunciation Detection and Diagnosis in L2 English Speech Using Multidistribution Deep Neural Networks

Phonetic posteriorgrams for many-to-one voice conversion without parallel data training

Voice conversion using deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks

Perceptually-motivated assessment of automatically detected lexical stress in L2 learners' speech

Research on Inland Ship Navigation Status Monitoring System

Detection of intonation in L2 English speech of native Mandarin learners

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options