Wyniki wyszukiwania dla: Kun Li

Pozycje od 1 do 6 spośród 6 wyników

artykuł

Mispronunciation Detection and Diagnosis in L2 English Speech Using Multidistribution Deep Neural Networks

Kun Li, Xiaojun Qian, Helen Meng

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2017 > 25 > 1 > 193 - 207

This paper investigates the use of multidistribution deep neural networks (DNNs) for mispronunciation detection and diagnosis (MDD), to circumvent the difficulties encountered in an existing approach based on extended recognition networks (ERNs). The ERNs leverage existing automatic speech recognition technology by constraining the search space via including the likely phonetic error patterns of the...

rozdział

Phonetic posteriorgrams for many-to-one voice conversion without parallel data training

Lifa Sun, Kun Li, Hao Wang, Shiyin Kang, więcej

2016 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2016 IEEE International Conference on Multimedia and Expo (ICME)

This paper proposes a novel approach to voice conversion with non-parallel training data. The idea is to bridge between speakers by means of Phonetic PosteriorGrams (PPGs) obtained from a speaker-independent automatic speech recognition (SI-ASR) system. It is assumed that these PPGs can represent articulation of speech sounds in a speaker-normalized space and correspond to spoken content speaker-independently...

rozdział

Voice conversion using deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks

Lifa Sun, Shiyin Kang, Kun Li, Helen Meng

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4869 - 4873

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper investigates the use of Deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks (DBLSTM-RNNs) for voice conversion. Temporal correlations across speech frames are not directly modeled in frame-based methods using conventional Deep Neural Networks (DNNs), which results in a limited quality of the converted speech. To improve the naturalness and continuity of the speech...

rozdział

Perceptually-motivated assessment of automatically detected lexical stress in L2 learners' speech

Kun Li, Helen Meng

2012 8th International Symposium on Chinese Spoken Language Processing > 179 - 183

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

This paper presents a method of automatic lexical stress assessment for L2 English speech. Syllable stress can be labeled at three levels - primary (P), secondary (S) and no (N) stress, but secondary stress may vary among word pronunciations within and across accents and present difficulties for human perception. Hence, evaluation of lexical stress based on all three levels (i.e., the P-S-N criterion...

rozdział

Research on Inland Ship Navigation Status Monitoring System

Kun Li, Xinping Yan, Zhe Mao, Lingzhi Sang

2012 11th International Symposium on Distributed Computing and Applications to Business, Engineering & Science > 366 - 370

2012 11th International Symposium on Distributed Computing and Applications to Business, Engineering & Science

Inland ship navigation status monitoring system plays an important role in the prevention and reduction of inland ship accident as well as navigation security. The system has functions of ship status data collection and recording, status monitoring and alarming. The hardware and software of the system are designed and the feasibility of the system is discussed. The hardware part of the system achieves...

rozdział

Detection of intonation in L2 English speech of native Mandarin learners

Kun Li, Shuang Zhang, Mingxing Li, Wai-Kit Lo, więcej

2010 7th International Symposium on Chinese Spoken Language Processing > 69 - 74

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

We aim to detect salient mispronunciations in intonation of English speech uttered by Mandarin speakers. The goal of our project is to detect intonation errors and provide corrective feedback to English second language (ESL) learners. An intonational event includes the pitch accent and edge tone, and the intonation is closely related to the nuclear tone of an intonational phrase (IP). Hence, we first...

Opcje filtrowania

Słowa kluczowe:
SPEECH

Data publikacji

Ustaw własny zakres dat

Typ publikacji

książka (5)
artykuł (1)

Słowa kluczowe

FEATURE EXTRACTION (3)
ACCURACY (2)
ACOUSTICS (2)
DETECTORS (2)
L2 ENGLISH SPEECH (2)
TRAINING (2)
VOICE CONVERSION (2)
ACCIDENTS (1)
BIDIRECTIONAL LONG SHORT-TERM MEMORY (1)
COMPUTATIONAL MODELING (1)
CONTEXT (1)
CONTEXT MODELING (1)
CORRECTIVE FEEDBACK (1)
CORRELATION (1)
DBLSTM (1)
DEEP NEURAL NETWORKS (1)
DICTIONARIES (1)
DYNAMIC FEATURES (1)
EARLY-WARNING (1)
EDGE TONE (1)
EDUCATIONAL INSTITUTIONS (1)
ENGLISH INTONATION (1)
ENGLISH SECOND LANGUAGE LEARNER (1)
ENGLISH SPEECH (1)
ESL LEARNERS (1)
GAUSSIAN DISCRIMINATOR (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PROCESSES (1)
HIDDEN MARKOV MODELS (1)
HUMANS (1)
IMAGE EDGE DETECTION (1)
INLAND SHIP (1)
INTONATION DETECTION (1)
INTONATIONAL PHRASE (1)
IP NETWORKS (1)
L2 SUPRASEGMENTAL FEATURES (1)
LANGUAGE LEARNING (1)
LOGIC GATES (1)
MANDARIN SPEAKER (1)
MANY-TO-ONE (1)
MARINE VEHICLES (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MISPRONUNCIATION DETECTION (1)
MISPRONUNCIATION DIAGNOSIS (1)
MONITORING (1)
NATIVE MANDARIN LEARNER (1)
NATURAL LANGUAGES (1)
NAVIGATION (1)
NEURAL NETWORKS (1)
NON-PARALLEL (1)
PHONETIC POSTERIORGRAMS (1)
PITCH ACCENT DETECTOR (1)
PITCH CONTOUR (1)
RECURRENT NEURAL NETWORKS (1)
SI-ASR (1)
SPEAKER RECOGNITION (1)
SPEECH RECOGNITION (1)
STATUS (1)
STRESS (1)
STRESS ASSESSMENT (1)
STRESS DETECTION (1)
STRESS PERCEPTION (1)
TRAINING DATA (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Kun Li

Mispronunciation Detection and Diagnosis in L2 English Speech Using Multidistribution Deep Neural Networks

Phonetic posteriorgrams for many-to-one voice conversion without parallel data training

Voice conversion using deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks

Perceptually-motivated assessment of automatically detected lexical stress in L2 learners' speech

Research on Inland Ship Navigation Status Monitoring System

Detection of intonation in L2 English speech of native Mandarin learners

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu