Search results for: Wenju Liu

Items from 1 to 11 out of 11 results

chapter

An improved steady segment based decoding algorithm by using response probability for LVCSR

Zhanlei Yang, Wenju Liu, Hao Chao

2012 8th International Symposium on Chinese Spoken Language Processing > 306 - 310

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

This paper proposes a novel decoding algorithm by integrating both steady speech segments and observations' location information into conventional path extension framework. First, speech segments which possess stable spectrum are extracted. Second, a preliminarily improved algorithm is given by modifying traditional inter-HMM extension framework using the detected steady segments. Then, at probability...

chapter

Improved tone modeling by exploiting articulatory features for mandarin speech recognition

Hao Chao, Zhanlei Yang, Wenju Liu

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4741 - 4744

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

For the same tone pattern, different articulatory characteristics may make the pitch contour change. This paper applies articulatory features, which represent the articulatory information, as well as prosodic features to the tone modeling. Three kinds of tone models are trained to verify the effectiveness of articulatory features. Tone recognition experiments indicate significant improvement can be...

chapter

Improved Syllable Based Acoustic Modeling by Inter-Syllable Transition Model for Continuous Chinese Speech Recognition

Hao Chao, Wenju Liu

2009 Chinese Conference on Pattern Recognition > 1 - 4

2009 Chinese Conference on Pattern Recognition. (CCPR 2009) and the First CJK Joint Workshop on Pattern Recognition (CJKPR)

Accurately modeling the acoustic variabilities caused by coarticulation is important in continuous speech recognition. Recent research indicates that syllable units do better in modeling intra-syllable co-articulation effect than sub-syllable units. However, most continuous Mandarin speech recognition systems use context dependent phones or initial/finals (IFs) as the basic acoustic unit because it...

chapter

Improved Large Vocabulary Mandarin Speech Recognition Using Prosodic and Lexical Information in Maximum Entropy Framework

Chongjia Ni, Wenju Liu, Bo Xu

2009 Chinese Conference on Pattern Recognition > 1 - 4

2009 Chinese Conference on Pattern Recognition. (CCPR 2009) and the First CJK Joint Workshop on Pattern Recognition (CJKPR)

Tone plays an important role in distinguishing ambiguous words in Chinese Mandarin speech recognition. In this paper, we make full use of pitch information. On the one hand, we interpolate F0 contour to make the F0 contour continuous between voiced and unvoiced segments in order to embed F0 into speech recognition system in two streams, which cepstrum and its first and second order derivatives constitute...

chapter

A novel interpolated N-gram language model based on class hierarchy

Zhenyu Lv, Wenju Liu, Zhanlei Yang

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 5

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

In this paper, we propose a novel interpolated language model that combines the interpolation and the backing-off along hierarchical classes based on class hierarchy. And the corresponding approach to the estimation of interpolation coefficients is also presented. We use the Minimum Discriminative Information (MDI) method to cluster the vocabulary into a word-clustering tree hierarchically. The tree...

chapter

HMM-based phonemic distance in different speaking styles and its influence on substitutions in Mandarin speech recognition

Zhanlei Yang, Wenju Liu, Zhenyu Lv

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 5

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

Statistical confusability between different acoustic models is important to character substitution error rate in large vocabulary continuous speech recognition. In this paper, we take factors of gender and speaking styles into consideration in Mandarin speech recognition. We modeled phonemes in different speaking styles, including read speech of female, male, and spontaneous dialogue. Then minimum...

chapter

Mandarin pitch accent prediction using hierarchical model based ensemble machine learning

Chongjia Ni, Wenju Liu, Bo Xu

2009 IEEE Youth Conference on Information, Computing and Telecommunication > 327 - 330

2009 IEEE Youth Conference on Information, Computing and Telecommunication (YC-ICT 2009)

In this study, we combine the Mandarin characteristics with Mandarin acoustic attribute and text information and use hierarchical model based ensemble machine learning to predict Mandarin pitch accent. Our model could make the best of advantages of prosody hierarchical structure and ensemble machine learning. When comparing our model with classification and regression tree (CART), support vector machine...

chapter

Automatic Prosody Boundary Labeling of Mandarin Using Both Text and Acoustic Information

Chongjia Ni, Wenju Liu, Bo Xu

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

Prosody is an important factor for a high quality text-to- speech (TTS) system. Prosody is often described with a hierarchical structure. So the generation of the hierarchical prosody structure is very important both in the corpus building and the real-time text analysis, but the prosody labeling procedure is laborious and time consuming. In this paper, an automatic prosody boundary label system is...

chapter

Stochastic Segment Model Decoding Algorithm Based on Neighboring Segments and its Application in LVCSR

Shouye Peng, Wenju Liu, Hua Zhang

2008 Chinese Conference on Pattern Recognition > 1 - 5

2008 Chinese Conference on Pattern Recognition

In the large vocabulary continuous speech recognition system based on stochastic segment model (SSM), the multistage decoding and pruning algorithm could decrease decoding time obviously. Generally, we only decode and prune for one segment each time. In this paper, a decoding algorithm based on neighboring segments is proposed. This algorithm decodes for multi-segments at the same time, so that the...

chapter

Durational Characteristics and Pitch Characteristics of the Prosodic Phrase in Mandarin Chinese

Chongjia Ni, Wenju Liu

2008 Chinese Conference on Pattern Recognition > 1 - 5

2008 Chinese Conference on Pattern Recognition

It is the key to improve the natural degree of speech synthesis and reduce the error rate of speech recognition that analyzes the information structure and prosodic structure of sentence and chapters. Based on large speech corpus (ASCCD) with prosodic structure label, we measured the characteristics of duration and pitch on prosodic phrase. The statistical results on duration and pitch are presented...

chapter

Research on Adaptive Step Decoding in Segment-based LVCSR

Hua Zhang, Wenju Liu, Bo Xu

2007 International Conference on Natural Language Processing and Knowledge Engineering > 463 - 467

2007 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE '07)

In this paper, a novel adaptive step decoding method using steady-energy pieces (SEPs) is explored in segment model (SM) based LVCSR system. Using speech analysis methods and statistical classification tools, the start and end points of SEPs are detected firstly. In SM decoding stage, frame-by-frame decoding for segments which start or end in SEPs are overleaped, replaced by SEP-based decoding. In...

Filter options

Keywords:
SPEECH RECOGNITION

Publication date

Set your own date range

Content availability

Available (10)
None (1)

Keywords

SPEECH (8)
HIDDEN MARKOV MODELS (7)
ACOUSTICS (4)
NATURAL LANGUAGE PROCESSING (4)
CLASSIFICATION ALGORITHMS (3)
DECODING (3)
ERROR ANALYSIS (3)
COMPUTATIONAL MODELING (2)
ENTROPY (2)
MANDARIN (2)
MANDARIN SPEECH RECOGNITION (2)
SPEECH PROCESSING (2)
SPEECH SYNTHESIS (2)
STATISTICAL ANALYSIS (2)
SUPPORT VECTOR MACHINES (2)
TONE MODELING (2)
TRAINING (2)
TREES (MATHEMATICS) (2)
VOCABULARY (2)
ACCURACY (1)
ACOUSTIC INFORMATION (1)
ACOUSTIC MODEL (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTIC VARIABILITIES MODELING (1)
ADABOOST (1)
ADAPTIVE STEP DECODING METHOD (1)
ARTICULATION (1)
ASCCD (1)
AUTOMATIC PROSODY BOUNDARY LABELING (1)
BACK-OFF (1)
CART ALGORITHM (1)
CHAPTERS (1)
CHARACTER SUBSTITUTION ERROR RATE (1)
CHINESE CHARACTER ERROR RATE REDUCTION (1)
CHINESE INITIAL-FINAL MODEL PAIR (1)
CLASS HIERARCHY (1)
CLASSIFICATION AND REGRESSION TREE (1)
CLASSIFICATION AND REGRESSION TREE FRAMEWORK (1)
CLUSTER (1)
CLUSTERING ALGORITHMS (1)
COMPOUNDS (1)
CONTEXT (1)
CONTEXT DEPENDENT PHONES (1)
CONTEXT INDEPENDENT SYLLABLE (1)
CONTEXT MODELING (1)
CONTINUOUS CHINESE SPEECH RECOGNITION (1)
CONTINUOUS MANDARIN SPEECH RECOGNITION SYSTEMS (1)
DATA MINING (1)
DATA MODELS (1)
DATABASES (1)
DECODING ALGORITHM (1)
DURATIONAL CHARACTERISTICS (1)
ELECTRONIC MAIL (1)
ENSEMBLE MACHINE LEARNING (1)
ERROR RATE (1)
ERROR RATE REDUCTION (1)
ESTIMATION (1)
FEATURE EXTRACTION (1)
GAUSSIAN DISTRIBUTION (1)
GAUSSIAN PROCESSES (1)
GENDER FACTOR (1)
GENERALISATION (ARTIFICIAL INTELLIGENCE) (1)
GENERALIZATION ABILITY (1)
HIERARCHICAL MODEL BASED ENSEMBLE MACHINE LEARNING (1)
HIGH QUALITY TEXT-TO-SPEECH SYSTEM (1)
HMM BASED PHONEMIC DISTANCE (1)
INTERPOLATE (1)
INTERPOLATED N-GRAM LANGUAGE MODEL (1)
INTERPOLATION (1)
INTERPOLATION COEFFICIENT ESTIMATION (1)
INTERSYLLABLE TRANSITION MODELS (1)
INTRASYLLABLE COARTICULATION EFFECT MODELING (1)
LABELING (1)
LABORATORIES (1)
LANGUAGE MODEL (1)
LARGE SPEECH CORPUS (1)
LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION (1)
LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION SYSTEM (1)
LATTICES (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEXICAL INFORMATION (1)
MACHINE LEARNING (1)
MANDARIN CHINESE (1)
MANDARIN PITCH ACCENT PREDICTION (1)
MATHEMATICS (1)
MAXIMUM ENTROPY FRAMEWORK (1)
MAXIMUM ENTROPY METHODS (1)
MINIMUM DISCRIMINATIVE INFORMATION METHOD (1)
MINIMUM GAUSSIAN DISTANCES (1)
MULTISTAGE DECODING (1)
MULTISTAGE PRUNING (1)
N-GRAM EVENT LIKELIHOOD ESTIMATION (1)
NATURAL LANGUAGES (1)
PATH EXTENSION (1)
PATTERN CLUSTERING (1)
PATTERN RECOGNITION (1)
PHONE SIZE (1)
PHONEMIC DISTANCE (1)
PITCH ACCENT (1)
more

INFONA - science communication portal

Search results for: Wenju Liu

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options