Search results for: Hua Wang

Items from 1 to 20 out of 22 results

chapter

Building HMM based unit-selection speech synthesis system using synthetic speech naturalness evaluation score

Heng Lu, Zhen-Hua Ling, Li-Rong Dai, Ren-Hua Wang

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5352 - 5355

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper proposes a unit-selection and waveform concatenation speech synthesis system based on synthetic speech naturalness evaluation. A Support Vector Machine (SVM) and Log Likelihood Ratio (LLR) based synthetic speech naturalness evaluation system was introduced in our previous work. In this paper, the evaluation system is improved in three aspects. Finally, a unit-selection and concatenation...

chapter

Self-adaptive Method Based on Software Architecture by Inspecting Uncertainty

Hua Wang, Zhijun Zheng

2010 International Conference on Artificial Intelligence and Computational Intelligence > 3 > 208 - 214

2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2010)

A recent common approach to monitor and adapt system behavior at runtime is to decouple one or more external modules and self-adaptive mechanisms from the target system. The non-invasive manners have the main advantage of realizing separation of concerns. However, some uncertainty aspects emerge while utilizing these separate control units. The unanticipated inherence and complexity of upcoming services...

chapter

Plume Source Localizing in Different Distributions and Noise Types Based on WSN

Hua Wang, Yiming Zhou, Xianglong Yang, Liren Wang

2010 International Conference on Communications and Mobile Computing > 3 > 63 - 66

2010 International Conference on Communications and Mobile Computing (CMC 2010)

Accidental gas leaks from unknown sites will cause the serious environmental pollution. One of the efficient methods to solve the problem is tracking and locating the plume source position. This paper presents a wireless sensor network installed with the gas sensor to on-line monitor the environment and estimate the location of a gas source based on the concentration readings at the wireless sensor...

chapter

HMM-based pseudo-clean speech synthesis for splice algorithm

Jun Du, Yu Hu, Li-Rong Dai, Ren-Hua Wang

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4570 - 4573

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In this paper, we present a novel approach to relax the constraint of stereo-data which is needed in a series of algorithms for noise-robust speech recognition. As a demonstration in SPLICE algorithm, we generate the pseudo-clean features to replace the ideal clean features from one of the stereo channels, by using HMM-based speech synthesis. Experimental results on aurora2 database show that the...

chapter

Full covariance state duration modeling for HMM-based speech synthesis

Heng Lu, Yi-Jian Wu, K. Tokuda, Li-Rong Dai, more

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4033 - 4036

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper proposes a state duration modeling method using full covariance matrix for HMM-based speech synthesis. In this method, a full covariance matrix instead of the conventional diagonal covariance matrix is adopted in the multi-dimensional Gaussian distribution to model the state duration of each context-dependent phoneme. At synthesis stage, the state durations are predicted using the clustered...

article

Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis

Zhen-Hua Ling, K. Richmond, J. Yamagishi, Ren-Hua Wang

IEEE Transactions on Audio, Speech, and Language Processing > 2009 > 17 > 6 > 1171 - 1185

This paper presents an investigation into ways of integrating articulatory features into hidden Markov model (HMM)-based parametric speech synthesis. In broad terms, this may be achieved by estimating the joint distribution of acoustic and articulatory features during training. This may in turn be used in conjunction with a maximum-likelihood criterion to produce acoustic synthesis parameters for...

chapter

An Improvement for Training Efficiency of Semi-Tied Covariance

Si-Bao Chen, Yu Hu, Bin Luo, Ren-Hua Wang

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

Semi-tied covariance (STC) is applied widely in speech recognition due to its feature de-correlation ability. Solving the transform matrices of STC is a nonlinear optimization problem. Gales proposed an efficient method by iteratively updating a row of transform matrices. However, it needs to solve cofactors of elements of a matrix row in two layers of loops. Directly solving them is very time-consuming...

chapter

Pronunciation Space Models for Pronunciation Evaluation

Si Wei, Yi-Qian Pan, Guo-Ping Hu, Yu Hu, more

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

Posterior probability is mostly used for pronunciation evaluation. This paper introduces pronunciation space models to calculate posterior probability replacing traditional phone-based acoustic models, which makes the calculated posterior probability more precise. Pronunciation space models are constructed using unsupervised clustering method guided by human scores and phone-level posterior probability...

chapter

Cross-Stream Dependency Modeling for HMM-Based Speech Synthesis

Zhen-Hua Ling, Wei Zhang, Ren-Hua Wang

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

This paper presents a method that the dependency between F0 and spectral features are modeled for the HMM-based parametric speech synthesis system. In conventional systems these two features are modeled as two independent streams, which is inconsistent with the fact that there always exists interaction between the extracted F0 and spectral parameters for model training. A piecewise linear transform...

chapter

Exploiting Non-Target Region Information for Confidence Measure Based on Bayesian Information Criterion

Cong Liu, Yu Hu, Xiong-Guo Lei, Zhi-Guo Wang, more

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper appropriate confidence measures (CMs) are investigated for Mandarin command word recognition, both in the so-called target region and non-target region, respectively. Here the target region refers to the recognized speech part of command word while the non-target region refers to the recognized silence part. It shows that exploiting extra information in the non-target region can effectively...

chapter

Tone Evaluation of Chinese Continuous Speech Based on Prosodic Words

Yi-Qian Pan, Si Wei, Ren-Hua Wang

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

Tonal evaluation of Chinese continuous speech plays an important role in Mandarin Chinese pronunciation test. In this paper, we introduce the Multi- Space Distribution Hidden Markov Model based on prosodic word. The results show that the performance of tonal syllable error rate can be reduced. For the non-standard Chinese Mandarin speech, the correlation between computer score and expert score was...

chapter

Model Adaptation for HMM-Based Speech Synthesis under Minimum Generation Error Criterion

Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang

2008 Tenth IEEE International Symposium on Multimedia > 539 - 544

2008 Tenth IEEE International Symposium on Multimedia

In order to solve the issues related to the maximum likelihood (ML) based HMM training for HMM-based speech synthesis, a minimum generation error (MGE) criterion had been proposed. This paper continues to apply the MGE criterion to model adaptation for HMM-based speech synthesis. We introduce a MGE linear regression (MGELR) based model adaptation algorithm, where the transforms from source HMMs to...

chapter

Minimum generation error criterion considering global/local variance for HMM-based speech synthesis

Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang, more

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4621 - 4624

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Due to the inconsistency between the maximum likelihood (ML) based training and the synthesis application in HMM-based speech synthesis, a minimum generation error (MGE) criterion had been proposed for HMM training. This paper continues to apply the MGE criterion to model adaptation for HMM-based speech synthesis. We propose a MGE linear regression (MGELR) based model adaptation algorithm, where the...

chapter

Minimum word classification error training of HMMS for automatic speech recognition

Zhi-Jie Yan, Bo Zhu, Yu Hu, Ren-Hua Wang

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4521 - 4524

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

This paper presents a novel discriminative training criterion, minimum word classification error (MWCE). By localizing conventional string-level MCE loss function to word-level, a more direct measure of empirical word classification error is approximated and minimized. Because the word-level criterion better matches performance evaluation criteria such as WER, an improved word recognition performance...

chapter

Minimum unit selection error training for HMM-based unit selection speech synthesis system

Zhen-Hua Ling, Ren-Hua Wang

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 3949 - 3952

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

This paper presents a minimum unit selection error (MUSE) training method for HMM-based unit selection speech synthesis system, which selects the optimal phone-sized unit sequence from the speech database by maximizing the combined likelihood of a group of trained HMMs. Under MUSE criterion, the weights and distribution parameters of these HMMs are estimated to minimize the number of different units...

chapter

A constrained line search approach to general discriminative HMM training

Peng Liu, Cong Liu, Hui Jiang, F.K. Soong, more

2007 IEEE Workshop on Automatic Speech Recognition&Understanding (ASRU) > 290 - 295

2007 IEEE Workshop on Automatic Speech Recognition and Understanding

Recently, we proposed a novel optimization algorithm called constrained line search (CLS) to train Gaussian mean vectors of HMMs in the MMI sense. In this paper, we extend and re-formulate it in a more general framework. The new CLS can optimize any discriminative objective functions including MMI, MCE, MPE/MWE etc. Also, closed-form solutions to update all Gaussian mixture parameters, including means,...

chapter

Toward Runtime Self-adaptation Method in Software-Intensive Systems Based on Hidden Markov Model

Hua Wang, Jing Ying

31st Annual International Computer Software and Applications Conference (COMPSAC 2007) > 2 > 601 - 606

2007 31st Annual International Computer Software and Applications Conference. COMPSAC 2007

To reduce the overload of human management, recently runtime self-adaptation is emerging as an important characteristic required by most intelligent software-intensive systems. Most methods are built upon the analysis of concepts of architecture and exploit some "craft" from the perspective of qualitative analysis. However, these methods are often incapable of reasoning about the history...

chapter

Word Graph Based Feature Enhancement for Noisy Speech Recognition

Zhi-Jie Yan, F.K. Soong, Ren-Hua Wang

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-373 - IV-376

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

This paper presents a word graph based feature enhancement method for robust speech recognition in noise. The approach uses signal processing based speech enhancement as a starting point, and then performs Wiener filtering to remove residual noise. During the process, a decoded word graph is used to directly guide the feature enhancement with respect to the HMM for recognition, so that the enhanced...

chapter

A Constrained Line Search Optimization for Discriminative Training in Speech Recognition

Cong Liu, Peng Liu, Hui Jiang, F. Soong, more

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-329 - IV-332

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

In this paper, we propose a novel constrained line search to optimize the MMEE objective function for training discriminative HMMs. In our method, the MMI estimation is cast as a constrained maximization problem, where Kullback-Leibler divergence between models before and after parameters adjustment is introduced as a constraint during optimization. Then, based on the idea of line search, we show...

chapter

A New Minimum Divergence Approach to Discriminative Training

Jun Du, Peng Liu, Hui Jiang, F.K. Soong, more

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-677 - IV-680

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

We propose to use minimum divergence, where acoustic similarity between HMMs is characterized by Kullback-Leibler divergence, for discriminative training. The MD objective function is defined as a posterior weighted divergence measured over the whole training set. Different from our earlier work, where KLD-based acoustic similarity is pre-computed for all initial models and stays invariant in the...

Keywords:
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Publication type

book (21)
article (1)

Keywords

SPEECH RECOGNITION (11)
SPEECH (9)
SPEECH SYNTHESIS (9)
HIDDEN MARKOV MODEL (7)
HMM (7)
TRAINING (7)
DATABASES (5)
DISCRIMINATIVE TRAINING (5)
MAXIMUM LIKELIHOOD ESTIMATION (5)
HMM-BASED SPEECH SYNTHESIS (4)
KULLBACK-LEIBLER DIVERGENCE (4)
TRANSFORMS (4)
ACOUSTICS (3)
CONTEXT MODELING (3)
COVARIANCE MATRICES (3)
COVARIANCE MATRIX (3)
PROBABILITY (3)
TESTING (3)
ADAPTATION MODEL (2)
FEATURE EXTRACTION (2)
GAUSSIAN DISTRIBUTION (2)
GAUSSIAN PROCESSES (2)
LINE SEARCH (2)
MAXIMUM MUTUAL INFORMATION (2)
MINIMUM GENERATION ERROR (2)
MINIMUM GENERATION ERROR CRITERION (2)
MODEL ADAPTATION (2)
NATURAL LANGUAGE PROCESSING (2)
NOISE MEASUREMENT (2)
NOISY SPEECH RECOGNITION (2)
REGRESSION ANALYSIS (2)
SEARCH PROBLEMS (2)
SPEECH PROCESSING (2)
WORD RECOGNITION (2)
ACCIDENTAL GAS LEAKS (1)
ACOUSTIC PARAMETER PREDICTION (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTIC SYNTHESIS PARAMETER (1)
AIR POLLUTION (1)
APPROXIMATION THEORY (1)
ARTICULATORY FEATURE (1)
ARTICULATORY FEATURES (1)
ASYNCHRONOUS-STATE MODEL STRUCTURE (1)
AUDIO DATABASES (1)
AURORA 2 DATABASE (1)
AUTOMATIC SPEECH RECOGNITION (1)
BASELINE SYSTEM (1)
BAYESIAN INFORMATION CRITERION (1)
BAYESIAN METHODS (1)
BIAS ADAPTATION ALGORITHM (1)
CHAM (1)
CHEMICAL ABSTRACT MACHINE (1)
CHINESE CONTINUOUS SPEECH (1)
CHINESE DATABASE (1)
CLS OPTIMIZATION METHOD (1)
CLUSTERED CONTEXT-DEPENDENT DISTRIBUTION (1)
COFACTOR VECTOR (1)
COMPUTATIONAL ELEMENT (1)
CONFIDENCE MEASURE (1)
CONNECTORS (1)
CONSTRAINED LINE SEARCH APPROACH (1)
CONSTRAINED LINE SEARCH OPTIMIZATION (1)
CONSTRUCTION INDUSTRY (1)
CONTEXT (1)
CONTEXT-DEPENDENT PHONEME (1)
CORRELATION (1)
COST ACCOUNTING (1)
CROSS-STREAM DEPENDENCY (1)
CROSS-STREAM FEATURE DEPENDENCY (1)
DECODED WORD GRAPH (1)
DECODING (1)
DISCRIMINATIVE OBJECTIVE FUNCTION (1)
DISPERSION (1)
DURATION (1)
EBW OPTIMIZATION METHOD (1)
ENVIRONMENTAL FACTORS (1)
ENVIRONMENTAL POLLUTION (1)
EQUAL ERROR RATE (1)
ERROR ANALYSIS (1)
ERROR CORRECTION (1)
ERROR ESTIMATION (1)
FEATURE DE-CORRELATION (1)
FEATURE ENHANCEMENT (1)
FILTERING THEORY (1)
FULL COVARIANCE (1)
FULL COVARIANCE MATRIX STATE DURATION MODELING (1)
GALERKIN METHOD (1)
GALES METHOD (1)
GAS SENSOR (1)
GAS SENSORS (1)
GAUSSIAN MEAN VECTOR (1)
GAUSSIAN MIXTURE PARAMETER (1)
GENERAL DISCRIMINATIVE HMM TRAINING (1)
GENERALIZED PROBABILISTIC DESCENT (1)
GENERALIZED PROBABILISTIC DESCENT ALGORITHM (1)
GENERATION ERROR MINIMIZATION (1)
GLOBAL VARIANCE (1)
GRAPH THEORY (1)
GUSSIAN COMPONENTS (1)
more

INFONA - science communication portal

Search results for: Hua Wang

Building HMM based unit-selection speech synthesis system using synthetic speech naturalness evaluation score

Self-adaptive Method Based on Software Architecture by Inspecting Uncertainty

Plume Source Localizing in Different Distributions and Noise Types Based on WSN

HMM-based pseudo-clean speech synthesis for splice algorithm

Full covariance state duration modeling for HMM-based speech synthesis

Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis

An Improvement for Training Efficiency of Semi-Tied Covariance

Pronunciation Space Models for Pronunciation Evaluation

Cross-Stream Dependency Modeling for HMM-Based Speech Synthesis

Exploiting Non-Target Region Information for Confidence Measure Based on Bayesian Information Criterion

Tone Evaluation of Chinese Continuous Speech Based on Prosodic Words

Model Adaptation for HMM-Based Speech Synthesis under Minimum Generation Error Criterion

Minimum generation error criterion considering global/local variance for HMM-based speech synthesis

Minimum word classification error training of HMMS for automatic speech recognition

Minimum unit selection error training for HMM-based unit selection speech synthesis system

A constrained line search approach to general discriminative HMM training

Toward Runtime Self-adaptation Method in Software-Intensive Systems Based on Hidden Markov Model

Word Graph Based Feature Enhancement for Noisy Speech Recognition

A Constrained Line Search Optimization for Discriminative Training in Speech Recognition

A New Minimum Divergence Approach to Discriminative Training

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results for: Hua Wang

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options