Search results for: Yonghong Yan

Items from 1 to 7 out of 7 results

chapter

A Spread Spectrum Audio Watermarking System with High Perceptual Quality

Xuemin Zhao, Yuhong Guo, Jian Liu, Yonghong Yan

2011 Third International Conference on Communications and Mobile Computing > 266 - 269

2011 Third International Conference on Communications and Mobile Computing (CMC)

We propose a system to embed watermark message into audio signal, which can be used for copyright protection. It uses spread spectrum theory to generate a watermark that resistants to different removal attempts. We exploit the psychoacoustic auditory model to guarantee the audio signal's perceptual quality after the watermark embedding procedure. Recovery is performed without knowledge of the original...

article

Voice Activity Detection Based on an Unsupervised Learning Framework

Dongwen Ying, Yonghong Yan, Jianwu Dang, Frank K. Soong

IEEE Transactions on Audio, Speech, and Language Processing > 2011 > 19 > 8 > 2624 - 2633

How to construct models for speech/nonspeech discrimination is a crucial point for voice activity detectors (VADs). Semi-supervised learning is the most popular way for model construction in conventional VADs. In this correspondence, we propose an unsupervised learning framework to construct statistical models for VAD. This framework is realized by a sequential Gaussian mixture model. It comprises...

chapter

Tone pronunciation quality scoring of Mandarin multi-syllable words

Junbo Zhang, Hemin Wu, Yonghong Yan

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS > 545 - 548

2010 10th International Conference on Signal Processing (ICSP 2010)

This paper discusses tone pronunciation scoring for Mandarin multi-syllable words in Computer Assisted Language Learning (CALL) System. A commonly used tone evaluation method is using GMM to model various pitch sequence. Because the pattern of pitch sequence will change a lot in the multisyllable context, tone models trained on mono-tone database will not have good performance on multi-syllable speech...

chapter

Removing fillers to induce semantic classes for a Chinese dialogue system

Yali Li, Xuemin Zhao, Yonghong Yan

2010 2nd IEEE International Conference on Information Management and Engineering > 512 - 516

2010 2nd IEEE International Conference on Information Management and Engineering (ICIME 2010)

In this paper, we introduced an unsupervised method to remove fillers in spoken dialogues semi-automatically based on their probability distribution and the effect of removing fillers to induce semantic classes. We conduct the unigram and bigram distribution of fillers on our Chinese voice search data and find that only using these distributions, fillers are in the first 1% of all words. We also test...

chapter

Removing fillers to induce semantic classes for a Chinese dialogue system

Yali Li, Yonghong Yan

2010 2nd International Conference on Advanced Computer Control > 4 > 163 - 166

2010 2nd International Conference on Advanced Computer Control (ICACC 2010)

In this paper, we introduced an unsupervised method to remove fillers in spoken dialogues semi-automatically based on their probability distribution. Disfluencies such as fillers, repairs often make the sentence ill-formed, longer and hard to process. Fillers were emphasized instead of repairs in this paper. We conduct the unigram and bigram distribution of fillers on our Chinese voice search data...

chapter

Automatic Detection of Pathological Voices Using GMM-MLLR Approach

Xiang Wang, Jianping Zhang, Yonghong Yan

2009 2nd International Conference on Biomedical Engineering and Informatics > 1 - 4

2009 2nd International Conference on Biomedical Engineering and Informatics (BMEI)

Modern lifestyles have increased the risk of suffering some kind of voice disorders. It is estimated that nearly 19% of the population have suffered from dysphonic voicing. It is very important to detect pathological voices automatically. Many classification methods have been used to detect the pathological voices automatically and got good results. In this paper, we focus on the automatic detection...

chapter

A Synchronous Method for Automatic Scoring of Language Learning

Bin Dong, Yonghong Yan

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 5

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper, a synchronous method based on state graph is proposed to calculate the evaluation feature for automatic scoring in computer-assisted language learning (CALL). The posterior probabilities of states are selected as the main feature. The score of hypothesized phonemes and words are estimated using the information of corresponding states. Traditional systems use two passes and two different...

Filter options

Keywords:
EQUATIONS

Publication date

Set your own date range

Publication type

book (6)
article (1)

Keywords

MATHEMATICAL MODEL (6)
SPEECH (5)
HIDDEN MARKOV MODELS (3)
NATURAL LANGUAGE PROCESSING (3)
SPEECH PROCESSING (3)
CHINESE DIALOGUE SYSTEM (2)
CHINESE VOICE SEARCH DATA (2)
CONTEXT (2)
DECODING (2)
FILLERS DETECTION (2)
FILLERS DISTRIBUTION (2)
MAINTENANCE ENGINEERING (2)
PROBABILITY (2)
PROBABILITY DISTRIBUTION (2)
SPEECH RECOGNITION (2)
SPOKEN DIALOGUES (2)
UNSUPERVISED LEARNING (2)
ACCURACY (1)
ACOUSTIC SIGNAL DETECTION (1)
ACOUSTICS (1)
ADAPTATION MODEL (1)
AUDIO SIGNAL PROCESSING (1)
AUDIO WATERMARKING (1)
AUTOMATIC DETECTION (1)
AUTOMATIC SCORING (1)
BIGRAM DISTRIBUTION (1)
CALL (1)
COMPUTATIONAL MODELING (1)
COMPUTER AIDED INSTRUCTION (1)
COMPUTER ASSISTED LANGUAGE LEARNING SYSTEM (1)
COMPUTER-ASSISTED LANGUAGE LEARNING (1)
DATABASES (1)
DYSPHONIC VOICING (1)
ENCODING (1)
EVALUATION METHOD (1)
FILLERS BIGRAM DISTRIBUTION (1)
FILLERS REMOVAL (1)
FILLERS UNIGRAM DISTRIBUTION (1)
FO (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PROCESSES (1)
GMM (1)
GMM-MLLR APPROACH (1)
GRAMMAR (1)
GRAMMARS (1)
HUMAN-TO-COMPUTER CORPUS (1)
HUMAN-TO-COMPUTER DIALOGUES (1)
HUMAN-TO-HUMAN CORPUS (1)
HUMAN-TO-HUMAN DIALOGUES (1)
INTERACTIVE SYSTEMS (1)
LINEAR REGRESSION (1)
LINGUISTICS (1)
MANDARIN MULTISYLLABLE WORDS (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MAXIMUM LIKELIHOOD LINEAR REGRESSION (1)
MEDICAL SIGNAL DETECTION (1)
MODEL-BASED GAUSSIAN CLUSTERING (1)
MONO-TONE DATABASE (1)
MULTISYLLABLE CONTEXT (1)
MULTISYLLABLE SPEECH (1)
NATURAL DIALOGUE CORPUS (1)
PATHOLOGICAL VOICES (1)
PATHOLOGY (1)
PITCH SEQUENCE (1)
POSTERIOR PROBABILITIES (1)
PROBABILITY DENSITY FUNCTION (1)
PRONUNCIATION QUALITY (1)
PSYCHOACOUSTIC MODEL (1)
PSYCHOACOUSTIC MODELS (1)
REGRESSION ANALYSIS (1)
REMOVING FILLERS (1)
ROBUSTNESS (1)
SEMANTIC CLASS INDUCTION (1)
SEMANTIC CLASS INDUCTION PRECISION (1)
SEMANTIC CLASSES (1)
SEMANTICS (1)
SEQUENTIAL GAUSSIAN MIXTURE MODEL (GMM) (1)
SIGNAL TO NOISE RATIO (1)
SPEECH PRESENCE PROBABILITY (1)
SPOKEN DIALOGUE (1)
SPREAD SPECTRUM (1)
STATE GRAPH (1)
STATISTICAL DISTRIBUTIONS (1)
SUPPORT VECTOR MACHINES (1)
SYNCHRONIZATION (1)
SYNCHRONOUS METHOD (1)
TEST DATABASE (1)
TONE EVALUATION (1)
TONE MODEL (1)
TONE PRONUNCIATION QUALITY SCORING (1)
TONE RECOGNITION CORRECT RATE (1)
UNIGRAM DISTRIBUTION (1)
UNSUPERVISED METHOD (1)
VOICE ACTIVITY DETECTION (VAD) (1)
VOICE DISORDERS (1)
WATERMARKING (1)
more

INFONA - science communication portal

Search results for: Yonghong Yan

A Spread Spectrum Audio Watermarking System with High Perceptual Quality

Voice Activity Detection Based on an Unsupervised Learning Framework

Tone pronunciation quality scoring of Mandarin multi-syllable words

Removing fillers to induce semantic classes for a Chinese dialogue system

Removing fillers to induce semantic classes for a Chinese dialogue system

Automatic Detection of Pathological Voices Using GMM-MLLR Approach

A Synchronous Method for Automatic Scoring of Language Learning

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options