Search results for: Pengyuan Zhang

Items from 1 to 13 out of 13 results

chapter

An unsupervised vocabulary selection technique for Chinese automatic speech recognition

Yike Zhang, Pengyuan Zhang, Ta Li, Yonghong Yan

2016 IEEE Spoken Language Technology Workshop (SLT) > 420 - 425

2016 IEEE Spoken Language Technology Workshop (SLT)

The vocabulary is a vital component of automatic speech recognition(ASR) systems. For a specific Chinese speech recognition task, using a large general vocabulary not only leads to a much longer time to decode, but also hurts the recognition accuracy. In this paper, we proposed an unsupervised algorithm to select task-specific words from a large general vocabulary. The out-of-vocabulary(OOV) rate...

chapter

A bi-scale method of link prediction

Pengyuan Zhang, Jianping Li, Qi Liu, Zheng Xie

2015 11th International Conference on Natural Computation (ICNC) > 1040 - 1044

The 2015 11th International Conference on Natural Computation

Link prediction in networks is of both theoretical interest and practical significance in many branches of science, and a great number of algorithms are based on microscale (common neighbours) or mesoscale (communities) information of observed networks. Either microscale or mesoscale methods are limited in the understanding of the topological properties at the corresponding scales. This article proposes...

chapter

An improvement of link prediction by combining local information and betweenness

Qi Liu, Jianping Li, Zheng Xie, Pengyuan Zhang

2015 11th International Conference on Natural Computation (ICNC) > 456 - 461

The 2015 11th International Conference on Natural Computation

Link prediction has significance in both theoretical interest and practical operation. Many methods via local and global structural information have been proposed. Methods based on local information like the Common Neighbours Index(CN) successfully reduce the computational expense but suffer from poor prediction performance. In this article, we put forward a new approach, namely Betweenness and Common...

chapter

Semi-supervised DNN training in meeting recognition

Pengyuan Zhang, Yulan Liu, Thomas Hain

2014 IEEE Spoken Language Technology Workshop (SLT) > 141 - 146

2014 IEEE Spoken Language Technology Workshop (SLT)

Training acoustic models for ASR requires large amounts of labelled data which is costly to obtain. Hence it is desirable to make use of unlabelled data. While unsupervised training can give gains for standard HMM training, it is more difficult to make use of unlabelled data for discriminative models. This paper explores semi-supervised training of Deep Neural Networks (DNN) in a meeting recognition...

chapter

Enhanced Out of Vocabulary Word Detection Using Local Acoustic Information

Xuyang Wang, Ta Li, Pengyuan Zhang, Jielin Pan, more

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 594 - 597

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

The detection of Out-of-vocabulary (OOV) words is a crucial problem for spoken term detection (STD). In this paper, the use of integration with local acoustic information is investigated to retrieve more OOV words. Tokens with high local acoustic probabilities propagated in the search space at the decoding stage will be forced to propagate to the next frame. In this way, acoustic similar words can...

chapter

Using neural network front-ends on far field multiple microphones based speech recognition

Yulan Liu, Pengyuan Zhang, Thomas Hain

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5542 - 5546

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents an investigation of far field speech recognition using beamforming and channel concatenation in the context of Deep Neural Network (DNN) based feature extraction. While speech enhancement with beamforming is attractive, the algorithms are typically signal-based with no information about the special properties of speech. A simple alternative to beamforming is concatenating multiple...

chapter

Variable fuzzy set model for evaluation of intergrity of pile foundation based on SPA

Pengyuan Zhang, Xiangdong Shen, Sichen Jiang

2011 Second International Conference on Mechanic Automation and Control Engineering > 7081 - 7083

2011 Second International Conference on Mechanic Automation and Control Engineering (MACE)

Based on the set pair analysis (SPA) and the variable fuzzy sets theory, In order to simplify the evaluation procedure of relative difference degree and make the accurate evaluation of the the integrity of pile foundation quantitatively, a new variable fuzzy set model for evaluation of integrity of pile foundation was established. Moreover, it was shown by comparison of results between a practical...

chapter

Keyword Spotting Based on Syllable Confusion Network

Pengyuan Zhang, Jian Shao, Qingwei Zhao, Yonghong Yan

Third International Conference on Natural Computation (ICNC 2007) > 2 > 656 - 659

2007 3rd International Conference on Natural Computation

Keyword spotting becomes a very important branch of speech recognition. But the acoustic mismatch between training and testing environments often causes a severe degradation in the recognition performance. This paper presents an improved keyword spotting strategy. A fuzzy search algorithm is proposed to extract keyword hypotheses from a syllable confusion network (SCN). SCN is linear and naturally...

chapter

Real Context Model for Tone Recognition in Mandarin Conversational Telephone Speech

Qingwei Zhao, Jian Shao, Pengyuan Zhang, Qingwei Zhao, more

Third International Conference on Natural Computation (ICNC 2007) > 2 > 696 - 699

2007 3rd International Conference on Natural Computation

This paper presents an approach to tone recognition in mandarin conversational telephone speech (CTS) based on a real context model. The real context model is proposed as a new concept designed with special consideration on the fact that mandarin CTS is characterized by complicated tone behaviors due to physiological articulation. As pitch is a supra-segmental feature, current tone's pitch value is...

chapter

Keyword Spotting Based on Syllable Confusion Network

Pengyuan Zhang, Jian Shao, Qingwei Zhao, Yonghong Yan

Third International Conference on Natural Computation (ICNC 2007) Vol II > 2 > 656 - 659

Third International Conference on Natural Computation (ICNC 2007) Vol II

chapter

Real Context Model for Tone Recognition in Mandarin Conversational Telephone Speech

Zhaojie Liu, Jian Shao, Pengyuan Zhang, Qingwei Zhao, more

Third International Conference on Natural Computation (ICNC 2007) Vol II > 2 > 696 - 699

Third International Conference on Natural Computation (ICNC 2007) Vol II

chapter

Fast Vocabulary-Independent Audio Search Based on Syllable Confusion Network Indexing in Mandarin Spontaneous Speech

Jian Shao, Pengyuan Zhang, Zhaojie Liu, Qingwei Zhao, more

2007 Second International Conference on Digital Telecommunications (ICDT'7) > 8

Second International Conference on Digital Telecommunications, ICDT 2007

This paper presents a fast vocabulary-independent audio search method in Mandarin spontaneous speech which is based on syllable confusion network (SCN) indexing. Confusion network is linear and naturally suitable for indexing. The feasibility of using syllable confusion network as lattice representation is firstly investigated. Since direct syllabic decoding may not have a very high accuracy, long-...

chapter

A New Keyword Spotting Approach for Spontaneous Mandarin Speech

Pengyuan Zhang, Jiang Han, Jian Shao, Yonghong Yan

2006 8th international Conference on Signal Processing > 1

2006 8th International Conference on Signal Processing

For many practical applications of keyword spotting, input signal is a spontaneous conversation. Generally speaking, keyword spotting system will degrade significantly because of mismatch between acoustic model and speech. To solve this problem, this paper presents a two-pass based keyword spotting strategy. Different from one-pass based system, decoding process is done in the whole acoustic space,...

Filter options

Data set:
ieee

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (6)
DECODING (3)
NATURAL LANGUAGES (3)
ACOUSTICS (2)
DEEP NEURAL NETWORKS (2)
MANDARIN CONVERSATIONAL TELEPHONE SPEECH (2)
SEARCH PROBLEMS (2)
SPEECH (2)
1-BEST PHONEME SEQUENCE (1)
ACOUSTIC BEAMS (1)
ACOUSTIC MISMATCH (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTIC SPACE (1)
ANALYTICAL MODELS (1)
AUTOMATIC SPEECH RECOGNITION (1)
BEAMFORMING (1)
BETWEENESS (1)
CHINESE VOCABULARY SELECTION (1)
COMMON NEIGHBOURS (1)
COMPLEX NETWORK (1)
COMPLEX NETWORKS (1)
COMPLICATED TONE BEHAVIORS (1)
CONCRETE (1)
CONFIDENCE SELECTION (1)
DECODING PROCESS (1)
DIRECT SYLLABIC DECODING (1)
EQUAL ERROR RATE REDUCTION (1)
ESTIMATION (1)
FAST VOCABULARY-INDEPENDENT AUDIO SEARCH (1)
FEATURE EXTRACTION (1)
FREQUENCY MODULATION (1)
FUZZY SEARCH ALGORITHM (1)
FUZZY SET THEORY (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PROCESSES (1)
GMM (1)
HIDDEN MARKOV MODELS (1)
INDEXING (1)
KEYWORD HYPOTHESES EXTRACTION (1)
KEYWORD SPOTTING (1)
KEYWORD SPOTTING APPROACH (1)
LANGUAGE MODEL (1)
LATTICE BASED CONFIDENCE MEASURE (1)
LATTICE REPRESENTATION (1)
LINK PREDICITION (1)
LINK PREDICTION (1)
MANDARIN SPONTANEOUS SPEECH (1)
MINIMUM CLASSIFICATION ERROR OPTIMIZED CONFIDENCE MEASURE (1)
MISSING LINKS (1)
MULTIPLE MICROPHONE (1)
NATURAL LANGUAGE PROCESSING (1)
NC INDEX (1)
ONE-PASS BASED SYSTEM (1)
OOV (1)
OOV RATE ESTIMATION FOR CHINESE VOCABULARY (1)
PATTERN MATCHING (1)
PHYSIOLOGICAL ARTICULATION (1)
PILE INTEGRITY (1)
PITCH CONTOUR SHAPES (1)
POST-PROCESSING METHOD (1)
PRESSES (1)
PSD CURVE (1)
RA INDEX (1)
REAL CONTEXT ANNOTATED TRAINING DATA (1)
REAL CONTEXT MODEL (1)
RELATIVE PITCH LEVEL (1)
ROCKS (1)
SEMI-SUPERVISED ACOUSTIC MODEL TRAINING (1)
SIGNAL CLASSIFICATION (1)
SIGNAL PROCESSING (1)
SPA (1)
SPEECH CODING (1)
SPOKEN TERM DETECTION (1)
SPONTANEOUS MANDARIN SPEECH (1)
STANDARDS (1)
SYLLABLE COLLOCATION (1)
SYLLABLE CONFUSION MATRIX (1)
SYLLABLE CONFUSION NETWORK (1)
SYLLABLE CONFUSION NETWORK INDEXING (1)
SYLLABLE GRAPH DENSITY (1)
TESTING (1)
TESTING ENVIRONMENTS (1)
TOKEN PASSING (1)
TONE RECOGNITION (1)
TRAINING DATA (1)
TRAINING ENVIRONMENTS (1)
VARIABLE FUZZY SET (1)
VOCABULARY (1)
more

INFONA - science communication portal

Search results for: Pengyuan Zhang

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options