Search results for: Qiang Zhang

Items from 1 to 5 out of 5 results

chapter

An LSTM-CTC based verification system for proxy-word based OOV keyword search

Zhiqiang Lv, Jian Kang, Wei-Qiang Zhang, Jia Liu

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5655 - 5659

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Proxy-word based out of vocabulary (OOV) keyword search has been proven to be quite effective in keyword search. In proxy-word based OOV keyword search, each OOV keyword is assigned several proxies and detections of the proxies are regarded as detections of the OOV keywords. However, the confidence scores of these detections are still those of the proxies from lattices. To obtain a better confidence...

chapter

A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction

Cong Guo, Like Hui, Wei-Qiang Zhang, Jia Liu

2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 6 - 10

2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and spectral subtraction to get better enhanced speech. The CASA part consists of the latest method deep neural networks (DNNs). The original way to reconstruct the denoise signal is to use the estimated masks with direct overlap-add method ignoring...

chapter

The NDSC transcription system for the 2016 multi-genre broadcast challenge

Xu-Kui Yang, Dan Qu, Wen-Lin Zhang, Wei-Qiang Zhang

2016 IEEE Spoken Language Technology Workshop (SLT) > 273 - 278

2016 IEEE Spoken Language Technology Workshop (SLT)

The National Digital Switching System Engineering and Technological R&D Center (NDSC) speech-to-text transcription system for the 2016 multi-genre broadcast challenge is described. Various acoustic models based on deep neural network (DNN), such as hybrid DNN, long short term memory recurrent neural network (LSTM RNN), and time delay neural network (TDNN), are trained. The system also makes use...

chapter

Convolutional maxout neural networks for speech separation

Like Hui, Meng Cai, Cong Guo, Liang He, more

2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 24 - 27

2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

Speech separation based on deep neural networks (DNNs) has been widely studied recently, and has achieved considerable success. However, previous studies are mostly based on fully-connected neural networks. In order to capture the local information of speech signals, we propose to use convolutional maxout neural networks (CMNNs) to separate speech and noise by estimating the ideal ratio mask of the...

chapter

Improving deep neural network acoustic models using unlabeled data

Meng Cai, Wei-Qiang Zhang, Jia Liu

2013 IEEE China Summit and International Conference on Signal and Information Processing > 137 - 141

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

The Context-Dependent Deep-Neural-Network HMM, or CD-DNN-HMM, is a powerful acoustic modeling technique. Its training process typically involves unsupervised pre-training and supervised fine-tuning. In the paper, we demonstrate that the performance of DNNs can be improved by utilizing a large amount of unlabeled data in the training procedure. In our method, CD-DNN-HMM trained using 309 hours of unlabeled...

Filter options

Keywords:
NEURAL NETWORKS
SPEECH

Publication date

Set your own date range

Keywords

ACOUSTICS (2)
HIDDEN MARKOV MODELS (2)
SIGNAL TO NOISE RATIO (2)
SPEECH RECOGNITION (2)
ACOUSTIC MODELING (1)
BROADCAST TRANSCRIPTION (1)
COMPUTATIONAL AUDITORY SCENE ANALYSIS (CASA) (1)
CONVOLUTION (1)
CTC (1)
DATA MODELS (1)
DEEP NEURAL NETWORK (1)
DEEP NEURAL NETWORK (DNN) (1)
DEEP NEURAL NETWORKS (1)
FEATURE EXTRACTION (1)
FILTER BANKS (1)
FREQUENCY-DOMAIN ANALYSIS (1)
KALDI (1)
KEYWORD SEARCH (1)
LATTICES (1)
LONG-TERM INFORMATION (1)
OOV KEYWORD (1)
PROXY KEYWORD (1)
SIGNAL PROCESSING ALGORITHMS (1)
SPEECH ENHANCEMENT (1)
SWITCHES (1)
TIME-FREQUENCY ANALYSIS (1)
TUNING (1)
UNLABELED DATA (1)
VERIFICATION (1)
more

INFONA - science communication portal

Search results for: Qiang Zhang

An LSTM-CTC based verification system for proxy-word based OOV keyword search

A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction

The NDSC transcription system for the 2016 multi-genre broadcast challenge

Convolutional maxout neural networks for speech separation

Improving deep neural network acoustic models using unlabeled data

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options