Search results for: Qiang Zhang

Items from 1 to 4 out of 4 results

chapter

Deep neural networks based speaker modeling at different levels of phonetic granularity

Yao Tian, Liang He, Meng Cai, Wei-Qiang Zhang, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5440 - 5444

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recently, a hybrid deep neural network/i-vector framework has been proved effective for speaker verification, where the DNN trained to predict tied-triphone states (senones) is used to produce frame alignments for sufficient statistics extraction. In this work, in order to better understand the impact of different phonetic precision to speaker verification tasks, three levels of phonetic granularity...

chapter

An LSTM-CTC based verification system for proxy-word based OOV keyword search

Zhiqiang Lv, Jian Kang, Wei-Qiang Zhang, Jia Liu

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5655 - 5659

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Proxy-word based out of vocabulary (OOV) keyword search has been proven to be quite effective in keyword search. In proxy-word based OOV keyword search, each OOV keyword is assigned several proxies and detections of the proxies are regarded as detections of the OOV keywords. However, the confidence scores of these detections are still those of the proxies from lattices. To obtain a better confidence...

chapter

The NDSC transcription system for the 2016 multi-genre broadcast challenge

Xu-Kui Yang, Dan Qu, Wen-Lin Zhang, Wei-Qiang Zhang

2016 IEEE Spoken Language Technology Workshop (SLT) > 273 - 278

2016 IEEE Spoken Language Technology Workshop (SLT)

The National Digital Switching System Engineering and Technological R&D Center (NDSC) speech-to-text transcription system for the 2016 multi-genre broadcast challenge is described. Various acoustic models based on deep neural network (DNN), such as hybrid DNN, long short term memory recurrent neural network (LSTM RNN), and time delay neural network (TDNN), are trained. The system also makes use...

chapter

Lattice based transcription loss for end-to-end speech recognition

Jian Kang, Wei-Qiang Zhang, Jia Liu

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

End-to-end speech recognition systems have been successfully implemented and have become competitive replacements for hybrid systems. A common loss function to train end-to-end systems is connectionist temporal classification (CTC). This method maximizes the log likelihood between the feature sequence and the associated transcription sequence. However there are some weaknesses with CTC training. The...

Filter options

Keywords:
ACOUSTICS
NEURAL NETWORKS

Publication date

Set your own date range

Keywords

TRAINING (3)
DEEP NEURAL NETWORKS (2)
HIDDEN MARKOV MODELS (2)
LATTICES (2)
SPEECH (2)
SPEECH RECOGNITION (2)
ANALYTICAL MODELS (1)
BROADCAST TRANSCRIPTION (1)
CONNECTIONIST TEMPORAL CLASSIFICATION (1)
CTC (1)
END-TO-END SYSTEM (1)
ERROR ANALYSIS (1)
FEATURE EXTRACTION (1)
FILTER BANKS (1)
KALDI (1)
KEYWORD SEARCH (1)
LATTICE (1)
LONG-TERM INFORMATION (1)
NIST (1)
OOV KEYWORD (1)
PHONETIC GRANULARITY (1)
PROXY KEYWORD (1)
SPEAKER VERIFICATION (1)
TELEPHONE SETS (1)
TRAINING DATA (1)
TRANSCRIPTION LOSS (1)
TUNING (1)
VERIFICATION (1)
more

INFONA - science communication portal

Search results for: Qiang Zhang

Deep neural networks based speaker modeling at different levels of phonetic granularity

An LSTM-CTC based verification system for proxy-word based OOV keyword search

The NDSC transcription system for the 2016 multi-genre broadcast challenge

Lattice based transcription loss for end-to-end speech recognition

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options