Search results for: Jia Liu

Items from 1 to 14 out of 14 results

chapter

Deep neural networks based speaker modeling at different levels of phonetic granularity

Yao Tian, Liang He, Meng Cai, Wei-Qiang Zhang, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5440 - 5444

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recently, a hybrid deep neural network/i-vector framework has been proved effective for speaker verification, where the DNN trained to predict tied-triphone states (senones) is used to produce frame alignments for sufficient statistics extraction. In this work, in order to better understand the impact of different phonetic precision to speaker verification tasks, three levels of phonetic granularity...

chapter

An LSTM-CTC based verification system for proxy-word based OOV keyword search

Zhiqiang Lv, Jian Kang, Wei-Qiang Zhang, Jia Liu

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5655 - 5659

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Proxy-word based out of vocabulary (OOV) keyword search has been proven to be quite effective in keyword search. In proxy-word based OOV keyword search, each OOV keyword is assigned several proxies and detections of the proxies are regarded as detections of the OOV keywords. However, the confidence scores of these detections are still those of the proxies from lattices. To obtain a better confidence...

chapter

Lattice based transcription loss for end-to-end speech recognition

Jian Kang, Wei-Qiang Zhang, Jia Liu

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

End-to-end speech recognition systems have been successfully implemented and have become competitive replacements for hybrid systems. A common loss function to train end-to-end systems is connectionist temporal classification (CTC). This method maximizes the log likelihood between the feature sequence and the associated transcription sequence. However there are some weaknesses with CTC training. The...

chapter

Gated recurrent units based hybrid acoustic models for robust speech recognition

Jian Kang, Wei-Qiang Zhang, Jia Liu

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Recurrent neural networks (RNNs) have shown an ability to model temporal dependencies. However the problem of exploding or vanishing gradients has limited their application. In recent years, long short-term memory RNNs (LSTM RNNs) have been proposed to solve this problem, and have achieved excellent results. However, because of the large size of LSTM RNNs, they more easily suffer from overfitting,...

chapter

Analysis of Ultrasonic Characteristics of Hashimoto's Thyroiditis Benign Nodules and Its Relationship with Serum TSH

Shuai Xue, Peisong Wang, Zhe Han, Chen Guang, more

2015 7th International Conference on Information Technology in Medicine and Education (ITME) > 46 - 52

2015 7th International Conference on Information Technology in Medicine and Education (ITME)

Objective: Discuss the relationship between ultrasonic characteristics of Hashimoto's thyroiditis benign nodules (HTBN) and serum TSH. Methods: We summarized 117 cases who were diagnosed as HTBN by thyroid fine needle aspiration according to the inclusion criteria from January 2012 to December 2013 in our department. 32 cases were misdiagnosed by ultrasound as malignant nodules. Using a random number...

chapter

The THUEE system for the openKWS14 keyword search evaluation

Meng Cai, Zhiqiang Lv, Beili Song, Yongzhe Shi, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4734 - 4738

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The OpenKWS14 keyword search evaluation is one of the most challenging and influential evaluations in the field of speech recognition. Its goal is to build a high-performance keyword search system for a minority language with limited training data in a short period of time. We present the system of the Department of Electronic Engineering, Tsinghua University (THUEE team) for the OpenKWS14 keyword...

chapter

Improve low-resource non-native mispronunciation detection with native speech by articulatory-based tandem feature

Hua Yuan, Ji Xu, Junhong Zhao, Jia Liu

2013 IEEE China Summit and International Conference on Signal and Information Processing > 127 - 131

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

In this paper, we propose a method to improve detecting the mispronunciation type of the non-native learners. In order to cope with the low-resource condition of non-native speech and the difference of native and non-native speech, the following efforts are made: 1) train acoustic model with the low-resource non-native data; 2) introduce the articulatory-based tandem feature; 3) pool auxiliary native...

chapter

Automatic pitch accent detection using auto-context with acoustic features

Junhong Zhao, Wei-Qiang Zhang, Hua Yuan, Jia Liu, more

2012 8th International Symposium on Chinese Spoken Language Processing > 247 - 251

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

In prosody event detection field, many local acoustic features have been proposed for representing the prosody characteristics of speech unit. The context information that represents some possible regularities underlying neighboring prosody events, however, hasn't been used effectively. The main difficulty to utilize prosodic context is that it's hard to capture the long-distance sequential dependency...

chapter

Improve mispronunciation detection with Tandem feature

Hua Yuan, Junhong Zhao, Jia Liu

2012 8th International Symposium on Chinese Spoken Language Processing > 184 - 187

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

This paper presents a method to improve the mispronunciation detection performance for low-resource acoustic model. The 1h speech data is randomly selected from CU-CHLOE to imitate the low-resource non-native English situation. The Tandem feature derived from articulatory based Multi-Layer Perception (MLP) is employed to replace the traditional spectral feature (e.g. PLP). Further, motivated by similar...

chapter

Study on the Treatment of PTA Productive Wastewater Using Ultrasound Enhanced Ozonation

Fei Rong, Jia Liu, Yejing Qiu, Wei Wu

2011 Third International Conference on Measuring Technology and Mechatronics Automation > 3 > 586 - 588

2011 International Conference on Measuring Technology and Mechatronics Automation (ICMTMA)

This paper reports a scale treatment on purified terephthalic acid (PTA) productive wastewater in a 50L column form reactor by ultrasound enhanced ozonation. The degradation effects of three kinds of productive wastewater including inlet and outlet water from treatment works as well as accident wastewater are investigated. The results show that the ultrasound enhanced ozonation is an efficient way...

chapter

Phone modeling and combining discriminative training for mandarinenglish bilingual speech recognition

Yanmin Qian, Jia Liu

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4918 - 4921

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Automatic multilingual speech recognition is always a difficult task. This paper presents recent work on the development of a Mandarin-English bilingual speech recognition system. A unified single set of bilingual acoustic models based on a novel State-Time-Alignment (STA) method is proposed to balance the performance and the complexity of the bilingual speech recognition system, and a comparison...

chapter

A Combined De-correlation Method for Acoustic Feedback Cancellation in Hearing Aids

Hong Cao, Jia Liu, Weiwei Zhang

2009 WRI World Congress on Computer Science and Information Engineering > 7 > 220 - 224

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

Acoustic feedback is a common problem in most hearing aids, it reduces the maximum useable gain and even causes "howling" while large forward gain is required, which is quite annoying for patient with severe hearing losses. This paper gives a new combination of de-correlation LMS adaptive algorithm and fixed forward delay to better de-correlate input and output signals of hearing aids as...

chapter

Acoustic modeling based on Chinese phonetics knowledge

Yao Li, Jia Liu

2008 International Conference on Audio, Language and Image Processing > 1189 - 1193

2008 International Conference on Audio, Language and Image Processing

This paper presents research on using Chinese phonetics knowledge in acoustic modeling based on extended initial/final (XIF). Context-dependent (CD) model is required for the improvement in performance of the acoustic model, and decision tree-based state tying technology is used to solve the problem which is the huge number of the modelpsilas parameters. Chinese phonetics knowledge plays an important...

chapter

Automatic language identification using support vector machines and phonetic N-gram

Yan Deng, Jia Liu

2008 International Conference on Audio, Language and Image Processing > 71 - 74

2008 International Conference on Audio, Language and Image Processing

In this paper, we describe two approaches for language identification (LID) using support vector machines (SVM) and phonetic n-gram. One is to use the language model scores of phone sequences to do SVM training. The other is to use the n-gram probabilities of those phones to train SVM models. For the second approach, we propose a new effective normalization method. In the experiments of 30 s test...

Filter options

Keywords:
ACOUSTICS

Publication date

Set your own date range

Keywords

SPEECH (8)
TRAINING (8)
SPEECH RECOGNITION (6)
HIDDEN MARKOV MODELS (5)
FEATURE EXTRACTION (4)
DATA MODELS (3)
NEURAL NETWORKS (3)
ACOUSTIC MODELING (2)
ACOUSTIC SIGNAL PROCESSING (2)
ADAPTATION MODELS (2)
ARTICULATORY FEATURE (2)
KEYWORD SEARCH (2)
LATTICES (2)
NATURAL LANGUAGE PROCESSING (2)
SPEECH PROCESSING (2)
TANDEM FEATURE (2)
ULTRASONIC IMAGING (2)
ACCIDENTS (1)
ACCURACY (1)
ACOUSTIC (1)
ACOUSTIC FEEDBACK CANCELLATION (1)
ADAPTIVE FILTER (1)
ADAPTIVE FILTERS (1)
ANALYTICAL MODELS (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTO-CONTEXT (1)
AUTOMATIC LANGUAGE IDENTIFICATION (1)
AUTOMATIC MULTILINGUAL SPEECH RECOGNITION (1)
BILINGUAL SPEECH RECOGNITION (1)
BIOLOGICAL NEURAL NETWORKS (1)
BLOOD FLOW (1)
CALL (1)
CANCER (1)
CHEMICAL OXYGEN DEMAND (1)
CHEMICALS (1)
CHINA (1)
CHINESE PHONETICS KNOWLEDGE (1)
CLUSTERING METHODS (1)
COD (1)
COLUMNIFORM REACTOR (1)
COMPUTER ARCHITECTURE (1)
CONNECTIONIST TEMPORAL CLASSIFICATION (1)
CONTEXT (1)
CONTEXT MODELING (1)
CONTEXT-DEPENDENT MODEL (1)
CTC (1)
DATABASES (1)
DECISION TREE-BASED STATE TYING TECHNOLOGY (1)
DECISION TREES (1)
DECORRELATION (1)
DECORRELATION LMS ADAPTIVE ALGORITHM (1)
DEEP NEURAL NETWORK (1)
DEEP NEURAL NETWORKS (1)
DEGRADATION (1)
DELAY (1)
DETECTORS (1)
DISCRIMINATIVE TRAINING (1)
EFFLUENTS (1)
END-TO-END SYSTEM (1)
EQUAL ERROR RATE (1)
EQUATIONS (1)
ERROR ANALYSIS (1)
ERROR STATISTICS (1)
FEEDBACK (1)
FIXED FORWARD DELAY (1)
FMPE (1)
GATED RECURRENT UNITS (1)
HASHIMOTO'S THYROIDITIS (1)
HEARING AIDS (1)
INDUCTORS (1)
INDUSTRIAL EFFLUENT QUALITY STANDARD (1)
INDUSTRIAL WASTE (1)
KERNEL (1)
KEYWORD SPOTTING (1)
LABELING (1)
LANGUAGE MODELING (1)
LATTICE (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEAST MEAN SQUARES METHODS (1)
LOGIC GATES (1)
LONG SHORT-TERM MEMORY (1)
LOW-RESOURCE (1)
LOW-RESOURCE MISPRONUNCIATION DETECTION (1)
LYMPH NODES (1)
MANDARIN ENGLISH (1)
MATHEMATICAL MODEL (1)
MICROPROCESSORS (1)
MINIMUM PHONE ERROR (1)
MISPRONUNCIATION DETECTION (1)
MLP (1)
MPE (1)
MULTI-LAYER PERCEPTION (MLP) (1)
N-GRAM PROBABILITY (1)
NEEDLES (1)
NIST (1)
OOV KEYWORD (1)
OZONANTION (1)
OZONATION (MATERIALS PROCESSING) (1)
PHONE CLUSTERING (1)
more

INFONA - science communication portal

Search results for: Jia Liu

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options