Lei Xie

article

Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection

Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, more

IEEE Journal of Selected Topics in Signal Processing > 2017 > 11 > 8 > 1329 - 1339

We propose a novel technique that learns a low-dimensional feature representation from unlabeled data of a target language, and labeled data from a nontarget language. The technique is studied as a solution to query-by-example spoken term detection (QbE-STD) for a low-resource language. We extract low-dimensional features from a bottle-neck layer of a multitask deep neural network, which is jointly...

chapter

Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection

Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5645 - 5649

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose to use a feature representation obtained by pairwise learning in a low-resource language for query-by-example spoken term detection (QbE-STD). We assume that word pairs identified by humans are available in the low-resource target language. The word pairs are parameterized by a multi-lingual bottleneck feature (BNF) extractor that is trained using transcribed data in high-resource languages...

chapter

Approximate search of audio queries by using DTW with phone time boundary and data augmentation

Haikua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, more

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6030 - 6034

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Dynamic Time Warping (DTW) is widely used in language independent query-by-example (QbE) spoken term detection (STD) tasks due to its high performance. However, there are two limitations of DTW based template matching, 1) it is not straightforward to perform approximate match of audio queries; 2) DTW is sensitive to the mismatch of signal conditions between the query and the speech search data. To...

chapter

Language independent query-by-example spoken term detection using N-best phone sequences and partial matching

Haihua Xu, Peng Yang, Xiong Xiao, Lei Xie, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5191 - 5195

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a partial sequence matching based symbolic search (SS) method for the task of language independent query-by-example spoken term detection. One main drawback of conventional SS approach is the high miss rate for long queries. This is due to high variations in symbol representation of query and search audios, especially in language independent scenario. The successful matching...

chapter

A tighter lower bound estimate for dynamic time warping

Peng Yang, Lei Xie, Qiao Luan, Wei Feng

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 8525 - 8529

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a new lower-bound estimate for speeding up dynamic time warping (DTW) on multivariate time sequences. It has several advantages as compared with the inner-product lower bound [1] recently proposed to eliminate a large number of DTW computations. First, we prove that it is tighter than the inner product lower bound while the computational complexity remains comparable. Second,...

INFONA - science communication portal

Search results for: Lei Xie

Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection

Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection

Approximate search of audio queries by using DTW with phone time boundary and data augmentation

Language independent query-by-example spoken term detection using N-best phone sequences and partial matching

A tighter lower bound estimate for dynamic time warping

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results for: Lei Xie

Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection

Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection

Approximate search of audio queries by using DTW with phone time boundary and data augmentation

Language independent query-by-example spoken term detection using N-best phone sequences and partial matching

A tighter lower bound estimate for dynamic time warping

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options