Search results for: Yong Xu

Items from 1 to 6 out of 6 results

chapter

A joint detection-classification model for audio tagging of weakly labelled data

Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 641 - 645

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Audio tagging aims to assign one or several tags to an audio clip. Most of the datasets are weakly labelled, which means only the tags of the clip are known, without knowing the occurrence time of the tags. The labeling of an audio clip is often based on the audio events in the clip and no event level label is provided to the user. Previous works have used the bag of frames model assume the tags occur...

chapter

Deep neural network for robust speech recognition with auxiliary features from laser-Doppler vibrometer sensor

Zhipeng Xie, Jun Du, Ian McLoughlin, Yong Xu, more

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Recently, the signal captured from a laser Doppler vibrometer (LDV) sensor been used to improve the noise robustness automatic speech recognition (ASR) systems by enhancing the acoustic signal prior to feature extraction. This study proposes another approach in which auxiliary features extracted from the LDV signal are used alongside conventional acoustic features to further improve ASR performance...

chapter

Deep neural network based speech separation for robust speech recognition

Yanhui Tu, Jun Du, Yong Xu, Lirong Dai, more

2014 12th International Conference on Signal Processing (ICSP) > 532 - 536

2014 12th International Conference on Signal Processing (ICSP 2014)

In this paper, a novel deep neural network (DNN) architecture is proposed to generate the speech features of both the target speaker and interferer for speech separation without using any prior information about the interfering speaker. DNN is adopted here to directly model the highly nonlinear relationship between speech features of the mixed signals and the two competing speakers. Experimental results...

chapter

Speech separation of a target speaker based on deep neural networks

Jun Du, Yanhui Tu, Yong Xu, Lirong Dai, more

2014 12th International Conference on Signal Processing (ICSP) > 473 - 477

2014 12th International Conference on Signal Processing (ICSP 2014)

This paper proposes a novel data-driven approach based on deep neural networks (DNNs) for single-channel speech separation. DNN is adopted to directly model the highly non-linear relationship of speech features between a target speaker and the mixed signals. Both supervised and semi-supervised scenarios are investigated. In the supervised mode, both identities of the target speaker and the interfering...

chapter

Spoken term detection for OOV terms based on triphone confusion matrix

Yong Xu, Wu Guo, Shan Su, LiRong Dai

2012 8th International Symposium on Chinese Spoken Language Processing > 98 - 102

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

The search for out of vocabulary (OOV) query terms in spoken term detection (STD) task is addressed in this paper. The phone level fragment with word-position marker is naturally adopted as the speech recognition decoding unit. Then the triphone confusion matrix (TriCM) is used to expand the query space to compensate for speech recognition errors. And we also propose a new approach to construct triphone...

chapter

A hybrid fragment / syllable-based system for improved OOV term detection

Yong Xu, Wu Guo, LiRong Dai

2012 8th International Symposium on Chinese Spoken Language Processing > 378 - 382

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

Spoken term detection (STD) is a task for open vocabulary search in large recordings of speech. Although the term detection performance for in-vocabulary (INV) terms has achieved a great improvement, the detection performance for out of vocabulary (OOV) terms is still disappointing. In this paper, we propose to combine fragment-based with syllable-based search into a hybrid STD system for OOV terms...

Filter options

Content availability:
Available
Keywords:
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Keywords

TRAINING (4)
SPEECH PROCESSING (3)
SPEECH RECOGNITION (3)
DEEP NEURAL NETWORKS (2)
INDEXES (2)
NEURAL NETWORKS (2)
NIST (2)
OUT OF VOCABULARY (2)
SEMI-SUPERVISED MODE (2)
SINGLE-CHANNEL SPEECH SEPARATION (2)
SPOKEN TERM DETECTION (2)
VOCABULARY (2)
ACOUSTIC EVENT DETECTION (1)
ACOUSTICS (1)
AUDIO TAGGING (1)
AUTOMOBILES (1)
AUXILIARY FEATURES (1)
DATA MODELS (1)
DECODING (1)
DEEP NEURAL NETWORK (1)
DETECTORS (1)
EVENT DETECTION (1)
FEATURE EXTRACTION (1)
FRAGMENT (1)
FUSION METHOD (1)
JOINT DETECTION-CLASSIFICATION MODEL (1)
LASER DOPPLER VIBROMETER (1)
MATHEMATICAL MODEL (1)
POSITIONED FRAGMENT (1)
PREDICTIVE MODELS (1)
REGRESSION MODEL (1)
ROBUST SPEECH RECOGNITION (1)
ROBUSTNESS (1)
SIGNAL TO NOISE RATIO (1)
SUPERVISED MODE (1)
SYLLABLE (1)
TAGGING (1)
TRIPHONE CONFUSION MATRIX (1)
TRIPHONE INDEX (1)
WEAKLY LABELLED DATA (1)
more

INFONA - science communication portal

Search results for: Yong Xu

A joint detection-classification model for audio tagging of weakly labelled data

Deep neural network for robust speech recognition with auxiliary features from laser-Doppler vibrometer sensor

Deep neural network based speech separation for robust speech recognition

Speech separation of a target speaker based on deep neural networks

Spoken term detection for OOV terms based on triphone confusion matrix

A hybrid fragment / syllable-based system for improved OOV term detection

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options