Search results for: Wei Li

Items from 1 to 4 out of 4 results

chapter

A DNN parameter mask for the binaural reverberant speech segregation

Yi Jiang, Wei Li, Yuanyuan Zu, Runsheng Liu, more

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) > 959 - 963

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)

The reverberant speech segregation is a basic problem in speech enhancement and automatic speech recognition. Based on the deep neural networks (DNN), a novel binaural speech segregation method is proposed. The binaural feature is extracted and used as the cue to train a DNN with a ideal parameter mask. The trained DNN is used to distinguish the target speech and noise, and output the estimated parameter...

chapter

A novel i-vector framework using multiple features and PCA for speaker recognition in short speech condition

Chi Zhang, Xiaoqiang Li, Wei Li, Peizhong Lu, more

2016 International Conference on Audio, Language and Image Processing (ICALIP) > 499 - 503

2016 International Conference on Audio, Language and Image Processing (ICALIP)

Speaker recognition in short speech condition is a difficult topic because the length of training and test speech is very short. One of the main disadvantage of the existing methods for speaker recognition is that they need very sufficient data and it's usually impossible in reality applications. In our experiments, the conventional methods with single feature don't make good performance in short...

chapter

Two-level approach for detecting non-lexical audio events in spontaneous speech

Yan-Xiong Li, Qian-Hua He, Wei Li, Zhi-Feng Wang

2010 International Conference on Audio, Language and Image Processing > 771 - 777

2010 International Conference on Audio, Language and Image Processing (ICALIP)

Based on analyses of characteristic differences between various audio events, a two-level approach is proposed for detecting three non-lexical audio events (filled pause, laugh, and applause) in spontaneous odel-based decision. The experiments give average precision of 87.3%, recall of 93.77%, and F-measure of 90.42%. Compared with the sliding window based approach, average F-measure is improved by...

chapter

Voice-Based Recognition System for Non-Semantics Information by Language and Gender

Wei Li, Dong-Ju Kim, Chul-Hwan Kim, Kwang-Seok Hong

2010 Third International Symposium on Electronic Commerce and Security > 84 - 88

2010 Third International Symposium on Electronic Commerce and Security (ISECS 2010)

The human voice not only provides information about the semantics of spoken words, but also contains voice information based on its characteristics. This paper designed feasible identification system for non-semantics voice information by language and gender, which are the two most important in voice signals. The proposed system is speaker-independent and text-independent: it fuses the language and...

Filter options

Keywords:
FEATURE EXTRACTION
SPEECH

Publication date

Set your own date range

Keywords

TRAINING (3)
HIDDEN MARKOV MODELS (2)
ACOUSTIC FEATURE (1)
ACOUSTIC FEATURES (1)
ACOUSTICS (1)
ARGON (1)
AUDIO SIGNAL PROCESSING (1)
CHARACTERISTIC DIFFERENCES (1)
COMPUTATIONAL AUDITORY SCENE ANALYSIS (CASA) (1)
COMPUTATIONAL MODELING (1)
COVARIANCE MATRICES (1)
DEEP NEURAL NETWORKS (DNNS) (1)
EAR (1)
F-MEASURE (1)
GENDER DETECTION (1)
GENDER RECOGNITION MODEL (1)
HUMAN COMPUTER INTERACTION (1)
HUMAN COMPUTER INTERFACE (1)
HUMAN VOICE (1)
I-VECTOR (1)
IDENTIFICATION SYSTEM (1)
INTERFERENCE (1)
LANGUAGE IDENTIFICATION (1)
LANGUAGE RECOGNITION MODEL (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MODEL-TRAINING METHOD (1)
NON-LEXICAL AUDIO EVENTS DETECTION (1)
NONSEMANTICS INFORMATION (1)
PARAMETER MASKS (1)
PCA (1)
PRINCIPAL COMPONENT ANALYSIS (1)
REVERBERANT SPEECH SEGREGATION (1)
SEMANTICS (1)
SHORT SPEECH CONDITION (1)
SIGNAL TO NOISE RATIO (1)
SILICON (1)
SPEAKER RECOGNITION (1)
SPEECH PROCESSING (1)
SPEECH RECOGNITION (1)
SPONTANEOUS SPEECH (1)
TIME-FREQUENCY ANALYSIS (1)
VOICE INFORMATION (1)
VOICE SIGNAL (1)
VOICE-BASED RECOGNITION SYSTEM (1)
more

INFONA - science communication portal

Search results for: Wei Li

A DNN parameter mask for the binaural reverberant speech segregation

A novel i-vector framework using multiple features and PCA for speaker recognition in short speech condition

Two-level approach for detecting non-lexical audio events in spontaneous speech

Voice-Based Recognition System for Non-Semantics Information by Language and Gender

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options