Search results

Items from 1 to 20 out of 36 results

chapter

Regularizing DNN acoustic models with Gaussian stochastic neurons

Hao Zhang, Yajie Miao, Florian Metze

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4964 - 4968

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Dropout and DropConnect can be viewed as regularization methods for deep neural network (DNN) training. In DNN acoustic modeling, the huge number of speech samples makes it expensive to sample the neuron mask (Dropout) or the weight mask (DropConnect) repetitively from a high dimensional distribution. In this paper we investigate the effect of Gaussian stochastic neurons on DNN acoustic modeling....

chapter

A quantitative comparison of blind C₅₀ estimators

P. Peso Parada, D. Sharma, J. Lainez, D. Barreda, more

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC) > 298 - 302

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC)

The problem of blind estimation of the room acoustic clarity index C₅₀ from single-channel reverberant speech signals is presented in this paper. We analyze the performance of several machine learning methods for a regression task using 309 features derived from the speech signal and modeled with a Deep Belief Network (DBN), Classification And Regression Tree (CART) and Linear Regression (LR). These...

chapter

Generalization of supervised learning for binary mask estimation

Tobias May, Timo Gerkmann

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC) > 154 - 158

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC)

This paper addresses the problem of speech segregation by estimating the ideal binary mask (IBM) from noisy speech. Two methods will be compared, one supervised learning approach that incorporates a priori knowledge about the feature distribution observed during training. The second method solely relies on a frame-based speech presence probability (SPP) es-timation, and therefore, does not depend...

chapter

An approach of agricultural price information collection based on speech recognition

Jinpu Xu, Yeping Zhu, Hailong Liu, Jinpu Xu, more

2014 10th International Conference on Natural Computation (ICNC) > 893 - 898

2014 10th International Conference on Natural Computation (ICNC)

Speech recognition technology was applied to information collection of agricultural prices, with the acoustic models trained for agricultural prices information collection environment so as to minimize the environmental influence. Firstly, we constructed the speech corpus by collecting speech under the operating scene, and then selected tri-phone modeling as the decode unit to train hidden Markov...

chapter

Weakly supervised click models for odontocete species classification

Nicole Nichols, Mari Ostendorf

OCEANS 2014 - TAIPEI > 1 - 4

OCEANS 2014 - TAIPEI

This paper addresses the problem of automatic learning of statistical models of clicks for odontocete species classifications, particularly focusing on improving accuracy of the classifier by iteratively identifying click-like sounds that are likely to be noise and removing these from the model training set. The algorithm is weakly supervised in that no hand-labeled click regions are available, but...

chapter

Intelligent ultrasound processing applied to insulation pollution estimation

T. V. Ferreira, P. B. Vilar, J. F. Araujo, M. A. O. Rodrigues, more

2012 Annual Report Conference on Electrical Insulation and Dielectric Phenomena > 924 - 927

2012 IEEE Conference on Electrical Insulation and Dielectric Phenomena - (CEIDP 2012)

This paper presents field results for a pollution estimation system based on ultrasound noise and Statistical AutoAssociative Artificial Neural Networks (SA³N²). The system extracts spectral information from the ultrasonic noise emitted by the corona discharges that occur nearby electric insulation, then correlates this information to a previously known pollution intensity situation. The entire acquisition...

chapter

Lasso environment model combination for robust speech recognition

Xiong Xiao, Jinyu Li, Eng Siong Chng, Haizhou Li

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4305 - 4308

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we propose a novel acoustic model adaptation method for noise robust speech recognition. Model combination is a common way to adapt acoustic models to a target test environment. For example, the mean supervectors of the adapted model are obtained as a linear combination of mean supervectors of many pre-trained environment-dependent acoustic models. Usually, the combination weights are...

chapter

Audio event detection from acoustic unit occurrence patterns

Anurag Kumar, Pranay Dighe, Rita Singh, Sourish Chaudhuri, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 489 - 492

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In most real-world audio recordings, we encounter several types of audio events. In this paper, we develop a technique for detecting signature audio events, that is based on identifying patterns of occurrences of automatically learned atomic units of sound, which we call Acoustic Unit Descriptors or AUDs. Experiments show that the methodology works as well for detection of individual events and their...

chapter

Improved audio event detection by use of contextual noise

Qiang Huang, Stephen Cox

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 493 - 496

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper presents new approaches to improve the detection of two key audio events in a sport game (tennis) using contextual information. When analysing a tennis match using only audio information, the sound of the ball being struck and the occurrence of a line judge's shout can be obscured by players' grunts or shouts. Furthermore, if models of these two important events are trained from labelled...

chapter

Cough detection algorithm for monitoring patient recovery from pulmonary tuberculosis

Brian H. Tracey, German Comina, Sandra Larson, Marjory Bravard, more

2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society > 6017 - 6020

2011 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society

In regions of the world where tuberculosis (TB) poses the greatest disease burden, the lack of access to skilled laboratories is a significant problem. A lab-free method for assessing patient recovery during treatment would be of great benefit, particularly for identifying patients who may have drug-resistant tuberculosis. We hypothesize that cough analysis may provide such a test. In this paper we...

chapter

Detection of post apnea sounds and apnea periods from sleep sounds

Ersin Karci, Yesim Serinagaoglu Dogrusoz, Tolga Ciloglu

2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society > 6075 - 6078

2011 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society

Obstructive Sleep Apnea Syndrome (OSAS) is defined as a sleep related breathing disorder that causes the body to stop breathing for about 10 seconds and mostly ends with a loud sound due to the opening of the airway. OSAS is traditionally diagnosed using polysomnography, which requires a whole night stay at the sleep laboratory of a hospital, with multiple electrodes attached to the patient's body...

chapter

Discrimination between healthy subjects and patients with pulmonary emphysema by detection of abnormal respiration

Masaru Yamashita, Shoichi Matsunaga, Sueharu Miyahara

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 693 - 696

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a robust classification strategy for distinguishing between a healthy subject and a patient with pulmonary emphysema on the basis of lung sounds. A symptom of pulmonary emphysema is that almost all lung sounds include some abnormal (i.e., adventitious) sounds. However, the great variety of possible adventitious sounds and noises at auscultation makes high-accuracy detection...

chapter

Robust multi-sensor classification via joint sparse representation

Nam H. Nguyen, Nasser M. Nasrabadi, Trac D. Tran

14th International Conference on Information Fusion > 1 - 8

2011 International Conference on Information Fusion (FUSION)

In this paper, we propose a novel multi-task multi-variate (MTMV) sparse representation method for multi-sensor classification, which takes into account correlations between sensors simultaneously while considering joint sparsity within each sensor's observations. This approach can be seen as the generalized model of multi-task and multivariate Lasso, where all the multi-sensor data are jointly represented...

chapter

Robust hands-free Automatic Speech Recognition for human-machine interaction

R Gomez, T Kawahara, K Nakadai

2010 10th IEEE-RAS International Conference on Humanoid Robots > 138 - 143

2010 10th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2010)

In enclosed environments where robots are deployed, the observed speech signal is smeared due to reverberation. This degrades the performance of the automatic speech recognition (ASR). Thus, hands-free speech recognition for human-machine communication is a difficult task. Most speech enhancement techniques used to address this problem enhance the contaminated waveform independent from that of the...

chapter

North Atlantic Right Whale acoustic signal processing: Part I. comparison of machine learning recognition algorithms

Peter J Dugan, Aaron N Rice, Ildar R Urazghildiiev, Christopher W Clark

2010 IEEE Long Island Systems, Applications and Technology Conference > 1 - 6

2010 IEEE Long Island Systems, Applications and Technology Conference (LISAT 2010)

This paper compares three different approaches currently used in recognizing contact calls made from the North Atlantic Right Whale (NRW), Eubalaena glacialis. We present two new approaches consisting of machine learning algorithms based on artificial neural networks (NET) and the classification and regression tree classifiers (CART), and compare their performance with earlier work that employs multi-Stage...

chapter

Support vector machines for noise robust ASR

M.J.F. Gales, A. Ragni, H. AlDamarki, C. Gautier

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 205 - 210

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

Using discriminative classifiers, such as Support Vector Machines (SVMs) in combination with, or as an alternative to, Hidden Markov Models (HMMs) has a number of advantages for difficult speech recognition tasks. For example, the models can make use of additional dependencies in the observation sequences than HMMs provided the appropriate form of kernel is used. However standard SVMs are binary classifiers,...

chapter

Prediction of muffler flow regeneration noise with neural network

Haijun Zhao, Zhaoxiang Deng, Shiju Zhao, Jie Yang

2009 9th International Conference on Electronic Measurement&Instruments > 3-40 - 3-43

2009 9th International Conference on Electronic Measurement & Instruments (ICEMI 2009)

Flow regeneration noise is a main reason effect on attenuation performance of mufflers, at present no sophisticated software or tool is found to predict effectively flow regeneration noises from mufflers. Prediction of flow regeneration noise from a muffler element of simple expansion chamber is realized using Bp neural network, and comparison of prediction with experiment is carried out. Results...

chapter

Stereo-based stochastic mapping with discriminative training for noise robust speech recognition

Xiaodong Cui, M. Afify, Yuqing Gao

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 3933 - 3936

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper presents an enhanced stochastic mapping technique in the discriminative feature (fMPE) space that exploits stereo data for noise robust LVCSR. Both MMSE and MAP estimates of the mapping are given and the performance of the two is investigated. Due to the iterative nature of the MAP estimate, we show that combining MMSE and MAP estimates is possible and yields superior performance than each...

chapter

Syllable-based automatic Arabic speech recognition in different conditions of noise

M.M. Azmi, H. Tolba

2008 9th International Conference on Signal Processing > 601 - 604

2008 9th International Conference on Signal Processing (ICSP 2008)

The presence of noise degrades the recognition percent of automatic speech recognition systems. The improvement of noise can be achieved by changing acoustic units during the recognition process. In this paper, we concentrate on automatic Arabic speech recognition in different conditions of noise using different acoustic units. Automatic Arabic speech was described by showing their constructing monophones,...

chapter

A Novel Approach Research on Low Altitude Passive Acoustic Target Recognition Based on ICA and HMM

Hui Liu, Jun-an Yang, Hao Chen

2008 Fourth International Conference on Natural Computation > 5 > 371 - 375

2008 Fourth International Conference on Natural Computation (ICNC)

An approach is proposed to classifying simultaneous multiple low altitude targets in battlefield. Based on Independent Component Analysis (ICA), the mixed signal is separated into several single and pure signals, and the noise is removed from the acoustic signal. mel-frequency cepstrum coefficients (MFCC) which responses the characteristic of the sound more aggressively is extracted as characteristic...

Keywords:
TRAINING
ACOUSTICS
NOISE

Publication date

Set your own date range

Keywords

SPEECH (18)
FEATURE EXTRACTION (14)
HIDDEN MARKOV MODELS (13)
SIGNAL PROCESSING (13)
ARTIFICIAL NEURAL NETWORKS (11)
ROBUSTNESS (10)
SIGNAL TO NOISE RATIO (10)
ALGORITHM DESIGN AND ANALYSIS (8)
DATABASES (8)
SPEECH RECOGNITION (8)
COMPUTERS (7)
FREQUENCY DOMAIN ANALYSIS (7)
SIGNAL PROCESSING ALGORITHMS (7)
CORRELATION (6)
DATA MINING (6)
ESTIMATION (6)
MEL FREQUENCY CEPSTRAL COEFFICIENT (6)
SPEECH PROCESSING (6)
CEPSTRAL ANALYSIS (5)
CLASSIFICATION ALGORITHMS (5)
COMPUTATIONAL MODELING (5)
CONFERENCES (5)
EDUCATIONAL INSTITUTIONS (5)
ELECTRONIC MAIL (5)
EQUATIONS (5)
FILTERING (5)
MATHEMATICAL MODEL (5)
TESTING (5)
ACCURACY (4)
ACOUSTIC SIGNAL PROCESSING (4)
ADAPTATION MODEL (4)
ATMOSPHERIC MODELING (4)
BAND PASS FILTERS (4)
CONVERGENCE (4)
DECISION MAKING (4)
DISCRETE FOURIER TRANSFORMS (4)
HEURISTIC ALGORITHMS (4)
MONITORING (4)
NOISE MEASUREMENT (4)
SENSORS (4)
SIMULATION (4)
SPEAKER RECOGNITION (4)
TIME FREQUENCY ANALYSIS (4)
TOPOLOGY (4)
TRAINING DATA (4)
TRANSFORMS (4)
VECTORS (4)
WAVELET TRANSFORMS (4)
WHITE NOISE (4)
ARRAY SIGNAL PROCESSING (3)
ARTIFICIAL INTELLIGENCE (3)
BACKGROUND NOISE (3)
BANDWIDTH (3)
CLUSTERING ALGORITHMS (3)
COGNITION (3)
COMPLEXITY THEORY (3)
COMPUTER ARCHITECTURE (3)
CONVOLUTION (3)
DATA MODELS (3)
DISTORTION (3)
EVENT DETECTION (3)
FAST FOURIER TRANSFORMS (3)
FAULT DIAGNOSIS (3)
FILTER BANK (3)
FILTERING THEORY (3)
FOURIER TRANSFORMS (3)
GAIN (3)
GAUSSIAN NOISE (3)
GENERATORS (3)
HELIUM (3)
INDEXES (3)
INFORMATION TECHNOLOGY (3)
MAXIMUM LIKELIHOOD ESTIMATION (3)
NOISE ROBUSTNESS (3)
OPTIMIZATION (3)
PATTERN RECOGNITION (3)
PRESSES (3)
PRODUCTION (3)
RADIAL BASIS FUNCTION NETWORKS (3)
SPECTRAL ANALYSIS (3)
STOCHASTIC PROCESSES (3)
SUPPORT VECTOR MACHINE CLASSIFICATION (3)
SUPPORT VECTOR MACHINES (3)
TIME DOMAIN ANALYSIS (3)
UNDERWATER ACOUSTICS (3)
UNDERWATER VEHICLES (3)
ACOUSTIC BEAMS (2)
ACOUSTIC MEASUREMENTS (2)
ACOUSTIC MODEL (2)
ADAPTIVE SYSTEMS (2)
ADDITIVE NOISE (2)
ANALYTICAL MODELS (2)
ATTENUATION (2)
AUDITORY SYSTEM (2)
AUTOMATIC SPEECH RECOGNITION (2)
BIOLOGICAL SYSTEM MODELING (2)
CLASSIFICATION (2)
more

INFONA - science communication portal

Search results

Regularizing DNN acoustic models with Gaussian stochastic neurons

A quantitative comparison of blind C₅₀ estimators

Generalization of supervised learning for binary mask estimation

An approach of agricultural price information collection based on speech recognition

Weakly supervised click models for odontocete species classification

Intelligent ultrasound processing applied to insulation pollution estimation

Lasso environment model combination for robust speech recognition

Audio event detection from acoustic unit occurrence patterns

Improved audio event detection by use of contextual noise

Cough detection algorithm for monitoring patient recovery from pulmonary tuberculosis

Detection of post apnea sounds and apnea periods from sleep sounds

Discrimination between healthy subjects and patients with pulmonary emphysema by detection of abnormal respiration

Robust multi-sensor classification via joint sparse representation

Robust hands-free Automatic Speech Recognition for human-machine interaction

North Atlantic Right Whale acoustic signal processing: Part I. comparison of machine learning recognition algorithms

Support vector machines for noise robust ASR

Prediction of muffler flow regeneration noise with neural network

Stereo-based stochastic mapping with discriminative training for noise robust speech recognition

Syllable-based automatic Arabic speech recognition in different conditions of noise

A Novel Approach Research on Low Altitude Passive Acoustic Target Recognition Based on ICA and HMM

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options