Search results

Items from 1 to 20 out of 35 results

chapter

Estimation of speech quality and intelligibility for some models of additive noise

Arkadiy Prodeus

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON) > 645 - 649

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON)

In this paper, the results of quality and intelligibility assessment of speech masked by stationary and nonstationary noise have been proposed. Subjective speech quality assessment technique has been used to show that white noise masking ability is lower than one for pink and even for brown noise when SNR is less than 0 dB. Two algorithms of nonstationary noise forming have been proposed. They are...

chapter

Multi-source TDOA estimation using SNR-based angular spectra

Charles Blandin, Emmanuel Vincent, Alexey Ozerov

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2616 - 2619

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper deals with the localization of multiple sources from two-channel mixtures recorded in a reverberant environment. We introduce new angular spectrum-based methods relying on the signal-to-noise ratio (SNR) to estimate the time difference of arrival (TDOA) of each source. We propose and compare five ways of estimating the SNR in each time-frequency point and in each direction, using beamforming...

chapter

Improved codebook constrained Wiener filter speech enhancement

S Chehresa, M H Savoji

2010 5th International Symposium on Telecommunications > 614 - 618

2010 5th International Symposium on Telecommunications (IST)

In this paper an improved method of speech enhancement using Power Spectral Density (PSD) codebooks of clean speech and several types of noise, is proposed. The proposed algorithm estimates the PSDs of speech and noise of unknown nature and evaluates the input Signal-to-Noise Ratio (SNR) by solving an over-determined set of equations as in the previous version. However, the search method used for...

chapter

Lombard speech model for automatic enhancement of speech intelligibility over telephone channel

D Huang, E P Ong

2010 International Conference on Audio, Language and Image Processing > 429 - 434

2010 International Conference on Audio, Language and Image Processing (ICALIP)

This paper aims at evaluating the performance of a “Lombard effect model” for improving speech intelligibility over telephone channel. It is well known that the naturalness and intelligibility of speech degrades rapidly in communication channels, such as phone networks or public address systems. To reduce the degradation, a ”Lombard effect mimicking” system has been proposed to modify the variations...

chapter

Perturbation analysis of mel-frequency cepstrum coefficients

Wei-Qiang Zhang, Dengzhou Yang, Jia Liu, Xiuguo Bao

2010 International Conference on Audio, Language and Image Processing > 715 - 718

2010 International Conference on Audio, Language and Image Processing (ICALIP)

Mel-frequency cepstrum coefficient (MFCC) is a widely used feature vector in speech signal precessing. Its feature extraction procedure can be seen as a mapping function which transfers the input speech signals to output MFCC feature vectors. However, this function is too complex to analyze and even a simple approximation is not easy to obtain. This paper studies the effects of each MFCC feature extraction...

chapter

CELP-like compression of spotlight-mode SAR raw data in transform domain

M Naraghi-Pour, R Cortez, T Ikuma, T Lewis

2010 - MILCOM 2010 MILITARY COMMUNICATIONS CONFERENCE > 870 - 874

2010 Military Communications Conference (MILCOM 2010)

Transmission of synthetic aperture radar (SAR) data requires large bandwidth due to its inherently high data rate. Consequently, compression of the data is often required. In this paper, we propose a raw SAR data compression algorithm that employs a predictive coding scheme, based on the analysis-by-synthesis encoding method. The proposed algorithm is inspired by code excited linear prediction (CELP)...

chapter

Speech presence probability estimation based on temporal cepstrum smoothing

T Gerkmann, M Krawczyk, R Martin

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4254 - 4257

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

We propose a novel, robust estimator for the probability of speech presence at each time-frequency point in the short-time discrete Fourier domain. While existing estimators perform quite reliably in stationary noise environments, they usually exhibit a large false-alarm rate in nonstationary noise that results in a great deal of noise leakage when applied to a speech enhancement task. The proposed...

chapter

Speech presence probability estimation based on integrated time-frequency minimum tracking for speech enhancement in adverse environments

Zhong-Hua Fu, Jhing-Fa Wang

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4258 - 4261

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Speech enhancement under nonstationary environments is a challenging problem. This paper addresses the problem of speech presence probability (SPP) estimation. According to the fact that speech is approximately sparse in time-frequency domain, we integrate time and frequency minimum tracking results to estimate the noise power spectral density and the a posteriori signal-to-noise ratio. A sparseness...

chapter

Histogram equalization and noise masking for robust speech recognition

Xueru Zhang, K Demuynck, H Van hamme

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4578 - 4581

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Mismatch between training and test conditions deteriorates the performance of speech recognizers. This paper investigates the combination of parametric histogram equalization (pHEQ) and noise masking to compensate for the mismatch caused by additive noise. The proposed front-end maps the distribution of the observed power spectrum vectors to a target distribution. The target distribution matches the...

chapter

A New Adaptive Threshold Algorithm to Speech Enhancement Based on Minimum Description Length Criterion

Li Nan, Liu Hua-bin

2009 International Conference on Information Engineering and Computer Science > 1 - 4

2009 International Conference on Information Engineering and Computer Science. ICIECS 2009

In regards to difficult selection of a threshold in wavelet speech enhancement algorithm, a new adaptive threshold algorithm based on minimum description length criterion is proposed in this paper. The algorithm is a completely data-driven method and has very strong adaptability. It has characteristics of no requirement for prior knowledge of noise level and nature, preset threshold and choosing threshold...

chapter

Codebook constrained iterative and Parametric Wiener filter speech enhancement

S Chehresa, M H Savoji

2009 IEEE International Conference on Signal and Image Processing Applications > 548 - 553

2009 IEEE International Conference on Signal and Image Processing Applications (ICSIPA 2009)

In this paper a new iterative method of speech enhancement using Power Spectral Density (PSD) codebooks of clean speech and several types of noise, is proposed. The proposed algorithm estimates the PSDs of speech and noise of unknown nature and, evaluates the input Signal-to-Noise Ratio (SNR) by solving an over-determined set of equations. No Voice Activity Detection (VAD) or other means of noise...

chapter

A Wiener-based implementation of equalization-cancellation pre-processing for binaural speech intelligibility prediction

N.N. Ellaham, C. Giguere, W. Gueaieb

2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics > 233 - 236

2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

This paper presents a precursor to an objective measure to predict speech intelligibility in binaural listening conditions. Such measures typically consist of a binaural pre-processing stage followed by intelligibility prediction using a monaural measure such as the Speech Intelligibility Index. In this work, an implementation of the equalization-cancellation process using Wiener filters is presented...

chapter

Gain adaptation based on signal-to-noise ratio for noise suppression

D.N. Parikh, S. Ravindran, D.V. Anderson

2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics > 185 - 188

2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

In this paper we describe a technique that uses adaptive gain control to achieve noise suppression in speech signals. The method used to map the dynamic range of the signal is based on the human auditory perceptual model. Since the processing is based on the model of human perception, the resulting noise suppressed speech is natural sounding. The computational complexity of the proposed method is...

chapter

Biologically inspired algorithm for enhancement of speech intelligibility over telephone channel

Dong-Yan Huang, S. Rahardja, Ee Ping Ong

2009 IEEE International Workshop on Multimedia Signal Processing > 1 - 6

2009 IEEE International Workshop on Multimedia Signal Processing (MMSP)

This paper describes a method to increase speech intelligibility when the speech signal is being transmitted over telephone lines. In order to detect all factors which affect speech intelligibility, we use telephone simulation tool in ITUT Software Tools Library release 2005 (STL2005) to identify the most problematic telephone-channel deteriorations. Of the various effects considered, additive noise...

chapter

On the application of variable-step adaptive noise cancelling for improving the robustness of speech recognition

Yang Jie, Wang Zhenli

2009 ISECS International Colloquium on Computing, Communication, Control, and Management > 2 > 419 - 422

2009 ISECS International Colloquium on Computing, Communication, Control, and Management (CCCM)

As speech recognition and spoken language technologies are being transferred to real applications, the need for greater robustness against adverse noise is becoming increasingly apparent. This paper researches a robust speech recognition method based on adaptive noise cancelling (ANC). It obtained the enhanced speech signal by applying a variable-step adaptive noise cancelling algorithm to reduce...

chapter

Speech endpoint detection in strong noisy environment based on the Hilbert-Huang Transform

Zhimao Lu, Baisen Liu, Liran Shen

2009 International Conference on Mechatronics and Automation > 4322 - 4326

2009 IEEE International Conference on Mechatronics and Automation

Speech endpoint detection in strong noise environment plays an important role in speech signal processing. Hilbert-Huang Transform (HHT) is based on the local characteristics of signals, which is an adaptive and efficient transformation method. It is particularly suitable for analyzing the non-linear and non-stationary signals such as speech signal. In this paper, we chose the noisy speech signal...

chapter

Using Artificial Neural Network for Robust Voice Activity Detection Under Adverse Conditions

T.V. Pham, C.T. Tang, M. Stadtschnitzer

2009 IEEE-RIVF International Conference on Computing and Communication Technologies > 1 - 8

2009 IEEE-RIVF International Conference on Computing and Communication Technologies (RIVF). Research, Innovation and Vision for the Future

We present an approach to model-based voice activity detection (VAD) for harsh environments. By using mel-frequency cepstral coefficients feature extracted from clean and noisy speech samples, an artificial neural network is trained optimally in order to provide a reliable model. There are three main aspects to this study: First, in addition to the developed model, recent state-of-the-art VAD methods...

chapter

A time-frequency domain formant frequency estimation scheme for noisy speech signals

S.A. Fattah, W.-P. Zhu, M.O. Ahmad

2009 IEEE International Symposium on Circuits and Systems > 1201 - 1204

2009 IEEE International Symposium on Circuits and Systems - ISCAS 2009

Formant frequency is a one of the most important speech feature, which has widespread applications in speech recognition, synthesis, and compression. In this paper, a new time-frequency domain scheme for the estimation of formant frequencies from noise-corrupted speech signals is presented. In order to overcome the adverse effect of noise, instead of conventional autocorrelation function (ACF), a...

chapter

AR-Based Bayesian Speech Enhancement for Nonstationary Environments

Qinghua Huang, Kai Liu

2009 International Joint Conference on Computational Sciences and Optimization > 1 > 918 - 921

2009 International Joint Conference on Computational Sciences and Optimization, CSO

A new technique for enhancing audio signal from a noisy nonstationary environment is presented in the paper. Autoregressive (AR) model is used to efficiently exploit the temporally correlated information of audio and noise signals during a short stationary frame. The temporal models of signals and noisy process are combined to construct a state space. The state space appropriately describes that the...

chapter

New insights into non-causal multichannel linear filtering for noise reduction

M. Souden, J. Benesty, S. Affes

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 141 - 144

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

We investigate a general framework for noise reduction which consists in controlling the level of signal distortion while reducing the level of noise. A parameterized non-causal filter that allows for tuning the signal distortion and noise reduction inversely is obtained and is referred to as parameterized multichannel non-causal Wiener filter (PMWF) herein. The same optimization problem leads to...

Keywords:
SIGNAL TO NOISE RATIO
SIGNAL-TO-NOISE RATIO
SPEECH
Publication type:
book

Publication date

Set your own date range

Keywords

SPEECH PROCESSING (18)
NOISE MEASUREMENT (17)
NOISE (13)
SPEECH ENHANCEMENT (12)
SPEECH RECOGNITION (9)
ESTIMATION (8)
DISTORTION (5)
SPEECH INTELLIGIBILITY (5)
WIENER FILTER (5)
CORRELATION METHODS (4)
FEATURE EXTRACTION (4)
INDEXES (4)
MICROPHONES (4)
SIGNAL DENOISING (4)
SIGNAL DETECTION (4)
SNR (4)
SPEECH CODING (4)
TIME FREQUENCY ANALYSIS (4)
WIENER FILTERS (4)
CEPSTRAL ANALYSIS (3)
MEL FREQUENCY CEPSTRAL COEFFICIENT (3)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (3)
NOISE REDUCTION (3)
NOISY SPEECH SIGNAL (3)
PITCH DETECTION (3)
ROBUSTNESS (3)
SPECTRAL ANALYSIS (3)
SPECTRAL SUBTRACTION (3)
SPEECH PRESENCE PROBABILITY (3)
SPEECH SIGNAL PROCESSING (3)
TRAINING (3)
VOICE ACTIVITY DETECTION (3)
ACOUSTIC NOISE (2)
ACOUSTICS (2)
ADDITIVE NOISE (2)
ALGORITHM DESIGN AND ANALYSIS (2)
APPROXIMATION METHODS (2)
ARTIFICIAL NEURAL NETWORKS (2)
AUTOCORRELATION FUNCTION (2)
BAYES METHODS (2)
DATA COMPRESSION (2)
DISCRETE COSINE TRANSFORMS (2)
ENERGY MEASUREMENT (2)
EQUATIONS (2)
FORMANT FREQUENCY ESTIMATION (2)
FREQUENCY ESTIMATION (2)
GAIN (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
LINEAR PREDICTIVE CODING (2)
NOISE ESTIMATION (2)
NOISY ENVIRONMENT (2)
SIGNAL PROCESSING (2)
SMOOTHING METHODS (2)
SPEECH ANALYSIS (2)
SPEECH ENDPOINT DETECTION (2)
SPEECH PRESENCE PROBABILITY ESTIMATION (2)
SPEECH QUALITY (2)
SPEECH SIGNALS (2)
TIME-FREQUENCY ANALYSIS (2)
1/F NOISE (1)
ACCUMULATIVE NORMAL DISTRIBUTION (1)
ADAPTATION MODEL (1)
ADAPTIVE ACOUSTIC BEAMFORMER (1)
ADAPTIVE GAIN (1)
ADAPTIVE GAIN CONTROL (1)
ADAPTIVE MULTIRESOLUTION FORM OF SVD (1)
ADAPTIVE NOISE CANCELLING (ANC) (1)
ADAPTIVE THRESHOLD ALGORITHM (1)
ADAPTIVE TRANSFORMATION METHOD (1)
ADVERSE ENVIRONMENTS (1)
ANALYSIS-BY-SYNTHESIS (1)
ANALYSIS-BY-SYNTHESIS ENCODING METHOD (1)
ANGULAR SPECTRUM (1)
AR MODEL (1)
ARRAY SIGNAL PROCESSING (1)
ARRAYS (1)
ARTIFICIAL NEURAL NETWORK (1)
AUDIO SIGNAL (1)
AUDIO SIGNAL PROCESSING (1)
AURORA4 DATABASE (1)
AUTOCORRELATION FUNCTION PITCH DETECTION METHOD (1)
AUTOREGRESSIVE MODEL (1)
AUTOREGRESSIVE PROCESSES (1)
AVATARS (1)
BACKGROUND NOISE (1)
BAND-SPLITTING SPECTRUM DOMAIN (1)
BAYES RULE (1)
BAYESIAN METHODS (1)
BAYESIAN SPEECH ENHANCEMENT (1)
BINAURAL HEARING (1)
BINAURAL LISTENING CONDITION (1)
BINAURAL SPEECH INTELLIGIBILITY PREDICTION (1)
BIOLOGICALLY INSPIRED ALGORITHM (1)
BIT RATE (1)
BLOCK ADAPTIVE QUANTIZATION (1)
BOOLEAN FORM (1)
BROWN NOISE (1)
more

INFONA - science communication portal

Search results

Estimation of speech quality and intelligibility for some models of additive noise

Multi-source TDOA estimation using SNR-based angular spectra

Improved codebook constrained Wiener filter speech enhancement

Lombard speech model for automatic enhancement of speech intelligibility over telephone channel

Perturbation analysis of mel-frequency cepstrum coefficients

CELP-like compression of spotlight-mode SAR raw data in transform domain

Speech presence probability estimation based on temporal cepstrum smoothing

Speech presence probability estimation based on integrated time-frequency minimum tracking for speech enhancement in adverse environments

Histogram equalization and noise masking for robust speech recognition

A New Adaptive Threshold Algorithm to Speech Enhancement Based on Minimum Description Length Criterion

Codebook constrained iterative and Parametric Wiener filter speech enhancement

A Wiener-based implementation of equalization-cancellation pre-processing for binaural speech intelligibility prediction

Gain adaptation based on signal-to-noise ratio for noise suppression

Biologically inspired algorithm for enhancement of speech intelligibility over telephone channel

On the application of variable-step adaptive noise cancelling for improving the robustness of speech recognition

Speech endpoint detection in strong noisy environment based on the Hilbert-Huang Transform

Using Artificial Neural Network for Robust Voice Activity Detection Under Adverse Conditions

A time-frequency domain formant frequency estimation scheme for noisy speech signals

AR-Based Bayesian Speech Enhancement for Nonstationary Environments

New insights into non-causal multichannel linear filtering for noise reduction

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options