Search results

Items from 101 to 120 out of 1,610 results

1 ...
3
4
5
6
7
8
9

chapter

Intelligibility evaluation of speech coding standards in severe background noise and packet loss conditions

Emma Jokinen, Jeremie Lecomte, Nadja Schinkel-Bielefeld, Tom Backstrom

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5152 - 5156

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speech intelligibility is an important aspect of speech transmission but often when speech coding standards are compared only the quality is evaluated using perceptual tests. In this study, the performance of three wideband speech coding standards, adaptive multi-rate wideband (AMR-WB), G.718, and enhanced voice services (EVS), is evaluated in a subjective intelligibility test. The test covers different...

chapter

Coherent channel based subband multichannel dereverberation

JeeSok Lee, Sejin Oh, Hong-Goo Kang

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2704 - 2708

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a multichannel dereverberation algorithm that only uses coherent acoustic channels. In the framework of multi-input/output inverse theorem (MINT), the equalization performance varies depending on the length of the input acoustic channels. However, only the portion of observed channel that resemble the true acoustic channel contributes to performance enhancement when measurement...

chapter

Estimation of relative transfer function in the presence of stationary noise based on segmental power spectral density matrix subtraction

Xiaofei Li, Laurent Girin, Radu Horaud, Sharon Gannot

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 320 - 324

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper addresses the problem of relative transfer function (RTF) estimation in the presence of stationary noise. We propose an RTF identification method based on segmental power spectral density (PSD) matrix subtraction. First multiple channel microphone signals are divided into segments corresponding to speech-plus-noise activity and noise-only. Then, the subtraction of two segmental PSD matrices...

chapter

Phase-optimized K-SVD for signal extraction from underdetermined multichannel sparse mixtures

Antoine Deleforge, Walter Kellermann

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 355 - 359

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a novel sparse representation for heavily underdetermined multichannel sound mixtures, i.e., with much more sources than microphones. The proposed approach operates in the complex Fourier domain, thus preserving spatial characteristics carried by phase differences. We derive a generalization of K-SVD which jointly estimates a dictionary capturing both spectral and spatial features, a sparse...

chapter

A pairwise algorithm for pitch estimation and speech separation using deep stacking network

Hui Zhang, Xueliang Zhang, Shuai Nie, Guanglai Gao, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 246 - 250

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pitch information is an important cue for speech separation. However, pitch estimation in noisy condition is also a task as challenging as speech separation. In this paper, we propose a supervised learning architecture which combines these two problems concisely. The proposed algorithm is based on deep stacking network (DSN) which provides a method of stacking simple processing modules in building...

chapter

On speech quality estimation of phase-aware single-channel speech enhancement

Andreas Gaich, Pejman Mowlaee

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 216 - 220

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

To approximate the speech quality of a given speech enhancement system, most of the existing instrumental metrics rely on the calculation of a distortion metric defined between the clean reference signal and the enhanced signal in the spectral amplitude domain. Several recent studies have demonstrated the effectiveness of employing a phase modification stage in single-channel speech enhancement showing...

chapter

A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds

Zafar Rafii, Antoine Liutkus, Bryan Pardo

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 271 - 275

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Repetition is a fundamental element in generating and perceiving structure in audio. Especially in music, structures tend to be composed of patterns that repeat through time (e.g., rhythmic elements in a musical accompaniment), and also frequency (e.g., different notes of the same instrument). The auditory system has the remarkable ability to parse such patterns by identifying repetitions within the...

chapter

Binaural multichannel Wiener filter with directional interference rejection

Elior Hadad, Daniel Marquardt, Simon Doclo, Sharon Gannot

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 644 - 648

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we consider an acoustic scenario with a desired source and a directional interference picked up by hearing devices in a noisy and reverberant environment. We present an extension of the binaural multichannel Wiener filter (BMWF), by adding an interference rejection constraint to its cost function, in order to combine the advantages of spatial and spectral filtering while mitigating directional...

chapter

Representation models in single channel source separation

Matthias Zohrer, Franz Pernkopf

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 713 - 717

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Model-based single-channel source separation (SCSS) is an ill-posed problem requiring source-specific prior knowledge. In this paper, we use representation learning and compare general stochastic networks (GSNs), Gauss Bernoulli restricted Boltzmann machines (GBRBMs), conditional Gauss Bernoulli restricted Boltzmann machines (CGBRBMs), and higher order contractive autoencoders (HCAEs) for modeling...

chapter

Robust sound event recognition using convolutional neural networks

Haomin Zhang, Ian McLoughlin, Yan Song

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 559 - 563

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Traditional sound event recognition methods based on informative front end features such as MFCC, with back end sequencing methods such as HMM, tend to perform poorly in the presence of interfering acoustic noise. Since noise corruption may be unavoidable in practical situations, it is important to develop more robust features and classifiers. Recent advances in this field use powerful machine learning...

chapter

Harmonic phase estimation in single-channel speech enhancement using von mises distribution and prior SNR

Josef Kulmer, Pejman Mowlaee

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5063 - 5067

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In single-channel speech enhancement the spectral amplitude of the noisy signal is often modified while the noisy spectral phase is directly employed for signal reconstruction. Recently, additional improvement in speech enhancement performance has been reported when the noisy phase is modified. In this work, we propose a Bayesian estimator for phase of harmonics given the noisy speech. The proposed...

chapter

Leveraging automatic speech recognition in cochlear implants for improved speech intelligibility under reverberation

Oldooz Hazrati, Shabnam Ghaffarzadegan, John H.L. Hansen

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5093 - 5097

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Despite recent advancements in digital signal processing technology for cochlear implant (CI) devices, there still remains a significant gap between speech identification performance of CI users in reverberation compared to that in anechoic quiet conditions. Alternatively, automatic speech recognition (ASR) systems have seen significant improvements in recent years resulting in robust speech recognition...

chapter

Delayless speech enhancement with a virtual zero-phase response using a prediction of periodic signal components

Kristian Timm Andersen, Thomas Bo Elmedyb, Marc Moonen

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5098 - 5102

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, a delayless speech enhancement scheme with zero phase distortion is proposed. It is based on a cascade of adaptive filters that predicts periodic components with a significant auto-correlation for lags larger than a value D. The adaptive filter is positioned at the output of a speech enhancement algorithm, to adjust the phase of the periodic components to the noisy signal, and to remove...

chapter

Sparse HMM-based speech enhancement method for stationary and non-stationary noise environments

Feng Deng, Chang-chun Bao, W. Bastiaan Kleijn

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5073 - 5077

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a sparse hidden Markov model (HMM)-based single-channel speech enhancement method that models the speech and noise gains accurately in both stationary and nonstationary environments. The objective function is augmented with an lp regularization term resulting in a sparse autoregressive HMM (SARHMM). The method encourages sparsity in the speech- and noise- modeling, which eliminates the...

chapter

Enhanced time domain packet loss concealment in switched speech/audio codec

Jeremie Lecomte, Adrian Tomasek, Goran Markovic, Michael Schnabel, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5922 - 5926

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper describes new time domain techniques for concealing packet loss in the new 3GPP Enhanced Voice Services codec. Enhancements to the existing ACELP concealment methods include guided, improved pitch prediction, increased flexibility and accuracy of pulse resynchronization. Furthermore, the new method of separate linear predictive (LP) filter synthesis aims for sound quality improvement in...

chapter

Wind noise short term power spectrum estimation using pitch adaptive inverse binary masks

Christoph M. Nelke, Peter Vary

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5068 - 5072

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a method to enhance a speech signal disturbed by wind noise. The wind noise is generated by turbulences in an air stream close to the microphone which picks up the desired speech signal. As the majority of speech enhancement algorithms works in the frequency domain, the short term power spectrum (STPS) of the unwanted noise must be estimated to reduce the wind noise. Conventional...

chapter

Advances in deep neural network approaches to speaker recognition

Mitchell McLaren, Yun Lei, Luciana Ferrer

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4814 - 4818

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The recent application of deep neural networks (DNN) to speaker identification (SID) has resulted in significant improvements over current state-of-the-art on telephone speech. In this work, we report a similar achievement in DNN-based SID performance on microphone speech. We consider two approaches to DNN-based SID: one that uses the DNN to extract features, and another that uses the DNN during feature...

chapter

Assistive listening headsets for high noise environments: Protection and communication

Sven Nordholm, Alan Davis, Pei Chee Yong, Hai Huyen Dam

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5753 - 5757

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In industrial noise environments, the use of assistive listening headsets is a means to provide adequate access to voice communication while wearing hearing protection. This paper presents a performance evaluation and comparison of two different methods to provide the binaural speech enhancement in real industrial noise scenarios. The investigated binaural methods based on differential beamforming...

chapter

Single channel speech enhancement in the modulation domain: New insights in the modulation channel selection framework

Jesper B. Boldt, Andreas T. Bertelsen, Fredrik Gran, Soren Jorgensen, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5748 - 5752

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recently, the ideal binary mask has been introduced in the modulation domain by extending the ideal channel selection method to modulation channel selection [1]. This new method shows substantial improvement in speech intelligibility but less than its predecessor despite the higher complexity. Here, we extend the previous finding from [1] and provide a more direct comparison of binary masking in the...

chapter

Robot audition: Its rise and perspectives

Hiroshi G. Okuno, Kazuhiro Nakadai

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5610 - 5614

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The ability of robots to listen to several things at once with their own “ears”, that is, robot audition, is an important factor in improving interaction and symbiosis between humans and robots. The critical issue in robot audition is real-time processing and robustness against noisy environments with high flexibility to support various kinds of robots and hardware configurations. This paper first...

1 ...
3
4
5
6
7
8
9

Data set:
ieee
Keywords:
NOISE
SPEECH

Publication date

Set your own date range

Content availability

Available (1,596)
None (14)

Publication type

book (1,337)
article (273)

Keywords

NOISE MEASUREMENT (525)
SPEECH ENHANCEMENT (491)
SPEECH RECOGNITION (458)
SPEECH PROCESSING (394)
MICROPHONES (268)
ESTIMATION (249)
FEATURE EXTRACTION (236)
HIDDEN MARKOV MODELS (214)
NOISE REDUCTION (198)
SIGNAL TO NOISE RATIO (187)
ACOUSTICS (174)
ROBUSTNESS (162)
TRAINING (156)
MEL FREQUENCY CEPSTRAL COEFFICIENT (125)
SIGNAL PROCESSING ALGORITHMS (121)
CORRELATION (112)
ACCURACY (107)
DATABASES (95)
SIGNAL DENOISING (92)
HARMONIC ANALYSIS (89)
ALGORITHM DESIGN AND ANALYSIS (88)
SIGNAL PROCESSING (83)
SPEAKER RECOGNITION (82)
VECTORS (82)
ARRAYS (81)
REVERBERATION (79)
SPEECH CODING (78)
MATHEMATICAL MODEL (75)
ARRAY SIGNAL PROCESSING (72)
ADAPTIVE FILTERS (71)
CEPSTRAL ANALYSIS (70)
TRANSFORMS (70)
TIME FREQUENCY ANALYSIS (69)
AUDITORY SYSTEM (64)
EQUATIONS (62)
WAVELET TRANSFORMS (62)
ACOUSTIC SIGNAL PROCESSING (61)
DATA MINING (59)
FILTERING THEORY (59)
SPECTRAL ANALYSIS (59)
COMPUTATIONAL MODELING (58)
MICROPHONE ARRAYS (58)
GAIN (56)
INTERFERENCE SUPPRESSION (56)
AUTOMATIC SPEECH RECOGNITION (55)
MAXIMUM LIKELIHOOD ESTIMATION (54)
ARTIFICIAL NEURAL NETWORKS (52)
SOURCE SEPARATION (51)
BLIND SOURCE SEPARATION (48)
FREQUENCY DOMAIN ANALYSIS (48)
INDEXES (48)
ROBOTS (48)
SPEECH INTELLIGIBILITY (48)
VOICE ACTIVITY DETECTION (48)
ROBUST SPEECH RECOGNITION (45)
FILTERING (44)
SPECTRAL SUBTRACTION (44)
ACOUSTIC NOISE (43)
ADAPTATION MODEL (43)
DISCRETE FOURIER TRANSFORMS (43)
SPEECH SYNTHESIS (43)
TIME-FREQUENCY ANALYSIS (43)
INTERFERENCE (42)
WIENER FILTER (42)
FREQUENCY ESTIMATION (41)
SPEECH SIGNAL (41)
WIENER FILTERS (41)
CLASSIFICATION ALGORITHMS (40)
DISTORTION (40)
SPECTROGRAM (40)
ADDITIVE NOISE (39)
HEARING (38)
EDUCATIONAL INSTITUTIONS (37)
WHITE NOISE (37)
AUDIO SIGNAL PROCESSING (36)
DISCRETE COSINE TRANSFORMS (36)
NOISE ROBUSTNESS (36)
COMPLEXITY THEORY (34)
CONFERENCES (34)
DETECTORS (34)
ENCODING (34)
LEAST MEAN SQUARES METHODS (34)
MICROPHONE ARRAY (34)
SIGNAL CLASSIFICATION (34)
FREQUENCY MODULATION (33)
REAL TIME SYSTEMS (33)
DECODING (32)
INDEPENDENT COMPONENT ANALYSIS (32)
ADAPTATION MODELS (31)
COMPUTATIONAL COMPLEXITY (31)
MUSIC (31)
POWER HARMONIC FILTERS (31)
SMOOTHING METHODS (31)
SUPPORT VECTOR MACHINES (31)
COMPUTERS (30)
COVARIANCE MATRIX (30)
GAUSSIAN PROCESSES (30)
KALMAN FILTERS (30)
more

INFONA - science communication portal

Search results

Intelligibility evaluation of speech coding standards in severe background noise and packet loss conditions

Coherent channel based subband multichannel dereverberation

Estimation of relative transfer function in the presence of stationary noise based on segmental power spectral density matrix subtraction

Phase-optimized K-SVD for signal extraction from underdetermined multichannel sparse mixtures

A pairwise algorithm for pitch estimation and speech separation using deep stacking network

On speech quality estimation of phase-aware single-channel speech enhancement

A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds

Binaural multichannel Wiener filter with directional interference rejection

Representation models in single channel source separation

Robust sound event recognition using convolutional neural networks

Harmonic phase estimation in single-channel speech enhancement using von mises distribution and prior SNR

Leveraging automatic speech recognition in cochlear implants for improved speech intelligibility under reverberation

Delayless speech enhancement with a virtual zero-phase response using a prediction of periodic signal components

Sparse HMM-based speech enhancement method for stationary and non-stationary noise environments

Enhanced time domain packet loss concealment in switched speech/audio codec

Wind noise short term power spectrum estimation using pitch adaptive inverse binary masks

Advances in deep neural network approaches to speaker recognition

Assistive listening headsets for high noise environments: Protection and communication

Single channel speech enhancement in the modulation domain: New insights in the modulation channel selection framework

Robot audition: Its rise and perspectives

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options