Search results

Items from 61 to 80 out of 1,337 results

chapter

Combined acoustic echo control and noise reduction for hands-free telephony — State of the art and perspectives

Rainer Martin, Peter Vary

1996 8th European Signal Processing Conference (EUSIPCO 1996) > 1 - 4

1996 8th European Signal Processing Conference (EUSIPCO 1996)

In this paper we summarize and discuss recent results in acoustic echo cancellation and noise reduction with emphasis on methods which combine both aspects. It is shown that echo control and noise reduction can support each other in a true synergy. The paper discusses fundamental issues of algorithm design and suggests that a frequency domain multi-microphone solution might be best suited to achieve...

chapter

Speech enhancement for hearing aids

Douglas R. Campbell

1996 8th European Signal Processing Conference (EUSIPCO 1996) > 1 - 4

1996 8th European Signal Processing Conference (EUSIPCO 1996)

The performance of hearing aids in noisy reverberant surroundings remains a major source of complaint and discomfort to wearers. Given the current capabilities and pace of development in microelectronics, the major problem is to find successful speech enhancement schemes. “Binaural unmasking” experiments demonstrate an enhancement advantage, due to binaural correlation properties, which can lower...

chapter

Voice controlled mobile phone for car environment

Ivan Bourmeyster, Jamil Chetoni, Silvio Cucchi, Nicola Griggio, more

1996 8th European Signal Processing Conference (EUSIPCO 1996) > 1 - 4

1996 8th European Signal Processing Conference (EUSIPCO 1996)

The development of an application of speech processing in a car environment is addressed. The main objective is to provide the user of a vehicular phone with a powerful and friendly bidirectional vocal interface. In particular, the paper focusses on the speech recogniser component of the interface as it was specifically designed and tuned to operate in the very hostile acoustic environment of a moving...

chapter

Binaural speech enhancement with instantaneous coherence smoothing using the cepstral correlation coefficient

Rainer Martin, Masoumeh Azarpour, Gerald Enzner

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 111 - 115

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we propose a novel approach to cepstral smoothing for reducing musical noise fluctuations in binaural speech enhancement. Similar to other methods, our approach computes a preliminary spectral gain function using the magnitude-squared coherence function and applies an instantaneous weighting to the gain function in the cepstral domain. In this contribution, the weighting function is...

chapter

Foreground suppression for capturing and reproduction of crowded acoustic environments

Nikolaos Stefanakis, Athanasios Mouchtaris

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 51 - 55

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Traditionally, sensor arrays and spatial filtering aim to enhance individual sources by suppressing ambient noise and reverberation. In this paper, the exactly opposite problem is examined, that of suppressing individual sources in favour of the ambient sound and of the whole acoustic scene in general. We consider a compact circular sensor array which is embedded in a crowded ambient acoustic environment...

chapter

A directional noise suppressor with a specified beamwidth

Akihiko Sugiyama, Ryoji Miyahara

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 524 - 528

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

this paper proposes a directional noise suppressor with a specified constant beamwidth. A directional gain is calculated based on interchannel phase difference and combined with a spectral gain commonly used in single-channelnoise suppressors (NSs). The beamwidth can be specified as passband edges of the directional gain. In order to implement frequency-independent constant beamwidth, frequency-proportionate...

chapter

Trinicon-BSS system incorporating robust dual beamformers for noise reduction

Craig A. Anderson, Stefan Meier, Walter Kellermann, Paul D. Teal, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 529 - 533

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, a method of adaptive noise suppression combining spatially robust fixed beamforming and the TRINICON blind source separation algorithm is presented. A multichannel sensor array is first processed using complementary fixed beamformers into maximum and minimum SINR channels. The channels form the inputs to a single 2×2 second-order statistics TRINICON-BSS system which adaptively compensates...

chapter

Coherent modification of pitch and energy for expressive prosody implantation

Alexander Sorin, Slava Shechtman, Vincent Pollet

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4914 - 4918

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In expressive TTS and voice transformation systems, implantation of expressive prosody derived from external out-of-domain sources often leads to extreme pitch modification that compromises the naturalness of the synthesized speech.

chapter

Super-wideband bandwidth extension for speech in the 3GPP EVS codec

Venkatraman Atti, Venkatesh Krishnan, Duminda Dewasurendra, Venkata Chebiyyam, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5927 - 5931

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper describes the time-domain bandwidth extension (TBE) framework employed to code wideband and super-wideband speech in the newly standardized 3GPP EVS codec. The TBE algorithm uses a nonlinear harmonic modeling technique that incorporates principles of time-domain envelope-modulated noise mixing. At 13.2 kbps, the super-wideband coding of speech uses as low as 1.55 kbps for encoding the spectral...

chapter

New post-processing techniques for low bit rate celp codecs

Tommy Vaillancourt, Redwan Salami, Milan Jelinek

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5908 - 5912

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents two new post-processing techniques to address limitations of the deployed low bit rate speech codecs in case of unvoiced speech and background noise, and in case of music. Both post-processing techniques enhance the spectrum of the decoded excitation signal without increasing the codec algorithmic delay. The paper discusses how to integrate the enhancement procedure of unvoiced...

chapter

Linear prediction based comfort noise generation in the EVS codec

Zhe Wang, Lei Miao, Jon Gibbs, Tomas Toftgard, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5903 - 5907

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

A Discontinuous transmission (DTX) system, which is widely adopted in speech codecs, is an important function for speech communication systems that can reduce the transmission bandwidth by at least a half. Within a DTX system, the comfort noise generation (CNG) plays a key role in the overall quality. Critical performance parameters with respect to the CNG including the transition quality from active...

chapter

Individualizing a monaural beamformer for cochlear implant users

Waldo Nogueira, Marta Lopez, Thilo Rode, Simon Doclo, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5738 - 5742

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speech intelligibility in noisy environments is still quite limited for cochlear implant (CI) users. Classical beamformers such as the Generalized Sidelobe Canceller (GSC) can provide large improvements in speech intelligibility for CI users. These algorithms have been adopted from hearing aids and multimedia applications into the CI field. However, their optimization taking into consideration the...

chapter

Low delay LPC and MDCT-based audio coding in the EVS codec

Guillaume Fuchs, Christian R. Helmrich, Goran Markovic, Matthias Neusinger, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5723 - 5727

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speech coders operating in time domain can be extended with a frequency domain mode to improve encoding of music, even though this is challenging at low delay. In such a scenario, the short analysis window limits the benefit of the transform coder, while a delayless switch between the two coders constrains the system further. The paper presents an LPC and MDCT-based audio coder part of the new 3GPP...

chapter

Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks

Rohit Prabhavalkar, Raziel Alvarez, Carolina Parada, Preetum Nakkiran, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4704 - 4708

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We explore techniques to improve the robustness of small-footprint keyword spotting models based on deep neural networks (DNNs) in the presence of background noise and in far-field conditions. We find that system performance can be improved significantly, with relative improvements up to 75% in far-field conditions, by employing a combination of multi-style training and a proposed novel formulation...

chapter

Robust overlapped speech detection and its application in word-count estimation for Prof-Life-Log data

Navid Shokouhi, Ali Ziaei, Abhijeet Sangwan, John H. L. Hansen

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4724 - 4728

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The ability to estimate the number of words spoken by an individual over a certain period of time is valuable in second language acquisition, healthcare, and assessing language development. However, establishing a robust automatic framework to achieve high accuracy is non-trivial in realistic/naturalistic scenarios due to various factors such as different styles of conversation or types of noise that...

chapter

Spatial diffuseness features for DNN-based speech recognition in noisy and reverberant environments

Andreas Schwarz, Christian Huemmer, Roland Maas, Walter Kellermann

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4380 - 4384

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a spatial diffuseness feature for deep neural network (DNN)-based automatic speech recognition to improve recognition accuracy in reverberant and noisy environments. The feature is computed in real-time from multiple microphone signals without requiring knowledge or estimation of the direction of arrival, and represents the relative amount of diffuse noise in each time and frequency bin...

chapter

Speech reinforcement in noisy reverberant conditions under an approximation of the short-time SII

Richard C. Hendriks, Joao B. Crespo, Jesper Jensen, Cees H. Taal

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4400 - 4404

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

While most contributions on speech reinforcement only consider the presence of environmental noise, late reverberation can also severely degrade the intelligibility of speech. In this paper we address the problem of speech reinforcement in noisy and reverberant environments. We use a short-time version of a recently presented approximation of the speech intelligibility index, which we optimize locally...

chapter

Pitch estimation and tracking with harmonic emphasis on the acoustic spectrum

Sam Karimian-Azari, Nasser Mohammadiha, Jesper R. Jensen, Mads G. Christensen

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4330 - 4334

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we use unconstrained frequency estimates (UFEs) from a noisy harmonic signal and propose two methods to estimate and track the pitch over time. We assume that the UFEs are multivariate-normally-distributed random variables, and derive a maximum likelihood (ML) pitch estimator by maximizing the likelihood of the UFEs over short time-intervals. As the main contribution of this paper,...

chapter

Multichannel speech enhancement using MEMS microphones

Z. I. Skordilis, A. Tsiami, P. Maragos, G. Potamianos, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2729 - 2733

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this work, we investigate the efficacy of Micro Electro-Mechanical System (MEMS) microphones, a newly developed technology of very compact sensors, for multichannel speech enhancement. Experiments are conducted on real speech data collected using a MEMS microphone array. First, the effectiveness of the array geometry for noise suppression is explored, using a new corpus containing speech recorded...

chapter

Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop

Hynek Hermansky, Lukas Burget, Jordan Cohen, Emmanuel Dupoux, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5009 - 5013

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

A group of junior and senior researchers gathered as a part of the 2014 Frederick Jelinek Memorial Workshop in Prague to address the problem of predicting the accuracy of a nonlinear Deep Neural Network probability estimator for unknown data in a different application domain from the domain in which the estimator was trained. The paper describes the problem and summarizes approaches that were taken...

Keywords:
NOISE
SPEECH

Publication date

Set your own date range

Content availability

Available (1,323)
None (14)

Keywords

NOISE MEASUREMENT (444)
SPEECH RECOGNITION (401)
SPEECH ENHANCEMENT (391)
SPEECH PROCESSING (318)
MICROPHONES (220)
FEATURE EXTRACTION (206)
ESTIMATION (199)
SIGNAL TO NOISE RATIO (176)
HIDDEN MARKOV MODELS (172)
ACOUSTICS (148)
NOISE REDUCTION (147)
ROBUSTNESS (137)
TRAINING (121)
MEL FREQUENCY CEPSTRAL COEFFICIENT (115)
ACCURACY (98)
SIGNAL PROCESSING ALGORITHMS (96)
CORRELATION (89)
SIGNAL DENOISING (85)
DATABASES (84)
ALGORITHM DESIGN AND ANALYSIS (79)
SIGNAL PROCESSING (77)
HARMONIC ANALYSIS (73)
SPEAKER RECOGNITION (72)
ARRAYS (71)
SPEECH CODING (66)
ADAPTIVE FILTERS (64)
REVERBERATION (64)
ARRAY SIGNAL PROCESSING (62)
CEPSTRAL ANALYSIS (62)
MATHEMATICAL MODEL (62)
TRANSFORMS (62)
TIME FREQUENCY ANALYSIS (60)
DATA MINING (59)
WAVELET TRANSFORMS (59)
EQUATIONS (55)
ACOUSTIC SIGNAL PROCESSING (53)
VECTORS (53)
AUDITORY SYSTEM (52)
FILTERING THEORY (52)
MICROPHONE ARRAYS (52)
ARTIFICIAL NEURAL NETWORKS (51)
SPECTRAL ANALYSIS (50)
INTERFERENCE SUPPRESSION (49)
FREQUENCY DOMAIN ANALYSIS (46)
ROBOTS (45)
COMPUTATIONAL MODELING (44)
AUTOMATIC SPEECH RECOGNITION (43)
BLIND SOURCE SEPARATION (43)
GAIN (43)
SPEECH INTELLIGIBILITY (42)
SOURCE SEPARATION (41)
FILTERING (40)
SPECTRAL SUBTRACTION (40)
INDEXES (39)
MAXIMUM LIKELIHOOD ESTIMATION (39)
SPEECH SIGNAL (38)
WIENER FILTERS (38)
ROBUST SPEECH RECOGNITION (37)
SPEECH SYNTHESIS (37)
VOICE ACTIVITY DETECTION (37)
WIENER FILTER (37)
ADDITIVE NOISE (36)
FREQUENCY ESTIMATION (36)
HEARING (36)
ADAPTATION MODEL (35)
CLASSIFICATION ALGORITHMS (35)
EDUCATIONAL INSTITUTIONS (35)
AUDIO SIGNAL PROCESSING (34)
DISTORTION (34)
ACOUSTIC NOISE (33)
CONFERENCES (33)
DISCRETE COSINE TRANSFORMS (33)
DISCRETE FOURIER TRANSFORMS (33)
SPECTROGRAM (33)
WHITE NOISE (33)
REAL TIME SYSTEMS (32)
INDEPENDENT COMPONENT ANALYSIS (31)
INTERFERENCE (30)
MUSIC (30)
TIME-FREQUENCY ANALYSIS (30)
COMPLEXITY THEORY (29)
DETECTORS (29)
ENCODING (29)
SIGNAL CLASSIFICATION (29)
COMPUTERS (28)
ELECTRONIC MAIL (28)
FILTERING ALGORITHMS (28)
FREQUENCY MODULATION (28)
BANDWIDTH (27)
LEAST MEAN SQUARES METHODS (27)
MICROPHONE ARRAY (27)
POWER HARMONIC FILTERS (27)
SUPPORT VECTOR MACHINES (27)
APPROXIMATION METHODS (26)
BACKGROUND NOISE (26)
DELAY (26)
MFCC (26)
MODULATION (26)
more

INFONA - science communication portal

Search results

Combined acoustic echo control and noise reduction for hands-free telephony — State of the art and perspectives

Speech enhancement for hearing aids

Voice controlled mobile phone for car environment

Binaural speech enhancement with instantaneous coherence smoothing using the cepstral correlation coefficient

Foreground suppression for capturing and reproduction of crowded acoustic environments

A directional noise suppressor with a specified beamwidth

Trinicon-BSS system incorporating robust dual beamformers for noise reduction

Coherent modification of pitch and energy for expressive prosody implantation

Super-wideband bandwidth extension for speech in the 3GPP EVS codec

New post-processing techniques for low bit rate celp codecs

Linear prediction based comfort noise generation in the EVS codec

Individualizing a monaural beamformer for cochlear implant users

Low delay LPC and MDCT-based audio coding in the EVS codec

Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks

Robust overlapped speech detection and its application in word-count estimation for Prof-Life-Log data

Spatial diffuseness features for DNN-based speech recognition in noisy and reverberant environments

Speech reinforcement in noisy reverberant conditions under an approximation of the short-time SII

Pitch estimation and tracking with harmonic emphasis on the acoustic spectrum

Multichannel speech enhancement using MEMS microphones

Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options