ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Items from 1 to 20 out of 1,363 results

chapter

Multi-location wideband through-the-wall beamforming

F. Ahmad

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 5193 - 5196

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Significant multipath propagation and heavy clutter in indoor environments renders through-the-wall radar imaging a difficult and complex proposition. It is highly desirable to properly interpret the radar images and determine the contents of the indoor scene with a high level of confidence. Data collected from multiple positions around a structure can be used to improve imaging visibility into the...

chapter

Using dialogue acts to learn better repair strategies for spoken dialogue systems

M. Frampton, O. Lemon

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 5045 - 5048

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Repair or error-recovery strategies are an important design issue in spoken dialogue systems (SDSs) - how to conduct the dialogue when there is no progress (e.g. due to repeated ASR errors). Nearly all current SDSs use hand-crafted repair rules, but a more robust approach is to use reinforcement learning (RL) for data-driven dialogue strategy learning. However, as well as usually being tested only...

chapter

Universal background model based speech recognition

D. Povey, S.M. Chu, B. Varadarajan

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4561 - 4564

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

The universal background model (UBM) is an effective framework widely used in speaker recognition. But so far it has received little attention from the speech recognition field. In this work, we make a first attempt to apply the UBM to acoustic modeling in ASR. We propose a tree-based parameter estimation technique for UBMs, and describe a set of smoothing and pruning methods to facilitate learning...

chapter

A novel approach to part-of-speech tagging based on latent analogy

J.R. Bellegarda

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4685 - 4688

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Part-of-speech tagging is a necessary pre-processing step for many natural language tasks. Recent statistical approaches, such as conditional random fields, rely on well chosen feature functions to ensure that important characteristics of the empirical training distribution are reflected in the trained model. In practice, however, it is not always clear how to best select these feature functions in...

chapter

Dual-microphone speech dereverberation using GARCH modeling

A. Abramson, E.A.P. Habets, S. Gannot, I. Cohen

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4565 - 4568

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

In this paper, we develop a dual-microphone speech dereverberation algorithm for noisy environments, which is aimed at suppressing late reverberation and background noise. The spectral variance of the late reverberation is obtained with adaptively-estimated direct path compensation. A Markov-switching generalized autoregressive conditional heteroscedasticity (GARCH) model is used to estimate the spectral...

chapter

A novel approach to mixed phase room impulse response inversion for speech dereverberation

N. Cahill, R. Lawlor

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4593 - 4596

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Outlined in this paper is a novel approach to speech dereverberation when an estimate of the source-receiver transfer function is known. It is a two-stage algorithm based on the minimum phase/allpass decomposition of a mixed phase room impulse response (RIR). The reverberant speech is first filtered with the inverse minimum phase component of the RIR. Then a non-negative matrix factorization (NMF)...

chapter

Comparative evaluations of robust and accurate F0 estimates in reverberant environments

M. Unoki, T. Hosorogiya, Y. Ishimoto

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4569 - 4572

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

This paper reports comparative evaluations of the method we previously proposed of estimating fundamental frequency (F₀) based on complex cepstrum analysis with nine typical methods over huge speech-sound datasets in both artificial and realistic reverberant environments (in room acoustics). They involve several classic algorithms (Cepstrum, AMDF, TPC, and modified autocorrelation) and a few modern...

chapter

Speech babble: Analysis and modeling for speech systems

N. Krishnamurth, J.H.L. Hansen

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4505 - 4508

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Speech babble represents the most challenging noise interference in all speech systems, yet no research has been performed at a systematic level to model the underlying structure. For the first time, this study establishes a working foundation for the analysis and modeling of babble speech. We first address the underlying model for multiple speaker babble speech - considering the number of conversations...

chapter

The role of voice source measures on automatic gender classification

Yen-Liang Shue, M. Iseli

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4493 - 4496

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Differences of physiological properties of the glottis and the vocal tract are partly due to age and/or gender differences. Since these differences are reflected in the speech signal, acoustic measures related to those properties can be helpful for automatic age and gender classification. In this paper, the focus is on the role of acoustic measures related to the voice source in automatic gender classification,...

chapter

Sample selection for automatic language identification

D. Farris, C. White, S. Khudanpur

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4225 - 4228

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Current approaches to automatic spoken language identification (LID) assume the availability of a large corpus of manually language-labeled speech samples for training statistical classifiers. We investigate two methods of active learning to significantly reduce the amount of labeled speech needed for training LID systems. Starting with a small training set, an automated method is used to select samples...

chapter

Target-oriented phone tokenizers for spoken language recognition

Rong Tong, Bin Ma, Haizhou Li, Eng Siong Chng

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4221 - 4224

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

This paper presents a new strategy for designing the parallel phone recognizers for spoken language recognition. Given a collection of parallel phone recognizers, we select a subset of phones from each phone recognizer for each target language to construct a target-oriented phone tokenizer (TOPT). As a result, the collection of target-oriented phone tokenizers is more effective than the original parallel...

chapter

HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition

Jinyu Li, Li Deng, Dong Yu, Yifan Gong, more

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4069 - 4072

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

In this paper, we present a new approach to HMM adaptation that jointly compensates for additive and convolutive acoustic distortion in environment-robust speech recognition. The hallmark of our new approach is the use of a nonlinear, phase-sensitive model of acoustic distortion that captures phase asynchrony between clean speech and the mixing noise. In the first step of the developed algorithm,...

chapter

Discriminative training by iterative linear programming optimization

B. Mak, B. Ng

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4061 - 4064

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

In this paper, we cast discriminative training problems into standard linear programming (LP) optimization. Besides being convex and having globally optimal solution(s), LP programs are well-studied with well-established solutions, and efficient LP solvers are freely available. In practice, however, one may not have complete knowledge of the feasible region since it is constructed from a limited number...

chapter

A novel adaptive leakage factor scheme for enhancement of a variable tap-length learning algorithm

Leilei Li, J.A. Chambers

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 3837 - 3840

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

In this paper a new adaptive leakage factor variable tap-length learning algorithm is proposed. Through analysis the converged difference between the segmented mean square error (MSE) of a filter formed from a number of the initial coefficients of an adaptive filter, and the MSE of the full adaptive filter, is confirmed as a function of the tap-length of the adaptive filter to be monotonically non-increasing...

chapter

A fully adaptive IFIR filter with removed border effect

E.L.O. Batista, O.J. Tobias, R. Seara

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 3821 - 3824

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

This paper presents a procedure for implementing fully adaptive interpolated FIR filters with removed border effect. The proposed approach allows reducing the steady-state mean-square error by eliminating the main sources of performance degradation from the adaptive interpolated FIR filters. In addition, the computational effort needed for implementing such a procedure is very small. Simulation results...

chapter

Spectral regrowth analysis of band-limited offset-QPSK

J. Nsenga, W. Van Thillo, A. Bourdoux, V. Ramon, more

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 3593 - 3596

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

In this paper, we present an analytical analysis to predict the power spectral density (PSD) at the output of a nonlinear power amplifier (PA). We focus on offset quadrature phase shift keying (OQPSK) waveform band-limited by a square root raised cosine (SRRC) filter. This is one of the waveforms used in wideband code division multiple access (W-CDMA) wireless standard. We show that the PA output...

chapter

Adaptive Notch Filter with time-frequency tracking of continuously changing frequencies

Minh Ta, V. DeBrunner

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 3557 - 3560

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

We propose in this paper a novel modification of the popular Adaptive Notch Filter (ANF) to improve the tracking of time-varying frequencies. Unlike previous algorithms, our new method incorporates a modeling of frequency variation directly into the cost minimization procedure. Our results show a notable improvement in the frequency estimation performance over earlier methods, and comparisons over...

chapter

Design of IIR QMF banks with near-perfect reconstruction and low complexity

H.W. Lollman, P. Vary

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 3521 - 3524

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

A novel design for a two-channel IIR quadrature-mirror filter (QMF) bank with near-perfect reconstruction (NPR) is presented. The analysis filter-bank is given by an efficient polyphase network (PPN) implementation based on allpass filters. The arising phase distortions are almost compensated by stable allpass filters, designed via analytical closed-form expressions. In a first design, the remaining...

chapter

Coefficient-truncated higher-order commuting matrices of the discrete fourier transform

Soo-Chang Pei, Wen-Liang Hsue, Jian-Jiun Ding

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 3545 - 3548

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Recently, Candan introduced higher order DFT-commuting matrices whose eigenvectors are accurate approximations to the continuous Hermite-Gaussian functions (HGFs). However, the highest order 2k of the O(h^2k) NtimesN DFT-commuting matrices proposed by Candan is restricted by 2k+1lesN. In this paper, we remove that restriction of order upper bound by developing a coefficient truncation technique to...

chapter

Improving frame-bound-ratio for frames generated by oversampled filter banks

Li Chai, Jingxin Zhang, Cishen Zhang, E. Mosca

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 3525 - 3528

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

This paper presents a simple method to improve the frame-bounds-ratio of perfect reconstruction (PR) oversampled filter banks (FBs) by adjusting the gain of each subband filter. For a given analysis PRFB, a finite convex optimization algorithm is presented to redesign the subband gains such that the frame-bounds-ratio of the FB is minimized. The algorithm also provides an effective way to compute...

Publication date

Set your own date range

Content availability

Available (1 362)
None (1)

Keywords

SPEECH RECOGNITION (184)
SPEECH PROCESSING (137)
ACOUSTIC SIGNAL PROCESSING (88)
FEATURE EXTRACTION (76)
MAXIMUM LIKELIHOOD ESTIMATION (75)
GAUSSIAN PROCESSES (67)
HIDDEN MARKOV MODELS (67)
SIGNAL PROCESSING (64)
COMPUTATIONAL COMPLEXITY (59)
ARRAY SIGNAL PROCESSING (58)
MIMO COMMUNICATION (57)
AUDIO SIGNAL PROCESSING (54)
LEAST MEAN SQUARES METHODS (54)
VIDEO CODING (53)
ITERATIVE METHODS (52)
MATRIX ALGEBRA (49)
STATISTICAL ANALYSIS (49)
BAYES METHODS (47)
FILTERING THEORY (46)
SPEAKER RECOGNITION (46)
DATA COMPRESSION (44)
SPEECH SYNTHESIS (44)
WAVELET TRANSFORMS (44)
NATURAL LANGUAGE PROCESSING (43)
OPTIMISATION (43)
ADAPTIVE FILTERS (42)
CHANNEL ESTIMATION (42)
PARAMETER ESTIMATION (42)
PROBABILITY (42)
SPEECH ENHANCEMENT (42)
OFDM MODULATION (38)
SIGNAL DETECTION (37)
SPEECH CODING (37)
WIRELESS SENSOR NETWORKS (37)
DECODING (34)
LEARNING (ARTIFICIAL INTELLIGENCE) (34)
AUTOMATIC SPEECH RECOGNITION (33)
BLIND SOURCE SEPARATION (33)
PATTERN CLASSIFICATION (33)
SUPPORT VECTOR MACHINES (33)
ERROR STATISTICS (32)
MEAN SQUARE ERROR METHODS (32)
VIDEO SIGNAL PROCESSING (32)
IMAGE CODING (31)
WIRELESS CHANNELS (31)
MEDICAL SIGNAL PROCESSING (30)
ROBUSTNESS (30)
MUSIC (29)
IMAGE PROCESSING (28)
IMAGE SEQUENCES (28)
STOCHASTIC PROCESSES (28)
AUDIO CODING (27)
MICROPHONE ARRAYS (27)
SIGNAL RECONSTRUCTION (27)
MONTE CARLO METHODS (26)
TIME-FREQUENCY ANALYSIS (26)
INDEPENDENT COMPONENT ANALYSIS (25)
MIMO SYSTEMS (25)
PARTICLE FILTERING (NUMERICAL METHODS) (25)
SIGNAL CLASSIFICATION (25)
MEDICAL IMAGE PROCESSING (24)
SIGNAL DENOISING (24)
ADAPTIVE SIGNAL PROCESSING (23)
CHANNEL CODING (23)
CORRELATION METHODS (23)
FADING CHANNELS (23)
HIDDEN MARKOV MODEL (23)
IMAGE CLASSIFICATION (23)
INTERFERENCE SUPPRESSION (23)
KALMAN FILTERS (23)
OFDM (23)
SPECTRAL ANALYSIS (23)
IMAGE SEGMENTATION (22)
SENSOR FUSION (22)
SENSOR NETWORKS (22)
FACE RECOGNITION (21)
IMAGE RECONSTRUCTION (21)
INFORMATION RETRIEVAL (21)
LEAST SQUARES APPROXIMATIONS (21)
PRINCIPAL COMPONENT ANALYSIS (21)
SIGNAL REPRESENTATION (21)
SIGNAL SAMPLING (21)
APPROXIMATION THEORY (20)
PATTERN RECOGNITION (20)
QUANTIZATION (20)
RADIO RECEIVERS (20)
DIRECTION-OF-ARRIVAL ESTIMATION (19)
GAUSSIAN MIXTURE MODEL (19)
IMAGE COLOUR ANALYSIS (19)
IMAGE DENOISING (19)
INTERPOLATION (19)
MOTION ESTIMATION (19)
QUANTISATION (SIGNAL) (19)
REGRESSION ANALYSIS (19)
TRACKING (19)
IMAGE RESOLUTION (18)
IMAGE RESTORATION (18)
SOURCE SEPARATION (18)
ANTENNA ARRAYS (17)
BIOMEDICAL MRI (17)
more

INFONA - science communication portal

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Multi-location wideband through-the-wall beamforming

Using dialogue acts to learn better repair strategies for spoken dialogue systems

Universal background model based speech recognition

A novel approach to part-of-speech tagging based on latent analogy

Dual-microphone speech dereverberation using GARCH modeling

A novel approach to mixed phase room impulse response inversion for speech dereverberation

Comparative evaluations of robust and accurate F0 estimates in reverberant environments

Speech babble: Analysis and modeling for speech systems

The role of voice source measures on automatic gender classification

Sample selection for automatic language identification

Target-oriented phone tokenizers for spoken language recognition

HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition

Discriminative training by iterative linear programming optimization

A novel adaptive leakage factor scheme for enhancement of a variable tap-length learning algorithm

A fully adaptive IFIR filter with removed border effect

Spectral regrowth analysis of band-limited offset-QPSK

Adaptive Notch Filter with time-frequency tracking of continuously changing frequencies

Design of IIR QMF banks with near-perfect reconstruction and low complexity

Coefficient-truncated higher-order commuting matrices of the discrete fourier transform

Improving frame-bound-ratio for frames generated by oversampled filter banks

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes