ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

rozdział

A sparse smoothing approach for Gaussian Mixture Model based Acoustic-to-Articulatory Inversion

Prasad Sudhakar, Laurent Jacques, Prasanta Kumar Ghosh

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3032 - 3036

It is well-known that the performance of the Gaussian Mixture Model (GMM) based Acoustic-to-Articulatory Inversion (AAI) improves by either incorporating smoothness constraint directly in the inversion criterion or smoothing (low-pass filtering) estimated articulator trajectories in a post-processing step, where smoothing is performed independently of the inversion. As the low-pass filtering is independent...

rozdział

Low complexity on-line video summarization with Gaussian mixture model based clustering

Shun-Hsing Ou, Chia-Han Lee, V. Srinivasa Somayazulu, Yen-Kuang Chen, więcej

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1260 - 1264

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Techniques of video summarization have attracted significant research interests in the past decade due to the rapid progress in video recording, computation, and communication technologies. However, most of the existing methods analyze the video in an off-line manner, which greatly reduces the flexibility of the system. On-line summarization, which can progressively process video during video recording,...

rozdział

Multi-pitch tracking using Gaussian mixture model with time varying parameters and Grating Compression Transform

M N Abhijith, Prasanta K Ghosh, K Rajgopal

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1473 - 1477

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Grating Compression Transform (GCT) is a two-dimensional analysis of speech signal which has been shown to be effective in multi-pitch tracking in speech mixtures. Multi-pitch tracking methods using GCT apply Kalman filter framework to obtain pitch tracks which requires training of the filter parameters using true pitch tracks. We propose an unsupervised method for obtaining multiple pitch tracks...

rozdział

Efficient recovery of principal components from compressive measurements with application to Gaussian mixture model estimation

Farhad Pourkamali Anaraki, Shannon M. Hughes

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2332 - 2336

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

There has been growing interest in performing signal processing tasks directly on compressive measurements, e.g. low-dimensional linear measurements of signals taken with Gaussian random vectors. In this paper, we present a highly efficient algorithm to recover the covariance matrix of high-dimensional data from compressive measurements. We show that, as the number of data samples increases, the eigenvectors...

rozdział

On combining DNN and GMM with unsupervised speaker adaptation for robust automatic speech recognition

Shilin Liu, Khe Chai Sim

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 195 - 199

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recently, context-dependent Deep Neural Network (CD-DNN) has been found to significantly outperform Gaussian Mixture Model (GMM) for various large vocabulary continuous speech recognition tasks. Unlike the GMM approach, there is no meaningful interpretation of the DNN parameters, which makes it difficult to devise effective adaptation methods for DNNs. Furthermore, DNN parameter estimation is based...

rozdział

Speech enhancement usinga modulation domain Kalman filter post-processor with a Gaussian Mixture noise model

Yu Wang, Mike Brookes

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7024 - 7028

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a speech enhancement algorithm that applies a Kalman filter in the modulation domain to the output of a conventional enhancer operating in the time-frequency domain. We show that the prediction residual signal of the spectral amplitude errors at the output of the baseline MMSE enhancer do not follow a Gaussian distribution. Accordingly, the Kalman filter used in our enhancement algorithm...

rozdział

Spear: An open source toolbox for speaker recognition based on Bob

Elie Khoury, Laurent El Shafey, Sebastien Marcel

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1655 - 1659

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we introduce Spear, an open source and extensible toolbox for state-of-the-art speaker recognition. This toolbox is built on top of Bob, a free signal processing and machine learning library. Spear implements a set of complete speaker recognition toolchains, including all the processing stages from the front-end feature extractor to the final steps of decision and evaluation. Several...

rozdział

Maximum a-posteriori estimation of missing samples with continuity constraint in Electromagnetic Articulography data

P Sujith, Prasanta Kumar Ghosh

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 940 - 944

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Electromagnetic Articulography (EMA) technique is used to record the kinematics of different articulators while one speaks. EMA data often contains missing segments due to sensor failure. In this work, we propose a maximum a-posteriori (MAP) estimation with continuity constraint to recover the missing samples in the articulatory trajectories recorded using EMA. In this approach, we combine the benefits...

INFONA - portal komunikacji naukowej

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

A sparse smoothing approach for Gaussian Mixture Model based Acoustic-to-Articulatory Inversion

Low complexity on-line video summarization with Gaussian mixture model based clustering

Multi-pitch tracking using Gaussian mixture model with time varying parameters and Grating Compression Transform

Efficient recovery of principal components from compressive measurements with application to Gaussian mixture model estimation

On combining DNN and GMM with unsupervised speaker adaptation for robust automatic speech recognition

Speech enhancement usinga modulation domain Kalman filter post-processor with a Gaussian Mixture noise model

Spear: An open source toolbox for speaker recognition based on Bob

Maximum a-posteriori estimation of missing samples with continuity constraint in Electromagnetic Articulography data

Opcje filtrowania

Data publikacji

Słowa kluczowe

INFONA - portal komunikacji naukowej

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)