ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Items from 101 to 120 out of 1,693 results

1 ...
3
4
5
6
7
8
9

chapter

Visual tracking using Blind Source Separation for mixed images

Hsiao-Tzu Chen, Chih-Wei Tang

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6548 - 6552

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Mixed images cannot be avoided in visual tracking since the transmitted scene may be captured with specular reflections. Since few previous methods tackle this important problem, this paper proposes a novel visual tracking method using Blind Source Separation (BSS) for mixed images. Based on the framework of particle filter with compensated motion model at the prediction stage for mobile cameras,...

chapter

An integrated system for object tracking, detection, and online learning with real-time RGB-D video

I-Kuei Chen, Chung-Yu Chi, Szu-Lu Hsu, Liang-Gee Chen

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6558 - 6562

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper introduces a highly integrated system providing very accurate object detection with RGB-D sensor. To solve the problem that there are always insufficient training sets for object detection in real world, we present an online learning architecture to learn templates and to detect objects real-time. The proposed novel concept skips the training phase required in previous recognition works,...

chapter

Parametric multichannel noise reduction algorithm utilizing temporal correlations in reverberant environment

Yu Gwang Jin, Jong Won Shin, Chul Min Lee, Soo Hyun Bae, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7049 - 7052

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a parametric multichannel noise reduction algorithm utilizing temporal correlations in a noisy and reverberant environment. Under the reverberant condition, the received acoustic signal becomes highly correlated in the time domain and it makes successful noise reduction quite difficult. The proposed parametric noise reduction method takes account of interdependencies between...

chapter

Distributed energy-efficient power optimization in cellular relay networks with minimum rate constraints

Giacomo Bacci, E. Veronica Belmega, Luca Sanguinetti

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7014 - 7018

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this work, we derive a distributed power control algorithm for energy-efficientuplink transmissions in interference-limited cellular networks, equipped with either multiple or shared relays. The proposed solution is derived by modeling the mobile terminals as utility-driven rational agents that engage in a noncooperative game, under minimum-rate constraints. The theoretical analysis of the game...

chapter

Can voice conversion be used to reduce non-native accents?

Sandesh Aryal, Ricardo Gutierrez-Osuna

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7879 - 7883

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Voice-conversion (VC) techniques aim to transform utterances from a source speaker to sound as if a target speaker had produced them. For this reason, VC is generally ill-suited for accent-conversion (AC) purposes, where the goal is to capture the regional accent of the source while preserving the voice quality of the target. In this paper, we propose a modification of the conventional training process...

chapter

Featherweight phonetic keyword search for conversational speech

Keith Kintzley, Aren Jansen, Hynek Hermansky

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7859 - 7863

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The point process model (PPM) for keyword search is a phonetic event-driven approach that provides a whole-word focused alternative to fast lattice matching techniques. Recent efforts in PPMs have been focused on improved model estimation techniques and efficient search algorithms, but past evaluations have been limited to searching relatively easy scripted corpora for simple unigram queries, preventing...

chapter

Speaker Adaptive Training using Deep Neural Networks

Tsubasa Ochiai, Shigeki Matsuda, Xugang Lu, Chiori Hori, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6349 - 6353

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Among many speaker adaptation embodiments, Speaker Adaptive Training (SAT) has been successfully applied to a standard Hidden-Markov-Model (HMM) speech recognizer, whose state is associated with Gaussian Mixture Models (GMMs). On the other hand, recent studies on Speaker-Independent (SI) recognizer development have reported that a new type of HMM speech recognizer, which replaces GMMs with Deep Neural...

chapter

Deep neural network trained with speaker representation for speaker normalization

Yun Tang, Aanchan Mohan, Richard C. Rose, Chengyuan Ma

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6329 - 6333

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

A method for speaker normalization in deep neural network (DNN) based discriminative feature estimation for automatic speech recognition (ASR) is presented. This method is applied in the context of a DNN configured for auto-encoder based low dimensional bottleneck (AE-BN) feature extraction where the derived features are used as input to a continuous Gaussian density hidden Markov model (HMM/GMM)...

chapter

Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network

Jian Xue, Jinyu Li, Dong Yu, Mike Seltzer, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6359 - 6363

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The large number of parameters in deep neural networks (DNN) for automatic speech recognition (ASR) makes speaker adaptation very challenging. It also limits the use of speaker personalization due to the huge storage cost in large-scale deployments. In this paper we address DNN adaptation and personalization issues by presenting two methods based on the singular value decomposition (SVD). The first...

chapter

Regularized constrained maximum likelihood linear regression for speech recognition

Sina Hamidi Ghalehjegh, Richard C. Rose

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6319 - 6323

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The use of a graph embedding framework is investigated as a regularization technique in the expectation-maximization (EM) algorithm applied to automatic speech recognition (ASR). The technique is motivated by the fact that graph em-beddings of feature vectors have been shown to provide useful characterizations of the underlying manifolds on which these features lie. Incorporating intrinsic graphs...

chapter

Compressed sensing for magnetic resonance images with phase variations

Satoshi Ito, Yoshifumi Yamada

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6598 - 6601

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The application of compressed sensing (CS) to MRI has the potential to significantly reduce scan time. However, the quality of reconstructed images will be degraded when the MR images have strong phase variations. In the present paper, we propose a new CS method that is easy to implement and robust to phase variations on MR images. When the signal trajectory in k-space is symmetrical with respect...

chapter

Two-stage speaker adaptation in subspace Gaussian mixture models

Sina Hamidi Ghalehjegh, Richard C. Rose

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6324 - 6328

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

A two-stage speaker adaptation approach is proposed for the subspace Gaussian mixture model (SGMM) [1] in large vocabulary automatic speech recognition (ASR). The SGMM differs from the more well known continuous density hidden Markov model (CDHMM) in that a large portion of the SGMM parameters are dedicated to shared full covariance Gaussian subspace parameters and a relatively small number of parameters...

chapter

Direct adaptation of hybrid DNN/HMM model for fast speaker adaptation in LVCSR based on speaker code

Shaofei Xue, Ossama Abdel-Hamid, Hui Jiang, Lirong Dai

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6339 - 6343

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recently an effective fast speaker adaptation method using discriminative speaker code (SC) has been proposed for the hybrid DNN-HMM models in speech recognition [1]. This adaptation method depends on a joint learning of a large generic adaptation neural network for all speakers as well as multiple small speaker codes using the standard back-propagation algorithm. In this paper, we propose an alternative...

chapter

I-vector-based speaker adaptation of deep neural networks for French broadcast audio transcription

Vishwa Gupta, Patrick Kenny, Pierre Ouellet, Themos Stafylakis

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6334 - 6338

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

State of the art speaker recognition systems are based on the i-vector representation of speech segments. In this paper we show how this representation can be used to perform blind speaker adaptation of hybrid DNN-HMM speech recognition system and we report excellent results on a French language audio transcription task. The implemenation is very simple. An audio file is first diarized and each speaker...

chapter

Investigation of unsupervised adaptation of DNN acoustic models with filter bank input

Takuya Yoshioka, Anton Ragni, Mark J. F. Gales

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6344 - 6348

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Adaptation to speaker variations is an essential component of speech recognition systems. One common approach to adapting deep neural network (DNN) acoustic models is to perform global constrained maximum likelihood linear regression (CMLLR) at some point of the systems. Using CMLLR (or more generally, generative approaches) is advantageous especially in unsupervised adaptation scenarios with high...

chapter

Online semidefinite programming for power system state estimation

Seung-Jun Kim, Gang Wang, Geogios B. Giannakis

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6024 - 6027

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Power system state estimation (PSSE) constitutes a crucial prerequisite for reliable operation of the power grid. A key challenge for accurate PSSE is the inherent nonlinearity of SCADA measurements in the system states. Recent proposals for static PSSE tackle this issue by exploiting hidden convexity structure and solving a semidefinite programming (SDP) relaxation. In this work, an online PSSE algorithm...

chapter

Adaptive distributed compressed sensing for dynamic high-dimensional hypothesis testing

Nicolo Michelusi, Urbashi Mitra

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6444 - 6448

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, a framework for dynamic high-dimensional hypothesis testing in wireless sensor networks is presented. The sensor nodes (SNs) collect and transmit to a fusion center (FC), in a distributed fashion, compressed measurements of a time-correlated hypothesis vector. The FC, based on the measurements collected, tracks the hypothesis vector, and feeds back minimal information about the uncertainty...

chapter

Recovering signals with variable sparsity levels from the noisy 1-bit compressive measurements

Amin Movahed, Ashkan Panahi, Mark C. Reed

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6454 - 6458

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we consider the 1-bit compressive sensing reconstruction problem in a scenario that the sparsity level of the signal is unknown and time variant, and the binary measurements are contaminated with the noise. We introduce a new reconstruction algorithm which we refer to as Noise-Adaptive Restricted Step Shrinkage (NARSS). NARSS is superior in terms of performance, complexity and speed...

chapter

Optimization of transmit signals to interfere eavesdropping in a wireless LAN

Shuichi Ohno, Yuji Wakasa, Shui Qiang Yan, Emmanuel Manasseh

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6052 - 6056

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We consider physical-layer security of a wireless LAN where multiple receivers collude to eavesdrop the information from the basestation to the intended receiver. To enhance the physical-layer security, we design the interference signals to combat the eavesdropping. Our design problems are resolved using semidefinite relaxation problems, which can be numerically solved efficiently by the existing...

chapter

Time-varying STAP for nonstationary hot clutter cancellation

Giuseppe A. Fabrizio, Alfonso Farina

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6062 - 6066

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper addresses the problem of mitigating non-stationary diffusely scattered multipath interference or “hot clutter” by space-time adaptive processing (STAP) in radar systems that use a multi-channel receive antenna array. A computationally efficient time-varying (TV) fast-time STAP algorithm that can effectively cancel hot clutter during the coherent processing interval (CPI) while simultaneously...

1 ...
3
4
5
6
7
8
9

Publication date

Set your own date range

Content availability

Available (1,692)
None (1)

Keywords

SPEECH RECOGNITION (35)
COMPRESSED SENSING (34)
SPARSITY (26)
DEEP NEURAL NETWORKS (24)
COMPRESSIVE SENSING (22)
DEEP NEURAL NETWORK (21)
CLASSIFICATION (20)
SPARSE REPRESENTATION (20)
SPEECH ENHANCEMENT (20)
CONVEX OPTIMIZATION (19)
DICTIONARY LEARNING (19)
SPEAKER RECOGNITION (18)
AUTOMATIC SPEECH RECOGNITION (17)
BEAMFORMING (14)
COGNITIVE RADIO (14)
DEEP LEARNING (14)
I-VECTOR (14)
NON-NEGATIVE MATRIX FACTORIZATION (13)
MIMO (12)
NEURAL NETWORKS (12)
SPEECH SYNTHESIS (12)
CLUSTERING (11)
INTERFERENCE ALIGNMENT (11)
MUSIC INFORMATION RETRIEVAL (11)
OPTIMIZATION (11)
SPARSE CODING (11)
SPOKEN TERM DETECTION (11)
HEVC (10)
HIDDEN MARKOV MODEL (10)
I-VECTORS (10)
OFDM (10)
SOURCE SEPARATION (10)
SPEAKER ADAPTATION (10)
SPEAKER VERIFICATION (10)
ADAPTIVE FILTERING (9)
CHANNEL ESTIMATION (9)
DETECTION (9)
DISTRIBUTED ESTIMATION (9)
EEG (9)
KALMAN FILTER (9)
NONNEGATIVE MATRIX FACTORIZATION (9)
VOICE CONVERSION (9)
BLIND SOURCE SEPARATION (8)
FACE RECOGNITION (8)
GAUSSIAN MIXTURE MODEL (8)
HMM (8)
KERNEL METHODS (8)
KEYWORD SEARCH (8)
NOISE REDUCTION (8)
NOISE ROBUSTNESS (8)
ROBUSTNESS (8)
SPARSE RECOVERY (8)
SPEECH ANALYSIS (8)
TRACKING (8)
WIRELESS SENSOR NETWORKS (8)
ARRAY PROCESSING (7)
CONSENSUS (7)
ELECTROENCEPHALOGRAPHY (7)
EMOTION RECOGNITION (7)
FEATURE EXTRACTION (7)
GAUSSIAN PROCESS (7)
HIDDEN MARKOV MODELS (7)
HMM-BASED SPEECH SYNTHESIS (7)
KEYWORD SPOTTING (7)
LINEAR PREDICTION (7)
MACHINE LEARNING (7)
MAXIMUM LIKELIHOOD (7)
PLDA (7)
RECURRENT NEURAL NETWORKS (7)
ROBUST SPEECH RECOGNITION (7)
SEGMENTATION (7)
SOURCE LOCALIZATION (7)
SPARSE REPRESENTATIONS (7)
SPECTRAL ESTIMATION (7)
SPECTRUM SENSING (7)
SPEECH SEPARATION (7)
SUPER-RESOLUTION (7)
UNSUPERVISED LEARNING (7)
ACOUSTIC MODELING (6)
ALTERNATING DIRECTION METHOD OF MULTIPLIERS (6)
ARRAY SIGNAL PROCESSING (6)
DENOISING (6)
DISCRIMINATIVE TRAINING (6)
ENTROPY (6)
ESTIMATION (6)
INDEPENDENT COMPONENT ANALYSIS (6)
INVERSE PROBLEM (6)
LOCALIZATION (6)
MICROPHONE ARRAY (6)
MICROPHONE ARRAYS (6)
MIMO RADAR (6)
MULTITASK LEARNING (6)
PARALLEL PROCESSING (6)
PARAMETER ESTIMATION (6)
PARTICLE FILTER (6)
PARTICLE FILTERING (6)
PARTICLE FILTERS (6)
REVERBERATION (6)
SCORE NORMALIZATION (6)
SIGNAL RECONSTRUCTION (6)
more

INFONA - science communication portal

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Visual tracking using Blind Source Separation for mixed images

An integrated system for object tracking, detection, and online learning with real-time RGB-D video

Parametric multichannel noise reduction algorithm utilizing temporal correlations in reverberant environment

Distributed energy-efficient power optimization in cellular relay networks with minimum rate constraints

Can voice conversion be used to reduce non-native accents?

Featherweight phonetic keyword search for conversational speech

Speaker Adaptive Training using Deep Neural Networks

Deep neural network trained with speaker representation for speaker normalization

Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network

Regularized constrained maximum likelihood linear regression for speech recognition

Compressed sensing for magnetic resonance images with phase variations

Two-stage speaker adaptation in subspace Gaussian mixture models

Direct adaptation of hybrid DNN/HMM model for fast speaker adaptation in LVCSR based on speaker code

I-vector-based speaker adaptation of deep neural networks for French broadcast audio transcription

Investigation of unsupervised adaptation of DNN acoustic models with filter bank input

Online semidefinite programming for power system state estimation

Adaptive distributed compressed sensing for dynamic high-dimensional hypothesis testing

Recovering signals with variable sparsity levels from the noisy 1-bit compressive measurements

Optimization of transmit signals to interfere eavesdropping in a wireless LAN

Time-varying STAP for nonstationary hot clutter cancellation

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)