ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Items from 141 to 160 out of 1,693 results

1 ...
5
6
7
8
9
10
11

chapter

Visual reranking with improved image graph

Ziqiong Liu, Shengjin Wang, Liang Zheng, Qi Tian

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6889 - 3893

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper introduces an improved reranking method for the Bag-of-Words (BoW) based image search. Built on [1], a directed image graph robust to outlier distraction is proposed. In our approach, the relevance among images is encoded in the image graph, based on which the initial rank list is refined. Moreover, we show that the rank-level feature fusion can be adopted in this reranking method as well...

chapter

Compressed quantitative MRI: Bloch response recovery through iterated projection

Mike Davies, Gilles Puy, Pierre Vandergheynst, Yves Wiaux

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6899 - 6903

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Inspired by the recently proposed Magnetic Resonance Fingerprinting technique, we develop a principled compressed sensing framework for quantitative MRI. The three key components are: a random pulse excitation sequence following the MRF technique; a random EPI subsampling strategy and an iterative projection algorithm that imposes consistency with the Bloch equations. We show that, as long as the...

chapter

Unsupervised domain adaptation for deep neural network based voice activity detection

Xiao-Lei Zhang

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6864 - 6868

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The mismatching problem between the training and test speech corpora hinders the practical use of the machine-learning-based voice activity detection (VAD). In this paper, we try to address this problem by the unsupervised domain adaptation techniques, which try to find a shared feature subspace between the mismatching corpora. The denoising deep neural network is used as the learning machine. Three...

chapter

Extended-bag-of-features for translation, rotation, and scale-invariant image retrieval

Chia-Yin Tsai, Ting-Chu Lin, Chia-Po Wei, Yu-Chiang Frank Wang

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6874 - 6878

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

While bag-of-features (BOF) models have been widely applied for addressing image retrieval problems, the resulting performance is typically limited due to its disregard of spatial information of local image descriptors (and the associated visual words). In this paper, we present a novel spatial pooling scheme, called extended bag-of-features (EBOF), for solving the above task. Besides improving image...

chapter

Low-cost multi-camera object matching

Syed Fahad Tahir, Andrea Cavallaro

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6869 - 6873

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose an object matching approach aimed at smartphone cameras that exploits the well-known concept of local sets of features for object representation. We also enable the temporal alignment of cameras by exploiting the frames of detected objects to group objects appeared in the same time interval for the assignment within each camera. The proposed approach does not need training thus making it...

chapter

On the convergence rate of the bi-alternating direction method of multipliers

Guoqiang Zhang, Richard Heusdens, W. B. Kleijn

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3869 - 3873

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we analyze the convergence rate of the bi-alternating direction method of multipliers (BiADMM). Differently from ADMM that optimizes an augmented Lagrangian function, Bi-ADMM optimizes an augmented primal-dual Lagrangian function. The new function involves both the objective functions and their conjugates, thus incorporating more information of the objective functions than the augmented...

chapter

Deep mixture density networks for acoustic modeling in statistical parametric speech synthesis

Heiga Zen, Andrew Senior

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3844 - 3848

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Statistical parametric speech synthesis (SPSS) using deep neural networks (DNNs) has shown its potential to produce naturally-sounding synthesized speech. However, there are limitations in the current implementation of DNN-based acoustic modeling for speech synthesis, such as the unimodal nature of its objective function and its lack of ability to predict variances. To address these limitations, this...

chapter

A stable betweenness centrality measure in networks

Santiago Segarra, Alejandro Ribeiro

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3859 - 3863

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a formal definition of stability for node centrality measures in networks and shows that the well-known betweenness centrality is not stable with respect to that metric. An alternative definition that preserves the same centrality notion while satisfying this stability criterion is then introduced. The practical implications of stability are explored by studying the behavior of...

chapter

Complex cepstrum factorization for statistical parametric synthesis

Ranniery Maia, Yannis Stylianou

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3839 - 3843

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a study on complex cepstrum-based speech factorization for acoustic modeling in statistical parametric synthesizers. The factorization is conducted assuming that both vocal tract resonance and glottal flow effect are fully represented by the complex cepstrum. We investigated four different forms to represent the complex cepstrum in the acoustic models and compared their performances...

chapter

Parametric speech synthesis based on Gaussian process regression using global variance and hyperparameter optimization

Tomoki Koriyama, Takashi Nose, Takao Kobayashi

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3834 - 3838

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper examines two issues of a statistical speech synthesis approach based Gaussian process (GP) regression. Although GP-based speech synthesis can give higher performance in generating spectral parameters than the HMM-based one, a number of issues still remain. In this paper, we incorporate global variance (GV) feature to overcome over-smoothing problem into the parameter generation. Furthermore,...

chapter

Online dictionary learning over distributed models

Jianshu Chen, Zaid J. Towfic, Ali H. Sayed

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3874 - 3878

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we consider learning dictionary models over a network of agents, where each agent is only in charge of a portion of the dictionary elements. This formulation is relevant in big data scenarios where multiple large dictionary models may be spread over different spatial locations and it is not feasible to aggregate all dictionaries in one location due to communication and privacy considerations...

chapter

Online NON-negative Tensor Deconvolution for source detection in 3DTV audio

Yuki Mitsufuji, Marco Liuni, Alex Baker, Axel Roebel

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3082 - 3086

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The following article describes research on source detection in multi channel (3DTV) audio streams. The problem is extremely complex due to the fact that multiple layers can be present in scenes (background music, ambience, commentator). In this work a new algorithm is developed that exploits the information from the different audio channels to detect, and possibly localize and separate independent...

chapter

A general framework for dictionary based audio fingerprinting

Manuel Moussallam, Laurent Daudet

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3077 - 3081

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Fingerprint-based Audio recognition system must address concurrent objectives. Indeed, fingerprints must be both robust to distortions and discriminative while their dimension must remain to allow fast comparison. This paper proposes to restate these objectives as a penalized sparse representation problem. On top of this dictionary-based approach, we propose a structured sparsity model in the form...

chapter

Acoustic feature extraction by statistics based local binary pattern for environmental sound classification

Takumi Kobayashi, Jiaxing Ye

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3052 - 3056

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Classification of environmental sounds is a fundamental procedure for a wide range of real-world applications. In this paper, we propose a novel acoustic feature extraction method for classifying the environmental sounds. The proposed method is motivated from the image processing technique, local binary pattern (LBP), and works on a spectrogram which forms two-dimensional (time-frequency) data like...

chapter

Trajectory analysis of speech using continuous state hidden Markov Models

P. Weber, S. M. Houghton, C. J. Champion, M. J. Russell, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3042 - 3046

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Many current speech models used in recognition involve thousands of parameters, whereas the mechanisms of speech production are conceptually very simple. We present and evaluate a new continuous state probabilistic model (CS-HMM) for recovering dwell-transition and phoneme sequences from dynamic speech production features. We show that with very few parameters, these features can be tracked, and phoneme...

chapter

Transmission power variance constrained power allocation for iterative frequency domain multiuser SIMO detector

Valiteli Tervo, A Tolli, J. Karjalainen, Tad Matsumoto

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3493 - 3497

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Transmission power variance constrained power allocation in single carrier multiuser (MU) single-input multiple-output (SIMO) systems with iterative frequency domain (FD) soft cancelation (SC) minimum mean squared error (MMSE) equalization is considered in this paper. It is known in the literature that peak to average power ratio (PAPR) at the transmitter can be decreased by reducing the variance...

chapter

Sparsity fine tuning in wavelet domain with application to compressive image reconstruction

Weisheng Dong, Xiaolin Wu, Guangming Shi

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4948 - 4952

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In compressive sensing, wavelet space is widely used to generate sparse signal (image signal in particular) representations. In this work, we propose a novel approach of statistical context modeling to increase the level of sparsity of wavelet image representations. It is shown, contrary to a widely held assumption, that high-frequency wavelet coefficients have non-zero mean distributions if conditioned...

chapter

Block processing with iterative correction filters for time-interleaved ADCs

Matthias Hotz, Christian Vogel

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4923 - 4932

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a systematic approach to block processing with iterative correction filters for time-interleaved analog-to-digital converters (TI-ADCs). TI-ADCs consist of several channels and can significantly increase the achievable sampling rate, but suffer from mismatches among the channels. Iterative digital correction filters are a general approach to mitigate the impact of mismatches in...

chapter

Algebraic phase unwrapping over collection of triangles based on two-dimensional spline smoothing

Daichi Kitahara, Isao Yamada

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4963 - 4967

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Phase unwrapping is a reconstruction problem of the continuous phase function from its finite wrapped samples. Especially the two-dimensional phase unwrapping has been a common key for estimating many crucial physical information, e.g., the surface topography measured by interferometric synthetic aperture radar. However almost all two-dimensional phase unwrapping algorithms are suffering from either...

chapter

Sparse signal recovery under poisson statistics for online marketing applications

Delaram Motamedvaziri, Mohammad H. Rohban, Venkatesh Saligrama

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4953 - 4957

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We are motivated by many applications such as problems that arise in online marketing applications, where the observations are governed by non-homogeneous Poisson models. We analyze the performance of a Maximum Likelihood (ML) decoder. We prove consistency and show an exponential rate of converge for sparse recovery in the high-dimensional Poisson setting. After verifying the efficiency of ML estimator...

1 ...
5
6
7
8
9
10
11

Publication date

Set your own date range

Content availability

Available (1,692)
None (1)

Keywords

SPEECH RECOGNITION (35)
COMPRESSED SENSING (34)
SPARSITY (26)
DEEP NEURAL NETWORKS (24)
COMPRESSIVE SENSING (22)
DEEP NEURAL NETWORK (21)
CLASSIFICATION (20)
SPARSE REPRESENTATION (20)
SPEECH ENHANCEMENT (20)
CONVEX OPTIMIZATION (19)
DICTIONARY LEARNING (19)
SPEAKER RECOGNITION (18)
AUTOMATIC SPEECH RECOGNITION (17)
BEAMFORMING (14)
COGNITIVE RADIO (14)
DEEP LEARNING (14)
I-VECTOR (14)
NON-NEGATIVE MATRIX FACTORIZATION (13)
MIMO (12)
NEURAL NETWORKS (12)
SPEECH SYNTHESIS (12)
CLUSTERING (11)
INTERFERENCE ALIGNMENT (11)
MUSIC INFORMATION RETRIEVAL (11)
OPTIMIZATION (11)
SPARSE CODING (11)
SPOKEN TERM DETECTION (11)
HEVC (10)
HIDDEN MARKOV MODEL (10)
I-VECTORS (10)
OFDM (10)
SOURCE SEPARATION (10)
SPEAKER ADAPTATION (10)
SPEAKER VERIFICATION (10)
ADAPTIVE FILTERING (9)
CHANNEL ESTIMATION (9)
DETECTION (9)
DISTRIBUTED ESTIMATION (9)
EEG (9)
KALMAN FILTER (9)
NONNEGATIVE MATRIX FACTORIZATION (9)
VOICE CONVERSION (9)
BLIND SOURCE SEPARATION (8)
FACE RECOGNITION (8)
GAUSSIAN MIXTURE MODEL (8)
HMM (8)
KERNEL METHODS (8)
KEYWORD SEARCH (8)
NOISE REDUCTION (8)
NOISE ROBUSTNESS (8)
ROBUSTNESS (8)
SPARSE RECOVERY (8)
SPEECH ANALYSIS (8)
TRACKING (8)
WIRELESS SENSOR NETWORKS (8)
ARRAY PROCESSING (7)
CONSENSUS (7)
ELECTROENCEPHALOGRAPHY (7)
EMOTION RECOGNITION (7)
FEATURE EXTRACTION (7)
GAUSSIAN PROCESS (7)
HIDDEN MARKOV MODELS (7)
HMM-BASED SPEECH SYNTHESIS (7)
KEYWORD SPOTTING (7)
LINEAR PREDICTION (7)
MACHINE LEARNING (7)
MAXIMUM LIKELIHOOD (7)
PLDA (7)
RECURRENT NEURAL NETWORKS (7)
ROBUST SPEECH RECOGNITION (7)
SEGMENTATION (7)
SOURCE LOCALIZATION (7)
SPARSE REPRESENTATIONS (7)
SPECTRAL ESTIMATION (7)
SPECTRUM SENSING (7)
SPEECH SEPARATION (7)
SUPER-RESOLUTION (7)
UNSUPERVISED LEARNING (7)
ACOUSTIC MODELING (6)
ALTERNATING DIRECTION METHOD OF MULTIPLIERS (6)
ARRAY SIGNAL PROCESSING (6)
DENOISING (6)
DISCRIMINATIVE TRAINING (6)
ENTROPY (6)
ESTIMATION (6)
INDEPENDENT COMPONENT ANALYSIS (6)
INVERSE PROBLEM (6)
LOCALIZATION (6)
MICROPHONE ARRAY (6)
MICROPHONE ARRAYS (6)
MIMO RADAR (6)
MULTITASK LEARNING (6)
PARALLEL PROCESSING (6)
PARAMETER ESTIMATION (6)
PARTICLE FILTER (6)
PARTICLE FILTERING (6)
PARTICLE FILTERS (6)
REVERBERATION (6)
SCORE NORMALIZATION (6)
SIGNAL RECONSTRUCTION (6)
more

INFONA - science communication portal

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Visual reranking with improved image graph

Compressed quantitative MRI: Bloch response recovery through iterated projection

Unsupervised domain adaptation for deep neural network based voice activity detection

Extended-bag-of-features for translation, rotation, and scale-invariant image retrieval

Low-cost multi-camera object matching

On the convergence rate of the bi-alternating direction method of multipliers

Deep mixture density networks for acoustic modeling in statistical parametric speech synthesis

A stable betweenness centrality measure in networks

Complex cepstrum factorization for statistical parametric synthesis

Parametric speech synthesis based on Gaussian process regression using global variance and hyperparameter optimization

Online dictionary learning over distributed models

Online NON-negative Tensor Deconvolution for source detection in 3DTV audio

A general framework for dictionary based audio fingerprinting

Acoustic feature extraction by statistics based local binary pattern for environmental sound classification

Trajectory analysis of speech using continuous state hidden Markov Models

Transmission power variance constrained power allocation for iterative frequency domain multiuser SIMO detector

Sparsity fine tuning in wavelet domain with application to compressive image reconstruction

Block processing with iterative correction filters for time-interleaved ADCs

Algebraic phase unwrapping over collection of triangles based on two-dimensional spline smoothing

Sparse signal recovery under poisson statistics for online marketing applications

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)