ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

chapter

Laplacian Tensor sparse coding for image categorization

Mouna Dammak, Mahmoud Mejdoub, Chokri Ben Amar

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3572 - 3576

To generate the visual codebook, a step of quantization process is obligatory. Several works have proved the efficiency of sparse coding in feature quantization process of BoW based image representation. Furthermore, it is an important method which encodes the original signal in a sparse signal space. Yet, this method neglects the relationships among features. To reduce the impact of this issue, we...

chapter

Blockwise coordinate descent schemes for sparse representation

Bao-Di Liu, Yu-Xiong Wang, Bin Shen, Yu-Jin Zhang, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5267 - 5271

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The current sparse representation framework is to decouple it as two subproblems, i.e., alternate sparse coding and dictionary learning using different optimizers, treating elements in bases and codes separately. In this paper, we treat elements both in bases and codes ho-mogenously. The original optimization is directly decoupled as several blockwise alternate subproblems rather than above two. Hence,...

chapter

Improving music auto-tagging by intra-song instance bagging

Chin-Chia Michael Yeh, Ju-Chiang Wang, Yi-Hsuan Yang, Hsin-Min Wang

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2139 - 2143

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Bagging is one the most classic ensemble learning techniques in the machine learning literature. The idea is to generate multiple subsets of the training data via bootstrapping (random sampling with replacement), and then aggregate the output of the models trained from each subset via voting or averaging. As music is a temporal signal, we propose and study two bagging methods in this paper: the inter-song...

chapter

Violent video detection based on MoSIFT feature and sparse coding

Long Xu, Chen Gong, Jie Yang, Qiang Wu, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3538 - 3542

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

To detect violence in a video, a common video description method is to apply local spatio-temporal description on the query video. Then, the low-level description is further summarized onto the high-level feature based on Bag-of-Words (BoW) model. However, traditional spatio-temporal descriptors are not discriminative enough. Moreover, BoW model roughly assigns each feature vector to only one visual...

chapter

Sparse cepstral codes and power scale for instrument identification

Li-Fan Yu, Li Su, Yi-Hsuan Yang

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7460 - 7464

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a novel feature representation called sparse cepstral codes for instrument identification. We first motivate the approach by discussing why cepstrum is suitable for instrument identification. Then we propose the use of sparse coding and power normalization to derive compact codes that better represent the information of the cepstrum. Our evaluation on both uni-source and multi-source...

chapter

Dynamic sparse coding with smoothing proximal gradient method

Rakesh Chalasani, Jose C. Principe

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7188 - 7192

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this work we focus on the problem of estimating time-varying sparse signals from a sequence of under-sampled observations. We formulate this problem as estimating hidden states in a dynamic model and exploit the underlying temporal structure to find a more accurate solution, particularly when the information in the observations is at scarce. We propose an optimization procedure based on smoothing...

chapter

Efficient convolutional sparse coding

Brendt Wohlberg

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7173 - 7177

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

When applying sparse representation techniques to images, the standard approach is to independently compute the representations for a set of overlapping image patches. This method performs very well in a variety of applications, but the independent sparse coding of each patch results in a representation that is not optimal for the image as a whole. A recent development is convolutional sparse coding,...

chapter

Overcomplete sparsifying transform learning algorithm using a constrained least squares approach

Ender M. Eksioglu, Ozden Bayir

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7158 - 7162

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Analysis sparsity and the accompanying analysis operator learning problem provide an important framework for signal modeling. Very recently, sparsifying transform learning has been put forward as an effective and new formulation for the analysis operator learning problem. In this study, we develop a new sparsifying transform learning algorithm by using the uniform normalized tight frame constraint...

chapter

Modified lasso screening for audio word-based music classification using large-scale dictionary

Ping-Keng Jao, Chin-Chia Michael Yeh, Yi-Hsuan Yang

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5207 - 5211

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Representing music information using audio codewords has led to state-of-the-art performance on various music classifcation benchmarks. Comparing to conventional audio descriptors, audio words offer greater fexibility in capturing the nuance of music signals, in that each codeword can be viewed as a quantization of the music universe and that the quantization goes fner as the size of the dictionary...

chapter

Active-set newton algorithm for non-negative sparse coding of audio

Tuomas Virtanen, Bhiksha Raj, Jort F. Gemmeke, Hugo Van hamme

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3092 - 3096

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a new algorithm to efficiently obtain non-negative sparse representations for audio. The spectrum of an audio signal is represented as a sparse linear combination of atoms taken from an overcomplete dictionary. The algorithm is based on minimizing the generalized Kullback-Leibler divergence between an observed magnitude spectrum and a non-negative linear combination of atoms, plus an ℓ₁...

chapter

BSIK-SVD: A dictionary-learning algorithm for block-sparse representations

Yongqin Zhang, Jiaying Liu, Mading Li, Zongming Guo

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3528 - 3532

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Sparse dictionary learning has attracted enormous interest in image processing and data representation in recent years. To improve the performance of dictionary learning, we propose an efficient block-structured incoherent K-SVD algorithm for the sparse representation of signals. Without relying on any prior knowledge of the group structure for the input data, we develop a two-stage agglomerative...

INFONA - science communication portal

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Laplacian Tensor sparse coding for image categorization

Blockwise coordinate descent schemes for sparse representation

Improving music auto-tagging by intra-song instance bagging

Violent video detection based on MoSIFT feature and sparse coding

Sparse cepstral codes and power scale for instrument identification

Dynamic sparse coding with smoothing proximal gradient method

Efficient convolutional sparse coding

Overcomplete sparsifying transform learning algorithm using a constrained least squares approach

Modified lasso screening for audio word-based music classification using large-scale dictionary

Active-set newton algorithm for non-negative sparse coding of audio

BSIK-SVD: A dictionary-learning algorithm for block-sparse representations

Filter options

Publication date

Keywords

INFONA - science communication portal

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)