ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

book

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

IEEE

chapter

Implementation of interconnective systems

Thomas A. Baran, Tarek A. Lahlou

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1101 - 1105

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper proposes a systematic strategy for the automated implementation of mixed constraint- and input-output-based representations of signal processing systems. Examples of the strategy are provided in synthesizing algorithms derived from signal-flow graphs having delay-free loops, as well as in performing automated system inversion. An algorithm that follows the strategy, and which has been deployed...

chapter

Pedestrian localization in moving platforms using dead reckoning, particle filtering and map matching

Jayaprasad Bojja, Jussi Collin, Simo Sarkka, Jarmo Takala

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1116 - 1120

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Localization in global navigation satellite system denied environments using inertial sensors alone, or radio sensors alone or a combination of both are the currently active research topics. The current research works are primarily focused on static environments with earth fixed coordinate frames, having nonmoving maps. In this research work, we use micro electromechanical sensors based inertial sensors,...

chapter

A novel image secret sharing scheme with meaningful shares

Hongliang Cai, Huajian Liu, Qizhao Yuan, Martin Steinebach, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1767 - 1771

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper a novel (t, n) threshold image secret sharing scheme is proposed. Based on the idea that there is close connection between secret sharing and coding theory, coding method on GF(2^m) is applied in our scheme instead of the classical Lagrange's interpolation method in order to deal with the fidelity loss problem in the recovery. All the generated share images are meaningful and the size...

chapter

A compact representation of sensor fingerprint for camera identification and fingerprint matching

Ruizhe Li, Chang-Tsun Li, Yu Guan

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1777 - 1781

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Sensor Pattern Noise (SPN) has been proved as an effective fingerprint of imaging devices to link pictures to the cameras that acquired them. In practice, forensic investigators usually extract this camera fingerprint from large image block to improve the matching accuracy because large image blocks tend to contain more SPN information. As a result, camera fingerprints usually have a very high dimensionality...

chapter

Quantile analysis of image sensor noise distribution

Jiachao Zhang, Keigo Hirakawa, Xiaodan Jin

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1598 - 1602

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper describes a study aimed at comparing the real image sensor noise distribution to the models of noise often assumed in image denoising designs. Quantile analysis in pixel, wavelet, and variance stabilization domains reveal that the tails of Poisson, signal-dependent Gaussian, and Poisson-Gaussian models are too short to capture real sensor noise behavior. Noise model mismatch would likely...

chapter

A workload balanced parallel view synthesis for FTV

Zhanqi Liu, Xin Jin, Chenyang Li, Qionghai Dai

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1558 - 1562

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, a parallel system together with an adaptive workload balancing algorithm is proposed for view synthesis on multi-core platforms. Based on system level data parallelism, an adaptive workload balancing method is proposed for depth image based rendering by evaluating the number of non-hole pixels after warping. Experimental results demonstrated that with the proposed workload balancing...

chapter

A data-driven color feature learning scheme for image retrieval

Rahul Rama Varior, Gang Wang

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1334 - 1338

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper addresses content based image retrieval based on color features. Several previous works have addressed color based image retrieval based on hand-crafted features. In this paper, a data-driven learning framework is proposed for generating color based signatures. To obtain the features, a linear transformation is learned from the pixel values based on its reconstruction error. Using this...

chapter

Dynamic ROI based on K-means for remote photoplethysmography

Litong Feng, Lai-Man Po, Xuyuan Xu, Yuming Li, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1310 - 1314

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Remote imaging photoplethysmography (RIPPG) can achieve contactless human vital signs monitoring. Though the remote operation mode brings a great convenience for RIPPG applications, the RIPPG signal quality is limited by the remote nature. Improving the RIPPG signal quality becomes an essential task in the clinical application of RIPPG. Since the region of interest (ROI) of the RIPPG transforms from...

chapter

Detecting rare events using Kullback-Leibler divergence

Jingxin Xu, Simon Denman, Clinton Fookes, Sridha Sridharan

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1305 - 1309

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

One main challenge in developing a system for visual surveillance event detection is the annotation of target events in the training data. By making use of the assumption that events with security interest are often rare compared to regular behaviours, this paper presents a novel approach by using Kullback-Leibler (KL) divergence for rare event detection in a weakly supervised learning setting, where...

chapter

Transmission distortion modeling for view synthesis prediction based 3-D video streaming

Pan Gao, Wei Xiang, Lijuan Zhang

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1448 - 1452

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View synthesis prediction (VSP) is an important tool for improving the coding efficiency in the next generation three-dimensional (3-D) video systems. However, VSP will result in a new type of inter-view error propagation when the multi-view video plus depth (MVD) data are transmitted over the lossy networks. In this paper, this new type of error propagation is characterized and modeled. Firstly,...

chapter

The efficiency of view synthesis prediction for 3D video coding: A spectral domain analysis

Yichen Zhang, Ngai-Man Cheung, Lu Yu

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1453 - 1457

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We study the coding efficiency of view synthesis prediction (VSP) in 3D video coding. Our spectral domain analysis relates the power spectral density (PSD) of the VSP prediction error to the probability density function (pdf) of the warping error. Our analysis takes into account the warping error induced by (i) depth coding and (ii) rounding error at integer-pel, half-pel and quarter-pel warping accuracy...

chapter

Reduced-rank condensed filter dictionaries for inter-picture prediction

Shunyao Li, Onur G. Guleryuz, Sehoon Yea

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1428 - 1432

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We consider the motion-compensated temporal prediction loop at the heart of modern video coders. Rather than using motion-compensated reference frame blocks directly as predictors, we incorporate their spatially-filtered versions into the prediction loop. We design adaptive filters that are geared toward successful prediction over sophisticated temporal evolutions involving lighting changes, focus...

chapter

Mode Dependent Vector Quantization with a rate-distortion optimized codebook for residue coding in video compression

Bihong Huang, Felix Henry, Christine Guillemot, Philippe Salembier

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1433 - 1437

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The High Efficiency Video Coding standard (HEVC) supports a total of 35 intra prediction modes which aim at reducing spatial redundancy by exploiting pixel correlation within a local neighborhood. In this paper, we show that spatial correlation remains after intra prediction, leading to high energy prediction residues. We propose a novel scheme for encoding the prediction residues using a Mode Dependent...

chapter

Error diffused intra prediction for HEVC

Ying-Hsiu Lai, Yinyi Lin

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1424 - 1427

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

HEVC uses up to 35 prediction modes for intra prediction and it can well predict blocks with uni-directional structures or sharp edges, but the intra prediction still suffers from its discontinuous characteristics. To improve coding performance of intra prediction, the inpainting technique has been studied but it is impractical because of its high computational complexity. In this paper, we employ...

chapter

Singing voice detection with deep recurrent neural networks

Simon Leglaive, Romain Hennequin, Roland Badeau

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 121 - 125

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a new method for singing voice detection based on a Bidirectional Long Short-Term Memory (BLSTM) Recurrent Neural Network (RNN). This classifier is able to take a past and future temporal context into account to decide on the presence/absence of singing voice, thus using the inherent sequential aspect of a short-term feature extraction in a piece of music. The BLSTM-RNN contains...

chapter

Exploring multi-channel features for denoising-autoencoder-based speech enhancement

Shoko Araki, Tomoki Hayashi, Marc Delcroix, Masakiyo Fujimoto, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 116 - 120

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper investigates a multi-channel denoising autoencoder (DAE)-based speech enhancement approach. In recent years, deep neural network (DNN)-based monaural speech enhancement and robust automatic speech recognition (ASR) approaches have attracted much attention due to their high performance. Although multi-channel speech enhancement usually outperforms single channel approaches, there has been...

chapter

Latent time-frequency component analysis: A novel pitch-based approach for singing voice separation

Xiu Zhang, Wei Li, Bilei Zhu

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 131 - 135

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Monaural singing voice separation has aroused considerable attention. Many pitch-based methods have been proposed to address this task, but generally have limited performance. The most crucial difficulties lie in the inaccurate judgment on voiced pitches and the failed recognition on unvoiced singing sounds. In this paper, we propose a novel algorithm based on the latent component analysis of time-frequency...

chapter

Binaural speech enhancement with instantaneous coherence smoothing using the cepstral correlation coefficient

Rainer Martin, Masoumeh Azarpour, Gerald Enzner

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 111 - 115

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we propose a novel approach to cepstral smoothing for reducing musical noise fluctuations in binaural speech enhancement. Similar to other methods, our approach computes a preliminary spectral gain function using the magnitude-squared coherence function and applies an instantaneous weighting to the gain function in the cepstral domain. In this contribution, the weighting function is...

chapter

Structural segmentation of Hindustani concert audio with posterior features

Prateek Verma, Vinutha T.P., Parthe Pandit, Preeti Rao

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 136 - 140

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Structural segmentation of music involves identifying boundaries between homogenous regions where the homogeneity involves one or more musical dimensions, and therefore depends on the musical genre. In this work, we address the segmentation of Hindustani instrumental concert recordings at the highest time-scale, that is, concert sections marked by prominent changes in rhythmic structure. Tempo features...

INFONA - science communication portal

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Implementation of interconnective systems

Pedestrian localization in moving platforms using dead reckoning, particle filtering and map matching

A novel image secret sharing scheme with meaningful shares

A compact representation of sensor fingerprint for camera identification and fingerprint matching

Quantile analysis of image sensor noise distribution

A workload balanced parallel view synthesis for FTV

A data-driven color feature learning scheme for image retrieval

Dynamic ROI based on K-means for remote photoplethysmography

Detecting rare events using Kullback-Leibler divergence

Transmission distortion modeling for view synthesis prediction based 3-D video streaming

The efficiency of view synthesis prediction for 3D video coding: A spectral domain analysis

Reduced-rank condensed filter dictionaries for inter-picture prediction

Mode Dependent Vector Quantization with a rate-distortion optimized codebook for residue coding in video compression

Error diffused intra prediction for HEVC

Singing voice detection with deep recurrent neural networks

Exploring multi-channel features for denoising-autoencoder-based speech enhancement

Latent time-frequency component analysis: A novel pitch-based approach for singing voice separation

Binaural speech enhancement with instantaneous coherence smoothing using the cepstral correlation coefficient

Structural segmentation of Hindustani concert audio with posterior features

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)