Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on

chapter

Video coding based on audio-visual attention

Jong-Seok Lee, F. De Simone, T. Ebrahimi

2009 IEEE International Conference on Multimedia and Expo > 57 - 60

2009 IEEE International Conference on Multimedia and Expo (ICME)

This paper proposes an efficient video coding method based on audio-visual attention, which is motivated by the fact that cross-modal interaction significantly affects humans' perception of multimedia content. First, we propose an audio-visual source localization method to locate the sound source in a video sequence. Then, its result is used for applying spatial blurring to video frames in order to...

chapter

Fast multi-reference motion estimation via statistical learning for H.264/AVC

Chen-Kuo Chiang, Shang-Hong Lai

2009 IEEE International Conference on Multimedia and Expo > 61 - 64

2009 IEEE International Conference on Multimedia and Expo (ICME)

In the H.264/AVC coding standard, motion estimation (ME) is allowed to use multiple reference frames to make full use of reducing temporal redundancy in a video sequence. Although it can further reduce the motion compensation errors, it introduces tremendous computational complexity as well. In this paper, we propose a statistical learning approach to reduce the computation involved in the multireference...

chapter

Block-based color correction algorithm for multi-view video coding

Boxin Shi, Yangxi Li, Lin Liu, Chao Xu

2009 IEEE International Conference on Multimedia and Expo > 65 - 68

2009 IEEE International Conference on Multimedia and Expo (ICME)

The color variations among different viewpoints in multiview video sequences may deteriorate the visual quality and coding efficiency. Various color correction methods have been proposed, however, the color appearance and histogram of corrected target frames are not similar enough to the reference frames in details. Focusing on restoring more similar color, a block-based color correction algorithm...

chapter

A Multi-layer motion estimation scheme for spatial scalability in H.264/AVC scalable extension

Sangkwon Na, Chong-Min Kyung

2009 IEEE International Conference on Multimedia and Expo > 69 - 72

2009 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we propose a fast multi-layer motion estimation algorithm for spatial scalability provided in H.264/AVC scalable extension, based on the reuse of the motion vectors from multiple spatial layers. The reused motion vector is used to set a search center and refined within a small search area. However, the reused motion vector often produces significant prediction error at object boundaries...

chapter

Kurtosis-based super-resolution algorithm

Jianping Qiao, Ju Liu, Xiangzeng Meng, Wan-Chi Siu

2009 IEEE International Conference on Multimedia and Expo > 73 - 76

2009 IEEE International Conference on Multimedia and Expo (ICME)

A kurtosis-based super-resolution image reconstruction algorithm is proposed in this paper. Firstly, we give the definition of the kurtosis image and analyze its two properties: (i) the kurtosis image is Gaussian noise invariant, and (ii) the absolute value of a kurtosis image becomes smaller as the the image gets smoother. Then we build a constrained absolute local kurtosis maximization function...

chapter

A robust spatial-temporal line-warping based deinterlacing method

Shing-Fat Tu, O.C. Au, Yannan Wu, Enming Luo, more

2009 IEEE International Conference on Multimedia and Expo > 77 - 80

2009 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, a line-warping based deinterlacing method will be introduced. The missing pixels in interlaced videos can be derived from the warping of pixels in horizontal line pairs. In order to increase the accuracy of temporal prediction, multiple temporal-line pairs, selected according to constant velocity model, are used for warping. The stationary pixels can be well-preserved by accuracy stationary...

chapter

Fast directional image interpolation with difference projection

Zhiwei Xiong, Yonghua Zhang, Xiaoyan Sun, Feng Wu

2009 IEEE International Conference on Multimedia and Expo > 81 - 84

2009 IEEE International Conference on Multimedia and Expo (ICME)

This paper presents a new directional image interpolator, aiming to increase image resolution with high perceptual quality and low computational complexity. In our method, missing pixels in a magnified image are generated through linear interpolation on certain fixed supports to facilitate fast implementation, while local directional features are imposed on the adaptive interpolation weights which...

chapter

A new IKONOS imagery fusion approach using particle swarm optimization

Hsuan-Ying Chen, Jin-Jang Leou

2009 IEEE International Conference on Multimedia and Expo > 85 - 88

2009 IEEE International Conference on Multimedia and Expo (ICME)

Spatial resolutions of IKONOS high-resolution panchromatic (PAN) and low-resolution multispectral (MS) satellite images are 1 m and 4 m, respectively. To cope with color distortion and blocking artifacts in fused images, in this study, a new IKONOS imagery fusion approach using particle swarm optimization (PSO) is proposed. The pixels of fused images in the training set are classified into several...

chapter

Perceptual compressive sensing for image signals

Yi Yang, O.C. Au, Lu Fang, Xing Wen, more

2009 IEEE International Conference on Multimedia and Expo > 89 - 92

2009 IEEE International Conference on Multimedia and Expo (ICME)

Human eyes have different sensitivity to different frequency components of image signals, typically, low frequency components are relatively more crucial to the perceptual quality of images than high frequency components. Based on this observation, we propose a novel sampling scheme for compressive sensing framework by designing a weighting scheme for the sampling matrix. By adjusting the weighting...

chapter

Accurate and efficient stereo matching with robust piecewise voting

Ke Zhang, Jiangbo Lu, G. Lafruit, R. Lauwereins, more

2009 IEEE International Conference on Multimedia and Expo > 93 - 96

2009 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we propose an efficient local stereo algorithm for accurate disparity estimation. First, we attain initial disparity estimates by iterating a cross-based cost aggregation process. Then, we propose a robust voting scheme to refine the initial estimates based on a piecewise smoothness prior, improving the quality in occluded regions and low-textured regions effectively. The refinement...

chapter

Bandwidth extension of low bitrate compressed audio based on statistical conversion

D. Cantzos, A. Mouchtaris, C. Kyriakakis

2009 IEEE International Conference on Multimedia and Expo > 97 - 100

2009 IEEE International Conference on Multimedia and Expo (ICME)

Algorithmic and protocol constraints of most low bitrate compression schemes lead to audio signals of low bandwidth and, inevitably, of low perceptual audio quality. Audio bandwidth extension methods address this problem by reconstructing the high frequency spectrum of a degraded signal based on information from the low frequency part. In this work, a novel audio bandwidth extension method is presented...

chapter

Low complexity intra mode selection for efficient distributed video coding

J. Ascenso, F. Pereira

2009 IEEE International Conference on Multimedia and Expo > 101 - 104

2009 IEEE International Conference on Multimedia and Expo (ICME)

Motion compensated frame interpolation (MCFI) is one of the most efficient solutions to generate side information (SI) in the context of distributed video coding. However, it creates SI with rather significant motion compensated errors for some frame regions while rather small for some other regions depending on the video content. In this paper, a low complexity intra mode selection algorithm is proposed...

chapter

Motion tubes for the representation of image sequences

M. Urvoy, N. Cammas, S. Pateux, O. Deforges, more

2009 IEEE International Conference on Multimedia and Expo > 105 - 108

2009 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we introduce a novel way to represent an image sequence, which naturally exhibits the temporal persistence of the textures. Standardized representations have been thoroughly optimized, and getting significant improvements has become more and more difficult. As an alternative, Analysis-Synthesis (AS) coders have focused on the use of texture within a video coder. We introduce here a...

chapter

Entropy constrained color splitting for palette images

En-hui Yang, Longji Wang

2009 IEEE International Conference on Multimedia and Expo > 109 - 112

2009 IEEE International Conference on Multimedia and Expo (ICME)

This paper proposes two entropy constrained color splitting algorithms through building a binary tree structure for a progressive transmission of palette images. At each step of color splitting, a representative color is split into two new representative colors to minimize the distortion incurred by the reconstructed image subject to an entropy constraint. Among the bit rates of interest, both of...

chapter

A fast multiview video coding algorithm based dynamic multi-threshold

Zongju Peng, Gangyi Jiang, Mei Yu

2009 IEEE International Conference on Multimedia and Expo > 113 - 116

2009 IEEE International Conference on Multimedia and Expo (ICME)

A fast macroblock mode selection algorithm based on dynamic multi-threshold is proposed to improve the encoding speed of multiview video, but with insignificant degradation in rate distortion (RD) performance. The macroblock modes are divided into four classes after statistically analyzing the macroblock mode selection results. Three thresholds are adopted based on the great RD cost gaps between the...

chapter

Optimal joint linear acoustic echo cancelation and blind source separation in the presence of loudspeaker nonlinearity

M. Souden, Zicheng Liu

2009 IEEE International Conference on Multimedia and Expo > 117 - 120

2009 IEEE International Conference on Multimedia and Expo (ICME)

Acoustic echoes represent a major source of discomfort in hands free, full-duplex, communication systems. The problem becomes particularly difficult when the loudspeakers are nonlinear as considered in this paper. In contrast to the single-microphone linear and nonlinear acoustic echo cancellation techniques, we take advantage of the spatial diversity offered by the microphone arrays. Indeed, having...

chapter

Robust language identification based on fused phonotactic information with MLKSFM pre-classifier

Liang Wang, E. Ambikairajah, E.H.C. Choi

2009 IEEE International Conference on Multimedia and Expo > 121 - 124

2009 IEEE International Conference on Multimedia and Expo (ICME)

In this paper we propose a novel language identification system which utilizes fused phonotactic information. The phase spectrum of speech signals is used with the magnitude spectrum in order to obtain a more robust feature representation. Parallel Broad Phoneclass Recognition followed by Language Model (PBPRLM) is used in order to remove the bias of the likelihood scores introduced by the size inequality...

chapter

Color filter array demosaicking using joint bilateral filter

Meng-Che Chuang, Yi-Nung Liu, Tsung-Huang Chen, Shao-Yi Chien

2009 IEEE International Conference on Multimedia and Expo > 125 - 128

2009 IEEE International Conference on Multimedia and Expo (ICME)

Bilateral filter has shown its outstanding performance in image denoising and other multimedia applications. In this paper, a new color interpolation technique named joint bilateral demosaicking is proposed. Considering the image gradient, an edge-sensing initialization step is performed. In addition, joint bilateral filter exploits the correlation between color channels with the information from...

chapter

Empirical mode decomposition descriptor for plane closed curves

Soo-Chang Pei, Yu-Zhe Hsiao, Chia-Ying Lee

2009 IEEE International Conference on Multimedia and Expo > 129 - 132

2009 IEEE International Conference on Multimedia and Expo (ICME)

Empirical mode decomposition (EMD) developed by Huang et al. is a nonlinear data analysis method for nonstationary real-valued time series. It has been applied extensively in many research areas. Recently, several generalized EMD methods for complex-valued data analysis was proposed. Since a plane closed curve comprises many two-dimensional (2D) space data points, one can imagine that the boundary...

chapter

Acoustic modeling using an extended phone set considering cross-lingual pronunciation variations

Dau-Cheng Lyu, Ren-Yuan Lyu, Ming-Tat Ko

2009 IEEE International Conference on Multimedia and Expo > 133 - 136

2009 IEEE International Conference on Multimedia and Expo (ICME)

To deal with the issue of data unbalanced condition among a task of multilingual speech recognition and a phenomenon of pronunciation variations across languages, we propose an approach to clustering context dependent phones from an extended phone set in an acoustic model trained on a data unbalanced bilingual corpus. First, we generate an extended phone set using pronunciation modeling by a confidence...

INFONA - science communication portal

2009 IEEE International Conference on Multimedia and Expo

Video coding based on audio-visual attention

Fast multi-reference motion estimation via statistical learning for H.264/AVC

Block-based color correction algorithm for multi-view video coding

A Multi-layer motion estimation scheme for spatial scalability in H.264/AVC scalable extension

Kurtosis-based super-resolution algorithm

A robust spatial-temporal line-warping based deinterlacing method

Fast directional image interpolation with difference projection

A new IKONOS imagery fusion approach using particle swarm optimization

Perceptual compressive sensing for image signals

Accurate and efficient stereo matching with robust piecewise voting

Bandwidth extension of low bitrate compressed audio based on statistical conversion

Low complexity intra mode selection for efficient distributed video coding

Motion tubes for the representation of image sequences

Entropy constrained color splitting for palette images

A fast multiview video coding algorithm based dynamic multi-threshold

Optimal joint linear acoustic echo cancelation and blind source separation in the presence of loudspeaker nonlinearity

Robust language identification based on fused phonotactic information with MLKSFM pre-classifier

Color filter array demosaicking using joint bilateral filter

Empirical mode decomposition descriptor for plane closed curves

Acoustic modeling using an extended phone set considering cross-lingual pronunciation variations

Filter options

Publication date

Keywords

INFONA - science communication portal

2009 IEEE International Conference on Multimedia and Expo $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2009 IEEE International Conference on Multimedia and Expo