The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper, a double-filter design of deblocking filter is proposed to support the macroblock adaptive frame field (MBAFF) coding of H.264/AVC main and high profiles. Without using SRAM for local memory, the proposed design adopts register arrays with a novel data exchange mechanism to efficiently reuse the intermediate data while filtering picture coded in MBAFF format. Cooperated with the proposed...
The major challenge in wireless multicast is the heterogeneous channel conditions of multiple users. In video multicast, the combination of a layered video coding scheme and a layered transmission scheme can gracefully accommodate user heterogeneity. This paper presents MixCast, a novel physical layer scheme for layered transmission. The key innovation in MixCast is the rateless Euclidean symbol mapping...
GPUs are ample utilized for their parallel processing capabilities and are also interesting in the area of 3D image processing where computational demanding tasks are essential. We concentrate particularly on free-view 3DTV applications, based on image warping to the new view and image artifact reduction techniques. In this paper, we report on the implementation of an efficient free-viewpoint DIBR...
Just noticeable difference (JND) model can reflect the least perceptible distortion from images, including 2D images and stereoscopic images. As we know, for the perception of human visual system (HVS), stereoscopic images have quite different characteristics from 2D images, since stereoscopic images contain not only planar information, but also depth information. This paper proposes a joint JND (JJND)...
Low-power and low-cost distributed wireless video sensors play important roles for applications in machine-to-machine (M2M) and wireless sensor networks. Distributed video coding (DVC), an emerging coding technology based on Wyner-Ziv theory, seems to be a possible solution for implementing low-power video sensors since most of the computational complexity is moved from the encoder to the decoder...
The increase of multimedia services delivered over packet-based networks has entailed greater quality expectations of the end-users. This has led to an intensive research on techniques for evaluating the quality of experience perceived by the viewers of audiovisual content, considering the different degradations that it could suffer along the broadcasting system. In this paper, a comprehensive study...
This report proposes an integer color transform for lossless coding of four color components of images. An existing color transform has a fixed set of coefficients and therefore it can't be adaptive to each image. We utilize eigen-vector of the covariance of the four components to increase data compaction performance. We also utilize fixed relation between two green components to simplify computational...
We propose a novel learning-based method for single image super-resolution (SR). Given a low-resolution input image and its image pyramid, we advance a context-constrained image segmentation to construct a super-pixel database with different context categories for learning purposes. By utilizing context-specific image sparse representation, our method aims at modeling the relationship between the...
This paper introduces a novel scalable 3D mesh compression technique based on a shape approximation prediction strategy. The proposed approach, so-called Shape Approximation Compression (SAC), directly compresses the levels of detail (LoDs) defined by the content creators, while exploiting their inter-correlations. Here, the geometry of each LoD is used in order to compute a smooth approximation of...
In this paper, a low power parallel surveillance video encoding system based on joint power-speed scheduling is proposed. The relative relationships among the CPU statuses, total power consumption and encoding speed are analyzed and modeled for multi-core processors. Based on the power directional graph and the relative encoding speed model, the working statuses of the cores are controlled jointly...
Stereoscopic video transmission systems have now evolved from 2D video systems and have been commercialized for a number of application areas, driven by developments in stereo capturing and display technology. With the new developments in autostereoscopic display technology, these stereo systems need to further advance towards 3D video systems. In contrast to all previous video coding technologies,...
Traditional Fourier transform profilometry (FTP) removes the zero spectrum of the acquired fringe image using the linear filtering approach. Such approach assumes there is no aliasing between the zero spectrum and the higher harmonics of the image, which however is not true in general. Thus it cannot adapt to sharp illuminant changes in the fringe image. One practical solution is to exploit one more...
In AVS and H.264/AVC, Lagrangian Rate distortion (RD) optimization techniques are widely adopted for coding mode selection and displacement vector estimation. The optimal Lagrange multipliers in these two cases are both floating-point values. If RD optimized video encoder is implemented on computation-constrained fixed-point platform such as FPGA and ASIC, fixed-point Lagrange multiplier selection...
Compound images/videos are a mixture of text, graphics and natural images/video. Despite extensive research on compound image coding, rare research on compound video coding has been reported in the literature. This paper proposes three approaches to exploit inter-frame correlations in compound video. One is the motion compensation aided base color (MCA-BC) approach, which handles a special type of...
This paper presents a scheme of a multiple-image compressed encryption based on the compressive holography technique. Computer generate hologram (CGH) is implemented to record multiple images simultaneously into an encrypted hologram. Because its two-dimensional (2D) Fourier transform (FT) result is analogous a partial 3D Fourier transform sampling, the 2D FT result can be compressed by a nonuniform...
Existing image codec technologies are based on transform which make image signal can be compressed, while quantization has been used to control bit rates. Compressive sensing (CS), which is a novel signal processing and recovery method, can be applied to image decoding to replace inverse transform reconstruction. This paper proposes an error estimate method based on equalization quantization noise...
The denoising performance of the Non-Local Means (NLM) method decreases as the variance of additive white Gaussian noise becomes higher. In this paper, we explain this phenomenon and propose a modified version of the Non-Local Means (NLM) method, called the Enhanced-Weights NLM (EWNLM) algorithm, to denoise highly noisy images. The EWNLM algorithm evaluates weights from a pre-filtered image using...
In this paper, we propose a fast mode decision algorithm for both the intra prediction and inter prediction in the depth video sequence. The proposed algorithm reduces the complexity of the depth video coding. According to the depth variation, depth video can be classified into depth-continuity and depth-discontinuity regions. From experiments, we determine a threshold value for classifying these...
Interactive object segmentation is widely used for extracting any user-interested objects from natural images. A common problem with many interactive segmentation approaches is that the object segmentation quality is degraded due to inaccurate object/background seeds provided by the user. This paper proposes an iterative adjustable graph cut to efficiently solve this problem. First, object/background...
When the video streams are transmitted over the unreliable networks, forward error correction (FEC) codes are usually used to protect them. Reed-Solomon codes are block-based FEC codes. On one hand, enlarging the block size can enhance the performance of the Reed-Solomon codes. On the other hand, large Reed-Solomon block size leads to long delay which is not tolerable for real-time video applications...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.