The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In applications where shape is used as metadata to describe image and video content, image dependent techniques exhibit superior error concealment performance compared to the traditional MPEG-4 like image-independent techniques. The image corresponding to a particular shape may not be readily known at the decoder and rather needs to be retrieved from of a set of images. This paper presents a novel...
In this paper, an intra coding method is proposed aiming to efficiently code the smooth regions in the depth map images. One of the characteristics of typical depth map images is that they contain large gradually changing smooth areas. Traditional block based transform coding algorithms are causing blocking artifacts for this type of regions. Instead of using transform coding, the proposed method...
In this paper, a highly efficient pixel compensator architecture for the H.264/AVC standard is proposed which can provide both inter and intra prediction functions for luma and chroma components of pixels. By decomposing the algorithms used for both prediction methods into small micro-operation steps, a suitable common arithmetic unit architecture capable for performing these operations has been determined...
Motion compensated prediction (MCP) plays an important role in video coding due to its great capability of reducing temporal redundancy. In this paper, we propose a new MCP scheme by adaptive patch matching with the full use of the reconstructed pixels surrounding the current block (referred to as the template inside the patch) aiming at achieving a more accurate prediction than conventional MCP....
Conventional multi-view video coding (MVC) requires high encoding complexity and high rate data transmission from cameras to a joint encoder. Distributed coding theories guarantee that low complexity and independent encoders can achieve similar coding efficiency as traditional methods. However, there is still a big performance gap between current distributed multi-view video coding (DMVC) schemes...
This paper proposes a novel method that utilizes inter-view correlation in order to reduce the bitrate required to represent the prediction mode including the intra prediction direction, the motion vectors, the reference picture indexes, and the sub-block partitioning. The proposed method derives these kinds of macroblock (MB) information at the decoder side by comparing each image signal predictor...
Multiview video coding (MVC) improves the coding efficiency by motion estimation (ME) and disparity estimation (DE). ME and DE at encoder side involve in heavy computation, which needs to be further reduced for practical applications. This paper presents fast disparity estimation by utilizing depth information to reduce DE's computational complexity. First, the coordinate offset of the encoding block...
In this paper, a perceptual multiview video coding scheme is proposed, based on the synthesized Just Noticeable Distortion (JND) maps. In JND-based perceptual video coding, the residues after intra or inter prediction are tuned according to the corresponding JND thresholds to save the bits without affecting the perceptual quality. In our scheme, to reduce the computational cost of generating the multiview...
In this paper, we present an efficient side information extrapolation scheme with temporal and spatial consistency for low-delay Wyner-Ziv video coding. Our method is based on the regularized local linear regression (RLLR) model, in which each pixel in SI is approximated as a linear weighted combination of samples within a local temporal neighborhood. The optimal model parameters are estimated by...
Stereoscopic video is an important manner for 3-D video applications, and robust stereoscopic video transmission has posed a technical challenge for stereoscopic video coding. In this paper, an auto-regressive (AR) model based error concealment scheme is proposed for stereoscopic video coding to address the challenging problem. The proposed error concealment scheme includes a temporal AR model for...
In order to further improve video coding efficiency, a novel adaptive sub-pixel interpolation filter is presented in this paper. Considering the local image characteristics, the proposed method designs interpolation filters for sub-pixels in low-frequent and high-frequent areas separately. And in order to reduce the header information, flexible symmetry is assumed for each filter. Experimental results...
A first-order-residual/second-order-residual (FOR/SOR) based video coding algorithm that incorporates the super macroblock (SMB) and the inter-frame stripe prediction (ISP) technique is proposed for high definition (HD) video coding in this work. We first examine the limitation of the SMB in high-bit-rate coding, and show that a simple extension of the block-size is not sufficient to get a significant...
In this paper, a low-complexity motion-based saliency map estimation method for perceptual video coding is proposed. The method employs a camera motion compensated vector map computed by means of a hierarchical motion estimation (HME) procedure and a Restricted Affine Transformation (RAT)-based modeling of the camera motion. To allow for a computationally efficient solution, the number of layers of...
When 3D video is transmitted over lossy channels, different strategies can be adopted at the different layers of the communication protocol stack to cope with such errors, in a cross-layer operation framework. At the receiver side, effective error concealment methods are necessary to overcome the quality loss due to erroneous packets. The existing concealment techniques for conventional 2D video can...
Embedded consumer electronics like video processing systems require large storage capacity and high bandwidth memory access. Also, those systems are built from heterogeneous processing units, designed specifically to perform dedicated tasks in order to maximize the processing power. A single off-chip memory is shared between the processing units to reduce power and save costs. The external memory...
The H.264/AVC intra-frame codec is widely used to compress image/video data for applications like Digital Still Camera (DSC), Digital Video Camera (DVC), Television Studio Broadcast, and Surveillance video. Intra-prediction is one of the top 3 compute-intensive processing functions in the H.264/AVC baseline decoder and, therefore, consumes significant number of compute cycles a processor. In this...
Integral imaging, also known as holoscopic imaging, appears to be a promising approach for glassless 3D. This paper presents the general concepts of integral imaging and lenticular lenses, which are used in the image acquisition and displaying step. Special attention is devoted to the analysis of 3D holoscopic video compression considering its intrinsic nature of having a high cross-correlation between...
Motion Estimation is the most complex part in H.264 encoder which takes more than 90% time in baseline profile while making test encoding using JM15.1. In the motion estimation stage, macro block cost calculation is very important for motion vector prediction and bit stream estimation. As a new characteristic introduced in H.264, Fractional ME is an essential part for improving encoder efficiency...
This paper proposes a video compression scheme: four-dimensional vector matrix DCT integer transform (4D-VMICT) codec based on multi-dimensional vector matrix theory. In the scheme we proposes two models for the codec, which can eliminate temporal, spatial and statistical redundancy among the pixels in video sequence by use of the excellent concentration of the 4D-VMICT. Then an e-exponential function...
We present a new graph-based transform for video signals using wavelet lifting. Graphs are created to capture spatial and temporal correlations in video sequences. Our new transforms allow spatial and temporal correlation to be jointly exploited, in contrast to existing techniques, such as motion compensated temporal filtering, which can be seen as "separable" transforms, since spatial and...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.