The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this work a new set of edge-adaptive transforms (EATs) is presented as an alternative to the standard DCTs used in image and video coding applications. These transforms avoid filtering across edges in each image block, thus, they avoid creating large high frequency coefficients. These transforms are then combined with the DCT in H.264/AVC and a transform mode selection algorithm is used to choose...
AVS-Part 2 is a mainstream video coding standard with high compression efficiency similar to H.264/AVC. A technique named Second Order Prediction (SOP) has been presented based on H.264/AVC to decrease the signal correlation after motion-compensated prediction. To achieve better coding performance, this paper presents a method named Block-based Second Order Prediction (BSOP) to ameliorate SOP to adapt...
In this paper we propose a novel approach to error concealment that can be applied to MVC and other 3D video coding technologies. The image content, that is lost due to errors, is recovered with use of multiple error-concealment techniques. In our work we have used three techniques: well-known temporal- and intra-based techniques and a novel inter-view technique. Proposed inter-view recovery employs...
In 3D video communication, how to select the appropriate quantization parameter (QP) for depth map coding is very important for obtaining the optimal view synthesis quality. This paper first analyzes the depth uncertainty induced two kinds of view synthesis errors, namely the original depth error induced view synthesis error and the depth compression induced view synthesis error, and then proposes...
This paper proposes a high-performance intra prediction architecture that can support H.264/AVC high profile. The proposed MB/block co-reordering can avoid data dependency and improve pipeline utilization. Therefore, the timing constraint of real-time 4k×2k encoding can be achieved with negligible quality loss. 16×16 prediction engine and 8×8 prediction engine work parallel for prediction and coefficients...
This paper presents an efficient forward inter prediction method for video coding, targeting at low delay applications. The method applies the idea of template matching averaging (TMA) to the conventional motion compensated prediction (MCP). TMA forms the final predictor of a target block by averaging multiple numbers of candidates. While one of the candidate is specified by a motion vector, the remaining...
In this paper, an intelligent low-cost mode decision (MD) procedure is proposed for the inter-view frame of the multi-view video coding (MVC). Multiple parameters, such as macro block (MB) energy, view/temporal domain mode similarities and so on, are utilized to reduce the number of the mode candidates so that the corresponding Rate-to-Distortion Cost (RDCost) computing can be eliminated. The simulation...
We consider the scenario of view synthesis via depth-image based rendering in multi-view imaging. We formulate a resource allocation problem of jointly assigning an optimal number of bits to compressed texture and depth images such that the maximum distortion of a synthesized view over a continuum of viewpoints between two encoded reference views is minimized, for a given bit budget. We construct...
The future of video coding for 3DTV lies in the combination of depth maps and corresponding textures. Most current video coding standards, however, are only optimized for visual quality and are not able to efficiently compress depth maps. We present in this work a content adaptive depth map meshing with tritree and entropy encoding for 3D videos. We show that this approach outperforms the intra frame...
Multi-view video plus depth (MVD) format, which consists of texture and depth images, has been recently presented as video representation to support depth perception of scenes and efficient view generation at the arbitrary positions. In particular, a depth image has been one of the significantly important issues for successful services of highly advanced multi-media video applications, such as three-dimensional...
Colorization is a method that adds color components to a grayscale image using only a few representative pixels provided by the user. A novel approach to image compression called colorization-based coding has recently been proposed. It automatically extracts representative pixels from an original color image at an encoder and restores a full color image by using colorization at a decoder. However,...
We propose a novel 3D space representation for multi-view video, using epipolar plane depth images (EPDI). Multi-view video plus depth (MVD) is used as common data format for FTV(Free-viewpoint TV), which enables synthesizing virtual view images. Due to large amount of data and complexity of the multi-view video coding (MVC), compression of MVD is a challenging issue. We address this problem and propose...
Abstract-In this work, we describe preliminary results re lated to an ultra-low power vision sensor, embedding contrast extraction and output data compression. The sensor performs less than 30 μW at 3.3 V and 30 frame/s, combining data compression together with event detection capabilities and efficient sensor interface. This makes the presented sensor to be suitable for energy-autonomous...
In a H.264/AVC intra-frame encoder, the high-complexity mode decision requires a lot of computations, spends much time in the encoding procedure, and calculates all modes to select the best one that achieves the minimum rate-distortion (RD) cost value. The complicated computations for the mode decision cause the difficulty in real-time applications. In this paper, we propose a fast and efficient algorithm...
This paper proposes a reversible watermarking technique that utilizes the IPCM macroblocks generated by the H.264/AVC encoder. The technique applies the concept of Differential Expansion (DE) to the pixels of the luminance and the chrominance components within the IPCM macroblocks of the P-frames. The performance of the proposed algorithm was tested for a number of video sequences and compared to...
We propose a novel 8bpp subsampling YUV pattern based on a checkerboard subsampling of the luminance component that explicitly preserves the edge. The proposed pattern uses 1bit to encode edge direction in the missing luminance pixel and this bit is stored in the chroma sample as the chroma sample is DPCM encoded 8 to 7bits per sample. The complexity analysis of both encoder and decoder is concluded...
We propose a new H.264 integer-pixel motion estimation algorithm and circuit architecture to improve the processing speed. The proposed circuit supports 7 kinds of variable block sizes and generates 41 motion vectors. Our circuit based on the proposed algorithm and architecture processes 60 image frames per second for 1080HD video at the operating frequency of 45.5MHz. We implemented an MPW chip using...
This paper presents a study of the effect of resizing the image by using different interpolation methods on fractal image compression. This study is made to reduce the search time of matching between range block and domain block. The main drawback of fractal image compression is that it involves more computational time due to global search. In order to reduce the computational time with acceptable...
In the scalable video coding extension of the H.264/AVC standard, an exhaustive search technique is used to select the best coding mode for each macroblock. This technique achieves the highest possible coding efficiency, but it demands a higher video encoding computational complexity which constrains its use in many practical applications. This paper proposes combined fast sub-pixel motion estimation...
RGB color signals encoding is more desirable for high resolution and accurate definition for saturations and hue. This paper presents a region adaptive inter-color prediction scheme for 4:4:4 RGB signals, which treats G component as a primary component and predicts B/R component from G component with different weighting factors based on different texture regions. The proposed algorithm demonstrates...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.