The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Presents the introductory welcome message from the conference proceedings. May include the conference officers' congratulations to all involved with the conference event and publication of the proceedings record.
MPEG is currently developing a standard titled Compact Descriptors for Visual Search (CDVS) for descriptor extraction and compression. In this work, we report comprehensive patch-level experiments for a direct comparison of low bitrate descriptors for visual search. For evaluating different compression schemes, we propose a dataset of matching pairs of image patches from the MPEG-CDVS image-level...
Key point features are very effective tools in image matching and key point feature aggregation is an effective scheme for creating a compact representation of the images for visual search. This solution not only achieves compression, but also offers the benefits of better accuracy in matching and indexing efficiency. Research is active in this area and recent results on Fisher Vector based aggregation...
This paper presents an approach for using hierarchically structured multi-view features for mobile visual search. We utilize a graph model to describe the feature correspondences between multi-view images. To add features of images from new viewpoints, we designa level raising algorithm and the associated multi-view geometric verification, which are based on the properties of the hierarchical structure...
For mobile augmented reality, an image captured by a mobile device's camera is often compared against a database hosted on a remote server to recognize objects in the image. It is critically important that the amount of data transmitted over the network is as small as possible to reduce the system latency. A low bitrate global signature for still images has been previously shown to achieve high-accuracy...
Screen contents with complex structure contain random combination of texts, graphics and camera-captured images, which makes them difficult to be compressed efficiently by traditional video codecs. In this paper, we propose a 2-D dictionary based scheme to exploit the repeated patterns on screen content. In the proposed scheme, the current block is predicted from the reconstructed region using a hash-based...
We construct motion-adaptive transforms for image sequences by using the eigenvectors of Laplacian matrices defined on vertex-weighted graphs, where the weights of the vertices are defined by scale factors. The vertex weights determine only the first basis vector of the linear transform uniquely. Therefore, we use these weights to define two Laplacians of vertex-weighted graphs. The eigenvectors of...
The standard Compressive Sensing (CS) theory indicates that robust signals recovery can be obtained from just a few collection of incoherent projections. To further decrease the necessary measurements, an alternative to the generic CS framework assumes that signals lie on a union of subspaces (UoS). However, UoS model is limited to the specific type of signal regularity. This paper considers a more...
Temporal in loop filters present one possible way to reduce noise introduced in compressed video sequences at low bit rates. Some of these filtering approaches make use of the quantized and generally noisy motion information conveyed in the bit stream generated by the encoder. One key feature of such filters is an adaptive filter length depending on the image content and the quality of the motion...
The issue of backwards compatible image and video coding gained some attention in both MPEG and JPEG, let it be as extension for HEVC, let it be as the JPEG XT standardization initiative of the SC29WG1 committee. The coding systems work all on the principle of a base layer, perating in the low-dynamic range regime, using a one-mapped version of the HDR material as input, and an extension layer invisible...
LS-based adaptation cannot fully exploit high-dimensional correlations in image signals, as linear prediction model in the input space of supports is undesirable to capture higher order statistics. This paper proposes Gaussian process regression for prediction in lossless image coding. Incorporating kernel functions, the prediction support is projected into a high-dimensional feature space to fit...
In this paper, we present two adaptive edge encoding schemes for the operational rate-distortion optimal polygon-based shape coding. The encoding edge is represented by an octant number, a major component, and a minor component, where the ranges of the two components are determined at two levels. For the object-level, these ranges are either determined by users or adaptive to the contour characteristics...
The objective approaches of 3D image quality assessment play a key role in the development of compression standards and various 3D multimedia applications. The quality assessment of 3D images faces many new challenges, e.g. asymmetric stereo compression, depth perception, and virtual view synthesis, as compared with its 2D counterparts. Moreover, the widely used 2D image quality metric (e.g. PSNR)...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.