The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper presents a method to extract rendering matrix on multi-channel audio signals as an object fed to Moving Picture Expert Group Spatial Audio Object Coding (MPEG SAOC) encoder. This technique allows MPEG SAOC to transmit multiple multi-channel audio objects, instead of only a single multi-channel background object as specified in MPEG SAOC standard. Listening tests show that the proposed method...
Coding of signals with high amount of dense transient events, like applause, rain drops, etc., is usually a difficult task for perceptual audio coders. Especially at low bit rates, coding (and the associated coding noise) leads to smeared transients and a perceived increase in noise-like signal character. We propose a post-processing technique for coded/decoded applause-like signals which is based...
Nowadays, the development of video coders is resulting in significantly increased performance. Among all coder categories, fine grain scalable video coders have the chance to show great advantage by freely extracting code-streams with different bitrates, resolutions, and frame rates from one source-coded bit file to satisfy the various requirements of subscribers in a multicast network. Based on the...
Two layer coding based on JPEG 2000 for HDR images was proposed. The proposed coding scheme, referred to as extended JPEG XT, is an extension of JPEG XT Profile A, and takes advantages as follows: freely bit rate controllability, high compression efficiency, available for over-eight-bits LDR images, and so on. In this paper, some experimental results for the quality of the reconstructed HDR images...
A new two-layer coding scheme based on JPEG 2000 is proposed for HDR images, where coded data consist of two layers: the base-layer for tone mapped LDR version of an HDR image and the enhancement layer. The paper considers the coding scheme as an extended version of the JPEG XT Profile A. JPEG coders used in JPEG XT are replaced with JPEG 2000 ones. Compared to normative JPEG XT coding, the proposed...
360° video streaming to clients using Virtual Reality head mounted displays is a challenge for traditional video delivery. As transmission of the complete content in a desirable quality sacrifices a large fraction of available client and network resources, adaptivity to the user viewport promises substantial benefits. An efficient way to achieve viewport adaptive streaming without per-user or per-orientation...
The upcoming JPEG XT standard for High Dynamic Range (HDR) images defines a common framework for the lossy and lossless representation of high-dynamic range images. It describes the decoding process as the combination of various processing tools that can be combined freely. In this paper we analyze the coding efficiency of different decoding tools through a large scale objective quality testing using...
This paper studies the influence of JPEG-XT on LDR generation using TMOs'. JPEG-XT encodes HDR images into a two layer scheme, encoding a LDR version of the image in a base layer, and the residual HDR information in an enhancement layer. The question addressed here is to understand if this model allows to extract a new LDR representation using a different TMO, independently of the TMO used to generate...
This paper presents the results of a subjective evaluation experiment, made to compare different HDR coding technologies, conducted at the recent ITU/ISO/IEC VCEG/ MPEG/JPEG Meeting in San Diego, CA, February 2016. A set of “anchor” streams, conforming to the HDR10 spec, was compared to a similar rate-matched set obtained using a method called “Reshaper”, which requires normative changes to the underlying...
Plenoptic images are one type of light field contents produced by using a combination of a conventional camera and an additional optical component in the form of microlens arrays, which are positioned in front of the image sensor surface. This camera setup can capture a sub-sampling of the light field with high spatial fidelity over a small range, and with a more coarsely sampled angle range. The...
A novel video coding standard called high efficiency video coding (HEVC)|ITU-T H.265 was released in 2013. HEVC has twice the compression capability of MPEG-4 AVC|ITU-T H.264, a feature that is expected to enhance new broadcasting services. Japanese broadcasters plan to start a new ultra high definition television service using HEVC. To identify the required bit rates for the new broadcasting service,...
A new-generation digital set-top-box (STB), as a core device of smart home, will provide mobile devices various multimedia services. In this study, we propose a more accurate power consumption model for mobile devices that are receiving video streaming services under the MPEG-DASH framework. The proposed model considers the influence of the encoding parameters specified by the encoder and the CPU...
The scalable video coding enables to compress video contents into a hierarchical layered representation, each layer depicts an enhanced version of the underlying layer. SHVC is the scalable extension of HEVC and enables spatial, SNR, color-gamut, codec and bitdepth scalability. It has been proved, in the MPEG investigations prior to the recent Call for Evidence, that SHVC can support SDR-to-HDR scalability...
Saliency-driven image coding is well worth pursuing. Previous studies on JPEG and JPEG2000 have suggested that region-of-interest coding brings little overall benefit compared to the standard implementation. We show that our saliency-driven variable quantization JPEG coding method significantly improves perceived image quality. To validate our findings, we performed large crowdsourcing experiments...
In recent years there has been an increasing interest in H.264/ MPEG-4 part 10 video coding standard. This is the latest coding standard which attains very high data compression. MPEG stands for moving picture experts group which was formed by ISO in order to define standards for audio and video coding and also transmission. This paper gives an idea of decorrelation and entropy coding of motion information...
This paper presents a panoramic video transmission system using spatially divided tiles based on the spatial relationships descriptions of MPEG-DASH. The proposed server system provides tiling and encoding functionalities for ROI-based retrieving. Moreover, it guarantees the temporal and spatial synchronization when rendering multiple tiles by a deterministic tile size and intra-coded frames at the...
the success of audio steganography techniques is to ensure imperceptibility of the embedded secret message in stego file and withstand any form of intentional or un-intentional degradation of secret message (robustness). Crucial to that using digital audio file such as MP3 file, which comes in different compression rate, however research studies have shown that performing steganography in MP3 format...
The paper presents an extension of 3D-HEVC for the circular camera arrangements. It generalizes the derivation of the disparity vectors from depth data for sequences captured using cameras located on an arc. The general equation for disparity calculations has been implemented instead of the simplified equation used in 3D-HEVC. Experiments have been performed on widely recognized multiview test sequences...
Traditional image coding has mainly been concerned about its rate-distortion performance in the sense of what rate can be achieved for a given distortion that is usually measured by either peak signal to noise ratio (PSNR) or subjective quality. However, in many visual analysis scenarios, especially for mobile visual search (MVS), the rate-accuracy performance is crucial. In this paper, we propose...
In many mobile visual analysis scenarios, compressed images are transmitted over a communication network for analysis at a server. Often, the processing at the server includes some form of feature extraction and matching. Image compression has been shown to have an adverse effect on feature matching performance. To address this issue, we propose to signal the feature keypoints as side information...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.