The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper presents a method to extract rendering matrix on multi-channel audio signals as an object fed to Moving Picture Expert Group Spatial Audio Object Coding (MPEG SAOC) encoder. This technique allows MPEG SAOC to transmit multiple multi-channel audio objects, instead of only a single multi-channel background object as specified in MPEG SAOC standard. Listening tests show that the proposed method...
Virtual reality applications make use of 360-degree panoramic or omnidirectional video with high resolution and high frame rate in order to create the immersive experience to the user. The user views only a portion of the captured 360-degree scene at each time instant, hence streaming the whole omnidirectional video in highest quality is not efficient. In order to alleviate the problem of bandwidth...
Transrating is a bitrate transcoding technique that facilitates video applications in heterogeneous environments. This paper presents a fast coding-unit (CU) mode decision algorithm for transrating a once-encoded HEVC bitstreams to lower bitrate versions of diverse quality levels. The proposed method comprises three major parts. First, an early SKIP decision is based on the modes and motion vectors...
In this paper, quantized and triggered control is utilized to sample output error signal for output feedback synchronization of two Lurie systems (formed by linear part and nonlinear one constructed only by the measurable output) under limited transmission capacity. With the characteristics of output error, we come up with practical quantized and triggered strategy to sample the output error instead...
Panoramic streaming enables users to interactively navigate through high-spatial resolution videos and create an immersive and personalized user experience. Since transmission of high-resolution videos in desirable quality is not feasible given the limited throughput of access and home network links, our work is based on tile-based streaming, where only a spatial subset of the video is transmitted...
Traditional intra prediction schemes usually only use the nearest adjacent reference lineto generate the prediction. Although the nearest reference line generally has the strongeststatistical correlation with current block, the farther non-adjacent reference lines can stillprovide potentialb etter prediction in some cases. Thus, in this paper, not only the nearestreference line but also the farther...
At present, the majority of the studies in the area of multizone sound-field reproduction are focused on the decoding of the soundfield. In this work, we propose an approach to encode the multizone sound-field within the desired reproduction region based on higher order Ambisonics (HOA) formats. The B-format signals for the complex multizone soundfield can be derived based on the coefficients of a...
In this paper, we show that tensor compression techniques based on randomization and partial observations are very useful for spatial audio object coding. In this application, we aim at transmitting several audio signals called objects from a coder to a decoder. A common strategy is to transmit only the downmix of the objects along some small information permitting reconstruction at the decoder. In...
Encoder and decoder implementations of the High Efficiency Video Coding (HEVC) standard have been subject to many optimization approaches since the release in 2013. However, the real-time decoding of high quality and ultra high resolution videos is still a very challenging task. Especially entropy decoding (CABAC) is most often the throughput bottleneck for very high bitrates. Syntax Element Partitioning...
We propose a novel informed source separation method for audio object coding based on a recent sampling theory for smooth signals on graphs. Assuming that only one source is active at each time-frequency point, we compute an ideal map indicating which source is active at each time-frequency point at the encoder. This map is then sampled with a compressive graph signal sampling strategy that guarantees...
Upmixing consists in extracting audio objects out of their downmix, given some parameters computed beforehand at a coding stage. It is an important task in audio processing with many applications in the entertainment industry. One particularly successful approach for this purpose is to compress the audio objects through nonnegative matrix factorization (NMF) parameters at the coder, to be used for...
High Resolution Envelope Processing (HREP) is a new tool for improved perceptual coding of audio signals that predominantly consist of many dense transient events, such as applause, rain drop sounds, etc. These signals have traditionally been very difficult to code for perceptual audio codecs, particularly at low bit rates. Based on the gain control principle, HREP acts as a pre-/post-processor pair...
360° video streaming to clients using Virtual Reality head mounted displays is a challenge for traditional video delivery. As transmission of the complete content in a desirable quality sacrifices a large fraction of available client and network resources, adaptivity to the user viewport promises substantial benefits. An efficient way to achieve viewport adaptive streaming without per-user or per-orientation...
In this paper, we propose a network coding solution to improve transmission reliability of scalable video coding (SVC) streams over lossy networks. Since scalable video coding produces various levels of quality with variable bitrates, we design a prioritized network coding scheme combining different encoding approaches: generation-based and sliding window based schemes. We carried out a performance...
Although High Dynamic Range (HDR) content can provide an enhanced immersive experience for end-users, the impact of channel errors on its perception is unclear due to the lack of a standardized HDR video distribution framework. This paper presents an assessment of the robustness of the two main HDR video distribution architectures, the single-layer 10-bit scheme and the two-layer 8-bit backward-compatible...
The article describes an innovative bit error rate reduction technique principle and its practical implementation. The design of the technique is implemented in an FPGA and is combined with other more conventional BER reduction techniques, such as Reed-Solomon coding. Experimental results are provided. The application bit rate in function of BER for both reliable (TCP) and unreliable (UDP) mode of...
A joint source-channel rate-distortion (RD) optimization is proposed for video communication systems. The source coding and channel coding options are optimized by seeking the best trade-off between the estimated end-to-end distortion of a video packet and the sum of the number of source bits and forward error correction bits used to encode that packet. The proposed RD algorithm controls the total...
In this paper, we pose a new problem of video enhancement transcoding, which converts the compressed dark video into compressed normal-lighting one. Distinct statistics of dark and normal videos result in quite different coding modes, which thus enforces latent constraints on mode conversion during transcoding. Following this idea, we propose a fast mode decision algorithm to speed up computation...
The upcoming JPEG XT standard for High Dynamic Range (HDR) images defines a common framework for the lossy and lossless representation of high-dynamic range images. It describes the decoding process as the combination of various processing tools that can be combined freely. In this paper we analyze the coding efficiency of different decoding tools through a large scale objective quality testing using...
This paper studies the influence of JPEG-XT on LDR generation using TMOs'. JPEG-XT encodes HDR images into a two layer scheme, encoding a LDR version of the image in a base layer, and the residual HDR information in an enhancement layer. The question addressed here is to understand if this model allows to extract a new LDR representation using a different TMO, independently of the TMO used to generate...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.