The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The JPEG committee (formally, ISO SC29 WG1) is currently standardizing a lightweight mezzanine codec for video over IP transport under the name JPEG XS. A particular challenging design constraint of this codec is multi-generation robustness, that is the necessity to minimize the error built-up under multiple re-compression cycles. In this paper, we discuss the sources of such errors, how they are...
This paper presents a method to extract rendering matrix on multi-channel audio signals as an object fed to Moving Picture Expert Group Spatial Audio Object Coding (MPEG SAOC) encoder. This technique allows MPEG SAOC to transmit multiple multi-channel audio objects, instead of only a single multi-channel background object as specified in MPEG SAOC standard. Listening tests show that the proposed method...
Advances in virtual reality have generated substantial interest in accurately reproducing and storing spatial audio in the higher order ambisonics (HOA) representation, given its rendering flexibility. Recent standardization for HOA compression adopted a framework wherein HOA data are decomposed into principal components that are then encoded by standard audio coding, i.e., frequency domain quantization...
Recent advances in capturing and display technologies, as well as the proliferation of platforms to share images on the Internet, will further increase the bandwidth and storage space required by image coding based applications. To reduce the image coding rate, some techniques taking into account the properties of the human visual system can be used. In this context, this paper proposes an inpainting...
Lossy image compression methods always introduce various unpleasant artifacts into the compressed results, especially at low bit-rates. In recent years, many effective soft decoding methods for JPEG compressed images have been proposed. However, to the best of our knowledge, very few works have been done on soft decoding of JPEG 2000 compressed images. Inspired by the outstanding performance of Convolution...
In order to improve the error resistance and security of JPEG2000 standard, a joint source channel and security arithmetic coding/decoding scheme for EBCOT in JPEG2000 is proposed. Based on error resistant arithmetic coding, this scheme inserts multiple forbidden symbols and generates secure two-way decodable bitstream controlled by chaotic maps, improving the security of the scheme. Meanwhile, at...
The latest High Efficiency Video Coding (HEVC) has been increasingly used to generate video streams over Internet. However, the decoded HEVC video streams may incur severe quality degradation, especially at low bit-rates. Thus, it is necessary to enhance visual quality of HEVC videos at the decoder side. To this end, we propose in this paper a Decoder-side Scalable Convolutional Neural Network (DS-CNN)...
Traditional stacked autoencoders have an equal number of encoders and decoders. However, while fine-tuned as a deep neural network the decoder portion is detached and never used. This begs the question: ‘do we need equal number of decoders and encoders’? In this study we explore asymmetric autoencoders — unequal number of encoders and decoders. We specifically address two tasks — 1. Classification...
MPEG-4 high efficiency (HE) advanced audio coding (AAC) contains a useful tool called spectral band replication (SBR) to improve the coded audio quality at low bitrates. The SBR tool uses start-band frequency to determine from which frequency band replication starts. This paper describes an algorithm to dynamically determine this parameter based on the genre of the music content. The simulation results...
An important feature of today's mobile devices is their ability to capture and display high resolution photos in an acceptable amount of time. These images are stored in flash memory on the mobile device using the JPEG codec which is almost a quarter of a century old but remains the industry standard. With increasing pixel counts on both mobile image sensors and screens, software solutions will struggle...
Biometric system security requires cryptographic protection of sample data under certain circumstances. We assess low complexity selective encryption schemes applied to JPEG2000 compressed fingerprint data by conducting fingerprint recognition on the selectively encrypted data. This paper specifically investigates the effect of considering different sensors for data acquisition and finds significant...
ISO recently published a new image compression standard, JPEG XT, which extends the popular JPEG standard towards higher dynamic range, compression of alpha channels and lossless coding. In part 7 ofJPEG XT, a two-layer lossy image compression for HDR images isintroduced that reconstructs HDR signals by the combination of a baselayer following the legacy JPEG standard, and an extension layer thatenlarges...
This paper improves a colorization-based image coding using image segmentation and adaptive colorspaces. Recently, various approaches for color image coding based on colorization have been presented. These methods utilize a YCbCr colorspace and transfer the luminance component by a conventional compression method. Then, the chrominance components are approximated from the luminance component using...
Recently, video traffic has rapidly increased, and a considerable portion of that is caused by wireless devices. Since video streaming services now require high data rates, IEEE 802.11 has been widely used to support them in the wireless environment. On the other hand, in densely populated areas (e.g., conference rooms), multiple video streaming services using a unicast method can result in exceeding...
360° video streaming to clients using Virtual Reality head mounted displays is a challenge for traditional video delivery. As transmission of the complete content in a desirable quality sacrifices a large fraction of available client and network resources, adaptivity to the user viewport promises substantial benefits. An efficient way to achieve viewport adaptive streaming without per-user or per-orientation...
In contrast to many image watermarking schemes, the suggested method is implemented in the JPEG compressed domain with no transcoding or decompression. Therefore, this scheme is highly efficient in real-time applications and suitable for multimedia information, which is rarely available in an uncompressed form. The proposed watermarking scheme is very flexible, and can be tailored to meet the requirements...
In this paper, we present a novel image codec by leveraging sparse representation strategy for geometric pattern encoding. Specifically, we propose a Multiple Learned Geometric Dictionaries (MLGD) solution to explore various texture patterns of images, and use different dictionaries to encode homogenous smooth components and heterogeneous directional components. Profiting from model proficiency, our...
A video analyzer is a comprehensive bitstream analysis tool which accelerates development and debugging of video bitstreams while ensuring compliance with industry standards. There are many conventional analyzers present for different video standards like H.264, HEVC which are compliant only with the respective video sequence format. In this work, a generalized analyzer with integrated encoder is...
In this paper, a scrambling technique is integrated in a HDR (High Dynamic Range) video. Specifically, the scrambling process will be directly applied to the DCT (Discrete Cosine Transform) coefficients after quantization and before entropy coding. Authorized users perform unscrambling (inverse scrambling) of the resulting coefficients of entropy decoding at the decoder side.
A novel perceptual multiple description coding with randomly offset quantizers (PMDROQ) is proposed. In the proposed PMDROQ method, the input image is partitioned into M subsets, and then obtaining M descriptions. In each description, one subset is directly encoded and decoded with different-small perceptual quantization stepsizes in DCT domain, while other subsets are predictively coded and decoded...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.