The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper presents Neighbor-Guided SemiGlobal Matching (NG-fSGM), a new method for optical flow. It is based on SGM, a popular dynamic programming algorithm for stereo vision, where the disparity of each pixel is calculated by aggregating local matching costs over the entire image to resolve local ambiguity in texture-less and occluded regions. Unlike conventional SGM, NG-fSGM operates on a subset...
Content-aware image retargeting adjusts images to arbitrary sizes and preserves visually salient content. Previous algorithms formulate the problem in terms of either pixel level or mesh level structures, deforming salient objects inconsistently. To improve retargeting quality and reduce complexity, we introduced a patch-wise method to generate sparse image grids based on visual saliency and gradient...
A cloud-based encoding pipeline which generates streams for video-on-demand distribution typically processes a wide diversity of content that exhibit varying signal characteristics. To produce the best quality video streams, the system needs to adapt the encoding to each piece of content, in an automated and scalable way. In this paper, we describe two algorithm optimizations for a distributed cloud-based...
Bilateral filtering is a commonly used technique in image processing. However, being nonlinear, it is computationally expensive. The situation gets worse while the filter radius grows up. Several works have been proposed to accelerate the computation. Nevertheless, most techniques are tailored for grayscale image bilateral filtering or confined to specific kernel functions. In this paper, we propose...
Parametric motion models are commonly used in image sequence analysis for different tasks. A robust estimation framework is usually required to reliably compute the motion model. The choice of the right model is also important. However, dealing simultaneously with both issues remains an open question. We propose a robust motion model selection method with two variants, which relies on the Fisher test...
The latest high efficiency video coding (HEVC) standard achieves about 50% bit-rate reduction at equivalent visual quality compared to H.264/AVC. Sample adaptive offset (SAO) is one of the newly adopted tools right after deblocking filter, which can improve both coding efficiency and visual quality. However, for real-time encoding scenarios, the complexity of SAO is usually too high to be enabled...
The flexible partitioning scheme and increased number of prediction modes in the High Efficiency Video Coding (HEVC) standard are largely responsible for both its high compression efficiency and computational complexity. Each frame in HEVC is partitioned in Coding Tree Units (CTUs) of fixed size, which are then recursively partitioned in Coding Units (CUs). In typical implementations, CUs in a CTU...
This paper presents a novel algorithm that aims at minimizing the required decoding energy by exploiting a general energy model for HEVC-decoder solutions. We incorporate the energy model into the HEVC encoder such that it is capable of constructing a bit stream whose decoding process consumes less energy than the decoding process of a conventional bit stream. To achieve this, we propose to extend...
The latest video compression standard HEVC sets new benchmarks concerning the efficiency for both video coding and also still image coding, i.e., pure intra picture coding. Nevertheless, its high complexity created by the rate-distortion optimization procedure is a serious drawback. To reduce this computational burden, several algorithms for fast mode decision have been proposed. However, most of...
This paper introduces row-column transforms (RCTs) which are 2D non-separable transforms defined with the aid of a set of 1-D linear transforms and a basis ordering permutation. We propose a novel method for the design of row-column transforms that approximate desired complex transforms (such as KLTs, SOTs, etc.) so that most of the performance of the approximated transforms is retained at significantly...
The efficiency improvements achieved by new video coding standards come at the cost of a huge increase in the encoder computational complexity. Paradoxically, such increasing complexity is commonly addressed by methods that have an adverse effect on coding efficiency. In this work, we propose a method to reduce the complexity of HEVC Hadamard ME, without compromising coding efficiency. Our method...
The Karhunen-Loeve Transform (KLT) is a popular transform used in multiple image processing scenarios. Sometimes, the application of the KLT is not carried out as a single transform over an entire image. Rather, the image is divided into smaller spatial regions (segments), each of which is transformed by a smaller dimensional KLT. Such a situation may penalize the transform efficiency. An improvement...
In this paper, we present a method for affine invariant feature description. Based on the gradient distribution of an image region we calculate two basis vectors defining an affine invariant coordinate system, used to normalize the image region. The estimated basis vectors are non-orthogonal and allow for a precise representation of the gradient distribution. The proposed method can be combined with...
In this paper we propose a novel bottom-up visual saliency detection model by analysis of image complexity. Compared with existing works, we emphasize the important impact of image complexity on saliency detection. Inspired by the free energy theory, a hybrid parametric and non-parametric model is used to estimate the complexity of a visual signal. Taking the image complexity as a new feature, this...
This paper explores a pragmatic approach to multiple object tracking where the main focus is to associate objects efficiently for online and realtime applications. To this end, detection quality is identified as a key factor influencing tracking performance, where changing the detector can improve tracking by up to 18.9%. Despite only using a rudimentary combination of familiar techniques such as...
Video retrieval and video copy detection are well studied problems. The goal is to find the matching video in a database from a given query video. Typically, these query videos are short and aligning the query video is of secondary importance. Short sequences can be aligned using dynamic time warping. But, since time and memory usage increases quadratically with the length of the sequences, such process...
The directional intra prediction (DIP) modes in HEVC are capable of predicting local continuous image features. Recently, intra block copy (IBC) is proposed for screen content coding, aiming at predicting non-local recurrent image features. For natural video, we observe that recurrent features are often irregular and not aligned with blocks. Thus, we propose a combination of DIP and IBC with block...
A new algorithm, named Connected Oriented Image Foresting Transform (COIFT), is proposed, which provides global optimum solutions according to a graph-cut measure, subject to high-level boundary constraints. COIFT incorporates the connectivity constraint in the Oriented Image Foresting Transform (OIFT), ensuring the generation of connected objects, and can also handle simultaneously the boundary polarity...
In this paper, we introduce a new video-streaming framework that is based on a novel packet design in conjunction with low-complexity active queue management within the network. Our proposed framework, which we call Erasable Packets within Internet Queues (EPIQ), exploits the inherent multi-priority nature of video to deliver optimal quality-of-experience to users, yet without requiring any additional...
In this paper, we address the problem of the statistical multiplexing of video streams. Dynamic bitrate allocation is used to improve the overall video quality of a pool of channels. The balance is obtained by providing more bits to complex channels, while deprivations are applied to non-complex ones. In this study, the error minimization optimization of several compressed video is considered along...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.