The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Reflection removal aims at separating the mixture of the desired scene and the undesired reflections. Locating reflection and background edges is a key step for reflection removal. In this paper, we present a visual depth guided method to remove reflections. Our idea is to use Depth of Field (DoF) to label the background and reflection edges. We propose a DoF confidence map where pixels with higher...
Stroke is one of the leading causes of death and disability. Clinically, to establish stroke patient prognosis, an accurate delineation of brain lesion is essential, which is time consuming and prone to subjective errors. In this paper, we propose a novel method call Deep Lesion Symmetry ConvNet to automatically segment chronic stroke lesions using MRI. An 8-layer 3D convolutional neural network is...
Coral reefs exhibit significant within-class variations, complex between-class boundaries and inconsistent image clarity. This makes coral classification a challenging task. In this paper, we report the application of generic CNN representations combined with hand-crafted features for coral reef classification to take advantage of the complementary strengths of these representation types. We extract...
We propose a new denoising algorithm for camera pipelines and other photographic applications. We aim for a scheme that is (1) fast enough to be practical even for mobile devices, and (2) handles the realistic content dependent noise in real camera captures. Our scheme consists of a simple two-stage non-linear processing. We introduce a new form of boosting/blending which proves to be very effective...
We introduce a constant luminance HDR video coding pipeline, which converts the source video to linear Y u'v' color space and applies a dedicated chromaticity transformation before encoding. This reduces perceivable color artifacts without modifying the core codec itself. We validate our approach by a user study that shows a significant improvement in perceived color quality at high compression rates...
In the absence of a commercial High Dynamic Range (HDR) distribution pipeline, two-layer backward-compatible HDR video coding is a viable solution for the imminent transition from Low Dynamic Range (LDR) to HDR content transmission. However, the performance of a two-layer coding solution is governed by the extension layer coding performance. In this paper, we propose an improved two-layer backward-compatible...
A cloud-based encoding pipeline which generates streams for video-on-demand distribution typically processes a wide diversity of content that exhibit varying signal characteristics. To produce the best quality video streams, the system needs to adapt the encoding to each piece of content, in an automated and scalable way. In this paper, we describe two algorithm optimizations for a distributed cloud-based...
Digital content consumption is exploding thanks to the advances of the distributed cloud-computing infrastructures and the consumer electronics. Further challenges have been posed to engineers and researchers to satisfy the ever increasing user needs not only for high quality video delivery, but also for richer experience. In order to support various video analysis tasks in addition to transcoding,...
Banding is a common video artifact caused by compressing low texture regions with coarse quantization. Relatively few previous attempts exist to address banding and none incorporate subjective testing for calibrating the measurement. In this paper, we propose a novel metric that incorporates both edge length and contrast across the edge to measure video banding. We further introduce both reference...
Document is unavailable: This DOI was registered to an article that was not presented by the author(s) at this conference. As per section 8.2.1.B.13 of IEEE's "Publication Services and Products Board Operations Manual," IEEE has chosen to exclude this article from distribution. We regret any inconvenience.
This paper introduces a novel system for the analysis of superresolution microscopy images using a learning based approach boosting performance and simplicity of use. Key component of single-molecule-localisation (SML) microscopy techniques is the ability to localise single emitting molecules in a stack of noisy images with a high degree of accuracy. To this end, we propose a SVM-based detector coupled...
Text detection is typically the first step for any text processing such as hand-written text recognition, layout analysis, line detection, or writer identification. This paper describes a new method to detect text in images, particularly in historical document images. For a robust detection, we propose the use of the vesselness filter as a new preprocessing step for text detection. We show, that this...
We propose a method for the color stabilization of cinema shots coming from different cameras that use unknown logarithmic encoding curves. The log-encoding curves are approximated by a concatenation of gamma-curves, whose values are accurately computed using image matches. The color stabilization procedure, based on the generic color processing pipeline of a digital camera, can be performed after...
In this paper, we propose a novel fast cost propagation algorithm on spanning tree structures. By introducing local smoothness constraint during the weighted cost aggregation process on tree structures, we overcome the shortage of the “fronto-parallel plane” assumption used in most local and non-local cost aggregation algorithms. By applying it to our stereo correspondence framework, accurate results...
Cultivar identification is an important aspect in agriculture and also a typical task of fine-grained visual categorization (FGVC). In comparison with other common topics in FGVC, studies on this problem are somewhat lagged and limited. In this paper, targeting four Chinese maize cultivars of Jundan No.20, Wuyue No.3, Nongda No.108, and Zhengdan No.958, we first consider the problem of identifying...
Automatic object detection is a rapidly evolving area in surveillance and autonomous vehicles. Deformable part model (DPM) is a well-known object detector for its high precision and speed bottleneck. This paper proposes a very fast object detection pipeline based on complementary techniques to accelerate DPM. A recent fast feature pyramid technique is employed with look-up table HOG features, Fast...
Color correction is an essential image processing operation that transforms a camera-dependent RGB color space to a standard color space, e.g., the XYZ or the sRGB color space. The color correction is typically performed by multiplying the camera RGB values by a color correction matrix, which often amplifies image noise. In this paper, we propose an effective color correction pipeline for a noisy...
Inspired by the binary-based descriptors (e.g. LBP, ALOHA, FREAK, BRISK), we propose the 3D Binary Pair Differences (3DBPD) video descriptor for action recognition. By comparing several spatio-temporal sub-regions around interests points, our descriptor is a feature vector with a dimensionality of up to 30% smaller than that of existing state-of-the-art descriptors. We demonstrate the effectiveness...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.