The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Existing maximum-margin support vector machines (SVMs) generate a hyperplane which produces the clearest separation between positive and negative feature vectors. These SVMs are effective when datasets are large. However, when few training samples are available, the hyperplane is easily influenced by outliers that are geometrically located in the opposite class. We propose a modified SVM which weights...
In recent years, deep architectures have been used for transfer learning with state-of-the-art performance in many datasets. The properties of their features remain, however, largely unstudied under the transfer perspective. In this work, we present an extensive analysis of the resiliency of feature vectors extracted from deep models, with special focus on the trade-off between performance and compression...
We present an elegant solution to joint pre-alignment and rigid point set registration, given prior matches. Instead of performing pre-alignment and the actual registration in the separate steps, prior matches explicitly influence the registration procedure in our approach. This results in several advantages. Firstly, our approach solves the pre-alignment task — an approximate resolving of rotation...
In medical image analysis, multi-modal registration has been a challenging task due to the complex intensity relationship between images to be aligned. Conventional multi-modal approaches tend to assess the accuracy of the alignment by measuring a similarity based on statistical dependency of the intensity values between images. However, measuring statistical similarity measures, such as mutual information,...
In this paper, we present a novel unsupervised method for detecting outliers in image databases, when the images are misaligned by action of transformations forming a group. The main idea is that when the aligned data lie in a low dimensional subspace, the misaligned data, assuming that the group size is small, will lie in a low dimensional group-invariant subspace. We then explicitly exploit this...
In this paper, we propose a robust video colorization method automatically through limited color references in a video sequence. The proposed method first estimates motion vectors between a monochrome frame and colored reference frames for initial matching by optical flow. Then it transfers color information to matched points in the monochrome frame and further propagates color information of matched...
A two stages car detection method using deformable part models with composite feature sets (DPM/CF) is proposed to recognize cars of various types and from multiple viewing angles. In the first stage, a HOG template is matched to detect the bounding box of the entire car of a certain type and viewed from a certain angle (called a t/a pair), which yields a region of interest (ROI). In the second stage,...
Comparing to the low level local features, Transform Invariant Low-Rank Textures (TILT) can in some sense globally rectify a large class of low-rank textures in 2D images, and thus more accurate and robust. However, TILT is still rather rudimentary, and have some limitations in applications. In this paper, we proposed a novel algorithm for better solving TILT. Our method is based on the application...
Edge-preserving smoothing is one of the most important topics for image and video processing. Recently, we presented a multiscale image decomposition method based on domain transform, which is an efficient edge-preserving smoothing method. It is robust to noise compared with the original domain transform. In this paper, we generalize the scheme so that it can be applied to not only domain transform...
Document is unavailable: This DOI was registered to an article that was not presented by the author(s) at this conference. As per section 8.2.1.B.13 of IEEE's "Publication Services and Products Board Operations Manual," IEEE has chosen to exclude this article from distribution. We regret any inconvenience.
Recently, graph ranking-based methods have been introduced to visual tracking and achieved promising results due to the local structure preserving property. However, existing graph ranking-based trackers use holistic templates to construct the graphs which makes the trackers sensitive to occlusions. In this paper, we propose a part-based multi-graph ranking algorithm for robust visual tracking. In...
In this paper, we propose a new Multi-kernel Metric Learning (MKML) approach to enhance the performance of person re-identification using adaptive weighted Multi-kernel. The intuition behind our approach is that different features, i.e., low-level and middle-level features, have different nature and thus discriminating capability, utilizing different kernels could map these features into sub-spaces,...
In recent years, correlation filter based trackers outperform better than other trackers. Nevertheless, they only employ one feature and a single kernel, so they are usually not robust in complex scenes. In this paper, we derive a multi-feature and multi-kernel correlation filter based tracker which fully takes advantage of the invariance-discriminative power spectrums of various features and kernels...
Character groundtruth for camera captured documents is crucial for training and evaluating advanced OCR algorithms. Manually generating character level groundtruth is a time consuming and costly process. This paper proposes a robust groundtruth generation method based on document retrieval and image registration for camera captured documents. We use an elastic non-rigid alignment method to fit the...
Hazy images hinder image understanding in many applications such as autonomous vehicle. In this paper, we propose an efficient method to improve image quality of hazy images. Our method estimates the transmission function based on a linear model that allows efficient computation and employs quadtree to search for a region that best represents the scatter of airlight. Experiments were conducted using...
In this work we study the problem of weakly supervised human body detection under difficult poses (e.g., multiview and/or arbitrary poses) within the framework of multi-instance learning (MIL). We first point out the existence of the so-called “vanishing gradient” problem in MIL with a noisy-or rule as its bagging model. This is mainly due to the independence assumption of the noisy-or rule, which...
In this paper, we propose a supervised learning based model for ocular biometrics. Using Speeded-Up Robust Features (SURF) for detecting local features of the eye region, we create a local feature descriptor vector of each image. We cluster these feature vectors, representing an image as a normalized histogram of membership to various clusters, thereby creating a bag-of-visual-words model. We conduct...
The illumination conditions of a scene create intra-class variability in outdoor visual data, degrading the performance of high-level algorithms. Using only the image, and with hyper-spectral data as a case study, this paper proposes a deep learning approach to learn illumination invariant features from the data in an unsupervised manner. The proposed approach incorporates a similarity measure, the...
Object Re-identification is a key technology for enabling mobile visual search, virtual reality and augmented reality, and a variety of security and surveillance applications. One key problem in re-identification is to have effective key point feature aggregation schemes that can preserve recall performance in short listing, while offering indexing and hashing efficiency. In the MPEG Compact Descriptor...
Image alignment and stitching continue to be the topics of great interest. Image mosaicking is a key application that involves both alignment and stitching of multiple images. Despite significant previous effort, existing methods have limited robustness in dealing with occlusions and local object motion in different captures. To address this issue, we investigate the potential of applying sparsity-based...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.