The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper, we present a novel unsupervised method for detecting outliers in image databases, when the images are misaligned by action of transformations forming a group. The main idea is that when the aligned data lie in a low dimensional subspace, the misaligned data, assuming that the group size is small, will lie in a low dimensional group-invariant subspace. We then explicitly exploit this...
It is well known that many more than three or four spectral measurements are required for accurate measurement of color. Previous work has shown seven to ten measurements can yield accurate results on average, but with significant numbers of errors above the threshold of obvious visual detection. Furthermore, the filters used for these measurements are very difficult to fabricate. We show that such...
This paper introduces a novel class of transforms, called graph-based separable transforms (GBSTs), based on two line graphs with optimized weights. For the optimal GBST construction, we formulate a graph learning problem to design two separate line graphs using row-wise and column-wise residual block statistics, respectively. We also analyze the optimality of resulting separable transforms for both...
With the advance of 3-dimensional sensing devices, the in-air handwriting, as a more natural way for human and computer interaction, is being developed by the UCAS-CVMT Lab. Compared with the conventional handwritten Chinese characters generated by touching, it is more challenging to accurately recognize them due to unconstrained one-stroke writing style. This paper presents two recognizers to address...
Cryo electron microscopy records essentially projection images of each of many instances of a nano bio object. This data allows reconstruction of a stochastic model of the object, i.e., the mean and covariance functions of the electron scattering intensity of the object. Understanding the covariance function, which characterizes the heterogeneity of the instances of the object, is challenging because...
Cutting out and object and estimate its transparency mask is a key task in many applications. We take on the work on closed-form matting by Levin et al.[1], that is used at the core of many matting techniques, and propose an alternative formulation that offers more flexible controls over the matting priors. We also show that this new approach is efficient at upscaling transparency maps from coarse...
The Karhunen-Loeve Transform (KLT) is a popular transform used in multiple image processing scenarios. Sometimes, the application of the KLT is not carried out as a single transform over an entire image. Rather, the image is divided into smaller spatial regions (segments), each of which is transformed by a smaller dimensional KLT. Such a situation may penalize the transform efficiency. An improvement...
In this paper, we present a method for affine invariant feature description. Based on the gradient distribution of an image region we calculate two basis vectors defining an affine invariant coordinate system, used to normalize the image region. The estimated basis vectors are non-orthogonal and allow for a precise representation of the gradient distribution. The proposed method can be combined with...
This paper proposes a novel Multiview Discriminative Analysis of Canonical Correlations (MDACC) for multiview learning. The proposed MDACC can capture discriminative features. Furthermore, we present a human action recognition framework by using MDACC to fuse multimodal features, which include the hierarchical Pyramid of Depth Motion Map (HP-DMM) for the depth images, the Histogram of Oriented Displacement...
We introduce the Gaussian Process Transform (GPT), an orthogonal transform for signals defined on a finite but otherwise arbitrary set of points in a Euclidean domain. The GPT is obtained as the Karhunen-Loéve Transform (KLT) of the marginalization of a Gaussian Process defined on the domain. Compared to the Graph Transform (GT), which is the KLT of a Gauss Markov Random Field over the same set of...
In this paper, we propose a new edge model for edge adaptive graph-based transforms (EA-GBTs) in video compression. In particular, we consider step and ramp edge models to design graphs used for defining transforms, and compare their performance on coding intra and inter predicted residual blocks. In order to reduce the signaling overhead of block-adaptive coding, a new edge coding method is introduced...
This paper presents a novel infrared (IR) object tracking algorithm based on the co-difference matrix. Extraction of co-difference features is similar to the well known covariance method except that the vector product operator is redefined in a multiplication-free manner. The new operator yields a computationally efficient implementation for real time object tracking applications. Experiments on an...
We study the classification with respect to the class of curved Mahalanobis metrics that extend the celebrated flat Mahalanobis distances to constant curvature spaces. We prove that these curved Mahalanobis k-NN classifiers define piecewise linear decision boundaries, and report the performance of learning those metrics within the framework of the Large Margin Nearest Neighbor (LMNN). Finally, we...
Active Shape Models are a powerful and well known method to perform face alignment. In some applications it is common to have shape information available beforehand, such as previously detected landmarks. Introducing this prior knowledge to the statistical model may result of great advantage but it is challenging to maintain this priors unchanged once the statistical model constraints are applied...
Document is unavailable: This DOI was registered to an article that was not presented by the author(s) at this conference. As per section 8.2.1.B.13 of IEEE's "Publication Services and Products Board Operations Manual," IEEE has chosen to exclude this article from distribution. We regret any inconvenience.
This paper introduces a generalization of the Fisher vectors to the Riemannian manifold. The proposed descriptors, called Riemannian Fisher vectors, are defined first, based on the mixture model of Riemannian Gaussian distributions. Next, their expressions are derived and they are applied in the context of texture image classification. The results are compared to those given by the recently proposed...
With the rapid development of mobile applications in recent years, there is a strong desire on light weight algorithm for view synthesis using uncalibrated images and limited geometry information. To address this challenge, we propose a Bayesian based view synthesis framework to support the rendering of complex scene with multiple planar structures. In this framework, we integrate image segmentation,...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.