The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The problem of detection of label-noise in large datasets is investigated. We consider applications where data are susceptible to label error and a human expert is available to verify a limited number of such labels in order to cleanse the data. We show the support vectors of a Support Vector Machine (SVM) contain almost all of these noisy labels. Therefore, the verification of support vectors allows...
This paper presents a novel random forest learning framework to construct a discriminative and informative mid-level feature from low-level features. Since a single low-level feature based representation is not enough to capture the variations of human appearance, multiple low-level features (i.e., optical flow and histogram of gradient 3D features) are fused to further improve recognition performance...
Traditional methods based on bag-of-word representation are easily affected by noise, and they also cannot handle the problem when a test distribution differs from the training distribution. In this paper, we propose a novel method for human action recognition by bagging data dependent representation. Different with traditional methods, the proposed method represents each video by several histograms...
We present an Online Random Ferns (ORFs) classifier that progressively learns and builds enhanced models of object appearances. During the learning process, we allow the human intervention to assist the classifier and discard false positive training samples. The amount of human intervention is minimized and integrated within the online learning, such that in a few seconds, complex object appearances...
In a pattern recognition sequence consisting of alternating steps of interactive labeling, classifier training, and automated labeling (e.g., CAVIAR systems), the choice of sample size at each step affects the overall amount of human interaction necessary to label all the samples correctly. The appropriate splits depend on the error rate of the classifier as a function of the size of the training...
This paper addresses a novel head posture detection algorithm to recognize human-computer interactions. A pattern training based image segmentation algorithm is used to detect the skin and hair of students. A simple and efficient human presence detection and gaze direction estimation method is then proposed based on the segmentation results. Finally, the proposed algorithm is tested on ten different...
Most existing methods for action recognition mainly rely on manually engineered features which, despite their good performances, are highly problem dependent. We propose in this paper a fully automated model, which learns to classify human actions without using any prior knowledge. A convolutional sparse autoencoder learns to extract sparse shift-invariant representations of the 2D local patterns...
Vision-based object detectors are crucial for different applications. They rely on learnt object models. Ideally, we would like to deploy our vision system in the scenario where it must operate. Then, the system should self-learn how to distinguish the objects of interest, i.e., without human intervention. However, the learning of each object model requires labelled samples collected through a tiresome...
Recently, articulated pose estimation methods based on the pictorial structure framework have received much attention in computer vision. However, the performance of these approaches has been limited due to the presence of self-occlusion. This paper deals with the problem of handling self-occlusion in the pictorial structure framework. We propose an exemplar-based framework for implicit occlusion...
Significant progress has been made towards learning a generalized offline object detector. However, when a generalized offline detector is applied on new datasets, it often misses some instances of the object or produces false alarms in the background scene. we propose a novel and efficient incremental learning method, which improves the performance of an offline trained detector. Our approach adjusts...
This paper proposes a novel regression method based on distance metric learning for human age estimation. We take age estimation as a problem of distance-based ordinal regression, in which the facial aging trend can be discovered by a learned distance metric. Through the learned distance metric, we hope that both the ordinal information of different age groups and the local geometry structure of the...
The security of web services is nowadays one of the major concerns for Internet users. Web services may manage confidential information, monetary transactions, or even health-critical systems, such as those employed in public airports or hospitals. A key problem of web services is that they should work as expected even in the presence of malicious inputs. Unfortunately, with the increasing complexity...
Human faces undergo considerable amount of variations across ages. This paper proposes an age-invariant face verification method by using a Local Classifier Ensemble Model (LCEM). First, reference points are located based on an extended Active Shape Model and faces are aligned afterwards. Second, a face is grouped into several non-overlapping patches and each group is further divided into several...
We propose an image sharpening method that automatically optimizes the perceived sharpness of an image. Image sharpness is defined in terms of the one-dimensional contrast across region boundaries. Regions are automatically extracted for all natural scales present that are themselves identified automatically. Human judgments are collected and used to learn a function that determines the best sharpening...
Sparse representation based classification (SRC) has been widely used for face recognition (FR). Although SRC algorithm is also adopted in human action recognition, the evaluations of different regular terms have not been given. In this paper, we will discuss and evaluate the role of different regular terms of SRC in human action recognition, after that, we propose human action recognition algorithm...
This paper proposes an activity-specific 3D human pose tracking system from multiple camera views. Dimensionality reduction is used to represent a single activity in a hierarchy of low dimensional spaces. This hierarchy provides increasing independence between limbs by decoupling them, allowing higher flexibility and adaptability that result in improved accuracy. For every subspace, a deterministic...
We propose a novel human-in-the-loop surveillance system that continuously learns the properties of objects that are interesting for a human operator. The interesting objects are automatically learned by tracking the eye gaze positions of the operator while he or she monitors the surveillance video. The system automatically detects interesting objects in the surveillance video and forms a new synthetic...
The aim of this paper is to track objects during their use by humans. The task is difficult because these objects are small, fast-moving and often occluded by the user. We present a novel solution based on cascade action recognition, a learned mapping between body-and object-poses, and a hierarchical extension of importance sampling. During tracking, body pose estimates from a Kinect sensor are classified...
This paper introduces a new method for streamed action recognition using Motion Capture (MoCap) data. First, the histograms of action poses, extracted from MoCap data, are computed according to Hausdorf distance. Then, using a dynamic programming algorithm and an incremental histogram computation, our proposed solution recognizes actions in real time from streams of poses. The comparison of histograms...
This paper proposes a method for estimating the quantitative values of some attributes associated with surface qualities of an object, such as glossiness and transparency, from its image. Our approach is to learn functions that compute such attribute values from the input image by using training data given in the form of relative information. To be specific, each sample of the training data represents...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.