The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
On the problem of tracking objects in videos, a recent and distinguished approach combining tracking and detection methods is the TLD framework. The detector identifies the object by its supposedly confirmed appearances. The tracker inserts new appearances into the model using apparent motion. Their outcomes are integrated by using the same similarity metric of the detector which, in our point of...
A two stages car detection method using deformable part models with composite feature sets (DPM/CF) is proposed to recognize cars of various types and from multiple viewing angles. In the first stage, a HOG template is matched to detect the bounding box of the entire car of a certain type and viewed from a certain angle (called a t/a pair), which yields a region of interest (ROI). In the second stage,...
Mammogram images are now increasingly acquired with full-field digital mammography (FFDM) systems in the clinics. Traditionally, the “for-processing” format of FFDM images is used in computer-aided diagnosis (CAD) of breast cancer. In this study, we investigate the feasibility of using “for-presentation” format of FFDM (which are more readily available) in development of CAD algorithms for microcalcification...
In computer-aided diagnosis of clustered microcalcifications (MCs), the individual MCs in a lesion need to be first detected prior to subsequent classification as being benign or malignant. However, owing to noise characteristics and patient variability, the detection accuracy is often adversely compromised by the occurrence of false-positives (FPs) or missed MCs in detection. To deal with difficulty,...
Determining the molecular or genomic sub-type of a cancer of a particular organ is important for prognosis and treatment planning. While clinical tests determine the dominant subtype of cancer in a patient, it is believed that some cancer treatments targeting the dominant sub-types ultimately prove ineffective because they ignore the existence of additional sub-types in the same patient. We present...
Multi-object tracking is a difficult problem underlying many computer vision applications. In this work, we focus on sediment transport experiments in a flow were sediments are represented by spherical calibrated beads. The aim is to track all beads over long time sequences to obtain sediment velocities and concentration. Classical algorithms used in fluid mechanics fail to track the beads over long...
In this work we study the problem of weakly supervised human body detection under difficult poses (e.g., multiview and/or arbitrary poses) within the framework of multi-instance learning (MIL). We first point out the existence of the so-called “vanishing gradient” problem in MIL with a noisy-or rule as its bagging model. This is mainly due to the independence assumption of the noisy-or rule, which...
In this paper, a part-based technique for real time detection of users' faces on mobile devices is proposed. This method is specifically designed for detecting partially cropped and occluded faces captured using a smartphone's front-facing camera for continuous authentication. The key idea is to detect facial segments in the frame and cluster the results to obtain the region which is most likely to...
Real-world CCTV footage often poses increased challenges in object tracking due to Pan-Tilt-Zoom operations, low camera quality and diverse working environments. Most relevant challenges are moving background, motion blur and severe scale changes. Convolutional neural networks, which offer state-of-the-art performance in object detection, are increasingly utilized to pursue a more efficient tracking...
Often the filters learned by Convolutional Neural Networks (CNNs) from different image datasets appear similar. This similarity of filters is often exploited for the purposes of transfer learning. This is also being used as an initialization technique for different tasks in the same dataset or for the same task in similar datasets. Off-the-shelf CNN features have capitalized on this idea to promote...
In this paper, we propose a principled framework for pornographic image recognition. Specifically, we present our definition of pornographic images, which characterizes the pornographic contents in images as the exposure of private body parts. As the private body parts often lie in local image regions, we model each image as a bag of local image patches (instances), and assume that for each pornographic...
Starting from an object's location in a video frame, tracking-by-detection methods find the location of that object in a subsequent video frame. The tracker's detection step may produce multiple false positives during short-term occlusions, which can result in loss of track. We propose a tracking-by-detection method that is robust to short-term occlusions and false positives. Here, we extend the Struck...
Directly connected to the texture appearance, texture granularity is an effective measurement for geographic resources classification, product quality monitoring and image compression ratio selection. However, the application of existing works on texture granularity is limited by intense computation and the dependence on empirically selected parameters that vary among different textures. This paper...
In this work, we present a new multiple channel feature called Deep Compact Channel Feature (DCCF), which generates a compact, discriminative feature representation by a pre-trained deep encoder-decoder. With the combination of DCCF and boosted decision trees, a new object detector is proposed which achieved outstanding performance on standard pedestrian dataset INRIA and Caltech. Furthermore, a large...
This paper introduces a method to guide the visual search towards a searched object, analogously to what is performed by the top-down visual attention mechanism. This is done by prioritizing scene descriptors based on their Hamming distance to the descriptors of the target. The proposal has constant space and time complexity in relation to the number of descriptors of the searched object. Moreover,...
Intrinsic natures of different appearance between sub-regions of objects and non-objects in optical flows lead to more visual consistency for object proposals. Hence, visual variations in different sub-regions in video sequences over time is a good indicator for likeliness of objects. We propose a method that dynamically measures the objectness of each proposal by exploiting temporal consistency within...
Detecting eyes in images is fundamental for many computer vision applications including face detection, face recognition, and human-computer interaction. Most existing methods are designed and tested on datasets acquired under controlled lab settings (e.g., fixed scale, known poses, clean background, etc.), leaving their performance to be further examined on real-world, uncontrolled images, such as...
In this paper, a novel progressive strategy is proposed to teach the machine to accomplish face detection in the wild. Firstly, deep model named Fully-connected Face Classifier (FCFC) is built up. With the targeted training data, FCFC learns the knowledge corresponding to distinguish face in various pose, facial expression, occlusion proportion, and blur degree from background gradually. Secondly,...
This paper proposes an extended Constrained Local Model (CLM) formulation for aligning faces using depth information. The CLMs are popular methods that were initially designed to locate facial features in regular intensity images. Briefly, they combine a set of local detectors, one for each landmark, whose locations are regularized by a linear shape model. Fitting a CLM is usually framed as a two...
This paper presents a fast algorithm for deriving the defocus map from a single image. Existing methods of defocus map estimation often include a pixel-level propagation step to spread the measured sparse defocus cues over the whole image. Since the pixel-level propagation step is time-consuming, we develop an effective method to obtain the whole-image defocus blur using oversegmentation and transductive...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.