The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Active one-shot scanning techniques have been widely used for various applications. Stereo-based active one-shot scanning embeds a positional information regarding the image plane of a projector onto a projected pattern to retrieve correspondences entirely from a captured image. Many combinations of patterns and decoding algorithms for active one-shot scanning have been proposed. If the capturing...
Pedestrian detection is a challenging problem studied over decades. Most algorithms are based on human appearance. Only few works consider motion as a feature component. In this paper, however, we tackle this problem only considering short periods of pedestrian walking. This motion does not depend on the variations of pedestrian pose, body shape, illumination, and background. We model pedestrian motion...
In this paper, a method for unknown object tracking in output images from 360-degree cameras called Modified Training-Learning-Detection (MTLD) is presented. The proposed method is based on the recently introduced Training-Learning-Detection (TLD) scheme in the literature. The flaws of the TLD approach have been detected and significant modifications are proposed to enhance and to elaborate the scheme...
The ubiquitous hand gesture plays an important role in the natural human machine interaction (HMI). Recently, the consumer color and depth cameras have been used to estimate hand shapes and postures for the mid-air HMI. Under the observation that 3D hand contours possess much information of hand postures, we estimate 3D hand contours from infrared images with a limited computation complexity for the...
We present a method to reconstruct the three-dimensional shape of a moving instance of a known object category in video data. We exploit state-of-the-art semantic segmentation techniques to extract the object's two-dimensional shape in each frame. Therefore, our method is robust to occlusion, handles stationary objects and extends naturally to multiple video sequences. We apply Structure from Motion...
One-shot active stereo using structured light is a practical solution for dynamic scene acquisition. Basically, those methods are based on encoding positional information of the pixel into the single projected pattern. A disadvantage of such methods is decreases of the spatial resolution caused by requiring a certain area of the pattern to encode the positional information. Among those methods, grid-based...
How to implement an effective factorization for nonrigid structure from motion(NRSFM) has attracted much attention in recent years. Addressing this problem, we propose a novel sequential factorization method without extra priors other than the basis low-rank prior, consisting of a motion estimation module and a 3D shape recovery module. In the motion estimation module, for improving the estimation...
In this study, we propose a novel method for facial landmark detection (FLD) based on an ensemble of local weighted regressors and a global face shape model under real driving situations. Unlike other FLD approaches, the method proposed in this study first detects the nose region instead of a face-bounding box as a reference point for estimating the offset from a landmark and a reference point. Next,...
High-speed recognition of the shape of a target object is indispensable for robot arms to perform various kinds of dexterous tasks in real time. In this paper, we propose a high-speed 3-D sensing system with active target-tracking. The system consists of a high-speed camera head and a high-speed projector, which are mounted on a two-axis active vision system. By measuring coded structured light projected...
To achieve the goal of frontal vehicle detection in night-driving condition, we propose an effective method to detect the red taillights of vehicles. The challenge is that the taillight images captured with automatic exposure typically are overexposed, which makes red color segmentation often erroneous. Instead of customizing the camera hardware to tackle this problem, we combine morphological and...
The census transform in computing the matching cost of stereo matching is simple and robust under luminance variations in stereo image pairs; however, different disparity maps are generated depending on the shape and size of the census transform window. In this paper, we propose a stereo matching method with variable sizes of census transform windows based on the gradients of stereo images. Our experiment...
This paper proposes a method to reconstruct the 3D shape of objects in participating media. Shape reconstruction of objects in participating media, such as water, fog, and smoke, is difficult due to light scattering, which degrades image quality. While previous methods cope with this problem by removing the scattering components from images, the proposed method estimates optical thickness from images...
Underwater docking for an autonomous underwater vehicle is important in sense that the vehicle can stop at a docking station to recharge its battery, transfer data, and can be used for launch and recovery system. To perform docking, recognizing the station through vision is important. There are few researches conducted on underwater docking using vision to recognize targets as guidance for the underwater...
Video summarization is the process to extract informative events of a video and represent in the condensed form. The paper proposes a new method for extracting important contents of a video for summarization using geometric primitives, such as line segments, angles, and conic parts. The primitives have the capabilities to represent complex shapes and structures of objects in a video frame. Therefore,...
Automatic traffic light detection (TLD) plays an important role for driver-assistance system and autonomous vehicles. State-of-the-art TLD systems showed remarkable results by exploring visual information from static frames. However, traffic lights from different countries, regions, and manufactures are always visually distinct. The existing large intra-class variance makes the pre-trained detectors...
This paper presents an agile approach to facilitate the rapid development of traffic sign classification algorithms in heavy vehicles under a wide range of visibility conditions. A vision-based traffic sign recognition system makes a significant contribution to improving the transportation safety by enhancing the driver's awareness on important road signs in an automotive cockpit environment. It has...
This paper presents a method for image perspective correction using camera intrinsic parameters. This method is based on two assumptions: a) the taken picture have a rectangle area, but didn't know the rectangle area's aspect ratio; b) the camera's intrinsic parameters should obtain by picture (Intrinsic parameters can easily obtain in iPhone or Android phone). The rectangle's perspective distortion...
We present a novel pipeline for augmenting a 3D eye-glass mesh into a person's face. While doing so, we take care about the proper fitment of the glass in terms of pupilary distance computed automatically, which is user-friendly in compare to standard marker based approaches. Our method also performs rigid eye-glass temple correction during augmentation followed by tracking to present realistic rendering...
In object recognition techniques, specially feature-based methods, a fundamental step is to extract keypoints which are distinct and considerably interesting in the image. There are many different keypoint detectors already available, each with its own specific use and results vary enormously. It is widely agreed that evaluation of feature detectors is important. To our knowledge there is no comparative...
Analysis of near-infrared images has a possibility to simply find vein disease. If super-resolution (SR) techniques improve the quality of near-infrared images with a low signal-to-noise ratio, they could detect abnormal veins at an early stage. Deep convolutional neural networks (DCNNs) as a SR technique were applied to downgraded images, and the effectiveness was investigated. The DCNNs with the...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.