The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Although light field data provides abundant cues for depth estimation, light field depth estimation suffers from occlusion and uncertain edges. In this paper, we propose occlusion robust light field depth estimation using segmentation guided bilateral filtering. First, we calculate refocused images from light field data using digital refocusing. Second, we perform support vector machines (SVM) classification...
This paper deals with automatic estimation of the horizon in videos from fixed surveillance cameras. The proposed algorithm is fully automatic in the sense that no user input is needed per-camera and it works with various scenes (indoor, outdoor, traffic, pedestrian, livestock, etc.). The algorithm detects moving objects, tracks them in time, assesses some of their geometric properties related to...
We propose a novel middle level estimation of traffic scenes: Collision Risk Rating (CRR). Given a video sequence from a dashboard camera as input, the objective is to estimate a rate that describes "how likely a collision could happen". CRR's problem setting is similar to that of video classification, but it is more complicated and requires rich feature representations to capture the different...
In monocular vision systems, lack of knowledge about metric distances caused by the inherent scale ambiguity can be a strong limitation for some applications. We offer a method for fusing inertial measurements with monocular odometry or tracking to estimate metric distances in inertial-monocular systems and to increase the rate of pose estimates. As we performed the fusion in a loosely-coupled manner,...
Lane estimation plays a central role for Driver Assistance Systems, therefore many approaches have been proposed to measure its performance. However, no commonly agreed metric exists. In this work, we first present a detailed survey of the current measures. Most of them apply pixel-level benchmarks on camera images and require a time-consuming and fault-prone labeling process. Moreover, these metrics...
Stixel-based segmentation is specifically designed towards obstacle detection which combines road surface estimation in traffic scenes, stixel calculations, and stixel clustering. Stixels are defined by observed height above road surface. Road surfaces (ground manifolds) are represented by using an occupancy grid map. Stixel-based segmentation may improve the accuracy of real-time obstacle detection,...
A human height is one property used in conjunction with others for person identification. The method of human height estimation, using only one camera and some simple settings on the floor, is proposed to automatically determine the vertical distance of human head from the ground in real-time. Bimodal background subtraction technique helps locate the position of head top and lower foot bottom in each...
State-of-the-art video deblurring methods are capable of removing non-uniform blur caused by unwanted camera shake and/or object motion in dynamic scenes. However, most existing methods are based on batch processing and thus need access to all recorded frames, rendering them computationally demanding and time-consuming and thus limiting their practical use. In contrast, we propose an online (sequential)...
We propose a lightweight method for dense online monocular depth estimation capable of reconstructing 3D meshes on computationally constrained platforms. Our main contribution is to pose the reconstruction problem as a non-local variational optimization over a time-varying Delaunay graph of the scene geometry, which allows for an efficient, keyframeless approach to depth estimation. The graph can...
The intensive annotation cost and the rich but unlabeled data contained in videos motivate us to propose an unsupervised video-based person re-identification (re-ID) method. We start from two assumptions: 1) different video tracklets typically contain different persons, given that the tracklets are taken at distinct places or with long intervals; 2) within each tracklet, the frames are mostly of the...
Traditional imaging methods and computer vision algorithms are often ineffective when images are acquired in scattering media, such as underwater, fog, and biological tissue. Here, we explore the use of light field imaging and algorithms for image restoration and depth estimation that address the image degradation from the medium. Towards this end, we make the following three contributions. First,...
In this paper we introduce a novel Depth-Aware Video Saliency approach to predict human focus of attention when viewing videos that contain a depth map (RGBD) on a 2D screen. Saliency estimation in this scenario is highly important since in the near future 3D video content will be easily acquired yet hard to display. Despite considerable progress in 3D display technologies, most are still expensive...
Hand-Eye Calibration (HEC) which is necessary in robotics is proposed to determine the relative transformations between the camera and the Inertial Measurement Unit (IMU) in this paper. We know HEC could be degenerated into the problem of solving an optimization issue for the homogeneous matrix composed of rotation and translation. After acquiring an initial estimation with Kronecker product and singular...
This paper presents a solution to the Projective Structure from Motion (PSfM) problem able to deal efficiently with missing data, outliers and, for the first time, large scale 3D reconstruction scenarios. By embedding the projective depths into the projective parameters of the points and views, we decrease the number of unknowns to estimate and improve computational speed by optimizing standard linear...
This paper proposed robust color constancy method for changing illuminant by using local chromaticity distribution and analysis of illuminant influence for each hue angle. First, changing in chromaticity distribution direction for each color with respect to various illuminant is analyzed using principal component analysis. Next, change in standard deviation of chromaticity distribution with respect...
Direct method for visual odometry has gained popularity, it needs not to compute feature descriptor and uses the actual values of camera sensors directly. Hence, it is very fast. However, its accuracy and consistency are not satisfactory. Based on these considerations, we propose a tightly-coupled, optimization-based method to fuse inertial measurement unit (IMU) and visual measurement, in which uses...
Hand Gesture Recognition is completed on top-view hand images observed by a Time of Flight(ToF) camera in a car. The work attempts to solve two important problems of touchless interactions inside a car. First, low latency identification of the gestures which are unobtrusive for the driver. Second, reducing the labelled data required to train learning based solutions, this is particularly important...
This paper proposes a pseudo-dolly-in video generation method that reproduces motion parallax by applying image reconstruction processing to multi-view videos. Since dolly-in video is taken by moving a camera forward to reproduce motion parallax, we can present a sense of immersion. However, at a sporting event in a large-scale space, moving a camera is difficult. Our research generates dolly-in video...
A method is proposed for estimation of occluded space and generation of auxiliary points for 3D position estimation of strongly occluded objects. First, occlusion space detection calculates 3D keypoints at the rear side of a target object, thus obtaining a silhouette around the object on the near side, as found from a camera image by an object detector. The method calculates the space containing the...
Transparency of optical see-through head-mounted displays (OST-HMDs) makes them suffer from background blending. Existing works have tackled this problem by color correction, but have not addressed how to estimate the background color accurately. In this paper, we apply colorimetric estimation to the subtraction compensation for background blending. Moreover, we propose an optimization framework that...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.