The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Depth perception, or 3D perception, can add a lot to the feeling of immersiveness in many applications such as 3D TV, 3D teleconferencing, etc. Stereopsis and motion parallax are two of the most important cues for depth perception. Most of the 3D displays today rely on stereopsis to create 3D perception. In this paper, we propose to improve user's depth perception by tracking their motions and creating...
In many cases, visual tracking is based on detecting, describing, and then matching local features. A variety of algorithms for these steps have been proposed and used in tracking systems, leading to an increased need for independent comparisons. However, existing evaluations are geared towards object recognition and image retrieval, and their results have limited validity for real-time visual tracking...
Accurate online localization is crucial for mobile robotics. In this paper, we describe a real-time image-based localization technique, which is based on a single calibrated camera. This can be supported by a second camera to improve accuracy and to provide the correct translational scale. Our goal is a robust and unbiased pose estimation in highly dynamic scenes on resource-limited systems. The presented...
3D hand gesture tracking is one of the key problems in human-computer interface design and has attracted increasing attention as an alternative way to traditional input devices, such as mouses and keyboards. Generally speaking, a human hand, as a typical articulated structure, has 15 joints and high dimensionality with 26DOFs, which makes it very difficult for real time tracking. A novel approach...
Region-of-interest (ROI) in multiview video (MW) is different from that of conventional single view video because MW provides three-dimensional perception and makes people more interested in depth discontinuity and pop-out regions. In this paper, we define a novel depth perceptual ROI for MW and discuss four ROI extraction schemes according to temporal and inter-view correlation of the MW. Then, depth...
New developments have been made in optical motion tracking for awake animal imaging that measures 3D position and orientation (pose) for a single photon emission computed tomography (SPECT) imaging system. Ongoing SPECT imaging research has been directed towards head motion measurement for brain studies in awake, unrestrained mice. In contrast to previous results using external markers, this work...
Automatic people detection and tracking is a very essential task of video surveillance systems. It can improve a system's performance in important fields such as security, safety, human activity monitoring etc. In this paper we present a novel approach for people detection and 3D tracking. Our method is based on a human upper body 3D model and a likelihood function to evaluate its presence in a certain...
In this paper we demonstrate that the motion of a sparse set of tracked features can be used to extract 3D pose from a single viewpoint. The purpose of this work is to illustrate the wealth of information present in the temporal dimension of a sequence of images that is currently not being exploited. Our approach is entirely dependent upon motion. We use low-level part detectors consisting of 3D motion...
In recent years, security camera systems have been installed in various public facilities. More intelligent processes are needed to track people in image sequences for security camera systems. In this paper, we propose a face tracking and recognition method based on a Bayesian framework. We assume that an observed space is three-dimensional, and we estimate the 3D position of a person. We use facial...
This paper presents a method to robustly track planes and estimate their 3D poses in a video. A weighted incremental normal estimation method for planes (WINEP) is presented using Bayesian inference. This estimation method guarantees an optimal solution based on all the observations up to the current time, and the computational cost at each time step does not increase with the growing number of past...
Driving assistance systems provide either safety or comfort functions. Such systems must evaluate the state of the world and take necessary actions. A preliminary step for evaluating the state of the world is to detect, track and classify scene objects. The classification step becomes especially important in complex urban traffic scenarios. In such scenarios the sensors of choice are vision based,...
Real-time camera tracking is steadily gaining in importance due to the drive from various applications, such as AR(augmented reality), human-machine interface, and ubiquitous computing. However, a real-time camera tracking using a single camera in an unknown environment is not a trivial work. In this paper, we describe a real-time camera tracking framework specifically designed to track a monocular...
In years, security camera systems have been installed in various public facilities. More intelligent processes are needed to track people in image sequences for security camera systems. In this paper, we propose a face tracking and recognition method based on a Bayesian framework. We assume that an observed space is three-dimensional, and we estimate the 3D position of a person. We use facial 3D shape,...
Most approaches to vehicle tracking have adopted a single calibrated camera for the task, which leads to an under-conditioned problem. We present a surveillance system for on-line vehicle tracking based on two cameras and structure from motion (SfM). Our surveillance system starts by tracking feature points. A novel matching scheme is proposed that allows a subset of feature points to be corresponded...
In this paper, we proposed a movable hand-held display system which uses a projector to project display content onto an ordinary cardboard which can move freely within the projection area. Such a system can give users greater freedom of control of the display such as the viewing angle and distance. At the same time, the size of the cardboard can be made to a size that fits one's application. A projector-camera...
The tracking and recognition of human motion, action, and events using computer vision has recently gained widespread interest in both academic research and industrial, with much emphasis on real-time systems. This paper presents a fast and accurate method for tracking the motion path of a person from the video stream. The motion data of the people are acquired in possible cases such as occlusion...
We document the progress in the design and implementation of a motion control strategy that exploits visual feedback from a narrow baseline stereo head mounted in the hand of a wheelchair mounted robot arm (WMRA) to recognize and grasp textured ADL objects for which one or more templates exist in a large image database. The problem is made challenging by kinematic uncertainty in the robot, imperfect...
When broadcasting sports events it is useful to be able to place virtual 3D annotations on the ground, to indicate things such as world record lines and distances. This requires the camera pose to be estimated in real time, so that the graphics can be rendered to match the camera view. Whilst camera calibration data can be obtained by using sensors on the camera mount and lens, such sensors can be...
We present an effective real-time approach for automatically reconstructing 3D human body poses from monocular video sequences. In this approach, human body is automatically detected from video sequence, then image features such as silhouette, edge and color are extracted and integrated to infer 3D human poses in an iterative way by minimizing the cost function defined between 2D features from the...
To work at video rate, the maps that monocular SLAM builds are bound to be sparse, making them sensitive to the erroneous inclusion of moving points and to the deletion of valid points through temporary occlusion. This paper describes the parallel implementation of monoSLAM with a 3D object tracker, allowing reasoning about moving objects and occlusion. The SLAM process provides the object tracker...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.