The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Wireless localization has attracted increasing attention from both researchers and engineers. Most of the state-of-the-art localization techniques are designed based on line-of-sight (LOS) radio propagation. The present paper focuses on indoor scenarios which are multipath and rich scattering environments considering non-line-of-sight (NLOS) propagations. Based on the measured angle-of-departure (AOD),...
This paper presents a novel real-time approach for robust high precision and high quality depth estimation. It extends recent work on real-time Patch-Sweeping by combining the advantages of a robust hybrid stereo-based disparity estimator with the high accuracy of the Patch-Sweeping approach. It overcomes limitations of the existing Patch-Sweep approach, such as limited search range. Further, it implicitly...
In this paper we present a novel object classification and pose recovery algorithm which takes advantage of existing 3D models and multiple synchronized and calibrated views. Having a calibrated scenario provides redundant data which can be exploited for gathering spatial consistency of an object's 3D pose and its class. In a first step, the cameras need to be calibrated and aligned to one common...
Facial landmark detection is an essential module in many face related applications and it often appears as the most time consuming part in face processing pipeline. This paper proposes a fast and effective method for facial landmark detection using Haar cascade classifiers and a simple 3D head model, which not only detects the position of landmark points but also gives an estimation of head pose such...
Inaccuracy depth estimation may influence on depth coding and virtual view rendering in the free-viewpoint television (FTV) system, an improved depth map estimation is proposed to solve the problem for coding and view synthesis. Firstly, check the consistency of initial depth, and the influence of initial miss-matches is minimized by introduction of an additional adaptive matching error selection...
We describe an approach to2D-to-3D video conversion for the stereoscopic display. Targeting the problem of synthesizing the frames of a virtual ‘right view’ from the original monocular 2D video, we generate the stereoscopic video in steps as following. (1) A 2.5D depth map is first estimated in a multi-cue fusion manner by leveraging motion cues and photometric cues in video frames with a depth prior...
Multi-camera 3D tracking systems with overlapping cameras represent a powerful mean for scene analysis, as they potentially allow greater robustness than monocular systems and provide useful 3D information about object location and movement. However, their performance relies on accurately calibrated camera networks, which is not a realistic assumption in real surveillance environments. Here, we introduce...
Single camera stereo system utilizes mirrors and a single camera for computational stereo, where the mirrors provide extra views needed for stereo and 3D reconstruction. In this paper, we investigate the basic epiploar geometric properties of the single camera stereo image and propose a novel image rectification technique to map the epipolar lines in the original image into the horizontally aligned...
Holoscopic imaging, also known as integral imaging, provides a solution for glassless 3D, and is promising to change the market for 3D television. To start, this paper briefly describes the general concepts of holoscopic imaging, focusing mainly on the spatial correlations inherent to this new type of content, which appear due to the micro-lens array that is used for both acquisition and display....
In this contribution a novel method to compute dense point-to-point correspondences between 3D faces is presented. The faces are aligned in 3D space with a Generalized Procrustes Analysis and subsequently mapped into 2D space. To compute a correspondence flow between two faces an energy function is minimized which is based on the following assumptions: smoothness of the flow, mapping of landmarks...
A depth map represents three-dimensional (3D) scene information and is used to synthesize virtual views in 3D video. Since the quality of synthesized virtual views highly depends on the quality of depth map, efficient depth compression is crucial to realize the 3D video system. However compressing depth map using existing video coding techniques yields unacceptable distortions while rendering virtual...
In surveillance videos, cues such as head or body pose provide important information for analyzing people's behavior and interactions. In this paper we propose an approach that jointly estimates body location and body pose in monocular surveillance video. Our approach is based on tracks derived by multi-object tracking. First, body pose classification is conducted using sparse representation technique...
In this study the perception of crowds was investigated in urban environment. The images of crowds were viewed non-stereoscopically and stereoscopically with HMD (head-mounted display). The task of the participants was to count the number of persons in the crowds. The results clearly indicate that stereoscopic viewing enhances perception of crowds. The counting task was determined to be easiest with...
A typical video surveillance system consists of at least one camera, controlled by an operator. To decrease the human error rate and to generally lessen the burden of operators, many object tracking systems have been implemented, most of which work in 2D image space. If used centralized, this is a very expensive task. Furthermore, if several views are to be fused, large inaccuracies arise due to ground...
The work presented in this paper addresses a practical approach to the problem of 3D pose estimation. The proposed method extends a classical 2D dead reckoning system to a 3D pose estimation system by merging data from odometry and multiple low cost rate gyros and accelerometers. The localization problem is decomposed into two parts, i.e. attitude estimation followed by pose estimation. Based on the...
In this paper, we address the recovering from monocular images focusing on designing a novel image descriptor derived from the second generation Bandelet transformation, noted as Bandelet2, to tackle with estimation accuracy combined with state-of-art prediction methods. The proposed Bandelet2 image representation could boost the accuracy for the final 3D pose prediction in monocular video images...
In this paper, we explore the combined use of inertial sensors and the Kinect for applications on rehabilitation robotics and assistive devices. In view of the deficiencies of each individual system, a new method based on Kalman filtering was developed in order to perform online calibration of sensor errors automatically whenever measurements from Kinect are available. The method was evaluated on...
For radiotherapy planning, contouring of target volume and healthy structures at risk in CT volumes is essential. To automate this process, one of the available segmentation techniques can be used for many thoracic organs except the esophagus, which is very hard to segment due to low contrast. In this work we propose to initialize our previously introduced model based 3D level set esophagus segmentation...
The occlusion between real and virtual objects influences not only seamless merging of virtual and real environments but also users' visual perception of orientations & locations and spatial interactions in augmented reality. If there exist a large amount of video sequences for representing the real environment, and each video sequence utilizes computer vision algorithms to deal with all of occlusions...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.