Search results

chapter

3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation from Single Depth Images

Liuhao Ge, Hui Liang, Junsong Yuan, Daniel Thalmann

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5679 - 5688

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a simple, yet effective approach for real-time hand pose estimation from single depth images using three-dimensional Convolutional Neural Networks (3D CNNs). Image based features extracted by 2D CNNs are not directly suitable for 3D hand pose estimation due to the lack of 3D spatial information. Our proposed 3D CNN taking a 3D volumetric representation of the hand depth image as input can...

chapter

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image

Denis Tome, Chris Russell, Lourdes Agapito

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5689 - 5698

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a unified formulation for the problem of 3D human pose estimation from a single raw RGB image that reasons jointly about 2D joint estimation and 3D pose reconstruction to improve both tasks. We take an integrated approach that fuses probabilistic knowledge of 3D human pose with a multi-stage CNN architecture and uses the knowledge of plausible 3D landmark locations to refine the search...

chapter

Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations

Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1253 - 1262

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent advances with Convolutional Networks (ConvNets) have shifted the bottleneck for many computer vision tasks to annotated data collection. In this paper, we present a geometry-driven approach to automatically collect annotations for human pose prediction tasks. Starting from a generic ConvNet for 2D human pose, and assuming a multi-view setup, we describe an automatic way to collect accurate...

chapter

Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing

Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 388 - 397

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Monocular 3D object parsing is highly desirable in various scenarios including occlusion reasoning and holistic scene interpretation. We present a deep convolutional neural network (CNN) architecture to localize semantic parts in 2D image and 3D space while inferring their visibility states, given a single RGB image. Our key insight is to exploit domain knowledge to regularize the network by deeply...

chapter

Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes from 2D Ones in RGB-Depth Images

Zhuo Deng, Longin Jan Latecki

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 398 - 406

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of amodal perception of 3D object detection. The task is to not only find object localizations in the 3D world, but also estimate their physical sizes and poses, even if only parts of them are visible in the RGB-D image. Recent approaches have attempted to harness point cloud from depth channel to exploit 3D features directly in the 3D space and demonstrated the superiority...

chapter

3D Bounding Box Estimation Using Deep Learning and Geometry

Arsalan Mousavian, Dragomir Anguelov, John Flynn, Jana Kosecka

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5632 - 5640

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a method for 3D object detection and pose estimation from a single image. In contrast to current techniques that only regress the 3D orientation of an object, our method first regresses relatively stable 3D object properties using a deep convolutional neural network and then combines these estimates with geometric constraints provided by a 2D object bounding box to produce a complete 3D...

chapter

Deep MANTA: A Coarse-to-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis from Monocular Image

Florian Chabot, Mohamed Chaouch, Jaonary Rabarisoa, Celine Teuliere, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1827 - 1836

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we present a novel approach, called Deep MANTA (Deep Many-Tasks), for many-task vehicle analysis from a given image. A robust convolutional network is introduced for simultaneous vehicle detection, part localization, visibility characterization and 3D dimension estimation. Its architecture is based on a new coarse-to-fine object proposal that boosts the vehicle detection. Moreover,...

chapter

Unite the People: Closing the Loop Between 3D and 2D Human Representations

Christoph Lassner, Javier Romero, Martin Kiefel, Federica Bogo, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4704 - 4713

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D models provide a common ground for different representations of human bodies. In turn, robust 2D estimation has proven to be a powerful tool to obtain 3D fits in-the-wild. However, depending on the level of detail, it can be hard to impossible to acquire labeled data for training 2D estimators on large scale. We propose a hybrid approach to this problem: with an extended version of the recently...

chapter

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Joao Carreira, Andrew Zisserman

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4724 - 4733

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The paucity of videos in current action classification datasets (UCF-101 and HMDB-51) has made it difficult to identify good video architectures, as most methods obtain similar performance on existing small-scale benchmarks. This paper re-evaluates state-of-the-art architectures in light of the new Kinetics Human Action Video dataset. Kinetics has two orders of magnitude more data, with 400 human...

chapter

Dynamic Attention-Controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-Set Sample Weighting

Zhen-Hua Feng, Josef Kittler, William Christmas, Patrik Huber, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3681 - 3690

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a new Cascaded Shape Regression (CSR) architecture, namely Dynamic Attention-Controlled CSR (DAC-CSR), for robust facial landmark detection on unconstrained faces. Our DAC-CSR divides facial landmark detection into three cascaded sub-tasks: face bounding box refinement, general CSR and attention-controlled CSR. The first two stages refine initial face bounding boxes and output intermediate...

chapter

3D lunar craters detection based on stereo matching

Hongmei Zhu, Jihao Yin, Ding Yuan

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 2333 - 2336

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

In this paper, we focus on the 3D crater detection problem on lunar surface, which helps high-precision spacecraft landing and rover navigation in moon exploration projects. A random structured forests method is firstly applied to detect the 2D edges of craters, and then dense correspondence between CCD stereo images estimates the elevations of craters. Finally, we propose a 3D crater detection model,...

chapter

Efficient quality-eactor estimation of a vertical cavity employing a high-contrast grating

Alireza Taghizadeh, Jesper Mork, Il-Sug Chung

2017 International Conference on Numerical Simulation of Optoelectronic Devices (NUSOD) > 87 - 88

2017 International Conference on Numerical Simulation of Optoelectronic Devices (NUSOD)

Hybrid vertical cavity lasers employing high-contrast grating reflectors are attractive for Si-integrated light source applications. Here, a method for reducing a three-dimensional (3D) optical simulation of this laser structure to lower-dimensional simulations is suggested, which allows for very fast and approximate analysis of the quality-factor of the 3D cavity. This approach enables us to efficiently...

chapter

Depth-Stretch: Enhancing Depth Perception Without Depth

Hagit Hel-Or, Yacov Hel-Or, Renato Keshet

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1006 - 1014

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

A simple and efficient method is presented to enhance the depth perception of an image. The approach termed Depth-Stretch (D-stretch) is a tone mapping operation that is applied to the shading component of the given image. Although re-rendering a scene under geometric transformations typically requires extracting the 3D model of the scene, we show that under very simple assumptions D-stretch can be...

chapter

Position Determines Perspective: Investigating Perspective Distortion for Image Forensics of Faces

Bo Peng, Wei Wang, Jing Dong, Tieniu Tan

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1813 - 1821

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

This paper points out a new telltale trace – the characteristic of perspective distortion (CPD), for the image forensics of faces. The perspective distortion is determined by the position of image shooting, and it is often overlooked when creating a forgery, which results in the inconsistency between the claimed camera parameters and the CPD in the face image. To investigate this consistency problem,...

chapter

3D Pose Regression Using Convolutional Neural Networks

Siddharth Mahendran, Haider Ali, Rene Vidal

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 494 - 495

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

3D pose estimation is a key component of many important computer vision tasks like autonomous navigation and robot manipulation. Current state-of-the-art approaches for 3D object pose estimation, like Viewpoints & Keypoints and Render for CNN, solve this problem by discretizing the pose space into bins and solving a pose-classification task. We argue that 3D pose is continuous and can be solved...

chapter

3D-Assisted Coarse-to-Fine Extreme-Pose Facial Landmark Detection

Shengtao Xiao, Jianshu Li, Yunpeng Chen, Zhecan Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 2060 - 2068

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

We propose a novel 3D-assisted coarse-to-fine extreme-pose facial landmark detection system in this work. For a given face image, our system first refines the face bounding box with landmark locations inferred from a 3D face model generated by a Recurrent 3D Regressor at coarse level. Another R3R is then employed to fit a 3D face model onto the 2D face image cropped with the refined bounding box at...

chapter

Joint 3D Human Motion Capture and Physical Analysis from Monocular Videos

Petrissa Zell, Bastian Wandt, Bodo Rosenhahn

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 17 - 26

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Motion analysis is often restricted to a laboratory setup with multiple cameras and force sensors which requires expensive equipment and knowledgeable operators. Therefore it lacks in simplicity and flexibility. We propose an algorithm combining monocular 3D pose estimation with physics-based modeling to introduce a statistical framework for fast and robust 3D motion analysis from 2D video-data. We...

chapter

Model-based pose estimation on-board MAVs equipped with 2D laser scanners for the automatic inspection of electric towers

Carlos Vina, Pascal Morin

2017 11th International Workshop on Robot Motion and Control (RoMoCo) > 78 - 84

2017 11th International Workshop on Robot Motion and Control (RoMoCo)

We propose a model-based approach to obtain local pose estimates of micro aerial vehicles (MAVs), with respect to electric towers, using 2D laser scanners. A simple planar model for the body of an electric tower is presented, which is used in an iterative closest point (ICP) framework to register incoming laser scans. This is complemented with attitude estimates from IMU measurements to obtain a complete...

chapter

The effectiveness of random selection for IGA-based texture search

Ken Ishibashi

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) > 657 - 662

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

This paper provides a novel texture search method for texture images. Creating a computer graphics (CG) is a popular task in many media creations. However, CG creators require their abundant time and effort. In addition, it is difficult for non-professional creators to make a 3D CG scene. This is because that they have to choose appropriate colors, textures, and lighting patterns in addition to 3D...

chapter

Evaluation of the radius of rebars inside a reinforced concrete sample by using 2D inverse problem on radar measurements

M. Albrand, G. Klysz, X. Ferrieres

2017 9th International Workshop on Advanced Ground Penetrating Radar (IWAGPR) > 1 - 4

2017 9th International Workshop on Advanced Ground-Penetrating Radar (IWAGPR)

The paper describes an optimization process to evaluate the radius of rebars located inside reinforced concrete sample, by solving a 2D inverse problem. This process has been applied to measurements and gives acceptable value of radius. However, there exist differences between measured and computed fields by using the optimized radius. A 3D model of the device has been proposed to improve this drawback.

INFONA - science communication portal

Search results

3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation from Single Depth Images

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image

Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations

Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing

Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes from 2D Ones in RGB-Depth Images

3D Bounding Box Estimation Using Deep Learning and Geometry

Deep MANTA: A Coarse-to-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis from Monocular Image

Unite the People: Closing the Loop Between 3D and 2D Human Representations

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Dynamic Attention-Controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-Set Sample Weighting

3D lunar craters detection based on stereo matching

Efficient quality-eactor estimation of a vertical cavity employing a high-contrast grating

Depth-Stretch: Enhancing Depth Perception Without Depth

Position Determines Perspective: Investigating Perspective Distortion for Image Forensics of Faces

3D Pose Regression Using Convolutional Neural Networks

3D-Assisted Coarse-to-Fine Extreme-Pose Facial Landmark Detection

Joint 3D Human Motion Capture and Physical Analysis from Monocular Videos

Model-based pose estimation on-board MAVs equipped with 2D laser scanners for the automatic inspection of electric towers

The effectiveness of random selection for IGA-based texture search

Evaluation of the radius of rebars inside a reinforced concrete sample by using 2D inverse problem on radar measurements

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options