2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Deep Image Matting

Ning Xu, Brian Price, Scott Cohen, Thomas Huang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 311 - 320

Image matting is a fundamental computer vision problem and has many applications. Previous algorithms have poor performance when an image has similar foreground and background colors or complicated textures. The main reasons are prior methods 1) only use low-level features and 2) lack high-level context. In this paper, we propose a novel deep learning based algorithm that can tackle both these problems...

chapter

Wetness and Color from a Single Multispectral Image

Mihoko Shimano, Hiroki Okawa, Yuta Asano, Ryoma Bise, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 321 - 329

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Visual recognition of wet surfaces and their degrees of wetness is important for many computer vision applications. It can inform slippery spots on a road to autonomous vehicles, muddy areas of a trail to humanoid robots, and the freshness of groceries to us. In the past, monochromatic appearance change, the fact that surfaces darken when wet, has been modeled to recognize wet surfaces. In this paper,...

chapter

FC^4: Fully Convolutional Color Constancy with Confidence-Weighted Pooling

Yuanming Hu, Baoyuan Wang, Stephen Lin

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 330 - 339

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Improvements in color constancy have arisen from the use of convolutional neural networks (CNNs). However, the patch-based CNNs that exist for this problem are faced with the issue of estimation ambiguity, where a patch may contain insufficient information to establish a unique or even a limited possible range of illumination colors. Image patches with estimation ambiguity not only appear with great...

chapter

Face Normals "In-the-Wild" Using Fully Convolutional Networks

George Trigeorgis, Patrick Snape, Iasonas Kokkinos, Stefanos Zafeiriou

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 340 - 349

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work we pursue a data-driven approach to the problem of estimating surface normals from a single intensity image, focusing in particular on human faces. We introduce new methods to exploit the currently available facial databases for dataset construction and tailor a deep convolutional neural network to the task of estimating facial surface normals in-the-wild. We train a fully convolutional...

chapter

A Non-convex Variational Approach to Photometric Stereo under Inaccurate Lighting

Yvain Queau, Tao Wu, Francois Lauze, Jean-Denis Durou, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 350 - 359

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper tackles the photometric stereo problem in the presence of inaccurate lighting, obtained either by calibration or by an uncalibrated photometric stereo method. Based on a precise modeling of noise and outliers, a robust variational approach is introduced. It explicitly accounts for self-shadows, and enforces robustness to cast-shadows and specularities by resorting to redescending M-estimators...

chapter

A Linear Extrinsic Calibration of Kaleidoscopic Imaging System from Single 3D Point

Kosuke Takahashi, Akihiro Miyata, Shohei Nobuhara, Takashi Matsuyama

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 360 - 368

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a new extrinsic calibration of kaleidoscopic imaging system by estimating normals and distances of the mirrors. The problem to be solved in this paper is a simultaneous estimation of all mirror parameters consistent throughout multiple reflections. Unlike conventional methods utilizing a pair of direct and mirrored images of a reference 3D object to estimate the parameters on a...

chapter

Polarimetric Multi-view Stereo

Zhaopeng Cui, Jinwei Gu, Boxin Shi, Ping Tan, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 369 - 378

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-view stereo relies on feature correspondences for 3D reconstruction, and thus is fundamentally flawed in dealing with featureless scenes. In this paper, we propose polarimetric multi-view stereo, which combines per-pixel photometric information from polarization with epipolar constraints from multiple views for 3D reconstruction. Polarization reveals surface normal information, and is thus helpful...

chapter

An Exact Penalty Method for Locally Convergent Maximum Consensus

Huu Le, Tat-Jun Chin, David Suter

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 379 - 387

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Maximum consensus estimation plays a critically important role in computer vision. Currently, the most prevalent approach draws from the class of non-deterministic hypothesize-and-verify algorithms, which are cheap but do not guarantee solution quality. On the other extreme, there are global algorithms which are exhaustive search in nature and can be costly for practical-sized inputs. This paper aims...

chapter

Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing

Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 388 - 397

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Monocular 3D object parsing is highly desirable in various scenarios including occlusion reasoning and holistic scene interpretation. We present a deep convolutional neural network (CNN) architecture to localize semantic parts in 2D image and 3D space while inferring their visibility states, given a single RGB image. Our key insight is to exploit domain knowledge to regularize the network by deeply...

chapter

Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes from 2D Ones in RGB-Depth Images

Zhuo Deng, Longin Jan Latecki

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 398 - 406

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of amodal perception of 3D object detection. The task is to not only find object localizations in the 3D world, but also estimate their physical sizes and poses, even if only parts of them are visible in the RGB-D image. Recent approaches have attempted to harness point cloud from depth channel to exploit 3D features directly in the 3D space and demonstrated the superiority...

chapter

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

Guillermo Garcia-Hernando, Tae-Kyun Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 407 - 415

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A human action can be seen as transitions between ones body poses over time, where the transition depicts a temporal relation between two poses. Recognizing actions thus involves learning a classifier sensitive to these pose transitions as well as to static poses. In this paper, we introduce a novel method called transitions forests, an ensemble of decision trees that both learn to discriminate static...

chapter

Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition with Convolutional Neural Networks

Pichao Wang, Wanqing Li, Zhimin Gao, Yuyao Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 416 - 425

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Scene flow describes the motion of 3D objects in real world and potentially could be the basis of a good feature for 3D action recognition. However, its use for action recognition, especially in the context of convolutional neural networks (ConvNets), has not been previously studied. In this paper, we propose the extraction and use of scene flow for action recognition from RGB-D data. Previous works...

chapter

Detecting Masked Faces in the Wild with LLE-CNNs

Shiming Ge, Jia Li, Qiting Ye, Zhao Luo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 426 - 434

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Detecting faces with occlusions is a challenging task due to two main reasons: 1) the absence of large datasets of masked faces, and 2) the absence of facial cues from the masked regions. To address these two issues, this paper first introduces a dataset, denoted as MAFA, with 30, 811 Internet images and 35, 806 masked faces. Faces in the dataset have various orientations and occlusion degrees, while...

chapter

A Domain Based Approach to Social Relation Recognition

Qianru Sun, Bernt Schiele, Mario Fritz

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 435 - 444

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Social relations are the foundation of human daily life. Developing techniques to analyze such relations from visual data bears great potential to build machines that better understand us and are capable of interacting with us at a social level. Previous investigations have remained partial due to the overwhelming diversity and complexity of the topic and consequently have only focused on a handful...

chapter

Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition

Junwu Weng, Chaoqun Weng, Junsong Yuan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 445 - 454

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Motivated by previous success of using non-parametric methods to recognize objects, e.g., NBNN [2], we extend it to recognize actions using skeletons. Each 3D action is presented by a sequence of 3D poses. Similar to NBNN, our proposed Spatio-Temporal-NBNN applies stage-to-class distance to classify actions. However, ST-NBNN takes the spatio-temporal structure of 3D actions into consideration and...

chapter

Personalizing Gesture Recognition Using Hierarchical Bayesian Neural Networks

Ajjen Joshi, Soumya Ghosh, Margrit Betke, Stan Sclaroff, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 455 - 464

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Building robust classifiers trained on data susceptible to group or subject-specific variations is a challenging pattern recognition problem. We develop hierarchical Bayesian neural networks to capture subject-specific variations and share statistical strength across subjects. Leveraging recent work on learning Bayesian neural networks, we build fast, scalable algorithms for inferring the posterior...

chapter

Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core

Wadim Kehl, Federico Tombari, Slobodan Ilic, Nassir Navab

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 465 - 473

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a novel method to track 3D models in color and depth data. To this end, we introduce approximations that accelerate the state-of-the-art in region-based tracking by an order of magnitude while retaining similar accuracy. Furthermore, we show how the method can be made more robust in the presence of depth data and consequently formulate a new joint contour and ICP tracking energy. We present...

chapter

Multi-scale FCN with Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild

Dafang He, Xiao Yang, Chen Liang, Zihan Zhou, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 474 - 483

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Scene text detection has attracted great attention these years. Text potentially exist in a wide variety of images or videos and play an important role in understanding the scene. In this paper, we present a novel text detection algorithm which is composed of two cascaded steps: (1) a multi-scale fully convolutional neural network (FCN) is proposed to extract text block regions, (2) a novel instance...

chapter

Viraliency: Pooling Local Virality

Xavier Alameda-Pineda, Andrea Pilzer, Dan Xu, Nicu Sebe, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 484 - 492

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In our overly-connected world, the automatic recognition of virality – the quality of an image or video to be rapidly and widely spread in social networks – is of crucial importance, and has recently awaken the interest of the computer vision community. Concurrently, recent progress in deep learning architectures showed that global pooling strategies allow the extraction of activation...

chapter

A Non-local Low-Rank Framework for Ultrasound Speckle Reduction

Lei Zhu, Chi-Wing Fu, Michael S. Brown, Pheng-Ann Heng

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 493 - 501

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Speckle refers to the granular patterns that occur in ultrasound images due to wave interference. Speckle removal can greatly improve the visibility of the underlying structures in an ultrasound image and enhance subsequent post processing. We present a novel framework for speckle removal based on low-rank non-local filtering. Our approach works by first computing a guidance image that assists in...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep Image Matting

Wetness and Color from a Single Multispectral Image

FC^4: Fully Convolutional Color Constancy with Confidence-Weighted Pooling

Face Normals "In-the-Wild" Using Fully Convolutional Networks

A Non-convex Variational Approach to Photometric Stereo under Inaccurate Lighting

A Linear Extrinsic Calibration of Kaleidoscopic Imaging System from Single 3D Point

Polarimetric Multi-view Stereo

An Exact Penalty Method for Locally Convergent Maximum Consensus

Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing

Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes from 2D Ones in RGB-Depth Images

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition with Convolutional Neural Networks

Detecting Masked Faces in the Wild with LLE-CNNs

A Domain Based Approach to Social Relation Recognition

Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition

Personalizing Gesture Recognition Using Hierarchical Bayesian Neural Networks

Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core

Multi-scale FCN with Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild

Viraliency: Pooling Local Virality

A Non-local Low-Rank Framework for Ultrasound Speckle Reduction

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)