2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Direct Photometric Alignment by Mesh Deformation

Kaimo Lin, Nianjuan Jiang, Shuaicheng Liu, Loong-Fah Cheong, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2701 - 2709

The choice of motion models is vital in applications like image/video stitching and video stabilization. Conventional methods explored different approaches ranging from simple global parametric models to complex per-pixel optical flow. Mesh-based warping methods achieve a good balance between computational complexity and model flexibility. However, they typically require high quality feature correspondences...

chapter

CNN-Based Patch Matching for Optical Flow with Thresholded Hinge Embedding Loss

Christian Bailer, Kiran Varanasi, Didier Stricker

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2710 - 2719

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning based approaches have not yet achieved their full potential in optical flow estimation, where their performance still trails heuristic approaches. In this paper, we present a CNN based patch matching approach for optical flow estimation. An important contribution of our approach is a novel thresholded loss for Siamese networks. We demonstrate that our loss performs clearly better than existing...

chapter

FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos

Suyog Dutt Jain, Bo Xiong, Kristen Grauman

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2117 - 2126

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose an end-to-end learning framework for segmenting generic objects in videos. Our method learns to combine appearance and motion information to produce pixel level segmentation masks for all prominent objects in videos. We formulate this task as a structured prediction problem and design a two-stream fully convolutional neural network which fuses together motion and appearance in a unified...

chapter

Spatiotemporal Pyramid Network for Video Action Recognition

Yunbo Wang, Mingsheng Long, Jianmin Wang, Philip S. Yu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2097 - 2106

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Two-stream convolutional networks have shown strong performance in video action recognition tasks. The key idea is to learn spatiotemporal features by fusing convolutional networks spatially and temporally. However, it remains unclear how to model the correlations between the spatial and temporal structures at multiple abstraction levels. First, the spatial stream tends to fail if two videos share...

chapter

Flexible Spatio-Temporal Networks for Video Prediction

Chaochao Lu, Michael Hirsch, Bernhard Scholkopf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2137 - 2145

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We describe a modular framework for video frame prediction. We refer to it as a Flexible Spatio-Temporal Network (FSTN) as it allows the extrapolation of a video sequence as well as the estimation of synthetic frames lying in between observed frames and thus the generation of slow-motion videos. By devising a customized objective function comprising decoding, encoding, and adversarial losses, we are...

chapter

S2F: Slow-to-Fast Interpolator Flow

Yanchao Yang, Stefano Soatto

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3767 - 3776

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a method to compute optical flow at multiple scales of motion, without resorting to multi-resolution or combinatorial methods. It addresses the key problem of small objects moving fast, and resolves the artificial binding between how large an object is and how fast it can move before being diffused away by classical scale-space. Even with no learning, it achieves top performance on the...

chapter

Fast Multi-frame Stereo Scene Flow with Motion Segmentation

Tatsunori Taniai, Sudipta N. Sinha, Yoichi Sato

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6891 - 6900

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a new multi-frame method for efficiently computing scene flow (dense depth and optical flow) and camera ego-motion for a dynamic scene observed from a moving stereo camera rig. Our technique also segments out moving objects from the rigid scene. In our method, we first estimate the disparity map and the 6-DOF camera motion using stereo matching and visual odometry. We then identify regions...

chapter

Optical Flow Requires Multiple Strategies (but Only One Network)

Tal Schuster, Lior Wolf, David Gadot

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6921 - 6930

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We show that the matching problem that underlies optical flow requires multiple strategies, depending on the amount of image motion and other factors. We then study the implications of this observation on training a deep neural network for representing image patches in the context of descriptor based optical flow. We propose a metric learning method, which selects suitable negative samples based on...

chapter

Video Frame Interpolation via Adaptive Convolution

Simon Niklaus, Long Mai, Feng Liu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2270 - 2279

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Video frame interpolation typically involves two steps: motion estimation and pixel synthesis. Such a two-step approach heavily depends on the quality of motion estimation. This paper presents a robust video frame interpolation method that combines these two steps into a single process. Specifically, our method considers pixel synthesis for the interpolated frame as local convolution over two input...

chapter

Optical Flow Estimation Using a Spatial Pyramid Network

Anurag Ranjan, Michael J. Black

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2720 - 2729

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We learn to compute optical flow by combining a classical spatial-pyramid formulation with deep learning. This estimates large motions in a coarse-to-fine approach by warping one image of a pair at each pyramid level by the current flow estimate and computing an update to the flow. Instead of the standard minimization of an objective function at each pyramid level, we train one deep network per level...

chapter

Predicting Behaviors of Basketball Players from First Person Videos

Shan Su, Jung Pyo Hong, Jianbo Shi, Hyun Soo Park

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1206 - 1215

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a method to predict the future movements (location and gaze direction) of basketball players as a whole from their first person videos. The predicted behaviors reflect an individual physical space that affords to take the next actions while conforming to social behaviors by engaging to joint attention. Our key innovation is to use the 3D reconstruction of multiple first person...

chapter

Filter Flow Made Practical: Massively Parallel and Lock-Free

Sathya N. Ravi, Yunyang Xiong, Lopamudra Mukherjee, Vikas Singh

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5009 - 5018

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper is inspired by a relatively recent work of Seitz and Baker which introduced the so-called Filter Flow model. Filter flow finds the transformation relating a pair of (or multiple) images by identifying a large set of local linear filters, imposing additional constraints on certain structural properties of these filters enables Filter Flow to serve as a general one stop construction for a...

chapter

Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition with Convolutional Neural Networks

Pichao Wang, Wanqing Li, Zhimin Gao, Yuyao Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 416 - 425

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Scene flow describes the motion of 3D objects in real world and potentially could be the basis of a good feature for 3D action recognition. However, its use for action recognition, especially in the context of convolutional neural networks (ConvNets), has not been previously studied. In this paper, we propose the extraction and use of scene flow for action recognition from RGB-D data. Previous works...

chapter

Deep Video Deblurring for Hand-Held Cameras

Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 237 - 246

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Motion blur from camera shake is a major problem in videos captured by hand-held devices. Unlike single-image deblurring, video-based approaches can take advantage of the abundant information that exists across neighboring frames. As a result the best performing methods rely on the alignment of nearby frames. However, aligning images is a computationally expensive and fragile procedure, and methods...

chapter

InterpoNet, a Brain Inspired Neural Network for Optical Flow Dense Interpolation

Shay Zweig, Lior Wolf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6363 - 6372

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Sparse-to-dense interpolation for optical flow is a fundamental phase in the pipeline of most of the leading optical flow estimation algorithms. The current state-of-the-art method for interpolation, EpicFlow, is a local average method based on an edge aware geodesic distance. We propose a new data-driven sparse-to-dense interpolation algorithm based on a fully convolutional network. We draw inspiration...

chapter

Event-Based Visual Inertial Odometry

Alex Zihao Zhu, Nikolay Atanasov, Kostas Daniilidis

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5816 - 5824

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Event-based cameras provide a new visual sensing model by detecting changes in image intensity asynchronously across all pixels on the camera. By providing these events at extremely high rates (up to 1MHz), they allow for sensing in both high speed and high dynamic range situations where traditional cameras may fail. In this paper, we present the first algorithm to fuse a purely event-based tracking...

chapter

Accurate Optical Flow via Direct Cost Volume Processing

Jia Xu, Rene Ranftl, Vladlen Koltun

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5807 - 5815

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an optical flow estimation approach that operates on the full four-dimensional cost volume. This direct approach shares the structural benefits of leading stereo matching pipelines, which are known to yield high accuracy. To this day, such approaches have been considered impractical due to the size of the cost volume. We show that the full four-dimensional cost volume can be constructed...

chapter

DeMoN: Depth and Motion Network for Learning Monocular Stereo

Benjamin Ummenhofer, Huizhong Zhou, Jonas Uhrig, Nikolaus Mayer, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5622 - 5631

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we formulate structure from motion as a learning problem. We train a convolutional network end-to-end to compute depth and camera motion from successive, unconstrained image pairs. The architecture is composed of multiple stacked encoder-decoder networks, the core part being an iterative network that is able to improve its own predictions. The network estimates not only depth and motion,...

chapter

FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1647 - 1655

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The FlowNet demonstrated that optical flow estimation can be cast as a learning problem. However, the state of the art with regard to the quality of the flow has still been defined by traditional methods. Particularly on small displacements and real-world data, FlowNet cannot compete with variational methods. In this paper, we advance the concept of end-to-end learning of optical flow and make it...

chapter

Deep Crisp Boundaries

Yupei Wang, Xin Zhao, Kaiqi Huang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1724 - 1732

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Edge detection had made significant progress with the help of deep Convolutional Networks (ConvNet). ConvNet based edge detectors approached human level performance on standard benchmarks. We provide a systematical study of these detector outputs, and show that they failed to accurately localize edges, which can be adversarial for tasks that require crisp edge inputs. In addition, we propose a novel...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Direct Photometric Alignment by Mesh Deformation

CNN-Based Patch Matching for Optical Flow with Thresholded Hinge Embedding Loss

FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos

Spatiotemporal Pyramid Network for Video Action Recognition

Flexible Spatio-Temporal Networks for Video Prediction

S2F: Slow-to-Fast Interpolator Flow

Fast Multi-frame Stereo Scene Flow with Motion Segmentation

Optical Flow Requires Multiple Strategies (but Only One Network)

Video Frame Interpolation via Adaptive Convolution

Optical Flow Estimation Using a Spatial Pyramid Network

Predicting Behaviors of Basketball Players from First Person Videos

Filter Flow Made Practical: Massively Parallel and Lock-Free

Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition with Convolutional Neural Networks

Deep Video Deblurring for Hand-Held Cameras

InterpoNet, a Brain Inspired Neural Network for Optical Flow Dense Interpolation

Event-Based Visual Inertial Odometry

Accurate Optical Flow via Direct Cost Volume Processing

DeMoN: Depth and Motion Network for Learning Monocular Stereo

FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Deep Crisp Boundaries

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)