2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos

Suyog Dutt Jain, Bo Xiong, Kristen Grauman

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2117 - 2126

We propose an end-to-end learning framework for segmenting generic objects in videos. Our method learns to combine appearance and motion information to produce pixel level segmentation masks for all prominent objects in videos. We formulate this task as a structured prediction problem and design a two-stream fully convolutional neural network which fuses together motion and appearance in a unified...

chapter

Coarse-to-Fine Segmentation with Shape-Tailored Continuum Scale Spaces

Naeemullah Khan, Byung-Woo Hong, Anthony Yezzi, Ganesh Sundaramoorthi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1733 - 1742

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We formulate an energy for segmentation that is designed to have preference for segmenting the coarse over fine structure of the image, without smoothing across boundaries of regions. The energy is formulated by integrating a continuum of scales from a scale space computed from the heat equation within regions. We show that the energy can be optimized without computing a continuum of scales, but instead...

chapter

Weakly Supervised Dense Video Captioning

Zhiqiang Shen, Jianguo Li, Zhou Su, Minjun Li, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5159 - 5167

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper focuses on a novel and challenging vision task, dense video captioning, which aims to automatically describe a video clip with multiple informative and diverse caption sentences. The proposed method is trained without explicit annotation of fine-grained sentence to video region-sequence correspondence, but is only based on weak video-level sentence annotations. It differs from existing...

chapter

Probabilistic Temporal Subspace Clustering

Behnam Gholami, Vladimir Pavlovic

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4313 - 4322

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Subspace clustering is a common modeling paradigm used to identify constituent modes of variation in data with locally linear structure. These structures are common to many problems in computer vision, including modeling time series of complex human motion. However classical subspace clustering algorithms learn the relationships within a set of data without considering the temporal dependency and...

chapter

Online Video Object Segmentation via Convolutional Trident Network

Won-Dong Jang, Chang-Su Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7474 - 7483

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A semi-supervised online video object segmentation algorithm, which accepts user annotations about a target object at the first frame, is proposed in this work. We propagate the segmentation labels at the previous frame to the current frame using optical flow vectors. However, the propagation is error-prone. Therefore, we develop the convolutional trident network (CTN), which has three decoding branches:...

chapter

Fast Multi-frame Stereo Scene Flow with Motion Segmentation

Tatsunori Taniai, Sudipta N. Sinha, Yoichi Sato

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6891 - 6900

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a new multi-frame method for efficiently computing scene flow (dense depth and optical flow) and camera ego-motion for a dynamic scene observed from a moving stereo camera rig. Our technique also segments out moving objects from the rigid scene. In our method, we first estimate the disparity map and the 6-DOF camera motion using stereo matching and visual odometry. We then identify regions...

chapter

Weakly Supervised Semantic Segmentation Using Web-Crawled Videos

Seunghoon Hong, Donghun Yeo, Suha Kwak, Honglak Lee, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2224 - 2232

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel algorithm for weakly supervised semantic segmentation based on image-level class labels only. In weakly supervised setting, it is commonly observed that trained model overly focuses on discriminative parts rather than the entire object area. Our goal is to overcome this limitation with no additional human intervention by retrieving videos relevant to target class labels from web...

chapter

Learning from Synthetic Humans

Gul Varol, Javier Romero, Xavier Martin, Naureen Mahmood, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4627 - 4635

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Estimating human pose, shape, and motion from images and videos are fundamental challenges with many applications. Recent advances in 2D human pose estimation use large amounts of manually-labeled training data for learning convolutional neural networks (CNNs). Such data is time consuming to acquire and difficult to extend. Moreover, manual labeling of 3D pose, depth and motion is impractical. In...

chapter

Primary Object Segmentation in Videos Based on Region Augmentation and Reduction

Yeong Jun Koh, Chang-Su Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7417 - 7425

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A novel algorithm to segment a primary object in a video sequence is proposed in this work. First, we generate candidate regions for the primary object using both color and motion edges. Second, we estimate initial primary object regions, by exploiting the recurrence property of the primary object. Third, we augment the initial regions with missing parts or reducing them by excluding noisy parts repeatedly...

chapter

Turning an Urban Scene Video into a Cinemagraph

Hang Yan, Yebin Liu, Yasutaka Furukawa

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1629 - 1637

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes an algorithm that turns a regular video capturing urban scenes into a high-quality endless animation, known as a Cinemagraph. The creation of a Cinemagraph usually requires a static camera in a carefully configured scene. The task becomes challenging for a regular video with a moving camera and objects. Our approach first warps an input video into the viewpoint of a reference camera...

chapter

End-to-End Learning of Driving Models from Large-Scale Video Datasets

Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3530 - 3538

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust perception-action models should be learned from training data with diverse visual appearances and realistic behaviors, yet current approaches to deep visuomotor policy learning have been generally limited to in-situ models learned from a single vehicle or simulation environment. We advocate learning a generic vehicle motion model from large scale crowd-sourced video data, and develop an end-to-end...

chapter

Optical Flow in Mostly Rigid Scenes

Jonas Wulff, Laura Sevilla-Lara, Michael J. Black

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6911 - 6920

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The optical flow of natural scenes is a combination of the motion of the observer and the independent motion of objects. Existing algorithms typically focus on either recovering motion and structure under the assumption of a purely static world or optical flow for general unconstrained scenes. We combine these approaches in an optical flow algorithm that estimates an explicit segmentation of moving...

chapter

Learning Features by Watching Objects Move

Deepak Pathak, Ross Girshick, Piotr Dollar, Trevor Darrell, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6024 - 6033

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a novel yet intuitive approach to unsupervised feature learning. Inspired by the human visual system, we explore whether low-level motion-based grouping cues can be used to learn an effective visual representation. Specifically, we use unsupervised motion-based segmentation on videos to obtain segments, which we use as pseudo ground truth to train a convolutional network to segment...

chapter

DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction

Antonio Agudo, Francesc Moreno-Noguer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1513 - 1521

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an approach to reconstruct the 3D shape of multiple deforming objects from incomplete 2D trajectories acquired by a single camera. Additionally, we simultaneously provide spatial segmentation (i.e., we identify each of the objects in every frame) and temporal clustering (i.e., we split the sequence into primitive actions). This advances existing work, which only tackled the problem for...

chapter

Minimum Delay Moving Object Detection

Dong Lao, Ganesh Sundaramoorthi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4809 - 4818

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a general framework and method for detection of an object in a video based on apparent motion. The object moves relative to background motion at some unknown time in the video, and the goal is to detect and segment the object as soon it moves in an online manner. Due to unreliability of motion between frames, more than two frames are needed to reliably detect the object. Our method is designed...

chapter

Learning Motion Patterns in Videos

Pavel Tokmakov, Karteek Alahari, Cordelia Schmid

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 531 - 539

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The problem of determining whether an object is in motion, irrespective of camera motion, is far from being solved. We address this challenging task by learning motion patterns in videos. The core of our approach is a fully convolutional network, which is learned entirely from synthetic video sequences, and their ground-truth optical flow and motion segmentation. This encoder-decoder style architecture...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos

Coarse-to-Fine Segmentation with Shape-Tailored Continuum Scale Spaces

Weakly Supervised Dense Video Captioning

Probabilistic Temporal Subspace Clustering

Online Video Object Segmentation via Convolutional Trident Network

Fast Multi-frame Stereo Scene Flow with Motion Segmentation

Weakly Supervised Semantic Segmentation Using Web-Crawled Videos

Learning from Synthetic Humans

Primary Object Segmentation in Videos Based on Region Augmentation and Reduction

Turning an Urban Scene Video into a Cinemagraph

End-to-End Learning of Driving Models from Large-Scale Video Datasets

Optical Flow in Mostly Rigid Scenes

Learning Features by Watching Objects Move

DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction

Minimum Delay Moving Object Detection

Learning Motion Patterns in Videos

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)