2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

The World of Fast Moving Objects

Denys Rozumnyi, Jan Kotera, Filip Sroubek, Lukas Novotny, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4838 - 4846

The notion of a Fast Moving Object (FMO), i.e. an object that moves over a distance exceeding its size within the exposure time, is introduced. FMOs may, and typically do, rotate with high angular speed. FMOs are very common in sports videos, but are not rare elsewhere. In a single frame, such objects are often barely visible and appear as semitransparent streaks. A method for the detection and tracking...

chapter

Identifying First-Person Camera Wearers in Third-Person Videos

Chenyou Fan, Jangwon Lee, Mingze Xu, Krishna Kumar Singh, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4734 - 4742

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We consider scenarios in which we wish to perform joint scene understanding, object tracking, activity recognition, and other tasks in scenarios in which multiple people are wearing body-worn cameras while a third-person static camera also captures the scene. To do this, we need to establish person-level correspondences across first-and third-person videos, which is challenging because the camera...

chapter

PoseTrack: Joint Multi-person Pose Estimation and Tracking

Umar Iqbal, Anton Milan, Juergen Gall

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4654 - 4663

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we introduce the challenging problem of joint multi-person pose estimation and tracking of an unknown number of persons in unconstrained videos. Existing methods for multi-person pose estimation in images cannot be applied directly to this problem, since it also requires to solve the problem of person association over time in addition to the pose estimation for each person. We therefore...

chapter

CERN: Confidence-Energy Recurrent Network for Group Activity Recognition

Tianmin Shu, Sinisa Todorovic, Song-Chun Zhu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4255 - 4263

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This work is about recognizing human activities occurring in videos at distinct semantic levels, including individual actions, interactions, and group activities. The recognition is realized using a two-level hierarchy of Long Short-Term Memory (LSTM) networks, forming a feed-forward deep architecture, which can be trained end-to-end. In comparison with existing architectures of LSTMs, we make two...

chapter

LSTM Self-Supervision for Detailed Behavior Analysis

Biagio Brattoli, Uta Buchler, Anna-Sophia Wahl, Martin E. Schwab, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3747 - 3756

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Behavior analysis provides a crucial non-invasive and easily accessible diagnostic tool for biomedical research. A detailed analysis of posture changes during skilled motor tasks can reveal distinct functional deficits and their restoration during recovery. Our specific scenario is based on a neuroscientific study of rodents recovering from a large sensorimotor cortex stroke and skilled forelimb grasping...

chapter

Fine-Grained Recognition as HSnet Search for Informative Image Parts

Michael Lam, Behrooz Mahasseni, Sinisa Todorovic

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6497 - 6506

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This work addresses fine-grained image classification. Our work is based on the hypothesis that when dealing with subtle differences among object classes it is critical to identify and only account for a few informative image parts, as the remaining image context may not only be uninformative but may also hurt recognition. This motivates us to formulate our problem as a sequential search for informative...

chapter

ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification

Rohit Girdhar, Deva Ramanan, Abhinav Gupta, Josef Sivic, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3165 - 3174

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we introduce a new video representation for action classification that aggregates local convolutional features across the entire spatio-temporal extent of the video. We do so by integrating state-of-the-art two-stream networks [42] with learnable spatio-temporal feature aggregation [6]. The resulting architecture is end-to-end trainable for whole-video classification. We investigate...

chapter

Deep Network Flow for Multi-object Tracking

Samuel Schulter, Paul Vernaza, Wongun Choi, Manmohan Chandraker

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2730 - 2739

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Data association problems are an important component of many computer vision applications, with multi-object tracking being one of the most prominent examples. A typical approach to data association involves finding a graph matching or network flow that minimizes a sum of pairwise association costs, which are often either hand-crafted or learned as linear functions of fixed features. In this work,...

chapter

Predicting Behaviors of Basketball Players from First Person Videos

Shan Su, Jung Pyo Hong, Jianbo Shi, Hyun Soo Park

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1206 - 1215

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a method to predict the future movements (location and gaze direction) of basketball players as a whole from their first person videos. The predicted behaviors reflect an individual physical space that affords to take the next actions while conforming to social behaviors by engaging to joint attention. Our key innovation is to use the 3D reconstruction of multiple first person...

chapter

Unrolling the Shutter: CNN to Correct Motion Distortions

Vijay Rengarajan, Yogesh Balaji, A. N. Rajagopalan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2345 - 2353

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Row-wise exposure delay present in CMOS cameras is responsible for skew and curvature distortions known as the rolling shutter (RS) effect while imaging under camera motion. Existing RS correction methods resort to using multiple images or tailor scene-specific correction schemes. We propose a convolutional neural network (CNN) architecture that automatically learns essential scene features from a...

chapter

Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing

Yu-Chuan Su, Kristen Grauman

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1368 - 1376

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

360° Video requires human viewers to actively control where to look while watching the video. Although it provides a more immersive experience of the visual content, it also introduces additional burden for viewers, awkward interfaces to navigate the video lead to suboptimal viewing experiences. Virtual cinematography is an appealing direction to remedy these problems, but conventional methods...

chapter

Fine-to-Coarse Global Registration of RGB-D Scans

Maciej Halber, Thomas Funkhouser

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6660 - 6669

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

RGB-D scanning of indoor environments is important for many applications, including real estate, interior design, and virtual reality. However, it is still challenging to register RGB-D images from a hand-held camera over a long video sequence into a globally consistent 3D model. Current methods often can lose tracking or drift and thus fail to reconstruct salient structures in large environments...

chapter

On the Two-View Geometry of Unsynchronized Cameras

Cenek Albl, Zuzana Kukelova, Andrew Fitzgibbon, Jan Heller, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5593 - 5602

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present new methods of simultaneously estimating camera geometry and time shift from video sequences from multiple unsynchronized cameras. Algorithms for simultaneous computation of a fundamental matrix or a homography with unknown time shift between images are developed. Our methods use minimal correspondence sets (eight for fundamental matrix and four and a half for homography) and therefore...

chapter

Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images

Zhi-Qi Cheng, Xiao Wu, Yang Liu, Xian-Sheng Hua

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4169 - 4177

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In recent years, both online retail and video hosting service have been exponentially grown. In this paper, a novel deep neural network, called AsymNet, is proposed to explore a new cross-domain task, Video2Shop, targeting for matching clothes appeared in videos to the exactly same items in online shops. For the image side, well-established methods are used to detect and extract features for clothing...

chapter

Spatio-Temporal Alignment of Non-overlapping Sequences from Independently Panning Cameras

S. Morteza Safdarnejad, Xiaoming Liu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6393 - 6401

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of spatio-temporal alignment of multiple video sequences. We identify and tackle a novel scenario of this problem referred to as Nonoverlapping Sequences (NOS). NOS are captured by multiple freely panning handheld cameras whose field of views (FOV) might have no direct spatial overlap. With the popularity of mobile sensors, NOS rise when multiple cooperative users...

chapter

Flight Dynamics-Based Recovery of a UAV Trajectory Using Ground Cameras

Artem Rozantsev, Sudipta N. Sinha, Debadeepta Dey, Pascal Fua

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2482 - 2491

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a new method to estimate the 6-dof trajectory of a flying object such as a quadrotor UAV within a 3D airspace monitored using multiple fixed ground cameras. It is based on a new structure from motion formulation for the 3D reconstruction of a single moving point with known motion dynamics. Our main contribution is a new bundle adjustment procedure, which in addition to optimizing the camera...

chapter

DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

Namhoon Lee, Wongun Choi, Paul Vernaza, Christopher B. Choy, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2165 - 2174

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a Deep Stochastic IOC RNN Encoder-decoder framework, DESIRE, for the task of future predictions of multiple interacting agents in dynamic scenes. DESIRE effectively predicts future locations of objects in multiple scenes by 1) accounting for the multi-modal nature of the future prediction (i.e., given the same context, future may vary), 2) foreseeing the potential future outcomes and...

chapter

Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization

Kuo-Hao Zeng, Shih-Han Chou, Fu-Hsiang Chan, Juan Carlos Niebles, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1330 - 1338

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

For survival, a living agent (e.g., human in Fig. 1(a)) must have the ability to assess risk (1) by temporally anticipating accidents before they occur (Fig. 1(b)), and (2) by spatially localizing risky regions (Fig. 1(c)) in the environment to move away from threats. In this paper, we take an agent-centric approach to study the accident anticipation and risky region localization tasks. We propose...

chapter

DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction

Antonio Agudo, Francesc Moreno-Noguer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1513 - 1521

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an approach to reconstruct the 3D shape of multiple deforming objects from incomplete 2D trajectories acquired by a single camera. Additionally, we simultaneously provide spatial segmentation (i.e., we identify each of the objects in every frame) and temporal clustering (i.e., we split the sequence into primitive actions). This advances existing work, which only tackled the problem for...

chapter

Forecasting Interactive Dynamics of Pedestrians with Fictitious Play

Wei-Chiu Ma, De-An Huang, Namhoon Lee, Kris M. Kitani

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4636 - 4644

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We develop predictive models of pedestrian dynamics by encoding the coupled nature of multi-pedestrian interaction using game theory and deep learning-based visual analysis to estimate person-specific behavior parameters. We focus on predictive models since they are important for developing interactive autonomous systems (e.g., autonomous cars, home robots, smart homes) that can understand different...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The World of Fast Moving Objects

Identifying First-Person Camera Wearers in Third-Person Videos

PoseTrack: Joint Multi-person Pose Estimation and Tracking

CERN: Confidence-Energy Recurrent Network for Group Activity Recognition

LSTM Self-Supervision for Detailed Behavior Analysis

Fine-Grained Recognition as HSnet Search for Informative Image Parts

ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification

Deep Network Flow for Multi-object Tracking

Predicting Behaviors of Basketball Players from First Person Videos

Unrolling the Shutter: CNN to Correct Motion Distortions

Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing

Fine-to-Coarse Global Registration of RGB-D Scans

On the Two-View Geometry of Unsynchronized Cameras

Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images

Spatio-Temporal Alignment of Non-overlapping Sequences from Independently Panning Cameras

Flight Dynamics-Based Recovery of a UAV Trajectory Using Ground Cameras

DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization

DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction

Forecasting Interactive Dynamics of Pedestrians with Fictitious Play

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)