2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

SRN: Side-Output Residual Network for Object Symmetry Detection in the Wild

Wei Ke, Jie Chen, Jianbin Jiao, Guoying Zhao, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 302 - 310

In this paper, we establish a baseline for object symmetry detection in complex backgrounds by presenting a new benchmark and an end-to-end deep learning approach, opening up a promising direction for symmetry detection in the wild. The new benchmark, named Sym-PASCAL, spans challenges including object diversity, multi-objects, part-invisibility, and various complex backgrounds that are far beyond...

chapter

Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition

Junwu Weng, Chaoqun Weng, Junsong Yuan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 445 - 454

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Motivated by previous success of using non-parametric methods to recognize objects, e.g., NBNN [2], we extend it to recognize actions using skeletons. Each 3D action is presented by a sequence of 3D poses. Similar to NBNN, our proposed Spatio-Temporal-NBNN applies stage-to-class distance to classify actions. However, ST-NBNN takes the spatio-temporal structure of 3D actions into consideration and...

chapter

Deep Learning on Lie Groups for Skeleton-Based Action Recognition

Zhiwu Huang, Chengde Wan, Thomas Probst, Luc Van Gool

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1243 - 1252

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In recent years, skeleton-based action recognition has become a popular 3D classification problem. State-of-the-art methods typically first represent each motion sequence as a high-dimensional trajectory on a Lie group with an additional dynamic time warping, and then shallowly learn favorable Lie group features. In this paper we incorporate the Lie group structure into a deep network architecture...

chapter

Awesome Typography: Statistics-Based Text Effects Transfer

Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2886 - 2895

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we explore the problem of generating fantastic special-effects for the typography. It is quite challenging due to the model diversities to illustrate varied text effects for different characters. To address this issue, our key idea is to exploit the analytics on the high regularity of the spatial distribution for text effects to guide the synthesis process. Specifically, we characterize...

chapter

Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks

Hongsong Wang, Liang Wang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3633 - 3642

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, skeleton based action recognition gains more popularity due to cost-effective depth sensors coupled with real-time skeleton estimation algorithms. Traditional approaches based on handcrafted features are limited to represent the complexity of motion patterns. Recent methods that use Recurrent Neural Networks (RNN) to handle raw skeletons only focus on the contextual dependency in the temporal...

chapter

Forecasting Human Dynamics from Static Images

Yu-Wei Chao, Jimei Yang, Brian Price, Scott Cohen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3643 - 3651

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents the first study on forecasting human dynamics from static images. The problem is to input a single RGB image and generate a sequence of upcoming human body poses in 3D. To address the problem, we propose the 3D Pose Forecasting Network (3D-PFNet). Our 3D-PFNet integrates recent advances on single-image human pose estimation and sequence prediction, and converts the 2D predictions...

chapter

Global Context-Aware Attention LSTM Networks for 3D Action Recognition

Jun Liu, Gang Wang, Ping Hu, Ling-Yu Duan, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3671 - 3680

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Long Short-Term Memory (LSTM) networks have shown superior performance in 3D human action recognition due to their power in modeling the dynamics and dependencies in sequential data. Since not all joints are informative for action analysis and the irrelevant joints often bring a lot of noise, we need to pay more attention to the informative ones. However, original LSTM does not have strong attention...

chapter

Object Co-skeletonization with Co-segmentation

Koteswar Rao Jerripothula, Jianfei Cai, Jiangbo Lu, Junsong Yuan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3881 - 3889

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent advances in the joint processing of images have certainly shown its advantages over the individual processing. Different from the existing works geared towards co-segmentation or co-localization, in this paper, we explore a new joint processing topic: co-skeletonization, which is defined as joint skeleton extraction of common objects in a set of semantically similar images. Object skeletonization...

chapter

A New Representation of Skeleton Sequences for 3D Action Recognition

Qiuhong Ke, Mohammed Bennamoun, Senjian An, Ferdous Sohel, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4570 - 4579

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a new method for 3D action recognition with skeleton sequences (i.e., 3D trajectories of human skeleton joints). The proposed method first transforms each skeleton sequence into three clips each consisting of several frames for spatial temporal feature learning using deep neural networks. Each clip is generated from one channel of the cylindrical coordinates of the skeleton sequence...

chapter

Learning and Refining of Privileged Information-Based RNNs for Action Recognition from Depth Sequences

Zhiyuan Shi, Tae-Kyun Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4684 - 4693

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing RNN-based approaches for action recognition from depth sequences require either skeleton joints or hand-crafted depth features as inputs. An end-to-end manner, mapping from raw depth maps to action classes, is non-trivial to design due to the fact that: 1) single channel map lacks texture thus weakens the discriminative power, 2) relatively small set of depth training data. To address these...

chapter

Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition

Yufei Wang, Zhe Lin, Xiaohui Shen, Scott Cohen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7378 - 7387

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, there has been a lot of interest in automatically generating descriptions for an image. Most existing language-model based approaches for this task learn to generate an image description word by word in its original word order. However, for humans, it is more natural to locate the objects and their relationships first, and then elaborate on each object, describing notable attributes. We...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

SRN: Side-Output Residual Network for Object Symmetry Detection in the Wild

Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition

Deep Learning on Lie Groups for Skeleton-Based Action Recognition

Awesome Typography: Statistics-Based Text Effects Transfer

Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks

Forecasting Human Dynamics from Static Images

Global Context-Aware Attention LSTM Networks for 3D Action Recognition

Object Co-skeletonization with Co-segmentation

A New Representation of Skeleton Sequences for 3D Action Recognition

Learning and Refining of Privileged Information-Based RNNs for Action Recognition from Depth Sequences

Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)