2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes

Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3309 - 3318

Semantic image segmentation is an essential component of modern autonomous driving systems, as an accurate understanding of the surrounding scene is crucial to navigation and action planning. Current state-of-the-art approaches in semantic image segmentation rely on pre-trained networks that were initially developed for classifying images as a whole. While these networks exhibit outstanding recognition...

chapter

Generating the Future with Adversarial Transformers

Carl Vondrick, Antonio Torralba

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2992 - 3000

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We learn models to generate the immediate future in video. This problem has two main challenges. Firstly, since the future is uncertain, models should be multi-modal, which can be difficult to learn. Secondly, since the future is similar to the past, models store low-level details, which complicates learning of high-level semantics. We propose a framework to tackle both of these challenges. We present...

chapter

StyleBank: An Explicit Representation for Neural Image Style Transfer

Dongdong Chen, Lu Yuan, Jing Liao, Nenghai Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2770 - 2779

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose StyleBank, which is composed of multiple convolution filter banks and each filter bank explicitly represents one style, for neural image style transfer. To transfer an image to a specific style, the corresponding filter bank is operated on top of the intermediate feature embedding produced by a single auto-encoder. The StyleBank and the auto-encoder are jointly learnt, where the learning...

chapter

All You Need is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks with Orthonormality and Modulation

Di Xie, Jiang Xiong, Shiliang Pu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5075 - 5084

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep neural network is difficult to train and this predicament becomes worse as the depth increases. The essence of this problem exists in the magnitude of backpropagated errors that will result in gradient vanishing or exploding phenomenon. We show that a variant of regularizer which utilizes orthonormality among different filter banks can alleviate this problem. Moreover, we design a backward error...

chapter

Identifying First-Person Camera Wearers in Third-Person Videos

Chenyou Fan, Jangwon Lee, Mingze Xu, Krishna Kumar Singh, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4734 - 4742

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We consider scenarios in which we wish to perform joint scene understanding, object tracking, activity recognition, and other tasks in scenarios in which multiple people are wearing body-worn cameras while a third-person static camera also captures the scene. To do this, we need to establish person-level correspondences across first-and third-person videos, which is challenging because the camera...

chapter

Fully-Adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification

Yongxi Lu, Abhishek Kumar, Shuangfei Zhai, Yu Cheng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1131 - 1140

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-task learning aims to improve generalization performance of multiple prediction tasks by appropriately sharing relevant information across them. In the context of deep neural networks, this idea is often realized by hand-designed network architectures with layers that are shared across tasks and branches that encode task-specific features. However, the space of possible multi-task deep architectures...

chapter

Removing Rain from Single Images via a Deep Detail Network

Xueyang Fu, Jiabin Huang, Delu Zeng, Yue Huang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1715 - 1723

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a new deep network architecture for removing rain streaks from individual images based on the deep convolutional neural network (CNN). Inspired by the deep residual network (ResNet) that simplifies the learning process by changing the mapping form, we propose a deep detail network to directly reduce the mapping range from input to output, which makes the learning process easier. To further...

chapter

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Christian Ledig, Lucas Theis, Ferenc Huszar, Jose Caballero, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 105 - 114

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Despite the breakthroughs in accuracy and speed of single image super-resolution using faster and deeper convolutional neural networks, one central problem remains largely unsolved: how do we recover the finer texture details when we super-resolve at large upscaling factors? The behavior of optimization-based super-resolution methods is principally driven by the choice of the objective function. Recent...

chapter

PolyNet: A Pursuit of Structural Diversity in Very Deep Networks

Xingcheng Zhang, Zhizhong Li, Chen Change Loy, Dahua Lin

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3900 - 3908

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A number of studies have shown that increasing the depth or width of convolutional networks is a rewarding approach to improve the performance of image recognition. In our study, however, we observed difficulties along both directions. On one hand, the pursuit for very deep networks is met with a diminishing return and increased training difficulty, on the other hand, widening a network would result...

chapter

Improved Stereo Matching with Constant Highway Networks and Reflective Confidence Learning

Amit Shaked, Lior Wolf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6901 - 6910

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an improved three-step pipeline for the stereo matching problem and introduce multiple novelties at each stage. We propose a new highway network architecture for computing the matching cost at each possible disparity, based on multilevel weighted residual shortcuts, trained with a hybrid loss that supports multilevel comparison of image patches. A novel post-processing step is then introduced,...

chapter

Deep Pyramidal Residual Networks

Dongyoon Han, Jiwhan Kim, Junmo Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6307 - 6315

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep convolutional neural networks (DCNNs) have shown remarkable performance in image classification tasks in recent years. Generally, deep neural network architectures are stacks consisting of a large number of convolutional layers, and they perform downsampling along the spatial dimension via pooling to reduce memory usage. Concurrently, the feature map dimension (i.e., the number of channels) is...

chapter

Densely Connected Convolutional Networks

Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2261 - 2269

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent work has shown that convolutional networks can be substantially deeper, more accurate, and efficient to train if they contain shorter connections between layers close to the input and those close to the output. In this paper, we embrace this observation and introduce the Dense Convolutional Network (DenseNet), which connects each layer to every other layer in a feed-forward fashion. Whereas...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes

Generating the Future with Adversarial Transformers

StyleBank: An Explicit Representation for Neural Image Style Transfer

All You Need is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks with Orthonormality and Modulation

Identifying First-Person Camera Wearers in Third-Person Videos

Fully-Adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification

Removing Rain from Single Images via a Deep Detail Network

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

PolyNet: A Pursuit of Structural Diversity in Very Deep Networks

Improved Stereo Matching with Constant Highway Networks and Reflective Confidence Learning

Deep Pyramidal Residual Networks

Densely Connected Convolutional Networks

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)