2017 IEEE International Conference on Computer Vision (ICCV)

chapter

Video Frame Interpolation via Adaptive Separable Convolution

Simon Niklaus, Long Mai, Feng Liu

2017 IEEE International Conference on Computer Vision (ICCV) > 261 - 270

Standard video frame interpolation methods first estimate optical flow between input frames and then synthesize an intermediate frame guided by motion. Recent approaches merge these two steps into a single convolution process by convolving input frames with spatially adaptive kernels that account for motion and re-sampling simultaneously. These methods require large kernels to handle large motion,...

chapter

SafetyNet: Detecting and Rejecting Adversarial Examples Robustly

Jiajun Lu, Theerasit Issaranon, David Forsyth

2017 IEEE International Conference on Computer Vision (ICCV) > 446 - 454

2017 IEEE International Conference on Computer Vision (ICCV)

We describe a method to produce a network where current methods such as DeepFool have great difficulty producing adversarial samples. Our construction suggests some insights into how deep networks work. We provide a reasonable analyses that our construction is difficult to defeat, and show experimentally that our method is hard to defeat with both Type I and Type II attacks using several standard...

chapter

Deep Determinantal Point Process for Large-Scale Multi-label Classification

Pengtao Xie, Ruslan Salakhutdinov, Luntian Mou, Eric P. Xing

2017 IEEE International Conference on Computer Vision (ICCV) > 473 - 482

2017 IEEE International Conference on Computer Vision (ICCV)

We study large-scale multi-label classification (MLC) on two recently released datasets: Youtube-8M and Open Images that contain millions of data instances and thousands of classes. The unprecedented problem scale poses great challenges for MLC. First, finding out the correct label subset out of exponentially many choices incurs substantial ambiguity and uncertainty. Second, the large data-size and...

chapter

Coordinating Filters for Faster Deep Neural Networks

Wei Wen, Cong Xu, Chunpeng Wu, Yandan Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 658 - 666

2017 IEEE International Conference on Computer Vision (ICCV)

Very large-scale Deep Neural Networks (DNNs) have achieved remarkable successes in a large variety of computer vision tasks. However, the high computation intensity of DNNs makes it challenging to deploy these models on resource-limited systems. Some studies used low-rank approaches that approximate the filters by low-rank basis to accelerate the testing. Those works directly decomposed the pre-trained...

chapter

Beyond Planar Symmetry: Modeling Human Perception of Reflection and Rotation Symmetries in the Wild

Christopher Funk, Yanxi Liu

2017 IEEE International Conference on Computer Vision (ICCV) > 793 - 803

2017 IEEE International Conference on Computer Vision (ICCV)

Humans take advantage of real world symmetries for various tasks, yet capturing their superb symmetry perception mechanism with a computational model remains elusive. Motivated by a new study demonstrating the extremely high inter-person accuracy of human perceived symmetries in the wild, we have constructed the first deeplearning neural network for reflection and rotation symmetry detection (Sym-NET),...

chapter

Learning to Reason: End-to-End Module Networks for Visual Question Answering

Ronghang Hu, Jacob Andreas, Marcus Rohrbach, Trevor Darrell, more

2017 IEEE International Conference on Computer Vision (ICCV) > 804 - 813

2017 IEEE International Conference on Computer Vision (ICCV)

Natural language questions are inherently compositional, and many are most easily answered by reasoning about their decomposition into modular sub-problems. For example, to answer “is there an equal number of balls and boxes?” we can look for balls, look for boxes, count them, and compare the results. The recently proposed Neural Module Network (NMN) architecture [3, 2] implements this approach to...

chapter

Hard-Aware Deeply Cascaded Embedding

Yuhui Yuan, Kuiyuan Yang, Chao Zhang

2017 IEEE International Conference on Computer Vision (ICCV) > 814 - 823

2017 IEEE International Conference on Computer Vision (ICCV)

Riding on the waves of deep neural networks, deep metric learning has achieved promising results in various tasks by using triplet network or Siamese network. Though the basic goal of making images from the same category closer than the ones from different categories is intuitive, it is hard to optimize the objective directly due to the quadratic or cubic sample size. Hard example mining is widely...

chapter

Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation

Ruichi Yu, Ang Li, Vlad I. Morariu, Larry S. Davis

2017 IEEE International Conference on Computer Vision (ICCV) > 1068 - 1076

2017 IEEE International Conference on Computer Vision (ICCV)

Understanding the visual relationship between two objects involves identifying the subject, the object, and a predicate relating them. We leverage the strong correlations between the predicate and the hsubj; obji pair (both semantically and spatially) to predict predicates conditioned on the subjects and the objects. Modeling the three entities jointly more accurately reflects their relationships...

chapter

DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

Yinda Zhang, Mingru Bai, Pushmeet Kohli, Shahram Izadi, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1201 - 1210

2017 IEEE International Conference on Computer Vision (ICCV)

3D context has been shown to be extremely important for scene understanding, yet very little research has been done on integrating context information with deep neural network architectures. This paper presents an approach to embed 3D context into the topology of a neural network trained to perform holistic scene understanding. Given a depth image depicting a 3D scene, our network aligns the observed...

chapter

Structured Attentions for Visual Question Answering

Chen Zhu, Yanpeng Zhao, Shuaiyi Huang, Kewei Tu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1300 - 1309

2017 IEEE International Conference on Computer Vision (ICCV)

Visual attention, which assigns weights to image regions according to their relevance to a question, is considered as an indispensable part by most Visual Question Answering models. Although the questions may involve complex rela- tions among multiple regions, few attention models can ef- fectively encode such cross-region relations. In this paper, we demonstrate the importance of encoding such relations...

chapter

SORT: Second-Order Response Transform for Visual Recognition

Yan Wang, Lingxi Xie, Chenxi Liu, Siyuan Qiao, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1368 - 1377

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we reveal the importance and benefits of introducing second-order operations into deep neural networks. We propose a novel approach named Second-Order Response Transform (SORT), which appends element-wise product transform to the linear sum of a two-branch network module. A direct advantage of SORT is to facilitate cross-branch response propagation, so that each branch can update its...

chapter

Primary Video Object Segmentation via Complementary CNNs and Neighborhood Reversible Flow

Jia Li, Anlin Zheng, Xiaowu Chen, Bin Zhou

2017 IEEE International Conference on Computer Vision (ICCV) > 1426 - 1434

2017 IEEE International Conference on Computer Vision (ICCV)

This paper proposes a novel approach for segmenting primary video objects by using Complementary Convolutional Neural Networks (CCNN) and neighborhood reversible flow. The proposed approach first pre-trains CCNN on massive images with manually annotated salient objects in an end-to-end manner, and the trained CCNN has two separate branches that simultaneously handle two complementary tasks, i.e.,...

chapter

Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization

Xun Huang, Serge Belongie

2017 IEEE International Conference on Computer Vision (ICCV) > 1510 - 1519

2017 IEEE International Conference on Computer Vision (ICCV)

Gatys et al. recently introduced a neural algorithm that renders a content image in the style of another image, achieving so-called style transfer. However, their framework requires a slow iterative optimization process, which limits its practical application. Fast approximations with feed-forward neural networks have been proposed to speed up neural style transfer. Unfortunately, the speed improvement...

chapter

Unrestricted Facial Geometry Reconstruction Using Image-to-Image Translation

Matan Sela, Elad Richardson, Ron Kimmel

2017 IEEE International Conference on Computer Vision (ICCV) > 1585 - 1594

2017 IEEE International Conference on Computer Vision (ICCV)

It has been recently shown that neural networks can recover the geometric structure of a face from a single given image. A common denominator of most existing face geometry reconstruction methods is the restriction of the solution space to some low-dimensional subspace. While such a model significantly simplifies the reconstruction problem, it is inherently limited in its expressiveness. As an alternative,...

chapter

PanNet: A Deep Network Architecture for Pan-Sharpening

Junfeng Yang, Xueyang Fu, Yuwen Hu, Yue Huang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1753 - 1761

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a deep network architecture for the pan-sharpening problem called PanNet. We incorporate domain-specific knowledge to design our PanNet architecture by focusing on the two aims of the pan-sharpening problem: spectral and spatial preservation. For spectral preservation, we add up-sampled multispectral images to the network output, which directly propagates the spectral information to the...

chapter

Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems

Tim Meinhardt, Michael Moeller, Caner Hazirbas, Daniel Cremers

2017 IEEE International Conference on Computer Vision (ICCV) > 1799 - 1808

2017 IEEE International Conference on Computer Vision (ICCV)

While variational methods have been among the most powerful tools for solving linear inverse problems in imaging, deep (convolutional) neural networks have recently taken the lead in many challenging benchmarks. A remaining drawback of deep learning approaches is their requirement for an expensive retraining whenever the specific problem, the noise level, noise type, or desired measure of fidelity...

chapter

Multi-task Self-Supervised Visual Learning

Carl Doersch, Andrew Zisserman

2017 IEEE International Conference on Computer Vision (ICCV) > 2070 - 2079

2017 IEEE International Conference on Computer Vision (ICCV)

We investigate methods for combining multiple selfsupervised tasks—i.e., supervised tasks where data can be collected without manual labeling—in order to train a single visual representation. First, we provide an apples-toapples comparison of four different self-supervised tasks using the very deep ResNet-101 architecture. We then combine tasks to jointly train a network. We also explore lasso regularization...

chapter

Decoder Network over Lightweight Reconstructed Feature for Fast Semantic Style Transfer

Ming Lu, Hao Zhao, Anbang Yao, Feng Xu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2488 - 2496

2017 IEEE International Conference on Computer Vision (ICCV)

Recently, the community of style transfer is trying to incorporate semantic information into traditional system. This practice achieves better perceptual results by transferring the style between semantically-corresponding regions. Yet, few efforts are invested to address the computation bottleneck of back-propagation. In this paper, we propose a new framework for fast semantic style transfer. Our...

chapter

Robust Video Super-Resolution with Learned Temporal Dynamics

Ding Liu, Zhaowen Wang, Yuchen Fan, Xianming Liu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2526 - 2534

2017 IEEE International Conference on Computer Vision (ICCV)

Video super-resolution (SR) aims to generate a highresolution (HR) frame from multiple low-resolution (LR) frames in a local temporal window. The inter-frame temporal relation is as crucial as the intra-frame spatial relation for tackling this problem. However, how to utilize temporal information efficiently and effectively remains challenging since complex motion is difficult to model and can introduce...

chapter

CREST: Convolutional Residual Learning for Visual Tracking

Yibing Song, Chao Ma, Lijun Gong, Jiawei Zhang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2574 - 2583

2017 IEEE International Conference on Computer Vision (ICCV)

Discriminative correlation filters (DCFs) have been shown to perform superiorly in visual tracking. They only need a small set of training samples from the initial frame to generate an appearance model. However, existing DCFs learn the filters separately from feature extraction, and update these filters using a moving average operation with an empirical weight. These DCF trackers hardly benefit from...

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV)

Video Frame Interpolation via Adaptive Separable Convolution

SafetyNet: Detecting and Rejecting Adversarial Examples Robustly

Deep Determinantal Point Process for Large-Scale Multi-label Classification

Coordinating Filters for Faster Deep Neural Networks

Beyond Planar Symmetry: Modeling Human Perception of Reflection and Rotation Symmetries in the Wild

Learning to Reason: End-to-End Module Networks for Visual Question Answering

Hard-Aware Deeply Cascaded Embedding

Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation

DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

Structured Attentions for Visual Question Answering

SORT: Second-Order Response Transform for Visual Recognition

Primary Video Object Segmentation via Complementary CNNs and Neighborhood Reversible Flow

Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization

Unrestricted Facial Geometry Reconstruction Using Image-to-Image Translation

PanNet: A Deep Network Architecture for Pan-Sharpening

Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems

Multi-task Self-Supervised Visual Learning

Decoder Network over Lightweight Reconstructed Feature for Fast Semantic Style Transfer

Robust Video Super-Resolution with Learned Temporal Dynamics

CREST: Convolutional Residual Learning for Visual Tracking

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Computer Vision (ICCV)