2017 IEEE International Conference on Computer Vision (ICCV)

chapter

Zero-Order Reverse Filtering

Xin Tao, Chao Zhou, Xiaoyong Shen, Jue Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 222 - 230

In this paper, we study an unconventional but practically meaningful reversibility problem of commonly used image filters. We broadly define filters as operations to smooth images or to produce layers via global or local algorithms. And we raise the intriguingly problem if they are reservable to the status before filtering. To answer it, we present a novel strategy to understand general filter via...

chapter

Video Frame Interpolation via Adaptive Separable Convolution

Simon Niklaus, Long Mai, Feng Liu

2017 IEEE International Conference on Computer Vision (ICCV) > 261 - 270

2017 IEEE International Conference on Computer Vision (ICCV)

Standard video frame interpolation methods first estimate optical flow between input frames and then synthesize an intermediate frame guided by motion. Recent approaches merge these two steps into a single convolution process by convolving input frames with spatially adaptive kernels that account for motion and re-sampling simultaneously. These methods require large kernels to handle large motion,...

chapter

Deformable Convolutional Networks

Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, more

2017 IEEE International Conference on Computer Vision (ICCV) > 764 - 773

2017 IEEE International Conference on Computer Vision (ICCV)

Convolutional neural networks (CNNs) are inherently limited to model geometric transformations due to the fixed geometric structures in their building modules. In this work, we introduce two new modules to enhance the transformation modeling capability of CNNs, namely, deformable convolution and deformable RoI pooling. Both are based on the idea of augmenting the spatial sampling locations in the...

chapter

Delving into Salient Object Subitizing and Detection

Shengfeng He, Jianbo Jiao, Xiaodan Zhang, Guoqiang Han, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1059 - 1067

2017 IEEE International Conference on Computer Vision (ICCV)

Subitizing (i.e., instant judgement on the number) and detection of salient objects are human inborn abilities. These two tasks influence each other in the human visual system. In this paper, we delve into the complementarity of these two tasks. We propose a multi-task deep neural network with weight prediction for salient object detection, where the parameters of an adaptive weight layer are dynamically...

chapter

Learning Discriminative Data Fitting Functions for Blind Image Deblurring

Jinshan Pan, Jiangxin Dong, Yu-Wing Tai, Zhixun Su, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1077 - 1085

2017 IEEE International Conference on Computer Vision (ICCV)

Solving blind image deblurring usually requires defining a data fitting function and image priors. While existing algorithms mainly focus on developing image priors for blur kernel estimation and non-blind deconvolution, only a few methods consider the effect of data fitting functions. In contrast to the state-of-the-art methods that use a single or a fixed data fitting term, we propose a data-driven...

chapter

Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation

Yu Chen, Chunhua Shen, Xiu-Shen Wei, Lingqiao Liu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1221 - 1230

2017 IEEE International Conference on Computer Vision (ICCV)

For human pose estimation in monocular images, joint occlusions and overlapping upon human bodies often result in deviated pose predictions. Under these circumstances, biologically implausible pose predictions may be produced. In contrast, human vision is able to predict poses by exploiting geometric constraints of joint inter-connectivity. To address the problem by incorporating priors about the...

chapter

An Empirical Study of Language CNN for Image Captioning

Jiuxiang Gu, Gang Wang, Jianfei Cai, Tsuhan Chen

2017 IEEE International Conference on Computer Vision (ICCV) > 1231 - 1240

2017 IEEE International Conference on Computer Vision (ICCV)

Language models based on recurrent neural networks have dominated recent image caption generation tasks. In this paper, we introduce a language CNN model which is suitable for statistical language modeling tasks and shows competitive performance in image captioning. In contrast to previous models which predict next word based on one previous word and hidden state, our language CNN is fed with all...

chapter

Recurrent Multimodal Interaction for Referring Image Segmentation

Chenxi Liu, Zhe Lin, Xiaohui Shen, Jimei Yang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1280 - 1289

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper we are interested in the problem of image segmentation given natural language descriptions, i.e. referring expressions. Existing works tackle this problem by first modeling images and sentences independently and then segment images by combining these two types of representations. We argue that learning word-to-image interaction is more native in the sense of jointly modeling two modalities...

chapter

Learning Feature Pyramids for Human Pose Estimation

Wei Yang, Shuang Li, Wanli Ouyang, Hongsheng Li, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1290 - 1299

2017 IEEE International Conference on Computer Vision (ICCV)

Articulated human pose estimation is a fundamental yet challenging task in computer vision. The difficulty is particularly pronounced in scale variations of human body parts when camera view changes or severe foreshortening happens. Although pyramid methods are widely used to handle scale changes at inference time, learning feature pyramids in deep convolutional neural networks (DCNNs) is still not...

chapter

Cascaded Feature Network for Semantic Segmentation of RGB-D Images

Di Lin, Guangyong Chen, Daniel Cohen-Or, Pheng-Ann Heng, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1320 - 1328

2017 IEEE International Conference on Computer Vision (ICCV)

Fully convolutional network (FCN) has been successfully applied in semantic segmentation of scenes represented with RGB images. Images augmented with depth channel provide more understanding of the geometric information of the scene in the image. The question is how to best exploit this additional information to improve the segmentation performance.,,In this paper, we present a neural network with...

chapter

Genetic CNN

Lingxi Xie, Alan Yuille

2017 IEEE International Conference on Computer Vision (ICCV) > 1388 - 1397

2017 IEEE International Conference on Computer Vision (ICCV)

The deep convolutional neural network (CNN) is the state-of-the-art solution for large-scale visual recognition. Following some basic principles such as increasing network depth and constructing highway connections, researchers have manually designed a lot of fixed network architectures and verified their effectiveness.,,In this paper, we discuss the possibility of learning deep network structures...

chapter

Self-Paced Kernel Estimation for Robust Blind Image Deblurring

Dong Gong, Mingkui Tan, Yanning Zhang, Anton van den Hengel, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1670 - 1679

2017 IEEE International Conference on Computer Vision (ICCV)

The challenge in blind image deblurring is to remove the effects of blur with limited prior information about the nature of the blur process. Existing methods often assume that the blur image is produced by linear convolution with additive Gaussian noise. However, including even a small number of outliers to this model in the kernel estimation process can significantly reduce the resulting image quality...

chapter

Joint Convolutional Analysis and Synthesis Sparse Representation for Single Image Layer Separation

Shuhang Gu, Deyu Meng, Wangmeng Zuo, Lei Zhang

2017 IEEE International Conference on Computer Vision (ICCV) > 1717 - 1725

2017 IEEE International Conference on Computer Vision (ICCV)

Analysis sparse representation (ASR) and synthesis sparse representation (SSR) are two representative approaches for sparsity-based image modeling. An image is described mainly by the non-zero coefficients in SSR, while is mainly characterized by the indices of zeros in ASR. To exploit the complementary representation mechanisms of ASR and SSR, we integrate the two models and propose a joint convolutional...

chapter

DSOD: Learning Deeply Supervised Object Detectors from Scratch

Zhiqiang Shen, Zhuang Liu, Jianguo Li, Yu-Gang Jiang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1937 - 1945

2017 IEEE International Conference on Computer Vision (ICCV)

We present Deeply Supervised Object Detector (DSOD), a framework that can learn object detectors from scratch. State-of-the-art object objectors rely heavily on the off the-shelf networks pre-trained on large-scale classification datasets like Image Net, which incurs learning bias due to the difference on both the loss functions and the category distributions between classification and detection tasks...

chapter

Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs

Maxim Tatarchenko, Alexey Dosovitskiy, Thomas Brox

2017 IEEE International Conference on Computer Vision (ICCV) > 2107 - 2115

2017 IEEE International Conference on Computer Vision (ICCV)

We present a deep convolutional decoder architecture that can generate volumetric 3D outputs in a compute- and memory-efficient manner by using an octree representation. The network learns to predict both the structure of the octree, and the occupancy values of individual cells. This makes it a particularly valuable technique for generating 3D shapes. In contrast to standard decoders acting on regular...

chapter

Performance Guaranteed Network Acceleration via High-Order Residual Quantization

Zefan Li, Bingbing Ni, Wenjun Zhang, Xiaokang Yang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2603 - 2611

2017 IEEE International Conference on Computer Vision (ICCV)

Input binarization has shown to be an effective way for network acceleration. However, previous binarization scheme could be regarded as simple pixel-wise thresholding operations (i.e., order-one approximation) and suffers a big accuracy loss. In this paper, we propose a highorder binarization scheme, which achieves more accurate approximation while still possesses the advantage of binary operation...

chapter

Directionally Convolutional Networks for 3D Shape Segmentation

Haotian Xu, Ming Dong, Zichun Zhong

2017 IEEE International Conference on Computer Vision (ICCV) > 2717 - 2726

2017 IEEE International Conference on Computer Vision (ICCV)

Previous approaches on 3D shape segmentation mostly rely on heuristic processing and hand-tuned geometric descriptors. In this paper, we propose a novel 3D shape representation learning approach, Directionally Convolutional Network (DCN), to solve the shape segmentation problem. DCN extends convolution operations from images to the surface mesh of 3D shapes. With DCN, we learn effective shape representations...

chapter

Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention

Jinkyu Kim, John Canny

2017 IEEE International Conference on Computer Vision (ICCV) > 2961 - 2969

2017 IEEE International Conference on Computer Vision (ICCV)

Deep neural perception and control networks are likely to be a key component of self-driving vehicles. These models need to be explainable - they should provide easy-tointerpret rationales for their behavior - so that passengers, insurance companies, law enforcement, developers etc., can understand what triggered a particular behavior. Here we explore the use of visual explanations. These explanations...

chapter

A Spatiotemporal Oriented Energy Network for Dynamic Texture Recognition

Isma Hadji, Richard P. Wildes

2017 IEEE International Conference on Computer Vision (ICCV) > 3085 - 3093

2017 IEEE International Conference on Computer Vision (ICCV)

This paper presents a novel hierarchical spatiotemporal orientation representation for spacetime image analysis. It is designed to combine the benefits of the multilayer architecture of ConvNets and a more controlled approach to spacetime analysis. A distinguishing aspect of the approach is that unlike most contemporary convolutional networks no learning is involved; rather, all design decisions are...

chapter

Semantic Line Detection and Its Applications

Jun-Tae Lee, Han-Ul Kim, Chul Lee, Chang-Su Kim

2017 IEEE International Conference on Computer Vision (ICCV) > 3249 - 3257

2017 IEEE International Conference on Computer Vision (ICCV)

Semantic lines characterize the layout of an image. Despite their importance in image analysis and scene understanding, there is no reliable research for semantic line detection. In this paper, we propose a semantic line detector using a convolutional neural network with multi-task learning, by regarding the line detection as a combination of classification and regression tasks. We use convolution...

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV)

Zero-Order Reverse Filtering

Video Frame Interpolation via Adaptive Separable Convolution

Deformable Convolutional Networks

Delving into Salient Object Subitizing and Detection

Learning Discriminative Data Fitting Functions for Blind Image Deblurring

Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation

An Empirical Study of Language CNN for Image Captioning

Recurrent Multimodal Interaction for Referring Image Segmentation

Learning Feature Pyramids for Human Pose Estimation

Cascaded Feature Network for Semantic Segmentation of RGB-D Images

Genetic CNN

Self-Paced Kernel Estimation for Robust Blind Image Deblurring

Joint Convolutional Analysis and Synthesis Sparse Representation for Single Image Layer Separation

DSOD: Learning Deeply Supervised Object Detectors from Scratch

Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs

Performance Guaranteed Network Acceleration via High-Order Residual Quantization

Directionally Convolutional Networks for 3D Shape Segmentation

Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention

A Spatiotemporal Oriented Energy Network for Dynamic Texture Recognition

Semantic Line Detection and Its Applications

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Computer Vision (ICCV)