2017 IEEE International Conference on Computer Vision (ICCV)

chapter

End-to-End Learning of Geometry and Context for Deep Stereo Regression

Alex Kendall, Hayk Martirosyan, Saumitro Dasgupta, Peter Henry

2017 IEEE International Conference on Computer Vision (ICCV) > 66 - 75

We propose a novel deep learning architecture for regressing disparity from a rectified pair of stereo images. We leverage knowledge of the problem’s geometry to form a cost volume using deep feature representations. We learn to incorporate contextual information using 3-D convolutions over this volume. Disparity values are regressed from the cost volume using a proposed differentiable soft argmin...

chapter

Deep Occlusion Reasoning for Multi-camera Multi-target Detection

Pierre Baque, Francois Fleuret, Pascal Fua

2017 IEEE International Conference on Computer Vision (ICCV) > 271 - 279

2017 IEEE International Conference on Computer Vision (ICCV)

People detection in single 2D images has improved greatly in recent years. However, comparatively little of this progress has percolated into multi-camera multipeople tracking algorithms, whose performance still degrades severely when scenes become very crowded. In this work, we introduce a new architecture that combines Convolutional Neural Nets and Conditional Random Fields to explicitly model those...

chapter

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

Chen Sun, Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta

2017 IEEE International Conference on Computer Vision (ICCV) > 843 - 852

2017 IEEE International Conference on Computer Vision (ICCV)

The success of deep learning in vision can be attributed to: (a) models with high capacity; (b) increased computational power; and (c) availability of large-scale labeled data. Since 2012, there have been significant advances in representation capabilities of the models and computational capabilities of GPUs. But the size of the biggest dataset has surprisingly remained constant. What will happen...

chapter

Ensemble Deep Learning for Skeleton-Based Action Recognition Using Temporal Sliding LSTM Networks

Inwoong Lee, Doyoung Kim, Seoungyoon Kang, Sanghoon Lee

2017 IEEE International Conference on Computer Vision (ICCV) > 1012 - 1020

2017 IEEE International Conference on Computer Vision (ICCV)

This paper addresses the problems of feature representation of skeleton joints and the modeling of temporal dynamics to recognize human actions. Traditional methods generally use relative coordinate systems dependent on some joints, and model only the long-term dependency, while excluding short-term and medium term dependencies. Instead of taking raw skeletons as the input, we transform the skeletons...

chapter

On-demand Learning for Deep Image Restoration

Ruohan Gao, Kristen Grauman

2017 IEEE International Conference on Computer Vision (ICCV) > 1095 - 1104

2017 IEEE International Conference on Computer Vision (ICCV)

While machine learning approaches to image restoration offer great promise, current methods risk training models fixated on performing well only for image corruption of a particular level of difficulty—such as a certain level of noise or blur. First, we examine the weakness of conventional “fixated” models and demonstrate that training general models to handle arbitrary levels of corruption is indeed...

chapter

Robust Object Tracking Based on Temporal and Spatial Deep Networks

Zhu Teng, Junliang Xing, Qiang Wang, Congyan Lang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1153 - 1162

2017 IEEE International Conference on Computer Vision (ICCV)

Recently deep neural networks have been widely employed to deal with the visual tracking problem. In this work, we present a new deep architecture which incorporates the temporal and spatial information to boost the tracking performance. Our deep architecture contains three networks, a Feature Net, a Temporal Net, and a Spatial Net. The Feature Net extracts general feature representations of the target...

chapter

Unsupervised Learning of Stereo Matching

Chao Zhou, Hong Zhang, Xiaoyong Shen, Jiaya Jia

2017 IEEE International Conference on Computer Vision (ICCV) > 1576 - 1584

2017 IEEE International Conference on Computer Vision (ICCV)

Convolutional neural networks showed the ability in stereo matching cost learning. Recent approaches learned parameters from public datasets that have ground truth disparity maps. Due to the difficulty of labeling ground truth depth, usable data for system training is rather limited, making it difficult to apply the system to real applications. In this paper, we present a framework for learning stereo...

chapter

Unsupervised Adaptation for Deep Stereo

Alessio Tonioni, Matteo Poggi, Stefano Mattoccia, Luigi Di Stefano

2017 IEEE International Conference on Computer Vision (ICCV) > 1614 - 1622

2017 IEEE International Conference on Computer Vision (ICCV)

Recent ground-breaking works have shown that deep neural networks can be trained end-to-end to regress dense disparity maps directly from image pairs. Computer generated imagery is deployed to gather the large data corpus required to train such networks, an additional fine-tuning allowing to adapt the model to work well also on real and possibly diverse environments. Yet, besides a few public datasets...

chapter

Modelling the Scene Dependent Imaging in Cameras with a Deep Neural Network

Seonghyeon Nam, Seon Joo Kim

2017 IEEE International Conference on Computer Vision (ICCV) > 1726 - 1734

2017 IEEE International Conference on Computer Vision (ICCV)

We present a novel deep learning framework that models the scene dependent image processing inside cameras. Often called as the radiometric calibration, the process of recovering RAWimages from processed images (JPEG format in the sRGB color space) is essential for many computer vision tasks that rely on physically accurate radiance values. All previous works rely on the deterministic imaging model...

chapter

PanNet: A Deep Network Architecture for Pan-Sharpening

Junfeng Yang, Xueyang Fu, Yuwen Hu, Yue Huang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1753 - 1761

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a deep network architecture for the pan-sharpening problem called PanNet. We incorporate domain-specific knowledge to design our PanNet architecture by focusing on the two aims of the pan-sharpening problem: spectral and spatial preservation. For spectral preservation, we add up-sampled multispectral images to the network output, which directly propagates the spectral information to the...

chapter

High Order Tensor Formulation for Convolutional Sparse Coding

Adel Bibi, Bernard Ghanem

2017 IEEE International Conference on Computer Vision (ICCV) > 1790 - 1798

2017 IEEE International Conference on Computer Vision (ICCV)

Convolutional sparse coding (CSC) has gained attention for its successful role as a reconstruction and a classification tool in the computer vision and machine learning community. Current CSC methods can only reconstruct singlefeature 2D images independently. However, learning multidimensional dictionaries and sparse codes for the reconstruction of multi-dimensional data is very important, as it examines...

chapter

SCNet: Learning Semantic Correspondence

Kai Han, Rafael S. Rezende, Bumsub Ham, Kwan-Yee K. Wong, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1849 - 1858

2017 IEEE International Conference on Computer Vision (ICCV)

This paper addresses the problem of establishing semantic correspondences between images depicting different instances of the same object or scene category. Previous approaches focus on either combining a spatial regularizer with hand-crafted features, or learning a correspondence model for appearance only. We propose instead a convolutional neural network architecture, called SCNet, for learning...

chapter

Class Rectification Hard Mining for Imbalanced Deep Learning

Qi Dong, Shaogang Gong, Xiatian Zhu

2017 IEEE International Conference on Computer Vision (ICCV) > 1869 - 1878

2017 IEEE International Conference on Computer Vision (ICCV)

Recognising detailed facial or clothing attributes in images of people is a challenging task for computer vision, especially when the training data are both in very large scale and extremely imbalanced among different attribute classes. To address this problem, we formulate a novel scheme for batch incremental hard sample mining of minority attribute classes from imbalanced large scale training data...

chapter

Identity-Aware Textual-Visual Matching with Latent Co-attention

Shuang Li, Tong Xiao, Hongsheng Li, Wei Yang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1908 - 1917

2017 IEEE International Conference on Computer Vision (ICCV)

Textual-visual matching aims at measuring similarities between sentence descriptions and images. Most existing methods tackle this problem without effectively utilizing identity-level annotations. In this paper, we propose an identity-aware two-stage framework for the textual-visual matching problem. Our stage-1 CNN-LSTM network learns to embed cross-modal features with a novel Cross-Modal Cross-Entropy...

chapter

Deep Cropping via Attention Box Prediction and Aesthetics Assessment

Wenguan Wang, Jianbing Shen

2017 IEEE International Conference on Computer Vision (ICCV) > 2205 - 2213

2017 IEEE International Conference on Computer Vision (ICCV)

We model the photo cropping problem as a cascade of attention box regression and aesthetic quality classification, based on deep learning. A neural network is designed that has two branches for predicting attention bounding box and analyzing aesthetics, respectively. The predicted attention box is treated as an initial crop window where a set of cropping candidates are generated around it, without...

chapter

Neural EPI-Volume Networks for Shape from Light Field

Stefan Heber, Wei Yu, Thomas Pock

2017 IEEE International Conference on Computer Vision (ICCV) > 2271 - 2279

2017 IEEE International Conference on Computer Vision (ICCV)

This paper presents a novel deep regression network to extract geometric information from Light Field (LF) data. Our network builds upon u-shaped network architectures. Those networks involve two symmetric parts, an encoding and a decoding part. In the first part the network encodes relevant information from the given input into a set of high-level feature maps. In the second part the generated feature...

chapter

Multi-stage Multi-recursive-input Fully Convolutional Networks for Neuronal Boundary Detection

Wei Shen, Bin Wang, Yuan Jiang, Yan Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2410 - 2419

2017 IEEE International Conference on Computer Vision (ICCV)

In the field of connectomics, neuroscientists seek to identify cortical connectivity comprehensively. Neuronal boundary detection from the Electron Microscopy (EM) images is often done to assist the automatic reconstruction of neuronal circuit. But the segmentation of EM images is a challenging problem, as it requires the detector to be able to detect both filament-like thin and blob-like thick membrane,...

chapter

Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis

Rui Huang, Shu Zhang, Tianyu Li, Ran He

2017 IEEE International Conference on Computer Vision (ICCV) > 2458 - 2467

2017 IEEE International Conference on Computer Vision (ICCV)

Photorealistic frontal view synthesis from a single face image has a wide range of applications in the field of face recognition. Although data-driven deep learning methods have been proposed to address this problem by seeking solutions from ample face data, this problem is still challenging because it is intrinsically ill-posed. This paper proposes a Two-Pathway Generative Adversarial Network (TP-GAN)...

chapter

Group Re-identification via Unsupervised Transfer of Sparse Features Encoding

Giuseppe Lisanti, Niki Martinel, Alberto Del Bimbo, Gian Luca Foresti

2017 IEEE International Conference on Computer Vision (ICCV) > 2468 - 2477

2017 IEEE International Conference on Computer Vision (ICCV)

Person re-identification is best known as the problem of associating a single person that is observed from one or more disjoint cameras. The existing literature has mainly addressed such an issue, neglecting the fact that people usually move in groups, like in crowded scenarios. We believe that the additional information carried by neighboring individuals provides a relevant visual context that can...

chapter

Revisiting IM2GPS in the Deep Learning Era

Nam Vo, Nathan Jacobs, James Hays

2017 IEEE International Conference on Computer Vision (ICCV) > 2640 - 2649

2017 IEEE International Conference on Computer Vision (ICCV)

Image geolocalization, inferring the geographic location of an image, is a challenging computer vision problem with many potential applications. The recent state-of-the-art approach to this problem is a deep image classification approach in which the world is spatially divided into cells and a deep network is trained to predict the correct cell for a given image. We propose to combine this approach...

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV)

End-to-End Learning of Geometry and Context for Deep Stereo Regression

Deep Occlusion Reasoning for Multi-camera Multi-target Detection

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

Ensemble Deep Learning for Skeleton-Based Action Recognition Using Temporal Sliding LSTM Networks

On-demand Learning for Deep Image Restoration

Robust Object Tracking Based on Temporal and Spatial Deep Networks

Unsupervised Learning of Stereo Matching

Unsupervised Adaptation for Deep Stereo

Modelling the Scene Dependent Imaging in Cameras with a Deep Neural Network

PanNet: A Deep Network Architecture for Pan-Sharpening

High Order Tensor Formulation for Convolutional Sparse Coding

SCNet: Learning Semantic Correspondence

Class Rectification Hard Mining for Imbalanced Deep Learning

Identity-Aware Textual-Visual Matching with Latent Co-attention

Deep Cropping via Attention Box Prediction and Aesthetics Assessment

Neural EPI-Volume Networks for Shape from Light Field

Multi-stage Multi-recursive-input Fully Convolutional Networks for Neuronal Boundary Detection

Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis

Group Re-identification via Unsupervised Transfer of Sparse Features Encoding

Revisiting IM2GPS in the Deep Learning Era

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Computer Vision (ICCV)