2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence

Seungryong Kim, Dongbo Min, Bumsub Ham, Sangryul Jeon, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 616 - 625

We present a descriptor, called fully convolutional self-similarity (FCSS), for dense semantic correspondence. To robustly match points among different instances within the same object class, we formulate FCSS using local self-similarity (LSS) within a fully convolutional network. In contrast to existing CNN-based descriptors, FCSS is inherently insensitive to intra-class appearance variations because...

chapter

Xception: Deep Learning with Depthwise Separable Convolutions

Francois Chollet

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1800 - 1807

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an interpretation of Inception modules in convolutional neural networks as being an intermediate step in-between regular convolution and the depthwise separable convolution operation (a depthwise convolution followed by a pointwise convolution). In this light, a depthwise separable convolution can be understood as an Inception module with a maximally large number of towers. This observation...

chapter

Active Convolution: Learning the Shape of Convolution for Image Classification

Yunho Jeon, Junmo Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1846 - 1854

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In recent years, deep learning has achieved great success in many computer vision applications. Convolutional neural networks (CNNs) have lately emerged as a major approach to image classification. Most research on CNNs thus far has focused on developing architectures such as the Inception and residual networks. The convolution layer is the core of the CNN, but few studies have addressed the convolution...

chapter

Densely Connected Convolutional Networks

Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2261 - 2269

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent work has shown that convolutional networks can be substantially deeper, more accurate, and efficient to train if they contain shorter connections between layers close to the input and those close to the output. In this paper, we embrace this observation and introduce the Dense Convolutional Network (DenseNet), which connects each layer to every other layer in a feed-forward fashion. Whereas...

chapter

Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

Amir Arsalan Soltani, Haibin Huang, Jiajun Wu, Tejas D. Kulkarni, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2511 - 2519

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the problem of learning generative models of 3D shapes. Voxels or 3D parts have been widely used as the underlying representations to build complex 3D shapes, however, voxel-based representations suffer from high memory requirements, and parts-based models require a large collection of cached or richly parametrized parts. We take an alternative approach: learning a generative model over multi-view...

chapter

Image Super-Resolution via Deep Recursive Residual Network

Ying Tai, Jian Yang, Xiaoming Liu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2790 - 2798

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, Convolutional Neural Network (CNN) based models have achieved great success in Single Image Super-Resolution (SISR). Owing to the strength of deep networks, these CNN models learn an effective nonlinear mapping from the low-resolution input image to the high-resolution target image, at the cost of requiring enormous parameters. This paper proposes a very deep CNN model (up to 52 convolutional...

chapter

Deep TEN: Texture Encoding Network

Hang Zhang, Jia Xue, Kristin Dana

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2896 - 2905

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a Deep Texture Encoding Network (Deep-TEN) with a novel Encoding Layer integrated on top of convolutional layers, which ports the entire dictionary learning and encoding pipeline into a single model. Current methods build from distinct components, using standard encoders with separate off-the-shelf features such as SIFT descriptors or pre-trained CNN features for material recognition. Our...

chapter

Deep Cross-Modal Hashing

Qing-Yuan Jiang, Wu-Jun Li

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3270 - 3278

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Due to its low storage cost and fast query speed, cross-modal hashing (CMH) has been widely used for similarity search in multimedia retrieval applications. However, most existing CMH methods are based on hand-crafted features which might not be optimally compatible with the hash-code learning procedure. As a result, existing CMH methods with hand-crafted features may not achieve satisfactory performance...

chapter

Unambiguous Text Localization and Retrieval for Cluttered Scenes

Xuejian Rong, Chucai Yi, Yingli Tian

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3279 - 3287

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Text instance as one category of self-described objects provides valuable information for understanding and describing cluttered scenes. In this paper, we explore the task of unambiguous text localization and retrieval, to accurately localize a specific targeted text instance in a cluttered image given a natural language description that refers to it. To address this issue, first a novel recurrent...

chapter

Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis

Dmitry Ulyanov, Andrea Vedaldi, Victor Lempitsky

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4105 - 4113

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The recent work of Gatys et al., who characterized the style of an image by the statistics of convolutional neural network filters, ignited a renewed interest in the texture generation and image stylization problems. While their image generation technique uses a slow optimization process, recently several authors have proposed to learn generator neural networks that can produce similar outputs in...

chapter

Deep Feature Flow for Video Recognition

Xizhou Zhu, Yuwen Xiong, Jifeng Dai, Lu Yuan, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4141 - 4150

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep convolutional neutral networks have achieved great success on image recognition tasks. Yet, it is non-trivial to transfer the state-of-the-art image recognition networks to videos as per-frame evaluation is too slow and unaffordable. We present deep feature flow, a fast and accurate framework for video recognition. It runs the expensive convolutional sub-network only on sparse key frames and...

chapter

FFTLasso: Large-Scale LASSO in the Fourier Domain

Adel Bibi, Hani Itani, Bernard Ghanem

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4371 - 4380

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we revisit the LASSO sparse representation problem, which has been studied and used in a variety of different areas, ranging from signal processing and information theory to computer vision and machine learning. In the vision community, it found its way into many important applications, including face recognition, tracking, super resolution, image denoising, to name a few. Despite advances...

chapter

Fully Convolutional Instance-Aware Semantic Segmentation

Yi Li, Haozhi Qi, Jifeng Dai, Xiangyang Ji, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4438 - 4446

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present the first fully convolutional end-to-end solution for instance-aware semantic segmentation task. It inherits all the merits of FCNs for semantic segmentation [29] and instance mask proposal [5]. It performs instance mask prediction and classification jointly. The underlying convolutional representation is fully shared between the two sub-tasks, as well as between all regions of interest...

chapter

Oriented Response Networks

Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4961 - 4970

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep Convolution Neural Networks (DCNNs) are capable of learning unprecedentedly effective image representations. However, their ability in handling significant local and global image rotations remains limited. In this paper, we propose Active Rotating Filters (ARFs) that actively rotate during convolution and produce feature maps with location and orientation explicitly encoded. An ARF acts as a...

chapter

Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders

Xin Yu, Fatih Porikli

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5367 - 5375

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most of the conventional face hallucination methods assume the input image is sufficiently large and aligned, and all require the input image to be noise-free. Their performance degrades drastically if the input image is tiny, unaligned, and contaminated by noise. In this paper, we introduce a novel transformative discriminative autoencoder to 8X super-resolve unaligned noisy and tiny (16X16) low-resolution...

chapter

Full Resolution Image Compression with Recurrent Neural Networks

George Toderici, Damien Vincent, Nick Johnston, Sung Jin Hwang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5435 - 5443

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a set of full-resolution lossy image compression methods based on neural networks. Each of the architectures we describe can provide variable compression rates during deployment without requiring retraining of the network: each network need only be trained once. All of our architectures consist of a recurrent neural network (RNN)-based encoder and decoder, a binarizer, and a neural...

chapter

Multi-context Attention for Human Pose Estimation

Xiao Chu, Wei Yang, Wanli Ouyang, Cheng Ma, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5669 - 5678

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we propose to incorporate convolutional neural networks with a multi-context attention mechanism into an end-to-end framework for human pose estimation. We adopt stacked hourglass networks to generate attention maps from features at multiple resolutions with various semantics. The Conditional Random Field (CRF) is utilized to model the correlations among neighboring regions in the attention...

chapter

Simultaneous Super-Resolution and Cross-Modality Synthesis of 3D Medical Images Using Weakly-Supervised Joint Convolutional Sparse Coding

Yawen Huang, Ling Shao, Alejandro F. Frangi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5787 - 5796

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Magnetic Resonance Imaging (MRI) offers high-resolution in vivo imaging and rich functional and anatomical multimodality tissue contrast. In practice, however, there are challenges associated with considerations of scanning costs, patient comfort, and scanning time that constrain how much data can be acquired in clinical or research studies. In this paper, we explore the possibility of generating...

chapter

Deep Co-occurrence Feature Learning for Visual Object Recognition

Ya-Fang Shih, Yang-Ming Yeh, Yen-Yu Lin, Ming-Fang Weng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7302 - 7311

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses three issues in integrating part-based representations into convolutional neural networks (CNNs) for object recognition. First, most part-based models rely on a few pre-specified object parts. However, the optimal object parts for recognition often vary from category to category. Second, acquiring training data with part-level annotation is labor-intensive. Third, modeling spatial...

chapter

Learning Detection with Diverse Proposals

Samaneh Azadi, Jiashi Feng, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7369 - 7377

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

To predict a set of diverse and informative proposals with enriched representations, this paper introduces a differentiable Determinantal Point Process (DPP) layer that is able to augment the object detection architectures. Most modern object detection architectures, such as Faster R-CNN, learn to localize objects by minimizing deviations from the ground truth, but ignore correlation between multiple...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence

Xception: Deep Learning with Depthwise Separable Convolutions

Active Convolution: Learning the Shape of Convolution for Image Classification

Densely Connected Convolutional Networks

Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

Image Super-Resolution via Deep Recursive Residual Network

Deep TEN: Texture Encoding Network

Deep Cross-Modal Hashing

Unambiguous Text Localization and Retrieval for Cluttered Scenes

Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis

Deep Feature Flow for Video Recognition

FFTLasso: Large-Scale LASSO in the Fourier Domain

Fully Convolutional Instance-Aware Semantic Segmentation

Oriented Response Networks

Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders

Full Resolution Image Compression with Recurrent Neural Networks

Multi-context Attention for Human Pose Estimation

Simultaneous Super-Resolution and Cross-Modality Synthesis of 3D Medical Images Using Weakly-Supervised Joint Convolutional Sparse Coding

Deep Co-occurrence Feature Learning for Visual Object Recognition

Learning Detection with Diverse Proposals

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)