2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Scene Graph Generation by Iterative Message Passing

Danfei Xu, Yuke Zhu, Christopher B. Choy, Li Fei-Fei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3097 - 3106

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their relationships using scene graphs, a visually-grounded graphical structure of an image. We propose a novel end-to-end model that generates such structured scene representation...

chapter

BIND: Binary Integrated Net Descriptors for Texture-Less Object Recognition

Jacob Chan, Jimmy Addison Lee, Qian Kemao

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3020 - 3028

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents BIND (Binary Integrated Net Descriptor), a texture-less object detector that encodes multi-layered binary-represented nets for high precision edge-based description. Our proposed concept aligns layers of object-sized patches (nets) onto highly fragmented occlusion resistant line-segment midpoints (linelets) to encode regional information into efficient binary strings. These lightweight...

chapter

Learning Random-Walk Label Propagation for Weakly-Supervised Semantic Segmentation

Paul Vernaza, Manmohan Chandraker

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2953 - 2961

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Large-scale training for semantic segmentation is challenging due to the expense of obtaining training data for this task relative to other vision tasks. We propose a novel training approach to address this difficulty. Given cheaply-obtained sparse image labelings, we propagate the sparse labels to produce guessed dense labelings. A standard CNN-based segmentation network is trained to mimic these...

chapter

Semantic Amodal Segmentation

Yan Zhu, Yuandong Tian, Dimitris Metaxas, Piotr Dollar

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3001 - 3009

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Common visual recognition tasks such as classification, object detection, and semantic segmentation are rapidly reaching maturity, and given the recent rate of progress, it is not unreasonable to conjecture that techniques for many of these problems will approach human levels of performance in the next few years. In this paper we look to the future: what is the next frontier in visual recognition?...

chapter

A Unified Approach of Multi-scale Deep and Hand-Crafted Features for Defocus Estimation

Jinsun Park, Yu-Wing Tai, Donghyeon Cho, In So Kweon

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2760 - 2769

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we introduce robust and synergetic hand-crafted features and a simple but efficient deep feature from a convolutional neural network (CNN) architecture for defocus estimation. This paper systematically analyzes the effectiveness of different features, and shows how each feature can compensate for the weaknesses of other features when they are concatenated. For a full defocus map estimation,...

chapter

PoseTrack: Joint Multi-person Pose Estimation and Tracking

Umar Iqbal, Anton Milan, Juergen Gall

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4654 - 4663

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we introduce the challenging problem of joint multi-person pose estimation and tracking of an unknown number of persons in unconstrained videos. Existing methods for multi-person pose estimation in images cannot be applied directly to this problem, since it also requires to solve the problem of person association over time in addition to the pose estimation for each person. We therefore...

chapter

Image Deblurring via Extreme Channels Prior

Yanyang Yan, Wenqi Ren, Yuanfang Guo, Rui Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6978 - 6986

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Camera motion introduces motion blur, affecting many computer vision tasks. Dark Channel Prior (DCP) helps the blind deblurring on scenes including natural, face, text, and low-illumination images. However, it has limitations and is less likely to support the kernel estimation while bright pixels dominate the input image. We observe that the bright pixels in the clear images are not likely to be bright...

chapter

InstanceCut: From Edges to Instances with MultiCut

Alexander Kirillov, Evgeny Levinkov, Bjoern Andres, Bogdan Savchynskyy, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7322 - 7331

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This work addresses the task of instance-aware semantic segmentation. Our key motivation is to design a simple method with a new modelling-paradigm, which therefore has a different trade-off between advantages and disadvantages compared to known approaches. Our approach, we term InstanceCut, represents the problem by two output modalities: (i) an instance-agnostic semantic segmentation and (ii) all...

chapter

Image-to-Image Translation with Conditional Adversarial Networks

Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5967 - 5976

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We investigate conditional adversarial networks as a general-purpose solution to image-to-image translation problems. These networks not only learn the mapping from input image to output image, but also learn a loss function to train this mapping. This makes it possible to apply the same generic approach to problems that traditionally would require very different loss formulations. We demonstrate...

chapter

Improving RANSAC-Based Segmentation through CNN Encapsulation

Dustin Morley, Hassan Foroosh

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2661 - 2670

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we present a method for improving a random sample consensus (RANSAC) based image segmentation algorithm by encapsulating it within a convolutional neural network (CNN). The improvements are gained by gradient descent training on the set of pre-RANSAC filtering and thresholding operations using a novel RANSAC-based loss function, which is geared toward optimizing the strength of the correct...

chapter

Deep Network Flow for Multi-object Tracking

Samuel Schulter, Paul Vernaza, Wongun Choi, Manmohan Chandraker

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2730 - 2739

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Data association problems are an important component of many computer vision applications, with multi-object tracking being one of the most prominent examples. A typical approach to data association involves finding a graph matching or network flow that minimizes a sum of pairwise association costs, which are often either hand-crafted or learned as linear functions of fixed features. In this work,...

chapter

Deep Sketch Hashing: Fast Free-Hand Sketch-Based Image Retrieval

Li Liu, Fumin Shen, Yuming Shen, Xianglong Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2298 - 2307

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Free-hand sketch-based image retrieval (SBIR) is a specific cross-view retrieval task, in which queries are abstract and ambiguous sketches while the retrieval database is formed with natural images. Work in this area mainly focuses on extracting representative and shared features for sketches and natural images. However, these can neither cope well with the geometric distortion between sketches and...

chapter

Deeply Supervised Salient Object Detection with Short Connections

Qibin Hou, Ming-Ming Cheng, Xiaowei Hu, Ali Borji, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5300 - 5309

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent progress on saliency detection is substantial, benefiting mostly from the explosive development of Convolutional Neural Networks (CNNs). Semantic segmentation and saliency detection algorithms developed lately have been mostly based on Fully Convolutional Neural Networks (FCNs). There is still a large room for improvement over the generic FCN models that do not explicitly deal with the scale-space...

chapter

MuCaLe-Net: Multi Categorical-Level Networks to Generate More Discriminating Features

Youssef Tamaazousti, Herve Le Borgne, Celine Hudelot

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5282 - 5291

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In a transfer-learning scheme, the intermediate layers of a pre-trained CNN are employed as universal image representation to tackle many visual classification problems. The current trend to generate such representation is to learn a CNN on a large set of images labeled among the most specific categories. Such processes ignore potential relations between categories, as well as the categorical-levels...

chapter

Single Image Reflection Suppression

Nikolaos Arvanitopoulos, Radhakrishna Achanta, Sabine Susstrunk

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1752 - 1760

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Reflections are a common artifact in images taken through glass windows. Automatically removing the reflection artifacts after the picture is taken is an ill-posed problem. Attempts to solve this problem using optimization schemes therefore rely on various prior assumptions from the physical world. Instead of removing reflections from a single image, which has met with limited success so far, we propose...

chapter

Learning to Align Semantic Segmentation and 2.5D Maps for Geolocalization

Anil Armagan, Martin Hirzer, Peter M. Roth, Vincent Lepetit

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4590 - 4597

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an efficient method for geolocalization in urban environments starting from a coarse estimate of the location provided by a GPS and using a simple untextured 2.5D model of the surrounding buildings. Our key contribution is a novel efficient and robust method to optimize the pose: We train a Deep Network to predict the best direction to improve a pose estimate, given a semantic segmentation...

chapter

Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context

Qingan Yan, Long Yang, Ling Zhang, Chunxia Xiao

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 152 - 160

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A perennial problem in structure from motion (SfM) is visual ambiguity posed by repetitive structures. Recent disambiguating algorithms infer ambiguities mainly via explicit background context, thus face limitations in highly ambiguous scenes which are visually indistinguishable. Instead of analyzing local visual information, we propose a novel algorithm for SfM disambiguation that explores the global...

chapter

Co-occurrence Filter

Roy J. Jevnisek, Shai Avidan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3816 - 3824

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Co-occurrence Filter (CoF) is a boundary preserving filter. It is based on the Bilateral Filter (BF) but instead of using a Gaussian on the range values to preserve edges it relies on a co-occurrence matrix. Pixel values that co-occur frequently in the image (i.e., inside textured regions) will have a high weight in the co-occurrence matrix. This, in turn, means that such pixel pairs will be averaged...

chapter

InterpoNet, a Brain Inspired Neural Network for Optical Flow Dense Interpolation

Shay Zweig, Lior Wolf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6363 - 6372

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Sparse-to-dense interpolation for optical flow is a fundamental phase in the pipeline of most of the leading optical flow estimation algorithms. The current state-of-the-art method for interpolation, EpicFlow, is a local average method based on an edge aware geodesic distance. We propose a new data-driven sparse-to-dense interpolation algorithm based on a fully convolutional network. We draw inspiration...

chapter

What Can Help Pedestrian Detection?

Jiayuan Mao, Tete Xiao, Yuning Jiang, Zhimin Cao

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6034 - 6043

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Aggregating extra features has been considered as an effective approach to boost traditional pedestrian detection methods. However, there is still a lack of studies on whether and how CNN-based pedestrian detectors can benefit from these extra features. The first contribution of this paper is exploring this issue by aggregating extra features into CNN-based pedestrian detection framework. Through...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Scene Graph Generation by Iterative Message Passing

BIND: Binary Integrated Net Descriptors for Texture-Less Object Recognition

Learning Random-Walk Label Propagation for Weakly-Supervised Semantic Segmentation

Semantic Amodal Segmentation

A Unified Approach of Multi-scale Deep and Hand-Crafted Features for Defocus Estimation

PoseTrack: Joint Multi-person Pose Estimation and Tracking

Image Deblurring via Extreme Channels Prior

InstanceCut: From Edges to Instances with MultiCut

Image-to-Image Translation with Conditional Adversarial Networks

Improving RANSAC-Based Segmentation through CNN Encapsulation

Deep Network Flow for Multi-object Tracking

Deep Sketch Hashing: Fast Free-Hand Sketch-Based Image Retrieval

Deeply Supervised Salient Object Detection with Short Connections

MuCaLe-Net: Multi Categorical-Level Networks to Generate More Discriminating Features

Single Image Reflection Suppression

Learning to Align Semantic Segmentation and 2.5D Maps for Geolocalization

Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context

Co-occurrence Filter

InterpoNet, a Brain Inspired Neural Network for Optical Flow Dense Interpolation

What Can Help Pedestrian Detection?

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)