2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Instance-Level Salient Object Segmentation

Guanbin Li, Yuan Xie, Liang Lin, Yizhou Yu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 247 - 256

Image saliency detection has recently witnessed rapid progress due to deep convolutional neural networks. However, none of the existing methods is able to identify object instances in the detected salient regions. In this paper, we present a salient instance segmentation method that produces a saliency mask with distinct object instance labels for an input image. Our method consists of three steps,...

chapter

Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes from 2D Ones in RGB-Depth Images

Zhuo Deng, Longin Jan Latecki

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 398 - 406

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of amodal perception of 3D object detection. The task is to not only find object localizations in the 3D world, but also estimate their physical sizes and poses, even if only parts of them are visible in the RGB-D image. Recent approaches have attempted to harness point cloud from depth channel to exploit 3D features directly in the 3D space and demonstrated the superiority...

chapter

Deep Level Sets for Salient Object Detection

Ping Hu, Bing Shuai, Jun Liu, Gang Wang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 540 - 549

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning has been applied to saliency detection in recent years. The superior performance has proved that deep networks can model the semantic properties of salient objects. Yet it is difficult for a deep network to discriminate pixels belonging to similar receptive fields around the object boundaries, thus deep networks may output maps with blurred saliency and inaccurate boundaries. To tackle...

chapter

Variational Bayesian Multiple Instance Learning with Gaussian Processes

Manuel HauBmann, Fred A. Hamprecht, Melih Kandemir

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 810 - 819

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Gaussian Processes (GPs) are effective Bayesian predictors. We here show for the first time that instance labels of a GP classifier can be inferred in the multiple instance learning (MIL) setting using variational Bayes. We achieve this via a new construction of the bag likelihood that assumes a large value if the instance predictions obey the MIL constraints and a small value otherwise. This construction...

chapter

Pixelwise Instance Segmentation with a Dynamically Instantiated Network

Anurag Arnab, Philip H. S. Torr

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 879 - 888

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic segmentation and object detection research have recently achieved rapid progress. However, the former task has no notion of different instances of the same object, and the latter operates at a coarse, bounding-box level. We propose an Instance Segmentation system that produces a segmentation map where each pixel is assigned an object class and instance identity label. Most approaches adapt...

chapter

Object Detection in Videos with Tubelet Proposal Networks

Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 889 - 897

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Object detection in videos has drawn increasing attention recently with the introduction of the large-scale ImageNet VID dataset. Different from object detection in static images, temporal information in videos is vital for object detection. To fully utilize temporal information, state-of-the-art methods [15, 14] are based on spatiotemporal tubelets, which are essentially sequences of associated bounding...

chapter

Feature Pyramid Networks for Object Detection

Tsung-Yi Lin, Piotr Dollar, Ross Girshick, Kaiming He, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 936 - 944

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Feature pyramids are a basic component in recognition systems for detecting objects at different scales. But pyramid representations have been avoided in recent object detectors that are based on deep convolutional networks, partially because they are slow to compute and memory intensive. In this paper, we exploit the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct...

chapter

Spatially Adaptive Computation Time for Residual Networks

Michael Figurnov, Maxwell D. Collins, Yukun Zhu, Li Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1790 - 1799

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a deep learning architecture based on Residual Network that dynamically adjusts the number of executed layers for the regions of the image. This architecture is end-to-end trainable, deterministic and problem-agnostic. It is therefore applicable without any modifications to a wide range of computer vision problems such as image classification, object detection and image segmentation...

chapter

Deep MANTA: A Coarse-to-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis from Monocular Image

Florian Chabot, Mohamed Chaouch, Jaonary Rabarisoa, Celine Teuliere, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1827 - 1836

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we present a novel approach, called Deep MANTA (Deep Many-Tasks), for many-task vehicle analysis from a given image. A robust convolutional network is introduced for simultaneous vehicle detection, part localization, visibility characterization and 3D dimension estimation. Its architecture is based on a new coarse-to-fine object proposal that boosts the vehicle detection. Moreover,...

chapter

Perceptual Generative Adversarial Networks for Small Object Detection

Jianan Li, Xiaodan Liang, Yunchao Wei, Tingfa Xu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1951 - 1959

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Detecting small objects is notoriously challenging due to their low resolution and noisy representation. Existing object detection pipelines usually detect small objects through learning representations of all the objects at multiple scales. However, the performance gain of such ad hoc architectures is usually limited to pay off the computational cost. In this work, we address the small object detection...

chapter

Dense Captioning with Joint Inference and Visual Context

Linjie Yang, Kevin Tang, Jianchao Yang, Li-Jia Li

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1978 - 1987

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Dense captioning is a newly emerging computer vision topic for understanding images with dense language descriptions. The goal is to densely detect visual concepts (e.g., objects, object parts, and interactions between them) from images, labeling each with a short descriptive phrase. We identify two key challenges of dense captioning that need to be properly addressed when tackling the problem. First,...

chapter

Semantic Amodal Segmentation

Yan Zhu, Yuandong Tian, Dimitris Metaxas, Piotr Dollar

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3001 - 3009

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Common visual recognition tasks such as classification, object detection, and semantic segmentation are rapidly reaching maturity, and given the recent rate of progress, it is not unreasonable to conjecture that techniques for many of these problems will approach human levels of performance in the next few years. In this paper we look to the future: what is the next frontier in visual recognition?...

chapter

A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection

Xiaolong Wang, Abhinav Shrivastava, Abhinav Gupta

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3039 - 3048

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

How do we learn an object detector that is invariant to occlusions and deformations? Our current solution is to use a data-driven strategy – collect large-scale datasets which have object instances under different conditions. The hope is that the final classifier can use these examples to learn invariances. But is it really possible to see all the occlusions in a dataset? We argue that...

chapter

Multiple Instance Detection Network with Online Instance Classifier Refinement

Peng Tang, Xinggang Wang, Xiang Bai, Wenyu Liu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3059 - 3067

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Of late, weakly supervised object detection is with great importance in object recognition. Based on deep learning, weakly supervised detectors have achieved many promising results. However, compared with fully supervised detection, it is more challenging to train deep network based detectors in a weakly supervised manner. Here we formulate weakly supervised detection as a Multiple Instance Learning...

chapter

Visual Translation Embedding Network for Visual Relation Detection

Hanwang Zhang, Zawlin Kyaw, Shih-Fu Chang, Tat-Seng Chua

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3107 - 3115

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Visual relations, such as person ride bike and bike next to car, offer a comprehensive scene understanding of an image, and have already shown their great utility in connecting computer vision and natural language. However, due to the challenging combinatorial complexity of modeling subject-predicate-object relation triplets, very little work has been done to localize and predict visual relations...

chapter

Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors

Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3296 - 3297

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The goal of this paper is to serve as a guide for selecting a detection architecture that achieves the right speed/memory/accuracy balance for a given application and platform. To this end, we investigate various ways to trade accuracy for speed and memory usage in modern convolutional object detection systems. A number of successful systems have been proposed in recent years, but apples-toapples...

chapter

Detecting Oriented Text in Natural Images by Linking Segments

Baoguang Shi, Xiang Bai, Serge Belongie

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3482 - 3490

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most state-of-the-art text detection methods are specific to horizontal Latin text and are not fast enough for real-time applications. We introduce Segment Linking (SegLink), an oriented text detection method. The main idea is to decompose text into two locally detectable elements, namely segments and links. A segment is an oriented box covering a part of a word or text line, A link connects two adjacent...

chapter

Learning to Detect Salient Objects with Image-Level Supervision

Lijun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3796 - 3805

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep Neural Networks (DNNs) have substantially improved the state-of-the-art in salient object detection. However, training DNNs requires costly pixel-level annotations. In this paper, we leverage the observation that image-level tags provide important cues of foreground salient objects, and develop a weakly supervised learning method for saliency detection using image-level tags only. The Foreground...

chapter

Mining Object Parts from CNNs via Active Question-Answering

Quanshi Zhang, Ruiming Cao, Ying Nian Wu, Song-Chun Zhu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3890 - 3899

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Given a convolutional neural network (CNN) that is pre-trained for object classification, this paper proposes to use active question-answering to semanticize neural patterns in conv-layers of the CNN and mine part concepts. For each part concept, we mine neural patterns in the pre-trained CNN, which are related to the target part, and use these patterns to construct an And-Or graph (AOG) to represent...

chapter

Polyhedral Conic Classifiers for Visual Object Detection and Classification

Hakan Cevikalp, Bill Triggs

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4114 - 4122

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a family of quasi-linear discriminants that outperform current large-margin methods in sliding window visual object detection and open set recognition tasks. In these tasks the classification problems are both numerically imbalanced – positive (object class) training and test windows are much rarer than negative (non-class) ones – and geometrically asymmetric –...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Instance-Level Salient Object Segmentation

Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes from 2D Ones in RGB-Depth Images

Deep Level Sets for Salient Object Detection

Variational Bayesian Multiple Instance Learning with Gaussian Processes

Pixelwise Instance Segmentation with a Dynamically Instantiated Network

Object Detection in Videos with Tubelet Proposal Networks

Feature Pyramid Networks for Object Detection

Spatially Adaptive Computation Time for Residual Networks

Deep MANTA: A Coarse-to-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis from Monocular Image

Perceptual Generative Adversarial Networks for Small Object Detection

Dense Captioning with Joint Inference and Visual Context

Semantic Amodal Segmentation

A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection

Multiple Instance Detection Network with Online Instance Classifier Refinement

Visual Translation Embedding Network for Visual Relation Detection

Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors

Detecting Oriented Text in Natural Images by Linking Segments

Learning to Detect Salient Objects with Image-Level Supervision

Mining Object Parts from CNNs via Active Question-Answering

Polyhedral Conic Classifiers for Visual Object Detection and Classification

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)