2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Learning Non-maximum Suppression

Jan Hosang, Rodrigo Benenson, Bernt Schiele

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6469 - 6477

Object detectors have hugely profited from moving towards an end-to-end learning paradigm: proposals, fea tures, and the classifier becoming one neural network improved results two-fold on general object detection. One indispensable component is non-maximum suppression (NMS), a post-processing algorithm responsible for merging all detections that belong to the same object. The de facto standard NMS...

chapter

A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection

Xiaolong Wang, Abhinav Shrivastava, Abhinav Gupta

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3039 - 3048

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

How do we learn an object detector that is invariant to occlusions and deformations? Our current solution is to use a data-driven strategy – collect large-scale datasets which have object instances under different conditions. The hope is that the final classifier can use these examples to learn invariances. But is it really possible to see all the occlusions in a dataset? We argue that...

chapter

Self-Learning Scene-Specific Pedestrian Detectors Using a Progressive Latent Model

Qixiang Ye, Tianliang Zhang, Wei Ke, Qiang Qiu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2057 - 2066

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, a self-learning approach is proposed towards solving scene-specific pedestrian detection problem without any human annotation involved. The self-learning approach is deployed as progressive steps of object discovery, object enforcement, and label propagation. In the learning procedure, object locations in each frame are treated as latent variables that are solved with a progressive...

chapter

Scale-Aware Face Detection

Zekun Hao, Yu Liu, Hongwei Qin, Junjie Yan, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1913 - 1922

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional neural network (CNN) based face detectors are inefficient in handling faces of diverse scales. They rely on either fitting a large single model to faces across a large scale range or multi-scale testing. Both are computationally expensive. We propose Scale-aware Face Detection (SAFD) to handle scale explicitly using CNN, and achieve better performance with less computation cost. Prior...

chapter

What is and What is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors

Changqun Xia, Jia Li, Xiaowu Chen, Anlin Zheng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4399 - 4407

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Finding what is and what is not a salient object can be helpful in developing better features and models in salient object detection (SOD). In this paper, we investigate the images that are selected and discarded in constructing a new SOD dataset and find that many similar candidates, complex shape and low objectness are three main attributes of many non-salient objects. Moreover, objects may have...

chapter

Deep Self-Taught Learning for Weakly Supervised Object Localization

Zequn Jie, Yunchao Wei, Xiaojie Jin, Jiashi Feng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4294 - 4302

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most existing weakly supervised localization (WSL) approaches learn detectors by finding positive bounding boxes based on features learned with image-level supervision. However, those features do not contain spatial location related information and usually provide poor-quality positive samples for training a detector. To overcome this issue, we propose a deep self-taught learning approach, which makes...

chapter

Towards Accurate Multi-person Pose Estimation in the Wild

George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3711 - 3719

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a method for multi-person detection and 2-D pose estimation that achieves state-of-art results on the challenging COCO keypoints task. It is a simple, yet powerful, top-down approach consisting of two stages. In the first stage, we predict the location and scale of boxes which are likely to contain people, for this we use the Faster RCNN detector. In the second stage, we estimate the keypoints...

chapter

Joint Detection and Identification Feature Learning for Person Search

Tong Xiao, Shuang Li, Bochao Wang, Liang Lin, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3376 - 3385

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing person re-identification benchmarks and methods mainly focus on matching cropped pedestrian images between queries and candidates. However, it is different from real-world scenarios where the annotations of pedestrian bounding boxes are unavailable and the target person needs to be searched from a gallery of whole scene images. To close the gap, we propose a new deep learning framework for...

chapter

RON: Reverse Connection with Objectness Prior Networks for Object Detection

Tao Kong, Fuchun Sun, Anbang Yao, Huaping Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5244 - 5252

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present RON, an efficient and effective framework for generic object detection. Our motivation is to smartly associate the best of the region-based (e.g., Faster R-CNN) and region-free (e.g., SSD) methodologies. Under fully convolutional architecture, RON mainly focuses on two fundamental problems: (a) multi-scale object localization and (b) negative sample mining. To address (a), we design the...

chapter

Accurate Single Stage Detector Using Recurrent Rolling Convolution

Jimmy Ren, Xiaohao Chen, Jianbo Liu, Wenxiu Sun, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 752 - 760

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most of the recent successful methods in accurate object detection and localization used some variants of R-CNN style two stage Convolutional Neural Networks (CNN) where plausible regions were proposed in the first stage then followed by a second stage for decision refinement. Despite the simplicity of training and the efficiency in deployment, the single stage detection methods have not been as competitive...

chapter

Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly

Hao Jiang, Kristen Grauman

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3435 - 3443

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Todays person detection methods work best when people are in common upright poses and appear reasonably well spaced out in the image. However, in many real images, thats not what people do. People often appear quite close to each other, e.g., with limbs linked or heads touching, and their poses are often not pedestrian-like. We propose an approach to detangle people in multi-person images. We formulate...

chapter

Mimicking Very Efficient Network for Object Detection

Quanquan Li, Shengying Jin, Junjie Yan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7341 - 7349

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Current CNN based object detectors need initialization from pre-trained ImageNet classification models, which are usually time-consuming. In this paper, we present a fully convolutional feature mimic framework to train very efficient CNN based detectors, which do not need ImageNet pre-training and achieve competitive performance as the large and slow models. We add supervision from high-level features...

chapter

Multiple Instance Detection Network with Online Instance Classifier Refinement

Peng Tang, Xinggang Wang, Xiang Bai, Wenyu Liu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3059 - 3067

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Of late, weakly supervised object detection is with great importance in object recognition. Based on deep learning, weakly supervised detectors have achieved many promising results. However, compared with fully supervised detection, it is more challenging to train deep network based detectors in a weakly supervised manner. Here we formulate weakly supervised detection as a Multiple Instance Learning...

chapter

ArtTrack: Articulated Multi-Person Tracking in the Wild

Eldar Insafutdinov, Mykhaylo Andriluka, Leonid Pishchulin, Siyu Tang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1293 - 1301

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we propose an approach for articulated tracking of multiple people in unconstrained videos. Our starting point is a model that resembles existing architectures for single-frame pose estimation but is substantially faster. We achieve this in two ways: (1) by simplifying and sparsifying the body-part relationship graph and leveraging recent methods for faster inference, and (2) by offloading...

chapter

Detecting Masked Faces in the Wild with LLE-CNNs

Shiming Ge, Jia Li, Qiting Ye, Zhao Luo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 426 - 434

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Detecting faces with occlusions is a challenging task due to two main reasons: 1) the absence of large datasets of masked faces, and 2) the absence of facial cues from the masked regions. To address these two issues, this paper first introduces a dataset, denoted as MAFA, with 30, 811 Internet images and 35, 806 masked faces. Faces in the dataset have various orientations and occlusion degrees, while...

chapter

Learning Cross-Modal Deep Representations for Robust Pedestrian Detection

Dan Xu, Wanli Ouyang, Elisa Ricci, Xiaogang Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4236 - 4244

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a novel method for detecting pedestrians under adverse illumination conditions. Our approach relies on a novel cross-modality learning framework and it is based on two main phases. First, given a multimodal dataset, a deep convolutional network is employed to learn a non-linear mapping, modeling the relations between RGB and thermal data. Then, the learned feature representations...

chapter

Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors

Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3296 - 3297

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The goal of this paper is to serve as a guide for selecting a detection architecture that achieves the right speed/memory/accuracy balance for a given application and platform. To this end, we investigate various ways to trade accuracy for speed and memory usage in modern convolutional object detection systems. A number of successful systems have been proposed in recent years, but apples-toapples...

chapter

Discover and Learn New Objects from Documentaries

Kai Chen, Hang Song, Chen Change Loy, Dahua Lin

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1111 - 1120

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Despite the remarkable progress in recent years, detecting objects in a new context remains a challenging task. Detectors learned from a public dataset can only work with a fixed list of categories, while training from scratch usually requires a large amount of training data with detailed annotations. This work aims to explore a novel approach – learning object detectors from documentary...

chapter

Feature Pyramid Networks for Object Detection

Tsung-Yi Lin, Piotr Dollar, Ross Girshick, Kaiming He, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 936 - 944

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Feature pyramids are a basic component in recognition systems for detecting objects at different scales. But pyramid representations have been avoided in recent object detectors that are based on deep convolutional networks, partially because they are slow to compute and memory intensive. In this paper, we exploit the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct...

chapter

Pixelwise Instance Segmentation with a Dynamically Instantiated Network

Anurag Arnab, Philip H. S. Torr

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 879 - 888

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic segmentation and object detection research have recently achieved rapid progress. However, the former task has no notion of different instances of the same object, and the latter operates at a coarse, bounding-box level. We propose an Instance Segmentation system that produces a segmentation map where each pixel is assigned an object class and instance identity label. Most approaches adapt...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning Non-maximum Suppression

A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection

Self-Learning Scene-Specific Pedestrian Detectors Using a Progressive Latent Model

Scale-Aware Face Detection

What is and What is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors

Deep Self-Taught Learning for Weakly Supervised Object Localization

Towards Accurate Multi-person Pose Estimation in the Wild

Joint Detection and Identification Feature Learning for Person Search

RON: Reverse Connection with Objectness Prior Networks for Object Detection

Accurate Single Stage Detector Using Recurrent Rolling Convolution

Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly

Mimicking Very Efficient Network for Object Detection

Multiple Instance Detection Network with Online Instance Classifier Refinement

ArtTrack: Articulated Multi-Person Tracking in the Wild

Detecting Masked Faces in the Wild with LLE-CNNs

Learning Cross-Modal Deep Representations for Robust Pedestrian Detection

Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors

Discover and Learn New Objects from Documentaries

Feature Pyramid Networks for Object Detection

Pixelwise Instance Segmentation with a Dynamically Instantiated Network

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)