Advanced search

Advanced search in people

From:

To:

Items from 81 to 100 out of 1,339 results

chapter

CityPersons: A Diverse Dataset for Pedestrian Detection

Shanshan Zhang, Rodrigo Benenson, Bernt Schiele

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4457 - 4465

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convnets have enabled significant progress in pedestrian detection recently, but there are still open questions regarding suitable architectures and training data. We revisit CNN design and point out key adaptations, enabling plain FasterRCNN to obtain state-of-the-art results on the Caltech dataset. To achieve further improvement from more and better data, we introduce CityPersons, a new set of person...

chapter

Deep Self-Taught Learning for Weakly Supervised Object Localization

Zequn Jie, Yunchao Wei, Xiaojie Jin, Jiashi Feng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4294 - 4302

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most existing weakly supervised localization (WSL) approaches learn detectors by finding positive bounding boxes based on features learned with image-level supervision. However, those features do not contain spatial location related information and usually provide poor-quality positive samples for training a detector. To overcome this issue, we propose a deep self-taught learning approach, which makes...

chapter

Person Re-identification in the Wild

Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3346 - 3355

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a novel large-scale dataset and comprehensive baselines for end-to-end pedestrian detection and person recognition in raw video frames. Our baselines address three issues: the performance of various combinations of detectors and recognizers, mechanisms for pedestrian detection to help improve overall re-identification (re-ID) accuracy and assessing the effectiveness of different...

chapter

Joint Detection and Identification Feature Learning for Person Search

Tong Xiao, Shuang Li, Bochao Wang, Liang Lin, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3376 - 3385

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing person re-identification benchmarks and methods mainly focus on matching cropped pedestrian images between queries and candidates. However, it is different from real-world scenarios where the annotations of pedestrian bounding boxes are unavailable and the target person needs to be searched from a gallery of whole scene images. To close the gap, we propose a new deep learning framework for...

chapter

YOLO9000: Better, Faster, Stronger

Joseph Redmon, Ali Farhadi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6517 - 6525

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce YOLO9000, a state-of-the-art, real-time object detection system that can detect over 9000 object categories. First we propose various improvements to the YOLO detection method, both novel and drawn from prior work. The improved model, YOLOv2, is state-of-the-art on standard detection tasks like PASCAL VOC and COCO. Using a novel, multi-scale training method the same YOLOv2 model can run...

chapter

RON: Reverse Connection with Objectness Prior Networks for Object Detection

Tao Kong, Fuchun Sun, Anbang Yao, Huaping Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5244 - 5252

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present RON, an efficient and effective framework for generic object detection. Our motivation is to smartly associate the best of the region-based (e.g., Faster R-CNN) and region-free (e.g., SSD) methodologies. Under fully convolutional architecture, RON mainly focuses on two fundamental problems: (a) multi-scale object localization and (b) negative sample mining. To address (a), we design the...

chapter

Learning Discriminative and Transformation Covariant Local Feature Detectors

Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4923 - 4931

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust covariant local feature detectors are important for detecting local features that are (1) discriminative of the image content and (2) can be repeatably detected at consistent locations when the image undergoes diverse transformations. Such detectors are critical for applications such as image search and scene reconstruction. Many learning-based local feature detectors address one of these two...

chapter

Training Object Class Detectors with Click Supervision

Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 180 - 189

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing center-click annotations: we ask annotators to click on the center of an imaginary bounding box which tightly encloses the object instance. We then incorporate...

chapter

Mimicking Very Efficient Network for Object Detection

Quanquan Li, Shengying Jin, Junjie Yan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7341 - 7349

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Current CNN based object detectors need initialization from pre-trained ImageNet classification models, which are usually time-consuming. In this paper, we present a fully convolutional feature mimic framework to train very efficient CNN based detectors, which do not need ImageNet pre-training and achieve competitive performance as the large and slow models. We add supervision from high-level features...

chapter

Network Dissection: Quantifying Interpretability of Deep Visual Representations

David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3319 - 3327

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a general framework called Network Dissection for quantifying the interpretability of latent representations of CNNs by evaluating the alignment between individual hidden units and a set of semantic concepts. Given any CNN model, the proposed method draws on a data set of concepts to score the semantics of hidden units at each intermediate convolutional layer. The units with semantics are...

chapter

Multiple Instance Detection Network with Online Instance Classifier Refinement

Peng Tang, Xinggang Wang, Xiang Bai, Wenyu Liu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3059 - 3067

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Of late, weakly supervised object detection is with great importance in object recognition. Based on deep learning, weakly supervised detectors have achieved many promising results. However, compared with fully supervised detection, it is more challenging to train deep network based detectors in a weakly supervised manner. Here we formulate weakly supervised detection as a Multiple Instance Learning...

chapter

Finding Tiny Faces

Peiyun Hu, Deva Ramanan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1522 - 1530

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Though tremendous strides have been made in object recognition, one of the remaining open challenges is detecting small objects. We explore three aspects of the problem in the context of finding small faces: the role of scale invariance, image resolution, and contextual reasoning. While most recognition approaches aim to be scale-invariant, the cues for recognizing a 3px tall face are fundamentally...

chapter

3D Human Pose Estimation from a Single Image via Distance Matrix Regression

Francesc Moreno-Noguer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1561 - 1570

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of 3D human pose estimation from a single image. We follow a standard two-step pipeline by first detecting the 2D position of the N body joints, and then using these observations to infer 3D pose. For the first step, we use a recent CNN-based detector. For the second step, most existing approaches perform 2N-to-3N regression of the Cartesian joint coordinates. We show...

chapter

Fine-Grained Recognition of Thousands of Object Categories with Single-Example Training

Leonid Karlinsky, Joseph Shtok, Yochay Tzur, Asaf Tzadok

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 965 - 974

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We approach the problem of fast detection and recognition of a large number (thousands) of object categories while training on a very limited amount of examples, usually one per category. Examples of this task include: (i) detection of retail products, where we have only one studio image of each product available for training, (ii) detection of brand logos, and (iii) detection of 3D objects and their...

chapter

Detecting Masked Faces in the Wild with LLE-CNNs

Shiming Ge, Jia Li, Qiting Ye, Zhao Luo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 426 - 434

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Detecting faces with occlusions is a challenging task due to two main reasons: 1) the absence of large datasets of masked faces, and 2) the absence of facial cues from the masked regions. To address these two issues, this paper first introduces a dataset, denoted as MAFA, with 30, 811 Internet images and 35, 806 masked faces. Faces in the dataset have various orientations and occlusion degrees, while...

chapter

Learning Cross-Modal Deep Representations for Robust Pedestrian Detection

Dan Xu, Wanli Ouyang, Elisa Ricci, Xiaogang Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4236 - 4244

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a novel method for detecting pedestrians under adverse illumination conditions. Our approach relies on a novel cross-modality learning framework and it is based on two main phases. First, given a multimodal dataset, a deep convolutional network is employed to learn a non-linear mapping, modeling the relations between RGB and thermal data. Then, the learned feature representations...

chapter

Quad-Networks: Unsupervised Learning to Rank for Interest Point Detection

Nikolay Savinov, Akihito Seki, L'Ubor Ladicky, Torsten Sattler, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3929 - 3937

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Several machine learning tasks require to represent the data using only a sparse set of interest points. An ideal detector is able to find the corresponding interest points even if the data undergo a transformation typical for a given domain. Since the task is of high practical interest in computer vision, many hand-crafted solutions were proposed. In this paper, we ask a fundamental question: can...

chapter

Learning to Detect Salient Objects with Image-Level Supervision

Lijun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3796 - 3805

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep Neural Networks (DNNs) have substantially improved the state-of-the-art in salient object detection. However, training DNNs requires costly pixel-level annotations. In this paper, we leverage the observation that image-level tags provide important cues of foreground salient objects, and develop a weakly supervised learning method for saliency detection using image-level tags only. The Foreground...

chapter

Vision-Based Traffic Light Detection for Intelligent Vehicles

Xiaoping Du, Yang Li, Yuang Guo, Hui Xiong

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 1323 - 1326

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Vision-based traffic light detection has been widely studied over the past decade. However, it is still a challenging task to build a real-time and robust classifier-based detector without a high dependency on prior knowledge. In this paper, we have a deep look at the design of features and detection mechanism in the domain of traffic light detection; propose a multi-scale and multi-phase detector...

chapter

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

Zhe Cao, Tomas Simon, Shih-En Wei, Yaser Sheikh

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1302 - 1310

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an approach to efficiently detect the 2D pose of multiple people in an image. The approach uses a nonparametric representation, which we refer to as Part Affinity Fields (PAFs), to learn to associate body parts with individuals in the image. The architecture encodes global context, allowing a greedy bottom-up parsing step that maintains high accuracy while achieving realtime performance,...

Keywords:
TRAINING
DETECTORS

Publication date

Set your own date range

Content availability

Available (1,331)
None (8)

Publication type

book (1,134)
article (205)

Keywords

FEATURE EXTRACTION (580)
OBJECT DETECTION (314)
SUPPORT VECTOR MACHINES (194)
FACE (154)
SHAPE (109)
ROBUSTNESS (108)
VISUALIZATION (107)
CAMERAS (102)
FACE DETECTION (101)
LEARNING (ARTIFICIAL INTELLIGENCE) (97)
ACCURACY (96)
COMPUTER VISION (95)
IMAGE COLOR ANALYSIS (95)
HISTOGRAMS (94)
TESTING (93)
CLASSIFICATION ALGORITHMS (92)
IMAGE CLASSIFICATION (89)
DATABASES (88)
IMAGE EDGE DETECTION (87)
COMPUTATIONAL MODELING (86)
TRAINING DATA (81)
BOOSTING (80)
FACE RECOGNITION (79)
IMAGE SEGMENTATION (78)
PIXEL (76)
VECTORS (74)
ESTIMATION (72)
DATA MINING (71)
MACHINE LEARNING (71)
PROPOSALS (68)
ARTIFICIAL NEURAL NETWORKS (64)
KERNEL (62)
NEURAL NETWORKS (61)
VEHICLES (61)
HUMANS (59)
ALGORITHM DESIGN AND ANALYSIS (55)
HIDDEN MARKOV MODELS (53)
OBJECT RECOGNITION (53)
IMAGE RECOGNITION (46)
SEMANTICS (45)
PEDESTRIAN DETECTION (44)
TRACKING (43)
VIDEOS (42)
NOISE (41)
ADABOOST (40)
DATA MODELS (40)
ANOMALY DETECTION (39)
SIGNAL TO NOISE RATIO (38)
LIGHTING (37)
OPTIMIZATION (36)
SPEECH (36)
CONTEXT (35)
PATTERN RECOGNITION (35)
CORRELATION (34)
CONFERENCES (33)
MATHEMATICAL MODEL (33)
TARGET TRACKING (33)
PRINCIPAL COMPONENT ANALYSIS (32)
ADAPTATION MODELS (31)
IMAGE RESOLUTION (31)
SIGNAL PROCESSING (31)
TRANSFORMS (31)
COMPLEXITY THEORY (30)
EQUATIONS (30)
STANDARDS (30)
VIDEO SURVEILLANCE (30)
CHANNEL ESTIMATION (29)
THREE-DIMENSIONAL DISPLAYS (29)
INTRUSION DETECTION (28)
NEURONS (28)
SIGNAL PROCESSING ALGORITHMS (28)
SURVEILLANCE (28)
DETECTION ALGORITHMS (27)
DICTIONARIES (27)
IMAGE SEQUENCES (27)
VIDEO SEQUENCES (27)
IMMUNE SYSTEM (26)
MONITORING (26)
PATTERN CLASSIFICATION (26)
SECURITY OF DATA (26)
SPEECH RECOGNITION (25)
SUPPORT VECTOR MACHINE CLASSIFICATION (25)
TRAFFIC ENGINEERING COMPUTING (25)
DEFORMABLE MODELS (24)
NEURAL NETS (24)
POSE ESTIMATION (24)
RECEIVERS (24)
ROADS (24)
SUPPORT VECTOR MACHINE (24)
LABELING (23)
MAXIMUM LIKELIHOOD ESTIMATION (23)
THREE DIMENSIONAL DISPLAYS (23)
VIDEO SIGNAL PROCESSING (23)
COVARIANCE MATRIX (22)
RELIABILITY (22)
SOLID MODELING (22)
STREAMING MEDIA (22)
ARTIFICIAL IMMUNE SYSTEM (21)
more

INFONA - science communication portal

Advanced search

Advanced search in people

CityPersons: A Diverse Dataset for Pedestrian Detection

Deep Self-Taught Learning for Weakly Supervised Object Localization

Person Re-identification in the Wild

Joint Detection and Identification Feature Learning for Person Search

YOLO9000: Better, Faster, Stronger

RON: Reverse Connection with Objectness Prior Networks for Object Detection

Learning Discriminative and Transformation Covariant Local Feature Detectors

Training Object Class Detectors with Click Supervision

Mimicking Very Efficient Network for Object Detection

Network Dissection: Quantifying Interpretability of Deep Visual Representations

Multiple Instance Detection Network with Online Instance Classifier Refinement

Finding Tiny Faces

3D Human Pose Estimation from a Single Image via Distance Matrix Regression

Fine-Grained Recognition of Thousands of Object Categories with Single-Example Training

Detecting Masked Faces in the Wild with LLE-CNNs

Learning Cross-Modal Deep Representations for Robust Pedestrian Detection

Quad-Networks: Unsupervised Learning to Rank for Interest Point Detection

Learning to Detect Salient Objects with Image-Level Supervision

Vision-Based Traffic Light Detection for Intelligent Vehicles

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options