2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Items from 1 to 20 out of 26 results

chapter

Growing a Brain: Fine-Tuning by Increasing Model Capacity

Yu-Xiong Wang, Deva Ramanan, Martial Hebert

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3029 - 3038

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

CNNs have made an undeniable impact on computer vision through the ability to learn high-capacity models with large annotated training sets. One of their remarkable properties is the ability to transfer knowledge from a large source dataset to a (typically smaller) target dataset. This is usually accomplished through fine-tuning a fixed-size network on new target data. Indeed, virtually every contemporary...

chapter

Deep TEN: Texture Encoding Network

Hang Zhang, Jia Xue, Kristin Dana

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2896 - 2905

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a Deep Texture Encoding Network (Deep-TEN) with a novel Encoding Layer integrated on top of convolutional layers, which ports the entire dictionary learning and encoding pipeline into a single model. Current methods build from distinct components, using standard encoders with separate off-the-shelf features such as SIFT descriptors or pre-trained CNN features for material recognition. Our...

chapter

The World of Fast Moving Objects

Denys Rozumnyi, Jan Kotera, Filip Sroubek, Lukas Novotny, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4838 - 4846

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The notion of a Fast Moving Object (FMO), i.e. an object that moves over a distance exceeding its size within the exposure time, is introduced. FMOs may, and typically do, rotate with high angular speed. FMOs are very common in sports videos, but are not rare elsewhere. In a single frame, such objects are often barely visible and appear as semitransparent streaks. A method for the detection and tracking...

chapter

Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters

Shiyu Huang, Deva Ramanan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4664 - 4673

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

As autonomous vehicles become an every-day reality, high-accuracy pedestrian detection is of paramount practical importance. Pedestrian detection is a highly researched topic with mature methods, but most datasets (for both training and evaluation) focus on common scenes of people engaged in typical walking poses on sidewalks. But performance is most crucial for dangerous scenarios that are rarely...

chapter

Straight to Shapes: Real-Time Detection of Encoded Shapes

Saumya Jetley, Michael Sapienza, Stuart Golodetz, Philip H. S. Torr

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4207 - 4216

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Current object detection approaches predict bounding boxes that provide little instance-specific information beyond location, scale and aspect ratio. In this work, we propose to regress directly to objects shapes in addition to their bounding boxes and categories. It is crucial to find an appropriate shape representation that is compact and decodable, and in which objects can be compared for higher-order...

chapter

Person Re-identification in the Wild

Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3346 - 3355

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a novel large-scale dataset and comprehensive baselines for end-to-end pedestrian detection and person recognition in raw video frames. Our baselines address three issues: the performance of various combinations of detectors and recognizers, mechanisms for pedestrian detection to help improve overall re-identification (re-ID) accuracy and assessing the effectiveness of different...

chapter

InstanceCut: From Edges to Instances with MultiCut

Alexander Kirillov, Evgeny Levinkov, Bjoern Andres, Bogdan Savchynskyy, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7322 - 7331

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This work addresses the task of instance-aware semantic segmentation. Our key motivation is to design a simple method with a new modelling-paradigm, which therefore has a different trade-off between advantages and disadvantages compared to known approaches. Our approach, we term InstanceCut, represents the problem by two output modalities: (i) an instance-agnostic semantic segmentation and (ii) all...

chapter

Optical Flow Requires Multiple Strategies (but Only One Network)

Tal Schuster, Lior Wolf, David Gadot

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6921 - 6930

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We show that the matching problem that underlies optical flow requires multiple strategies, depending on the amount of image motion and other factors. We then study the implications of this observation on training a deep neural network for representing image patches in the context of descriptor based optical flow. We propose a metric learning method, which selects suitable negative samples based on...

chapter

EAST: An Efficient and Accurate Scene Text Detector

Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2642 - 2651

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Previous approaches for scene text detection have already achieved promising performances across various benchmarks. However, they usually fall short when dealing with challenging scenarios, even when equipped with deep neural network models, because the overall performance is determined by the interplay of multiple stages and components in the pipelines. In this work, we propose a simple yet powerful...

chapter

PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2566 - 2574

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

State-of-the-art computer vision algorithms often achieve efficiency by making discrete choices about which hypotheses to explore next. This allows allocation of computational resources to promising candidates, however, such decisions are non-differentiable. As a result, these algorithms are hard to train in an end-to-end fashion. In this work we propose to learn an efficient algorithm for the task...

chapter

Noise Robust Depth from Focus Using a Ring Difference Filter

Jaeheung Surh, Hae-Gon Jeon, Yunwon Park, Sunghoon Im, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2444 - 2453

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Depth from focus (DfF) is a method of estimating depth of a scene by using the information acquired through the change of the focus of a camera. Within the framework of DfF, the focus measure (FM) forms the foundation on which the accuracy of the output is determined. With the result from the FM, the role of a DfF pipeline is to determine and recalculate unreliable measurements while enhancing those...

chapter

RON: Reverse Connection with Objectness Prior Networks for Object Detection

Tao Kong, Fuchun Sun, Anbang Yao, Huaping Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5244 - 5252

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present RON, an efficient and effective framework for generic object detection. Our motivation is to smartly associate the best of the region-based (e.g., Faster R-CNN) and region-free (e.g., SSD) methodologies. Under fully convolutional architecture, RON mainly focuses on two fundamental problems: (a) multi-scale object localization and (b) negative sample mining. To address (a), we design the...

chapter

Webly Supervised Semantic Segmentation

Bin Jin, Maria V. Ortiz Segovia, Sabine Susstrunk

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1705 - 1714

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a weakly supervised semantic segmentation algorithm that uses image tags for supervision. We apply the tags in queries to collect three sets of web images, which encode the clean foregrounds, the common backgrounds, and realistic scenes of the classes. We introduce a novel three-stage training pipeline to progressively learn semantic segmentation models. We first train and refine a class-specific...

chapter

Variational Bayesian Multiple Instance Learning with Gaussian Processes

Manuel HauBmann, Fred A. Hamprecht, Melih Kandemir

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 810 - 819

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Gaussian Processes (GPs) are effective Bayesian predictors. We here show for the first time that instance labels of a GP classifier can be inferred in the multiple instance learning (MIL) setting using variational Bayes. We achieve this via a new construction of the bag likelihood that assumes a large value if the instance predictions obey the MIL constraints and a small value otherwise. This construction...

chapter

Accurate Optical Flow via Direct Cost Volume Processing

Jia Xu, Rene Ranftl, Vladlen Koltun

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5807 - 5815

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an optical flow estimation approach that operates on the full four-dimensional cost volume. This direct approach shares the structural benefits of leading stereo matching pipelines, which are known to yield high accuracy. To this day, such approaches have been considered impractical due to the size of the cost volume. We show that the full four-dimensional cost volume can be constructed...

chapter

Convex Global 3D Registration with Lagrangian Duality

Jesus Briales, Javier Gonzalez-Jimenez

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5612 - 5621

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The registration of 3D models by a Euclidean transformation is a fundamental task at the core of many application in computer vision. This problem is non-convex due to the presence of rotational constraints, making traditional local optimization methods prone to getting stuck in local minima. This paper addresses finding the globally optimal transformation in various 3D registration problems by a...

chapter

3D Human Pose Estimation = 2D Pose Estimation + Matching

Ching-Hang Chen, Deva Ramanan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5759 - 5767

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We explore 3D human pose estimation from a single RGB image. While many approaches try to directly predict 3D pose from image measurements, we explore a simple architecture that reasons through intermediate 2D pose predictions. Our approach is based on two key observations (1) Deep neural nets have revolutionized 2D pose estimation, producing accurate 2D predictions even for poses with self-occlusions...

chapter

Improved Stereo Matching with Constant Highway Networks and Reflective Confidence Learning

Amit Shaked, Lior Wolf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6901 - 6910

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an improved three-step pipeline for the stereo matching problem and introduce multiple novelties at each stage. We propose a new highway network architecture for computing the matching cost at each possible disparity, based on multilevel weighted residual shortcuts, trained with a hybrid loss that supports multilevel comparison of image patches. A novel post-processing step is then introduced,...

chapter

Unsupervised Learning of Depth and Ego-Motion from Video

Tinghui Zhou, Matthew Brown, Noah Snavely, David G. Lowe

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6612 - 6619

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an unsupervised learning framework for the task of monocular depth and camera motion estimation from unstructured video sequences. In common with recent work [10, 14, 16], we use an end-to-end learning approach with view synthesis as the supervisory signal. In contrast to the previous work, our method is completely unsupervised, requiring only monocular video sequences for training. Our...

chapter

SGM-Nets: Semi-Global Matching with Neural Networks

Akihito Seki, Marc Pollefeys

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6640 - 6649

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper deals with deep neural networks for predicting accurate dense disparity map with Semi-global matching (SGM). SGM is a widely used regularization method for real scenes because of its high accuracy and fast computation speed. Even though SGM can obtain accurate results, tuning of SGMs penalty-parameters, which control a smoothness and discontinuity of a disparity map, is uneasy and empirical...

Keywords:
PIPELINES

Publication date

Set your own date range

Keywords

TRAINING (13)
CAMERAS (8)
PROPOSALS (7)
COMPUTER VISION (6)
DETECTORS (6)
BENCHMARK TESTING (5)
FEATURE EXTRACTION (5)
THREE-DIMENSIONAL DISPLAYS (5)
NEURAL NETWORKS (4)
OBJECT DETECTION (4)
SEMANTICS (4)
VISUALIZATION (4)
ENCODING (3)
ESTIMATION (3)
IMAGE SEGMENTATION (3)
POSE ESTIMATION (3)
VIDEOS (3)
GEOMETRY (2)
IMAGE COLOR ANALYSIS (2)
IMAGE EDGE DETECTION (2)
MACHINE LEARNING (2)
MEASUREMENT (2)
OPTICAL IMAGING (2)
ROBUSTNESS (2)
SHAPE (2)
STANDARDS (2)
ADAPTATION MODELS (1)
ADAPTIVE OPTICS (1)
AUTOMOBILES (1)
BODY REGIONS (1)
BRAIN MODELING (1)
CINEMATOGRAPHY (1)
COMPUTATIONAL MODELING (1)
COMPUTER ARCHITECTURE (1)
CONVOLUTIONAL CODES (1)
DICTIONARIES (1)
ENGINES (1)
FASTENERS (1)
FREQUENCY MODULATION (1)
GAUSSIAN PROCESSES (1)
HEURISTIC ALGORITHMS (1)
LAPLACE EQUATIONS (1)
LEARNING SYSTEMS (1)
LEGGED LOCOMOTION (1)
LIBRARIES (1)
MERGING (1)
NAVIGATION (1)
NETWORK ARCHITECTURE (1)
NOISE MEASUREMENT (1)
OBSERVERS (1)
OPTICAL LOSSES (1)
OPTICAL NETWORK UNITS (1)
OPTIMIZATION (1)
PATTERN RECOGNITION (1)
POWER DISTRIBUTION (1)
PREDICTION ALGORITHMS (1)
PREDICTIVE MODELS (1)
REAL-TIME SYSTEMS (1)
RENDERING (COMPUTER GRAPHICS) (1)
RESOURCE DESCRIPTION FRAMEWORK (1)
ROAD TRANSPORTATION (1)
SEARCH PROBLEMS (1)
SIMULTANEOUS LOCALIZATION AND MAPPING (1)
SOLID MODELING (1)
SUPERVISED LEARNING (1)
TESTING (1)
TRACKING (1)
TRAJECTORY (1)
TWO DIMENSIONAL DISPLAYS (1)
more

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Growing a Brain: Fine-Tuning by Increasing Model Capacity

Deep TEN: Texture Encoding Network

The World of Fast Moving Objects

Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters

Straight to Shapes: Real-Time Detection of Encoded Shapes

Person Re-identification in the Wild

InstanceCut: From Edges to Instances with MultiCut

Optical Flow Requires Multiple Strategies (but Only One Network)

EAST: An Efficient and Accurate Scene Text Detector

PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

Noise Robust Depth from Focus Using a Ring Difference Filter

RON: Reverse Connection with Objectness Prior Networks for Object Detection

Webly Supervised Semantic Segmentation

Variational Bayesian Multiple Instance Learning with Gaussian Processes

Accurate Optical Flow via Direct Cost Volume Processing

Convex Global 3D Registration with Lagrangian Duality

3D Human Pose Estimation = 2D Pose Estimation + Matching

Improved Stereo Matching with Constant Highway Networks and Reflective Confidence Learning

Unsupervised Learning of Depth and Ego-Motion from Video

SGM-Nets: Semi-Global Matching with Neural Networks

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)