Search results

chapter

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks

Yinda Zhang, Shuran Song, Ersin Yumer, Manolis Savva, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5057 - 5065

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Indoor scene understanding is central to applications such as robot navigation and human companion assistance. Over the last years, data-driven deep neural networks have outperformed many traditional approaches thanks to their representation learning capabilities. One of the bottlenecks in training for better representations is the amount of available per-pixel ground truth data that is required for...

chapter

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

Seungjun Nah, Tae Hyun Kim, Kyoung Mu Lee

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 257 - 265

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Non-uniform blind deblurring for general dynamic scenes is a challenging computer vision problem as blurs arise not only from multiple object motions but also from camera shake, scene depth variation. To remove these complicated motion blurs, conventional energy optimization based methods rely on simple assumptions such that blur kernel is partially uniform or locally linear. Moreover, recent machine...

chapter

Person Re-identification in the Wild

Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3346 - 3355

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a novel large-scale dataset and comprehensive baselines for end-to-end pedestrian detection and person recognition in raw video frames. Our baselines address three issues: the performance of various combinations of detectors and recognizers, mechanisms for pedestrian detection to help improve overall re-identification (re-ID) accuracy and assessing the effectiveness of different...

chapter

Semi-Supervised Deep Learning for Monocular Depth Map Prediction

Yevhen Kuznietsov, Jorg Stuckler, Bastian Leibe

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2215 - 2223

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Supervised deep learning often suffers from the lack of sufficient training data. Specifically in the context of monocular depth map prediction, it is barely possible to determine dense ground truth depth images in realistic dynamic outdoor environments. When using LiDAR sensors, for instance, noise is present in the distance measurements, the calibration between sensors cannot be perfect, and the...

chapter

Fast Video Classification via Adaptive Cascading of Deep Models

Haichen Shen, Seungyeop Han, Matthai Philipose, Arvind Krishnamurthy

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2197 - 2205

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent advances have enabled oracle classifiers that can classify across many classes and input distributions with high accuracy without retraining. However, these classifiers are relatively heavyweight, so that applying them to classify video is costly. We show that day-to-day video exhibits highly skewed class distributions over the short term, and that these distributions can be classified by much...

chapter

Procedural Generation of Videos to Train Deep Action Recognition Networks

Cesar Roberto de Souza, Adrien Gaidon, Yohann Cabon, Antonio Manuel Lopez

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2594 - 2604

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning for human action recognition in videos is making significant progress, but is slowed down by its dependency on expensive manual labeling of large video collections. In this work, we investigate the generation of synthetic training data for action recognition, as it has recently shown promising results for a variety of other computer vision tasks. We propose an interpretable parametric...

chapter

Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks

Rameswar Panda, Amran Bhuiyan, Vittorio Murino, Amit K. Roy-Chowdhury

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1377 - 1386

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Person re-identification is an open and challenging problem in computer vision. Existing approaches have concentrated on either designing the best feature representation or learning optimal matching metrics in a static setting where the number of cameras are fixed in a network. Most approaches have neglected the dynamic and open world nature of the re-identification problem, where a new camera may...

chapter

Deep Video Deblurring for Hand-Held Cameras

Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 237 - 246

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Motion blur from camera shake is a major problem in videos captured by hand-held devices. Unlike single-image deblurring, video-based approaches can take advantage of the abundant information that exists across neighboring frames. As a result the best performing methods rely on the alignment of nearby frames. However, aligning images is a computationally expensive and fragile procedure, and methods...

chapter

On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation

Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 218 - 227

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Camera relocalisation is an important problem in computer vision, with applications in simultaneous localisation and mapping, virtual/augmented reality and navigation. Common techniques either match the current image against keyframes with known poses coming from a tracker, or establish 2D-to-3D correspondences between keypoints in the current image and points in the scene in order to estimate the...

chapter

An improved method for 3D shape estimation using cascade of neural networks

Van-Thanh Hoang, Van-Dung Hoang, Kang-Hyun Jo

2017 IEEE 15th International Conference on Industrial Informatics (INDIN) > 285 - 289

2017 IEEE 15th International Conference on Industrial Informatics (INDIN)

This paper tackles the problem of estimating 3D human poses from given 2D landmarks, which is still an ill-posed problem. The existing works have successfully applied Active Shape Model approach to estimate 3D human poses, but the error is still high. In this paper, we propose an improved method by using the cascade of neural networks to make the estimated shape more alike to the ground truth shape...

chapter

A Dataset for Benchmarking Image-Based Localization

Xun Sun, Yuanfan Xie, Pei Luo, Liang Wang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5641 - 5649

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A novel dataset for benchmarking image-based localization is presented. With increasing research interests in visual place recognition and localization, several datasets have been published in the past few years. One of the evident limitations of existing datasets is that precise ground truth camera poses of query images are not available in a meaningful 3D metric system. This is in part due to the...

chapter

DeMoN: Depth and Motion Network for Learning Monocular Stereo

Benjamin Ummenhofer, Huizhong Zhou, Jonas Uhrig, Nikolaus Mayer, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5622 - 5631

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we formulate structure from motion as a learning problem. We train a convolutional network end-to-end to compute depth and camera motion from successive, unconstrained image pairs. The architecture is composed of multiple stacked encoder-decoder networks, the core part being an iterative network that is able to improve its own predictions. The network estimates not only depth and motion,...

chapter

3D Human Pose Estimation = 2D Pose Estimation + Matching

Ching-Hang Chen, Deva Ramanan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5759 - 5767

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We explore 3D human pose estimation from a single RGB image. While many approaches try to directly predict 3D pose from image measurements, we explore a simple architecture that reasons through intermediate 2D pose predictions. Our approach is based on two key observations (1) Deep neural nets have revolutionized 2D pose estimation, producing accurate 2D predictions even for poses with self-occlusions...

chapter

One-Shot Metric Learning for Person Re-identification

Slawomir Bak, Peter Carr

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1571 - 1580

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Re-identification of people in surveillance footage must cope with drastic variations in color, background, viewing angle and a persons pose. Supervised techniques are often the most effective, but require extensive annotation which is infeasible for large camera networks. Unlike previous supervised learning approaches that require hundreds of annotated subjects, we learn a metric using a novel one-shot...

chapter

From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur

Dong Gong, Jie Yang, Lingqiao Liu, Yanning Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3806 - 3815

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Removing pixel-wise heterogeneous motion blur is challenging due to the ill-posed nature of the problem. The predominant solution is to estimate the blur kernel by adding a prior, but extensive literature on the subject indicates the difficulty in identifying a prior which is suitably informative, and general. Rather than imposing a prior based on theory, we propose instead to learn one from the data...

chapter

Unsupervised Learning of Depth and Ego-Motion from Video

Tinghui Zhou, Matthew Brown, Noah Snavely, David G. Lowe

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6612 - 6619

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an unsupervised learning framework for the task of monocular depth and camera motion estimation from unstructured video sequences. In common with recent work [10, 14, 16], we use an end-to-end learning approach with view synthesis as the supervisory signal. In contrast to the previous work, our method is completely unsupervised, requiring only monocular video sequences for training. Our...

chapter

Unsupervised Monocular Depth Estimation with Left-Right Consistency

Clement Godard, Oisin Mac Aodha, Gabriel J. Brostow

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6602 - 6611

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning based methods have shown very promising results for the task of depth estimation in single images. However, most existing approaches treat depth prediction as a supervised regression problem and as a result, require vast quantities of corresponding ground truth depth data for training. Just recording quality depth data in a range of environments is a challenging problem. In this paper, we...

chapter

Coin Recognition Method Based on SIFT Algorithm

Jing Xu, Gongliu Yang, Yuanyuan Liu, Jingjia Zhong

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 229 - 233

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Coin recognition is one of the prime important activities for modern banking and currency processing systems in which machine vision is widely used. The technique at the heart of such systems is object recognition in a digital image. Although it has high recognition speed, the traditional method of coin recognition can not recognize the coins with similar sizes. This paper presents a method based...

chapter

Adaboost-based algorithm for human action recognition

Nabil Zerrouki, Fouzi Harrou, Ying Sun, Amrane Houacine

2017 IEEE 15th International Conference on Industrial Informatics (INDIN) > 189 - 193

2017 IEEE 15th International Conference on Industrial Informatics (INDIN)

This paper presents a computer vision-based methodology for human action recognition. First, the shape based pose features are constructed based on area ratios to identify the human silhouette in images. The proposed features are invariance to translation and scaling. Once the human body features are extracted from videos, different human actions are learned individually on the training frames of...

chapter

Headgear recognition by decomposing human images in the thermal infrared spectrum

Brahmastro Kresnaraman, Yasutomo Kawanishi, Daisuke Deguchi, Tomokazu Takahashi, more

2017 15th International Conference on Quality in Research (QiR) : International Symposium on Electrical and Computer Engineering > 164 - 168

2017 15th International Conference on Quality in Research (QiR) : International Symposium on Electrical and Computer Engineering

Surveillance systems play a critical role in security and surveillance. A surveillance system with cameras that work in the visible spectrum is sufficient for most cases. However, problems may arise during the night, or in areas with less than ideal illumination conditions. Cameras with thermal infrared technology can be a better option in these situations since they do not rely on illumination to...

INFONA - science communication portal

Search results

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

Person Re-identification in the Wild

Semi-Supervised Deep Learning for Monocular Depth Map Prediction

Fast Video Classification via Adaptive Cascading of Deep Models

Procedural Generation of Videos to Train Deep Action Recognition Networks

Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks

Deep Video Deblurring for Hand-Held Cameras

On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation

An improved method for 3D shape estimation using cascade of neural networks

A Dataset for Benchmarking Image-Based Localization

DeMoN: Depth and Motion Network for Learning Monocular Stereo

3D Human Pose Estimation = 2D Pose Estimation + Matching

One-Shot Metric Learning for Person Re-identification

From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur

Unsupervised Learning of Depth and Ego-Motion from Video

Unsupervised Monocular Depth Estimation with Left-Right Consistency

Coin Recognition Method Based on SIFT Algorithm

Adaboost-based algorithm for human action recognition

Headgear recognition by decomposing human images in the thermal infrared spectrum

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options