Search results

chapter

An improved method for 3D shape estimation using cascade of neural networks

Van-Thanh Hoang, Van-Dung Hoang, Kang-Hyun Jo

2017 IEEE 15th International Conference on Industrial Informatics (INDIN) > 285 - 289

2017 IEEE 15th International Conference on Industrial Informatics (INDIN)

This paper tackles the problem of estimating 3D human poses from given 2D landmarks, which is still an ill-posed problem. The existing works have successfully applied Active Shape Model approach to estimate 3D human poses, but the error is still high. In this paper, we propose an improved method by using the cascade of neural networks to make the estimated shape more alike to the ground truth shape...

chapter

A Dataset for Benchmarking Image-Based Localization

Xun Sun, Yuanfan Xie, Pei Luo, Liang Wang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5641 - 5649

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A novel dataset for benchmarking image-based localization is presented. With increasing research interests in visual place recognition and localization, several datasets have been published in the past few years. One of the evident limitations of existing datasets is that precise ground truth camera poses of query images are not available in a meaningful 3D metric system. This is in part due to the...

chapter

DeMoN: Depth and Motion Network for Learning Monocular Stereo

Benjamin Ummenhofer, Huizhong Zhou, Jonas Uhrig, Nikolaus Mayer, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5622 - 5631

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we formulate structure from motion as a learning problem. We train a convolutional network end-to-end to compute depth and camera motion from successive, unconstrained image pairs. The architecture is composed of multiple stacked encoder-decoder networks, the core part being an iterative network that is able to improve its own predictions. The network estimates not only depth and motion,...

chapter

3D Human Pose Estimation = 2D Pose Estimation + Matching

Ching-Hang Chen, Deva Ramanan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5759 - 5767

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We explore 3D human pose estimation from a single RGB image. While many approaches try to directly predict 3D pose from image measurements, we explore a simple architecture that reasons through intermediate 2D pose predictions. Our approach is based on two key observations (1) Deep neural nets have revolutionized 2D pose estimation, producing accurate 2D predictions even for poses with self-occlusions...

chapter

One-Shot Metric Learning for Person Re-identification

Slawomir Bak, Peter Carr

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1571 - 1580

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Re-identification of people in surveillance footage must cope with drastic variations in color, background, viewing angle and a persons pose. Supervised techniques are often the most effective, but require extensive annotation which is infeasible for large camera networks. Unlike previous supervised learning approaches that require hundreds of annotated subjects, we learn a metric using a novel one-shot...

chapter

From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur

Dong Gong, Jie Yang, Lingqiao Liu, Yanning Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3806 - 3815

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Removing pixel-wise heterogeneous motion blur is challenging due to the ill-posed nature of the problem. The predominant solution is to estimate the blur kernel by adding a prior, but extensive literature on the subject indicates the difficulty in identifying a prior which is suitably informative, and general. Rather than imposing a prior based on theory, we propose instead to learn one from the data...

chapter

Unsupervised Learning of Depth and Ego-Motion from Video

Tinghui Zhou, Matthew Brown, Noah Snavely, David G. Lowe

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6612 - 6619

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an unsupervised learning framework for the task of monocular depth and camera motion estimation from unstructured video sequences. In common with recent work [10, 14, 16], we use an end-to-end learning approach with view synthesis as the supervisory signal. In contrast to the previous work, our method is completely unsupervised, requiring only monocular video sequences for training. Our...

chapter

Unsupervised Monocular Depth Estimation with Left-Right Consistency

Clement Godard, Oisin Mac Aodha, Gabriel J. Brostow

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6602 - 6611

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning based methods have shown very promising results for the task of depth estimation in single images. However, most existing approaches treat depth prediction as a supervised regression problem and as a result, require vast quantities of corresponding ground truth depth data for training. Just recording quality depth data in a range of environments is a challenging problem. In this paper, we...

chapter

Coin Recognition Method Based on SIFT Algorithm

Jing Xu, Gongliu Yang, Yuanyuan Liu, Jingjia Zhong

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 229 - 233

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Coin recognition is one of the prime important activities for modern banking and currency processing systems in which machine vision is widely used. The technique at the heart of such systems is object recognition in a digital image. Although it has high recognition speed, the traditional method of coin recognition can not recognize the coins with similar sizes. This paper presents a method based...

chapter

Adaboost-based algorithm for human action recognition

Nabil Zerrouki, Fouzi Harrou, Ying Sun, Amrane Houacine

2017 IEEE 15th International Conference on Industrial Informatics (INDIN) > 189 - 193

2017 IEEE 15th International Conference on Industrial Informatics (INDIN)

This paper presents a computer vision-based methodology for human action recognition. First, the shape based pose features are constructed based on area ratios to identify the human silhouette in images. The proposed features are invariance to translation and scaling. Once the human body features are extracted from videos, different human actions are learned individually on the training frames of...

chapter

Headgear recognition by decomposing human images in the thermal infrared spectrum

Brahmastro Kresnaraman, Yasutomo Kawanishi, Daisuke Deguchi, Tomokazu Takahashi, more

2017 15th International Conference on Quality in Research (QiR) : International Symposium on Electrical and Computer Engineering > 164 - 168

2017 15th International Conference on Quality in Research (QiR) : International Symposium on Electrical and Computer Engineering

Surveillance systems play a critical role in security and surveillance. A surveillance system with cameras that work in the visible spectrum is sufficient for most cases. However, problems may arise during the night, or in areas with less than ideal illumination conditions. Cameras with thermal infrared technology can be a better option in these situations since they do not rely on illumination to...

chapter

End to end very deep person re-identification

Liviu-Daniel Stefan, Ionut Mironica, Catalin Alexandru Mitrea, Bogdan Ionescu

2017 International Symposium on Signals, Circuits and Systems (ISSCS) > 1 - 4

2017 International Symposium on Signals, Circuits and Systems (ISSCS)

Convolutional Neural Networks (CNNs) are responsible for major breakthroughs in object recognition in still images. This work presents an end to end very deep architecture with small convolutional kernel size, small convolutional strides and very deep network architecture for person re-identification in video streams. To achieve such system several good practices for the training were tested, namely:...

chapter

6-DOF object localization by combining monocular vision and robot arm kinematics

Kun Liu, Weiwei Shang, Shuang Du, Shuang Cong

2017 36th Chinese Control Conference (CCC) > 6575 - 6580

2017 36th Chinese Control Conference (CCC)

A robot needs to localize an unknown object before grasping it. When the robot only has a monocular sensor, how can it get the object pose? In this work, we present a method of localizing the 6-DOF pose of a target object using a robotic arm and a hand-mounted monocular camera. The method includes an object recognition and a localization process. The recognition process uses point features on a surface...

chapter

Estimating relative depth in single images via rankboost

Ralph Ewerth, Matthias Springstein, Eric Muller, Alexander Balz, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 919 - 924

2017 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we present a novel approach to estimate the relative depth of regions in monocular images. There are several contributions. First, the task of monocular depth estimation is considered as a learning-to-rank problem which offers several advantages compared to regression approaches. Second, monocular depth clues of human perception are modeled in a systematic manner. Third, we show that...

chapter

Large-scale person re-identification as retrieval

Hantao Yao, Shiliang Zhang, Dongming Zhang, Yongdong Zhang, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1440 - 1445

2017 IEEE International Conference on Multimedia and Expo (ICME)

This paper targets to bring together the research efforts on two fields that are growing actively in the past few years: multicamera person Re-Identification (ReID) and large-scale image retrieval. We demonstrate that the essentials of image retrieval and person ReID are the same, i.e., measuring the similarity between images. However, person ReID requires more discriminative and robust features to...

chapter

It’s Written All Over Your Face: Full-Face Appearance-Based Gaze Estimation

Xucong Zhang, Yusuke Sugano, Mario Fritz, Andreas Bulling

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 2299 - 2308

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Eye gaze is an important non-verbal cue for human affect analysis. Recent gaze estimation work indicated that information from the full face region can benefit performance. Pushing this idea further, we propose an appearance-based method that, in contrast to a long-standing line of work in computer vision, only takes the full face image as input. Our method encodes the face image using a convolutional...

chapter

Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection

Mohammadamin Barekatain, Miquel Marti, Hsueh-Fu Shih, Samuel Murray, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 2153 - 2160

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Despite significant progress in the development of human action detection datasets and algorithms, no current dataset is representative of real-world aerial view scenarios. We present Okutama-Action, a new video dataset for aerial view concurrent human action detection. It consists of 43 minute-long fully-annotated sequences with 12 action classes. Okutama-Action features many challenges missing in...

chapter

Deep Heterogeneous Face Recognition Networks Based on Cross-Modal Distillation and an Equitable Distance Metric

Christopher Reale, Hyungtae Lee, Heesung Kwon

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 226 - 232

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In this work we present three methods to improve a deep convolutional neural network approach to near-infrared heterogeneous face recognition. We first present a method to distill extra information from a pre-trained visible face network through the output logits of the network. Next, we put forth an altered contrastive loss function that uses the ℓ1 norm instead of the ℓ2 norm as a distance metric...

chapter

Person Re-Identification with Deep Features and Transfer Learning

Shengke Wang, Shan Wu, Lianghua Duan, Changyin Yu, more

22017 IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC) > 1 > 704 - 707

2017 IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC)

Person re-identification is an important technique towards automatic search of a person's presence in a surveillance video. Two fundamental problems are critical for person re-identification:feature representation and metric learning. At present, there are many methods in the study of person re-identification, which has achieved remarkable results. Due to the difference of the data distribution in...

chapter

EDeN: Ensemble of Deep Networks for Vehicle Classification

Rajkumar Theagarajan, Federico Pala, Bir Bhanu

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 906 - 913

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Traffic surveillance has always been a challenging task to automate. The main difficulties arise from the high variation of the vehicles appertaining to the same category, low resolution, changes in illumination and occlusions. Due to the lack of large labeled datasets, deep learning techniques still have not shown their full potential. In this paper, we train an Ensemble of Deep Networks (EDeN) to...

INFONA - science communication portal

Search results

An improved method for 3D shape estimation using cascade of neural networks

A Dataset for Benchmarking Image-Based Localization

DeMoN: Depth and Motion Network for Learning Monocular Stereo

3D Human Pose Estimation = 2D Pose Estimation + Matching

One-Shot Metric Learning for Person Re-identification

From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur

Unsupervised Learning of Depth and Ego-Motion from Video

Unsupervised Monocular Depth Estimation with Left-Right Consistency

Coin Recognition Method Based on SIFT Algorithm

Adaboost-based algorithm for human action recognition

Headgear recognition by decomposing human images in the thermal infrared spectrum

End to end very deep person re-identification

6-DOF object localization by combining monocular vision and robot arm kinematics

Estimating relative depth in single images via rankboost

Large-scale person re-identification as retrieval

It’s Written All Over Your Face: Full-Face Appearance-Based Gaze Estimation

Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection

Deep Heterogeneous Face Recognition Networks Based on Cross-Modal Distillation and an Equitable Distance Metric

Person Re-Identification with Deep Features and Transfer Learning

EDeN: Ensemble of Deep Networks for Vehicle Classification

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options