Search results

chapter

End to end very deep person re-identification

Liviu-Daniel Stefan, Ionut Mironica, Catalin Alexandru Mitrea, Bogdan Ionescu

2017 International Symposium on Signals, Circuits and Systems (ISSCS) > 1 - 4

2017 International Symposium on Signals, Circuits and Systems (ISSCS)

Convolutional Neural Networks (CNNs) are responsible for major breakthroughs in object recognition in still images. This work presents an end to end very deep architecture with small convolutional kernel size, small convolutional strides and very deep network architecture for person re-identification in video streams. To achieve such system several good practices for the training were tested, namely:...

chapter

6-DOF object localization by combining monocular vision and robot arm kinematics

Kun Liu, Weiwei Shang, Shuang Du, Shuang Cong

2017 36th Chinese Control Conference (CCC) > 6575 - 6580

2017 36th Chinese Control Conference (CCC)

A robot needs to localize an unknown object before grasping it. When the robot only has a monocular sensor, how can it get the object pose? In this work, we present a method of localizing the 6-DOF pose of a target object using a robotic arm and a hand-mounted monocular camera. The method includes an object recognition and a localization process. The recognition process uses point features on a surface...

chapter

Estimating relative depth in single images via rankboost

Ralph Ewerth, Matthias Springstein, Eric Muller, Alexander Balz, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 919 - 924

2017 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we present a novel approach to estimate the relative depth of regions in monocular images. There are several contributions. First, the task of monocular depth estimation is considered as a learning-to-rank problem which offers several advantages compared to regression approaches. Second, monocular depth clues of human perception are modeled in a systematic manner. Third, we show that...

chapter

Large-scale person re-identification as retrieval

Hantao Yao, Shiliang Zhang, Dongming Zhang, Yongdong Zhang, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1440 - 1445

2017 IEEE International Conference on Multimedia and Expo (ICME)

This paper targets to bring together the research efforts on two fields that are growing actively in the past few years: multicamera person Re-Identification (ReID) and large-scale image retrieval. We demonstrate that the essentials of image retrieval and person ReID are the same, i.e., measuring the similarity between images. However, person ReID requires more discriminative and robust features to...

chapter

It’s Written All Over Your Face: Full-Face Appearance-Based Gaze Estimation

Xucong Zhang, Yusuke Sugano, Mario Fritz, Andreas Bulling

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 2299 - 2308

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Eye gaze is an important non-verbal cue for human affect analysis. Recent gaze estimation work indicated that information from the full face region can benefit performance. Pushing this idea further, we propose an appearance-based method that, in contrast to a long-standing line of work in computer vision, only takes the full face image as input. Our method encodes the face image using a convolutional...

chapter

Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection

Mohammadamin Barekatain, Miquel Marti, Hsueh-Fu Shih, Samuel Murray, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 2153 - 2160

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Despite significant progress in the development of human action detection datasets and algorithms, no current dataset is representative of real-world aerial view scenarios. We present Okutama-Action, a new video dataset for aerial view concurrent human action detection. It consists of 43 minute-long fully-annotated sequences with 12 action classes. Okutama-Action features many challenges missing in...

chapter

Deep Heterogeneous Face Recognition Networks Based on Cross-Modal Distillation and an Equitable Distance Metric

Christopher Reale, Hyungtae Lee, Heesung Kwon

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 226 - 232

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In this work we present three methods to improve a deep convolutional neural network approach to near-infrared heterogeneous face recognition. We first present a method to distill extra information from a pre-trained visible face network through the output logits of the network. Next, we put forth an altered contrastive loss function that uses the ℓ1 norm instead of the ℓ2 norm as a distance metric...

chapter

Person Re-Identification with Deep Features and Transfer Learning

Shengke Wang, Shan Wu, Lianghua Duan, Changyin Yu, more

22017 IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC) > 1 > 704 - 707

2017 IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC)

Person re-identification is an important technique towards automatic search of a person's presence in a surveillance video. Two fundamental problems are critical for person re-identification:feature representation and metric learning. At present, there are many methods in the study of person re-identification, which has achieved remarkable results. Due to the difference of the data distribution in...

chapter

EDeN: Ensemble of Deep Networks for Vehicle Classification

Rajkumar Theagarajan, Federico Pala, Bir Bhanu

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 906 - 913

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Traffic surveillance has always been a challenging task to automate. The main difficulties arise from the high variation of the vehicles appertaining to the same category, low resolution, changes in illumination and occlusions. Due to the lack of large labeled datasets, deep learning techniques still have not shown their full potential. In this paper, we train an Ensemble of Deep Networks (EDeN) to...

chapter

Privacy-Preserving Understanding of Human Body Orientation for Smart Meetings

Indrani Bhattacharya, Noam Eshed, Richard J. Radke

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 284 - 292

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

We present a method for estimating the body orientation of seated people in a smart room by fusing low-resolution range information collected from downward pointed time-of-flight (ToF) sensors with synchronized speaker identification information from microphone recordings. The ToF sensors preserve the privacy of the occupants in that they only return the range to a small set of hit points. We propose...

chapter

Fully Convolutional Region Proposal Networks for Multispectral Person Detection

Daniel Konig, Michael Adam, Christian Jarvers, Georg Layher, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 243 - 250

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Multispectral images that combine visual-optical (VIS) and infrared (IR) image information are a promising source of data for automatic person detection. Especially in automotive or surveillance applications, challenging conditions such as insufficient illumination or large distances between camera and object occur regularly and can affect image quality. This leads to weak image contrast or low object...

chapter

An improved method for 3D shape estimation using active shape model

Van-Thanh Hoang, Kang-Hyun Jo

2017 10th International Conference on Human System Interactions (HSI) > 230 - 233

2017 10th International Conference on Human-System Interactions (HSI)

This paper tackles the problem of reconstructing 3D human poses from 2D landmarks, which is still an ill-posed problem. A widely-used approach is active shape model (ASM) which considers an unknown 3D shape as a linear combination of predefined basis shapes. The existing methods often resolve an optimization problem to reckon the weights and viewpoints of basis shapes, but they could fall into a locally-optimal...

chapter

A C3D-Based Convolutional Neural Network for Frame Dropping Detection in a Single Video Shot

Chengjiang Long, Eric Smith, Arslan Basharat, Anthony Hoogs

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1898 - 1906

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Frame dropping is a type of video manipulation where consecutive frames are deleted to omit content from the original video. Automatically detecting dropped frames across a large archive of videos while maintaining a low false alarm rate is a challenging task in digital video forensics. We propose a new approach for forensic analysis by exploiting the local spatio-temporal relationships within a portion...

chapter

Detection of Metadata Tampering Through Discrepancy Between Image Content and Metadata Using Multi-task Deep Learning

Bor-Chun Chen, Pallabi Ghosh, Vlad I. Morariu, Larry S. Davis

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1872 - 1880

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Image content or metadata editing software availability and ease of use has resulted in a high demand for automatic image tamper detection algorithms. Most previous work has focused on detection of tampered image content, whereas we develop techniques to detect metadata tampering in outdoor images using sun altitude angle and other meteorological information like temperature, humidity and weather,...

chapter

A Counter-Forensic Method for CNN-Based Camera Model Identification

David Guera, Yu Wang, Luca Bondi, Paolo Bestagini, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1840 - 1847

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

An increasing number of digital images are being shared and accessed through websites, media, and social applications. Many of these images have been modified and are not authentic. Recent advances in the use of deep convolutional neural networks (CNNs) have facilitated the task of analyzing the veracity and authenticity of largely distributed image datasets. We examine in this paper the problem of...

chapter

Geographic information use in weakly-supervised deep learning for landmark recognition

Yifang Yin, Zhenguang Liu, Roger Zimmermann

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1015 - 1020

2017 IEEE International Conference on Multimedia and Expo (ICME)

The successful deep convolutional neural networks for visual object recognition typically rely on a massive number of training images that are well annotated by class labels or object bounding boxes with great human efforts. Here we explore the use of the geographic metadata, which are automatically retrieved from sensors such as GPS and compass, in weakly-supervised learning techniques for landmark...

chapter

Person Re-identification by Deep Learning Attribute-Complementary Information

Arne Schumann, Rainer Stiefelhagen

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1435 - 1443

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Automatic person re-identification (re-id) across camera boundaries is a challenging problem. Approaches have to be robust against many factors which influence the visual appearance of a person but are not relevant to the person's identity. Examples for such factors are pose, camera angles, and lighting conditions. Person attributes are a semantic high level information which is invariant across many...

chapter

3D Pose Regression Using Convolutional Neural Networks

Siddharth Mahendran, Haider Ali, Rene Vidal

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 494 - 495

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

3D pose estimation is a key component of many important computer vision tasks like autonomous navigation and robot manipulation. Current state-of-the-art approaches for 3D object pose estimation, like Viewpoints & Keypoints and Render for CNN, solve this problem by discretizing the pose space into bins and solving a pose-classification task. We argue that 3D pose is continuous and can be solved...

chapter

Protecting Visual Secrets Using Adversarial Nets

Nisarg Raval, Ashwin Machanavajjhala, Landon P. Cox

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1329 - 1332

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Protecting visual secrets is an important problem due to the prevalence of cameras that continuously monitor our surroundings. Any viable solution to this problem should also minimize the impact on the utility of applications that use images. In this work, we build on the existing work of adversarial learning to design a perturbation mechanism that jointly optimizes privacy and utility objectives...

chapter

Improving triplet-wise training of convolutional neural network for vehicle re-identification

Yiheng Zhang, Dong Liu, Zheng-Jun Zha

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1386 - 1391

2017 IEEE International Conference on Multimedia and Expo (ICME)

Vehicle re-identification (re-id) plays an important role in the automatic analysis of the drastically increasing urban surveillance videos. Similar to the other image retrieval problems, vehicle re-id suffers from the difficulties caused by various poses of vehicles, diversified illuminations, and complicated environments. Triplet-wise training of convolutional neural network (CNN) has been studied...

INFONA - science communication portal

Search results

End to end very deep person re-identification

6-DOF object localization by combining monocular vision and robot arm kinematics

Estimating relative depth in single images via rankboost

Large-scale person re-identification as retrieval

It’s Written All Over Your Face: Full-Face Appearance-Based Gaze Estimation

Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection

Deep Heterogeneous Face Recognition Networks Based on Cross-Modal Distillation and an Equitable Distance Metric

Person Re-Identification with Deep Features and Transfer Learning

EDeN: Ensemble of Deep Networks for Vehicle Classification

Privacy-Preserving Understanding of Human Body Orientation for Smart Meetings

Fully Convolutional Region Proposal Networks for Multispectral Person Detection

An improved method for 3D shape estimation using active shape model

A C3D-Based Convolutional Neural Network for Frame Dropping Detection in a Single Video Shot

Detection of Metadata Tampering Through Discrepancy Between Image Content and Metadata Using Multi-task Deep Learning

A Counter-Forensic Method for CNN-Based Camera Model Identification

Geographic information use in weakly-supervised deep learning for landmark recognition

Person Re-identification by Deep Learning Attribute-Complementary Information

3D Pose Regression Using Convolutional Neural Networks

Protecting Visual Secrets Using Adversarial Nets

Improving triplet-wise training of convolutional neural network for vehicle re-identification

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options