Search results

chapter

Evaluating aerial vessel detector in multiple maritime surveillance scenarios

Goncalo Cruz, Alexandre Bernardino

OCEANS 2017 – Anchorage > 1 - 9

OCEANS 2017 - Anchorage

In this paper we present an autonomous detection approach for airborne surveillance in maritime scenarios. This approach is robust to sun glare, waves and scale variation. Additionally, we introduce a new metric to evaluate detection and tracking results that is more adequate for these scenarios. The proposed detection method is evaluated using videos from different monitoring missions and its results...

chapter

RMPE: Regional Multi-person Pose Estimation

Hao-Shu Fang, Shuqin Xie, Yu-Wing Tai, Cewu Lu

2017 IEEE International Conference on Computer Vision (ICCV) > 2353 - 2362

2017 IEEE International Conference on Computer Vision (ICCV)

Multi-person pose estimation in the wild is challenging. Although state-of-the-art human detectors have demonstrated good performance, small errors in localization and recognition are inevitable. These errors can cause failures for a single-person pose estimator (SPPE), especially for methods that solely depend on human detection results. In this paper, we propose a novel regional multi-person pose...

chapter

Human Pose Estimation Using Global and Local Normalization

Ke Sun, Cuiling Lan, Junliang Xing, Wenjun Zeng, more

2017 IEEE International Conference on Computer Vision (ICCV) > 5600 - 5608

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we address the problem of estimating the positions of human joints, i.e., articulated pose estimation. Recent state-of-the-art solutions model two key issues, joint detection and spatial configuration refinement, together using convolutional neural networks. Our work mainly focuses on spatial configuration refinement by reducing variations of human poses statistically, which is motivated...

chapter

Occlusion detector using convolutional neural network for person re-identification

Sejeong Lee, Yoojin Hong, Moongu Jeon

2017 International Conference on Control, Automation and Information Sciences (ICCAIS) > 140 - 144

2017 International Conference on Control, Automation and Information Sciences (ICCAIS)

Technique of comparing pedestrian images observed by different cameras to determine whether they are the same person is important in the surveillance system. This technique is called Person re-identification. Most of Person reidentification is underway assuming that occlusion does not occur. However, since occlusion occurs frequently in the surveillance system and affects accuracy, it is necessary...

chapter

Single Shot Text Detector with Regional Attention

Pan He, Weilin Huang, Tong He, Qile Zhu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3066 - 3074

2017 IEEE International Conference on Computer Vision (ICCV)

We present a novel single-shot text detector that directly outputs word-level bounding boxes in a natural image. We propose an attention mechanism which roughly identifies text regions via an automatically learned attentional map. This substantially suppresses background interference in the convolutional features, which is the key to producing accurate inference of words, particularly at extremely...

chapter

Boosting Image Captioning with Attributes

Ting Yao, Yingwei Pan, Yehao Li, Zhaofan Qiu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4904 - 4912

2017 IEEE International Conference on Computer Vision (ICCV)

Automatically describing an image with a natural language has been an emerging challenge in both fields of computer vision and natural language processing. In this paper, we present Long Short-Term Memory with Attributes (LSTM-A) - a novel architecture that integrates attributes into the successful Convolutional Neural Networks (CNNs) plus Recurrent Neural Networks (RNNs) image captioning framework,...

chapter

Focal Loss for Dense Object Detection

Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2999 - 3007

2017 IEEE International Conference on Computer Vision (ICCV)

The highest accuracy object detectors to date are based on a two-stage approach popularized by R-CNN, where a classifier is applied to a sparse set of candidate object locations. In contrast, one-stage detectors that are applied over a regular, dense sampling of possible object locations have the potential to be faster and simpler, but have trailed the accuracy of two-stage detectors thus far. In...

chapter

Joint Discovery of Object States and Manipulation Actions

Jean-Baptiste Alayrac, Josef Sivic, Ivan Laptev, Simon Lacoste-Julien

2017 IEEE International Conference on Computer Vision (ICCV) > 2146 - 2155

2017 IEEE International Conference on Computer Vision (ICCV)

Many human activities involve object manipulations aiming to modify the object state. Examples of common state changes include full/empty bottle, open/closed door, and attached/detached car wheel. In this work, we seek to automatically discover the states of objects and the associated manipulation actions. Given a set of videos for a particular task, we propose a joint model that learns to identify...

chapter

Feature matching for underwater image via superpixel tracking

Shu Zhang, Junyu Dong, Hui Yu

2017 23rd International Conference on Automation and Computing (ICAC) > 1 - 5

2017 23rd International Conference on Automation and Computing (ICAC)

Feature matching is fundamental to many vision tasks. Due to the low visibility of images in underwater environments, traditional pixels-based matching methods suffer from miss-matching or error-matching. Recently, Superpixel based features have been applied to image feature analysis. However, most of existing methods dedicate to rectified stereo matching with images captured in the air. This paper...

chapter

Learning Discriminative and Transformation Covariant Local Feature Detectors

Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4923 - 4931

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust covariant local feature detectors are important for detecting local features that are (1) discriminative of the image content and (2) can be repeatably detected at consistent locations when the image undergoes diverse transformations. Such detectors are critical for applications such as image search and scene reconstruction. Many learning-based local feature detectors address one of these two...

chapter

Training Object Class Detectors with Click Supervision

Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 180 - 189

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing center-click annotations: we ask annotators to click on the center of an imaginary bounding box which tightly encloses the object instance. We then incorporate...

chapter

Harmonic Networks: Deep Translation and Rotation Equivariance

Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7168 - 7177

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Translating or rotating an input image should not affect the results of many computer vision tasks. Convolutional neural networks (CNNs) are already translation equivariant: input image translations produce proportionate feature map translations. This is not the case for rotations. Global rotation equivariance is typically sought through data augmentation, but patch-wise equivariance is more difficult...

chapter

Comparative Evaluation of Hand-Crafted and Learned Local Features

Johannes L. Schonberger, Hans Hardmeier, Torsten Sattler, Marc Pollefeys

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6959 - 6968

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Matching local image descriptors is a key step in many computer vision applications. For more than a decade, hand-crafted descriptors such as SIFT have been used for this task. Recently, multiple new descriptors learned from data have been proposed and shown to improve on SIFT in terms of discriminative power. This paper is dedicated to an extensive experimental evaluation of learned local features...

chapter

Detecting Visual Relationships with Deep Relational Networks

Bo Dai, Yuqi Zhang, Dahua Lin

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3298 - 3308

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Relationships among objects play a crucial role in image understanding. Despite the great success of deep learning techniques in recognizing individual objects, reasoning about the relationships among objects remains a challenging task. Previous methods often treat this as a classification problem, considering each type of relationship (e.g. ride) or each distinct visual phrase (e.g. person-ride-horse)...

chapter

Viraliency: Pooling Local Virality

Xavier Alameda-Pineda, Andrea Pilzer, Dan Xu, Nicu Sebe, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 484 - 492

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In our overly-connected world, the automatic recognition of virality – the quality of an image or video to be rapidly and widely spread in social networks – is of crucial importance, and has recently awaken the interest of the computer vision community. Concurrently, recent progress in deep learning architectures showed that global pooling strategies allow the extraction of activation...

chapter

Video based pedestrian detection and tracking at night-time

Geun-Hoo Lee, Gyu-Yeong Kim, Jong-Kwan Song, Omer Faruk Ince, more

2017 10th International Conference on Human System Interactions (HSI) > 69 - 72

2017 10th International Conference on Human-System Interactions (HSI)

This paper is an approach for pedestrian detection and tracking with infrared imagery. The detection phase is performed by AdaBoost algorithm based on Haar-like features. AdaBoost classifier is trained with datasets generated from infrared images. The number of negative images used for training with AdaBoost algorithm is 3000. For positive training, 1000 samples are used After detecting the pedestrian...

chapter

Sterile zone monitoring with human verification

Ajmal Shahbaz, Wahyono, Kang-Hyun Jo

2017 10th International Conference on Human System Interactions (HSI) > 60 - 63

2017 10th International Conference on Human-System Interactions (HSI)

This paper proposes efficient real time method for sterile zone monitoring with human verification. The propose method consists of two main parts: Motion detection module and human verification module. The role of motion detection module is to segment out foreground object from background. Probabilistic Foreground Detector based on Gaussian Mixture Model(GMM) is used. Region of interest (ROI) obtained...

chapter

Comparison of Four Local Invariant Characteristics Based on Palm Vein

Wei Lu, Wei-qi Yuan

22017 IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC) > 1 > 850 - 853

2017 IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC)

Palm vein recognition is a new biometric identification technology. The horizontal rotation, translation, tilting and loss of local vein information of palm vein image greatly affect recognition rate. To solve the above problems, this paper respectively extract four kinds of local invariant feature, Scale Invariant Feature Transform(SIFT), Affine-SIFT(ASIFT), Harris-Laplace and Maximally Stable Extremal...

chapter

Ground Truth Accuracy and Performance of the Matching Pipeline

Josef Maier, Martin Humenberger, Oliver Zendel, Markus Vincze

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 969 - 979

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Feature matching quality strongly influences the accuracy of most computer vision tasks. This led to impressive advances in keypoint detection, descriptor calculation, and feature matching itself. To compare different approaches and evaluate their quality, datasets from related tasks are used. Unfortunately, none of these datasets actually provide ground truth (GT) feature matches. Thus, matches can...

chapter

The human detection in images using the depth map

Dmitriy Tatarenkov, Dmitry Podolsky

2017 Systems of Signal Synchronization, Generating and Processing in Telecommunications (SINKHROINFO) > 1 - 4

2017 Systems of Signal Synchronization, Generating and Processing in Telecommunications (SINKHROINFO)

In today world the necessity for the autonomous mobile robots and vehicles is increasing. The safety autonomous moving demands the reliable and fast detection algorithms. The Histogram of Oriented Gradients (HOG) descriptors show significantly outperforms the existing feature sets for a human detection. Though the given method has a lot of type I errors. The amount of these errors can be decreased...

INFONA - science communication portal

Search results

Evaluating aerial vessel detector in multiple maritime surveillance scenarios

RMPE: Regional Multi-person Pose Estimation

Human Pose Estimation Using Global and Local Normalization

Occlusion detector using convolutional neural network for person re-identification

Single Shot Text Detector with Regional Attention

Boosting Image Captioning with Attributes

Focal Loss for Dense Object Detection

Joint Discovery of Object States and Manipulation Actions

Feature matching for underwater image via superpixel tracking

Learning Discriminative and Transformation Covariant Local Feature Detectors

Training Object Class Detectors with Click Supervision

Harmonic Networks: Deep Translation and Rotation Equivariance

Comparative Evaluation of Hand-Crafted and Learned Local Features

Detecting Visual Relationships with Deep Relational Networks

Viraliency: Pooling Local Virality

Video based pedestrian detection and tracking at night-time

Sterile zone monitoring with human verification

Comparison of Four Local Invariant Characteristics Based on Palm Vein

Ground Truth Accuracy and Performance of the Matching Pipeline

The human detection in images using the depth map

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options