2016 23rd International Conference on Pattern Recognition (ICPR)

chapter

A multi-modal RGB-D object recognizer

Thomas Faulhammer, Michael Zillich, Johann Prankl, Markus Vincze

2016 23rd International Conference on Pattern Recognition (ICPR) > 733 - 738

In this paper we propose a multi-modal object recognition system that uses a two-step hypothesis verification approach to improve runtime efficiency. The system uses local and global appearance and shape features, generating many possibly competing hypotheses, which are then verified such that the scene can be optimally explained in terms of recognized object models. The introduced modification in...

chapter

Detection and localization with multi-scale models

Eshed Ohn-Bar, Mohan M. Trivedi

2016 23rd International Conference on Pattern Recognition (ICPR) > 1382 - 1387

2016 23rd International Conference on Pattern Recognition (ICPR)

Object detection and localization in images involve a multi-scale reasoning process. First, responses of object detectors are known to vary with image scale. Second, contextual relationships on a part-level, object-level, and scene-level appear at different scales of the image. This paper studies efficient modeling of these two components by training multi-scale template models. The input to the proposed...

chapter

Boosting VLAD with double assignment using deep features for action recognition in videos

Ionut C. Duta, Tuan A. Nguyen, Kiyoharu Aizawa, Bogdan Ionescu, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 2210 - 2215

2016 23rd International Conference on Pattern Recognition (ICPR)

The encoding method is an important factor for an action recognition pipeline. One of the key points for the encoding method is the assignment step. A very widely used super-vector encoding method is the vector of locally aggregated descriptors (VLAD), with very competitive results in many tasks. However, it considers only hard assignment and the criteria for the assignment is performed only from...

chapter

Leveraging multiple tasks to regularize fine-grained classification

Riddhiman Dasgupta, Anoop M. Namboodiri

2016 23rd International Conference on Pattern Recognition (ICPR) > 3476 - 3481

2016 23rd International Conference on Pattern Recognition (ICPR)

Fine-grained classification is an extremely challenging problem in computer vision, compounded by subtle differences in shape, pose, illumination and appearance. While convolutional neural networks have become the versatile jack-of-all-trades tool in modern computer vision, approaches for fine-grained recognition still rely on localization of keypoints and parts to learn discriminative features for...

chapter

An attention model based on spatial transformers for scene recognition

Shuxuan Guo, Li Liu, Wei Wang, Songyang Lao, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 3757 - 3762

2016 23rd International Conference on Pattern Recognition (ICPR)

Scene recognition is an important and challenging task in computer vision. We propose an end-to-end pipeline by combing convolutional neural networks (CNNs) with explicit attention model to determine several meaningful regions of original images for scene recognition. In the proposed pipeline, the spatial transformer network is leveraged as the attention module, which can automatically learn the scales...

INFONA - science communication portal

2016 23rd International Conference on Pattern Recognition (ICPR)

A multi-modal RGB-D object recognizer

Detection and localization with multi-scale models

Boosting VLAD with double assignment using deep features for action recognition in videos

Leveraging multiple tasks to regularize fine-grained classification

An attention model based on spatial transformers for scene recognition

Filter options

Publication date

Keywords

INFONA - science communication portal

2016 23rd International Conference on Pattern Recognition (ICPR) $("#expandableTitles").expandable();

A multi-modal RGB-D object recognizer

Detection and localization with multi-scale models

Boosting VLAD with double assignment using deep features for action recognition in videos

Leveraging multiple tasks to regularize fine-grained classification

An attention model based on spatial transformers for scene recognition

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 23rd International Conference on Pattern Recognition (ICPR)