2012 21st International Conference on Pattern Recognition (ICPR)

chapter

Logo spotting for document categorization

Viet Phuong Le, Muriel Visani, Cao De Tran, Jean-Marc Ogier

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 3484 - 3487

Logo spotting is of a great interest because it enables to categorize the document images of a digital library of scanned documents according to their sources, without any costly semantic analysis of their textual transcript. In this paper, we present an approach for logo spotting, based on the matching of keypoints extracted both from the query document images and a given set of logos (gallery) using...

chapter

DFlow and DField: New features for capturing object and image relationships

Pavel Kisilev, Daniel Freedman, Eugene Wallach, Asaf Tzadok, more

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 3590 - 3593

2012 21st International Conference on Pattern Recognition (ICPR)

In this paper we propose two new types of features useful for problems in which one wants to describe object or image relationships rather than objects or images themselves. The features are based on the notion of distribution flow, as derived from the classic Transportation Problem. Two variants of such features, the Distribution Flow (DFlow) and Displacement Field (DField), are defined and studied...

chapter

Learning robust color name models from web images

Boris Schauerte, Rainer Stiefelhagen

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 3598 - 3601

2012 21st International Conference on Pattern Recognition (ICPR)

We use images that have been collected using an Internet search engine to train color name models for color naming and recognition tasks. Considering color histogram bands as being words of an image and the color names as classes, we use the supervised latent Dirichlet allocation to train our model. To pre-process the training data, we use state-ofthe art salient object detection and a Kullback-Leibler...

chapter

Action recognition with discriminative mid-level features

Cuiwei Liu, Yu Kong, Xinxiao Wu, Yunde Jia

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 3366 - 3369

2012 21st International Conference on Pattern Recognition (ICPR)

This paper presents a novel random forest learning framework to construct a discriminative and informative mid-level feature from low-level features. Since a single low-level feature based representation is not enough to capture the variations of human appearance, multiple low-level features (i.e., optical flow and histogram of gradient 3D features) are fused to further improve recognition performance...

chapter

HEp-2 cell classification in IIF images using Shareboost

I. Ersoy, F. Bunyak, J. Peng, K. Palaniappan

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 3362 - 3365

2012 21st International Conference on Pattern Recognition (ICPR)

Indirect immunofluorescence (IIF) imaging is a method used for detection of antinuclear auto-antibodies (ANA) for the diagnosis of autoimmune diseases. We present a feature extraction and classification scheme to classify the fluorescence staining patterns of HEp-2 cells in IIF images. We propose a set of complementary features that are sensitive to staining pattern variations among classes. Our feature...

chapter

Efficient semantic segmentation with Gaussian processes and histogram intersection kernels

Alexander Freytag, Bjorn Frohlich, Erik Rodner, Joachim Denzler

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 3313 - 3316

2012 21st International Conference on Pattern Recognition (ICPR)

Semantic interpretation and understanding of images is an important goal of visual recognition research and offers a large variety of possible applications. One step towards this goal is semantic segmentation, which aims for automatic labeling of image regions and pixels with category names. Since usual images contain several millions of pixel, the use of kernel-based methods for the task of semantic...

chapter

Improving texture description in remote sensing image multi-scale classification tasks by using visual words

J. A. dos Santos, O. A. B. Penatti, R. da S. Torres, P-H. Gosselin, more

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 3090 - 3093

2012 21st International Conference on Pattern Recognition (ICPR)

Although texture features are important for region-based classification of remote sensing images, the literature shows that texture descriptors usually have poor performance when compared and combined with color descriptors. In this paper, we propose a bag-of-visual-words (BOW) “propagation” approach to extract texture features from a hierarchy of regions. This strategy improves efficacy of feature...

chapter

Multi-modality movie scene detection using Kernel Canonical Correlation Analysis

Guangyu Gao, Huadong Ma

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 3074 - 3077

2012 21st International Conference on Pattern Recognition (ICPR)

Scene detection is the fundamental step for efficient accessing and browsing videos. In this paper, we propose to segment movie into scenes which utilizes fused visual and audio features. The movie is first segmented into shots by an accelerating algorithm, and the key frames are extracted later. While feature movies are often filmed in open and dynamic environments using moving cameras and have continuously...

chapter

Sparse coding for histograms of local binary patterns applied for image categorization: Toward a Bag-of-Scenes analysis

Sebastien Paris, Xanadu Halkias, Herve Glotin

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 2817 - 2820

2012 21st International Conference on Pattern Recognition (ICPR)

In this work¹, we propose a novel approach for image categorization, which we will refer to as Bag-of-Scenes (BoS). It is based on the association of Sparse coding (Sc) and pooling techniques applied to histograms of multi-scale Local Binary Patterns (LBP) and its improved variant. This approach can be considered as a 2-layer hierarchical architecture. The first layer, encodes general local patch's...

chapter

Learning-based deformable registration using weighted mutual information

Yongning Lu, Rui Liao, Li Zhang, Ying Sun, more

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 2626 - 2629

2012 21st International Conference on Pattern Recognition (ICPR)

Deformable registration of multi-modality medical image remains a challenging research topic. The incorporation of prior information on the expected joint distribution has shown to noticeably improve registration accuracy and robustness. However, direct application of the learned joint histogram makes the algorithm sensitive to the difference between the training data and the test image. This paper...

chapter

Manhattan-Pyramid Distance: A solution to an anomaly in pyramid matching by minimization

Aneesh Chauhan, Luis Seabra Lopes

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 2668 - 2672

2012 21st International Conference on Pattern Recognition (ICPR)

In the field of computer vision, pyramid matching by minimization has gained increasing popularity. This paper points out and discusses an inherent anomaly in pyramid matching by minimization that can affect the performance of classification approaches based on this type of matching. As a solution, a new multiresolution measure, called Manhattan-Pyramid Distance (MPD), is proposed. Systematic evaluations...

chapter

Feature-aligned 4D spatiotemporal image registration

Huanhuan Xu, Peizhi Chen, Wuyi Yu, Amit Sawant, more

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 2639 - 2642

2012 21st International Conference on Pattern Recognition (ICPR)

In this paper, we develop a feature-aware 4D spatiotemporal image registration method. Our model is based on a 4D (3D+time) free-form B-spline deformation model which has both spatial and temporal smoothness. We first introduce an automatic 3D feature extraction and matching method based on an improved 3D SIFT descriptor, which is scale- and rotation- invariant. Then we use the results of feature...

chapter

Invariant signatures for omnidirectional visual place recognition and robot localization in unknown environments

Romain Marie, Ouiddad Labbani-Igbida, El Mustapha Mouaddib

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 2537 - 2540

2012 21st International Conference on Pattern Recognition (ICPR)

The paper introduces a novel approach to place representation for robot localization and mapping. It uses classical invariance theory while proposing an adaptive kernel to omnidirectional images and exploiting only the main significant visual information in the images. The approach is validated in real world robot exploration and localization and compared to color histograms.

chapter

Color Maximal-Dissimilarity Pattern for pedestrian detection

Qingyuan Wang, Junbiao Pang, Guoyi Liu, Lei Qin, more

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 1952 - 1955

2012 21st International Conference on Pattern Recognition (ICPR)

Feature plays an important role in pedestrian detection, and considerable progress has been made on shape-based descriptors. However, color cues have barely been devoted to detection tasks, seemingly due to the variable appearance of pedestrians. In this paper, Color Maximal-Dissimilarity Pattern (CMDP) is proposed to encode color cues by two core operations, i.e., oriented filtering and max-pooling,...

chapter

Detection of eyes by circular Hough transform and histogram of gradient

Yasutaka Ito, Wataru Ohyama, Tetsushi Wakabayashi, Fumitaka Kimura

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 1795 - 1798

2012 21st International Conference on Pattern Recognition (ICPR)

In order to achieve high accuracy of face recognition, detection of facial parts such as eyes, nose, and mouth is essentially important. In this paper, we propose a method to detect eyes from frontal face images. The proposed method consists of two major steps. The first is two dimensional Hough transformation for detecting circle of unknown radius. The circular Hough transform first generates two...

chapter

ARMA-HMM: A new approach for early recognition of human activity

Kang Li, Yun Fu

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 1779 - 1782

2012 21st International Conference on Pattern Recognition (ICPR)

Early Recognition of human activities is a highly desirable functionality for many visual intelligent systems. However, in computer vision, very few work have been devoted to this challenging and interesting task. In this paper, we address human activity early recognition as a pattern recognition problem of time series data. A new model called ARMA-HMM is introduced to integrate both the predictive...

chapter

Facial emotion recognition in continuous video

Albert Cruz, Bir Bhanu, Ninad Thakoor

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 1880 - 1883

2012 21st International Conference on Pattern Recognition (ICPR)

Facial emotion recognition-the detection of emotion states from video of facial expressions-has applications in video games, medicine, and affective computing. While there have been many advances, an approach has yet to be revealed that performs well on the non-trivial Audio/Visual Emotion Challenge 2011 data set. A majority of approaches still employ single frame classification, or temporally aggregate...

chapter

Supporting ground-truth annotation of image datasets using clustering

Bastiaan J. Boom, Phoenix X. Huang, Jiyin He, Robert B. Fisher

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 1542 - 1545

2012 21st International Conference on Pattern Recognition (ICPR)

As more subject-specific image datasets (medical images, birds, etc) become available, high quality labels associated with these datasets are essential for building statistical models and method evaluation. Obtaining these annotations is a time-comsuming and thus a costly business. We propose a clustering method to support this annotation task, making the task easier and more efficient to perform...

chapter

Corner-surround Contrast for saliency detection

Quan Zhou, Nianyi Li, Yi Yang, Pan Chen, more

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 1423 - 1426

2012 21st International Conference on Pattern Recognition (ICPR)

Center-surround measurements are widely used for saliency detection but with some disadvantages: 1) Center-surround operation may cause inaccurate segmentation and even involve incorrect detection results; 2) In most situations, only using center-surround feature is not efficient to encode object saliency. To overcome these disadvantages, we describe a novel measurement, namely Corner-Surround Contrast...

chapter

Scale-invariant sampling for supervised image segmentation

Yan Li, Marco Loog

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 1399 - 1402

2012 21st International Conference on Pattern Recognition (ICPR)

Scale invariance is a desirable property for many vision tasks such as image segmentation and classification. One way to achieve such invariance is to collect images containing objects of all scales and then train a classifie r. In practice, however, only a finite number of images at a finite number of scales can be collected, and this poses the problem of scale sampling. In this paper, we focus on...

INFONA - science communication portal

2012 21st International Conference on Pattern Recognition (ICPR)

Logo spotting for document categorization

DFlow and DField: New features for capturing object and image relationships

Learning robust color name models from web images

Action recognition with discriminative mid-level features

HEp-2 cell classification in IIF images using Shareboost

Efficient semantic segmentation with Gaussian processes and histogram intersection kernels

Improving texture description in remote sensing image multi-scale classification tasks by using visual words

Multi-modality movie scene detection using Kernel Canonical Correlation Analysis

Sparse coding for histograms of local binary patterns applied for image categorization: Toward a Bag-of-Scenes analysis

Learning-based deformable registration using weighted mutual information

Manhattan-Pyramid Distance: A solution to an anomaly in pyramid matching by minimization

Feature-aligned 4D spatiotemporal image registration

Invariant signatures for omnidirectional visual place recognition and robot localization in unknown environments

Color Maximal-Dissimilarity Pattern for pedestrian detection

Detection of eyes by circular Hough transform and histogram of gradient

ARMA-HMM: A new approach for early recognition of human activity

Facial emotion recognition in continuous video

Supporting ground-truth annotation of image datasets using clustering

Corner-surround Contrast for saliency detection

Scale-invariant sampling for supervised image segmentation

Filter options

Publication date

Keywords

INFONA - science communication portal

2012 21st International Conference on Pattern Recognition (ICPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2012 21st International Conference on Pattern Recognition (ICPR)