Search results

Items from 1 to 20 out of 21 results

chapter

Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization

Sijia Cai, Wangmeng Zuo, Lei Zhang

2017 IEEE International Conference on Computer Vision (ICCV) > 511 - 520

2017 IEEE International Conference on Computer Vision (ICCV)

The success of fine-grained visual categorization (FGVC) extremely relies on the modeling of appearance and interactions of various semantic parts. This makes FGVC very challenging because: (i) part annotation and detection require expert guidance and are very expensive; (ii) parts are of different sizes; and (iii) the part interactions are complex and of higher-order. To address these issues, we...

chapter

Generalized Orderless Pooling Performs Implicit Salient Matching

Marcel Simon, Yang Gao, Trevor Darrell, Joachim Denzler, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4970 - 4979

2017 IEEE International Conference on Computer Vision (ICCV)

Most recent CNN architectures use average pooling as a final feature encoding step. In the field of fine-grained recognition, however, recent global representations like bilinear pooling offer improved performance. In this paper, we generalize average and bilinear pooling to “α-pooling”, allowing for learning the pooling strategy during training. In addition, we present a novel way to visualize decisions...

chapter

Flower classification using fusion descriptor and SVM

Wei Liu, Yunbo Rao, Baijiang Fan, Jiali Song, more

2017 International Smart Cities Conference (ISC2) > 1 - 4

2017 International Smart Cities Conference (ISC2)

This paper aims to develop an effective flower classification approach using the technology of feature extraction. With this regard, a fused descriptor based on Pyramid Histogram of Visual Words (PHOW) is used to extract the color, texture and contour information of flower image. Secondly, Dictionary Learning and Locality-constrained Linear Coding (LLC) are operated on PHOW feature and then images...

chapter

Spatial collaborative representation for image categorization

Mouna Dammak, Chokri Ben Amar

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 3776 - 3781

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

A novel proposed approach, collaborative representation-based classification, has been developed for face recognition and recently used in image classification task owing to its simplicity and effectiveness. The major drawback of this method is the neglect of the spatial structure among the image representations. Inspired by the success of this technique and motivated by the power of spatial information...

chapter

Action Recognition Based on Local Fisher Discriminant Analysis and Mix Encoding

Lijun Li, Shuling Dai

2016 International Conference on Virtual Reality and Visualization (ICVRV) > 16 - 23

2016 International Conference on Virtual Reality and Visualization (ICVRV)

Action recognition has been one of the most popular fields of computer vision. This paper presents a novel approach to action recognition problem using the dimension reduction method, local fisher discriminant analysis, to reduce the dimension of feature descriptors as the preprocessing step after feature extraction. We propose to use sparse matrix and randomized kd-tree to modify and accelerate the...

chapter

RGB-D Visual Search with Compact Binary Codes

Alioscia Petrelli, Danilo Pau, Emanuele Plebani, Luigi Di Stefano

2015 International Conference on 3D Vision > 82 - 90

2015 International Conference on 3D Vision (3DV)

As integration of depth sensing into mobile devices is likely forthcoming, we investigate on merging appearance and shape information for mobile visual search. Accordingly, we propose an RGB-D search engine architecture that can attain high recognition rates with peculiarly moderate bandwidth requirements. Our experiments include a comparison to the CDVS (Compact Descriptors for Visual Search) pipeline,...

chapter

A Performance Evaluation on Action Recognition with Local Features

Xiantong Zhen, Ling Shao

2014 22nd International Conference on Pattern Recognition > 4495 - 4500

2014 22nd International Conference on Pattern Recognition (ICPR)

Local features have played an important role in visual recognition. Methods based on local features, e.g., the bag-of-words (BoW) model and sparse coding, have shown their effectiveness in image and object recognition in the past decades. Recently, many new techniques, including the improvements of BoW and sparse coding as well as the non-parametric naive bayes nearest neighbor (NBNN) classifier,...

chapter

A spectral based visual matching method for image classification

Yan Song, Wu Guo, Li-Rong Dai, Ian Vince Mcloughlin

2014 International Conference on Audio, Language and Image Processing > 666 - 670

2014 International Conference on Audio, Language and Image Processing (ICALIP)

Visual matching algorithms can be described in terms of visual content representation and similarity measure. With local feature based representations, visual matching can be restated as: 1) how to obtain visual similarity from the local kernel matrix, and 2) how to calculate the local kernel matrix effectively and efficiently. Existing methods mostly focus on the former, and use Euclidean distance...

chapter

Robust Feature Encoding with Neighborhood Information for Image Classification

Bingyuan Liu, Jing Liu, Chunjie Zhang, Maolin Chen, more

2013 Seventh International Conference on Image and Graphics > 880 - 885

2013 Seventh International Conference on Image and Graphics (ICIG)

The bag of visual words (BoW) model is one of the most successful model in image classification task. However, the major problem of the BoW model lies in the determination of visual words, which consists of codebook training and feature encoding phases. The traditional K-means and hard-assignment method completely ignore the structure of the local feature space, leading to high loss of information...

chapter

Fuzzy clustering based encoding for Visual Object Classification

Danilo Dell'Agnello, Gustavo Carneiro, Tat-Jun Chin, Giovanna Castellano, more

2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS) > 1439 - 1444

2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS)

Nowadays the bag-of-visual-words is a very popular approach to perform the task of Visual Object Classification (VOC). Two key phases of VOC are the vocabulary building step, i.e. the construction of a ‘visual dictionary’ including common codewords in the image corpus, and the assignment step, i.e. the encoding of the images by means of these codewords. Hard assignment of image descriptors to visual...

chapter

Large-scale web video event classification by use of Fisher Vectors

Chen Sun, Ram Nevatia

2013 IEEE Workshop on Applications of Computer Vision (WACV) > 15 - 22

2013 IEEE Workshop on Applications of Computer Vision (WACV)

Event recognition has been an important topic in computer vision research due to its many applications. However, most of the work has focused on videos taken from a fixed camera, known environments and basic events. Here, we focus on classification of unconstrained, web videos into much higher level activities. We follow the approach of constructing fixed length feature vectors from local feature...

chapter

Classification of Urban Scenes from Geo-referenced Images in Urban Street-View Context

Corina Iovan, David Picard, Nicolas Thome, Matthieu Cord

2012 11th International Conference on Machine Learning and Applications > 2 > 339 - 344

2012 Eleventh International Conference on Machine Learning and Applications (ICMLA)

This paper addresses the challenging problem of scene classification in street-view georeferenced images of urban environments. More precisely, the goal of this task is semantic image classification, consisting in predicting in a given image, the presence or absence of a pre-defined class (e.g. shops, vegetation, etc.). The approach is based on the BOSSA representation, which enriches the Bag of Words...

chapter

Hierarchical matching with side information for image classification

Qiang Chen, Zheng Song, Yang Hua, Zhongyang Huang, more

2012 IEEE Conference on Computer Vision and Pattern Recognition > 3426 - 3433

2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we introduce a hierarchical matching framework with so-called side information for image classification based on bag-of-words representation. Each image is expressed as a bag of orderless pairs, each of which includes a local feature vector encoded over a visual dictionary, and its corresponding side information from priors or contexts. The side information is used for hierarchical clustering...

chapter

Using spatial pyramids with compacted VLAT for image categorization

Romain Negrel, David Picard, Philippe-Henri Gosselin

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 2460 - 2463

2012 21st International Conference on Pattern Recognition (ICPR)

In this paper, we propose a compact image signature based on VLAT. Our method integrates spatial information while significantly reducing the size of original VLAT by using two pojection steps. we carry out experiments showing our approach is competitive with state of the art signatures.

chapter

Indian Classical Dance classification by learning dance pose bases

Soumitra Samanta, Pulak Purkait, Bhabatosh Chanda

2012 IEEE Workshop on the Applications of Computer Vision (WACV) > 265 - 270

2012 IEEE Workshop on Applications of Computer Vision (WACV)

In this paper, we address an interesting application of computer vision technique, namely classification of Indian Classical Dance (ICD). With the best of our knowledge, the problem has not been addressed so far in computer vision domain. To deal with this problem, we use a sparse representation based dictionary learning technique. First, we represent each frame of a dance video by a pose descriptor...

chapter

Compact correlation coding for visual object categorization

Nobuyuki Morioka, Shin'ichi Satoh

2011 International Conference on Computer Vision > 1639 - 1646

2011 IEEE International Conference on Computer Vision (ICCV)

Spatial relationships between local features are thought to play a vital role in representing object categories. However, learning a compact set of higher-order spatial features based on visual words, e.g., doublets and triplets, remains a challenging problem as possible combinations of visual words grow exponentially. While the local pairwise codebook achieves a compact codebook of pairs of spatially...

chapter

Spatial Coordinate Coding to reduce histogram representations, Dominant Angle and Colour Pyramid Match

Piotr Koniusz, Krystian Mikolajczyk

2011 18th IEEE International Conference on Image Processing > 661 - 664

2011 18th IEEE International Conference on Image Processing (ICIP 2011)

Spatial Pyramid Match lies at a heart of modern object category recognition systems. Once image descriptors are expressed as histograms of visual words, they are further deployed across spatial pyramid with coarse-to-fine spatial location grids. However, such representation results in extreme histogram vectors of 200K or more elements increasing computational and memory requirements. This paper investigates...

chapter

Soft assignment of visual words as Linear Coordinate Coding and optimisation of its reconstruction error

Piotr Koniusz, Krystian Mikolajczyk

2011 18th IEEE International Conference on Image Processing > 2413 - 2416

2011 18th IEEE International Conference on Image Processing (ICIP 2011)

Visual Word Uncertainty also referred to as Soft Assignment is a well established technique for representing images as histograms by flexible assignment of image descriptors to a visual vocabulary. Recently, an attention of the community dealing with the object category recognition has been drawn to Linear Coordinate Coding methods. In this work, we focus on Soft Assignment as it yields good results...

chapter

Large-scale image retrieval with compressed Fisher vectors

F Perronnin, Yan Liu, J Sánchez, H Poirier

2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition > 3384 - 3391

2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The problem of large-scale image search has been traditionally addressed with the bag-of-visual-words (BOV). In this article, we propose to use as an alternative the Fisher kernel framework. We first show why the Fisher representation is well-suited to the retrieval problem: it describes an image by what makes it different from other images. One drawback of the Fisher vector is that it is high-dimensional...

chapter

Human Action Recognition Using Salient Opponent-Based Motion Features

Amir-Hossein Shabani, John S Zelek, David A Clausi

2010 Canadian Conference on Computer and Robot Vision > 362 - 369

2010 Seventh Canadian Conference on Computer and Robot Vision (CRV 2010)

Human action recognition can be performed using multiscale salient features which encode the local events in the video. Existing feature extraction methods use non-causal spatio-temporal filtering, and hence, they are not biologically plausible. To address this inconsistency, new features extracted from a biologically plausible perception model are introduced. In this model, the opponent-based motion...

Data set:
ieee
Keywords:
KERNEL
VISUALIZATION
ENCODING
Publication type:
book

Publication date

Set your own date range

Keywords

FEATURE EXTRACTION (10)
IMAGE CODING (8)
VECTORS (8)
HISTOGRAMS (5)
IMAGE CLASSIFICATION (4)
TRAINING (4)
BAGS-OF-WORDS (2)
COMPUTER VISION (2)
DICTIONARIES (2)
FEATURE ENCODING (2)
HUMAN ACTION RECOGNITION (2)
IMAGE MOTION ANALYSIS (2)
IMAGE REPRESENTATION (2)
SOFT ASSIGNMENT (2)
SPARSE CODING (2)
STANDARDS (2)
SUPPORT VECTOR MACHINES (2)
TENSILE STRESS (2)
VOCABULARY (2)
3D INTEREST POINTS EXTRACTION (1)
ACCURACY (1)
ACTION RECOGNITION (1)
APPROXIMATION METHODS (1)
BAG OF WORDS REPRESENTATION (1)
BAG-OF-VISUAL WORDS (1)
BAG-OF-WORDS APPROACH (1)
BAG-OF-WORDS MODEL (1)
BENCHMARK TESTING (1)
BINARIZATION APPROACH (1)
BIO-INSPIRED TIME-CAUSAL FILTERING (1)
BIOLOGICAL SYSTEM MODELING (1)
BIRDS (1)
CAMERAS (1)
CAUSAL SCALE-SPACE FILTERING (1)
CLUSTERING ALGORITHMS (1)
COLLABORATION (1)
COLOUR (1)
COMPRESSED FISHER VECTORS (1)
COMPUTATIONAL MODELING (1)
CONTEXT (1)
CONVOLUTIONAL CODES (1)
COORDINATE CODING (1)
CORRELATION (1)
DATA COMPRESSION (1)
DESCRIPTOR RECONSTRUCTION ERROR (1)
DIMENSION REDUCTION (1)
DOMINANT ANGLE (1)
EQUATIONS (1)
FISHER KERNEL FRAMEWORK (1)
FISHER REPRESENTATION (1)
FLOWER CLASSIFICATION (1)
GESTURE RECOGNITION (1)
GRAPH EMBEDDING (1)
HEAD (1)
HUMANS (1)
IMAGE COLOR ANALYSIS (1)
IMAGE EDGE DETECTION (1)
IMAGE RECOGNITION (1)
IMAGE RETRIEVAL (1)
JOINTS (1)
K-MEANS ALGORITHM (1)
KERNEL-BASED MACHINE LEARNING (1)
KTH ACTION DATASET (1)
LARGE-SCALE IMAGE RETRIEVAL (1)
LARGE-SCALE IMAGE SEARCH (1)
LAYOUT (1)
LLC (1)
MATCH KERNEL (1)
MATHEMATICAL MODEL (1)
MEMORY FOOTPRINT (1)
MOBILE COMMUNICATION (1)
MOBILE HANDSETS (1)
MOTION ENERGY MAP (1)
MULTISCALE SALIENT FEATURES (1)
OBJECT RECOGNITION (1)
OPPONENT-BASED MOTION ENERGY (1)
OPPONENT-BASED MOTION FEATURES (1)
ORIENTED MOTION FILTERS (1)
PHOW (1)
PIPELINES (1)
PROTOTYPES (1)
RGB-D VISUAL SEARCH (1)
ROBUSTNESS (1)
SALIENT OPPONENT-BASED MOTION FEATURES (1)
SEARCH ENGINES (1)
SEMANTIC IMAGE CLASSIFICATION (1)
SEMANTICS (1)
SENSORS (1)
SIMILARITY-PRESERVING HASHING (1)
SPATIAL PYRAMID MATCH (1)
SPATIAL PYRAMID MATCHING (1)
SPATIO-TEMPORAL SALINET FEATURES (1)
SPECTRAL ANALYSIS (1)
SPP (1)
STANDARD COMPRESSION TECHNIQUE (1)
STREET-LEVEL IMAGES (1)
SVM (1)
more

INFONA - science communication portal

Search results

Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization

Generalized Orderless Pooling Performs Implicit Salient Matching

Flower classification using fusion descriptor and SVM

Spatial collaborative representation for image categorization

Action Recognition Based on Local Fisher Discriminant Analysis and Mix Encoding

RGB-D Visual Search with Compact Binary Codes

A Performance Evaluation on Action Recognition with Local Features

A spectral based visual matching method for image classification

Robust Feature Encoding with Neighborhood Information for Image Classification

Fuzzy clustering based encoding for Visual Object Classification

Large-scale web video event classification by use of Fisher Vectors

Classification of Urban Scenes from Geo-referenced Images in Urban Street-View Context

Hierarchical matching with side information for image classification

Using spatial pyramids with compacted VLAT for image categorization

Indian Classical Dance classification by learning dance pose bases

Compact correlation coding for visual object categorization

Spatial Coordinate Coding to reduce histogram representations, Dominant Angle and Colour Pyramid Match

Soft assignment of visual words as Linear Coordinate Coding and optimisation of its reconstruction error

Large-scale image retrieval with compressed Fisher vectors

Human Action Recognition Using Salient Opponent-Based Motion Features

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options