2017 IEEE International Conference on Computer Vision (ICCV)

chapter

Temporal Tessellation: A Unified Approach for Video Analysis

Dotan Kaufman, Gil Levi, Tal Hassner, Lior Wolf

2017 IEEE International Conference on Computer Vision (ICCV) > 94 - 104

We present a general approach to video understanding, inspired by semantic transfer techniques that have been successfully used for 2D image analysis. Our method considers a video to be a 1D sequence of clips, each one associated with its own semantics. The nature of these semantics – natural language captions or other labels – depends on the task at hand. A test video is processed by forming correspondences...

chapter

A Revisit of Sparse Coding Based Anomaly Detection in Stacked RNN Framework

Weixin Luo, Wen Liu, Shenghua Gao

2017 IEEE International Conference on Computer Vision (ICCV) > 341 - 349

2017 IEEE International Conference on Computer Vision (ICCV)

Motivated by the capability of sparse coding based anomaly detection, we propose a Temporally-coherent Sparse Coding (TSC) where we enforce similar neighbouring frames be encoded with similar reconstruction coefficients. Then we map the TSC with a special type of stacked Recurrent Neural Network (sRNN). By taking advantage of sRNN in learning all parameters simultaneously, the nontrivial hyper-parameter...

chapter

Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization

Sijia Cai, Wangmeng Zuo, Lei Zhang

2017 IEEE International Conference on Computer Vision (ICCV) > 511 - 520

2017 IEEE International Conference on Computer Vision (ICCV)

The success of fine-grained visual categorization (FGVC) extremely relies on the modeling of appearance and interactions of various semantic parts. This makes FGVC very challenging because: (i) part annotation and detection require expert guidance and are very expensive; (ii) parts are of different sizes; and (iii) the part interactions are complex and of higher-order. To address these issues, we...

chapter

SuBiC: A Supervised, Structured Binary Code for Image Search

Himalaya Jain, Joaquin Zepeda, Patrick Perez, Remi Gribonval

2017 IEEE International Conference on Computer Vision (ICCV) > 833 - 842

2017 IEEE International Conference on Computer Vision (ICCV)

For large-scale visual search, highly compressed yet meaningful representations of images are essential. Structured vector quantizers based on product quantization and its variants are usually employed to achieve such compression while minimizing the loss of accuracy. Yet, unlike binary hashing schemes, these unsupervised methods have not yet benefited from the supervision, end-to-end learning and...

chapter

Genetic CNN

Lingxi Xie, Alan Yuille

2017 IEEE International Conference on Computer Vision (ICCV) > 1388 - 1397

2017 IEEE International Conference on Computer Vision (ICCV)

The deep convolutional neural network (CNN) is the state-of-the-art solution for large-scale visual recognition. Following some basic principles such as increasing network depth and constructing highway connections, researchers have manually designed a lot of fixed network architectures and verified their effectiveness.,,In this paper, we discuss the possibility of learning deep network structures...

chapter

Video Fill In the Blank Using LR/RL LSTMs with Spatial-Temporal Attentions

Amir Mazaheri, Dong Zhang, Mubarak Shah

2017 IEEE International Conference on Computer Vision (ICCV) > 1416 - 1425

2017 IEEE International Conference on Computer Vision (ICCV)

Given a video and a description sentence with one missing word, “source sentence”, Video-Fill-In-the-Blank (VFIB) problem is to find the missing word automatically. The contextual information of the sentence, as well as visual cues from the video, are important to infer the missing word accurately. Since the source sentence is broken into two fragments: the sentence’s left fragment (before the blank)...

chapter

Anchored Regression Networks Applied to Age Estimation and Super Resolution

Eirikur Agustsson, Radu Timofte, Luc Van Gool

2017 IEEE International Conference on Computer Vision (ICCV) > 1652 - 1661

2017 IEEE International Conference on Computer Vision (ICCV)

We propose the Anchored Regression Network (ARN), a nonlinear regression network which can be seamlessly integrated into various networks or can be used stand-alone when the features have already been fixed. Our ARN is a smoothed relaxation of a piecewise linear regressor through the combination of multiple linear regressors over soft assignments to anchor points. When the anchor points are fixed...

chapter

Group Re-identification via Unsupervised Transfer of Sparse Features Encoding

Giuseppe Lisanti, Niki Martinel, Alberto Del Bimbo, Gian Luca Foresti

2017 IEEE International Conference on Computer Vision (ICCV) > 2468 - 2477

2017 IEEE International Conference on Computer Vision (ICCV)

Person re-identification is best known as the problem of associating a single person that is observed from one or more disjoint cameras. The existing literature has mainly addressed such an issue, neglecting the fact that people usually move in groups, like in crowded scenarios. We believe that the additional information carried by neighboring individuals provides a relevant visual context that can...

chapter

Scene Parsing with Global Context Embedding

Wei-Chih Hung, Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2650 - 2658

2017 IEEE International Conference on Computer Vision (ICCV)

We present a scene parsing method that utilizes global context information based on both the parametric and nonparametric models. Compared to previous methods that only exploit the local relationship between objects, we train a context network based on scene similarities to generate feature representations for global contexts. In addition, these learned features are utilized to generate global and...

chapter

DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding

Dieu Linh Tran, Robert Walecki, Ognjen Rudovic, Stefanos Eleftheriadis, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3209 - 3218

2017 IEEE International Conference on Computer Vision (ICCV)

Human face exhibits an inherent hierarchy in its representations (i.e., holistic facial expressions can be encoded via a set of facial action units (AUs) and their intensity). Variational (deep) auto-encoders (VAE) have shown great results in unsupervised extraction of hierarchical latent representations from large amounts of image data, while being robust to noise and other undesired artifacts. Potentially,...

chapter

Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross Retrieval

Yuming Shen, Li Liu, Ling Shao, Jingkuan Song

2017 IEEE International Conference on Computer Vision (ICCV) > 4117 - 4126

2017 IEEE International Conference on Computer Vision (ICCV)

Cross-modal hashing is usually regarded as an effective technique for large-scale textual-visual cross retrieval, where data from different modalities are mapped into a shared Hamming space for matching. Most of the traditional textual-visual binary encoding methods only consider holistic image representations and fail to model descriptive sentences. This renders existing methods inappropriate to...

chapter

Consensus Convolutional Sparse Coding

Biswarup Choudhury, Robin Swanson, Felix Heide, Gordon Wetzstein, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4290 - 4298

2017 IEEE International Conference on Computer Vision (ICCV)

Convolutional sparse coding (CSC) is a promising direction for unsupervised learning in computer vision. In contrast to recent supervised methods, CSC allows for convolutional image representations to be learned that are equally useful for high-level vision tasks and low-level image reconstruction and can be applied to a wide range of tasks without problem-specific retraining. Due to their extreme...

chapter

AMTnet: Action-Micro-Tube Regression by End-to-end Trainable Deep Architecture

Suman Saha, Gurkirt Singh, Fabio Cuzzolin

2017 IEEE International Conference on Computer Vision (ICCV) > 4424 - 4433

2017 IEEE International Conference on Computer Vision (ICCV)

Dominant approaches to action detection can only provide sub-optimal solutions to the problem, as they rely on seeking frame-level detections, to later compose them into ‘action tubes’ in a post-processing step. With this paper we radically depart from current practice, and take a first step towards the design and implementation of a deep network architecture able to classify and regress whole video...

chapter

Constrained Convolutional Sparse Coding for Parametric Based Reconstruction of Line Drawings

Sara Shaheen, Lama Affara, Bernard Ghanem

2017 IEEE International Conference on Computer Vision (ICCV) > 4434 - 4442

2017 IEEE International Conference on Computer Vision (ICCV)

Convolutional sparse coding (CSC) plays an essential role in many computer vision applications ranging from image compression to deep learning. In this work, we spot the light on a new application where CSC can effectively serve, namely line drawing analysis. The process of drawing a line drawing can be approximated as the sparse spatial localization of a number of typical basic strokes, which in...

chapter

AnnArbor: Approximate Nearest Neighbors Using Arborescence Coding

Artem Babenko Yandex, Victor Lempitsky

2017 IEEE International Conference on Computer Vision (ICCV) > 4895 - 4903

2017 IEEE International Conference on Computer Vision (ICCV)

To compress large datasets of high-dimensional descriptors, modern quantization schemes learn multiple codebooks and then represent individual descriptors as combinations of codewords. Once the codebooks are learned, these schemes encode descriptors independently. In contrast to that, we present a new coding scheme that arranges dataset descriptors into a set of arborescence graphs, and then encodes...

chapter

Locally-Transferred Fisher Vectors for Texture Classification

Yang Song, Fan Zhang, Qing Li, Heng Huang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4922 - 4930

2017 IEEE International Conference on Computer Vision (ICCV)

Texture classification has been extensively studied in computer vision. Recent research shows that the combination of Fisher vector (FV) encoding and convolutional neural network (CNN) provides significant improvement in texture classification over the previous feature representation methods. However, by truncating the CNN model at the last convolutional layer, the CNN-based FV descriptors would not...

chapter

Generalized Orderless Pooling Performs Implicit Salient Matching

Marcel Simon, Yang Gao, Trevor Darrell, Joachim Denzler, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4970 - 4979

2017 IEEE International Conference on Computer Vision (ICCV)

Most recent CNN architectures use average pooling as a final feature encoding step. In the field of fine-grained recognition, however, recent global representations like bilinear pooling offer improved performance. In this paper, we generalize average and bilinear pooling to “α-pooling”, allowing for learning the pooling strategy during training. In addition, we present a novel way to visualize decisions...

chapter

Rotation Equivariant Vector Field Networks

Diego Marcos, Michele Volpi, Nikos Komodakis, Devis Tuia

2017 IEEE International Conference on Computer Vision (ICCV) > 5058 - 5067

2017 IEEE International Conference on Computer Vision (ICCV)

In many computer vision tasks, we expect a particular behavior of the output with respect to rotations of the input image. If this relationship is explicitly encoded, instead of treated as any other variation, the complexity of the problem is decreased, leading to a reduction in the size of the required model. In this paper, we propose the Rotation Equivariant Vector Field Networks (RotEqNet), a Convolutional...

chapter

Offline Handwritten Signature Modeling and Verification Based on Archetypal Analysis

Elias N. Zois, Ilias Theodorakopoulos, George Economou

2017 IEEE International Conference on Computer Vision (ICCV) > 5515 - 5524

2017 IEEE International Conference on Computer Vision (ICCV)

The handwritten signature is perhaps the most accustomed way for the acknowledgement of the consent of an individual or the authentication of the identity of a person in numerous transactions. In addition, the authenticity of a questioned offline or static handwritten signature still poses a case of interest, especially in forensic related applications. A common approach in offline signature verification...

chapter

HashNet: Deep Learning to Hash by Continuation

Zhangjie Cao, Mingsheng Long, Jianmin Wang, Philip S. Yu

2017 IEEE International Conference on Computer Vision (ICCV) > 5609 - 5618

2017 IEEE International Conference on Computer Vision (ICCV)

Learning to hash has been widely applied to approximate nearest neighbor search for large-scale multimedia retrieval, due to its computation efficiency and retrieval quality. Deep learning to hash, which improves retrieval quality by end-to-end representation learning and hash encoding, has received increasing attention recently. Subject to the ill-posed gradient difficulty in the optimization with...

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV)

Temporal Tessellation: A Unified Approach for Video Analysis

A Revisit of Sparse Coding Based Anomaly Detection in Stacked RNN Framework

Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization

SuBiC: A Supervised, Structured Binary Code for Image Search

Genetic CNN

Video Fill In the Blank Using LR/RL LSTMs with Spatial-Temporal Attentions

Anchored Regression Networks Applied to Age Estimation and Super Resolution

Group Re-identification via Unsupervised Transfer of Sparse Features Encoding

Scene Parsing with Global Context Embedding

DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding

Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross Retrieval

Consensus Convolutional Sparse Coding

AMTnet: Action-Micro-Tube Regression by End-to-end Trainable Deep Architecture

Constrained Convolutional Sparse Coding for Parametric Based Reconstruction of Line Drawings

AnnArbor: Approximate Nearest Neighbors Using Arborescence Coding

Locally-Transferred Fisher Vectors for Texture Classification

Generalized Orderless Pooling Performs Implicit Salient Matching

Rotation Equivariant Vector Field Networks

Offline Handwritten Signature Modeling and Verification Based on Archetypal Analysis

HashNet: Deep Learning to Hash by Continuation

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Computer Vision (ICCV)