2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Temporal Action Localization by Structured Maximal Sums

Zehuan Yuan, Jonathan C. Stroud, Tong Lu, Jia Deng

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3215 - 3223

We address the problem of temporal action localization in videos. We pose action localization as a structured prediction over arbitrary-length temporal windows, where each window is scored as the sum of frame-wise classification scores. Additionally, our model classifies the start, middle, and end of each action as separate components, allowing our system to explicitly model each actions temporal...

chapter

Visual Translation Embedding Network for Visual Relation Detection

Hanwang Zhang, Zawlin Kyaw, Shih-Fu Chang, Tat-Seng Chua

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3107 - 3115

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Visual relations, such as person ride bike and bike next to car, offer a comprehensive scene understanding of an image, and have already shown their great utility in connecting computer vision and natural language. However, due to the challenging combinatorial complexity of modeling subject-predicate-object relation triplets, very little work has been done to localize and predict visual relations...

chapter

Growing a Brain: Fine-Tuning by Increasing Model Capacity

Yu-Xiong Wang, Deva Ramanan, Martial Hebert

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3029 - 3038

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

CNNs have made an undeniable impact on computer vision through the ability to learn high-capacity models with large annotated training sets. One of their remarkable properties is the ability to transfer knowledge from a large source dataset to a (typically smaller) target dataset. This is usually accomplished through fine-tuning a fixed-size network on new target data. Indeed, virtually every contemporary...

chapter

Image Super-Resolution via Deep Recursive Residual Network

Ying Tai, Jian Yang, Xiaoming Liu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2790 - 2798

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, Convolutional Neural Network (CNN) based models have achieved great success in Single Image Super-Resolution (SISR). Owing to the strength of deep networks, these CNN models learn an effective nonlinear mapping from the low-resolution input image to the high-resolution target image, at the cost of requiring enormous parameters. This paper proposes a very deep CNN model (up to 52 convolutional...

chapter

Direct Photometric Alignment by Mesh Deformation

Kaimo Lin, Nianjuan Jiang, Shuaicheng Liu, Loong-Fah Cheong, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2701 - 2709

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The choice of motion models is vital in applications like image/video stitching and video stabilization. Conventional methods explored different approaches ranging from simple global parametric models to complex per-pixel optical flow. Mesh-based warping methods achieve a good balance between computational complexity and model flexibility. However, they typically require high quality feature correspondences...

chapter

Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs

Federico Monti, Davide Boscaini, Jonathan Masci, Emanuele Rodola, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5425 - 5434

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning has achieved a remarkable performance breakthrough in several fields, most notably in speech recognition, natural language processing, and computer vision. In particular, convolutional neural network (CNN) architectures currently produce state-of-the-art performance on a variety of image analysis tasks such as object detection and recognition. Most of deep learning research has so far...

chapter

Generative Hierarchical Learning of Sparse FRAME Models

Jianwen Xie, Yifei Xu, Erik Nijkamp, Ying Nian Wu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1933 - 1941

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a method for generative learning of hierarchical random field models. The resulting model, which we call the hierarchical sparse FRAME (Filters, Random field, And Maximum Entropy) model, is a generalization of the original sparse FRAME model by decomposing it into multiple parts that are allowed to shift their locations, scales and rotations, so that the resulting model becomes...

chapter

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Miguel A. Bautista, Artsiom Sanakoyeu, Bjorn Ommer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1923 - 1932

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Unsupervised learning of visual similarities is of paramount importance to computer vision, particularly due to lacking training data for fine-grained similarities. Deep learning of similarities is often based on relationships between pairs or triplets of samples. Many of these relations are unreliable and mutually contradicting, implying inconsistencies when trained without supervision information...

chapter

Deep Learning of Human Visual Sensitivity in Image Quality Assessment Framework

Jongyoo Kim, Sanghoon Lee

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1969 - 1977

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Since human observers are the ultimate receivers of digital images, image quality metrics should be designed from a human-oriented perspective. Conventionally, a number of full-reference image quality assessment (FR-IQA) methods adopted various computational models of the human visual system (HVS) from psychological vision science research. In this paper, we propose a novel convolutional neural networks...

chapter

More is Less: A More Complicated Network with Less Inference Complexity

Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1895 - 1903

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we present a novel and general network structure towards accelerating the inference process of convolutional neural networks, which is more complicated in network structure yet with less inference complexity. The core idea is to equip each original convolutional layer with another low-cost collaborative layer (LCCL), and the element-wise multiplication of the ReLU outputs of these two...

chapter

Robust Interpolation of Correspondences for Large Displacement Optical Flow

Yinlin Hu, Yunsong Li, Rui Song

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4791 - 4799

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The interpolation of correspondences (EpicFlow) was widely used for optical flow estimation in most-recent works. It has the advantage of edge-preserving and efficiency. However, it is vulnerable to input matching noise, which is inevitable in modern matching techniques. In this paper, we present a Robust Interpolation method of Correspondences (called RicFlow) to overcome the weakness. First, the...

chapter

Semantic Autoencoder for Zero-Shot Learning

Elyor Kodirov, Tao Xiang, Shaogang Gong

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4447 - 4456

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing zero-shot learning (ZSL) models typically learn a projection function from a feature space to a semantic embedding space (e.g. attribute space). However, such a projection function is only concerned with predicting the training seen class semantic representation (e.g. attribute prediction) or classification. When applied to test data, which in the context of ZSL contains different (unseen)...

chapter

Probabilistic Temporal Subspace Clustering

Behnam Gholami, Vladimir Pavlovic

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4313 - 4322

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Subspace clustering is a common modeling paradigm used to identify constituent modes of variation in data with locally linear structure. These structures are common to many problems in computer vision, including modeling time series of complex human motion. However classical subspace clustering algorithms learn the relationships within a set of data without considering the temporal dependency and...

chapter

Compact Matrix Factorization with Dependent Subspaces

Viktor Larsson, Carl Olsson

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4361 - 4370

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Traditional matrix factorization methods approximate high dimensional data with a low dimensional subspace. This imposes constraints on the matrix elements which allow for estimation of missing entries. A lower rank provides stronger constraints and makes estimation of the missing entries less ambiguous at the cost of measurement fit. In this paper we propose a new factorization model that further...

chapter

Low-Rank Bilinear Pooling for Fine-Grained Classification

Shu Kong, Charless Fowlkes

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7025 - 7034

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Pooling second-order local feature statistics to form a high-dimensional bilinear feature has been shown to achieve state-of-the-art performance on a variety of fine-grained classification tasks. To address the computational demands of high feature dimensionality, we propose to represent the covariance features as a matrix and apply a low-rank bilinear classifier. The resulting classifier can be evaluated...

chapter

Seeing into Darkness: Scotopic Visual Recognition

Bo Chen, Pietro Perona

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7292 - 7301

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Images are formed by counting how many photons traveling from a given set of directions hit an image sensor during a given time interval. When photons are few and far in between, the concept of image breaks down and it is best to consider directly the flow of photons. Computer vision in this regime, which we call scotopic, is radically different from the classical image-based paradigm in that visual...

chapter

SST: Single-Stream Temporal Action Proposals

Shyamal Buch, Victor Escorcia, Chuanqi Shen, Bernard Ghanem, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6373 - 6382

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Our paper presents a new approach for temporal detection of human actions in long, untrimmed video sequences. We introduce Single-Stream Temporal Action Proposals (SST), a new effective and efficient deep architecture for the generation of temporal action proposals. Our network can run continuously in a single stream over very long input video sequences, without the need to divide input into short...

chapter

Adversarially Tuned Scene Generation

Vsr Veeravasarapu, Constantin Rothkopf, Ramesh Visvanathan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6441 - 6449

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Generalization performance of trained computer vision (CV) systems that use computer graphics (CG) generated data is not yet effective due to the concept of domain-shift between virtual and real data. Although simulated data augmented with a few real-world samples has been shown to mitigate domain shift and improve transferability of trained models, guiding or bootstrapping the virtual data generation...

chapter

SCC: Semantic Context Cascade for Efficient Action Detection

Fabian Caba Heilbron, Wayner Barrios, Victor Escorcia, Bernard Ghanem

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3175 - 3184

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Despite the recent advances in large-scale video analysis, action detection remains as one of the most challenging unsolved problems in computer vision. This snag is in part due to the large volume of data that needs to be analyzed to detect actions in videos. Existing approaches have mitigated the computational cost, but still, these methods lack rich high-level semantics that helps them to localize...

chapter

Knowledge Acquisition for Visual Question Answering via Iterative Querying

Yuke Zhu, Joseph J. Lim, Li Fei-Fei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6146 - 6155

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Humans possess an extraordinary ability to learn new skills and new knowledge for problem solving. Such learning ability is also required by an automatic model to deal with arbitrary, open-ended questions in the visual world. We propose a neural-based approach to acquiring task-driven information for visual question answering (VQA). Our model proposes queries to actively acquire relevant information...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Temporal Action Localization by Structured Maximal Sums

Visual Translation Embedding Network for Visual Relation Detection

Growing a Brain: Fine-Tuning by Increasing Model Capacity

Image Super-Resolution via Deep Recursive Residual Network

Direct Photometric Alignment by Mesh Deformation

Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs

Generative Hierarchical Learning of Sparse FRAME Models

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Deep Learning of Human Visual Sensitivity in Image Quality Assessment Framework

More is Less: A More Complicated Network with Less Inference Complexity

Robust Interpolation of Correspondences for Large Displacement Optical Flow

Semantic Autoencoder for Zero-Shot Learning

Probabilistic Temporal Subspace Clustering

Compact Matrix Factorization with Dependent Subspaces

Low-Rank Bilinear Pooling for Fine-Grained Classification

Seeing into Darkness: Scotopic Visual Recognition

SST: Single-Stream Temporal Action Proposals

Adversarially Tuned Scene Generation

SCC: Semantic Context Cascade for Efficient Action Detection

Knowledge Acquisition for Visual Question Answering via Iterative Querying

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)