2017 IEEE International Conference on Computer Vision (ICCV)

chapter

CAD Priors for Accurate and Flexible Instance Reconstruction

Tolga Birdal, Slobodan Ilic

2017 IEEE International Conference on Computer Vision (ICCV) > 133 - 142

We present an efficient and automatic approach for accurate instance reconstruction of big 3D objects from multiple, unorganized and unstructured point clouds, in presence of dynamic clutter and occlusions. In contrast to conventional scanning, where the background is assumed to be rather static, we aim at handling dynamic clutter where the background drastically changes during object scanning. Currently,...

chapter

Hard-Aware Deeply Cascaded Embedding

Yuhui Yuan, Kuiyuan Yang, Chao Zhang

2017 IEEE International Conference on Computer Vision (ICCV) > 814 - 823

2017 IEEE International Conference on Computer Vision (ICCV)

Riding on the waves of deep neural networks, deep metric learning has achieved promising results in various tasks by using triplet network or Siamese network. Though the basic goal of making images from the same category closer than the ones from different categories is intuitive, it is hard to optimize the objective directly due to the quadratic or cubic sample size. Hard example mining is widely...

chapter

Factorized Bilinear Models for Image Recognition

Yanghao Li, Naiyan Wang, Jiaying Liu, Xiaodi Hou

2017 IEEE International Conference on Computer Vision (ICCV) > 2098 - 2106

2017 IEEE International Conference on Computer Vision (ICCV)

Although Deep Convolutional Neural Networks (CNNs) have liberated their power in various computer vision tasks, the most important components of CNN, convolutional layers and fully connected layers, are still limited to linear transformations. In this paper, we propose a novel Factorized Bilinear (FB) layer to model the pairwise feature interactions by considering the quadratic terms in the transformations...

chapter

What Actions are Needed for Understanding Human Actions in Videos?

Gunnar A. Sigurdsson, Olga Russakovsky, Abhinav Gupta

2017 IEEE International Conference on Computer Vision (ICCV) > 2156 - 2165

2017 IEEE International Conference on Computer Vision (ICCV)

What is the right way to reason about human activities? What directions forward are most promising? In this work, we analyze the current state of human activity understanding in videos. The goal of this paper is to examine datasets, evaluation metrics, algorithms, and potential future directions. We look at the qualitative attributes that define activities such as pose variability, brevity, and density...

chapter

MUTAN: Multimodal Tucker Fusion for Visual Question Answering

Hedi Ben-younes, Remi Cadene, Matthieu Cord, Nicolas Thome

2017 IEEE International Conference on Computer Vision (ICCV) > 2631 - 2639

2017 IEEE International Conference on Computer Vision (ICCV)

Bilinear models provide an appealing framework for mixing and merging information in Visual Question Answering (VQA) tasks. They help to learn high level associations between question meaning and visual concepts in the image, but they suffer from huge dimensionality issues.,,We introduce MUTAN, a multimodal tensor-based Tucker decomposition to efficiently parametrize bilinear interactions between...

chapter

SGN: Sequential Grouping Networks for Instance Segmentation

Shu Liu, Jiaya Jia, Sanja Fidler, Raquel Urtasun

2017 IEEE International Conference on Computer Vision (ICCV) > 3516 - 3524

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we propose Sequential Grouping Networks (SGN) to tackle the problem of object instance segmentation. SGNs employ a sequence of neural networks, each solving a sub-grouping problem of increasing semantic complexity in order to gradually compose objects out of pixels. In particular, the first network aims to group pixels along each image row and column by predicting horizontal and vertical...

chapter

From Square Pieces to Brick Walls: The Next Challenge in Solving Jigsaw Puzzles

Shir Gur, Ohad Ben-Shahar

2017 IEEE International Conference on Computer Vision (ICCV) > 4049 - 4057

2017 IEEE International Conference on Computer Vision (ICCV)

Research into computational jigsaw puzzle solving, an emerging theoretical problem with numerous applications, has focused in recent years on puzzles that constitute square pieces only. In this paper we wish to extend the scientific scope of appearance-based puzzle solving and consider ’’brick wall” jigsaw puzzles – rectangular pieces who may have different sizes, and could be placed next to each...

chapter

Unsupervised Video Understanding by Reconciliation of Posture Similarities

Timo Milbich, Miguel Bautista, Ekaterina Sutter, Bjorn Ommer

2017 IEEE International Conference on Computer Vision (ICCV) > 4404 - 4414

2017 IEEE International Conference on Computer Vision (ICCV)

Understanding human activity and being able to explain it in detail surpasses mere action classification by far in both complexity and value. The challenge is thus to describe an activity on the basis of its most fundamental constituents, the individual postures and their distinctive transitions. Supervised learning of such a fine-grained representation based on elementary poses is very tedious and...

chapter

Learning from Video and Text via Large-Scale Discriminative Clustering

Antoine Miech, Jean-Baptiste Alayrac, Piotr Bojanowski, Ivan Laptev, more

2017 IEEE International Conference on Computer Vision (ICCV) > 5267 - 5276

2017 IEEE International Conference on Computer Vision (ICCV)

Discriminative clustering has been successfully applied to a number of weakly supervised learning tasks. Such applications include person and action recognition, text-to-video alignment, object co-segmentation and co-localization in videos and images. One drawback of discriminative clustering, however, is its limited scalability. We address this issue and propose an online optimization algorithm based...

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV)

CAD Priors for Accurate and Flexible Instance Reconstruction

Hard-Aware Deeply Cascaded Embedding

Factorized Bilinear Models for Image Recognition

What Actions are Needed for Understanding Human Actions in Videos?

MUTAN: Multimodal Tucker Fusion for Visual Question Answering

SGN: Sequential Grouping Networks for Instance Segmentation

From Square Pieces to Brick Walls: The Next Challenge in Solving Jigsaw Puzzles

Unsupervised Video Understanding by Reconciliation of Posture Similarities

Learning from Video and Text via Large-Scale Discriminative Clustering

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Computer Vision (ICCV)