Search results

chapter

Learning Video Object Segmentation with Visual Memory

Pavel Tokmakov, Karteek Alahari, Cordelia Schmid

2017 IEEE International Conference on Computer Vision (ICCV) > 4491 - 4500

2017 IEEE International Conference on Computer Vision (ICCV)

This paper addresses the task of segmenting moving objects in unconstrained videos. We introduce a novel two-stream neural network with an explicit memory module to achieve this. The two streams of the network encode spatial and temporal features in a video sequence respectively, while the memory module captures the evolution of objects over time. The module to build a “visual memory” in video, i...

chapter

Pixel-Level Matching for Video Object Segmentation Using Convolutional Neural Networks

Jae Shin Yoon, Francois Rameau, Junsik Kim, Seokju Lee, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2186 - 2195

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a novel video object segmentation algorithm based on pixel-level matching using Convolutional Neural Networks (CNN). Our network aims to distinguish the target area from the background on the basis of the pixel-level similarity between two object units. The proposed network represents a target object using features from different depth layers in order to take advantage of both the spatial...

chapter

Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection

Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-Yan Yeung, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1819 - 1828

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we investigate a weakly-supervised object detection framework. Most existing frameworks focus on using static images to learn object detectors. However, these detectors often fail to generalize to videos because of the existing domain shift. Therefore, we investigate learning these detectors directly from boring videos of daily activities. Instead of using bounding boxes, we explore...

chapter

Video Fill In the Blank Using LR/RL LSTMs with Spatial-Temporal Attentions

Amir Mazaheri, Dong Zhang, Mubarak Shah

2017 IEEE International Conference on Computer Vision (ICCV) > 1416 - 1425

2017 IEEE International Conference on Computer Vision (ICCV)

Given a video and a description sentence with one missing word, “source sentence”, Video-Fill-In-the-Blank (VFIB) problem is to find the missing word automatically. The contextual information of the sentence, as well as visual cues from the video, are important to infer the missing word accurately. Since the source sentence is broken into two fragments: the sentence’s left fragment (before the blank)...

chapter

Genetic CNN

Lingxi Xie, Alan Yuille

2017 IEEE International Conference on Computer Vision (ICCV) > 1388 - 1397

2017 IEEE International Conference on Computer Vision (ICCV)

The deep convolutional neural network (CNN) is the state-of-the-art solution for large-scale visual recognition. Following some basic principles such as increasing network depth and constructing highway connections, researchers have manually designed a lot of fixed network architectures and verified their effectiveness.,,In this paper, we discuss the possibility of learning deep network structures...

chapter

Look, Listen and Learn

Relja Arandjelovic, Andrew Zisserman

2017 IEEE International Conference on Computer Vision (ICCV) > 609 - 617

2017 IEEE International Conference on Computer Vision (ICCV)

We consider the question: what can be learnt by looking at and listening to a large number of unlabelled videos? There is a valuable, but so far untapped, source of information contained in the video itself – the correspondence between the visual and the audio streams, and we introduce a novel “Audio-Visual Correspondence” learning task that makes use of this. Training visual and audio networks from...

chapter

Personalized Image Aesthetics

Jian Ren, Xiaohui Shen, Zhe Lin, Radomir Mech, more

2017 IEEE International Conference on Computer Vision (ICCV) > 638 - 647

2017 IEEE International Conference on Computer Vision (ICCV)

Automatic image aesthetics rating has received a growing interest with the recent breakthrough in deep learning. Although many studies exist for learning a generic or universal aesthetics model, investigation of aesthetics models incorporating individual user’s preference is quite limited. We address this personalized aesthetics problem by showing that individual’s aesthetic preferences exhibit strong...

chapter

A Read-Write Memory Network for Movie Story Understanding

Seil Na, Sangho Lee, Jisung Kim, Gunhee Kim

2017 IEEE International Conference on Computer Vision (ICCV) > 677 - 685

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a novel memory network model named Read-Write Memory Network (RWMN) to perform question and answering tasks for large-scale, multimodal movie story understanding. The key focus of our RWMN model is to design the read network and the write network that consist of multiple convolutional layers, which enable memory read and write operations to have high capacity and flexibility. While existing...

chapter

Learning Gaze Transitions from Depth to Improve Video Saliency Estimation

George Leifman, Dmitry Rudoy, Tristan Swedish, Eduardo Bayro-Corrochano, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1707 - 1716

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper we introduce a novel Depth-Aware Video Saliency approach to predict human focus of attention when viewing videos that contain a depth map (RGBD) on a 2D screen. Saliency estimation in this scenario is highly important since in the near future 3D video content will be easily acquired yet hard to display. Despite considerable progress in 3D display technologies, most are still expensive...

chapter

Learning Background-Aware Correlation Filters for Visual Tracking

Hamed Kiani Galoogahi, Ashton Fagg, Simon Lucey

2017 IEEE International Conference on Computer Vision (ICCV) > 1144 - 1152

2017 IEEE International Conference on Computer Vision (ICCV)

Correlation Filters (CFs) have recently demonstrated excellent performance in terms of rapidly tracking objects under challenging photometric and geometric variations. The strength of the approach comes from its ability to efficiently learn - on the fly - how the object is changing over time. A fundamental drawback to CFs, however, is that the background of the target is not modeled over time which...

chapter

MarioQA: Answering Questions by Watching Gameplay Videos

Jonghwan Mun, Paul Hongsuck Seo, Ilchae Jung, Bohyung Han

2017 IEEE International Conference on Computer Vision (ICCV) > 2886 - 2894

2017 IEEE International Conference on Computer Vision (ICCV)

We present a framework to analyze various aspects of models for video question answering (VideoQA) using customizable synthetic datasets, which are constructed automatically from gameplay videos. Our work is motivated by the fact that existing models are often tested only on datasets that require excessively high-level reasoning or mostly contain instances accessible through single frame inferences...

chapter

Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning

Berkan Demirel, Ramazan Gokberk Cinbis, Nazli Ikizler-Cinbis

2017 IEEE International Conference on Computer Vision (ICCV) > 1241 - 1250

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a novel approach for unsupervised zero-shot learning (ZSL) of classes based on their names. Most existing unsupervised ZSL methods aim to learn a model for directly comparing image features and class names. However, this proves to be a difficult task due to dominance of non-visual semantics in underlying vector-space embeddings of class names. To address this issue, we discriminatively...

chapter

Areas of Attention for Image Captioning

Marco Pedersoli, Thomas Lucas, Cordelia Schmid, Jakob Verbeek

2017 IEEE International Conference on Computer Vision (ICCV) > 1251 - 1259

2017 IEEE International Conference on Computer Vision (ICCV)

We propose “Areas of Attention”, a novel attentionbased model for automatic image captioning. Our approach models the dependencies between image regions, caption words, and the state of an RNN language model, using three pairwise interactions. In contrast to previous attentionbased approaches that associate image regions only to the RNN state, our method allows a direct association between caption...

chapter

CREST: Convolutional Residual Learning for Visual Tracking

Yibing Song, Chao Ma, Lijun Gong, Jiawei Zhang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2574 - 2583

2017 IEEE International Conference on Computer Vision (ICCV)

Discriminative correlation filters (DCFs) have been shown to perform superiorly in visual tracking. They only need a small set of training samples from the initial frame to generate an appearance model. However, existing DCFs learn the filters separately from feature extraction, and update these filters using a moving average operation with an empirical weight. These DCF trackers hardly benefit from...

chapter

Increasing CNN Robustness to Occlusions by Reducing Filter Support

Elad Osherov, Michael Lindenbaum

2017 IEEE International Conference on Computer Vision (ICCV) > 550 - 561

2017 IEEE International Conference on Computer Vision (ICCV)

Convolutional neural networks (CNNs) provide the current state of the art in visual object classification, but they are far less accurate when classifying partially occluded objects. A straightforward way to improve classification under occlusion conditions is to train the classifier using partially occluded object examples. However, training the network on many combinations of object instances and...

chapter

VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization

Saihui Hou, Yushan Feng, Zilei Wang

2017 IEEE International Conference on Computer Vision (ICCV) > 541 - 549

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we propose a novel domain-specific dataset named VegFru for fine-grained visual categorization (FGVC). While the existing datasets for FGVC are mainly focused on animal breeds or man-made objects with limited labelled data, VegFru is a larger dataset consisting of vegetables and fruits which are closely associated with the daily life of everyone. Aiming at domestic cooking and food...

chapter

Learning Policies for Adaptive Tracking with Deep Feature Cascades

Chen Huang, Simon Lucey, Deva Ramanan

2017 IEEE International Conference on Computer Vision (ICCV) > 105 - 114

2017 IEEE International Conference on Computer Vision (ICCV)

Visual object tracking is a fundamental and time-critical vision task. Recent years have seen many shallow tracking methods based on real-time pixel-based correlation filters, as well as deep methods that have top performance but need a high-end GPU. In this paper, we learn to improve the speed of deep trackers without losing accuracy. Our fundamental insight is to take an adaptive approach, where...

chapter

Detecting defects in sub-skin-depth metallic layers by a thermo-elastic sensor

Shant Arakelyan, Hanju Lee, Kiejin Lee, Shant Arakelyan, more

2017 IEEE SENSORS > 1 - 3

2017 IEEE SENSORS

The visualization of the microwave radiation intensity in the middle-field region by the thermo-elastic optical indicator microscopy (TEOIM) system for the sub-skin-depth metallic layers was performed. An indicator with the 10 nm Al deposition for the 50 GHz microwave radiation intensity visualization was used. Modeled simulation results were in good agreement with the experimental data. A defect...

chapter

A quasi-panoramic bio-inspired eye for flying parallel to walls

Erik Vanhoutte, Franck Ruffier, Julien Serres

2017 IEEE SENSORS > 1 - 3

2017 IEEE SENSORS

In this study, a quasi-panoramic bio-inspired eye dedicated to optic flow measurement on board micro flying robots is presented. It will allow future micro flying robots to mimic honeybees' navigational tasks which work without any metric sensors. An innovative optic flow-based algorithm was tested in the horizontal plane to measure the robot's incidence angle when travelling along a wall. Experimental...

chapter

Live demonstration: High-fidelity brain electrical activity from automatic noise cancelling tripolar concentric ring electrode sensor

Walter Besio

2017 IEEE SENSORS > 1

2017 IEEE SENSORS

Tripolar concentric ring electrode (TCRE) sensors have unique properties. These sensors have been used to acquire various bio-signals such as: electroencephalography (EEG), electrocardiography (ECG), and electromyography (EMG). Compared to conventional disc electrode signals TCRE EEG (tEEG) has four times better signal-to-noise ratio, eleven times better mutual information and spatial resolution....

INFONA - science communication portal

Search results

Learning Video Object Segmentation with Visual Memory

Pixel-Level Matching for Video Object Segmentation Using Convolutional Neural Networks

Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection

Video Fill In the Blank Using LR/RL LSTMs with Spatial-Temporal Attentions

Genetic CNN

Look, Listen and Learn

Personalized Image Aesthetics

A Read-Write Memory Network for Movie Story Understanding

Learning Gaze Transitions from Depth to Improve Video Saliency Estimation

Learning Background-Aware Correlation Filters for Visual Tracking

MarioQA: Answering Questions by Watching Gameplay Videos

Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning

Areas of Attention for Image Captioning

CREST: Convolutional Residual Learning for Visual Tracking

Increasing CNN Robustness to Occlusions by Reducing Filter Support

VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization

Learning Policies for Adaptive Tracking with Deep Feature Cascades

Detecting defects in sub-skin-depth metallic layers by a thermo-elastic sensor

A quasi-panoramic bio-inspired eye for flying parallel to walls

Live demonstration: High-fidelity brain electrical activity from automatic noise cancelling tripolar concentric ring electrode sensor

Filter options

Publication date

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options