Search results

chapter

Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection

Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-Yan Yeung, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1819 - 1828

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we investigate a weakly-supervised object detection framework. Most existing frameworks focus on using static images to learn object detectors. However, these detectors often fail to generalize to videos because of the existing domain shift. Therefore, we investigate learning these detectors directly from boring videos of daily activities. Instead of using bounding boxes, we explore...

chapter

Areas of Attention for Image Captioning

Marco Pedersoli, Thomas Lucas, Cordelia Schmid, Jakob Verbeek

2017 IEEE International Conference on Computer Vision (ICCV) > 1251 - 1259

2017 IEEE International Conference on Computer Vision (ICCV)

We propose “Areas of Attention”, a novel attentionbased model for automatic image captioning. Our approach models the dependencies between image regions, caption words, and the state of an RNN language model, using three pairwise interactions. In contrast to previous attentionbased approaches that associate image regions only to the RNN state, our method allows a direct association between caption...

chapter

Soft Proposal Networks for Weakly Supervised Object Localization

Yi Zhu, Yanzhao Zhou, Qixiang Ye, Qiang Qiu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1859 - 1868

2017 IEEE International Conference on Computer Vision (ICCV)

Weakly supervised object localization remains challenging, where only image labels instead of bounding boxes are available during training. Object proposal is an effective component in localization, but often computationally expensive and incapable of joint optimization with some of the remaining modules. In this paper, to the best of our knowledge, we for the first time integrate weakly supervised...

chapter

CoupleNet: Coupling Global Structure with Local Parts for Object Detection

Yousong Zhu, Chaoyang Zhao, Jinqiao Wang, Xu Zhao, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4146 - 4154

2017 IEEE International Conference on Computer Vision (ICCV)

The region-based Convolutional Neural Network (CNN) detectors such as Faster R-CNN or R-FCN have already shown promising results for object detection by combining the region proposal subnetwork and the classification subnetwork together. Although R-FCN has achieved higher detection speed while keeping the detection performance, the global structure information is ignored by the position-sensitive...

chapter

What is and What is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors

Changqun Xia, Jia Li, Xiaowu Chen, Anlin Zheng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4399 - 4407

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Finding what is and what is not a salient object can be helpful in developing better features and models in salient object detection (SOD). In this paper, we investigate the images that are selected and discarded in constructing a new SOD dataset and find that many similar candidates, complex shape and low objectness are three main attributes of many non-salient objects. Moreover, objects may have...

chapter

Discover and Learn New Objects from Documentaries

Kai Chen, Hang Song, Chen Change Loy, Dahua Lin

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1111 - 1120

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Despite the remarkable progress in recent years, detecting objects in a new context remains a challenging task. Detectors learned from a public dataset can only work with a fixed list of categories, while training from scratch usually requires a large amount of training data with detailed annotations. This work aims to explore a novel approach – learning object detectors from documentary...

chapter

Webly-supervised visual concept learning with cardinality guided instance mining and clustered multitask refinement

Saijie Ni, Xiaopeng Zhang, Botao Wang, Hongkai Xiong

2017 IEEE International Conference on Multimedia and Expo (ICME) > 979 - 984

2017 IEEE International Conference on Multimedia and Expo (ICME)

Conventional image classification and object detection methods depend on manual annotations, such as image-level labels and bounding boxes. However, the acquisition of such annotations for millions of images is trivial. This paper addresses the problem of webly-supervised visual concept learning, and develops an automatic algorithm using parallel text and visual corpora to discover informative visual...

chapter

Visual search guided by an efficient top-down attention approach

R. G. Mesquita, C. A. B. Mello, P. L. Castilho

2016 IEEE International Conference on Image Processing (ICIP) > 679 - 683

2016 IEEE International Conference on Image Processing (ICIP)

This paper introduces a method to guide the visual search towards a searched object, analogously to what is performed by the top-down visual attention mechanism. This is done by prioritizing scene descriptors based on their Hamming distance to the descriptors of the target. The proposal has constant space and time complexity in relation to the number of descriptors of the searched object. Moreover,...

chapter

Self-taught object localization with deep networks

Loris Bazzani, Alessandra Bergamo, Dragomir Anguelov, Lorenzo Torresani

2016 IEEE Winter Conference on Applications of Computer Vision (WACV) > 1 - 9

2016 IEEE Winter Conference on Applications of Computer Vision (WACV)

This paper introduces self-taught object localization, a novel approach that leverages deep convolutional networks trained for whole-image recognition to localize objects in images without additional human supervision, i.e., without using any ground-truth bounding boxes for training. The key idea is to analyze the change in the recognition scores when artificially masking out different regions of...

article

Query-Adaptive Multiple Instance Learning for Video Instance Retrieval

Ting-Chu Lin, Min-Chun Yang, Chia-Yin Tsai, Yu-Chiang Frank Wang

IEEE Transactions on Image Processing > 2015 > 24 > 4 > 1330 - 1340

Given a query image containing the object of interest (OOI), we propose a novel learning framework for retrieving relevant frames from the input video sequence. While techniques based on object matching have been applied to solve this task, their performance would be typically limited due to the lack of capabilities in handling variations in visual appearances of the OOI across video frames. Our proposed...

INFONA - science communication portal

Search results

Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection

Areas of Attention for Image Captioning

Soft Proposal Networks for Weakly Supervised Object Localization

CoupleNet: Coupling Global Structure with Local Parts for Object Detection

What is and What is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors

Discover and Learn New Objects from Documentaries

Webly-supervised visual concept learning with cardinality guided instance mining and clustered multitask refinement

Visual search guided by an efficient top-down attention approach

Self-taught object localization with deep networks

Query-Adaptive Multiple Instance Learning for Video Instance Retrieval

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options