Search results for: Arun Mallya

Items from 1 to 5 out of 5 results

article

Combining Multiple Cues for Visual Madlibs Question Answering

Tatiana Tommasi, Arun Mallya, Bryan Plummer, Svetlana Lazebnik, more

International Journal of Computer Vision > 2019 > 127 > 1 > 38-60

This paper presents an approach for answering fill-in-the-blank multiple choice questions from the Visual Madlibs dataset. Instead of generic and commonly used representations trained on the ImageNet classification task, our approach employs a combination of networks trained for specialized tasks such as scene recognition, person activity classification, and attribute prediction. We also present a...

chapter

Phrase Localization and Visual Relationship Detection with Comprehensive Image-Language Cues

Bryan A. Plummer, Arun Mallya, Christopher M. Cervantes, Julia Hockenmaier, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1946 - 1955

2017 IEEE International Conference on Computer Vision (ICCV)

This paper presents a framework for localization or grounding of phrases in images using a large collection of linguistic and visual cues. We model the appearance, size, and position of entity bounding boxes, adjectives that contain attribute information, and spatial relationships between pairs of entities connected by verbs or prepositions. Special attention is given to relationships between people...

chapter

Recurrent Models for Situation Recognition

Arun Mallya, Svetlana Lazebnik

2017 IEEE International Conference on Computer Vision (ICCV) > 455 - 463

2017 IEEE International Conference on Computer Vision (ICCV)

This work proposes Recurrent Neural Network (RNN) models to predict structured ‘image situations’ – actions and noun entities fulfilling semantic roles related to the action. In contrast to prior work relying on Conditional Random Fields (CRFs), we use a specialized action prediction network followed by an RNN for noun prediction. Our system obtains state-of-the-art accuracy on the challenging recent...

chapter

Unsupervised network pretraining via encoding human design

Ming-Yu Liu, Arun Mallya, Oncel Tuzel, Xi Chen

2016 IEEE Winter Conference on Applications of Computer Vision (WACV) > 1 - 9

2016 IEEE Winter Conference on Applications of Computer Vision (WACV)

Over the years, computer vision researchers have spent an immense amount of effort on designing image features for the visual object recognition task. We propose to incorporate this valuable experience to guide the task of training deep neural networks. Our idea is to pretrain the network through the task of replicating the process of hand-designed feature extraction. By learning to replicate the...

chapter

Learning Informative Edge Maps for Indoor Scene Layout Prediction

Arun Mallya, Svetlana Lazebnik

2015 IEEE International Conference on Computer Vision (ICCV) > 936 - 944

2015 IEEE International Conference on Computer Vision (ICCV)

In this paper, we introduce new edge-based features for the task of recovering the 3D layout of an indoor scene from a single image. Indoor scenes have certain edges that are very informative about the spatial layout of the room, namely, the edges formed by the pairwise intersections of room faces (two walls, wall and ceiling, wall and floor). In contrast with previous approaches that rely on area-based...

INFONA - science communication portal

Search results for: Arun Mallya

Combining Multiple Cues for Visual Madlibs Question Answering

Phrase Localization and Visual Relationship Detection with Comprehensive Image-Language Cues

Recurrent Models for Situation Recognition

Unsupervised network pretraining via encoding human design

Learning Informative Edge Maps for Indoor Scene Layout Prediction

Filter options

Publication date

Publication type

Keywords

Data set

INFONA - science communication portal

Search results for: Arun Mallya

Combining Multiple Cues for Visual Madlibs Question Answering

Phrase Localization and Visual Relationship Detection with Comprehensive Image-Language Cues

Recurrent Models for Situation Recognition

Unsupervised network pretraining via encoding human design

Learning Informative Edge Maps for Indoor Scene Layout Prediction

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options