Stefan Lee

chapter

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning

Abhishek Das, Satwik Kottur, Jose M. F. Moura, Stefan Lee, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2970 - 2979

2017 IEEE International Conference on Computer Vision (ICCV)

We introduce the first goal-driven training for visual question answering and dialog agents. Specifically, we pose a cooperative ‘image guessing’ game between two agents – Q-BOT and A-BOT– who communicate in natural language dialog so that Q-BOT can select an unseen image from a lineup of images. We use deep reinforcement learning (RL) to learn the policies of these agents end-to-end – from pixels...

chapter

Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning

Qing Sun, Stefan Lee, Dhruv Batra

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7215 - 7223

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We develop the first approximate inference algorithm for 1-Best (and M-Best) decoding in bidirectional neural sequence models by extending Beam Search (BS) to reason about both forward and backward time dependencies. Beam Search (BS) is a widely used approximate inference algorithm for decoding sequences from unidirectional neural sequence models. Interestingly, approximate inference in bidirectional...

chapter

Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions

Sven Bambach, Stefan Lee, David J. Crandall, Chen Yu

2015 IEEE International Conference on Computer Vision (ICCV) > 1949 - 1957

2015 IEEE International Conference on Computer Vision (ICCV)

Hands appear very often in egocentric video, and their appearance and pose give important cues about what people are doing and what they are paying attention to. But existing work in hand detection has made strong assumptions that work well in only simple scenarios, such as with limited interaction with other people or in lab settings. We develop methods to locate and distinguish between hands in...

chapter

Linking Past to Present: Discovering Style in Two Centuries of Architecture

Stefan Lee, Nicolas Maisonneuve, David Crandall, Alexei A. Efros, more

2015 IEEE International Conference on Computational Photography (ICCP) > 1 - 10

2015 IEEE International Conference on Computational Photography (ICCP)

With vast quantities of imagery now available online, researchers have begun to explore whether visual patterns can be discovered automatically. Here we consider the particular domain of architecture, using huge collections of street-level imagery to find visual patterns that correspond to semantic-level architectural elements distinctive to particular time periods. We use this analysis both to date...

chapter

Predicting Geo-informative Attributes in Large-Scale Image Collections Using Convolutional Neural Networks

Stefan Lee, Haipeng Zhang, David J. Crandall

2015 IEEE Winter Conference on Applications of Computer Vision > 550 - 557

2015 IEEE Winter Conference on Applications of Computer Vision (WACV)

Geographic location is a powerful property for organizing large-scale photo collections, but only a small fraction of online photos are geo-tagged. Most work in automatically estimating geo-tags from image content is based on comparison against models of buildings or landmarks, or on matching to large reference collections of geotagged images. These approaches work well for frequently photographed...

chapter

Estimating bedrock and surface layer boundaries and confidence intervals in ice sheet radar imagery using MCMC

Stefan Lee, Jerome Mitchell, David J. Crandall, Geoffrey C. Fox

2014 IEEE International Conference on Image Processing (ICIP) > 111 - 115

2014 IEEE International Conference on Image Processing (ICIP)

Climate models that predict polar ice sheet behavior require accurate measurements of the bedrock-ice and ice-air boundaries in ground-penetrating radar imagery. Identifying these features is typically performed by hand, which can be tedious and error prone. We propose an approach for automatically estimating layer boundaries by viewing this task as a probabilistic inference problem. Our solution...

chapter

This Hand Is My Hand: A Probabilistic Approach to Hand Disambiguation in Egocentric Video

Stefan Lee, Sven Bambach, David J. Crandall, John M. Franchak, more

2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops > 557 - 564

2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Egocentric cameras are becoming more popular, introducing increasing volumes of video in which the biases and framing of traditional photography are replaced with those of natural viewing tendencies. This paradigm enables new applications, including novel studies of social interaction and human development. Recent work has focused on identifying the camera wearer's hands as a first step towards more...

INFONA - science communication portal

Search results for: Stefan Lee

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning

Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning

Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions

Linking Past to Present: Discovering Style in Two Centuries of Architecture

Predicting Geo-informative Attributes in Large-Scale Image Collections Using Convolutional Neural Networks

Estimating bedrock and surface layer boundaries and confidence intervals in ice sheet radar imagery using MCMC

This Hand Is My Hand: A Probabilistic Approach to Hand Disambiguation in Egocentric Video

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Stefan Lee

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning

Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning

Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions

Linking Past to Present: Discovering Style in Two Centuries of Architecture

Predicting Geo-informative Attributes in Large-Scale Image Collections Using Convolutional Neural Networks

Estimating bedrock and surface layer boundaries and confidence intervals in ice sheet radar imagery using MCMC

This Hand Is My Hand: A Probabilistic Approach to Hand Disambiguation in Egocentric Video

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options