Search results for: Bryan Russell

Items from 1 to 4 out of 4 results

chapter

Localizing Moments in Video with Natural Language

Lisa Anne Hendricks, Oliver Wang, Eli Shechtman, Josef Sivic, more

2017 IEEE International Conference on Computer Vision (ICCV) > 5804 - 5813

2017 IEEE International Conference on Computer Vision (ICCV)

We consider retrieving a specific temporal segment, or moment, from a video given a natural language text description. Methods designed to retrieve whole video clips with natural language determine what occurs in a video but not when. To address this issue, we propose the Moment Context Network (MCN) which effectively localizes natural language queries in videos by integrating local and global video...

chapter

ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification

Rohit Girdhar, Deva Ramanan, Abhinav Gupta, Josef Sivic, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3165 - 3174

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we introduce a new video representation for action classification that aggregates local convolutional features across the entire spatio-temporal extent of the video. We do so by integrating state-of-the-art two-stream networks [42] with learnable spatio-temporal feature aggregation [6]. The resulting architecture is end-to-end trainable for whole-video classification. We investigate...

chapter

Marr Revisited: 2D-3D Alignment via Surface Normal Prediction

Aayush Bansal, Bryan Russell, Abhinav Gupta

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5965 - 5974

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce an approach that leverages surface normal predictions, along with appearance cues, to retrieve 3D models for objects depicted in 2D still images from a large CAD object library. Critical to the success of our approach is the ability to recover accurate surface normals for objects in the depicted scene. We introduce a skip-network model built on the pre-trained Oxford VGG convolutional...

chapter

LabelMe video: Building a video database with human annotations

Jenny Yuen, Bryan Russell, Ce Liu, Antonio Torralba

2009 IEEE 12th International Conference on Computer Vision > 1451 - 1458

2009 IEEE 12th International Conference on Computer Vision (ICCV)

Currently, video analysis algorithms suffer from lack of information regarding the objects present, their interactions, as well as from missing comprehensive annotated video databases for benchmarking. We designed an online and openly accessible video annotation system that allows anyone with a browser and internet access to efficiently annotate object category, shape, motion, and activity information...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Bryan Russell

Localizing Moments in Video with Natural Language

ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification

Marr Revisited: 2D-3D Alignment via Surface Normal Prediction

LabelMe video: Building a video database with human annotations

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Bryan Russell

Localizing Moments in Video with Natural Language

ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification

Marr Revisited: 2D-3D Alignment via Surface Normal Prediction

LabelMe video: Building a video database with human annotations

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options