Li Fei-Fei

chapter

Dense-Captioning Events in Videos

Ranjay Krishna, Kenji Hata, Frederic Ren, Li Fei-Fei, more

2017 IEEE International Conference on Computer Vision (ICCV) > 706 - 715

2017 IEEE International Conference on Computer Vision (ICCV)

Most natural videos contain numerous events. For example, in a video of a “man playing a piano”, the video might also contain “another man dancing” or “a crowd clapping”. We introduce the task of dense-captioning events, which involves both detecting and describing events in a video. We propose a new model that is able to identify all events in a single pass of the video while simultaneously describing...

chapter

Characterizing and Improving Stability in Neural Style Transfer

Agrim Gupta, Justin Johnson, Alexandre Alahi, Li Fei-Fei

2017 IEEE International Conference on Computer Vision (ICCV) > 4087 - 4096

2017 IEEE International Conference on Computer Vision (ICCV)

Recent progress in style transfer on images has focused on improving the quality of stylized images and speed of methods. However, real-time methods are highly unstable resulting in visible flickering when applied to videos. In this work we characterize the instability of these methods by examining the solution set of the style transfer objective. We show that the trace of the Gram matrix representing...

chapter

Learning to Learn from Noisy Web Videos

Serena Yeung, Vignesh Ramanathan, Olga Russakovsky, Liyue Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7455 - 7463

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Understanding the simultaneously very diverse and intricately fine-grained set of possible human actions is a critical open problem in computer vision. Manually labeling training videos is feasible for some action classes but doesnt scale to the full long-tailed distribution of actions. A promising way to address this is to leverage noisy data from web queries to learn new actions, using semi-supervised...

chapter

Unsupervised Learning of Long-Term Motion Dynamics for Videos

Zelun Luo, Boya Peng, De-An Huang, Alexandre Alahi, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7101 - 7110

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an unsupervised representation learning approach that compactly encodes the motion dependencies in videos. Given a pair of images from a video clip, our framework learns to predict the long-term 3D motions. To reduce the complexity of the learning framework, we propose to describe the motion as a sequence of atomic 3D flows computed with RGB-D modality. We use a Recurrent Neural Network...

chapter

Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos

De-An Huang, Joseph J. Lim, Li Fei-Fei, Juan Carlos Niebles

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1032 - 1041

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose an unsupervised method for reference resolution in instructional videos, where the goal is to temporally link an entity (e.g., dressing) to the action (e.g., mix yogurt) that produced it. The key challenge is the inevitable visual-linguistic ambiguities arising from the changes in both visual appearance and referring expression of an entity in the video. This challenge is amplified by the...

chapter

Learning latent temporal structure for complex event detection

Kevin Tang, Li Fei-Fei, Daphne Koller

2012 IEEE Conference on Computer Vision and Pattern Recognition > 1250 - 1257

2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we tackle the problem of understanding the temporal structure of complex events in highly varying videos obtained from the Internet. Towards this goal, we utilize a conditional model trained in a max-margin framework that is able to automatically discover discriminative and interesting segments of video, while simultaneously achieving competitive accuracies on difficult detection and...

INFONA - science communication portal

Search results for: Li Fei-Fei

Dense-Captioning Events in Videos

Characterizing and Improving Stability in Neural Style Transfer

Learning to Learn from Noisy Web Videos

Unsupervised Learning of Long-Term Motion Dynamics for Videos

Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos

Learning latent temporal structure for complex event detection

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Li Fei-Fei

Dense-Captioning Events in Videos

Characterizing and Improving Stability in Neural Style Transfer

Learning to Learn from Noisy Web Videos

Unsupervised Learning of Long-Term Motion Dynamics for Videos

Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos

Learning latent temporal structure for complex event detection

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options