Behrooz Mahasseni

chapter

Unsupervised Video Summarization with Adversarial LSTM Networks

Behrooz Mahasseni, Michael Lam, Sinisa Todorovic

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2982 - 2991

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of unsupervised video summarization, formulated as selecting a sparse subset of video frames that optimally represent the input video. Our key idea is to learn a deep summarizer network to minimize distance between training videos and a distribution of their summarizations, in an unsupervised way. Such a summarizer can then be applied on a new video for estimating...

chapter

Fine-Grained Recognition as HSnet Search for Informative Image Parts

Michael Lam, Behrooz Mahasseni, Sinisa Todorovic

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6497 - 6506

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This work addresses fine-grained image classification. Our work is based on the hypothesis that when dealing with subtle differences among object classes it is critical to identify and only account for a few informative image parts, as the remaining image context may not only be uninformative but may also hurt recognition. This motivates us to formulate our problem as a sequential search for informative...

chapter

Budget-Aware Deep Semantic Video Segmentation

Behrooz Mahasseni, Sinisa Todorovic, Alan Fern

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2077 - 2086

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we study a poorly understood trade-off between accuracy and runtime costs for deep semantic video segmentation. While recent work has demonstrated advantages of learning to speed-up deep activity detection, it is not clear if similar advantages will hold for our very different segmentation loss function, which is defined over individual pixels across the frames. In deep video segmentation,...

chapter

Regularizing Long Short Term Memory with 3D Human-Skeleton Sequences for Action Recognition

Behrooz Mahasseni, Sinisa Todorovic

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3054 - 3062

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper argues that large-scale action recognition in video can be greatly improved by providing an additional modality in training data – namely, 3D human-skeleton sequences – aimed at complementing poorly represented or missing features of human actions in the training videos. For recognition, we use Long Short Term Memory (LSTM) grounded via a deep Convolutional Neural Network (CNN) onto the...

chapter

Play type recognition in real-world football video

Sheng Chen, Zhongyuan Feng, Qingkai Lu, Behrooz Mahasseni, more

IEEE Winter Conference on Applications of Computer Vision > 652 - 659

2014 IEEE Winter Conference on Applications of Computer Vision (WACV)

This paper presents a vision system for recognizing the sequence of plays in amateur videos of American football games (e.g. offense, defense, kickoff, punt, etc). The system is aimed at reducing user effort in annotating football videos, which are posted on a web service used by over 13,000 high school, college, and professional football teams. Recognizing football plays is particularly challenging...

chapter

Latent Multitask Learning for View-Invariant Action Recognition

Behrooz Mahasseni, Sinisa Todorovic

2013 IEEE International Conference on Computer Vision > 3128 - 3135

2013 IEEE International Conference on Computer Vision (ICCV)

This paper presents an approach to view-invariant action recognition, where human poses and motions exhibit large variations across different camera viewpoints. When each viewpoint of a given set of action classes is specified as a learning task then multitask learning appears suitable for achieving view invariance in recognition. We extend the standard multitask learning to allow identifying: (1)...

INFONA - science communication portal

Search results for: Behrooz Mahasseni

Unsupervised Video Summarization with Adversarial LSTM Networks

Fine-Grained Recognition as HSnet Search for Informative Image Parts

Budget-Aware Deep Semantic Video Segmentation

Regularizing Long Short Term Memory with 3D Human-Skeleton Sequences for Action Recognition

Play type recognition in real-world football video

Latent Multitask Learning for View-Invariant Action Recognition

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Behrooz Mahasseni

Unsupervised Video Summarization with Adversarial LSTM Networks

Fine-Grained Recognition as HSnet Search for Informative Image Parts

Budget-Aware Deep Semantic Video Segmentation

Regularizing Long Short Term Memory with 3D Human-Skeleton Sequences for Action Recognition

Play type recognition in real-world football video

Latent Multitask Learning for View-Invariant Action Recognition

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options