S. Davis

chapter

Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation

Ruichi Yu, Ang Li, Vlad I. Morariu, Larry S. Davis

2017 IEEE International Conference on Computer Vision (ICCV) > 1068 - 1076

2017 IEEE International Conference on Computer Vision (ICCV)

Understanding the visual relationship between two objects involves identifying the subject, the object, and a predicate relating them. We leverage the strong correlations between the predicate and the hsubj; obji pair (both semantically and spatially) to predict predicates conditioned on the subjects and the objects. Modeling the three entities jointly more accurately reflects their relationships...

chapter

Automatic Spatially-Aware Fashion Concept Discovery

Xintong Han, Zuxuan Wu, Phoenix X. Huang, Xiao Zhang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1472 - 1480

2017 IEEE International Conference on Computer Vision (ICCV)

This paper proposes an automatic spatially-aware concept discovery approach using weakly labeled image-text data from shopping websites. We first fine-tune GoogleNet by jointly modeling clothing images and their corresponding descriptions in a visual-semantic embedding space. Then, for each attribute (word), we generate its spatiallyaware representation by combining its semantic word vector representation...

chapter

Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval

Ang Li, Jin Sun, Joe Yue-Hei Ng, Ruichi Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1942 - 1950

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Spatial relationships between objects provide important information for text-based image retrieval. As users are more likely to describe a scene from a real world perspective, using 3D spatial relationships rather than 2D relationships that assume a particular viewing direction, one of the main challenges is to infer the 3D structure that bridges images with users text descriptions. However, direct...

article

VRFP: On-the-Fly Video Retrieval Using Web Images and Fast Fisher Vector Products

Xintong Han, Bharat Singh, Vlad I. Morariu, Larry S. Davis

IEEE Transactions on Multimedia > 2017 > 19 > 7 > 1583 - 1595

On-the-fly video retrieval using web images and fast Fisher Vector products (VRFP) is a real-time video retrieval framework based on short text input queries, which obtains weakly labeled training images from the web after the query is known. The retrieved web images representing the query and each database video are treated as unordered collections of images, and each collection is represented using...

chapter

Clauselets: Leveraging Temporally Related Actions for Video Event Analysis

Hyungtae Lee, Vlad I. Morariu, Larry S. Davis

2015 IEEE Winter Conference on Applications of Computer Vision > 1161 - 1168

2015 IEEE Winter Conference on Applications of Computer Vision (WACV)

We propose clause lets, sets of concurrent actions and their temporal relationships, and explore their application to video event analysis. We train clause lets in two stages. We initially train first level clause let detectors that find a limited set of actions in particular qualitative temporal configurations based on Allen's interval relations. In the second stage, we apply the first level detectors...

chapter

Multimedia user feedback based on augmenting user tags with EEG emotional states

S. Davis, E. Cheng, I. Burnett, C. Ritz

2011 Third International Workshop on Quality of Multimedia Experience > 143 - 148

2011 Third International Workshop on Quality of Multimedia Experience (QoMEX 2011)

Efficient content-based access to large multimedia collections requires annotations that are human-meaningful, and user tagging of media is one means to obtain such semantic metadata. Tags can also act as user feedback essential for quality of multimedia experience assessment; however, tags can lack user context and become ambiguous between different users. Further, user tagging is a deliberate and...

INFONA - science communication portal

Search results for: S. Davis

Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation

Automatic Spatially-Aware Fashion Concept Discovery

Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval

VRFP: On-the-Fly Video Retrieval Using Web Images and Fast Fisher Vector Products

Clauselets: Leveraging Temporally Related Actions for Video Event Analysis

Multimedia user feedback based on augmenting user tags with EEG emotional states

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results for: S. Davis

Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation

Automatic Spatially-Aware Fashion Concept Discovery

Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval

VRFP: On-the-Fly Video Retrieval Using Web Images and Fast Fisher Vector Products

Clauselets: Leveraging Temporally Related Actions for Video Event Analysis

Multimedia user feedback based on augmenting user tags with EEG emotional states

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options