Search results

Items from 1 to 7 out of 7 results

chapter

Dense Captioning with Joint Inference and Visual Context

Linjie Yang, Kevin Tang, Jianchao Yang, Li-Jia Li

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1978 - 1987

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Dense captioning is a newly emerging computer vision topic for understanding images with dense language descriptions. The goal is to densely detect visual concepts (e.g., objects, object parts, and interactions between them) from images, labeling each with a short descriptive phrase. We identify two key challenges of dense captioning that need to be properly addressed when tackling the problem. First,...

chapter

Mining Object Parts from CNNs via Active Question-Answering

Quanshi Zhang, Ruiming Cao, Ying Nian Wu, Song-Chun Zhu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3890 - 3899

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Given a convolutional neural network (CNN) that is pre-trained for object classification, this paper proposes to use active question-answering to semanticize neural patterns in conv-layers of the CNN and mine part concepts. For each part concept, we mine neural patterns in the pre-trained CNN, which are related to the target part, and use these patterns to construct an And-Or graph (AOG) to represent...

article

Reading between the Lines: Object Localization Using Implicit Cues from Image Tags

Sung Ju Hwang, Kristen Grauman

IEEE Transactions on Pattern Analysis and Machine Intelligence > 2012 > 34 > 6 > 1145 - 1158

Current uses of tagged images typically exploit only the most explicit information: the link between the nouns named and the objects present somewhere in the image. We propose to leverage “unspoken” cues that rest within an ordered list of image tags so as to improve object localization. We define three novel implicit features from an image's tags—the relative prominence of each object as signified...

chapter

Video Concept Detection Using Support Vector Machine with Augmented Features

Xinxing Xu, Dong Xu, I W Tsang

2010 Fourth Pacific-Rim Symposium on Image and Video Technology > 381 - 385

2010 Fourth Pacific-Rim Symposium on Image and Video Technology (PSIVT)

In this paper, we present a direct application of Support Vector Machine with Augmented Features (AFSVM) for video concept detection. For each visual concept, we learn an adapted classifier by leveraging the pre-learnt SVM classifiers of other concepts. The solution of AFSVM is to re-train the SVM classifier using augmented feature, which concatenates the original feature vector with the decision...

chapter

On the Use of Visual Soft Semantics for Video Temporal Decomposition to Scenes

V Mezaris, P Sidiropoulos, A Dimou, I Kompatsiaris

2010 IEEE Fourth International Conference on Semantic Computing > 141 - 148

2010 IEEE Fourth International Conference on Semantic Computing (ICSC)

This work examines the possibility of exploiting, for the purpose of video segmentation to scenes, semantic information coming from the analysis of the visual modality. This information, in contrast to the low-level visual features typically used in previous approaches, is obtained by application of trained visual concept detectors such as those developed and evaluated as part of the TRECVID High-Level...

chapter

Semantic Detection of Adult Image Using Semantic Features

Jae-Hyun Jeon, Se Min Kim, Jae-Young Choi, Hyun Suk Min, more

2010 4th International Conference on Multimedia and Ubiquitous Engineering > 1 - 4

2010 4th International Conference on Multimedia and Ubiquitous Engineering (MUE 2010)

Recently, in the fields of internet and social networking, the classification and filtering of naked images has been receiving a significant amount of attention. In this paper, we propose a novel naked image classification which can make effective use of semantic features of a naked image. In addition, a novel measurement, termed accumulated distance ratio (ADR), is proposed in order to systematically...

chapter

Multi-modal characteristics analysis and fusion for TV commercial detection

Nan Liu, Yao Zhao, Zhenfeng Zhu, Hanqing Lu

2010 IEEE International Conference on Multimedia and Expo > 831 - 836

2010 IEEE International Conference on Multimedia and Expo (ICME)

Automatic TV commercial detection has become an indispensable part of content-based video analysis technique due to the explosive growth in TV commercial volume. In this paper, a multi-modal (i.e. visual, audio and textual modalities) commercial digesting scheme is proposed to alleviate two challenges in commercial detection, which are the generation of mid-level semantic descriptor and the application...

Filter options

Data set:
ieee
Keywords:
TRAINING
SEMANTICS
OBJECT DETECTION
VISUALIZATION

Publication date

Set your own date range

Publication type

book (6)
article (1)

Keywords

FEATURE EXTRACTION (5)
DETECTORS (3)
VIDEO SIGNAL PROCESSING (3)
IMAGE CLASSIFICATION (2)
OBJECT RECOGNITION (2)
SUPPORT VECTOR MACHINES (2)
VISUAL MODALITY (2)
ACCUMULATED DISTANCE RATIO (1)
ACCURACY (1)
ADULT IMAGE SEMANTIC DETECTION (1)
AUDIO MODALITY (1)
AUGMENTED FEATURE (1)
BASELINE CONCEPT DETECTORS (1)
BIOINFORMATICS (1)
COMMERCIAL DETECTION (1)
CONTENT-BASED VIDEO ANALYSIS (1)
CONTEXT (1)
CONTEXT. (1)
CORRELATION (1)
DATA MINING (1)
DECISION VALUE VECTOR (1)
DETECTOR CONFIDENCE SCORE VECTOR (1)
DISCRIMINATION METHOD (1)
ERROR ANALYSIS (1)
FILTERING THEORY (1)
GENOMICS (1)
HIGH DIMENSIONAL SEMANTIC SPACE (1)
HILBERT SPACES (1)
IMAGE FILTERING (1)
IMAGE FUSION (1)
IMAGE SEGMENTATION (1)
IMAGE TAGS (1)
INTERNET (1)
KERNEL (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LOW-LEVEL VISUAL FEATURE APPROACH (1)
LOW-LEVEL VISUAL FEATURES (1)
MID-LEVEL DESCRIPTOR (1)
MID-LEVEL SEMANTIC DESCRIPTOR (1)
MOTION PICTURES (1)
MULTIMEDIA ANALYSIS (1)
MULTIMODAL CHARACTERISTIC ANALYSIS (1)
MULTIMODAL CHARACTERISTIC FUSION (1)
MULTIMODAL COMMERCIAL DIGESTING SCHEME (1)
NAKED IMAGE CLASSIFICATION (1)
NONBINARY DETECTORS (1)
PROPOSALS (1)
REPRODUCING KERNEL HILBERT SPACE (1)
SCENES (1)
SEMANTIC FEATURE ANALYSIS (1)
SHOT SEMANTIC SIMILARITY MEASURE (1)
SOCIAL NETWORKING (1)
STRAIN (1)
SUPPORT VECTOR MACHINE (1)
SVM CLASSIFIER (1)
TEST DATASETS (1)
TESTING (1)
TEXTUAL MODALITY (1)
TRAINING DATA (1)
TRECVID HIGH-LEVEL FEATURE EXTRACTION TASK (1)
TRI-ADABOOST (1)
TRI-ADABOOST SELF-LEARNING METHOD (1)
TV (1)
TV COMMERCIAL DETECTION (1)
VIDEO CATEGORIZATION (1)
VIDEO CONCEPT DETECTION (1)
VIDEO SEGMENTATION (1)
VIDEO TEMPORAL DECOMPOSITION (1)
VISUAL CONCEPT (1)
VISUAL CONCEPT DETECTORS (1)
VISUAL SOFT SEMANTICS (1)
more

INFONA - science communication portal

Search results

Dense Captioning with Joint Inference and Visual Context

Mining Object Parts from CNNs via Active Question-Answering

Reading between the Lines: Object Localization Using Implicit Cues from Image Tags

Video Concept Detection Using Support Vector Machine with Augmented Features

On the Use of Visual Soft Semantics for Video Temporal Decomposition to Scenes

Semantic Detection of Adult Image Using Semantic Features

Multi-modal characteristics analysis and fusion for TV commercial detection

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options