Search results

chapter

Blog Article Summarization with Image-Text Alignment Techniques

Wei-Ta Chu, Ming-Chih Kao

2017 IEEE International Symposium on Multimedia (ISM) > 244 - 247

2017 IEEE International Symposium on Multimedia (ISM)

We propose an image-text alignment framework to match images with text, and take blog article summarization as the main application. Objects in an image are first detected, from them deep features are extracted and transformed into a space commonly shared with the text. On the other hand, sentences of a blog article are represented as vectors, and are also embedded into the common space. With these...

chapter

Hyper-Feature Based Tracking with the Fully-Convolutional Siamese Network

Yangliu Kuai, Gongjian Wen, Dongdong Li

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 7

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

Convolutional neural network (CNN) has drawn increasing interest in visual tracking, among which fully-convolutional Siamese network based method (SiamFC) is quite popular due to its competitive performance in both precision and efficiency. Generally, SiamFC captures robust semantics from high-level features in the last layer but ignores detailed spatial features in earlier layers, thus tending to...

chapter

Semantic Visualization Support for Innovators Marketplace on Data Jackets

Qi Wang

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 599 - 604

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Following the trend of big data, the business value of data is becoming a hot research field in recent years. The novel concept of Data Jacket introduced by Ohsawa et al. solved the difficult problem of data transactions due to the particular characteristic of data, i.e. the safeguarding privacy. In order to make sure the mechanism of the market of data, there are some researchers proposed a gamified...

chapter

Assessing the Intuitiveness of Qualitative Contribution Relationships in Goal Models: An Exploratory Experiment

Sotirios Liaskos, Alexis Ronse, Mehrnaz Zhian

2017 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM) > 466 - 471

2017 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM)

[Background]: Developing conceptual models is an integral part of the requirements engineering (RE) process. Goal models are requirements engineering conceptual models that allow diagrammatic representation of stakeholder intentions and how they affect each other. A specific goal modeling language construct, the contribution of goal satisfaction of one goal to another, plays a central role in supporting...

chapter

Beyond Boxes and Lines: Creating and Empirically Evaluating Alternative Visualizations for Requirements Conceptual Models

Sotirios Liaskos, Teodora Dundjerovic, Norah Alothman

2017 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM) > 476 - 477

2017 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM)

[Background]: Conceptual modeling languages have been widely studied in requirements engineering as tools for capturing, representing and reasoning about domain problems. One of these languages, goal models, has been proposed for representing the structure of stakeholder intentions. Like most other conceptual modeling languages, goal models are visualized using box-and-line diagrammatic notations...

chapter

Image2song: Song Retrieval via Bridging Image Content and Lyric Words

Xuelong Li, Di Hu, Xiaoqiang Lu

2017 IEEE International Conference on Computer Vision (ICCV) > 5650 - 5659

2017 IEEE International Conference on Computer Vision (ICCV)

Image is usually taken for expressing some kinds of emotions or purposes, such as love, celebrating Christmas. There is another better way that combines the image and relevant song to amplify the expression, which has drawn much attention in the social network recently. Hence, the automatic selection of songs should be expected. In this paper, we propose to retrieve semantic relevant songs just by...

chapter

Learning Robust Visual-Semantic Embeddings

Yao-Hung Hubert Tsai, Liang-Kang Huang, Ruslan Salakhutdinov

2017 IEEE International Conference on Computer Vision (ICCV) > 3591 - 3600

2017 IEEE International Conference on Computer Vision (ICCV)

Many of the existing methods for learning joint embedding of images and text use only supervised information from paired images and its textual attributes. Taking advantage of the recent success of unsupervised learning in deep neural networks, we propose an end-to-end learning framework that is able to extract more robust multi-modal representations across domains. The proposed method combines representation...

chapter

Referring Expression Generation and Comprehension via Attributes

Jingyu Liu, Liang Wang, Ming-Hsuan Yang

2017 IEEE International Conference on Computer Vision (ICCV) > 4866 - 4874

2017 IEEE International Conference on Computer Vision (ICCV)

Referring expression is a kind of language expression that used for referring to particular objects. To make the expression without ambiguation, people often use attributes to describe the particular object. In this paper, we explore the role of attributes by incorporating them into both referring expression generation and comprehension. We first train an attribute learning model from visual objects...

chapter

Sketching with Style: Visual Search with Sketches and Aesthetic Context

John Collomosse, Tu Bui, Michael Wilber, Chen Fang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2679 - 2687

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a novel measure of visual similarity for image retrieval that incorporates both structural and aesthetic (style) constraints. Our algorithm accepts a query as sketched shape, and a set of one or more contextual images specifying the desired visual aesthetic. A triplet network is used to learn a feature embedding capable of measuring style similarity independent of structure, delivering...

chapter

Pixel-Level Matching for Video Object Segmentation Using Convolutional Neural Networks

Jae Shin Yoon, Francois Rameau, Junsik Kim, Seokju Lee, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2186 - 2195

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a novel video object segmentation algorithm based on pixel-level matching using Convolutional Neural Networks (CNN). Our network aims to distinguish the target area from the background on the basis of the pixel-level similarity between two object units. The proposed network represents a target object using features from different depth layers in order to take advantage of both the spatial...

chapter

Look, Listen and Learn

Relja Arandjelovic, Andrew Zisserman

2017 IEEE International Conference on Computer Vision (ICCV) > 609 - 617

2017 IEEE International Conference on Computer Vision (ICCV)

We consider the question: what can be learnt by looking at and listening to a large number of unlabelled videos? There is a valuable, but so far untapped, source of information contained in the video itself – the correspondence between the visual and the audio streams, and we introduce a novel “Audio-Visual Correspondence” learning task that makes use of this. Training visual and audio networks from...

chapter

MarioQA: Answering Questions by Watching Gameplay Videos

Jonghwan Mun, Paul Hongsuck Seo, Ilchae Jung, Bohyung Han

2017 IEEE International Conference on Computer Vision (ICCV) > 2886 - 2894

2017 IEEE International Conference on Computer Vision (ICCV)

We present a framework to analyze various aspects of models for video question answering (VideoQA) using customizable synthetic datasets, which are constructed automatically from gameplay videos. Our work is motivated by the fact that existing models are often tested only on datasets that require excessively high-level reasoning or mostly contain instances accessible through single frame inferences...

chapter

Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning

Berkan Demirel, Ramazan Gokberk Cinbis, Nazli Ikizler-Cinbis

2017 IEEE International Conference on Computer Vision (ICCV) > 1241 - 1250

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a novel approach for unsupervised zero-shot learning (ZSL) of classes based on their names. Most existing unsupervised ZSL methods aim to learn a model for directly comparing image features and class names. However, this proves to be a difficult task due to dominance of non-visual semantics in underlying vector-space embeddings of class names. To address this issue, we discriminatively...

chapter

Interactive visualization toolbox to detect sophisticated android malware

Ganesh Ram Santhanam, Benjamin Holland, Suresh Kothari, Jon Mathews

2017 IEEE Symposium on Visualization for Cyber Security (VizSec) > 1 - 8

2017 IEEE Symposium on Visualization for Cyber Security (VizSec)

Detecting zero-day sophisticated malware is like searching for a needle in the haystack, not knowing what the needle looks like. This paper describes Android Malicious Flow Visualization Toolbox that empowers a human analyst to detect such malware. Detecting sophisticated malware requires systematic exploration of the code to identify potentially malignant code, conceiving plausible malware hypotheses,...

chapter

On the Performance of Visual Semantics for Improving Texture-Based Blind Image Quality Assessment

Pedro Garcia Freitas, Mylene Christine Queiroz De Farias

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 330 - 337

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

Blind image quality assessment (BIQA) methods aim to estimate the quality of a given test image without referring to the corresponding reference (original) image. Most BIQA methods use visual sensitivity models, which take into consideration intrinsic image characteristics (e.g. contrast, luminance, and texture) to identify degradations and estimate quality. For example, texture-based BIQA methods...

chapter

Visualizing OWL 2 using diagrams

Gem Stapleton, Michael Compton, John Howse

2017 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC) > 245 - 253

2017 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC)

Diagrams can be an effective means of communicating complex ideas and can aid ontology engineering. Indeed, domain experts often do not have the expertise required to understand or create the complex logical statements of an ontology in description logic (DL). This paper presents a visualisation method, concept diagrams, geared toward expressing assertions and class expression axioms alongside providing...

chapter

Visualizing the Bias of Enterprise Metamodels towards Nuanced Concepts

David Naranjo, Mario Sanchez, Jorge Villalobos

2017 IEEE 21st International Enterprise Distributed Object Computing Conference (EDOC) > 30 - 39

2017 IEEE 21st International Enterprise Distributed Object Computing Conference (EDOC)

In Enterprise Modeling, we use several languages for designing, analyzing, and communicating the different domains of an enterprise. Two important criteria for choosing a domainspecific language are its appropriateness to the requirements of the enterprise, as well as the accuracy of the language in describing the domain at hand. However, in some business domains, core concepts --such as Capability,...

chapter

[POSTER] Composite Realism: Effects of Object Knowledge and Mismatched Feature Type on Observer Gaze and Subjective Quality

Alan Dolhasz, Maite Frutos-Pascual, Ian Williams

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 9 - 14

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

We report on the results of the first visual search and rating study (N60) evaluating human gaze when assessing the realism of image composites. The effects of object identity knowledge and mismatched feature type on observers' gaze and subjective realism scores are studied. Gaze metrics used include: fixation count, fixation duration, time and duration of first fixation on target object, as well...

chapter

The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes

Gerhard Neuhold, Tobias Ollmann, Samuel Rota Bulo, Peter Kontschieder

2017 IEEE International Conference on Computer Vision (ICCV) > 5000 - 5009

2017 IEEE International Conference on Computer Vision (ICCV)

The Mapillary Vistas Dataset is a novel, large-scale street-level image dataset containing 25000 high-resolution images annotated into 66 object categories with additional, instance-specific labels for 37 classes. Annotation is performed in a dense and fine-grained style by using polygons for delineating individual objects. Our dataset is 5× larger than the total amount of fine annotations for Cityscapes...

chapter

A Stagewise Refinement Model for Detecting Salient Objects in Images

Tiantian Wang, Ali Borji, Lihe Zhang, Pingping Zhang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4039 - 4048

2017 IEEE International Conference on Computer Vision (ICCV)

Deep convolutional neural networks (CNNs) have been successfully applied to a wide variety of problems in computer vision, including salient object detection. To detect and segment salient objects accurately, it is necessary to extract and combine high-level semantic features with low-levelfine details simultaneously. This happens to be a challenge for CNNs as repeated subsampling operations such...

INFONA - science communication portal

Search results

Blog Article Summarization with Image-Text Alignment Techniques

Hyper-Feature Based Tracking with the Fully-Convolutional Siamese Network

Semantic Visualization Support for Innovators Marketplace on Data Jackets

Assessing the Intuitiveness of Qualitative Contribution Relationships in Goal Models: An Exploratory Experiment

Beyond Boxes and Lines: Creating and Empirically Evaluating Alternative Visualizations for Requirements Conceptual Models

Image2song: Song Retrieval via Bridging Image Content and Lyric Words

Learning Robust Visual-Semantic Embeddings

Referring Expression Generation and Comprehension via Attributes

Sketching with Style: Visual Search with Sketches and Aesthetic Context

Pixel-Level Matching for Video Object Segmentation Using Convolutional Neural Networks

Look, Listen and Learn

MarioQA: Answering Questions by Watching Gameplay Videos

Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning

Interactive visualization toolbox to detect sophisticated android malware

On the Performance of Visual Semantics for Improving Texture-Based Blind Image Quality Assessment

Visualizing OWL 2 using diagrams

Visualizing the Bias of Enterprise Metamodels towards Nuanced Concepts

[POSTER] Composite Realism: Effects of Object Knowledge and Mismatched Feature Type on Observer Gaze and Subjective Quality

The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes

A Stagewise Refinement Model for Detecting Salient Objects in Images

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options