Search results

chapter

Exploiting Spatial Structure for Localizing Manipulated Image Regions

Jawadul H. Bappy, Amit K. Roy-Chowdhury, Jason Bunk, Lakshmanan Nataraj, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4980 - 4989

2017 IEEE International Conference on Computer Vision (ICCV)

The advent of high-tech journaling tools facilitates an image to be manipulated in a way that can easily evade state-of-the-art image tampering detection approaches. The recent success of the deep learning approaches in different recognition tasks inspires us to develop a high confidence detection framework which can localize manipulated regions in an image. Unlike semantic object segmentation where...

chapter

Situation Recognition with Graph Neural Networks

Ruiyu Li, Makarand Tapaswi, Renjie Liao, Jiaya Jia, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4183 - 4192

2017 IEEE International Conference on Computer Vision (ICCV)

We address the problem of recognizing situations in images. Given an image, the task is to predict the most salient verb (action), and fill its semantic roles such as who is performing the action, what is the source and target of the action, etc. Different verbs have different roles (e.g. attacking has weapon), and each role can take on many possible values (nouns). We propose a model based on Graph...

chapter

Recurrent Topic-Transition GAN for Visual Paragraph Generation

Xiaodan Liang, Zhiting Hu, Hao Zhang, Chuang Gan, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3382 - 3391

2017 IEEE International Conference on Computer Vision (ICCV)

A natural image usually conveys rich semantic content and can be viewed from different angles. Existing image description methods are largely restricted by small sets of biased visual paragraph annotations, and fail to cover rich underlying semantics. In this paper, we investigate a semi-supervised paragraph generative framework that is able to synthesize diverse and semantically coherent paragraph...

chapter

Open Vocabulary Scene Parsing

Hang Zhao, Xavier Puig, Bolei Zhou, Sanja Fidler, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2021 - 2029

2017 IEEE International Conference on Computer Vision (ICCV)

Recognizing arbitrary objects in the wild has been a challenging problem due to the limitations of existing classification models and datasets. In this paper, we propose a new task that aims at parsing scenes with a large and open vocabulary, and several evaluation metrics are explored for this problem. Our approach is a joint image pixel and word concept embeddings framework, where word concepts...

chapter

Unsupervised Representation Learning by Sorting Sequences

Hsin-Ying Lee, Jia-Bin Huang, Maneesh Singh, Ming-Hsuan Yang

2017 IEEE International Conference on Computer Vision (ICCV) > 667 - 676

2017 IEEE International Conference on Computer Vision (ICCV)

We present an unsupervised representation learning approach using videos without semantic labels. We leverage the temporal coherence as a supervisory signal by formulating representation learning as a sequence sorting task. We take temporally shuffled frames (i.e., in non-chronological order) as inputs and train a convolutional neural network to sort the shuffled sequences. Similar to comparison-based...

chapter

Automatic Spatially-Aware Fashion Concept Discovery

Xintong Han, Zuxuan Wu, Phoenix X. Huang, Xiao Zhang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1472 - 1480

2017 IEEE International Conference on Computer Vision (ICCV)

This paper proposes an automatic spatially-aware concept discovery approach using weakly labeled image-text data from shopping websites. We first fine-tune GoogleNet by jointly modeling clothing images and their corresponding descriptions in a visual-semantic embedding space. Then, for each attribute (word), we generate its spatiallyaware representation by combining its semantic word vector representation...

chapter

Leveraging Weak Semantic Relevance for Complex Video Event Classification

Heng Tao Shen, Chao Li, Jiewei Cao, Zi Huang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3667 - 3676

2017 IEEE International Conference on Computer Vision (ICCV)

Existing video event classification approaches suffer from limited human-labeled semantic annotations. Weak semantic annotations can be harvested from Web-knowledge without involving any human interaction. However such weak annotations are noisy, thus can not be effectively utilized without distinguishing its reliability. In this paper, we propose a novel approach to automatically maximize the utility...

chapter

Personalized Cinemagraphs Using Semantic Understanding and Collaborative Learning

Tae-Hyun Oh, Kyungdon Joo, Neel Joshi, Baoyuan Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 5170 - 5179

2017 IEEE International Conference on Computer Vision (ICCV)

Cinemagraphs are a compelling way to convey dynamic aspects of a scene. In these media, dynamic and still elements are juxtaposed to create an artistic and narrative experience. Creating a high-quality, aesthetically pleasing cinemagraph requires isolating objects in a semantically meaningful way and then selecting good start times and looping periods for those objects to minimize visual artifacts...

chapter

Unsupervised Learning of Important Objects from First-Person Videos

Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi

2017 IEEE International Conference on Computer Vision (ICCV) > 1974 - 1982

2017 IEEE International Conference on Computer Vision (ICCV)

A first-person camera, placed at a person's head, captures, which objects are important to the camera wearer. Most prior methods for this task learn to detect such important objects from the manually labeled first-person data in a supervised fashion. However, important objects are strongly related to the camera wearer's internal state such as his intentions and attention, and thus, only the person...

chapter

VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation

Chuang Gan, Yandong Li, Haoxiang Li, Chen Sun, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1829 - 1838

2017 IEEE International Conference on Computer Vision (ICCV)

Rich and dense human labeled datasets are among the main enabling factors for the recent advance on visionlanguage understanding. Many seemingly distant annotations (e.g., semantic segmentation and visual question answering (VQA)) are inherently connected in that they reveal different levels and perspectives of human understandings about the same visual scenes — and even the same set of images (e...

chapter

Identity-Aware Textual-Visual Matching with Latent Co-attention

Shuang Li, Tong Xiao, Hongsheng Li, Wei Yang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1908 - 1917

2017 IEEE International Conference on Computer Vision (ICCV)

Textual-visual matching aims at measuring similarities between sentence descriptions and images. Most existing methods tackle this problem without effectively utilizing identity-level annotations. In this paper, we propose an identity-aware two-stage framework for the textual-visual matching problem. Our stage-1 CNN-LSTM network learns to embed cross-modal features with a novel Cross-Modal Cross-Entropy...

chapter

Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning

Soravit Changpinyo, Wei-Lun Chao, Fei Sha

2017 IEEE International Conference on Computer Vision (ICCV) > 3496 - 3505

2017 IEEE International Conference on Computer Vision (ICCV)

Leveraging class semantic descriptions and examples of known objects, zero-shot learning makes it possible to train a recognition model for an object class whose examples are not available. In this paper, we propose a novel zero-shot learning model that takes advantage of clustering structures in the semantic embedding space. The key idea is to impose the structural constraint that semantic representations...

chapter

Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection

Pingping Zhang, Dong Wang, Huchuan Lu, Hongyu Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 202 - 211

2017 IEEE International Conference on Computer Vision (ICCV)

Fully convolutional neural networks (FCNs) have shown outstanding performance in many dense labeling problems. One key pillar of these successes is mining relevant information from features in convolutional layers. However, how to better aggregate multi-level convolutional feature maps for salient object detection is underexplored. In this work, we present Amulet, a generic aggregating multi-level...

chapter

HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis

Xihui Liu, Haiyu Zhao, Maoqing Tian, Lu Sheng, more

2017 IEEE International Conference on Computer Vision (ICCV) > 350 - 359

2017 IEEE International Conference on Computer Vision (ICCV)

Pedestrian analysis plays a vital role in intelligent video surveillance and is a key component for security-centric computer vision systems. Despite that the convolutional neural networks are remarkable in learning discriminative features from images, the learning of comprehensive features of pedestrians for fine-grained tasks remains an open problem. In this study, we propose a new attentionbased...

chapter

A Framework for Improving the Verifiability of Visual Notation Design Grounded in the Physics of Notations

Dirk van der Linden, Anna Zamansky, Irit Hadar

2017 IEEE 25th International Requirements Engineering Conference (RE) > 41 - 50

2017 IEEE 25th International Requirements Engineering Conference (RE)

This paper proposes a systematic framework for applying the Physics of Notations (PoN), a theory for the design of cognitively effective visual notations. The PoN consists of nine principles, but not all principles lend themselves equally to a clear and unambiguous operationalization. As a result, many visual notations designed according to the PoN apply it in different ways. The proposed framework...

chapter

Semantic visual SLAM in populated environments

L. Riazuelo, L. Montano, J. M. M. Montiel

2017 European Conference on Mobile Robots (ECMR) > 1 - 7

2017 European Conference on Mobile Robots (ECMR)

We propose a visual SLAM (Simultaneous Localization And Mapping) system able to perform robustly in populated environments. The image stream from a moving RGB-D camera is the only input to the system. The computed map in real-time is composed of two layers: 1) The unpopulated geometrical layer, which describes the geometry of the bare scene as an occupancy grid where pieces of information corresponding...

chapter

Intellectual visualization of complex maps and schemes

Stanislav Belyakov, Marina Savelyeva, Marina Belyakova, Sergey Zubkov

2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY) > 303 - 308

2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY)

This paper investigates the problem of visual analysis of complex plans, schemes and maps for solving of difficult formalized tasks. It is analyzed the dependence of the level of perception upon the visualization complexity. It is introduced the concept of the utility of the visual image, it is described by the behavior of the empirical utility function. We propose an optimization model of the utility,...

chapter

A realtime sensing-data-triggered news article provision system with 5D world map

Hanako Fujioka, Shiori Sasaki, Yasushi Kiyoki

2017 International Electronics Symposium on Knowledge Creation and Intelligent Computing (IES-KCIC) > 265 - 269

2017 International Electronics Symposium on Knowledge Creation and Intelligent Computing (IES-KCIC)

The most important aim of our study is to realize a multi-database for social sciences and environmental sciences. Our system connects heterogeneous databases about historical phenomena by using common spatiotemporal information and visualize the connected results onto 5D World Map (a set of chronologically ordered global maps). To actualize that, we created a news articles provision system using...

chapter

Sparse decomposition of convolutional features for scene recognition

Lin Xie, Feifei Lee, Yan Yan, Qiu Chen

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA) > 345 - 348

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)

Scene recognition is an important and challenging problem in the field of computer vision owing to the variations in the same class and the similarities between different classes. This paper presents a novel approach that learns a reasonable dictionary from convolutional features to effectively describe the distinctive and shared properties in scene images. Substantial convolution operations in Deep...

chapter

Towards semantic visual features for malignancy description within medical images

Abir Baazaoui, Walid Barhoumi, Ezzeddine Zagrouba

2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) > 397 - 402

2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

Semantic gap, which is the difference between low-level image features and their high-level semantics, has become very popular and witnessed great interest in the last two decades. This paper deals with this problem and proposes a hybrid approach to learn image semantic concepts for modeling visual features in discriminative learning stage. It combines the advantages of human-in-the-loop and discriminative...

INFONA - science communication portal

Search results

Exploiting Spatial Structure for Localizing Manipulated Image Regions

Situation Recognition with Graph Neural Networks

Recurrent Topic-Transition GAN for Visual Paragraph Generation

Open Vocabulary Scene Parsing

Unsupervised Representation Learning by Sorting Sequences

Automatic Spatially-Aware Fashion Concept Discovery

Leveraging Weak Semantic Relevance for Complex Video Event Classification

Personalized Cinemagraphs Using Semantic Understanding and Collaborative Learning

Unsupervised Learning of Important Objects from First-Person Videos

VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation

Identity-Aware Textual-Visual Matching with Latent Co-attention

Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning

Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection

HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis

A Framework for Improving the Verifiability of Visual Notation Design Grounded in the Physics of Notations

Semantic visual SLAM in populated environments

Intellectual visualization of complex maps and schemes

A realtime sensing-data-triggered news article provision system with 5D world map

Sparse decomposition of convolutional features for scene recognition

Towards semantic visual features for malignancy description within medical images

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options