Search results for: Chuang Gan

Items from 1 to 11 out of 11 results

chapter

Recurrent Topic-Transition GAN for Visual Paragraph Generation

Xiaodan Liang, Zhiting Hu, Hao Zhang, Chuang Gan, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3382 - 3391

2017 IEEE International Conference on Computer Vision (ICCV)

A natural image usually conveys rich semantic content and can be viewed from different angles. Existing image description methods are largely restricted by small sets of biased visual paragraph annotations, and fail to cover rich underlying semantics. In this paper, we investigate a semi-supervised paragraph generative framework that is able to synthesize diverse and semantically coherent paragraph...

chapter

VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation

Chuang Gan, Yandong Li, Haoxiang Li, Chen Sun, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1829 - 1838

2017 IEEE International Conference on Computer Vision (ICCV)

Rich and dense human labeled datasets are among the main enabling factors for the recent advance on visionlanguage understanding. Many seemingly distant annotations (e.g., semantic segmentation and visual question answering (VQA)) are inherently connected in that they reveal different levels and perspectives of human understandings about the same visual scenes — and even the same set of images (e...

chapter

Semantic Compositional Networks for Visual Captioning

Zhe Gan, Chuang Gan, Xiaodong He, Yunchen Pu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1141 - 1150

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A Semantic Compositional Network (SCN) is developed for image captioning, in which semantic concepts (i.e., tags) are detected from the image, and the probability of each tag is used to compose the parameters in a long short-term memory (LSTM) network. The SCN extends each weight matrix of the LSTM to an ensemble of tag-dependent weight matrices. The degree to which each member of the ensemble is...

chapter

StyleNet: Generating Attractive Visual Captions with Styles

Chuang Gan, Zhe Gan, Xiaodong He, Jianfeng Gao, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 955 - 964

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel framework named StyleNet to address the task of generating attractive captions for images and videos with different styles. To this end, we devise a novel model component, named factored LSTM, which automatically distills the style factors in the monolingual text corpus. Then at runtime, we can explicitly control the style in the caption generation process so as to produce attractive...

chapter

Tunnel deformation monitoring based on laser distance measuring and vision assistant

Chuang Gan, Yong Lei

2016 12th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA) > 1 - 6

2016 12th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA)

Full Face Rock Tunnel Boring Machine (TBM) has been applied in many underground tunnel projects. During the operation, the safety of the TBM is one of the major concerns in the construction project. One of the major safety concerns is the deformation of the tunnel. Minor tunnel deformation may cause additional delay of the operation due to rock cleaning and equipment replacement, whilst major deformation...

chapter

Learning Attributes Equals Multi-Source Domain Generalization

Chuang Gan, Tianbao Yang, Boqing Gong

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 87 - 97

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Attributes possess appealing properties and benefit many computer vision problems, such as object recognition, learning with humans in the loop, and image retrieval. Whereas the existing work mainly pursues utilizing attributes for various computer vision problems, we contend that the most basic problem—how to accurately and robustly detect attributes from images—has been left under explored. Especially,...

chapter

You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images

Chuang Gan, Ting Yao, Kuiyuan Yang, Yi Yang, more

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 923 - 932

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Video concept learning often requires a large set oftraining samples. In practice, however, acquiring noise-free training labels with sufficient positive examples is very expensive. A plausible solution for training data collection is by sampling from the vast quantities of images and videos on the Web. Such a solution is motivated by the assumption that the retrieved images or videos are highly correlated...

article

Recognizing an Action Using Its Name: A Knowledge-Based Approach

Chuang Gan, Yi Yang, Linchao Zhu, Deli Zhao, more

International Journal of Computer Vision > 2016 > 120 > 1 > 61-77

Existing action recognition algorithms require a set of positive exemplars to train a classifier for each action. However, the amount of action classes is very large and the users’ queries vary dramatically. It is impractical to pre-define all possible action classes beforehand. To address this issue, we propose to perform action recognition with no positive exemplars, which is often known as the...

chapter

Automatic Concept Discovery from Parallel Text and Visual Corpora

Chen Sun, Chuang Gan, Ram Nevatia

2015 IEEE International Conference on Computer Vision (ICCV) > 2596 - 2604

2015 IEEE International Conference on Computer Vision (ICCV)

Humans connect language and vision to perceive the world. How to build a similar connection for computers? One possible way is via visual concepts, which are text terms that relate to visually discriminative entities. We propose an automatic visual concept discovery algorithm using parallel text and visual corpora, it filters text terms based on the visual discriminative power of the associated images,...

chapter

DevNet: A Deep Event Network for multimedia event detection and evidence recounting

Chuang Gan, Naiyan Wang, Yi Yang, Dit-Yan Yeung, more

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2568 - 2577

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we focus on complex event detection in internet videos while also providing the key evidences of the detection results. Convolutional Neural Networks (CNNs) have achieved promising performance in image classification and action recognition tasks. However, it remains an open problem how to use CNNs for video event detection and recounting, mainly due to the complexity and diversity of...

chapter

Salient object detection in image sequences via spatial-temporal cue

Chuang Gan, Zengchang Qin, Jia Xu, Tao Wan

2013 Visual Communications and Image Processing (VCIP) > 1 - 6

2013 Visual Communications and Image Processing (VCIP)

Contemporary video search and categorization are non-trivial tasks due to the massively increasing amount and content variety of videos. We put forward the study of visual saliency models in video. Such a model is employed to identify salient objects from the image background. Starting from the observation that motion information in video often attracts more human attention compared to static images,...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Chuang Gan

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options