2017 IEEE International Conference on Computer Vision (ICCV)

chapter

Rethinking Reprojection: Closing the Loop for Pose-Aware Shape Reconstruction from a Single Image

Rui Zhu, Hamed Kiani Galoogahi, Chaoyang Wang, Simon Lucey

2017 IEEE International Conference on Computer Vision (ICCV) > 57 - 65

An emerging problem in computer vision is the reconstruction of 3D shape and pose of an object from a single image. Hitherto, the problem has been addressed through the application of canonical deep learning methods to regress from the image directly to the 3D shape and pose labels. These approaches, however, are problematic from two perspectives. First, they are minimizing the error between 3D shapes...

chapter

Learning to Super-Resolve Blurry Face and Text Images

Xiangyu Xu, Deqing Sun, Jinshan Pan, Yujin Zhang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 251 - 260

2017 IEEE International Conference on Computer Vision (ICCV)

We present an algorithm to directly restore a clear highresolution image from a blurry low-resolution input. This problem is highly ill-posed and the basic assumptions for existing super-resolution methods (requiring clear input) and deblurring methods (requiring high-resolution input) no longer hold. We focus on face and text images and adopt a generative adversarial network (GAN) to learn a category-specific...

chapter

Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation

Yu Chen, Chunhua Shen, Xiu-Shen Wei, Lingqiao Liu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1221 - 1230

2017 IEEE International Conference on Computer Vision (ICCV)

For human pose estimation in monocular images, joint occlusions and overlapping upon human bodies often result in deviated pose predictions. Under these circumstances, biologically implausible pose predictions may be produced. In contrast, human vision is able to predict poses by exploiting geometric constraints of joint inter-connectivity. To address the problem by incorporating priors about the...

chapter

Be Your Own Prada: Fashion Synthesis with Structural Coherence

Shizhan Zhu, Sanja Fidler, Raquel Urtasun, Dahua Lin, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1689 - 1697

2017 IEEE International Conference on Computer Vision (ICCV)

We present a novel and effective approach for generating new clothing on a wearer through generative adversarial learning. Given an input image of a person and a sentence describing a different outfit, our model “redresses” the person as desired, while at the same time keeping the wearer and her/his pose unchanged. Generating new outfits with precise regions conforming to a language description while...

chapter

Dual Motion GAN for Future-Flow Embedded Video Prediction

Xiaodan Liang, Lisa Lee, Wei Dai, Eric P. Xing

2017 IEEE International Conference on Computer Vision (ICCV) > 1762 - 1770

2017 IEEE International Conference on Computer Vision (ICCV)

Future frame prediction in videos is a promising avenue for unsupervised video representation learning. Video frames are naturally generated by the inherent pixel flows from preceding frames based on the appearance and motion dynamics in the video. However, existing methods focus on directly hallucinating pixel values, resulting in blurry predictions. In this paper, we develop a dual motion Generative...

chapter

GANs for Biological Image Synthesis

Anton Osokin, Anatole Chessel, Rafael E. Carazo Salas, Federico Vaggi

2017 IEEE International Conference on Computer Vision (ICCV) > 2252 - 2261

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we propose a novel application of Generative Adversarial Networks (GAN) to the synthesis of cells imaged by fluorescence microscopy. Compared to natural images, cells tend to have a simpler and more geometric global structure that facilitates image generation. However, the correlation between the spatial pattern of different fluorescent proteins reflects important biological functions,...

chapter

Polynomial Solvers for Saturated Ideals

Viktor Larsson, Kalle Astrom, Magnus Oskarsson

2017 IEEE International Conference on Computer Vision (ICCV) > 2307 - 2316

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper we present a new method for creating polynomial solvers for problems where a (possibly infinite) subset of the solutions are undesirable or uninteresting. These solutions typically arise from simplifications made during modeling, but can also come from degeneracies which are inherent to the geometry of the original problem. The proposed approach extends the standard action matrix method...

chapter

RMPE: Regional Multi-person Pose Estimation

Hao-Shu Fang, Shuqin Xie, Yu-Wing Tai, Cewu Lu

2017 IEEE International Conference on Computer Vision (ICCV) > 2353 - 2362

2017 IEEE International Conference on Computer Vision (ICCV)

Multi-person pose estimation in the wild is challenging. Although state-of-the-art human detectors have demonstrated good performance, small errors in localization and recognition are inevitable. These errors can cause failures for a single-person pose estimator (SPPE), especially for methods that solely depend on human detection results. In this paper, we propose a novel regional multi-person pose...

chapter

CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training

Jianmin Bao, Dong Chen, Fang Wen, Houqiang Li, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2764 - 2773

2017 IEEE International Conference on Computer Vision (ICCV)

We present variational generative adversarial networks, a general learning framework that combines a variational auto-encoder with a generative adversarial network, for synthesizing images in fine-grained categories, such as faces of a specific person or objects in a category. Our approach models an image as a composition of label and latent attributes in a probabilistic model. By varying the fine-grained...

chapter

Introspective Neural Networks for Generative Modeling

Justin Lazarow, Long Jin, Zhuowen Tu

2017 IEEE International Conference on Computer Vision (ICCV) > 2793 - 2802

2017 IEEE International Conference on Computer Vision (ICCV)

We study unsupervised learning by developing a generative model built from progressively learned deep convolutional neural networks. The resulting generator is additionally a discriminator, capable of "introspection" in a sense — being able to self-evaluate the difference between its generated samples and the given training data. Through repeated discriminative learning, desirable properties...

chapter

Least Squares Generative Adversarial Networks

Xudong Mao, Qing Li, Haoran Xie, Raymond Y.K. Lau, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2813 - 2821

2017 IEEE International Conference on Computer Vision (ICCV)

Unsupervised learning with generative adversarial networks (GANs) has proven hugely successful. Regular GANs hypothesize the discriminator as a classifier with the sigmoid cross entropy loss function. However, we found that this loss function may lead to the vanishing gradients problem during the learning process. To overcome such a problem, we propose in this paper the Least Squares Generative Adversarial...

chapter

Temporal Generative Adversarial Nets with Singular Value Clipping

Masaki Saito, Eiichi Matsumoto, Shunta Saito

2017 IEEE International Conference on Computer Vision (ICCV) > 2849 - 2858

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we propose a generative model, Temporal Generative Adversarial Nets (TGAN), which can learn a semantic representation of unlabeled videos, and is capable of generating videos. Unlike existing Generative Adversarial Nets (GAN)-based methods that generate videos with a single generator consisting of 3D deconvolutional layers, our model exploits two different types of generators: a temporal...

chapter

DualGAN: Unsupervised Dual Learning for Image-to-Image Translation

Zili Yi, Hao Zhang, Ping Tan, Minglun Gong

2017 IEEE International Conference on Computer Vision (ICCV) > 2868 - 2876

2017 IEEE International Conference on Computer Vision (ICCV)

Conditional Generative Adversarial Networks (GANs) for cross-domain image-to-image translation have made much progress recently [7, 8, 21, 12, 4, 18]. Depending on the task complexity, thousands to millions of labeled image pairs are needed to train a conditional GAN. However, human labeling is expensive, even impractical, and large quantities of data may not always be available. Inspired by dual...

chapter

Towards Diverse and Natural Image Descriptions via a Conditional GAN

Bo Dai, Sanja Fidler, Raquel Urtasun, Dahua Lin

2017 IEEE International Conference on Computer Vision (ICCV) > 2989 - 2998

2017 IEEE International Conference on Computer Vision (ICCV)

Despite the substantial progress in recent years, the image captioning techniques are still far from being perfect. Sentences produced by existing methods, e.g. those based on RNNs, are often overly rigid and lacking in variability. This issue is related to a learning principle widely used in practice, that is, to maximize the likelihood of training samples. This principle encourages high resemblance...

chapter

Recurrent Topic-Transition GAN for Visual Paragraph Generation

Xiaodan Liang, Zhiting Hu, Hao Zhang, Chuang Gan, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3382 - 3391

2017 IEEE International Conference on Computer Vision (ICCV)

A natural image usually conveys rich semantic content and can be viewed from different angles. Existing image description methods are largely restricted by small sets of biased visual paragraph annotations, and fail to cover rich underlying semantics. In this paper, we investigate a semi-supervised paragraph generative framework that is able to synthesize diverse and semantically coherent paragraph...

chapter

Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses

Christian Rupprecht, Iro Laina, Robert DiPietro, Maximilian Baust

2017 IEEE International Conference on Computer Vision (ICCV) > 3611 - 3620

2017 IEEE International Conference on Computer Vision (ICCV)

Many prediction tasks contain uncertainty. In some cases, uncertainty is inherent in the task itself. In future prediction, for example, many distinct outcomes are equally valid. In other cases, uncertainty arises from the way data is labeled. For example, in object detection, many objects of interest often go unlabeled, and in human pose estimation, occluded joints are often labeled with ambiguous...

chapter

Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in Vitro

Zhedong Zheng, Liang Zheng, Yi Yang

2017 IEEE International Conference on Computer Vision (ICCV) > 3774 - 3782

2017 IEEE International Conference on Computer Vision (ICCV)

The main contribution of this paper is a simple semisupervised pipeline that only uses the original training set without collecting extra data. It is challenging in 1) how to obtain more training data only from the training set and 2) how to use the newly generated data. In this work, the generative adversarial network (GAN) is used to generate unlabeled samples. We propose the label smoothing regularization...

chapter

Towards Large-Pose Face Frontalization in the Wild

Xi Yin, Xiang Yu, Kihyuk Sohn, Xiaoming Liu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4010 - 4019

2017 IEEE International Conference on Computer Vision (ICCV)

Despite recent advances in face recognition using deep learning, severe accuracy drops are observed for large pose variations in unconstrained environments. Learning pose-invariant features is one solution, but needs expensively labeled large-scale data and carefully designed feature learning algorithms. In this work, we focus on frontalizing faces in the wild under various head poses, including extreme...

chapter

Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training

Rakshith Shetty, Marcus Rohrbach, Lisa Anne Hendricks, Mario Fritz, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4155 - 4164

2017 IEEE International Conference on Computer Vision (ICCV)

While strong progress has been made in image captioning recently, machine and human captions are still quite distinct. This is primarily due to the deficiencies in the generated word distribution, vocabulary size, and strong bias in the generators towards frequent captions. Furthermore, humans – rightfully so – generate multiple, diverse captions, due to the inherent ambiguity in the captioning task...

chapter

Attention-Based Multimodal Fusion for Video Description

Chiori Hori, Takaaki Hori, Teng-Yok Lee, Ziming Zhang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4203 - 4212

2017 IEEE International Conference on Computer Vision (ICCV)

Current methods for video description are based on encoder-decoder sentence generation using recurrent neural networks (RNNs). Recent work has demonstrated the advantages of integrating temporal attention mechanisms into these models, in which the decoder network predicts each word in the description by selectively giving more weight to encoded features from specific time frames. Such methods typically...

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV)

Rethinking Reprojection: Closing the Loop for Pose-Aware Shape Reconstruction from a Single Image

Learning to Super-Resolve Blurry Face and Text Images

Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation

Be Your Own Prada: Fashion Synthesis with Structural Coherence

Dual Motion GAN for Future-Flow Embedded Video Prediction

GANs for Biological Image Synthesis

Polynomial Solvers for Saturated Ideals

RMPE: Regional Multi-person Pose Estimation

CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training

Introspective Neural Networks for Generative Modeling

Least Squares Generative Adversarial Networks

Temporal Generative Adversarial Nets with Singular Value Clipping

DualGAN: Unsupervised Dual Learning for Image-to-Image Translation

Towards Diverse and Natural Image Descriptions via a Conditional GAN

Recurrent Topic-Transition GAN for Visual Paragraph Generation

Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses

Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in Vitro

Towards Large-Pose Face Frontalization in the Wild

Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training

Attention-Based Multimodal Fusion for Video Description

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Computer Vision (ICCV) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Computer Vision (ICCV)