Search results

Items from 141 to 160 out of 2,646 results

1 ...
5
6
7
8
9
10
11

chapter

Flexible Spatio-Temporal Networks for Video Prediction

Chaochao Lu, Michael Hirsch, Bernhard Scholkopf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2137 - 2145

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We describe a modular framework for video frame prediction. We refer to it as a Flexible Spatio-Temporal Network (FSTN) as it allows the extrapolation of a video sequence as well as the estimation of synthetic frames lying in between observed frames and thus the generation of slow-motion videos. By devising a customized objective function comprising decoding, encoding, and adversarial losses, we are...

chapter

Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF

Falong Shen, Rui Gan, Shuicheng Yan, Gang Zeng

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5178 - 5186

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper describes a fast and accurate semantic image segmentation approach that encodes not only segmentation-specified features but also high-order context compatibilities and boundary guidance constraints. We introduce a structured patch prediction technique to make a trade-off between classification discriminability and boundary sensibility for features. Both label and feature contexts are embedded...

chapter

Hard Mixtures of Experts for Large Scale Weakly Supervised Vision

Sam Gross, Marc'Aurelio Ranzato, Arthur Szlam

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5085 - 5093

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Training convolutional networks (CNNs) that fit on a single GPU with minibatch stochastic gradient descent has become effective in practice. However, there is still no effective method for training large networks that do not fit in the memory of a few GPU cards, or for parallelizing CNN training. In this work we show that a simple hard mixture of experts model can be efficiently trained to good effect...

chapter

Infinite Variational Autoencoder for Semi-Supervised Learning

M. Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 781 - 790

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents an infinite variational autoencoder (VAE) whose capacity adapts to suit the input data. This is achieved using a mixture model where the mixing coefficients are modeled by a Dirichlet process, allowing us to integrate over the coefficients when performing inference. Critically, this then allows us to automatically vary the number of autoencoders in the mixture based on the data...

chapter

Variational Bayesian Multiple Instance Learning with Gaussian Processes

Manuel HauBmann, Fred A. Hamprecht, Melih Kandemir

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 810 - 819

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Gaussian Processes (GPs) are effective Bayesian predictors. We here show for the first time that instance labels of a GP classifier can be inferred in the multiple instance learning (MIL) setting using variational Bayes. We achieve this via a new construction of the bag likelihood that assumes a large value if the instance predictions obey the MIL constraints and a small value otherwise. This construction...

chapter

Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation

Dan Xu, Elisa Ricci, Wanli Ouyang, Xiaogang Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 161 - 169

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of depth estimation from a single still image. Inspired by recent works on multi-scale convolutional neural networks (CNN), we propose a deep model which fuses complementary information derived from multiple CNN side outputs. Different from previous methods, the integration is obtained by means of continuous Conditional Random Fields (CRFs). In particular, we propose...

chapter

Attentional Push: A Deep Convolutional Network for Augmenting Image Salience with Shared Attention Modeling in Social Scenes

Siavash Gorji, James J. Clark

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3472 - 3481

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a novel visual attention tracking technique based on Shared Attention modeling. By considering the viewer as a participant in the activity occurring in the scene, our model learns the loci of attention of the scene actors and use it to augment image salience. We go beyond image salience and instead of only computing the power of image regions to pull attention, we also consider the strength...

chapter

Convolutional Random Walk Networks for Semantic Image Segmentation

Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6137 - 6145

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most current semantic segmentation methods rely on fully convolutional networks (FCNs). However, their use of large receptive fields and many pooling layers cause low spatial resolution inside the deep layers. This leads to predictions with poor localization around the boundaries. Prior work has attempted to address this issue by post-processing predictions with CRFs or MRFs. But such models often...

chapter

From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis

Yang Long, Li Liu, Ling Shao, Fumin Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6165 - 6174

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust object recognition systems usually rely on powerful feature extraction mechanisms from a large number of real images. However, in many realistic applications, collecting sufficient images for ever-growing new classes is unattainable. In this paper, we propose a new Zero-shot learning (ZSL) framework that can synthesise visual features for unseen classes without acquiring real images. Using...

chapter

Sequential Person Recognition in Photo Albums with a Recurrent Network

Yao Li, Guosheng Lin, Bohan Zhuang, Lingqiao Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5660 - 5668

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recognizing the identities of people in everyday photos is still a very challenging problem for machine vision, due to issues such as non-frontal faces, changes in clothing, location, lighting. Recent studies have shown that rich relational information between people in the same photo can help in recognizing their identities. In this work, we propose to model the relational information between people...

chapter

Captioning Images with Diverse Objects

Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond Mooney, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1170 - 1178

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent captioning models are limited in their ability to scale and describe concepts unseen in paired image-text corpora. We propose the Novel Object Captioner (NOC), a deep visual semantic captioning model that can describe a large number of object categories not present in existing image-caption datasets. Our model takes advantage of external sources – labeled images from object recognition...

chapter

Semantic Regularisation for Recurrent Image Annotation

Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4160 - 4168

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The CNN-RNN design pattern is increasingly widely applied in a variety of image annotation tasks including multi-label classification and captioning. Existing models use the weakly semantic CNN hidden layer or its transform as the image embedding that provides the interface between the CNN and RNN. This leaves the RNN overstretched with two jobs: predicting the visual concepts and modelling their...

chapter

End-to-End Learning of Driving Models from Large-Scale Video Datasets

Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3530 - 3538

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust perception-action models should be learned from training data with diverse visual appearances and realistic behaviors, yet current approaches to deep visuomotor policy learning have been generally limited to in-situ models learned from a single vehicle or simulation environment. We advocate learning a generic vehicle motion model from large scale crowd-sourced video data, and develop an end-to-end...

chapter

Life prediction of jet engines based on LSTM-recurrent neural networks

Dong Dong, Xiao-Yang Li, Fu-Qiang Sun

2017 Prognostics and System Health Management Conference (PHM-Harbin) > 1 - 6

2017 Prognostics and System Health Management Conference (PHM-Harbin)

The issue of remaining useful life (RUL) prediction has already become a quite interesting topic in industrial product. The data driven RUL prediction has been applied to the current research by taking advantage of a long-short term memory (LSTM)-recurrent neural network (RNN) approach. This means that even in a specified long-short term memory bound and limited available data sets, the RUL predictions...

chapter

Unsupervised Monocular Depth Estimation with Left-Right Consistency

Clement Godard, Oisin Mac Aodha, Gabriel J. Brostow

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6602 - 6611

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning based methods have shown very promising results for the task of depth estimation in single images. However, most existing approaches treat depth prediction as a supervised regression problem and as a result, require vast quantities of corresponding ground truth depth data for training. Just recording quality depth data in a range of environments is a challenging problem. In this paper, we...

chapter

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Mark Yatskar, Vicente Ordonez, Luke Zettlemoyer, Ali Farhadi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6335 - 6344

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic sparsity is a common challenge in structured visual classification problems, when the output space is complex, the vast majority of the possible predictions are rarely, if ever, seen in the training set. This paper studies semantic sparsity in situation recognition, the task of producing structured summaries of what is happening in images, including activities, objects and the roles objects...

chapter

Self-Critical Sequence Training for Image Captioning

Steven J. Rennie, Etienne Marcheret, Youssef Mroueh, Jerret Ross, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1179 - 1195

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently it has been shown that policy-gradient methods for reinforcement learning can be utilized to train deep end-to-end systems directly on non-differentiable metrics for the task at hand. In this paper we consider the problem of optimizing image captioning systems using reinforcement learning, and show that by carefully optimizing our systems using the test metrics of the MSCOCO task, significant...

chapter

On Human Motion Prediction Using Recurrent Neural Networks

Julieta Martinez, Michael J. Black, Javier Romero

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4674 - 4683

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Human motion modelling is a classical problem at the intersection of graphics and computer vision, with applications spanning human-computer interaction, motion synthesis, and motion prediction for virtual and augmented reality. Following the success of deep learning methods in several computer vision tasks, recent work has focused on using deep recurrent neural networks (RNNs) to model human motion,...

chapter

Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks

Mengmi Zhang, Keng Teck Ma, Joo Hwee Lim, Qi Zhao, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3539 - 3548

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a new problem of gaze anticipation on egocentric videos. This substantially extends the conventional gaze prediction problem to future frames by no longer confining it on the current frame. To solve this problem, we propose a new generative adversarial neural network based model, Deep Future Gaze (DFG). DFG generates multiple future frames conditioned on the single current frame and anticipates...

chapter

Deep neural network for manufacturing quality prediction

Yun Bai, Chuan Li, Zhenzhong Sun, Haibin Chen

2017 Prognostics and System Health Management Conference (PHM-Harbin) > 1 - 5

2017 Prognostics and System Health Management Conference (PHM-Harbin)

Expected product quality is affected by multi-parameter in complex manufacturing processes. Product quality prediction can offer the possibility of designing better system parameters at the early production stage. Many existing approaches fail at providing favorable results duo to shallow architecture in prediction model that can not learn multi-parameter's features insufficiently. To address this...

1 ...
5
6
7
8
9
10
11

Data set:
ieee
Keywords:
TRAINING
PREDICTIVE MODELS

Publication date

Set your own date range

Content availability

Available (2,620)
None (26)

Publication type

book (2,436)
article (210)

Keywords

ARTIFICIAL NEURAL NETWORKS (818)
DATA MODELS (695)
MATHEMATICAL MODEL (448)
SUPPORT VECTOR MACHINES (426)
FORECASTING (400)
NEURAL NETS (384)
COMPUTATIONAL MODELING (351)
ACCURACY (310)
PREDICTION ALGORITHMS (296)
NEURONS (247)
DATA MINING (233)
NEURAL NETWORKS (225)
BACKPROPAGATION (222)
TIME SERIES ANALYSIS (214)
TESTING (186)
FEATURE EXTRACTION (184)
KERNEL (176)
MACHINE LEARNING (166)
BIOLOGICAL SYSTEM MODELING (163)
NEURAL NETWORK (149)
LEARNING (ARTIFICIAL INTELLIGENCE) (146)
ARTIFICIAL NEURAL NETWORK (142)
PREDICTION (142)
OPTIMIZATION (137)
REGRESSION ANALYSIS (131)
ANALYTICAL MODELS (128)
CORRELATION (122)
LOAD MODELING (120)
BP NEURAL NETWORK (118)
TRAINING DATA (115)
GENETIC ALGORITHMS (109)
SUPPORT VECTOR MACHINE (107)
BIOLOGICAL NEURAL NETWORKS (100)
INDEXES (96)
ADAPTATION MODELS (94)
TIME SERIES (92)
CLASSIFICATION ALGORITHMS (88)
HIDDEN MARKOV MODELS (87)
FORECASTING THEORY (81)
LOAD FORECASTING (81)
VECTORS (80)
MEASUREMENT (79)
ESTIMATION (71)
DECISION TREES (70)
SOFTWARE (66)
PARTICLE SWARM OPTIMIZATION (60)
ALGORITHM DESIGN AND ANALYSIS (57)
ATMOSPHERIC MODELING (57)
GENETIC ALGORITHM (57)
PRODUCTION ENGINEERING COMPUTING (57)
RADIAL BASIS FUNCTION NETWORKS (57)
DATABASES (56)
LOGISTICS (55)
MONITORING (55)
PRINCIPAL COMPONENT ANALYSIS (55)
SVM (55)
VISUALIZATION (55)
ADAPTATION MODEL (52)
PATTERN CLASSIFICATION (51)
SEMANTICS (50)
STOCK MARKETS (49)
CONTEXT (46)
EDUCATIONAL INSTITUTIONS (46)
SPEECH (46)
BUILDINGS (45)
GEOPHYSICS COMPUTING (45)
EQUATIONS (44)
PARTICLE SWARM OPTIMISATION (43)
RECURRENT NEURAL NETWORKS (43)
SUPPORT VECTOR REGRESSION (43)
WAVELET TRANSFORMS (43)
CONVERGENCE (42)
STATISTICAL ANALYSIS (42)
CLASSIFICATION (41)
COMPLEXITY THEORY (40)
LEAST SQUARES APPROXIMATIONS (40)
POWER ENGINEERING COMPUTING (40)
BENCHMARK TESTING (39)
STANDARDS (39)
WEATHER FORECASTING (39)
BAYES METHODS (38)
DEEP LEARNING (37)
COMPANIES (36)
COMPUTER ARCHITECTURE (36)
FUZZY NEURAL NETS (35)
NOISE (34)
PRODUCTION (34)
ROADS (34)
FEEDFORWARD NEURAL NETS (33)
MULTILAYER PERCEPTRONS (33)
PREDICTION MODEL (33)
RIVERS (33)
WIND SPEED (33)
AUTOREGRESSIVE PROCESSES (32)
GAUSSIAN PROCESSES (32)
ANN (31)
ARTIFICIAL INTELLIGENCE (31)
CONTEXT MODELING (31)
more

INFONA - science communication portal

Search results

Flexible Spatio-Temporal Networks for Video Prediction

Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF

Hard Mixtures of Experts for Large Scale Weakly Supervised Vision

Infinite Variational Autoencoder for Semi-Supervised Learning

Variational Bayesian Multiple Instance Learning with Gaussian Processes

Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation

Attentional Push: A Deep Convolutional Network for Augmenting Image Salience with Shared Attention Modeling in Social Scenes

Convolutional Random Walk Networks for Semantic Image Segmentation

From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis

Sequential Person Recognition in Photo Albums with a Recurrent Network

Captioning Images with Diverse Objects

Semantic Regularisation for Recurrent Image Annotation

End-to-End Learning of Driving Models from Large-Scale Video Datasets

Life prediction of jet engines based on LSTM-recurrent neural networks

Unsupervised Monocular Depth Estimation with Left-Right Consistency

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Self-Critical Sequence Training for Image Captioning

On Human Motion Prediction Using Recurrent Neural Networks

Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks

Deep neural network for manufacturing quality prediction

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options