Search results

Items from 101 to 120 out of 911 results

1 ...
3
4
5
6
7
8
9

chapter

FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1647 - 1655

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The FlowNet demonstrated that optical flow estimation can be cast as a learning problem. However, the state of the art with regard to the quality of the flow has still been defined by traditional methods. Particularly on small displacements and real-world data, FlowNet cannot compete with variational methods. In this paper, we advance the concept of end-to-end learning of optical flow and make it...

chapter

UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory

Iasonas Kokkinos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5454 - 5463

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work we train in an end-to-end manner a convolutional neural network (CNN) that jointly handles low-, mid-, and high-level vision tasks in a unified architecture. Such a network can act like a swiss knife for vision tasks, we call it an UberNet to indicate its overarching nature. The main contribution of this work consists in handling challenges that emerge when scaling up to many tasks. We...

chapter

Switching Convolutional Neural Network for Crowd Counting

Deepak Babu Sam, Shiv Surya, R. Venkatesh Babu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4031 - 4039

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel crowd counting model that maps a given crowd scene to its density. Crowd analysis is compounded by myriad of factors like inter-occlusion between people due to extreme crowding, high similarity of appearance between people and background elements, and large variability of camera view-points. Current state-of-the art approaches tackle these factors by using multi-scale CNN architectures,...

chapter

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

Zhaofan Qiu, Ting Yao, Tao Mei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4085 - 4094

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep convolutional neural networks (CNNs) have proven highly effective for visual recognition, where learning a universal representation from activations of convolutional layer plays a fundamental problem. In this paper, we present Fisher Vector encoding with Variational Auto-Encoder (FV-VAE), a novel deep architecture that quantizes the local activations of convolutional layer in a deep generative...

chapter

End-to-End Learning of Driving Models from Large-Scale Video Datasets

Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3530 - 3538

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust perception-action models should be learned from training data with diverse visual appearances and realistic behaviors, yet current approaches to deep visuomotor policy learning have been generally limited to in-situ models learned from a single vehicle or simulation environment. We advocate learning a generic vehicle motion model from large scale crowd-sourced video data, and develop an end-to-end...

chapter

Improved Stereo Matching with Constant Highway Networks and Reflective Confidence Learning

Amit Shaked, Lior Wolf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6901 - 6910

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an improved three-step pipeline for the stereo matching problem and introduce multiple novelties at each stage. We propose a new highway network architecture for computing the matching cost at each possible disparity, based on multilevel weighted residual shortcuts, trained with a hybrid loss that supports multilevel comparison of image patches. A novel post-processing step is then introduced,...

chapter

Budget-Aware Deep Semantic Video Segmentation

Behrooz Mahasseni, Sinisa Todorovic, Alan Fern

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2077 - 2086

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we study a poorly understood trade-off between accuracy and runtime costs for deep semantic video segmentation. While recent work has demonstrated advantages of learning to speed-up deep activity detection, it is not clear if similar advantages will hold for our very different segmentation loss function, which is defined over individual pixels across the frames. In deep video segmentation,...

chapter

Building a Regular Decision Boundary with Deep Networks

Edouard Oyallon

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1886 - 1894

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we build a generic architecture of Convolutional Neural Networks to discover empirical properties of neural networks. Our first contribution is to introduce a state-of-the-art framework that depends upon few hyper parameters and to study the network when we vary them. It has no max pooling, no biases, only 13 layers, is purely convolutional and yields up to 95.4% and 79.6% accuracy respectively...

chapter

Linking Image and Text with 2-Way Nets

Aviv Eisenschtat, Lior Wolf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1855 - 1865

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Linking two data sources is a basic building block in numerous computer vision problems. Canonical Correlation Analysis (CCA) achieves this by utilizing a linear optimizer in order to maximize the correlation between the two views. Recent work makes use of non-linear models, including deep learning techniques, that optimize the CCA loss in some feature space. In this paper, we introduce a novel, bi-directional...

chapter

LCR-Net: Localization-Classification-Regression for Human Pose

Gregory Rogez, Philippe Weinzaepfel, Cordelia Schmid

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1216 - 1224

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose an end-to-end architecture for joint 2D and 3D human pose estimation in natural images. Key to our approach is the generation and scoring of a number of pose proposals per image, which allows us to predict 2D and 3D pose of multiple people simultaneously. Hence, our approach does not require an approximate localization of the humans for initialization. Our architecture, named LCR-Net, contains...

chapter

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

Zhe Cao, Tomas Simon, Shih-En Wei, Yaser Sheikh

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1302 - 1310

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an approach to efficiently detect the 2D pose of multiple people in an image. The approach uses a nonparametric representation, which we refer to as Part Affinity Fields (PAFs), to learn to associate body parts with individuals in the image. The architecture encodes global context, allowing a greedy bottom-up parsing step that maintains high accuracy while achieving realtime performance,...

chapter

Deep neural network for manufacturing quality prediction

Yun Bai, Chuan Li, Zhenzhong Sun, Haibin Chen

2017 Prognostics and System Health Management Conference (PHM-Harbin) > 1 - 5

2017 Prognostics and System Health Management Conference (PHM-Harbin)

Expected product quality is affected by multi-parameter in complex manufacturing processes. Product quality prediction can offer the possibility of designing better system parameters at the early production stage. Many existing approaches fail at providing favorable results duo to shallow architecture in prediction model that can not learn multi-parameter's features insufficiently. To address this...

chapter

Improving the lenet with batch normalization and online hard example mining for digits recognition

Yiliang Xie, Hongyuan Jin, Eric C.C. Tsang

2017 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR) > 149 - 153

2017 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR)

Nowadays, applications based on digits recognition and characters recognition have become much more reliable thanks to the rapid development of the DNN(deep neural network) architecture and constantly increasing the efficiency to the computing resources. A lot of methods have been proposed to improve the performance of DNNs, such as the ReLU (Rectified Linear Unit) which is a widely used alternative...

chapter

Fusion of statistical and learnt features for SAR images classification

Chu He, Xinlong Liu, Gong Han, Chenyao Kang, more

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 5490 - 5493

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

Deep-learning-based methods often suffer from insufficient training samples when they are directly used in the task of Synthetical Aperture Radar (SAR) images classification, which in turn leads to poor performance. To alleviate this problem, this paper presents a feature-fused approach, in which several statistical features of SAR images are extracted and integrated into the first layer of a typical...

chapter

Deep speckle noise filtering

S. Foucher, M. Beaulieu, M. Dahmane, F. Cavayas

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 5311 - 5314

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

Speckle removal from single-channel and multi-dimensional SAR remains a difficult problem. In this paper, we are investigating the use of a Convolutional Neural Network (CNN), previously applied to the Super-Resolution problem, for speckle removal. Because speckle noise statistics is signal dependent, we are training the neural network on the residual image formed by the ratio of the observed intensity...

chapter

A deep convolutional neural network, with pre-training, for solar photovoltaic array detection in aerial imagery

Jordan M. Malof, Leslie M. Collins, Kyle Bradbury

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 874 - 877

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

In this work we consider the problem of developing algorithms that automatically identify small-scale solar photovoltaic arrays in high resolution aerial imagery. Such algorithms potentially offer a faster and cheaper solution to collecting small-scale photovoltaic (PV) information, such as their location, capacity, and the energy they produce. Here we build on previous algorithmic work by employing...

chapter

Urban land cover classification with missing data using deep convolutional neural networks

Michael Kampffmeyer, Arnt-Borre Salberg, Robert Jenssen

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 5161 - 5164

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

Fusing different sensors with different data modalities is a common technique to improve land cover classification performance in remote sensing. However, all modalities are rarely available for all test data, and this missing data problem poses severe challenges for multi-modal learning. Inspired by recent successes in deep learning, we propose as a remedy a convolutional neural network architecture...

chapter

CATERPILLAR: Coarse Grain Reconfigurable Architecture for accelerating the training of Deep Neural Networks

Yuanfang Li, Ardavan Pedram

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 1 - 10

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Accelerating the inference of a trained DNN is a well studied subject. In this paper we switch the focus to the training of DNNs. The training phase is compute intensive, demands complicated data communication, and contains multiple levels of data dependencies and parallelism. This paper presents an algorithm/architecture space exploration of efficient accelerators to achieve better network convergence...

chapter

Deep learning for multimodal-based video interestingness prediction

Yuesong Shen, Claire-Heiene Demarty, Ngoc Q. K. Duong

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1003 - 1008

2017 IEEE International Conference on Multimedia and Expo (ICME)

Predicting interestingness of media content remains an important, but challenging research subject. The difficulty comes first from the fact that, besides being a high-level semantic concept, interestingness is highly subjective and its global definition has not been agreed yet. This paper presents the use of up-to-date deep learning techniques for solving the task. We perform experiments with both...

chapter

Facial attractiveness computation by label distribution learning with deep CNN and geometric features

Shu Liu, Bo Li, Yang-Yu Fan, Zhe Quo, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1344 - 1349

2017 IEEE International Conference on Multimedia and Expo (ICME)

Facial attractiveness computation is a challenging task because of the lack of labeled data and discriminative features. In this paper, an end-to-end label distribution learning (LDL) framework with deep convolutional neural network (CNN) and geometric features is proposed to meet these two challenges. Different from the previous work, we recast this task as an LDL problem. Compared with the single...

1 ...
3
4
5
6
7
8
9

Keywords:
TRAINING
COMPUTER ARCHITECTURE

Publication date

Set your own date range

Content availability

Available (908)
None (3)

Keywords

FEATURE EXTRACTION (176)
NEURAL NETWORKS (171)
NEURONS (157)
ARTIFICIAL NEURAL NETWORKS (154)
COMPUTATIONAL MODELING (115)
MACHINE LEARNING (101)
SUPPORT VECTOR MACHINES (71)
BIOLOGICAL NEURAL NETWORKS (69)
DEEP LEARNING (67)
KERNEL (66)
ACCURACY (64)
DATABASES (63)
CONVOLUTION (62)
TESTING (59)
HARDWARE (57)
DATA MODELS (54)
LEARNING (ARTIFICIAL INTELLIGENCE) (54)
MICROPROCESSORS (53)
SOFTWARE (51)
NEURAL NETS (45)
DATA MINING (44)
IMAGE SEGMENTATION (44)
CLASSIFICATION ALGORITHMS (43)
MATHEMATICAL MODEL (43)
VISUALIZATION (43)
OPTIMIZATION (40)
TRAINING DATA (40)
ALGORITHM DESIGN AND ANALYSIS (39)
COMPUTER VISION (39)
FIELD PROGRAMMABLE GATE ARRAYS (37)
COMPUTERS (35)
PREDICTIVE MODELS (35)
FACE (31)
NEURAL NETWORK (31)
CONTEXT (29)
ESTIMATION (29)
LOGIC GATES (29)
SERVERS (29)
IMAGE CLASSIFICATION (28)
SEMANTICS (27)
VECTORS (27)
CONVOLUTIONAL NEURAL NETWORKS (26)
CONFERENCES (25)
CORRELATION (25)
HIDDEN MARKOV MODELS (25)
RECURRENT NEURAL NETWORKS (25)
BACKPROPAGATION (24)
BENCHMARK TESTING (24)
IMAGE RECOGNITION (24)
MONITORING (24)
SPEECH RECOGNITION (24)
STANDARDS (24)
CLASSIFICATION (23)
COMPLEXITY THEORY (22)
CONVOLUTIONAL NEURAL NETWORK (22)
MULTILAYER PERCEPTRONS (22)
PATTERN RECOGNITION (22)
ROBUSTNESS (22)
ENCODING (21)
SPEECH (21)
CAMERAS (20)
EDUCATIONAL INSTITUTIONS (20)
FACE RECOGNITION (20)
ARTIFICIAL INTELLIGENCE (19)
BIOLOGICAL SYSTEM MODELING (19)
ELECTRONIC MAIL (19)
GENETIC ALGORITHMS (19)
IMAGE COLOR ANALYSIS (19)
NEURAL NET ARCHITECTURE (19)
SHAPE (19)
MEASUREMENT (18)
OBJECT RECOGNITION (18)
SIGNAL PROCESSING (18)
SOFTWARE ARCHITECTURE (18)
ADAPTATION MODELS (17)
CONVERGENCE (17)
PROPOSALS (17)
TRANSFORMS (17)
ACOUSTICS (16)
ANALYTICAL MODELS (16)
CLUSTERING ALGORITHMS (16)
RADIAL BASIS FUNCTION NETWORKS (16)
SUPPORT VECTOR MACHINE CLASSIFICATION (16)
TIME SERIES ANALYSIS (16)
UNSUPERVISED LEARNING (16)
BUILDINGS (15)
FPGA (15)
GAMES (15)
HISTOGRAMS (15)
INTERNET (15)
NOISE (15)
PRINCIPAL COMPONENT ANALYSIS (15)
ARTIFICIAL NEURAL NETWORK (14)
DETECTORS (14)
EDUCATION (14)
INDEXES (14)
OBJECT DETECTION (14)
PATTERN CLASSIFICATION (14)
more

INFONA - science communication portal

Search results

FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory

Switching Convolutional Neural Network for Crowd Counting

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

End-to-End Learning of Driving Models from Large-Scale Video Datasets

Improved Stereo Matching with Constant Highway Networks and Reflective Confidence Learning

Budget-Aware Deep Semantic Video Segmentation

Building a Regular Decision Boundary with Deep Networks

Linking Image and Text with 2-Way Nets

LCR-Net: Localization-Classification-Regression for Human Pose

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

Deep neural network for manufacturing quality prediction

Improving the lenet with batch normalization and online hard example mining for digits recognition

Fusion of statistical and learnt features for SAR images classification

Deep speckle noise filtering

A deep convolutional neural network, with pre-training, for solar photovoltaic array detection in aerial imagery

Urban land cover classification with missing data using deep convolutional neural networks

CATERPILLAR: Coarse Grain Reconfigurable Architecture for accelerating the training of Deep Neural Networks

Deep learning for multimodal-based video interestingness prediction

Facial attractiveness computation by label distribution learning with deep CNN and geometric features

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options