Advanced search

Advanced search in people

From:

To:

Items from 101 to 120 out of 1,013 results

1 ...
3
4
5
6
7
8
9

chapter

Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes

Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3309 - 3318

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic image segmentation is an essential component of modern autonomous driving systems, as an accurate understanding of the surrounding scene is crucial to navigation and action planning. Current state-of-the-art approaches in semantic image segmentation rely on pre-trained networks that were initially developed for classifying images as a whole. While these networks exhibit outstanding recognition...

chapter

RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation

Guosheng Lin, Anton Milan, Chunhua Shen, Ian Reid

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5168 - 5177

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, very deep convolutional neural networks (CNNs) have shown outstanding performance in object recognition and have also been the first choice for dense classification problems such as semantic segmentation. However, repeated subsampling operations like pooling or convolution striding in deep CNNs lead to a significant decrease in the initial image resolution. Here, we present RefineNet, a...

chapter

End-to-End Representation Learning for Correlation Filter Based Tracking

Jack Valmadre, Luca Bertinetto, Joao Henriques, Andrea Vedaldi, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5000 - 5008

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The Correlation Filter is an algorithm that trains a linear template to discriminate between images and their translations. It is well suited to object tracking because its formulation in the Fourier domain provides a fast solution, enabling the detector to be re-trained once per frame. Previous works that use the Correlation Filter, however, have adopted features that were either manually designed...

chapter

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

Seungjun Nah, Tae Hyun Kim, Kyoung Mu Lee

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 257 - 265

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Non-uniform blind deblurring for general dynamic scenes is a challenging computer vision problem as blurs arise not only from multiple object motions but also from camera shake, scene depth variation. To remove these complicated motion blurs, conventional energy optimization based methods rely on simple assumptions such that blur kernel is partially uniform or locally linear. Moreover, recent machine...

chapter

CERN: Confidence-Energy Recurrent Network for Group Activity Recognition

Tianmin Shu, Sinisa Todorovic, Song-Chun Zhu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4255 - 4263

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This work is about recognizing human activities occurring in videos at distinct semantic levels, including individual actions, interactions, and group activities. The recognition is realized using a two-level hierarchy of Long Short-Term Memory (LSTM) networks, forming a feed-forward deep architecture, which can be trained end-to-end. In comparison with existing architectures of LSTMs, we make two...

chapter

SST: Single-Stream Temporal Action Proposals

Shyamal Buch, Victor Escorcia, Chuanqi Shen, Bernard Ghanem, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6373 - 6382

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Our paper presents a new approach for temporal detection of human actions in long, untrimmed video sequences. We introduce Single-Stream Temporal Action Proposals (SST), a new effective and efficient deep architecture for the generation of temporal action proposals. Our network can run continuously in a single stream over very long input video sequences, without the need to divide input into short...

chapter

FASON: First and Second Order Information Fusion Network for Texture Recognition

Xiyang Dai, Joe Yue-Hei Ng, Larry S. Davis

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6100 - 6108

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep networks have shown impressive performance on many computer vision tasks. Recently, deep convolutional neural networks (CNNs) have been used to learn discriminative texture representations. One of the most successful approaches is Bilinear CNN model that explicitly captures the second order statistics within deep features. However, these networks cut off the first order information flow in the...

chapter

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach

Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2233 - 2241

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a theoretically grounded approach to train deep neural networks, including recurrent networks, subject to class-dependent label noise. We propose two procedures for loss correction that are agnostic to both application domain and network architecture. They simply amount to at most a matrix inversion and multiplication, provided that we know the probability of each class being corrupted...

chapter

Weakly Supervised Cascaded Convolutional Networks

Ali Diba, Vivek Sharma, Ali Pazandeh, Hamed Pirsiavash, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5131 - 5139

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Object detection is a challenging task in visual understanding domain, and even more so if the supervision is to be weak. Recently, few efforts to handle the task without expensive human annotations is established by promising deep neural network. A new architecture of cascaded networks is proposed to learn a convolutional neural network (CNN) under such conditions. We introduce two such architectures,...

chapter

Exploiting Saliency for Object Segmentation from Image Level Labels

Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5038 - 5047

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

There have been remarkable improvements in the semantic labelling task in the recent years. However, the state of the art methods rely on large-scale pixel-level annotations. This paper studies the problem of training a pixel-wise semantic labeller network from image-level annotations of the present object classes. Recently, it has been shown that high quality seeds indicating discriminative object...

chapter

Universal Adversarial Perturbations

Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Omar Fawzi, Pascal Frossard

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 86 - 94

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Given a state-of-the-art deep neural network classifier, we show the existence of a universal (image-agnostic) and very small perturbation vector that causes natural images to be misclassified with high probability. We propose a systematic algorithm for computing universal perturbations, and show that state-of-the-art deep neural networks are highly vulnerable to such perturbations, albeit being quasi-imperceptible...

chapter

Learning Detection with Diverse Proposals

Samaneh Azadi, Jiashi Feng, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7369 - 7377

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

To predict a set of diverse and informative proposals with enriched representations, this paper introduces a differentiable Determinantal Point Process (DPP) layer that is able to augment the object detection architectures. Most modern object detection architectures, such as Faster R-CNN, learn to localize objects by minimizing deviations from the ground truth, but ignore correlation between multiple...

chapter

Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer

Xin Wang, Geoffrey Oxholm, Da Zhang, Yuan-Fang Wang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7178 - 7186

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Transferring artistic styles onto everyday photographs has become an extremely popular task in both academia and industry. Recently, offline training has replaced online iterative optimization, enabling nearly real-time stylization. When those stylization networks are applied directly to high-resolution images, however, the style of localized regions often appears less similar to the desired artistic...

chapter

Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups

Yani Ioannou, Duncan Robertson, Roberto Cipolla, Antonio Criminisi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5977 - 5986

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a new method for creating computationally efficient and compact convolutional neural networks (CNNs) using a novel sparse connection structure that resembles a tree root. This allows a significant reduction in computational cost and number of parameters compared to state-of-the-art deep CNNs, without compromising accuracy, by exploiting the sparsity of inter-layer filter dependencies. We...

chapter

DeMoN: Depth and Motion Network for Learning Monocular Stereo

Benjamin Ummenhofer, Huizhong Zhou, Jonas Uhrig, Nikolaus Mayer, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5622 - 5631

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we formulate structure from motion as a learning problem. We train a convolutional network end-to-end to compute depth and camera motion from successive, unconstrained image pairs. The architecture is composed of multiple stacked encoder-decoder networks, the core part being an iterative network that is able to improve its own predictions. The network estimates not only depth and motion,...

chapter

FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1647 - 1655

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The FlowNet demonstrated that optical flow estimation can be cast as a learning problem. However, the state of the art with regard to the quality of the flow has still been defined by traditional methods. Particularly on small displacements and real-world data, FlowNet cannot compete with variational methods. In this paper, we advance the concept of end-to-end learning of optical flow and make it...

chapter

UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory

Iasonas Kokkinos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5454 - 5463

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work we train in an end-to-end manner a convolutional neural network (CNN) that jointly handles low-, mid-, and high-level vision tasks in a unified architecture. Such a network can act like a swiss knife for vision tasks, we call it an UberNet to indicate its overarching nature. The main contribution of this work consists in handling challenges that emerge when scaling up to many tasks. We...

chapter

Switching Convolutional Neural Network for Crowd Counting

Deepak Babu Sam, Shiv Surya, R. Venkatesh Babu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4031 - 4039

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel crowd counting model that maps a given crowd scene to its density. Crowd analysis is compounded by myriad of factors like inter-occlusion between people due to extreme crowding, high similarity of appearance between people and background elements, and large variability of camera view-points. Current state-of-the art approaches tackle these factors by using multi-scale CNN architectures,...

chapter

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

Zhaofan Qiu, Ting Yao, Tao Mei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4085 - 4094

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep convolutional neural networks (CNNs) have proven highly effective for visual recognition, where learning a universal representation from activations of convolutional layer plays a fundamental problem. In this paper, we present Fisher Vector encoding with Variational Auto-Encoder (FV-VAE), a novel deep architecture that quantizes the local activations of convolutional layer in a deep generative...

chapter

End-to-End Learning of Driving Models from Large-Scale Video Datasets

Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3530 - 3538

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust perception-action models should be learned from training data with diverse visual appearances and realistic behaviors, yet current approaches to deep visuomotor policy learning have been generally limited to in-situ models learned from a single vehicle or simulation environment. We advocate learning a generic vehicle motion model from large scale crowd-sourced video data, and develop an end-to-end...

1 ...
3
4
5
6
7
8
9

Keywords:
TRAINING
COMPUTER ARCHITECTURE

Publication date

Set your own date range

Content availability

Available (1,010)
None (3)

Publication type

book (911)
article (102)

Keywords

FEATURE EXTRACTION (195)
NEURAL NETWORKS (190)
NEURONS (175)
ARTIFICIAL NEURAL NETWORKS (169)
COMPUTATIONAL MODELING (124)
MACHINE LEARNING (112)
DEEP LEARNING (82)
SUPPORT VECTOR MACHINES (80)
BIOLOGICAL NEURAL NETWORKS (74)
KERNEL (74)
CONVOLUTION (70)
ACCURACY (67)
HARDWARE (67)
DATABASES (64)
TESTING (64)
LEARNING (ARTIFICIAL INTELLIGENCE) (57)
MICROPROCESSORS (57)
DATA MODELS (56)
SOFTWARE (52)
NEURAL NETS (50)
IMAGE SEGMENTATION (49)
VISUALIZATION (47)
ALGORITHM DESIGN AND ANALYSIS (45)
DATA MINING (45)
MATHEMATICAL MODEL (45)
TRAINING DATA (45)
OPTIMIZATION (44)
CLASSIFICATION ALGORITHMS (43)
FIELD PROGRAMMABLE GATE ARRAYS (43)
COMPUTER VISION (40)
VECTORS (38)
LOGIC GATES (36)
PREDICTIVE MODELS (36)
COMPUTERS (35)
FACE (34)
SEMANTICS (33)
CONTEXT (31)
ESTIMATION (31)
NEURAL NETWORK (31)
CONVOLUTIONAL NEURAL NETWORKS (30)
IMAGE CLASSIFICATION (30)
RECURRENT NEURAL NETWORKS (30)
SERVERS (30)
CLASSIFICATION (29)
BENCHMARK TESTING (27)
CORRELATION (26)
HIDDEN MARKOV MODELS (26)
IMAGE RECOGNITION (26)
MONITORING (26)
ROBUSTNESS (26)
SPEECH RECOGNITION (26)
STANDARDS (26)
BACKPROPAGATION (25)
CONFERENCES (25)
CONVOLUTIONAL NEURAL NETWORK (24)
MULTILAYER PERCEPTRONS (24)
PATTERN RECOGNITION (24)
FACE RECOGNITION (23)
IMAGE COLOR ANALYSIS (23)
SPEECH (23)
ARTIFICIAL INTELLIGENCE (22)
COMPLEXITY THEORY (22)
ENCODING (21)
OBJECT RECOGNITION (21)
SHAPE (21)
CAMERAS (20)
EDUCATIONAL INSTITUTIONS (20)
GENETIC ALGORITHMS (20)
NEURAL NET ARCHITECTURE (20)
SIGNAL PROCESSING (20)
ANALYTICAL MODELS (19)
BIOLOGICAL SYSTEM MODELING (19)
CONVERGENCE (19)
ELECTRONIC MAIL (19)
MEASUREMENT (19)
HISTOGRAMS (18)
PROPOSALS (18)
SOFTWARE ARCHITECTURE (18)
TRANSFORMS (18)
UNSUPERVISED LEARNING (18)
ACOUSTICS (17)
ADAPTATION MODELS (17)
DETECTORS (17)
OBJECT DETECTION (17)
RADIAL BASIS FUNCTION NETWORKS (17)
SENSORS (17)
BUILDINGS (16)
CLUSTERING ALGORITHMS (16)
DEEP NEURAL NETWORKS (16)
GAMES (16)
INTERNET (16)
PATTERN CLASSIFICATION (16)
PRINCIPAL COMPONENT ANALYSIS (16)
PROGRAMMING (16)
RANDOM ACCESS MEMORY (16)
SUPPORT VECTOR MACHINE CLASSIFICATION (16)
TIME SERIES ANALYSIS (16)
EDUCATION (15)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes

RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation

End-to-End Representation Learning for Correlation Filter Based Tracking

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

CERN: Confidence-Energy Recurrent Network for Group Activity Recognition

SST: Single-Stream Temporal Action Proposals

FASON: First and Second Order Information Fusion Network for Texture Recognition

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach

Weakly Supervised Cascaded Convolutional Networks

Exploiting Saliency for Object Segmentation from Image Level Labels

Universal Adversarial Perturbations

Learning Detection with Diverse Proposals

Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer

Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups

DeMoN: Depth and Motion Network for Learning Monocular Stereo

FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory

Switching Convolutional Neural Network for Crowd Counting

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

End-to-End Learning of Driving Models from Large-Scale Video Datasets

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options