Advanced search

chapter

Improving Training of Deep Neural Networks via Singular Value Bounding

Kui Jia, Dacheng Tao, Shenghua Gao, Xiangmin Xu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3994 - 4002

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning methods achieve great success recently on many computer vision problems. In spite of these practical successes, optimization of deep networks remains an active topic in deep learning research. In this work, we focus on investigation of the network solution properties that can potentially lead to good performance. Our research is inspired by theoretical and empirical results that use...

chapter

Convolutional Random Walk Networks for Semantic Image Segmentation

Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6137 - 6145

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most current semantic segmentation methods rely on fully convolutional networks (FCNs). However, their use of large receptive fields and many pooling layers cause low spatial resolution inside the deep layers. This leads to predictions with poor localization around the boundaries. Prior work has attempted to address this issue by post-processing predictions with CRFs or MRFs. But such models often...

chapter

Adversarial Discriminative Domain Adaptation

Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2962 - 2971

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Adversarial learning methods are a promising approach to training robust deep networks, and can generate complex samples across diverse domains. They can also improve recognition despite the presence of domain shift or dataset bias: recent adversarial approaches to unsupervised domain adaptation reduce the difference between the training and test domain distributions and thus improve generalization...

chapter

Learning Diverse Image Colorization

Aditya Deshpande, Jiajun Lu, Mao-Chuang Yeh, Min Jin Chong, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2877 - 2885

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Colorization is an ambiguous problem, with multiple viable colorizations for a single grey-level image. However, previous methods only produce the single most probable colorization. Our goal is to model the diversity intrinsic to the problem of colorization and produce multiple colorizations that display long-scale spatial co-ordination. We learn a low dimensional embedding of color fields using a...

chapter

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

Guillermo Garcia-Hernando, Tae-Kyun Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 407 - 415

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A human action can be seen as transitions between ones body poses over time, where the transition depicts a temporal relation between two poses. Recognizing actions thus involves learning a classifier sensitive to these pose transitions as well as to static poses. In this paper, we introduce a novel method called transitions forests, an ensemble of decision trees that both learn to discriminate static...

chapter

Local Binary Convolutional Neural Networks

Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4284 - 4293

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose local binary convolution (LBC), an efficient alternative to convolutional layers in standard convolutional neural networks (CNN). The design principles of LBC are motivated by local binary patterns (LBP). The LBC layer comprises of a set of fixed sparse pre-defined binary convolutional filters that are not updated during the training process, a non-linear activation function and a set of...

chapter

Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation

Binghui Chen, Weihong Deng, Junping Du

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4021 - 4030

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Over the past few years, softmax and SGD have become a commonly used component and the default training strategy in CNN frameworks, respectively. However, when optimizing CNNs with SGD, the saturation behavior behind softmax always gives us an illusion of training well and then is omitted. In this paper, we first emphasize that the early saturation behavior of softmax will impede the exploration of...

chapter

AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos

Amlan Kar, Nishant Rai, Karan Sikka, Gaurav Sharma

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5699 - 5708

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel method for temporally pooling frames in a video for the task of human action recognition. The method is motivated by the observation that there are only a small number of frames which, together, contain sufficient information to discriminate an action class present in a video, from the rest. The proposed method learns to pool such discriminative and informative frames, while discarding...

chapter

Building a Regular Decision Boundary with Deep Networks

Edouard Oyallon

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1886 - 1894

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we build a generic architecture of Convolutional Neural Networks to discover empirical properties of neural networks. Our first contribution is to introduce a state-of-the-art framework that depends upon few hyper parameters and to study the network when we vary them. It has no max pooling, no biases, only 13 layers, is purely convolutional and yields up to 95.4% and 79.6% accuracy respectively...

chapter

LCR-Net: Localization-Classification-Regression for Human Pose

Gregory Rogez, Philippe Weinzaepfel, Cordelia Schmid

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1216 - 1224

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose an end-to-end architecture for joint 2D and 3D human pose estimation in natural images. Key to our approach is the generation and scoring of a number of pose proposals per image, which allows us to predict 2D and 3D pose of multiple people simultaneously. Hence, our approach does not require an approximate localization of the humans for initialization. Our architecture, named LCR-Net, contains...

chapter

Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects

Ting Yao, Yingwei Pan, Yehao Li, Tao Mei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5263 - 5271

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Image captioning often requires a large set of training image-sentence pairs. In practice, however, acquiring sufficient training pairs is always expensive, making the recent captioning models limited in their ability to describe objects outside of training corpora (i.e., novel objects). In this paper, we present Long Short-Term Memory with Copying Mechanism (LSTM-C) — a new architecture...

chapter

Evaluation of target segmentation on SAR target recognition

Baiyuan Ding, Gongjian Wen, Conghui Ma, Xiaoliang Yang

2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS) > 663 - 667

2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS)

Target segmentation of synthetic aperture radar (SAR) images is one of the challenging problems in SAR image interpretation, which often serves as a processing step for SAR target recognition. Target segmentation tries to separate the target from the background thus eliminating the interference of background noises or clutters. However, the segmentation may also discard a part of the target characteristics...

chapter

Classification of VHR remote sensing images using local feature-based attribute profiles

Minh-Tan Pham, Sebastien Lefevre, Erchan Aptoula, Bharath Bhushan Damodaran

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 747 - 750

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

The present paper introduces an extension of attribute profiles (APs) by extracting their local features. The so-called local feature-based attribute profiles (LFAPs) are expected to provide a better characterization of each APs' filtered pixel (i.e. APs' sample) within its neighborhood, hence better deal with local texture information from the image's content. In this work, LFAP is constructed by...

chapter

Enhancing Accuracy of Multi-Class Support Vector Machine by Applying Directed Acyclic Graphs

Zhi Li, Zhao Niu, Kun Lu, Yue Ma

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 307 - 311

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Multi-class classification algorithm of support vector machine (SVM) has always been a research hotspot. A new multi-class SVM algorithm, naming recall reordering adaptive directed acyclic graphs (RRADAG), is proposed from the perspective of error detection to solve the error accumulation existed in multi-class SVM algorithm of which is based on Directed Acyclic Graphs (DAG). By detecting the output...

chapter

A Multi-Label Classification Method on Chinese Temporal Expressions Based on Character Embedding

Baosheng Yin, Bowen Jin

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 51 - 54

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Understanding temporal expressions is the important foundation of many NLP tasks. However, the varied representations of temporal expressions is difficulty in analysis and understanding. To parsing expressions, an effective classification method of temporal expressions is significant. A temporal expression may belong to one or more classes, but the classification usually requires manual annotation...

chapter

Loss Max-Pooling for Semantic Image Segmentation

Samuel Rota Bulo, Gerhard Neuhold, Peter Kontschieder

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7082 - 7091

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a novel loss max-pooling concept for handling imbalanced training data distributions, applicable as alternative loss layer in the context of deep neural networks for semantic image segmentation. Most real-world semantic segmentation datasets exhibit long tail distributions with few object categories comprising the majority of data and consequently biasing the classifiers towards them...

chapter

Nonlinear statistical retrieval of surface emissivity from IASI data

Valero Laparra, Jordi Munoz-Mari, Luis Gomez-Chova, Xavier Calbet, more

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 5450 - 5453

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

Emissivity is one of the most important parameters to improve the determination of the troposphere properties (thermodynamic properties, aerosols and trace gases concentration) and it is essential to estimate the radiative budget. With the second generation of infrared sounders, we can estimate emissivity spectra at high spectral resolution, which gives us a global view and long-term monitoring of...

chapter

Temporally Steered Gaussian Attention for Video Understanding

Shagan Sah, Thang Nguyen, Miguel Dominguez, Felipe Petroski Such, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 2208 - 2216

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Recent advances in video understanding are enabling incredible developments in video search, summarization, automatic captioning and human computer interaction. Attention mechanisms are a powerful way to steer focus onto different sections of the video. Existing mechanisms are driven by prior training probabilities and require input instances of identical temporal duration. We introduce an intuitive...

chapter

RATM: Recurrent Attentive Tracking Model

Samira Ebrahimi Kahou, Vincent Michalski, Roland Memisevic, Christopher Pal, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1613 - 1622

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

We present an attention-based modular neural framework for computer vision. The framework uses a soft attention mechanism allowing models to be trained with gradient descent. It consists of three modules: a recurrent attention module controlling where to look in an image or video frame, a feature-extraction module providing a representation of what is seen, and an objective module formalizing why...

chapter

Therapeutic effects of an anti-gravity locomotor training (AlterG) on postural balance and cerebellum structure in children with Cerebral Palsy

A. H. Rasooli, P. M. Birgani, Sh. Azizi, A. Shahrokhi, more

2017 International Conference on Rehabilitation Robotics (ICORR) > 101 - 105

2017 International Conference on Rehabilitation Robotics (ICORR)

We evaluated the therapeutic effects of anti-gravity locomotor treadmill (AlterG) training on postural stability in children with Cerebral Palsy (CP) and spasticity, particularly in the lower extremity. AlterG can facilitate walking by reducing the weight of CP children by up to 80%; it can also help subjects maintain an appropriate posture during the locomotor AlterG training. Thus, we hypothesized...

INFONA - science communication portal

Advanced search

Advanced search in people

Improving Training of Deep Neural Networks via Singular Value Bounding

Convolutional Random Walk Networks for Semantic Image Segmentation

Adversarial Discriminative Domain Adaptation

Learning Diverse Image Colorization

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

Local Binary Convolutional Neural Networks

Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation

AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos

Building a Regular Decision Boundary with Deep Networks

LCR-Net: Localization-Classification-Regression for Human Pose

Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects

Evaluation of target segmentation on SAR target recognition

Classification of VHR remote sensing images using local feature-based attribute profiles

Enhancing Accuracy of Multi-Class Support Vector Machine by Applying Directed Acyclic Graphs

A Multi-Label Classification Method on Chinese Temporal Expressions Based on Character Embedding

Loss Max-Pooling for Semantic Image Segmentation

Nonlinear statistical retrieval of surface emissivity from IASI data

Temporally Steered Gaussian Attention for Video Understanding

RATM: Recurrent Attentive Tracking Model

Therapeutic effects of an anti-gravity locomotor training (AlterG) on postural balance and cerebellum structure in children with Cerebral Palsy

Filter options

Publication date

Content availability

Publication type

Publication language

Keywords

Data set

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Publication language

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options