Search results

chapter

Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources

Adrian Bulat, Georgios Tzimiropoulos

2017 IEEE International Conference on Computer Vision (ICCV) > 3726 - 3734

2017 IEEE International Conference on Computer Vision (ICCV)

Our goal is to design architectures that retain the groundbreaking performance of CNNs for landmark localization and at the same time are lightweight, compact and suitable for applications with limited computational resources. To this end, we make the following contributions: (a) we are the first to study the effect of neural network binarization on localization tasks, namely human pose estimation...

chapter

A minimal convolutional neural network for handwritten digit recognition

Matthew Y. W. Teow

2017 7th IEEE International Conference on System Engineering and Technology (ICSET) > 171 - 176

2017 7th IEEE International Conference on System Engineering and Technology (ICSET)

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network using a minimal model. The proposed minimal convolutional neural network is presented using a layering approach. This approach provides a clear understanding of the main mathematical operations in a convolutional neural network. Hence,...

chapter

Curriculum Dropout

Pietro Morerio, Jacopo Cavazza, Riccardo Volpi, Rene Vidal, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3564 - 3572

2017 IEEE International Conference on Computer Vision (ICCV)

Dropout is a very effective way of regularizing neural networks. Stochastically “dropping out” units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network generalization. Besides, Dropout can be interpreted as an approximate model aggregation technique, where an exponential number of smaller networks are averaged in order...

chapter

DualNet: Learn Complementary Features for Image Recognition

Saihui Hou, Xu Liu, Zilei Wang

2017 IEEE International Conference on Computer Vision (ICCV) > 502 - 510

2017 IEEE International Conference on Computer Vision (ICCV)

In this work we propose a novel framework named Dual-Net aiming at learning more accurate representation for image recognition. Here two parallel neural networks are coordinated to learn complementary features and thus a wider network is constructed. Specifically, we logically divide an end-to-end deep convolutional neural network into two functional parts, i.e., feature extractor and image classifier...

chapter

Automatic gender recognition for “in the wild” facial images using convolutional neural networks

Sergiu Cosmin Nistor, Alexandra-Cristina Marina, Adrian Sergiu Darabant, Diana Borza

2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) > 287 - 291

2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

Automatic recognition of human demographical attributes has implications in a variety of domains, such as surveillance systems, human computer interaction, marketing etc. In this paper, we present an automatic gender recognition method from facial images based on convolutional neural networks. In order to train the network, we merged together several face databases and also gathered and annotated...

chapter

Classification of dental diseases using CNN and transfer learning

Shreyansh A. Prajapati, R. Nagaraj, Suman Mitra

2017 5th International Symposium on Computational and Business Intelligence (ISCBI) > 70 - 74

2017 5th International Symposium on Computational and Business Intelligence (ISCBI)

Automated medical assistance system is in high demand with the advances in research in the machine learning area. In many such applications, availability of labeled medical dataset is a primary challenge and dataset of dental diseases is not an exception. An attempt towards accurate classification of dental diseases is addressed in this paper. Labeled dataset consisting of 251 Radio Visiography (RVG)...

chapter

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach

Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2233 - 2241

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a theoretically grounded approach to train deep neural networks, including recurrent networks, subject to class-dependent label noise. We propose two procedures for loss correction that are agnostic to both application domain and network architecture. They simply amount to at most a matrix inversion and multiplication, provided that we know the probability of each class being corrupted...

chapter

Switching Convolutional Neural Network for Crowd Counting

Deepak Babu Sam, Shiv Surya, R. Venkatesh Babu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4031 - 4039

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel crowd counting model that maps a given crowd scene to its density. Crowd analysis is compounded by myriad of factors like inter-occlusion between people due to extreme crowding, high similarity of appearance between people and background elements, and large variability of camera view-points. Current state-of-the art approaches tackle these factors by using multi-scale CNN architectures,...

chapter

Linking Image and Text with 2-Way Nets

Aviv Eisenschtat, Lior Wolf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1855 - 1865

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Linking two data sources is a basic building block in numerous computer vision problems. Canonical Correlation Analysis (CCA) achieves this by utilizing a linear optimizer in order to maximize the correlation between the two views. Recent work makes use of non-linear models, including deep learning techniques, that optimize the CCA loss in some feature space. In this paper, we introduce a novel, bi-directional...

chapter

Improving the lenet with batch normalization and online hard example mining for digits recognition

Yiliang Xie, Hongyuan Jin, Eric C.C. Tsang

2017 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR) > 149 - 153

2017 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR)

Nowadays, applications based on digits recognition and characters recognition have become much more reliable thanks to the rapid development of the DNN(deep neural network) architecture and constantly increasing the efficiency to the computing resources. A lot of methods have been proposed to improve the performance of DNNs, such as the ReLU (Rectified Linear Unit) which is a widely used alternative...

chapter

When Kernel Methods Meet Feature Learning: Log-Covariance Network for Action Recognition From Skeletal Data

Jacopo Cavazza, Pietro Morerio, Vittorio Murino

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1251 - 1258

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Human action recognition from skeletal data is a hot research topic and important in many open domain applications of computer vision, thanks to recently introduced 3D sensors. In the literature, naive methods simply transfer off-the-shelf techniques from video to the skeletal representation. However, the current state-of-the-art is contended between to different paradigms: kernel-based methods and...

chapter

Delving Deep Into Coarse-to-Fine Framework for Facial Landmark Localization

Xi Chen, Erjin Zhou, Yuchen Mo, Jiancheng Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 2088 - 2095

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In this paper we proposed a 4-stage coarse-to-fine framework to tackle the facial landmark localization problem in-the-wild. In our system, we first predict the landmark key points on a coarse level of granularity, which sets a good initialization for the whole framework. Then we group the key points into several components and refine each component with local patches cropped within them. After that...

chapter

Self-Supervised Neural Aggregation Networks for Human Parsing

Jian Zhao, Jianshu Li, Xuecheng Nie, Fang Zhao, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1595 - 1603

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In this paper, we present a Self-Supervised Neural Aggregation Network (SS-NAN) for human parsing. SS-NAN adaptively learns to aggregate the multi-scale features at each pixel "address". In order to further improve the feature discriminative capacity, a self-supervised joint loss is adopted as an auxiliary learning strategy, which imposes human joint structures into parsing results without...

chapter

Investigating Nuisance Factors in Face Recognition with DCNN Representation

Claudio Ferrari, Giuseppe Lisanti, Stefano Berretti, Alberto Del Bimbo

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 583 - 591

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Deep learning based approaches proved to be dramatically effective to address many computer vision applications, including "face recognition in the wild". It has been extensively demonstrated that methods exploiting Deep Convolutional Neural Networks (DCNN) are powerful enough to overcome to a great extent many problems that negatively affected computer vision algorithms based on hand-crafted...

chapter

Image recognition method based on deep learning

Xin Jia

2017 29th Chinese Control And Decision Conference (CCDC) > 4730 - 4735

2017 29th Chinese Control And Decision Conference (CCDC)

Deep learning algorithms are a subset of the machine learning algorithms, which aim at discovering multiple levels of distributed representations. Recently, numerous deep learning algorithms have been proposed to solve traditional artificial intelligence problems. This work aims to review the state-of-the-art in deep learning algorithms in computer vision by highlighting the contributions and challenges...

chapter

Cross-spectral image patch similarity using convolutional neural network

Patricia L. Suarez, Angel D. Sappa, Boris X. Vintimilla

2017 IEEE International Workshop of Electronics, Control, Measurement, Signals and their Application to Mechatronics (ECMSM) > 1 - 5

2017 IEEE International Workshop of Electronics, Control, Measurement, Signals and their Application to Mechatronics (ECMSM)

The ability to compare image regions (patches) has been the basis of many approaches to core computer vision problems, including object, texture and scene categorization. Hence, developing representations for image patches have been of interest in several works. The current work focuses on learning similarity between cross-spectral image patches with a 2 channel convolutional neural network (CNN)...

chapter

Keynote: small neural nets are beautiful: enabling embedded systems with small deep-neural- network architectures

Forrest Iandola, Kurt Keutzer

2017 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS) > 1 - 10

2017 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)

Over the last five years Deep Neural Nets have offered more accurate solutions to many problems in speech recognition, and computer vision, and these solutions have surpassed a threshold of acceptability for many applications. As a result, Deep Neural Networks have supplanted other approaches to solving problems in these areas, and enabled many new applications. While the design of Deep Neural Nets...

chapter

Learning local descriptors by optimizing the keypoint-correspondence criterion

Nenad Markus, Igor S. Pandzic, Jorgen Ahlberg

2016 23rd International Conference on Pattern Recognition (ICPR) > 2380 - 2385

2016 23rd International Conference on Pattern Recognition (ICPR)

Current best local descriptors are learned on a large dataset of matching and non-matching keypoint pairs. However, data of this kind is not always available since detailed keypoint correspondences can be hard to establish. On the other hand, we can often obtain labels for pairs of keypoint bags. For example, keypoint bags extracted from two images of the same object under different views form a matching...

chapter

Object Depth Estimation from a Single Image Using Fully Convolutional Neural Network

Ahmed J. Afifi, Olaf Hellwich

2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 7

2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

Convolutional Neural Network (CNN) has been used successfully in solving different computer vision tasks such as classification, detection, and segmentation. This paper addresses the problem of estimating object depth from a single RGB image. While stereo depth estimation is a straightforward task, predicting depth map of an object from a single RGB image is a more challenging task due to the lack...

chapter

Fast Computing Framework for Convolutional Neural Networks

Marcin Korytkowski, Pawel Staszewski, Piotr Woldan, Rafal Scherer

2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking (SocialCom), Sustainable Computing and Communications (SustainCom) (BDCloud-SocialCom-SustainCom) > 118 - 123

2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking (SocialCom), Sustainable Computing and Communications (SustainCom) (BDCloud-SocialCom-SustainCom)

In the case of building large convolutional neural networks, signal propagation speed is one of priority factors. Training large neural structures requires enormous time for achieving satisfying accuracy. In addition, the networks need to be learn by very large sets of good quality training images, which is another time-consuming factor. The paper presents a fast computing framework with some methods...

INFONA - science communication portal

Search results

Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources

A minimal convolutional neural network for handwritten digit recognition

Curriculum Dropout

DualNet: Learn Complementary Features for Image Recognition

Automatic gender recognition for “in the wild” facial images using convolutional neural networks

Classification of dental diseases using CNN and transfer learning

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach

Switching Convolutional Neural Network for Crowd Counting

Linking Image and Text with 2-Way Nets

Improving the lenet with batch normalization and online hard example mining for digits recognition

When Kernel Methods Meet Feature Learning: Log-Covariance Network for Action Recognition From Skeletal Data

Delving Deep Into Coarse-to-Fine Framework for Facial Landmark Localization

Self-Supervised Neural Aggregation Networks for Human Parsing

Investigating Nuisance Factors in Face Recognition with DCNN Representation

Image recognition method based on deep learning

Cross-spectral image patch similarity using convolutional neural network

Keynote: small neural nets are beautiful: enabling embedded systems with small deep-neural- network architectures

Learning local descriptors by optimizing the keypoint-correspondence criterion

Object Depth Estimation from a Single Image Using Fully Convolutional Neural Network

Fast Computing Framework for Convolutional Neural Networks

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options