Search results

Items from 1 to 20 out of 84 results

chapter

Deep affordance learning for single- and multiple-instance object detection

Jian-Gang Wang, Prabhu Shankar Mahendran, Eam-Khwang Teoh

TENCON 2017 - 2017 IEEE Region 10 Conference > 321 - 326

TENCON 2017 - 2017 IEEE Region 10 Conference

Affordance learning in general, is to identify the purpose, use, and ways to interact with an object, based on information gained from observing the object. Most of the existing affordance learning approaches assume the object target has been cropped individually from images. However, the object could not be easily separated from others due to occlusion or noise. Actually, two or more neighboring...

chapter

AutoDIAL: Automatic Domain Alignment Layers

Fabio Maria Cariucci, Lorenzo Porzi, Barbara Caputo, Elisa Ricci, more

2017 IEEE International Conference on Computer Vision (ICCV) > 5077 - 5085

2017 IEEE International Conference on Computer Vision (ICCV)

Classifiers trained on given databases perform poorly when tested on data acquired in different settings. This is explained in domain adaptation through a shift among distributions of the source and target domains. Attempts to align them have traditionally resulted in works reducing the domain shift by introducing appropriate loss terms, measuring the discrepancies between source and target distributions,...

chapter

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

Chen Sun, Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta

2017 IEEE International Conference on Computer Vision (ICCV) > 843 - 852

2017 IEEE International Conference on Computer Vision (ICCV)

The success of deep learning in vision can be attributed to: (a) models with high capacity; (b) increased computational power; and (c) availability of large-scale labeled data. Since 2012, there have been significant advances in representation capabilities of the models and computational capabilities of GPUs. But the size of the biggest dataset has surprisingly remained constant. What will happen...

chapter

360° view camera based visual assistive technology for contextual scene information

Mazin Ali, Ferat Sahin, Shitij Kumar, Celal Savur

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 2135 - 2140

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

In this paper, a system to aid the visually impaired by providing contextual information of the surroundings using 360° view camera combined with deep learning is proposed. The system uses a 360° view camera with a mobile device to capture surrounding scene information and provide contextual information to the user in the form of audio. The scene information from the spherical camera feed is classified...

chapter

Towards modeling the learning process of aviators using deep reinforcement learning

Joost van Oijen, Gerald Poppinga, Olaf Brouwer, Andi Aliko, more

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 3439 - 3444

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

In this paper we report on our study of the performance of Deep Reinforcement Learning (DRL) agents in performing tasks that are illustrative for human Sensor Operators (SOs) in Remotely Piloted Aircraft Systems (RPASs). Our hypothesis is that the descriptive and predictive qualities of the agent's learning process potentially allow us to identify human task requirements, training needs, selection...

chapter

A minimal convolutional neural network for handwritten digit recognition

Matthew Y. W. Teow

2017 7th IEEE International Conference on System Engineering and Technology (ICSET) > 171 - 176

2017 7th IEEE International Conference on System Engineering and Technology (ICSET)

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network using a minimal model. The proposed minimal convolutional neural network is presented using a layering approach. This approach provides a clear understanding of the main mathematical operations in a convolutional neural network. Hence,...

chapter

DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning

Lili Pan, Samira Pouyanfar, Hao Chen, Jiaohua Qin, more

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC) > 181 - 189

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)

Deep learning has brought a series of breakthroughs in image processing. Specifically, there are significant improvements in the application of food image classification using deep learning techniques. However, very little work has been studied for the classification of food ingredients. Therefore, this paper proposes a new framework, called DeepFood which not only extracts rich and effective features...

chapter

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning

Abhishek Das, Satwik Kottur, Jose M. F. Moura, Stefan Lee, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2970 - 2979

2017 IEEE International Conference on Computer Vision (ICCV)

We introduce the first goal-driven training for visual question answering and dialog agents. Specifically, we pose a cooperative ‘image guessing’ game between two agents – Q-BOT and A-BOT– who communicate in natural language dialog so that Q-BOT can select an unseen image from a lineup of images. We use deep reinforcement learning (RL) to learn the policies of these agents end-to-end – from pixels...

chapter

Understanding convolutional neural networks using a minimal model for handwritten digit recognition

Matthew Y. W. Teow

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS) > 167 - 172

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS)

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network (CNN) using a minimal model (Minimal CNN). The proposed minimal CNN is presented using a layering approach. This approach provides a concise and accessible understanding of the main mathematical operations of a CNN. Hence, it benefits...

chapter

Identity-Aware Textual-Visual Matching with Latent Co-attention

Shuang Li, Tong Xiao, Hongsheng Li, Wei Yang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1908 - 1917

2017 IEEE International Conference on Computer Vision (ICCV)

Textual-visual matching aims at measuring similarities between sentence descriptions and images. Most existing methods tackle this problem without effectively utilizing identity-level annotations. In this paper, we propose an identity-aware two-stage framework for the textual-visual matching problem. Our stage-1 CNN-LSTM network learns to embed cross-modal features with a novel Cross-Modal Cross-Entropy...

chapter

Improving the visualisation of 3D textured models via shadow detection and removal

Evangelos Maltezos, Anastasios Doulamis, Charalabos Ioannidis

2017 9th International Conference on Virtual Worlds and Games for Serious Applications (VS-Games) > 161 - 164

2017 9th International Conference on Virtual Worlds and Games for Serious Applications (VS-Games)

Although shadows in images have a constructive role providing a natural view of features of the scene, they also have a destructive role in image processing by hiding significant information. Improving the quality of 3D textured models for serious games and augmented reality applications via shadow detection and removal remains challenging due to the complexity of an image scene. This paper proposes...

chapter

Creation of a deep convolutional auto-encoder in Caffe

Volodymyr Turchenko, Artur Luczak

2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS) > 2 > 651 - 659

2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS)

The development of a deep (stacked) convolutional auto-encoder in the Caffe deep learning framework is presented in this paper. We describe simple principles which we used to create this model in Caffe. The proposed model of convolutional auto-encoder does not have pooling/unpooling layers yet. The results of our experimental research show comparable accuracy of dimensionality reduction in comparison...

chapter

Deep learning algorithm with visual impression

Funan He, Mengduo Yang, Fanzhang Li

2017 International Smart Cities Conference (ISC2) > 1 - 4

2017 International Smart Cities Conference (ISC2)

In this article, we develop two visual impression models: recognition model and generalization model to simulate the cognition process of human visual systems. We show how the visual impression learned with a deep neural network can be efficiently transferred to other visual recognition tasks. By reusing the hidden layers trained in an unsupervised way, we show that we can largely reduce the number...

chapter

Topological deep learning algorithm with visual impression

Mengduo Yang, Fanzhang Li

2017 International Smart Cities Conference (ISC2) > 1 - 4

2017 International Smart Cities Conference (ISC2)

We present in this paper a novel approach for training a topological deep neural network with visual impression. We show that by combing denoising auto-encoder model and contractive auto-encoder with Hessian regularization model, we can achieve a deterministic auto-encoder aiming for robustness to small variations of the input. We exploit the tangent propagation algorithm to show how our algorithm...

chapter

Dictionary learning for spontaneous neural activity modeling

Birini Troullinou, Grigorios Tsagkatakis, Ganna Palagina, Maria Papadopouli, more

2017 25th European Signal Processing Conference (EUSIPCO) > 1579 - 1583

2017 25th European Signal Processing Conference (EUSIPCO)

Modeling the activity of an ensemble of neurons can provide critical insights into the workings of the brain. In this work we examine if learning based signal modeling can contribute to a high quality modeling of neuronal signal data. To that end, we employ the sparse coding and dictionary learning schemes for capturing the behavior of neuronal responses into a small number of representative prototypical...

chapter

DLNE: A hybridization of deep learning and neuroevolution for visual control

Andreas Precht Poulsen, Mark Thorhauge, Mikkel Hvilshj Funch, Sebastian Risi

2017 IEEE Conference on Computational Intelligence and Games (CIG) > 256 - 263

2017 IEEE Conference on Computational Intelligence and Games (CIG)

This paper investigates the potential of combining deep learning and neuroevolution to create a bot for a simple first person shooter (FPS) game capable of aiming and shooting based on high-dimensional raw pixel input. The deep learning component is responsible for visual recognition and translating raw pixels to compact feature representations, while the evolving network takes those features as inputs...

chapter

Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning from Web Data

Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2915 - 2924

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Large-scale datasets have driven the rapid development of deep neural networks for visual recognition. However, annotating a massive dataset is expensive and time-consuming. Web images and their labels are, in comparison, much easier to obtain, but direct training on such automatially harvested images can lead to unsatisfactory performance, because the noisy labels of Web images adversely affect the...

chapter

Low-Rank Embedded Ensemble Semantic Dictionary for Zero-Shot Learning

Zhengming Ding, Ming Shao, Yun Fu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6005 - 6013

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Zero-shot learning for visual recognition has received much interest in the most recent years. However, the semantic gap across visual features and their underlying semantics is still the biggest obstacle in zero-shot learning. To fight off this hurdle, we propose an effective Low-rank Embedded Semantic Dictionary learning (LESD) through ensemble strategy. Specifically, we formulate a novel framework...

chapter

DeepPermNet: Visual Permutation Learning

Rodrigo Santa Cruz, Basura Fernando, Anoop Cherian, Stephen Gould

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6044 - 6052

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a principled approach to uncover the structure of visual data by solving a novel deep learning task coined visual permutation learning. The goal of this task is to find the permutation that recovers the structure of data from shuffled versions of it. In the case of natural images, this task boils down to recovering the original image from patches shuffled by an unknown permutation matrix...

chapter

Borrowing Treasures from the Wealthy: Deep Transfer Learning through Selective Joint Fine-Tuning

Weifeng Ge, Yizhou Yu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 10 - 19

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep neural networks require a large amount of labeled training data during supervised learning. However, collecting and labeling so much data might be infeasible in many cases. In this paper, we introduce a deep transfer learning scheme, called selective joint fine-tuning, for improving the performance of deep learning tasks with insufficient training data. In this scheme, a target learning task...

Keywords:
VISUALIZATION
TRAINING
MACHINE LEARNING

Publication date

Set your own date range

Keywords

FEATURE EXTRACTION (33)
DEEP LEARNING (20)
LEARNING (ARTIFICIAL INTELLIGENCE) (18)
SUPPORT VECTOR MACHINES (11)
DATA MINING (10)
CONVOLUTION (8)
IMAGE CLASSIFICATION (8)
SEMANTICS (8)
COMPUTATIONAL MODELING (7)
COMPUTER VISION (7)
IMAGE RECOGNITION (7)
OBJECT RECOGNITION (7)
CAMERAS (6)
IMAGE RETRIEVAL (6)
NEURAL NETWORKS (6)
VOCABULARY (6)
ACCURACY (5)
CONVOLUTIONAL NEURAL NETWORK (5)
DATA MODELS (5)
ESTIMATION (5)
IMAGE COLOR ANALYSIS (5)
IMAGE SEGMENTATION (5)
KERNEL (5)
LINEAR PROGRAMMING (5)
NEURONS (5)
AUTOMATIC IMAGE ANNOTATION (4)
BIOLOGICAL NEURAL NETWORKS (4)
CONFERENCES (4)
DICTIONARIES (4)
GAMES (4)
HISTOGRAMS (4)
HUMANS (4)
IMAGE ANNOTATION (4)
MATHEMATICAL MODEL (4)
OBJECT DETECTION (4)
OPTIMIZATION (4)
PATTERN RECOGNITION (4)
ROBUSTNESS (4)
SVM (4)
ARTIFICIAL NEURAL NETWORK (3)
ARTIFICIAL NEURAL NETWORKS (3)
COGNITION (3)
COMPUTER ARCHITECTURE (3)
CONTENT-BASED RETRIEVAL (3)
CORRELATION (3)
DECODING (3)
DETECTORS (3)
ENCODING (3)
HANDWRITING RECOGNITION (3)
NOISE MEASUREMENT (3)
PROBABILITY (3)
TEXT ANALYSIS (3)
THREE-DIMENSIONAL DISPLAYS (3)
TRAINING DATA (3)
UNSUPERVISED LEARNING (3)
ADAPTATION MODELS (2)
ARTIFICIAL INTELLIGENCE (2)
BRAIN (2)
BRAIN DECODING (2)
CLASSIFICATION (2)
CLASSIFICATION ALGORITHMS (2)
CNN (2)
DATABASES (2)
ENSEMBLE OF CLASSIFIERS (2)
EQUATIONS (2)
FMRI (2)
GAUSSIAN PROCESSES (2)
HANDWRITTEN DIGIT RECOGNITION (2)
HIDDEN MARKOV MODELS (2)
INTERNET (2)
LEARNING SYSTEMS (2)
MACHINE LEARNING ALGORITHMS (2)
MEASUREMENT (2)
MULTIMEDIA COMMUNICATION (2)
OBJECT TRACKING (2)
ONTOLOGIES (2)
PIXEL (2)
ROBOT VISION (2)
SHAPE (2)
STANDARDS (2)
SURVEILLANCE (2)
TESTING (2)
VISUAL IMPRESSION (2)
VISUAL LOCALIZER (2)
WEB PAGES (2)
3D CONVOLUTIONAL NEURAL NETWORKS (1)
3D TEXTURED MODELLING (1)
ACTION RECOGNITION (1)
ADABOOST (1)
ADAPTIVE DEEP LEARNING (1)
ADAPTIVE NEURO FUZZY INFERENCE SYSTEM (1)
AERIAL REFUELING (1)
AFFORDANCE LEARNING (1)
AIRPLANES (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ANALYTICAL MODELS (1)
AND FILTER (1)
more

INFONA - science communication portal

Search results

Deep affordance learning for single- and multiple-instance object detection

AutoDIAL: Automatic Domain Alignment Layers

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

360° view camera based visual assistive technology for contextual scene information

Towards modeling the learning process of aviators using deep reinforcement learning

A minimal convolutional neural network for handwritten digit recognition

DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning

Understanding convolutional neural networks using a minimal model for handwritten digit recognition

Identity-Aware Textual-Visual Matching with Latent Co-attention

Improving the visualisation of 3D textured models via shadow detection and removal

Creation of a deep convolutional auto-encoder in Caffe

Deep learning algorithm with visual impression

Topological deep learning algorithm with visual impression

Dictionary learning for spontaneous neural activity modeling

DLNE: A hybridization of deep learning and neuroevolution for visual control

Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning from Web Data

Low-Rank Embedded Ensemble Semantic Dictionary for Zero-Shot Learning

DeepPermNet: Visual Permutation Learning

Borrowing Treasures from the Wealthy: Deep Transfer Learning through Selective Joint Fine-Tuning

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options