Search results

Items from 1 to 20 out of 3,201 results

chapter

Towards Playing a 3D First-Person Shooter Game Using a Classification Deep Neural Network Architecture

Yuri Lenon Barbosa Nogueira, Creto Augusto Vidal, Joaquim Bento Cavalcante-Neto

2017 19th Symposium on Virtual and Augmented Reality (SVR) > 120 - 126

2017 19th Symposium on Virtual and Augmented Reality (SVR)

In this work, we present a network architecture to solve a supervised learning problem, the classification of a handwritten dataset, and a reinforcement learning problem, a complex First-Person Shooter 3D game environment. We used a Deep Neural Network model to solve both problems. For classification, we used a Softmax regression and cross entropy loss to train the network. To play the game, we used...

chapter

An Intersection Signal Control Method Based on Deep Reinforcement Learning

Pang Ha-li, Ding Ke

2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA) > 344 - 348

2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA)

Urban traffic flow is dynamic and uncertain. In this paper, we combine the deep learning and the reinforcement learning, and design an intersection signal controller based on Q-learning and convolutional neural network. We redefine the state space and the reward function. The training and simulation of the controller are carried out in traffic micro-simulator SUMO. Compared with timing control, the...

chapter

Virtrul reality and artificial intelligence support future training development

Miao Li, Lijuan Li, Risheng Jiao, Hongguang Xiao

2017 Chinese Automation Congress (CAC) > 416 - 419

2017 Chinese Automation Congress (CAC)

With the developing of modern training, advanced requirements of training process such as creative, economical, realistic, safety have been proposed. It is hard for tradition training method to satisfy the training requirement of new technology and equipment in the background of modern industry. Traditional training methods were facing severe challenges. In order to solve the complex technical training...

chapter

A reinforcement learning based fault diagnosis for autoregressive-moving-average model

Dapeng Zhang, Yichuan Fu, Zhiling Lin, Zhiwei Gao

IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society > 7067 - 7072

IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society

In this paper, a reinforcement learning approach is proposed to detect unexpected faults, where the noise-to-signal ratio of the data series is minimized for achieving robustness. The model parameter is taken as a special action of the reinforcement learning, and the policy valuation and policy improvement are utilized to find the parameters, which can make the estimated model consistent to the real-time...

chapter

A novel DDPG method with prioritized experience replay

Yuenan Hou, Lifeng Liu, Qing Wei, Xudong Xu, more

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 316 - 321

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Recently, a state-of-the-art algorithm, called deep deterministic policy gradient (DDPG), has achieved good performance in many continuous control tasks in the MuJoCo simulator. To further improve the efficiency of the experience replay mechanism in DDPG and thus speeding up the training process, in this paper, a prioritized experience replay method is proposed for the DDPG algorithm, where prioritized...

chapter

Towards modeling the learning process of aviators using deep reinforcement learning

Joost van Oijen, Gerald Poppinga, Olaf Brouwer, Andi Aliko, more

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 3439 - 3444

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

In this paper we report on our study of the performance of Deep Reinforcement Learning (DRL) agents in performing tasks that are illustrative for human Sensor Operators (SOs) in Remotely Piloted Aircraft Systems (RPASs). Our hypothesis is that the descriptive and predictive qualities of the agent's learning process potentially allow us to identify human task requirements, training needs, selection...

chapter

A reinforcement learning approach to score goals in RoboCup 3D soccer simulation for nao humanoid robot

Mohammad Amin Fahami, Mohamad Roshanzamir, Navid Hoseini Izadi

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE) > 450 - 454

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)

Reinforcement learning is one of the best methods to train autonomous robots. Using this method, a robot can learn to make optimal decisions without detailed programming and hard coded instructions. So, this method is useful for learning complex robotic behaviors. For example, in RoboCup competitions this method will be very useful in learning different behaviors. We propose a method for training...

chapter

Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning

James Supancic, Deva Ramanan

2017 IEEE International Conference on Computer Vision (ICCV) > 322 - 331

2017 IEEE International Conference on Computer Vision (ICCV)

We formulate tracking as an online decision-making process, where a tracking agent must follow an object despite ambiguous image frames and a limited computational bud- get. Crucially, the agent must decide where to look in the upcoming frames, when to reinitialize because it believes the target has been lost, and when to update its appearance model for the tracked object. Such decisions are typically...

chapter

Analysis of Q-learning on ANNs for robot control using live video feed

Nihal Murali, Kunal Gupta, Surekha Bhanot

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) > 524 - 529

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)

Training of artificial neural networks (ANNs) using reinforcement learning (RL) techniques is being widely discussed in the robot learning literature. The high model complexity of ANNs along with the model-free nature of RL algorithms provides a desirable combination for many robotics applications. There is a huge need for algorithms that generalize using raw sensory inputs, such as vision, without...

chapter

DLNE: A hybridization of deep learning and neuroevolution for visual control

Andreas Precht Poulsen, Mark Thorhauge, Mikkel Hvilshj Funch, Sebastian Risi

2017 IEEE Conference on Computational Intelligence and Games (CIG) > 256 - 263

2017 IEEE Conference on Computational Intelligence and Games (CIG)

This paper investigates the potential of combining deep learning and neuroevolution to create a bot for a simple first person shooter (FPS) game capable of aiming and shooting based on high-dimensional raw pixel input. The deep learning component is responsible for visual recognition and translating raw pixels to compact feature representations, while the evolving network takes those features as inputs...

chapter

Predicting trust in human control of swarms via inverse reinforcement learning

Changjoo Nam, Phillip Walker, Michael Lewis, Katia Sycara

2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN) > 528 - 533

2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN)

In this paper, we study the model of human trust where an operator controls a robotic swarm remotely for a search mission. Existing trust models in human-in-the-loop systems are based on task performance of robots. However, we find that humans tend to make their decisions based on physical characteristics of the swarm rather than its performance since task performance of swarms is not clearly perceivable...

chapter

Improved reward estimation for efficient robot navigation using inverse reinforcement learning

Olimpiya Saha, Prithviraj Dasgupta

2017 NASA/ESA Conference on Adaptive Hardware and Systems (AHS) > 245 - 252

2017 NASA/ESA Conference on Adaptive Hardware and Systems (AHS)

Robot navigation is a central problem in extraterrestrial environments and a suitable navigation algorithm that allows the robot to quickly but precisely avoid initially unknown obstacles is important for efficient navigation. In this paper, we consider a well-known machine learning-based framework called reinforcement learning for robot navigation and investigate a technique for adaptively adjusting...

chapter

Deep Reinforcement Learning-Based Image Captioning with Embedding Reward

Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1151 - 1159

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Image captioning is a challenging problem owing to the complexity in understanding the image content and diverse ways of describing it in natural language. Recent advances in deep neural networks have substantially improved the performance of this task. Most state-of-the-art approaches follow an encoder-decoder framework, which generates captions using a sequential recurrent prediction model. However,...

chapter

Self-Critical Sequence Training for Image Captioning

Steven J. Rennie, Etienne Marcheret, Youssef Mroueh, Jerret Ross, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1179 - 1195

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently it has been shown that policy-gradient methods for reinforcement learning can be utilized to train deep end-to-end systems directly on non-differentiable metrics for the task at hand. In this paper we consider the problem of optimizing image captioning systems using reinforcement learning, and show that by carefully optimizing our systems using the test metrics of the MSCOCO task, significant...

chapter

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1349 - 1358

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a novel tracker which is controlled by sequentially pursuing actions learned by deep reinforcement learning. In contrast to the existing trackers using deep networks, the proposed tracker is designed to achieve a light computation as well as satisfactory tracking accuracy in both location and scale. The deep network to control actions is pre-trained using various training sequences...

chapter

Performance comparision of different momentum techniques on deep reinforcement learning

Mehmet Sarigul, Mutlu Avci

2017 IEEE International Conference on INnovations in Intelligent SysTems and Applications (INISTA) > 302 - 306

2017 IEEE International Conference on INnovations in Intelligent SysTems and Applications (INISTA)

Increase in popularity of deep convolutional neural networks in many different areas leads to increase in the use of these networks in reinforcement learning. Training a huge deep neural network structure by using simple gradient descent learning can take quite a long time. Some additional learning approaches should be utilized to solve this problem. One of these techniques is use of momentum which...

chapter

Learning to walk with prior knowledge

Martin Gottwald, Dominik Meyer, Hao Shen, Klaus Diepold

2017 IEEE International Conference on Advanced Intelligent Mechatronics (AIM) > 1369 - 1374

2017 IEEE International Conference on Advanced Intelligent Mechatronics (AIM)

In this work a novel approach to Transfer Learning for the use in Deep Reinforcement Learning is introduced. The agent is realized as an actor-critic framework, namely the Deep Deterministic Policy Gradient algorithm. The Q-function and the policy are represented as deep feed-forward networks, that are trained by minimizing the mean squared Bellman error and by maximizing the expected reward, respectively...

chapter

Deep reinforcement learning based optimal trajectory tracking control of autonomous underwater vehicle

Runsheng Yu, Zhenyu Shi, Chaoxing Huang, Tenglong Li, more

2017 36th Chinese Control Conference (CCC) > 4958 - 4965

2017 36th Chinese Control Conference (CCC)

The aim of this paper is to solve the control problem of trajectory tracking of Autonomous Underwater Vehicles (AUVs) through using and improving deep reinforcement learning (DRL). The deep reinforcement learning of an underwater motion control system is composed of two neural networks: one network selects action and the other evaluates whether the selected action is accurate, and they modify themselves...

chapter

Curiosity-Driven Exploration by Self-Supervised Prediction

Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 488 - 489

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In many real-world scenarios, rewards extrinsic to the agent are extremely sparse, or absent altogether. In such cases, curiosity can serve as an intrinsic reward signal to enable the agent to explore its environment and learn skills that might be useful later in its life. We formulate curiosity as the error in an agent's ability to predict the consequence of its own actions in a visual feature space...

chapter

End-to-End Driving in a Realistic Racing Game with Deep Reinforcement Learning

Etienne Perot, Maximilian Jaritz, Marin Toromanoff, Raoul de Charette

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 474 - 475

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

We address the problem of autonomous race car driving. Using a recent rally game (WRC6) with realistic physics and graphics we train an Asynchronous Actor Critic (A3C) in an end-to-end fashion and propose an improved reward function to learn faster. The network is trained simultaneously on three very different tracks (snow, mountain, and coast) with various road structures, graphics and physics. Despite...

Data set:
ieee
Keywords:
TRAINING
LEARNING (ARTIFICIAL INTELLIGENCE)

Publication date

Set your own date range

Content availability

Available (3,128)
None (73)

Keywords

ARTIFICIAL NEURAL NETWORKS (821)
CLASSIFICATION ALGORITHMS (791)
SUPPORT VECTOR MACHINES (762)
DATA MINING (755)
MACHINE LEARNING (739)
ACCURACY (685)
PATTERN CLASSIFICATION (654)
FEATURE EXTRACTION (640)
KERNEL (403)
NEURAL NETS (389)
IMAGE CLASSIFICATION (338)
TRAINING DATA (296)
NEURONS (273)
TESTING (258)
SUPPORT VECTOR MACHINE (252)
ALGORITHM DESIGN AND ANALYSIS (223)
DATA MODELS (218)
DATABASES (215)
OPTIMIZATION (206)
FACE (183)
FACE RECOGNITION (178)
MATHEMATICAL MODEL (177)
COMPUTATIONAL MODELING (171)
OBJECT DETECTION (167)
CLASSIFICATION (164)
PIXEL (162)
REGRESSION ANALYSIS (150)
BOOSTING (147)
SUPERVISED LEARNING (142)
PREDICTION ALGORITHMS (141)
PREDICTIVE MODELS (141)
NEURAL NETWORKS (140)
PRINCIPAL COMPONENT ANALYSIS (140)
HIDDEN MARKOV MODELS (136)
SVM (136)
HUMANS (131)
NEURAL NETWORK (129)
STATISTICAL ANALYSIS (129)
PATTERN CLUSTERING (127)
OPTIMISATION (126)
PATTERN RECOGNITION (123)
LEARNING SYSTEMS (121)
SUPPORT VECTOR MACHINE CLASSIFICATION (121)
GENETIC ALGORITHMS (120)
LEARNING (120)
RADIAL BASIS FUNCTION NETWORKS (120)
FUZZY SET THEORY (118)
TEXT ANALYSIS (117)
PROBABILITY (113)
VISUALIZATION (113)
CLUSTERING ALGORITHMS (110)
IMAGE SEGMENTATION (110)
ADAPTATION MODEL (109)
DECISION TREES (108)
BAYES METHODS (107)
EQUATIONS (103)
ERROR ANALYSIS (103)
NOISE (102)
SHAPE (102)
DISTANCE MEASUREMENT (101)
MACHINE LEARNING ALGORITHMS (100)
ESTIMATION (99)
CORRELATION (98)
DETECTORS (94)
NATURAL LANGUAGE PROCESSING (91)
IMAGE COLOR ANALYSIS (89)
CONVERGENCE (86)
ARTIFICIAL NEURAL NETWORK (85)
COMPUTER VISION (85)
GAUSSIAN PROCESSES (85)
INTERNET (85)
ROBOTS (83)
IMAGE RECOGNITION (82)
SPEECH (82)
ENTROPY (81)
FEATURE SELECTION (81)
LABELING (81)
OBJECT RECOGNITION (81)
ADABOOST (80)
MANIFOLDS (80)
PROBABILITY DENSITY FUNCTION (80)
LEARNING ALGORITHM (78)
MULTILAYER PERCEPTRONS (78)
BAYESIAN METHODS (76)
SEMI-SUPERVISED LEARNING (76)
CONTEXT (75)
HISTOGRAMS (74)
SPEECH RECOGNITION (74)
FUZZY NEURAL NETS (73)
BAGGING (72)
IMAGE RETRIEVAL (71)
INCREMENTAL LEARNING (71)
REINFORCEMENT LEARNING (70)
ACTIVE LEARNING (68)
MEDICAL IMAGE PROCESSING (68)
APPROXIMATION METHODS (67)
BIOLOGICAL SYSTEM MODELING (67)
COMPUTATIONAL COMPLEXITY (67)
more

INFONA - science communication portal

Search results

Towards Playing a 3D First-Person Shooter Game Using a Classification Deep Neural Network Architecture

An Intersection Signal Control Method Based on Deep Reinforcement Learning

Virtrul reality and artificial intelligence support future training development

A reinforcement learning based fault diagnosis for autoregressive-moving-average model

A novel DDPG method with prioritized experience replay

Towards modeling the learning process of aviators using deep reinforcement learning

A reinforcement learning approach to score goals in RoboCup 3D soccer simulation for nao humanoid robot

Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning

Analysis of Q-learning on ANNs for robot control using live video feed

DLNE: A hybridization of deep learning and neuroevolution for visual control

Predicting trust in human control of swarms via inverse reinforcement learning

Improved reward estimation for efficient robot navigation using inverse reinforcement learning

Deep Reinforcement Learning-Based Image Captioning with Embedding Reward

Self-Critical Sequence Training for Image Captioning

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Performance comparision of different momentum techniques on deep reinforcement learning

Learning to walk with prior knowledge

Deep reinforcement learning based optimal trajectory tracking control of autonomous underwater vehicle

Curiosity-Driven Exploration by Self-Supervised Prediction

End-to-End Driving in a Realistic Racing Game with Deep Reinforcement Learning

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options