Search results

Items from 1 to 20 out of 145 results

chapter

Towards Playing a 3D First-Person Shooter Game Using a Classification Deep Neural Network Architecture

Yuri Lenon Barbosa Nogueira, Creto Augusto Vidal, Joaquim Bento Cavalcante-Neto

2017 19th Symposium on Virtual and Augmented Reality (SVR) > 120 - 126

2017 19th Symposium on Virtual and Augmented Reality (SVR)

In this work, we present a network architecture to solve a supervised learning problem, the classification of a handwritten dataset, and a reinforcement learning problem, a complex First-Person Shooter 3D game environment. We used a Deep Neural Network model to solve both problems. For classification, we used a Softmax regression and cross entropy loss to train the network. To play the game, we used...

chapter

An Intersection Signal Control Method Based on Deep Reinforcement Learning

Pang Ha-li, Ding Ke

2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA) > 344 - 348

2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA)

Urban traffic flow is dynamic and uncertain. In this paper, we combine the deep learning and the reinforcement learning, and design an intersection signal controller based on Q-learning and convolutional neural network. We redefine the state space and the reward function. The training and simulation of the controller are carried out in traffic micro-simulator SUMO. Compared with timing control, the...

chapter

A novel DDPG method with prioritized experience replay

Yuenan Hou, Lifeng Liu, Qing Wei, Xudong Xu, more

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 316 - 321

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Recently, a state-of-the-art algorithm, called deep deterministic policy gradient (DDPG), has achieved good performance in many continuous control tasks in the MuJoCo simulator. To further improve the efficiency of the experience replay mechanism in DDPG and thus speeding up the training process, in this paper, a prioritized experience replay method is proposed for the DDPG algorithm, where prioritized...

chapter

Analysis of Q-learning on ANNs for robot control using live video feed

Nihal Murali, Kunal Gupta, Surekha Bhanot

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) > 524 - 529

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)

Training of artificial neural networks (ANNs) using reinforcement learning (RL) techniques is being widely discussed in the robot learning literature. The high model complexity of ANNs along with the model-free nature of RL algorithms provides a desirable combination for many robotics applications. There is a huge need for algorithms that generalize using raw sensory inputs, such as vision, without...

chapter

DLNE: A hybridization of deep learning and neuroevolution for visual control

Andreas Precht Poulsen, Mark Thorhauge, Mikkel Hvilshj Funch, Sebastian Risi

2017 IEEE Conference on Computational Intelligence and Games (CIG) > 256 - 263

2017 IEEE Conference on Computational Intelligence and Games (CIG)

This paper investigates the potential of combining deep learning and neuroevolution to create a bot for a simple first person shooter (FPS) game capable of aiming and shooting based on high-dimensional raw pixel input. The deep learning component is responsible for visual recognition and translating raw pixels to compact feature representations, while the evolving network takes those features as inputs...

chapter

Deep Reinforcement Learning-Based Image Captioning with Embedding Reward

Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1151 - 1159

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Image captioning is a challenging problem owing to the complexity in understanding the image content and diverse ways of describing it in natural language. Recent advances in deep neural networks have substantially improved the performance of this task. Most state-of-the-art approaches follow an encoder-decoder framework, which generates captions using a sequential recurrent prediction model. However,...

chapter

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1349 - 1358

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a novel tracker which is controlled by sequentially pursuing actions learned by deep reinforcement learning. In contrast to the existing trackers using deep networks, the proposed tracker is designed to achieve a light computation as well as satisfactory tracking accuracy in both location and scale. The deep network to control actions is pre-trained using various training sequences...

chapter

Performance comparision of different momentum techniques on deep reinforcement learning

Mehmet Sarigul, Mutlu Avci

2017 IEEE International Conference on INnovations in Intelligent SysTems and Applications (INISTA) > 302 - 306

2017 IEEE International Conference on INnovations in Intelligent SysTems and Applications (INISTA)

Increase in popularity of deep convolutional neural networks in many different areas leads to increase in the use of these networks in reinforcement learning. Training a huge deep neural network structure by using simple gradient descent learning can take quite a long time. Some additional learning approaches should be utilized to solve this problem. One of these techniques is use of momentum which...

chapter

Learning to walk with prior knowledge

Martin Gottwald, Dominik Meyer, Hao Shen, Klaus Diepold

2017 IEEE International Conference on Advanced Intelligent Mechatronics (AIM) > 1369 - 1374

2017 IEEE International Conference on Advanced Intelligent Mechatronics (AIM)

In this work a novel approach to Transfer Learning for the use in Deep Reinforcement Learning is introduced. The agent is realized as an actor-critic framework, namely the Deep Deterministic Policy Gradient algorithm. The Q-function and the policy are represented as deep feed-forward networks, that are trained by minimizing the mean squared Bellman error and by maximizing the expected reward, respectively...

chapter

Deep reinforcement learning based optimal trajectory tracking control of autonomous underwater vehicle

Runsheng Yu, Zhenyu Shi, Chaoxing Huang, Tenglong Li, more

2017 36th Chinese Control Conference (CCC) > 4958 - 4965

2017 36th Chinese Control Conference (CCC)

The aim of this paper is to solve the control problem of trajectory tracking of Autonomous Underwater Vehicles (AUVs) through using and improving deep reinforcement learning (DRL). The deep reinforcement learning of an underwater motion control system is composed of two neural networks: one network selects action and the other evaluates whether the selected action is accurate, and they modify themselves...

chapter

Curiosity-Driven Exploration by Self-Supervised Prediction

Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 488 - 489

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In many real-world scenarios, rewards extrinsic to the agent are extremely sparse, or absent altogether. In such cases, curiosity can serve as an intrinsic reward signal to enable the agent to explore its environment and learn skills that might be useful later in its life. We formulate curiosity as the error in an agent's ability to predict the consequence of its own actions in a visual feature space...

chapter

Developing game AI agent behaving like human by mixing reinforcement learning and supervised learning

Shohei Miyashita, Xinyu Lian, Xiao Zeng, Takashi Matsubara, more

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) > 489 - 494

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

Artificial intelligence (AI) agent created with Deep Q-Networks (DQN) can defeat human agents in video games. Despite its high performance, DQN often exhibits odd behaviors, which could be immersion-breaking against the purpose of creating game AI. Moreover, DQN is capable of reacting to the game environment much faster than humans, making itself invincible (thus not fun to play with) in certain types...

chapter

Autonomous lane keeping based on approximate Q-learning

Jonggu Lee, Taewan Kim, H. Jin Kim

2017 14th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI) > 402 - 405

2017 14th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI)

Obstacle avoidance is one of the most important problems in autonomous robots. This paper suggests a collision avoidance system using reinforcement learning. Hand-crafted features are used to approximate Q value. With off-line learning, we develop a general collision avoidance system and use this system to unknown environment. Simulation results show that our mobile robot agent using reinforcement...

chapter

PLATO: Policy learning using adaptive trajectory optimization

Gregory Kahn, Tianhao Zhang, Sergey Levine, Pieter Abbeel

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3342 - 3349

2017 IEEE International Conference on Robotics and Automation (ICRA)

Policy search can in principle acquire complex strategies for control of robots and other autonomous systems. When the policy is trained to process raw sensory inputs, such as images and depth maps, it can also acquire a strategy that combines perception and control. However, effectively processing such complex inputs requires an expressive policy class, such as a large neural network. These high-dimensional...

chapter

Reset-free guided policy search: Efficient deep reinforcement learning with stochastic initial states

William Montgomery, Anurag Ajay, Chelsea Finn, Pieter Abbeel, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3373 - 3380

2017 IEEE International Conference on Robotics and Automation (ICRA)

Autonomous learning of robotic skills can allow general-purpose robots to learn wide behavioral repertoires without extensive manual engineering. However, robotic skill learning must typically make trade-offs to enable practical real-world learning, such as requiring manually designed policy or value function representations, initialization from human demonstrations, instrumentation of the training...

chapter

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates

Shixiang Gu, Ethan Holly, Timothy Lillicrap, Sergey Levine

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3389 - 3396

2017 IEEE International Conference on Robotics and Automation (ICRA)

Reinforcement learning holds the promise of enabling autonomous robots to learn large repertoires of behavioral skills with minimal human intervention. However, robotic applications of reinforcement learning often compromise the autonomy of the learning process in favor of achieving training times that are practical for real physical systems. This typically involves introducing hand-engineered policy...

chapter

Scaling up deep reinforcement learning for multi-domain dialogue systems

Heriberto Cuayahuitl, Seunghak Yu, Ashley Williamson, Jacob Carse

2017 International Joint Conference on Neural Networks (IJCNN) > 3339 - 3346

2017 International Joint Conference on Neural Networks (IJCNN)

Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domains) face scalability problems due to large search spaces. This paper proposes a three-stage method for multi-domain dialogue policy learning-termed NDQN, and applies it to an information-seeking spoken dialogue system in the domains of restaurants and hotels. In this method, the first stage does multi-policy...

chapter

Training neural networks with policy gradient

Sourabh Bose, Manfred Huber

2017 International Joint Conference on Neural Networks (IJCNN) > 3998 - 4005

2017 International Joint Conference on Neural Networks (IJCNN)

Neural networks are a powerful function approximation tool which has the ability to model any function with arbitrary precision. For any function as a black box, it is able to reconstruct the function given the target and the input data. However, there are problems where the target is at least partially unknown. In such cases it is impossible for a traditional neural network to compute the gradient...

chapter

Learning modular neural network policies for multi-task and multi-robot transfer

Coline Devin, Abhishek Gupta, Trevor Darrell, Pieter Abbeel, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2169 - 2176

2017 IEEE International Conference on Robotics and Automation (ICRA)

Reinforcement learning (RL) can automate a wide variety of robotic skills, but learning each new skill requires considerable real-world data collection and manual representation engineering to design policy classes or features. Using deep reinforcement learning to train general purpose neural network policies alleviates some of the burden of manual representation engineering by using expressive policy...

chapter

Learning of binocular fixations using anomaly detection with deep reinforcement learning

Francois de La Bourdonnaye, Celine Teuliere, Thierry Chateau, Jochen Triesch

2017 International Joint Conference on Neural Networks (IJCNN) > 760 - 767

2017 International Joint Conference on Neural Networks (IJCNN)

Due to its ability to learn complex behaviors in high-dimensional state-action spaces, deep reinforcement learning algorithms have attracted much interest in the robotics community. For a practical reinforcement learning implementation on a robot, it has to be provided with an informative reward signal that makes it easy to discriminate the values of nearby states. To address this issue, prior information,...

Keywords:
NEURAL NETWORKS
LEARNING (ARTIFICIAL INTELLIGENCE)

Publication date

Set your own date range

Content availability

Available (144)
None (1)

Publication type

book (140)
article (5)

Keywords

ARTIFICIAL NEURAL NETWORKS (74)
NEURAL NETS (58)
NEURONS (29)
MACHINE LEARNING (24)
FEATURE EXTRACTION (22)
DATA MINING (16)
ACCURACY (15)
CLASSIFICATION ALGORITHMS (15)
GAMES (15)
SUPPORT VECTOR MACHINES (14)
PATTERN CLASSIFICATION (12)
NEURAL NETWORK (11)
PATTERN RECOGNITION (11)
RADIAL BASIS FUNCTION NETWORKS (11)
REINFORCEMENT LEARNING (11)
ALGORITHM DESIGN AND ANALYSIS (10)
COMPUTER ARCHITECTURE (10)
MATHEMATICAL MODEL (10)
TESTING (10)
TRAINING DATA (10)
CONVERGENCE (8)
ERROR ANALYSIS (8)
IMAGE PROCESSING (8)
IMAGE RECOGNITION (8)
MULTILAYER PERCEPTRONS (8)
ROBOTS (8)
FUNCTION APPROXIMATION (7)
IMAGE CLASSIFICATION (7)
LEARNING (7)
NEURAL NETWORK TRAINING (7)
PREDICTIVE MODELS (7)
BACKPROPAGATION (6)
DATABASES (6)
DECISION MAKING (6)
DEEP REINFORCEMENT LEARNING (6)
GENETIC ALGORITHMS (6)
IMAGE SEGMENTATION (6)
KERNEL (6)
LEARNING SYSTEMS (6)
OPTIMIZATION (6)
SUPERVISED LEARNING (6)
VISUALIZATION (6)
COMPUTATIONAL MODELING (5)
CONFERENCES (5)
FAULT DIAGNOSIS (5)
HEURISTIC ALGORITHMS (5)
HUMANS (5)
PROBABILITY (5)
RECURRENT NEURAL NETS (5)
ROBUSTNESS (5)
ARTIFICIAL NEURAL NETWORK (4)
BAYES METHODS (4)
DATA MODELS (4)
ENCODING (4)
EVOLUTIONARY COMPUTATION (4)
GALLIUM (4)
GENERALISATION (ARTIFICIAL INTELLIGENCE) (4)
GENETIC ALGORITHM (4)
IMAGE CODING (4)
IMAGE RECONSTRUCTION (4)
NEUROCONTROLLERS (4)
OBJECT DETECTION (4)
OPTIMISATION (4)
PARTICLE SWARM OPTIMISATION (4)
PARTICLE SWARM OPTIMIZATION (4)
PATTERN CLUSTERING (4)
PRINCIPAL COMPONENT ANALYSIS (4)
PROBABILITY DENSITY FUNCTION (4)
PROTOTYPES (4)
RECURRENT NEURAL NETWORKS (4)
VIDEO SIGNAL PROCESSING (4)
WAVELET TRANSFORMS (4)
APPROXIMATION ALGORITHMS (3)
ARTIFICIAL INTELLIGENCE (3)
AUTOMOBILES (3)
BIOLOGICAL NEURAL NETWORKS (3)
CAMERAS (3)
CLASSIFICATION (3)
COMPLEXITY THEORY (3)
COMPUTATIONAL INTELLIGENCE (3)
CORRELATION (3)
ENTROPY (3)
EQUATIONS (3)
ESTIMATION (3)
FEEDFORWARD NEURAL NETS (3)
FUZZY NEURAL NETS (3)
FUZZY SET THEORY (3)
GRADIENT METHODS (3)
IMAGE COLOR ANALYSIS (3)
IMAGE SEQUENCES (3)
LEARNING ALGORITHM (3)
MACHINE LEARNING TECHNIQUES (3)
MEAN SQUARE ERROR METHODS (3)
MOBILE ROBOTS (3)
MULTILAYER PERCEPTRON (3)
NATURAL LANGUAGE PROCESSING (3)
NETWORK TOPOLOGY (3)
more

INFONA - science communication portal

Search results

Towards Playing a 3D First-Person Shooter Game Using a Classification Deep Neural Network Architecture

An Intersection Signal Control Method Based on Deep Reinforcement Learning

A novel DDPG method with prioritized experience replay

Analysis of Q-learning on ANNs for robot control using live video feed

DLNE: A hybridization of deep learning and neuroevolution for visual control

Deep Reinforcement Learning-Based Image Captioning with Embedding Reward

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Performance comparision of different momentum techniques on deep reinforcement learning

Learning to walk with prior knowledge

Deep reinforcement learning based optimal trajectory tracking control of autonomous underwater vehicle

Curiosity-Driven Exploration by Self-Supervised Prediction

Developing game AI agent behaving like human by mixing reinforcement learning and supervised learning

Autonomous lane keeping based on approximate Q-learning

PLATO: Policy learning using adaptive trajectory optimization

Reset-free guided policy search: Efficient deep reinforcement learning with stochastic initial states

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates

Scaling up deep reinforcement learning for multi-domain dialogue systems

Training neural networks with policy gradient

Learning modular neural network policies for multi-task and multi-robot transfer

Learning of binocular fixations using anomaly detection with deep reinforcement learning

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options