Search results

Items from 21 to 40 out of 3,201 results

chapter

Learning how to drive in a real world simulation with deep Q-Networks

Peter Wolf, Christian Hubschneider, Michael Weber, Andre Bauer, more

2017 IEEE Intelligent Vehicles Symposium (IV) > 244 - 250

2017 IEEE Intelligent Vehicles Symposium (IV)

We present a reinforcement learning approach using Deep Q-Networks to steer a vehicle in a 3D physics simulation. Relying solely on camera image input the approach directly learns steering the vehicle in an end-to-end manner. The system is able to learn human driving behavior without the need of any labeled training data. An action-based reward function is proposed, which is motivated by a potential...

chapter

Developing game AI agent behaving like human by mixing reinforcement learning and supervised learning

Shohei Miyashita, Xinyu Lian, Xiao Zeng, Takashi Matsubara, more

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) > 489 - 494

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

Artificial intelligence (AI) agent created with Deep Q-Networks (DQN) can defeat human agents in video games. Despite its high performance, DQN often exhibits odd behaviors, which could be immersion-breaking against the purpose of creating game AI. Moreover, DQN is capable of reacting to the game environment much faster than humans, making itself invincible (thus not fun to play with) in certain types...

chapter

Autonomous lane keeping based on approximate Q-learning

Jonggu Lee, Taewan Kim, H. Jin Kim

2017 14th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI) > 402 - 405

2017 14th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI)

Obstacle avoidance is one of the most important problems in autonomous robots. This paper suggests a collision avoidance system using reinforcement learning. Hand-crafted features are used to approximate Q value. With off-line learning, we develop a general collision avoidance system and use this system to unknown environment. Simulation results show that our mobile robot agent using reinforcement...

chapter

Better deep visual attention with reinforcement learning in action recognition

Gang Wang, Wenmin Wang, Jingzhuo Wang, Yaohua Bu

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

Deep visual attention in computer vision has attracted much attention over the past years, which achieves great contributions especially in image classification, image caption and action recognition. However, due to taking BP training wholly or partially, they can not show the true power of attention in computational efficiency and focusing accuracy. Our intuition is that attention mechanism should...

chapter

Target-driven visual navigation in indoor scenes using deep reinforcement learning

Yuke Zhu, Roozbeh Mottaghi, Eric Kolve, Joseph J. Lim, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3357 - 3364

2017 IEEE International Conference on Robotics and Automation (ICRA)

Two less addressed issues of deep reinforcement learning are (1) lack of generalization capability to new goals, and (2) data inefficiency, i.e., the model requires several (and often costly) episodes of trial and error to converge, which makes it impractical to be applied to real-world scenarios. In this paper, we address these two issues and apply our model to target-driven visual navigation. To...

chapter

PLATO: Policy learning using adaptive trajectory optimization

Gregory Kahn, Tianhao Zhang, Sergey Levine, Pieter Abbeel

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3342 - 3349

2017 IEEE International Conference on Robotics and Automation (ICRA)

Policy search can in principle acquire complex strategies for control of robots and other autonomous systems. When the policy is trained to process raw sensory inputs, such as images and depth maps, it can also acquire a strategy that combines perception and control. However, effectively processing such complex inputs requires an expressive policy class, such as a large neural network. These high-dimensional...

chapter

Reset-free guided policy search: Efficient deep reinforcement learning with stochastic initial states

William Montgomery, Anurag Ajay, Chelsea Finn, Pieter Abbeel, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3373 - 3380

2017 IEEE International Conference on Robotics and Automation (ICRA)

Autonomous learning of robotic skills can allow general-purpose robots to learn wide behavioral repertoires without extensive manual engineering. However, robotic skill learning must typically make trade-offs to enable practical real-world learning, such as requiring manually designed policy or value function representations, initialization from human demonstrations, instrumentation of the training...

chapter

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates

Shixiang Gu, Ethan Holly, Timothy Lillicrap, Sergey Levine

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3389 - 3396

2017 IEEE International Conference on Robotics and Automation (ICRA)

Reinforcement learning holds the promise of enabling autonomous robots to learn large repertoires of behavioral skills with minimal human intervention. However, robotic applications of reinforcement learning often compromise the autonomy of the learning process in favor of achieving training times that are practical for real physical systems. This typically involves introducing hand-engineered policy...

chapter

Multilateral surgical pattern cutting in 2D orthotropic gauze with deep reinforcement learning policies for tensioning

Brijen Thananjeyan, Animesh Garg, Sanjay Krishnan, Carolyn Chen, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2371 - 2378

2017 IEEE International Conference on Robotics and Automation (ICRA)

In the Fundamentals of Laparoscopic Surgery (FLS) standard medical training regimen, the Pattern Cutting task requires residents to demonstrate proficiency by maneuvering two tools, surgical scissors and tissue gripper, to accurately cut a circular pattern on surgical gauze suspended at the corners. Accuracy of cutting depends on tensioning, wherein the gripper pinches a point on the gauze in R³ and...

chapter

Scaling up deep reinforcement learning for multi-domain dialogue systems

Heriberto Cuayahuitl, Seunghak Yu, Ashley Williamson, Jacob Carse

2017 International Joint Conference on Neural Networks (IJCNN) > 3339 - 3346

2017 International Joint Conference on Neural Networks (IJCNN)

Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domains) face scalability problems due to large search spaces. This paper proposes a three-stage method for multi-domain dialogue policy learning-termed NDQN, and applies it to an information-seeking spoken dialogue system in the domains of restaurants and hotels. In this method, the first stage does multi-policy...

chapter

Training neural networks with policy gradient

Sourabh Bose, Manfred Huber

2017 International Joint Conference on Neural Networks (IJCNN) > 3998 - 4005

2017 International Joint Conference on Neural Networks (IJCNN)

Neural networks are a powerful function approximation tool which has the ability to model any function with arbitrary precision. For any function as a black box, it is able to reconstruct the function given the target and the input data. However, there are problems where the target is at least partially unknown. In such cases it is impossible for a traditional neural network to compute the gradient...

chapter

Learning task-parametrized assistive strategies for exoskeleton robots by multi-task reinforcement learning

Masashi Hamaya, Takamitsu Matsubara, Tomoyuki Noda, Tatsuya Teramae, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 5907 - 5912

2017 IEEE International Conference on Robotics and Automation (ICRA)

Recent studies suggest that reinforcement learning has great potential for generating assistive strategies in exoskeletons through physical interactions between a user and a robot. Previous methods focused on a task-specific assistive strategy, where for every single task (situation/context), the user needs to interact with a robot to learn an appropriate assistive strategy. Therefore, the learned...

chapter

Supervision via competition: Robot adversaries for learning tasks

Lerrel Pinto, James Davidson, Abhinav Gupta

2017 IEEE International Conference on Robotics and Automation (ICRA) > 1601 - 1608

2017 IEEE International Conference on Robotics and Automation (ICRA)

There has been a recent paradigm shift in robotics to data-driven learning for planning and control. Due to large number of experiences required for training, most of these approaches use a self-supervised paradigm: using sensors to measure success/failure. However, in most cases, these sensors provide weak supervision at best. In this work, we propose an adversarial learning framework that pits an...

chapter

Learning modular neural network policies for multi-task and multi-robot transfer

Coline Devin, Abhishek Gupta, Trevor Darrell, Pieter Abbeel, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2169 - 2176

2017 IEEE International Conference on Robotics and Automation (ICRA)

Reinforcement learning (RL) can automate a wide variety of robotic skills, but learning each new skill requires considerable real-world data collection and manual representation engineering to design policy classes or features. Using deep reinforcement learning to train general purpose neural network policies alleviates some of the burden of manual representation engineering by using expressive policy...

chapter

Learning of binocular fixations using anomaly detection with deep reinforcement learning

Francois de La Bourdonnaye, Celine Teuliere, Thierry Chateau, Jochen Triesch

2017 International Joint Conference on Neural Networks (IJCNN) > 760 - 767

2017 International Joint Conference on Neural Networks (IJCNN)

Due to its ability to learn complex behaviors in high-dimensional state-action spaces, deep reinforcement learning algorithms have attracted much interest in the robotics community. For a practical reinforcement learning implementation on a robot, it has to be provided with an informative reward signal that makes it easy to discriminate the values of nearby states. To address this issue, prior information,...

chapter

OS-λ₁-ELM: Online sequential λ₁-regularized-ELM based on ADMM

Dazi Li, Zhiyin Liu, Qibing Jin

2017 6th International Symposium on Advanced Control of Industrial Processes (AdCONIP) > 371 - 376

2017 6th International Symposium on Advanced Control of Industrial Processes (AdCONIP)

As business data and scientific data become larger and larger, the study of incremental learning algorithms becomes more and more important. Online sequential extreme learning machine (OS-ELM) algorithm is an incremental learning algorithm that can learn data one by one. On the basis of OS-ELM, an online sequential extreme learning machine incremental learning algorithm is proposed based on the λ₁...

chapter

Can a reinforcement learning agent practice before it starts learning?

Minwoo Lee, Charles W. Anderson

2017 International Joint Conference on Neural Networks (IJCNN) > 4006 - 4013

2017 International Joint Conference on Neural Networks (IJCNN)

A reinforcement learning (RL) agent needs a fair amount of experience to find a near-optimal policy. Transfer learning has been investigated as a means to reduce the amount of experience required. Transfer learning, however, requires another similar reinforcement learning task as a transfer source, which can also be costly in the amount of experience required. In this research, we examine the possible...

chapter

Deep reward shaping from demonstrations

Ahmed Hussein, Eyad Elyan, Mohamed Medhat Gaber, Chrisina Jayne

2017 International Joint Conference on Neural Networks (IJCNN) > 510 - 517

2017 International Joint Conference on Neural Networks (IJCNN)

Deep reinforcement learning is rapidly gaining attention due to recent successes in a variety of problems. The combination of deep learning and reinforcement learning allows for a generic learning process that does not consider specific knowledge of the task. However, learning from scratch becomes more difficult when tasks involve long trajectories with delayed rewards. The chances of finding the...

chapter

Air-to-ground shepherd problem: An action-delay reinforcement learning approach

Jiangcheng Zhu, Chao Xu

2017 American Control Conference (ACC) > 3771 - 3776

2017 American Control Conference (ACC)

In this paper, we consider an optimization problem motivated by the International Aerial Robotics Competition (IARC) Mission-7, or the shepherd action. IARC Mission-7 requires an autonomous drone (i.e. the shepherd dog) to drive ground vehicle (sheep) across the green-line boundary of an competition arena of 20m × 20m within 10 mins. There are two actions, either top touch or collision to change the...

chapter

Combining Machine-Learning with Invariants Assurance Techniques for Autonomous Systems

Piergiuseppe Mallozzi

2017 IEEE/ACM 39th International Conference on Software Engineering Companion (ICSE-C) > 485 - 486

2017 IEEE/ACM 39th International Conference on Software Engineering Companion (ICSE-C)

Autonomous Systems are systems situated in some environment and are able of taking decision autonomously. The environment is not precisely known at design-time and it might be full of unforeseeable events that the autonomous system has to deal with at run-time. This brings two main problems to be addressed. One is that the uncertainty of the environment makes it difficult to model all the behaviours...

Data set:
ieee
Keywords:
TRAINING
LEARNING (ARTIFICIAL INTELLIGENCE)

Publication date

Set your own date range

Content availability

Available (3,128)
None (73)

Keywords

ARTIFICIAL NEURAL NETWORKS (821)
CLASSIFICATION ALGORITHMS (791)
SUPPORT VECTOR MACHINES (762)
DATA MINING (755)
MACHINE LEARNING (739)
ACCURACY (685)
PATTERN CLASSIFICATION (654)
FEATURE EXTRACTION (640)
KERNEL (403)
NEURAL NETS (389)
IMAGE CLASSIFICATION (338)
TRAINING DATA (296)
NEURONS (273)
TESTING (258)
SUPPORT VECTOR MACHINE (252)
ALGORITHM DESIGN AND ANALYSIS (223)
DATA MODELS (218)
DATABASES (215)
OPTIMIZATION (206)
FACE (183)
FACE RECOGNITION (178)
MATHEMATICAL MODEL (177)
COMPUTATIONAL MODELING (171)
OBJECT DETECTION (167)
CLASSIFICATION (164)
PIXEL (162)
REGRESSION ANALYSIS (150)
BOOSTING (147)
SUPERVISED LEARNING (142)
PREDICTION ALGORITHMS (141)
PREDICTIVE MODELS (141)
NEURAL NETWORKS (140)
PRINCIPAL COMPONENT ANALYSIS (140)
HIDDEN MARKOV MODELS (136)
SVM (136)
HUMANS (131)
NEURAL NETWORK (129)
STATISTICAL ANALYSIS (129)
PATTERN CLUSTERING (127)
OPTIMISATION (126)
PATTERN RECOGNITION (123)
LEARNING SYSTEMS (121)
SUPPORT VECTOR MACHINE CLASSIFICATION (121)
GENETIC ALGORITHMS (120)
LEARNING (120)
RADIAL BASIS FUNCTION NETWORKS (120)
FUZZY SET THEORY (118)
TEXT ANALYSIS (117)
PROBABILITY (113)
VISUALIZATION (113)
CLUSTERING ALGORITHMS (110)
IMAGE SEGMENTATION (110)
ADAPTATION MODEL (109)
DECISION TREES (108)
BAYES METHODS (107)
EQUATIONS (103)
ERROR ANALYSIS (103)
NOISE (102)
SHAPE (102)
DISTANCE MEASUREMENT (101)
MACHINE LEARNING ALGORITHMS (100)
ESTIMATION (99)
CORRELATION (98)
DETECTORS (94)
NATURAL LANGUAGE PROCESSING (91)
IMAGE COLOR ANALYSIS (89)
CONVERGENCE (86)
ARTIFICIAL NEURAL NETWORK (85)
COMPUTER VISION (85)
GAUSSIAN PROCESSES (85)
INTERNET (85)
ROBOTS (83)
IMAGE RECOGNITION (82)
SPEECH (82)
ENTROPY (81)
FEATURE SELECTION (81)
LABELING (81)
OBJECT RECOGNITION (81)
ADABOOST (80)
MANIFOLDS (80)
PROBABILITY DENSITY FUNCTION (80)
LEARNING ALGORITHM (78)
MULTILAYER PERCEPTRONS (78)
BAYESIAN METHODS (76)
SEMI-SUPERVISED LEARNING (76)
CONTEXT (75)
HISTOGRAMS (74)
SPEECH RECOGNITION (74)
FUZZY NEURAL NETS (73)
BAGGING (72)
IMAGE RETRIEVAL (71)
INCREMENTAL LEARNING (71)
REINFORCEMENT LEARNING (70)
ACTIVE LEARNING (68)
MEDICAL IMAGE PROCESSING (68)
APPROXIMATION METHODS (67)
BIOLOGICAL SYSTEM MODELING (67)
COMPUTATIONAL COMPLEXITY (67)
more

INFONA - science communication portal

Search results

Learning how to drive in a real world simulation with deep Q-Networks

Developing game AI agent behaving like human by mixing reinforcement learning and supervised learning

Autonomous lane keeping based on approximate Q-learning

Better deep visual attention with reinforcement learning in action recognition

Target-driven visual navigation in indoor scenes using deep reinforcement learning

PLATO: Policy learning using adaptive trajectory optimization

Reset-free guided policy search: Efficient deep reinforcement learning with stochastic initial states

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates

Multilateral surgical pattern cutting in 2D orthotropic gauze with deep reinforcement learning policies for tensioning

Scaling up deep reinforcement learning for multi-domain dialogue systems

Training neural networks with policy gradient

Learning task-parametrized assistive strategies for exoskeleton robots by multi-task reinforcement learning

Supervision via competition: Robot adversaries for learning tasks

Learning modular neural network policies for multi-task and multi-robot transfer

Learning of binocular fixations using anomaly detection with deep reinforcement learning

OS-λ₁-ELM: Online sequential λ₁-regularized-ELM based on ADMM

Can a reinforcement learning agent practice before it starts learning?

Deep reward shaping from demonstrations

Air-to-ground shepherd problem: An action-delay reinforcement learning approach

Combining Machine-Learning with Invariants Assurance Techniques for Autonomous Systems

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options