Search results

chapter

Experimental study on decentralized concurrent learning for multi-agent system with complex dynamics

Ting Fei, Xin Chen, Min Wu, Chi Wang

2017 36th Chinese Control Conference (CCC) > 8373 - 8378

2017 36th Chinese Control Conference (CCC)

A cooperative multi-agent system entitles some independent agents to complete complex tasks through coordination and cooperation. Since the dynamics of physical agents are so complex that the environment of learning is indeed stochastic, the paper introduces the decentralized multi-agent reinforcement learning (MARL) algorithm, named as Decentralized Concurrent Learning with Cooperative Policy Exploration...

chapter

Research on Q-ELM algorithm in robot path planning

Hongge Ren, Rui Yin, Fujin Li, Wei Wang, more

2016 Chinese Control and Decision Conference (CCDC) > 5975 - 5979

2016 Chinese Control and Decision Conference (CCDC)

In view of high dimension, the difficulty of training, the problem of slow learning speed in the application of BP neural network in mobile robot path planning, an algorithm of reinforcement Q learning based on extreme learning machine (Q-ELM algorithm) is proposed in this paper. Firstly, the characteristic of reinforcement learning is combining the dynamic network with supervised learning, and the...

chapter

Multi-Agent Path Planning for Unmanned Aerial Vehicle Based on Threats Analysis

Lei Gang, Dong Min-zhou, Xu Tao, Wang Liang

2011 3rd International Workshop on Intelligent Systems and Applications > 1 - 4

2011 3rd International Workshop on Intelligent Systems and Applications (ISA)

This paper focuses on the flight path planning process with multi-agent for Unmanned Aerial Vehicle (UAV) based on threats analysis and path length constraint. Path planner agent searches the path with global view considering path length constraint and information collector agent deals with path planning in the zone of threats. Scoring function is presented based on analysis the threats' attributes...

chapter

Autonomous Navigation in Dynamic Environments with Reinforcement Learning and Heuristic

E D S Costa, Maury M Gouvea

2010 Ninth International Conference on Machine Learning and Applications > 37 - 42

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Researchers have created machines which operate autonomously in complex and changing environments. An important problem that has been widely studied is that of autonomous navigation systems, through which attempts have been made to create mechanisms with their own decision making in complex environments. Ideally, an autonomous navigation agent must have an ability to learn while working in its environment...

chapter

Imitation learning of hand gestures and its evaluation for humanoid robots

Anand Thobbi, Weihua Sheng

The 2010 IEEE International Conference on Information and Automation > 60 - 65

2010 International Conference on Information and Automation (ICIA 2010)

This paper presents a platform to implement and evaluate a learning by imitation framework which enables humanoid robots to learn hand gestures from human beings. A marker based system is used to capture human motion data. From this data we extract the shoulder and elbow joint angles, which uniquely characterize a particular hand gesture. The proposed imitation learning framework aims to generalize...

chapter

Reinforcement learning accelerated with artificial neural network for maze and search problems

Mehmet Hacibeyoglu, Ahmet Arslan

3rd International Conference on Human System Interaction > 124 - 127

2010 3rd International Conference on Human System Interactions (HSI)

Reinforcement learning is the problem faced by an agent that must learn behaviour through trial and error interactions with a dynamic environment that lacks the educational examples. Q-learning is one of the most popular algorithms among the reinforcement learning methods. Artificial neural network, as in reinforcement learning, is a sub-entry of machine learning, which can be applied on real frames,...

chapter

Q-learning policies for a single agent foraging tasks

Yogeswaran Mohan, S G Ponnambalam

7th International Symposium on Mechatronics and its Applications > 1 - 6

7th International Symposium on Mechatronics and its Applications (ISMA 2010)

Policies play an important role in balancing the trade-off between exploration and exploitation problem in q-learning. Pure exploration degrades the performance of the q-learning but increases the flexibility to adapt in a dynamic environment. On the other hand pure exploitation drives the learning process to locally optimal solutions. In this paper, a single agent foraging task has been modeled incorporating...

chapter

Research on Convergence of Robot Path Planning Based on LCS

Jie Shao, Jing yu Yang

2009 Chinese Conference on Pattern Recognition > 1 - 5

2009 Chinese Conference on Pattern Recognition. (CCPR 2009) and the First CJK Joint Workshop on Pattern Recognition (CJKPR)

A path planning algorithm of robot is proposed based on ensemble algorithm of the learning classifier system, which design fitness function in dynamic environment. The paper derived and proved that ensemble algorithm is convergence and provided a theoretical guarantee for the path planning algorithm. Simulation results also showed that genetic algorithms and learning classifier system combination...

chapter

Learning moving objects in a multi-target tracking scenario for mobile robots that use laser range measurements

P. Kondaxakis, H. Baltzakis, P. Trahanias

2009 IEEE/RSJ International Conference on Intelligent Robots and Systems > 1667 - 1672

2009 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2009)

This paper addresses the problem of real-time moving-object detection, classification and tracking in populated and dynamic environments. In this scenario, a mobile robot uses 2D laser range data to recognize, track and avoid moving targets. Most previous approaches either rely on pre-defined data features or off-line training of a classifier for specific data sets, thus eliminating the possibility...

chapter

Sliding Mode Control Design of Cleaning Robot's Mobile Manipulator Used in Large Condenser Based on Neural Networks

Tang Hong, Yang Qing-xuan

2009 IITA International Conference on Control, Automation and Systems Engineering (case 2009) > 446 - 449

2009 IITA International Conference on Control, Automation and Systems Engineering, CASE 2009

Sliding mode control (SMC) of cleaning robot's mobile manipulator based on neural networks which have nonlinear approximation ability is put forward in this article. The controller reduces inherent chattering phenomenon sharply when the uncertainties and external disturbances are unknown. Structure of sliding mode control and neural networkspsila learning algorithms using Lyapunov theorem are designed...

chapter

Vision-based reinforcement learning using approximate policy iteration

M.R. Shaker, Shigang Yue, T. Duckett

2009 International Conference on Advanced Robotics > 1 - 6

2009 14th International Conference on Advanced Robotics (ICAR 2009)

A major issue for reinforcement learning (RL) applied to robotics is the time required to learn a new skill. While RL has been used to learn mobile robot control in many simulated domains, applications involving learning on real robots are still relatively rare. In this paper, the Least-Squares Policy Iteration (LSPI) reinforcement learning algorithm and a new model-based algorithm Least-Squares Policy...

chapter

Development of a memetic algorithm for Dynamic Multi-Objective Optimization and its applications for online neural network modeling of UAVs

A. Isaacs, V. Puttige, T. Ray, W. Smith, more

2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence) > 548 - 554

2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence)

Dynamic multi-objective optimization (DMO) is one of the most challenging class of optimization problems where the objective functions change over time and the optimization algorithm is required to identify the corresponding Pareto optimal solutions with minimal time lag. DMO has received very little attention in the past and none of the existing multi-objective algorithms perform satisfactorily on...

INFONA - science communication portal

Search results

Experimental study on decentralized concurrent learning for multi-agent system with complex dynamics

Research on Q-ELM algorithm in robot path planning

Multi-Agent Path Planning for Unmanned Aerial Vehicle Based on Threats Analysis

Autonomous Navigation in Dynamic Environments with Reinforcement Learning and Heuristic

Imitation learning of hand gestures and its evaluation for humanoid robots

Reinforcement learning accelerated with artificial neural network for maze and search problems

Q-learning policies for a single agent foraging tasks

Research on Convergence of Robot Path Planning Based on LCS

Learning moving objects in a multi-target tracking scenario for mobile robots that use laser range measurements

Sliding Mode Control Design of Cleaning Robot's Mobile Manipulator Used in Large Condenser Based on Neural Networks

Vision-based reinforcement learning using approximate policy iteration

Development of a memetic algorithm for Dynamic Multi-Objective Optimization and its applications for online neural network modeling of UAVs

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options