Search results

Items from 1 to 8 out of 8 results

chapter

Experimental study on decentralized concurrent learning for multi-agent system with complex dynamics

Ting Fei, Xin Chen, Min Wu, Chi Wang

2017 36th Chinese Control Conference (CCC) > 8373 - 8378

2017 36th Chinese Control Conference (CCC)

A cooperative multi-agent system entitles some independent agents to complete complex tasks through coordination and cooperation. Since the dynamics of physical agents are so complex that the environment of learning is indeed stochastic, the paper introduces the decentralized multi-agent reinforcement learning (MARL) algorithm, named as Decentralized Concurrent Learning with Cooperative Policy Exploration...

chapter

Environmental field estimation of mobile sensor networks using support vector regression

Bowen Lu, Dongbing Gu, Huosheng Hu

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems > 2926 - 2931

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)

This paper presents a distributed algorithm for mobile sensor networks to monitor the environment. With this algorithm, multiple mobile sensor nodes can collectively sample the environmental field and recover the environmental field function via machine learning approaches. The mobile sensor nodes are able to self-organise so that the distribution of mobile sensor nodes matches to the estimated environmental...

chapter

Using information gain to build meaningful decision forests for multilabel classification

K Gold, A Petrosino

2010 IEEE 9th International Conference on Development and Learning > 58 - 63

2010 IEEE 9th International Conference on Development and Learning (ICDL 2010)

“Gain-Based Separation” is a novel heuristic that modifies the standard multiclass decision tree learning algorithm to produce forests that can describe an example or object with multiple classifications. When the information gain at a node would be higher if all examples of a particular classification were removed, those examples are reserved for another tree. In this way, the algorithm performs...

chapter

Behavior recognition for Learning from Demonstration

Erik A Billing, Thomas Hellström, Lars-Erik Janlert

2010 IEEE International Conference on Robotics and Automation > 866 - 872

2010 IEEE International Conference on Robotics and Automation (ICRA 2010)

Two methods for behavior recognition are presented and evaluated. Both methods are based on the dynamic temporal difference algorithm Predictive Sequence Learning (PSL) which has previously been proposed as a learning algorithm for robot control. One strength of the proposed recognition methods is that the model PSL builds to recognize behaviors is identical to that used for control, implying that...

chapter

Inference model for heterogeneous robot team configuration based on Reinforcement Learning

Xueqing Sun, Tao Mao, L.E. Ray

2009 IEEE International Conference on Technologies for Practical Robot Applications > 55 - 60

2009 IEEE International Conference on Technologies for Practical Robot Applications. TePRA 2009

In many practical robotics problems, knowledge of the team configuration and capabilities is crucial in coordination of multiple heterogeneous robots. In a challenging environment with costly, sporadic, or absent communication, inferencing based on observed spatio-temporal state transitions is necessary for learning and reasoning. In this paper, we present a general purpose inference engine that takes...

chapter

Automatic weight learning for multiple data sources when learning from demonstration

B.D. Argall, B. Browning, M. Veloso

2009 IEEE International Conference on Robotics and Automation > 226 - 231

2009 IEEE International Conference on Robotics and Automation (ICRA)

Traditional approaches to programming robots are generally inaccessible to non-robotics-experts. A promising exception is the learning from demonstration paradigm. Here a policy mapping world observations to action selection is learned, by generalizing from task demonstrations by a teacher. Most learning from demonstration work to date considers data from a single teacher. In this paper, we consider...

chapter

Vision-based reinforcement learning using approximate policy iteration

M.R. Shaker, Shigang Yue, T. Duckett

2009 International Conference on Advanced Robotics > 1 - 6

2009 14th International Conference on Advanced Robotics (ICAR 2009)

A major issue for reinforcement learning (RL) applied to robotics is the time required to learn a new skill. While RL has been used to learn mobile robot control in many simulated domains, applications involving learning on real robots are still relatively rare. In this paper, the Least-Squares Policy Iteration (LSPI) reinforcement learning algorithm and a new model-based algorithm Least-Squares Policy...

chapter

Q-learning based multi-robot box-pushing with minimal switching of actions

Ying Wang, Haoxiang Lang, C.W. de Silva

2008 IEEE International Conference on Automation and Logistics > 640 - 643

IEEE International Conference on Automation and Logistics (ICAL)

Reinforcement learning has been commonly used in multi-robot decision making to cope with uncertainties in the environment. A shortcoming of this approach is the need for the robots to change their actions quite frequently, which is not feasible in a physical multi-robot system. This paper focuses on the development of a modified Q-learning algorithm with minimal switching of actions. By introducing...

Filter options

Keywords:
HEURISTIC ALGORITHMS
LEARNING (ARTIFICIAL INTELLIGENCE)
ROBOT SENSING SYSTEMS

Publication date

Set your own date range

Keywords

INTELLIGENT ROBOTS (3)
CONFERENCES (2)
CONTROL ENGINEERING COMPUTING (2)
DISTANCE MEASUREMENT (2)
LEARNING (2)
MACHINE LEARNING (2)
MOBILE ROBOTS (2)
MULTI-ROBOT SYSTEMS (2)
REINFORCEMENT LEARNING (2)
ROBOT KINEMATICS (2)
ROBOTS (2)
ACTION SWITCHING PROBABILITY (1)
ADAPTIVE EXPERT LEARNING INSPIRED APPROACH (1)
APPROXIMATE POLICY ITERATION (1)
APPROXIMATION METHODS (1)
AUTOMATIC WEIGHT LEARNING (1)
AUTONOMOUS AGENTS (1)
BEHAVIOR RECOGNITION (1)
BOX-PUSHING (1)
CENTROIDAL VORONOI TESSELLATIONS (1)
CLASSIFICATION (1)
COMPUTATIONAL GEOMETRY (1)
COMPUTATIONAL MODELING (1)
COMPUTERISED INSTRUMENTATION (1)
DATA MINING (1)
DECENTRALIZED COOPERATIVE LEARNING (1)
DECISION MAKING (1)
DECISION TREE LEARNING ALGORITHM (1)
DECISION TREES (1)
DEMONSTRATION WEIGHT LEARNING (1)
DOCKING TASK (1)
ENVIRONMENTAL FIELD ESTIMATION (1)
EQUATIONS (1)
GAIN BASED SEPARATION (1)
HETEROGENEOUS ROBOT TEAM CONFIGURATION (1)
HIDDEN MARKOV MODELS (1)
HIERARCHICAL CONTROL SYSTEM (1)
HOUSEHOLD VACUUM CLEANER (1)
HUMAN PERCEPTION MODEL (1)
IMAGE COLOR ANALYSIS (1)
INFERENCE ALGORITHMS (1)
INFERENCE ENGINE (1)
INFERENCE MECHANISMS (1)
INVERSE PROBLEMS (1)
ITERATIVE METHODS (1)
KERNEL (1)
L BINARY OUTCOME DECISION TREE (1)
LEARNING ALGORITHM (1)
LEARNING AND ADAPTIVE SYSTEMS (1)
LEARNING FROM DEMONSTRATION ALGORITHM (1)
LEARNING-FROM-DEMONSTRATION SETTING (1)
LEAST MEAN SQUARES METHODS (1)
LEAST SQUARES APPROXIMATIONS (1)
LEAST-SQUARES POLICY ITERATION (1)
LLOYD'S ALGORITHM (1)
LOCATIONAL OPTIMISATION TECHNIQUES (1)
MARKOV PROCESSES (1)
MATHEMATICAL MODEL (1)
MOBILE COMMUNICATION (1)
MOBILE COMPUTING (1)
MOBILE ROBOT CONTROL (1)
MOBILE SENSOR NETWORKS (1)
MOBILE SENSOR NODES (1)
MODIFIED Q-LEARNING ALGORITHM (1)
MOTOR CONTROL MODEL (1)
MULTI-AGENT SYSTEM (1)
MULTI-AGENT SYSTEMS (1)
MULTI-ROBOT COOPERATION (1)
MULTILABEL CLASSIFICATION (1)
MULTIPLE DATA SOURCES (1)
MULTIPLE HETEROGENEOUS ROBOTS (1)
MULTIPLE TEACHERS (1)
MULTIROBOT BOX-PUSHING (1)
MULTIROBOT DECISION MAKING (1)
MULTIROBOT TEAM EXECUTION (1)
NATURAL LANGUAGE PROCESSING (1)
NEUROROBOTICS (1)
NEW MODEL-BASED ALGORITHM (1)
ON-LINE LEARNING (1)
OPTIMISATION (1)
OPTIMIZATION (1)
PATTERN CLASSIFICATION (1)
PATTERN RECOGNITION (1)
PREDICTION ALGORITHMS (1)
PREDICTIVE MODELS (1)
PREDICTIVE SEQUENCE LEARNING (1)
PROBABILITY (1)
PSLE-COMPARISON METHOD (1)
PSLH-COMPARISON METHOD (1)
Q-LEARNING (1)
RANDOM MULTILABEL FOREST (1)
RATIONAL DECISION-MAKING PROCESS (1)
REGRESSION ANALYSIS (1)
RELIABILITY (1)
ROBOT (1)
ROBOT CONTROL (1)
ROBOT PROGRAMMING (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options