Wyniki wyszukiwania dla: Haibo Liu

Pozycje od 1 do 4 spośród 4 wyników

rozdział

Multi-robot Cooperation Based on Hierarchical Reinforcement Learning

Xiaobei Cheng, Jing Shen, Haibo Liu, Guochang Gu

Lecture Notes in Computer Science > Computational Science – ICCS 2007 > 90-97

Multi-agent reinforcement learning for multi-robot systems is a challenging issue in both robotics and artificial intelligence. But multi-agent reinforcement learning is bedeviled by the curse of dimensionality. In this paper, a novel hierarchical reinforcement learning approach named MOMQ is presented for multi-robot cooperation. The performance of MOMQ is demonstrated in three-robot trash collection...

rozdział

Hierarchical Reinforcement Learning with OMQ

Jing Shen, Haibo Liu, Guochang Gu

2006 5th IEEE International Conference on Cognitive Informatics > 1 > 584 - 588

2006 5th IEEE International Conference on Cognitive Informatics

A novel method of hierarchical reinforcement learning, named OMQ, by integrating options into MAXQ is presented. In OMQ, the MAXQ is used as basic framework to design hierarchies experientially and learn online, and the option is used to construct hierarchies automatically. The performance of OMQ is demonstrated in taxi domain and compared with Option and MAXQ. The simulation results show that the...

rozdział

Multi-Agent Hierarchical Reinforcement Learning by Integrating Options into MAXQ

Jing Shen, Guochang Gu, Haibo Liu

First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS'6) > 1 > 676 - 682

First International on Computer and Computational Sciences

MAXQ is a new framework for multi-agent reinforcement learning. But the MAXQ framework cannot decompose all subtasks into more refined hierarchies and the hierarchies are difficult to be discovered automatically. In this paper, a multi-agent hierarchical reinforcement learning approach, named OptMAXQ, by integrating Options into MAXQ is presented. In the OptMAXQ framework, the MAXQ framework is used...

rozdział

Automatic option generation in hierarchical reinforcement learning via immune clustering

Jing Shen, Guochang Gu, Haibo Liu

2006 1st International Symposium on Systems and Control in Aerospace and Astronautics > 4 pp. - 500

First International Symposium on Systems and Control in Aerospace and Astronautics

An open problem in hierarchical reinforcement learning is how to automatically generate hierarchies, e.g. options. We consider an immune clustering approach for automatic construction of options in a dynamic environment. The learning agent generates an undirected edge-weighted topological graph of the environment state transitions online. An immune clustering algorithm is then used to partition the...

Opcje filtrowania

Słowa kluczowe:
HIERARCHICAL REINFORCEMENT LEARNING

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

LEARNING (ARTIFICIAL INTELLIGENCE) (3)
MAXQ (2)
ALGORITHM DESIGN AND ANALYSIS (1)
AUTOMATIC OPTION GENERATION (1)
COMPUTERS (1)
COOPERATION (1)
FUNCTION APPROXIMATION (1)
IMMUNE CLUSTERING (1)
LEARNING (1)
LEARNING AGENT (1)
LEARNING SYSTEMS (1)
MAXQ FRAMEWORK (1)
MULTI-AGENT REINFORCEMENT LEARNING (1)
MULTI-AGENT SYSTEMS (1)
MULTI-ROBOT (1)
MULTIAGENT HIERARCHICAL REINFORCEMENT LEARNING APPROACH (1)
NAVIGATION (1)
OMQ (1)
OPTION (1)
OPTION FRAMEWORK (1)
OPTIONS (1)
OPTMAXQ (1)
ROBOT KINEMATICS (1)
ROBOT TRASH COLLECTION TASK (1)
ROBOTS (1)
SECOND IMMUNE RESPONSE (1)
więcej

Zbiór danych

ieee (3)
Springer (1)

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Haibo Liu

Multi-robot Cooperation Based on Hierarchical Reinforcement Learning

Hierarchical Reinforcement Learning with OMQ

Multi-Agent Hierarchical Reinforcement Learning by Integrating Options into MAXQ

Automatic option generation in hierarchical reinforcement learning via immune clustering

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zbiór danych

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu