Wyniki wyszukiwania dla: R. Babuska

Pozycje od 1 do 2 spośród 2 wyników

rozdział

Control delay in Reinforcement Learning for real-time dynamic systems: A memoryless approach

E Schuitema, L Busoniu, R Babuska, P Jonker

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems > 3226 - 3231

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)

Robots controlled by Reinforcement Learning (RL) are still rare. A core challenge to the application of RL to robotic systems is to learn despite the existence of control delay - the delay between measuring a system's state and acting upon it. Control delay is always present in real systems. In this work, we present two novel temporal difference (TD) learning algorithms for problems with control delay...

rozdział

Policy search with cross-entropy optimization of basis functions

L. Busoniu, D. Ernst, B. De Schutter, R. Babuska

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning > 153 - 160

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning

This paper introduces a novel algorithm for approximate policy search in continuous-state, discrete-action Markov decision processes (MDPs). Previous policy search approaches have typically used ad-hoc parameterizations developed for specific MDPs. In contrast, the novel algorithm employs a flexible policy parameterization, suitable for solving general discrete-action MDPs. The algorithm looks for...

Opcje filtrowania

Słowa kluczowe:
MARKOV PROCESSES

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

APPROXIMATE POLICY SEARCH ALGORITHM (1)
BASIS FUNCTION (1)
CLOSED-LOOP POLICY (1)
COMPUTATIONAL MODELING (1)
CONTINUOUS-STATE DISCRETE-ACTION MARKOV DECISION PROCESS (1)
CONTROL DELAY (1)
CROSS-ENTROPY OPTIMIZATION (1)
DATA MINING (1)
DECISION THEORY (1)
DELAY (1)
DELAYS (1)
ENTROPY (1)
FLEXIBLE POLICY PARAMETERIZATION (1)
FUNCTION APPROXIMATION (1)
GRIDWORLD (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
MEMORYLESS APPROACH (1)
MONTE CARLO METHODS (1)
OPTIMISATION (1)
OPTIMIZATION (1)
PREDICTIVE MODELS (1)
PROBABILITY DENSITY FUNCTION (1)
PROCESS CONTROL (1)
REAL-TIME DYNAMIC SYSTEMS (1)
REINFORCEMENT LEARNING (1)
ROBOT CONTROL (1)
ROBOTIC SYSTEM SIMULATION (1)
ROBOTS (1)
SEARCH PROBLEMS (1)
TEMPORAL DIFFERENCE (1)
TRAJECTORY (1)
VALUE FUNCTION APPROXIMATION (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: R. Babuska

Control delay in Reinforcement Learning for real-time dynamic systems: A memoryless approach

Policy search with cross-entropy optimization of basis functions

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu