Advanced search

Advanced search in people

From:

To:

Items from 1 to 5 out of 5 results

chapter

Ensembles of Neural Networks for Robust Reinforcement Learning

A Hans, S Udluft

2010 Ninth International Conference on Machine Learning and Applications > 401 - 406

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Reinforcement learning algorithms that employ neural networks as function approximators have proven to be powerful tools for solving optimal control problems. However, their training and the validation of final policies can be cumbersome as neural networks can suffer from problems like local minima or over fitting. When using iterative methods, such as neural fitted Q-iteration, the problem becomes...

chapter

Statistically linearized least-squares temporal differences

M Geist, O Pietquin

International Congress on Ultra Modern Telecommunications and Control Systems > 450 - 457

2010 International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT 2010)

A common drawback of standard reinforcement learning algorithms is their inability to scale-up to real-world problems. For this reason, a current important trend of research is (state-action) value function approximation. A prominent value function approximator is the least-squares temporal differences (LSTD) algorithm. However, for technical reasons, linearity is mandatory: the parameterization of...

chapter

Smoothing Supervised Learning of Neural Networks for Function Approximation

T T Nguyen

2010 Second International Conference on Knowledge and Systems Engineering > 104 - 109

2010 Second International Conference on Knowledge and Systems Engineering (KSE)

Two popular hazards in supervised learning of neural networks are local minima and over fitting. Application of the momentum technique dealing with the local optima has proved efficient but it is vulnerable to over fitting. In contrast, deployment of the early stopping technique might overcome the over fitting phenomena but it sometimes terminates into the local minima. This paper proposes a hybrid...

chapter

Fixed point method of step-size estimation for on-line neural network training

Pawel Wawrzyński

The 2010 International Joint Conference on Neural Networks (IJCNN) > 1 - 6

2010 International Joint Conference on Neural Networks (IJCNN 2010)

This paper considers on-line training of feadforward neural networks. Training examples are only available sampled randomly from a given generator. What emerges in this setting is the problem of step-sizes, or learning rates, adaptation. A scheme of determining step-sizes is introduced here that satisfies the following requirements: (i) it does not need any auxiliary problem-dependent parameters,...

article

A Comparative Study of Value Systems for Self-Motivated Exploration and Learning by Robots

Kathryn Elizabeth Merrick

IEEE Transactions on Autonomous Mental Development > 2010 > 2 > 2 > 119 - 131

A range of different value systems have been proposed for self-motivated agents, including biologically and cognitively inspired approaches. Likewise, these value systems have been integrated with different behavioral systems including reflexive architectures, reward-based learning and supervised learning. However, there is little literature comparing the performance of different value systems for...

Filter options

Content availability:
Available
Keywords:
ARTIFICIAL NEURAL NETWORKS
NEURAL NETWORKS
LEARNING (ARTIFICIAL INTELLIGENCE)
FUNCTION APPROXIMATION

Publication date

Set your own date range

Publication type

book (4)
article (1)

Keywords

REINFORCEMENT LEARNING (4)
NEURAL NETS (3)
TRAINING (3)
APPROXIMATION ALGORITHMS (2)
OPTIMIZATION (2)
SUPERVISED LEARNING (2)
2D GABOR FUNCTION APPROXIMATION (1)
BELLMAN EVALUATION OPERATOR (1)
BENCHMARK APPLICATION (1)
BIOLOGICAL SYSTEMS (1)
COGNITIVE ROBOTICS (1)
COMPETENCE (1)
CURVE FITTING (1)
DERIVATIVE-FREE STATISTICAL LINEARIZATION (1)
DEVELOPMENTAL ROBOTICS (1)
EARLY STOPPING TECHNIQUE (1)
EDUCATIONAL ROBOTS (1)
ENSEMBLE METHODS (1)
ESTIMATION (1)
FEADFORWARD NEURAL NETWORKS (1)
FEEDFORWARD NEURAL NETWORKS (1)
FIXED POINT METHOD (1)
FUNCTION APPROXIMATOR (1)
HAZARDS (1)
HUMANS (1)
INDEXES (1)
INTEREST (1)
ITERATIVE METHOD (1)
ITERATIVE METHODS (1)
LEARNING (1)
LEAST SQUARES APPROXIMATIONS (1)
LEGO MINDSTORMS NXT ROBOT (1)
LINEAR PARAMETRIZATION (1)
LOCAL MINIMA (1)
LSTD ALGORITHM (1)
MOMENTUM TECHNIQUE (1)
MOTIVATED REINFORCEMENT LEARNING (1)
NEAR OPTIMAL POLICY (1)
NETWORK TOPOLOGY (1)
NEURAL FITTED Q-ITERATION (1)
NEURAL NETWORK (1)
NEURAL NETWORK ARCHITECTURE (1)
NEURAL NETWORK TRAINING (1)
NEURONS (1)
NOISE (1)
NONLINEAR PARAMETERIZATION (1)
NOVELTY (1)
ON-LINE LEARNING (1)
ONLINE TRAINING (1)
ORGANISMS (1)
OVERFITTING (1)
PROCESS PLANNING (1)
Q AVERAGING (1)
Q-LEARNING-LIKE PROBLEM (1)
REFLEXIVE ARCHITECTURES (1)
REINFORCEMENT LEARNING ALGORITHM (1)
REWARD BASED ARCHITECTURE (1)
REWARD-BASED LEARNING (1)
ROBOTS (1)
ROBUSTNESS (1)
SELF MOTIVATED EXPLORATION (1)
SELF-ADJUSTING SYSTEMS (1)
STATISTICAL ANALYSIS (1)
STATISTICAL LINEARIZATION (1)
STATISTICALLY LINEARIZED LEAST-SQUARES TEMPORAL DIFFERENCE ALGORITHM (1)
STEP-SIZE ADAPTATION (1)
STEP-SIZE ESTIMATION (1)
STOCHASTIC PROCESSES (1)
TESTING (1)
TRAINING PROCESS (1)
TRANSFORMS (1)
VALUE FUNCTION APPROXIMATION (1)
VALUE FUNCTION APPROXIMATOR (1)
VALUE SYSTEM (1)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Ensembles of Neural Networks for Robust Reinforcement Learning

Statistically linearized least-squares temporal differences

Smoothing Supervised Learning of Neural Networks for Function Approximation

Fixed point method of step-size estimation for on-line neural network training

A Comparative Study of Value Systems for Self-Motivated Exploration and Learning by Robots

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options