Search results

Items from 1 to 6 out of 6 results

chapter

The QV family compared to other reinforcement learning algorithms

M.A. Wiering, H. van Hasselt

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning > 101 - 108

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning

This paper describes several new online model-free reinforcement learning (RL) algorithms. We designed three new reinforcement algorithms, namely: QV2, QVMAX, and QVMAX2, that are all based on the QV-learning algorithm, but in contrary to QV-learning, QVMAX and QVMAX2 are off-policy RL algorithms and QV2 is a new on-policy RL algorithm. We experimentally compare these algorithms to a large number...

chapter

A New Learning Algorithm Based on Trust Region Optimization Theory for Neural Networks

Yunsheng Liu, Xin Liu, Tian Ba

2008 International Conference on Computer Science and Software Engineering > 4 > 788 - 793

2008 International Conference on Computer Science and Software Engineering (CSSE 2008)

Neural network techniques have been widely applied to areas of such as data mining, information integration and grid computing. This paper proposes a new learning algorithm based on trust region optimization theory. In the paper, the Dogleg-algorithm to obtain the valid trust region steps is presented, and a self-adjustable method with variable coefficients is given to resolve the problem of oscillatory...

chapter

Dynamic correlation matrix based multi-Q learning for a multi-robot system

Hongliang Guo, Yan Meng

2008 IEEE/RSJ International Conference on Intelligent Robots and Systems > 840 - 845

2008 IEEE/RSJ International Conference on Intelligent Robots and Systems

Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selections, and difficulty in merging learned experiences from other robots. In this paper, we propose a dynamic correlation matrix based multi-Q learning (DCM-MultiQ) method for a distributed multi-robot system. A novel dynamic correlation...

chapter

Research of New Learning Method of Feedforward Neural Network

Jinghong Wang, Bi Li, Chenguang Liu, Jiaomin Liu

2008 International Symposiums on Information Processing > 102 - 106

2008 International Symposiums on Information Processing - ISIP 2008; 2008 International Pacific Workshop on Web Mining and Web-Based Application - WMWA 2008

This paper discussed the sparsed feed-forward neural network, namely, how to determine and delete the redundant neurons and connections in the network. To begin with, the author gives the mathematical definition of feed-forward neural network, and then introduces the partial and topological order to the sparsed algorithm and the learning algorithm of the feed-forward neural network. As a result, the...

chapter

FPGA implementation of a self-organized map with on-chip learning

A. Tisan, S. Oniga, C. Gavrincea, A. Buchman

2008 11th International Conference on Optimization of Electrical and Electronic Equipment > 81 - 86

11th International Conference on Optimization of Electrical and Electronic Equipment. OPTIM 2008

In this paper we propose a method to implement SOM neural network in FPGA circuits: a self organized map neural network with on-chip learning algorithm. The method implies the building of a neural network by generic blocks designed in Mathworks' Simulink environment. The main characteristics of this solution are onchip learning algorithm implementation and high reconfiguration capability and operation...

article

Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks

Qinmin Yang, J.B. Vance, S. Jagannathan

IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) > 2008 > 38 > 4 > 994 - 1001

A nonaffine discrete-time system represented by the nonlinear autoregressive moving average with eXogenous input (NARMAX) representation with unknown nonlinear system dynamics is considered. An equivalent affinelike representation in terms of the tracking error dynamics is first obtained from the original nonaffine nonlinear discrete-time system so that reinforcement-learning-based near-optimal neural...

Filter options

Data set:
ieee
Keywords:
ARTIFICIAL NEURAL NETWORKS
LEARNING (ARTIFICIAL INTELLIGENCE)
EQUATIONS
ALGORITHM DESIGN AND ANALYSIS

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

LEARNING (3)
APPROXIMATION ALGORITHMS (2)
CONVERGENCE (2)
DATA MINING (2)
HARDWARE (2)
MATHEMATICAL MODEL (2)
NEURONS (2)
TUNING (2)
ACCURACY (1)
ACTOR-CRITIC (1)
ADAPTATION MODEL (1)
ADAPTIVE CONTROL (1)
ADAPTIVE CRITIC (1)
ADAPTIVE DYNAMIC PROGRAMMING (1)
ADAPTIVE SYSTEMS (1)
AEROSPACE ELECTRONICS (1)
APPROXIMATION METHODS (1)
AUTOREGRESSIVE MOVING AVERAGE PROCESSES (1)
BACKPROPAGATION (1)
CART POLE BALANCING PROBLEM (1)
CLASSIFICATION ALGORITHMS (1)
CLOSED LOOP SYSTEMS (1)
CLOSED-LOOP SYSTEM STABILITY (1)
COLOR (1)
COMPUTATIONAL MODELING (1)
COMPUTERS (1)
CONTROL DESIGN (1)
CONTROL SYSTEMS (1)
CORRELATION (1)
CORRELATION METHODS (1)
COST FUNCTION (1)
COST FUNCTION MINIMIZATION (1)
CYBERNETICS (1)
DELAY (1)
DESIGN METHODOLOGY (1)
DISCRETE TIME SYSTEMS (1)
DISPERSE DEGREE (1)
DISTANCE MEASUREMENT (1)
DISTRIBUTED MULTIROBOT SYSTEM (1)
DYNAMIC CORRELATION MATRIX (1)
DYNAMIC PROGRAMMING (1)
EDUCATION (1)
ENCODING (1)
FAST LEARNING (1)
FEED FORWARD NEURAL NETWORK (1)
FEEDBACK (1)
FEEDBACK MATRIX CONTROL THEORY (1)
FEEDFORWARD NEURAL NETS (1)
FEEDFORWARD NEURAL NETWORK (1)
FIELD PROGRAMMABLE GATE ARRAYS (1)
FITTING (1)
FPGA CIRCUITS IMPLEMENTATION (1)
HELIUM (1)
HEURISTIC ALGORITHMS (1)
HISTORY (1)
INDEXES (1)
INFORMATION INTEGRATION (1)
INTELLIGENT SYSTEMS (1)
ITERATIVE METHODS (1)
LEARNING ALGORITHM (1)
LINEAR PARAMETERIZED NEURAL NETWORK (1)
LINEAR SYSTEMS (1)
LYAPUNOV APPROACH (1)
LYAPUNOV METHODS (1)
LYAPUNOV STABILITY (1)
MATHWORKS SIMULINK (1)
MINIMISATION (1)
MULTI-ROBOT SYSTEMS (1)
MULTIQ LEARNING (1)
NEAR-OPTIMAL CONTROL SIGNAL (1)
NEURAL NETS (1)
NEURAL NETWORK (1)
NEURAL NETWORK CONTROL (1)
NEURAL NETWORKS (1)
NEUROCONTROLLERS (1)
NONAFFINE NONLINEAR DISCRETE-TIME SYSTEM CONTROL (1)
NONHOMOGENEOUS MEDIA (1)
NONLINEAR AUTOREGRESSIVE MOVING AVERAGE-EXOGENOUS INPUT (1)
NONLINEAR CONTROL SYSTEMS (1)
NONLINEAR DYNAMICAL SYSTEMS (1)
NONLINEAR SYSTEM DYNAMICS (1)
NONLINEAR SYSTEMS (1)
OBJECT RECOGNITION (1)
ONCHIP LEARNING (1)
OPTIMAL CONTROL (1)
OPTIMISATION (1)
OSCILLATORY BEHAVIORS (1)
PARALLEL PROCESSING (1)
QV- MAX2 (1)
QV-LEARNING (1)
QV2 (1)
QVMAX (1)
R-LEARNING (1)
REDUNDANT NEURONS (1)
REINFORCEMENT LEARNING (1)
REINFORCEMENT LEARNING ALGORITHMS (1)
more

INFONA - science communication portal

Search results

The QV family compared to other reinforcement learning algorithms

A New Learning Algorithm Based on Trust Region Optimization Theory for Neural Networks

Dynamic correlation matrix based multi-Q learning for a multi-robot system

Research of New Learning Method of Feedforward Neural Network

FPGA implementation of a self-organized map with on-chip learning

Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options