Search results

Items from 1 to 7 out of 7 results

chapter

Can a reinforcement learning agent practice before it starts learning?

Minwoo Lee, Charles W. Anderson

2017 International Joint Conference on Neural Networks (IJCNN) > 4006 - 4013

2017 International Joint Conference on Neural Networks (IJCNN)

A reinforcement learning (RL) agent needs a fair amount of experience to find a near-optimal policy. Transfer learning has been investigated as a means to reduce the amount of experience required. Transfer learning, however, requires another similar reinforcement learning task as a transfer source, which can also be costly in the amount of experience required. In this research, we examine the possible...

chapter

Self-adaptive multi-objective optimization method design based on agent reinforcement learning for elevator group control systems

Fanlin Zeng, Qun Zong, Zhengya Sun, Liqian Dou

2010 8th World Congress on Intelligent Control and Automation > 2577 - 2582

2010 8th World Congress on Intelligent Control and Automation (WCICA 2010)

This paper study the multi-objective optimization problem of elevator group control systems by using the Markov Decision Process model. Define the Agent to be the leaner and decision-maker of the MDP model. And then using reinforcement learning Algorithm combined with generic method defines the elements of this model. Moreover we use SARSA(λ) value iteration algorithm which was selected to iterative...

chapter

An online approach towards self-generating fuzzy neural networks with applications

Fan Liu, Meng Joo Er, L Rutkowski

The 2010 International Joint Conference on Neural Networks (IJCNN) > 1 - 7

2010 International Joint Conference on Neural Networks (IJCNN 2010)

In this paper, a novel approach towards self-generating fuzzy neural network (SGFNN) is proposed. The proposed approach is simple and effective and is able to generate a fuzzy neural network with high accuracy and compact structure. The structure learning algorithm of the proposed SGFNN combines criteria of rule generation with a pruning technology. The Kalman filter (KF) algorithm is used to adjust...

chapter

The research of classification based on improved RBF neural network

Wang Lin-shuang, Zhou Li-juan, Ge Xue-bin, Shi Qian

2009 4th International Conference on Computer Science&Education > 792 - 796

2009 4th International Conference on Computer Science & Education (ICCSE 2009)

The approximation accuracy of RBF network constructed by the incremental learning algorithm to the target was not high. For function approximation or other requirements of high accuracy, such accuracy of RBF network model can not meet the requirements. We have improved this network model focused on three aspects to improve the bottleneck, and have an experiment and comparatively analyze these improvements...

chapter

Approximate dynamic programming using Bellman residual elimination and Gaussian process regression

B. Bethke, J.P. How

2009 American Control Conference > 745 - 750

2009 American Control Conference (ACC-09)

This paper presents an approximate policy iteration algorithm for solving infinite-horizon, discounted Markov decision processes (MDPs) for which a model of the system is available. The algorithm is similar in spirit to Bellman residual minimization methods. However, by using Gaussian process regression with nondegenerate kernel functions as the underlying cost-to-go function approximation architecture,...

chapter

Training Feedforward Neural Networks by Pruning Algorithm Based on Grey Incidence Analysis

Yan Xiong, Li Wang, Dawei Li

2008 Second International Symposium on Intelligent Information Technology Application > 3 > 535 - 539

2008 Second International Symposium on Intelligent Information Technology Application

In this paper, a new pruning algorithm based on grey incidence analysis for feedforward neural networks is presented.The pruned network has the optimal topology with avoiding over training and obtaining good generalization.The removed connections and the incorporated connections are chosen according to the degree of grey incidence of each output sequence of the network units. The simulation results...

chapter

Historical Temporal Difference Learning: Some Initial Results

Hengshuai Yao, Diao Dongcui, Zengqi Sun

First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS'6) > 2 > 678 - 685

First International on Computer and Computational Sciences

In this paper, we develop a multi-step prediction algorithm that is guaranteed to converge when using general function approximation. Besides, the new algorithm should satisfy the following requirements: first, it does not have to be faster than TD(0) in the look-up table representation; however, the new algorithm should be faster than residual gradient method. Second, the new algorithm should learn...

Filter options

Keywords:
HEURISTIC ALGORITHMS
LEARNING (ARTIFICIAL INTELLIGENCE)
FUNCTION APPROXIMATION

Publication date

Set your own date range

Keywords

ALGORITHM DESIGN AND ANALYSIS (4)
APPROXIMATION ALGORITHMS (4)
TRAINING (3)
APPROXIMATION METHODS (2)
ARTIFICIAL NEURAL NETWORKS (2)
DATA MINING (2)
ITERATIVE METHODS (2)
MARKOV PROCESSES (2)
REINFORCEMENT LEARNING (2)
ACCURACY (1)
ADAPTIVE MULTI-OBJECTIVE OPTIMIZATION (1)
AGENT (1)
AGENT REINFORCEMENT LEARNING (1)
APPROXIMATE DYNAMIC PROGRAMMING (1)
APPROXIMATE POLICY ITERATION ALGORITHM (1)
APPROXIMATION ACCURACY (1)
BELLMAN RESIDUAL ELIMINATION ALGORITHM (1)
BELLMAN RESIDUAL MINIMIZATION METHOD (1)
BENCHMARK TESTING (1)
CLASSIFICATION (1)
CLASSIFICATION ALGORITHMS (1)
CONTROL SYSTEM SYNTHESIS (1)
CONTROL SYSTEMS (1)
CONVERGENCE (1)
CORRELATION (1)
COST-TO-GO FUNCTION APPROXIMATION ARCHITECTURE (1)
DECISION MAKING (1)
DECISION THEORY (1)
DYNAMIC PROGRAMMING (1)
ELEVATOR GROUP CONTROL SYSTEM (1)
ELEVATOR GROUP CONTROL SYSTEMS (1)
ENCODING (1)
ERROR BOUND (1)
ERROR CORRECTION (1)
FEEDFORWARD NEURAL NETS (1)
FEEDFORWARD NEURAL NETWORK (1)
FEEDFORWARD NEURAL NETWORK TRAINING (1)
FUZZY NEURAL NETS (1)
FUZZY NEURAL NETWORKS (1)
GAUSSIAN PROCESS REGRESSION (1)
GAUSSIAN PROCESSES (1)
GRADIENT METHODS (1)
GREY INCIDENCE ANALYSIS (1)
GREY SYSTEMS (1)
IDENTIFICATION (1)
INCREMENTAL LEARNING ALGORITHM (1)
INFINITE HORIZON (1)
INPUT VARIABLES (1)
ITERATION ALGORITHM (1)
ITERATIVE ESTIMATION (1)
KALMAN FILTER ALGORITHM (1)
KALMAN FILTERS (1)
KERNEL (1)
LEARNING (1)
LEAST-SQUARES METHOD (1)
LIFTS (1)
LOOK-UP TABLE REPRESENTATION (1)
MARKOV DECISION PROCESS (1)
MARKOV DECISION PROCESS MODEL (1)
MDP (1)
MINIMISATION (1)
MULTI-STEP PREDICTION (1)
MULTISTEP PREDICTION ALGORITHM (1)
NEURAL NETWORKS (1)
NEURONS (1)
NONDEGENERATE KERNEL FUNCTION (1)
NONLINEAR SYSTEM IDENTIFICATION (1)
ONLINE APPROACH (1)
OPTIMISATION (1)
OPTIMIZING PARAMETERS OF THE EVALUATION FUNCTION (1)
PATTERN CLASSIFICATION (1)
PREDICTION ALGORITHMS (1)
PRUNING ALGORITHM (1)
PRUNING TECHNOLOGY (1)
RADIAL BASIS FUNCTION NETWORKS (1)
RADIAL BASIS FUNCTION NEURAL NETWORK (1)
RBF NEURAL NETWORK (1)
REGRESSION ANALYSIS (1)
REINFORCEMENT LEARNING ALGORITHM (1)
RESIDUAL GRADIENT METHOD (1)
RULE GENERATION (1)
SAMPLING METHODS (1)
SARSA (1)
SARSA(λ ) ALGORITHM (1)
SELF ADAPTIVE MULTIOBJECTIVE OPTIMIZATION METHOD DESIGN (1)
SELF GENERATING FUZZY NEURAL NETWORKS (1)
SILICON (1)
SIMULATION (1)
STATE SPACE SAMPLING (1)
STRUCTURE LEARNING ALGORITHM (1)
SUPPORT VECTOR MACHINES (1)
TABLE LOOKUP (1)
TEMPORAL DIFFERENCE LEARNING (1)
TILE CODING FUNCTION APPROXIMATION (1)
TILES (1)
TIME SERIES (1)
TIME SERIES PREDICTION (1)
more

INFONA - science communication portal

Search results

Can a reinforcement learning agent practice before it starts learning?

Self-adaptive multi-objective optimization method design based on agent reinforcement learning for elevator group control systems

An online approach towards self-generating fuzzy neural networks with applications

The research of classification based on improved RBF neural network

Approximate dynamic programming using Bellman residual elimination and Gaussian process regression

Training Feedforward Neural Networks by Pruning Algorithm Based on Grey Incidence Analysis

Historical Temporal Difference Learning: Some Initial Results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options