Search results

Items from 1 to 6 out of 6 results

chapter

Towards enabling deep learning techniques for adaptive dynamic programming

Zhen Ni, Naresh Malla, Xiangnan Zhong

2017 International Joint Conference on Neural Networks (IJCNN) > 2828 - 2835

2017 International Joint Conference on Neural Networks (IJCNN)

Human-level control through deep learning and deep reinforcement learning have revealed the unique and powerful potentials through a very complex Go game. The AlphaGo, developed by Google DeepMind, has beat the top Go game player early this year. The scientific and technological advancement behind the success of AlphaGo attracted researchers from multiple areas, including machine learning, artificial...

chapter

Tutor learning using linear constraints in approximate dynamic programming

Dotan Di Castro, S Mannor

2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton) > 1384 - 1390

2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

In adaptive control, agents interacting with Markov Decision Processes typically face two types of setups. In the first setup, the environment's model is known and dynamic programming and related methods are used to obtain the optimal control. In the second setup, the environment's model is unknown and reinforcement learning methods are used. In this work we investigate a new setup that is a mix of...

chapter

A Survey of Approximate Dynamic Programming

Wang Lin, Peng Hui, Zhu Hua-yong, Shen Lin-cheng

2009 International Conference on Intelligent Human-Machine Systems and Cybernetics > 2 > 396 - 399

2009 International Conference on Intelligent Human-Machine Systems and Cybernetics. IHMSC 2009

Multi-stage decision problems under uncertainty are abundant in process industries. Markov decision process (MDP) is a general mathematical formulation of such problems. Whereas stochastic programming and dynamic programming are the standard methods to solve MDPs, their unwieldy computational requirements limit their usefulness in real applications. Approximate dynamic programming (ADP) combines simulation...

chapter

Approximate dynamic programming using Bellman residual elimination and Gaussian process regression

B. Bethke, J.P. How

2009 American Control Conference > 745 - 750

2009 American Control Conference (ACC-09)

This paper presents an approximate policy iteration algorithm for solving infinite-horizon, discounted Markov decision processes (MDPs) for which a model of the system is available. The algorithm is similar in spirit to Bellman residual minimization methods. However, by using Gaussian process regression with nondegenerate kernel functions as the underlying cost-to-go function approximation architecture,...

chapter

Cover

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning > c1

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning

The following topics are dealt with: adaptive dynamic programming; reinforcement learning and Markov decision process.

chapter

Learning Policies for Efficiently Identifying Objects of Many Classes

R. Isukapalli, A. Elgammal, R. Greiner

18th International Conference on Pattern Recognition (ICPR'6) > 3 > 356 - 361

2006 18th International Conference on Pattern Recognition

Viola and Jones (VJ) cascade classification methods have proven to be very successful in detecting objects belonging to a single class - e.g., faces. This paper addresses the more challenging "many class detection" problem: detecting and identifying objects that belong to any of a set of classes. We use a set of learned weights (corresponding to the parameters of a set of binary linear separators)...

Filter options

Content availability:
Available
Keywords:
MARKOV DECISION PROCESS
LEARNING (ARTIFICIAL INTELLIGENCE)

Publication date

Set your own date range

Keywords

MARKOV PROCESSES (5)
FUNCTION APPROXIMATION (3)
REINFORCEMENT LEARNING (3)
APPROXIMATE DYNAMIC PROGRAMMING (2)
APPROXIMATION ALGORITHMS (2)
DECISION MAKING (2)
ADAPTIVE CONTROL (1)
ADAPTIVE DYNAMIC PROGRAMMING (1)
ADAPTIVE DYNAMIC PROGRAMMING (ADP) (1)
ALGORITHM DESIGN AND ANALYSIS (1)
APPROXIMATE POLICY ITERATION ALGORITHM (1)
APPROXIMATION METHODS (1)
APPROXIMATION THEORY (1)
ARTIFICIAL NEURAL NETWORKS (1)
BELLMAN RESIDUAL ELIMINATION ALGORITHM (1)
BELLMAN RESIDUAL MINIMIZATION METHOD (1)
BINARY LINEAR SEPARATORS (1)
COMPUTATIONAL INTELLIGENCE (1)
COST-TO-GO FUNCTION APPROXIMATION ARCHITECTURE (1)
CURSE-OF-DIMENSIONALITY (1)
DATA MINING (1)
DECISION THEORY (1)
DECISION TREE CLASSIFIER (1)
DECISION TREES (1)
DEEP LEARNING (1)
DEEP REINFORCEMENT LEARNING (DRL) (1)
ENVELOPE THEOREM (1)
ERROR BOUND (1)
EXPERIENCE REPLAY (1)
GAUSSIAN PROCESS REGRESSION (1)
GAUSSIAN PROCESSES (1)
HEURISTIC ALGORITHMS (1)
IMAGE CLASSIFICATION (1)
INFINITE HORIZON (1)
ITERATIVE METHODS (1)
KERNEL (1)
LEARNING (1)
LEARNING POLICIES (1)
LEAST SQUARES APPROXIMATION (1)
LINEAR CONSTRAINT (1)
LINEAR FUNCTION APPROXIMATION (1)
MACHINE LEARNING (1)
MANY CLASS DETECTION (1)
MARKOV DECISION PROCESSES (1)
MATHEMATICAL FORMULATION (1)
MATHEMATICAL MODEL (1)
MDP (1)
MINIMISATION (1)
MULTISTAGE DECISION PROBLEMS (1)
NONDEGENERATE KERNEL FUNCTION (1)
OBJECT IDENTIFICATION (1)
OPTIMAL CONTROL (1)
OPTIMIZATION (1)
PROCESS INDUSTRIES (1)
PROGRAMMING (1)
REGRESSION ANALYSIS (1)
REINFORCEMENT LEARNING ALGORITHM (1)
SAMPLING METHODS (1)
STATE SPACE (1)
STATE SPACE SAMPLING (1)
STOCHASTIC PROGRAMMING (1)
TUTOR LEARNING (1)
more

INFONA - science communication portal

Search results

Towards enabling deep learning techniques for adaptive dynamic programming

Tutor learning using linear constraints in approximate dynamic programming

A Survey of Approximate Dynamic Programming

Approximate dynamic programming using Bellman residual elimination and Gaussian process regression

Cover

Learning Policies for Efficiently Identifying Objects of Many Classes

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options