Search results

Items from 1 to 5 out of 5 results

chapter

An Approximate Stochastic Annealing algorithm for finite horizon Markov decision processes

Jiaqiao Hu, Hyeong Soo Chang

49th IEEE Conference on Decision and Control (CDC) > 5338 - 5343

2010 49th IEEE Conference on Decision and Control (CDC 2010)

We present a simulation-based algorithm called Approximate Stochastic Annealing (ASA) for solving finite-horizon Markov decision processes (MDPs). The algorithm iteratively estimates the optimal policy by sampling from a sequence of probability distribution functions over the policy space. By exploiting a novel connection of ASA to the stochastic approximation method, we show that the sequence of...

chapter

Eigenfunction approximation methods for linearly-solvable optimal control problems

E. Todorov

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning > 161 - 168

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning

We have identified a general class of nonlinear stochastic optimal control problems which can be reduced to computing the principal eigenfunction of a linear operator. Here we develop function approximation methods exploiting this inherent linearity. First we discretize the time axis in a novel way, yielding an integral operator that approximates not only our control problems but also more general...

chapter

Iterative local dynamic programming

E. Todorov, Y. Tassa

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning > 90 - 95

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning

We develop an iterative local dynamic programming method (iLDP) applicable to stochastic optimal control problems in continuous high-dimensional state and action spaces. Such problems are common in the control of biological movement, but cannot be handled by existing methods. iLDP can be considered a generalization of differential dynamic programming, in as much as: (a) we use general basis functions...

chapter

Practical numerical methods for stochastic optimal control of biological systems in continuous time and space

A. Simpkins, E. Todorov

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning > 212 - 218

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning

Previous studies have suggested that optimal control is one suitable model for biological movement. In some cases, solutions to optimal control problems are known, such as the Linear Quadratic Gaussian setting. However, more general cost functionals and nonlinear stochastic systems lead to optimal control problems to which direct solutions are presently unknown but these solutions would theoretically...

chapter

Decentralized approximate dynamic programming for dynamic networks of agents

H. Lakshmanan, D.P. de Farias

2006 American Control Conference > 6 pp.

2006 American Control Conference

We consider control systems consisting of teams of agents operating in stochastic environments and communicating through a network with dynamic topology. An optimal centralized control policy can be derived from the Q-function associated with the problem. However, computing and storing the Q-function is intractable for systems of practical scale, and having a centralized policy may lead to prohibitive...

Filter options

Keywords:
APPROXIMATION METHODS
FUNCTION APPROXIMATION

Publication date

Set your own date range

Keywords

OPTIMAL CONTROL (4)
AEROSPACE ELECTRONICS (2)
CONVERGENCE (2)
DYNAMIC PROGRAMMING (2)
EQUATIONS (2)
ITERATIVE METHODS (2)
NONLINEAR CONTROL SYSTEMS (2)
ACTION SPACES (1)
ANNEALING (1)
APPROXIMATE DYNAMIC PROGRAMMING (1)
APPROXIMATE STOCHASTIC ANNEALING ALGORITHM (1)
APPROXIMATION ALGORITHMS (1)
ASA (1)
BIOLOGICAL SYSTEM (1)
BIOLOGICAL TECHNIQUES (1)
COLLOCATION METHOD (1)
COMPLEXITY THEORY (1)
COMPUTATIONAL COMPLEXITY (1)
CONTINUOUS HIGH-DIMENSIONAL STATE (1)
CONTINUOUS TIME SYSTEMS (1)
CONTINUOUS TIME-SPACE (1)
CONTROL SYSTEMS (1)
COST FUNCTIONAL (1)
COVARIANCE ANALYSIS (1)
DECENTRALISED CONTROL (1)
DECENTRALIZED OPTIMAL CONTROL (1)
DEGENERATED DISTRIBUTION (1)
DIFFERENTIAL DYNAMIC PROGRAMMING (1)
DISTRIBUTED CONTROL (1)
DYNAMIC NETWORK TOPOLOGY (1)
DYNAMIC NETWORKS (1)
EIGENFUNCTION APPROXIMATION METHODS (1)
EIGENVALUES AND EIGENFUNCTIONS (1)
ELLIPTIC EQUATIONS (1)
EXPLICIT DIFFERENTIATION (1)
FINITE HORIZON MARKOV DECISION PROCESS (1)
FINITE-DIMENSIONAL EIGENVECTOR PROBLEM (1)
FUNCTION APPROXIMATION METHODS (1)
GENERAL ELLIPTIC PDE (1)
GRADIENT METHODS (1)
GRADIENT-BASED ALGORITHM (1)
GUARANTEED CONVERGENCE (1)
INTELLIGENT CONTROL (1)
ITERATIVE LOCAL DYNAMIC PROGRAMMING (1)
KALMAN FILTERS (1)
LEARNING SYSTEMS (1)
LEAST SQUARES APPROXIMATION (1)
LEVENBERG-MARQUARDT MINIMIZATION (1)
LINEAR PROGRAMMING (1)
LINEARLY-SOLVABLE OPTIMAL CONTROL PROBLEMS (1)
LOCAL APPROXIMATION ARCHITECTURES (1)
LOCAL FUNCTION APPROXIMATOR (1)
MARKOV PROCESSES (1)
METEOROLOGY (1)
MINIMISATION (1)
MULTI-AGENT SYSTEMS (1)
MULTIDIMENSIONAL SYSTEMS (1)
MUSCLES (1)
NEWTON METHOD (1)
NOISE (1)
NONLINEAR STOCHASTIC OPTIMAL CONTROL PROBLEMS (1)
NONLINEAR STOCHASTIC SYSTEM (1)
NP-HARDNESS (1)
NUMERICAL METHOD (1)
OPTIMAL POLICY (1)
OPTIMAL SYSTEMS (1)
OPTIMAL VALUE FUNCTION (1)
PARTIAL DIFFERENTIAL EQUATIONS (1)
POLICY ITERATION (1)
POLICY SPACE (1)
PROBABILITY (1)
PROBABILITY DISTRIBUTION (1)
PROBABILITY DISTRIBUTION FUNCTION (1)
Q-FUNCTION (1)
QUASI-NEWTON METHODS (1)
RESOURCE ALLOCATION (1)
RESOURCE ALLOCATION PROBLEM (1)
RESOURCE MANAGEMENT (1)
SCHEDULES (1)
SHAPE (1)
SIMULATION-BASED ALGORITHM (1)
STATE COVARIANCE (1)
STOCHASTIC APPROXIMATION METHOD (1)
STOCHASTIC ENVIRONMENTS (1)
STOCHASTIC OPTIMAL CONTROL (1)
STOCHASTIC OPTIMAL CONTROL PROBLEMS (1)
SWIMMER DYNAMICAL SYSTEM (1)
TOPOLOGY (1)
TRAJECTORY (1)
UNOBSERVABLE PARAMETER LEARNING (1)
UNSCENTED KALMAN FILTER (1)
VALUE ITERATION (1)
more

INFONA - science communication portal

Search results

An Approximate Stochastic Annealing algorithm for finite horizon Markov decision processes

Eigenfunction approximation methods for linearly-solvable optimal control problems

Iterative local dynamic programming

Practical numerical methods for stochastic optimal control of biological systems in continuous time and space

Decentralized approximate dynamic programming for dynamic networks of agents

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options