Analyzing policy iteration in optimal control

Ali Heydari

doi:10.1109/ACC.2016.7526567

Analyzing policy iteration in optimal control

Source

2016 American Control Conference (ACC) > 5728 - 5733

Abstract

Policy iteration, as an adaptive/approximate dynamic programming-based approach for optimal control is investigated. The context is optimal control of discrete-time nonlinear dynamics with undiscounted cost functions. Convergence of the learning iterations and uniqueness of the solution to the corresponding Bellman equation are established, leading to the optimality of the limit function, i.e., the learning results. Moreover, given the faster convergence of the learning under policy iteration compared with value iteration-based learning algorithms, some theoretical results are developed which prove that starting with a similar initial guess, policy iteration will not converge slower than value iteration. Finally, some numerical analyses are presented to demonstrate the results in practice.

Identifiers

book e-ISSN :	2378-5861
book e-ISBN :	978-1-4673-8682-1 , 978-1-4673-8680-7
DOI	10.1109/ACC.2016.7526567

Authors

Heydari, Ali

Mechanical Engineering, South Dakota School of Mines and Technology, Rapid City, 57701, USA

Additional information

Data set: ieee

Publisher

American Automatic Control Council (AACC)

chapter

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Analyzing policy iteration in optimal control $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Heydari, Ali

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Analyzing policy iteration in optimal control