Search results for: Shalabh Bhatnagar

Items from 1 to 2 out of 2 results

article

Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes

Mohammed Shahid Abdulla, Shalabh Bhatnagar

Discrete Event Dynamic Systems > 2007 > 17 > 1 > 23-52

This article proposes several two-timescale simulation-based actor-critic algorithms for solution of infinite horizon Markov Decision Processes with finite state-space under the average cost criterion. Two of the algorithms are for the compact (non-discrete) action setting while the rest are for finite-action spaces. On the slower timescale, all the algorithms perform a gradient search over corresponding...

article

A time aggregation approach to Markov decision processes

Xi-Ren Cao, Zhiyuan Ren, Shalabh Bhatnagar, Michael Fu, more

Automatica > 2002 > 38 > 6 > 929-943

We propose a time aggregation approach for the solution of infinite horizon average cost Markov decision processes via policy iteration. In this approach, policy update is only carried out when the process visits a subset of the state space. As in state aggregation, this approach leads to a reduced state space, which may lead to a substantial reduction in computational and storage requirements, especially...

Filter options

Keywords:
POLICY ITERATION

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Shalabh Bhatnagar

Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes

A time aggregation approach to Markov decision processes

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Data set

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options