Advanced search

Advanced search in people

From:

To:

Items from 1 to 2 out of 2 results

chapter

Risk-constrained Markov decision processes

V Borkar, R Jain

49th IEEE Conference on Decision and Control (CDC) > 2664 - 2669

2010 49th IEEE Conference on Decision and Control (CDC 2010)

We propose a new constrained Markov decision process framework with risk-type constraints. The risk metric we use is Conditional Value-at-Risk (CVaR), which is gaining popularity in finance. It is a conditional expectation but the conditioning is defined in terms of the level of the tail probability. We propose an iterative offline algorithm to find the risk-contrained optimal control policy. A stochastic...

chapter

Infinite-Horizon Policy-Gradient Estimation with Variable Discount Factor for Markov Decision Process

Bing-Kun Bao, Bao-Qun Yin, Hong-Sheng Xi

2008 3rd International Conference on Innovative Computing Information and Control > 584

2008 3rd International Conference on Innovative Computing Information and Control (ICICIC)

A novel infinite-horizon policy-gradient estimation method with variable discount factor is proposed in this paper. This method tackles the normal policy-gradient estimation methods' limitations on unbalance of the bias and variance by using an incremental sequence as the discount factor. Numerical experiments conducted on the Markov decision process have shown its effectiveness.

Filter options

Keywords:
CONVERGENCE
MARKOV PROCESSES
APPROXIMATION METHODS
MARKOV DECISION PROCESS

Publication date

Set your own date range

Keywords

APPROXIMATION ALGORITHMS (1)
APPROXIMATION THEORY (1)
COMPUTATIONAL MODELING (1)
CONDITIONAL VALUE-AT-RISK (1)
CONSTRAINED MARKOV DECISION PROCESSES (1)
DECISION THEORY (1)
EIGENVALUES AND EIGENFUNCTIONS (1)
ESTIMATION (1)
GRADIENT METHODS (1)
HEURISTIC ALGORITHMS (1)
INCREMENTAL SEQUENCE (1)
INFINITE HORIZON (1)
INFINITE-HORIZON POLICY-GRADIENT ESTIMATION (1)
ITERATIVE METHODS (1)
ITERATIVE OFFLINE ALGORITHM (1)
OPTIMAL CONTROL (1)
OPTIMIZATION (1)
RISK MEASURES (1)
RISK-TYPE CONSTRAINTS (1)
SIMULATION (1)
STOCHASTIC APPROXIMATION-INSPIRED LEARNING VARIANT (1)
STOCHASTIC APPROXIMATIONS (1)
STOCHASTIC PROGRAMMING (1)
TAIL PROBABILITY (1)
VARIABLE DISCOUNT FACTOR (1)
YTTRIUM (1)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Risk-constrained Markov decision processes

Infinite-Horizon Policy-Gradient Estimation with Variable Discount Factor for Markov Decision Process

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options