Search results for: Lei Jiao

Items from 1 to 1 out of 1 results

article

Online adaptive Q-learning method for fully cooperative linear quadratic dynamic games

Xinxing Li, Zhihong Peng, Lei Jiao, Lele Xi, more

Science China Information Sciences > 2019 > 62 > 12 > 1-14

A model-based offline policy iteration (PI) algorithm and a model-free online Q-learning algorithm are proposed for solving fully cooperative linear quadratic dynamic games. The PI-based adaptive Q-learning method can learn the feedback Nash equilibrium online using the state samples generated by behavior policies, without sending inquiries to the system model. Unlike the existing Q-learning methods,...

Filter options

Journal:
Science China Information Sciences

Publication date

Set your own date range

Keywords

ADAPTIVE DYNAMIC PROGRAMMING (1)
FULLY COOPERATIVE LINEAR QUADRATIC DYNAMIC GAMES (1)
OFF-POLICY (1)
POLICY ITERATION (1)
Q-LEARNING (1)
REINFORCEMENT LEARNING (1)

INFONA - science communication portal

Search results for: Lei Jiao

Online adaptive Q-learning method for fully cooperative linear quadratic dynamic games

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options