Sequence Q-learning: A memory-based method towards solving POMDP

Janis Zuters

doi:10.1109/MMAR.2015.7283925

Sequence Q-learning: A memory-based method towards solving POMDP

Source

2015 20th International Conference on Methods and Models in Automation and Robotics (MMAR) > 495 - 500

Abstract

Partially observable Markov decision process (POMDP) models a control problem, where states are only partially observable by an agent. The two main approaches to solve such tasks are these of value function and direct search in policy space. This paper introduces the Sequence Q-learning method which extends the well known Q-learning algorithm towards the ability to solve POMDPs through adding a special sequence management framework by advancing from action values to “sequence” values and including the “sequence continuity principle”.

Identifiers

book e-ISBN :	978-1-4799-8701-6 , 978-1-4799-8700-9
DOI	10.1109/MMAR.2015.7283925

Authors

Zuters, Janis

Faculty of Computing, University of Latvia, Riga, Latvia

Keywords

Context Observability History Markov processes Computational modeling Learning (artificial intelligence) Aerospace electronics POMDP reinforcement learning Q-learning

Additional information

Data set: ieee

Publisher

IEEE

chapter

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Sequence Q-learning: A memory-based method towards solving POMDP $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Zuters, Janis

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Sequence Q-learning: A memory-based method towards solving POMDP