Reinforcement Learning for a Human-Following Robot

Yang Wang; D. Lee

doi:10.1109/ROMAN.2006.314435

Reinforcement Learning for a Human-Following Robot

Source

ROMAN 2006 - The 15th IEEE International Symposium on Robot and Human Interactive Communication > 309 - 314

Abstract

This paper discusses the use of a mobile robot following a person. It focuses on the less researched interaction with the human attitude through robot movements. The reward, which indicates the attitude of the human, is used to train the network so that the robot learns an appropriate position relative to the person. The algorithm presented in this study overcomes the difficulty that the feedback reward score given by the human has no gradient throughout large parts of the input space. This network works online and has the ability to adapt to unpredictable changes in the person's preference