Learning from demonstration using a multi-valued function regressor for time-series data

J Butterfield; S Osentoski; G Jay; O C Jenkins

doi:10.1109/ICHR.2010.5686284

Source

2010 10th IEEE-RAS International Conference on Humanoid Robots > 328 - 333

Abstract

Using data collected from human teleoperation, our goal is to learn a control policy that maps perception to actuation. Such policies are potentially multi-valued with regard to perception with a single input mapping to multiple outputs depending on the user's objective at a particular time. We propose a multi-valued function regressor to learn a larger class of robot control policies from human demonstration and extend the Hierarchical Dirichlet Process Hidden Markov Model to discover latent variables representing unknown objectives in the demonstrated data and the transitions between these objectives. Each of these objectives requires only a single-valued policy function, and thus can be learned with a Gaussian process function regressor. The learned transitions between these objectives determine the correct actuation where the complete policy function is multi-valued. We present the results of experiments conducted on the Nao humanoid robot platform.

Identifiers

book ISBN :	978-1-4244-8688-5
book e-ISBN :	978-1-4244-8690-8 , 978-1-4244-8689-2
DOI	10.1109/ICHR.2010.5686284

Keywords

time series Gaussian processes hidden Markov models humanoid robots learning (artificial intelligence) regression analysis telecontrol Nao humanoid robot platform multivalued function regressor time-series data human teleoperation control policy perception actuation learning robot control policies human demonstration hierarchical Dirichlet process hidden Markov model Gaussian process Robot kinematics Head Humans Kernel Robot sensing systems

Additional information

Data set: ieee

Publisher

IEEE

INFONA - science communication portal

Learning from demonstration using a multi-valued function regressor for time-series data

Source

Abstract

Identifiers

Authors

Butterfield, J.

Osentoski, S.

Jay, G.

Jenkins, O.C.

Keywords

Additional information

Publisher


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Learning from demonstration using a multi-valued function regressor for time-series data $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Butterfield, J.

Osentoski, S.

Jay, G.

Jenkins, O.C.

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Learning from demonstration using a multi-valued function regressor for time-series data