Sergey Levine

chapter

GPLAC: Generalizing Vision-Based Robotic Skills Using Weakly Labeled Images

Avi Singh, Larry Yang, Sergey Levine

2017 IEEE International Conference on Computer Vision (ICCV) > 5852 - 5861

2017 IEEE International Conference on Computer Vision (ICCV)

We tackle the problem of learning robotic sensorimotor control policies that can generalize to visually diverse and unseen environments. Achieving broad generalization typically requires large datasets, which are difficult to obtain for task-specific interactive processes such as reinforcement learning or learning from demonstration. However, much of the visual diversity in the world can be captured...

chapter

Collective robot reinforcement learning with distributed asynchronous guided policy search

Ali Yahya, Adrian Li, Mrinal Kalakrishnan, Yevgen Chebotar, more

2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) > 79 - 86

2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Policy search methods and, more broadly, reinforcement learning can enable robots to learn highly complex and general skills that may allow them to function amid the complexity and diversity of the real world. However, training a policy that generalizes well across a wide range of real-world conditions requires far greater quantity and diversity of experience than is practical to collect with a single...

chapter

Cognitive Mapping and Planning for Visual Navigation

Saurabh Gupta, James Davidson, Sergey Levine, Rahul Sukthankar, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7272 - 7281

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a neural architecture for navigation in novel environments. Our proposed architecture learns to map from first-person views and plans a sequence of actions towards goals in the environment. The Cognitive Mapper and Planner (CMP) is based on two key ideas: a) a unified joint architecture for mapping and planning, such that the mapping is driven by the needs of the planner, and b) a spatial...

chapter

Time-Contrastive Networks: Self-Supervised Learning from Multi-view Observation

Pierre Sermanet, Corey Lynch, Jasmine Hsu, Sergey Levine

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 486 - 487

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

We propose a self-supervised approach for learning representations of relationships between humans and their environment, including object interactions, attributes, and body pose, entirely from unlabeled videos recorded from multiple viewpoints (Fig. 2). We train an embedding with a triplet loss that contrasts a pair of simultaneous frames from different viewpoints with temporally adjacent and visually...

chapter

PLATO: Policy learning using adaptive trajectory optimization

Gregory Kahn, Tianhao Zhang, Sergey Levine, Pieter Abbeel

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3342 - 3349

2017 IEEE International Conference on Robotics and Automation (ICRA)

Policy search can in principle acquire complex strategies for control of robots and other autonomous systems. When the policy is trained to process raw sensory inputs, such as images and depth maps, it can also acquire a strategy that combines perception and control. However, effectively processing such complex inputs requires an expressive policy class, such as a large neural network. These high-dimensional...

chapter

Reset-free guided policy search: Efficient deep reinforcement learning with stochastic initial states

William Montgomery, Anurag Ajay, Chelsea Finn, Pieter Abbeel, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3373 - 3380

2017 IEEE International Conference on Robotics and Automation (ICRA)

Autonomous learning of robotic skills can allow general-purpose robots to learn wide behavioral repertoires without extensive manual engineering. However, robotic skill learning must typically make trade-offs to enable practical real-world learning, such as requiring manually designed policy or value function representations, initialization from human demonstrations, instrumentation of the training...

chapter

Path integral guided policy search

Yevgen Chebotar, Mrinal Kalakrishnan, Ali Yahya, Adrian Li, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3381 - 3388

2017 IEEE International Conference on Robotics and Automation (ICRA)

3Sergey Levine is with Google Brain, Mountain View, CA 94043, USA. We present a policy search method for learning complex feedback control policies that map from high-dimensional sensory inputs to motor torques, for manipulation tasks with discontinuous contact dynamics. We build on a prior technique called guided policy search (GPS), which iteratively optimizes a set of local policies for specific...

chapter

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates

Shixiang Gu, Ethan Holly, Timothy Lillicrap, Sergey Levine

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3389 - 3396

2017 IEEE International Conference on Robotics and Automation (ICRA)

Reinforcement learning holds the promise of enabling autonomous robots to learn large repertoires of behavioral skills with minimal human intervention. However, robotic applications of reinforcement learning often compromise the autonomy of the learning process in favor of achieving training times that are practical for real physical systems. This typically involves introducing hand-engineered policy...

chapter

Deep reinforcement learning for tensegrity robot locomotion

Marvin Zhang, Xinyang Geng, Jonathan Bruce, Ken Caluwaerts, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 634 - 641

2017 IEEE International Conference on Robotics and Automation (ICRA)

Tensegrity robots, composed of rigid rods connected by elastic cables, have a number of unique properties that make them appealing for use as planetary exploration rovers. However, control of tensegrity robots remains a difficult problem due to their unusual structures and complex dynamics. In this work, we show how locomotion gaits can be learned automatically using a novel extension of mirror descent...

chapter

Learning from the hindsight plan — Episodic MPC improvement

Aviv Tamar, Garrett Thomas, Tianhao Zhang, Sergey Levine, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 336 - 343

2017 IEEE International Conference on Robotics and Automation (ICRA)

Model predictive control (MPC) is a popular control method that has proved effective for robotics, among other fields. MPC performs re-planning at every time step. Re-planning is done with a limited horizon per computational and real-time constraints and often also for robustness to potential model errors. However, the limited horizon leads to suboptimal performance. In this work, we consider the...

chapter

Deep visual foresight for planning robot motion

Chelsea Finn, Sergey Levine

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2786 - 2793

2017 IEEE International Conference on Robotics and Automation (ICRA)

A key challenge in scaling up robot learning to many skills and environments is removing the need for human supervision, so that robots can collect their own data and improve their own performance without being limited by the cost of requesting human feedback. Model-based reinforcement learning holds the promise of enabling an agent to learn to predict the effects of its actions, which could provide...

chapter

Learning modular neural network policies for multi-task and multi-robot transfer

Coline Devin, Abhishek Gupta, Trevor Darrell, Pieter Abbeel, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2169 - 2176

2017 IEEE International Conference on Robotics and Automation (ICRA)

Reinforcement learning (RL) can automate a wide variety of robotic skills, but learning each new skill requires considerable real-world data collection and manual representation engineering to design policy classes or features. Using deep reinforcement learning to train general purpose neural network policies alleviates some of the burden of manual representation engineering by using expressive policy...

chapter

Combining self-supervised learning and imitation for vision-based rope manipulation

Ashvin Nair, Dian Chen, Pulkit Agrawal, Phillip Isola, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2146 - 2153

2017 IEEE International Conference on Robotics and Automation (ICRA)

Manipulation of deformable objects, such as ropes and cloth, is an important but challenging problem in robotics. We present a learning-based system where a robot takes as input a sequence of images of a human manipulating a rope from an initial to goal configuration, and outputs a sequence of actions that can reproduce the human demonstration, using only monocular images as input. To perform this...

chapter

One-shot learning of manipulation skills with online dynamics adaptation and neural network priors

Justin Fu, Sergey Levine, Pieter Abbeel

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) > 4019 - 4026

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

One of the key challenges in applying reinforcement learning to complex robotic control tasks is the need to gather large amounts of experience in order to find an effective policy for the task at hand. Model-based reinforcement learning can achieve good sample efficiency, but requires the ability to learn a model of the dynamics that is good enough to learn an effective policy. In this work, we develop...

chapter

Learning dexterous manipulation for a soft robotic hand from human demonstrations

Abhishek Gupta, Clemens Eppner, Sergey Levine, Pieter Abbeel

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) > 3786 - 3793

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Dexterous multi-fingered hands can accomplish fine manipulation behaviors that are infeasible with simple robotic grippers. However, sophisticated multi-fingered hands are often expensive and fragile. Low-cost soft hands offer an appealing alternative to more conventional devices, but present considerable challenges in sensing and actuation, making them difficult to apply to more complex manipulation...

chapter

Deep spatial autoencoders for visuomotor learning

Chelsea Finn, Xin Yu Tan, Yan Duan, Trevor Darrell, more

2016 IEEE International Conference on Robotics and Automation (ICRA) > 512 - 519

2016 IEEE International Conference on Robotics and Automation (ICRA)

Reinforcement learning provides a powerful and flexible framework for automated acquisition of robotic motion skills. However, applying reinforcement learning requires a sufficiently detailed representation of the state, including the configuration of task-relevant objects. We present an approach that automates state-space construction by learning a state representation directly from camera images...

chapter

Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search

Tianhao Zhang, Gregory Kahn, Sergey Levine, Pieter Abbeel

2016 IEEE International Conference on Robotics and Automation (ICRA) > 528 - 535

2016 IEEE International Conference on Robotics and Automation (ICRA)

Model predictive control (MPC) is an effective method for controlling robotic systems, particularly autonomous aerial vehicles such as quadcopters. However, application of MPC can be computationally demanding, and typically requires estimating the state of the system, which can be challenging in complex, unstructured environments. Reinforcement learning can in principle forego the need for explicit...

chapter

Learning deep neural network policies with continuous memory states

Marvin Zhang, Zoe McCarthy, Chelsea Finn, Sergey Levine, more

2016 IEEE International Conference on Robotics and Automation (ICRA) > 520 - 527

2016 IEEE International Conference on Robotics and Automation (ICRA)

Policy learning for partially observed control tasks requires policies that can remember salient information from past observations. In this paper, we present a method for learning policies with internal memory for high-dimensional, continuous systems, such as robotic manipulators. Our approach consists of augmenting the state and action space of the system with continuous-valued memory states that...

chapter

Model-based reinforcement learning with parametrized physical models and optimism-driven exploration

Chris Xie, Sachin Patil, Teodor Moldovan, Sergey Levine, more

2016 IEEE International Conference on Robotics and Automation (ICRA) > 504 - 511

2016 IEEE International Conference on Robotics and Automation (ICRA)

In this paper, we present a robotic model-based reinforcement learning method that combines ideas from model identification and model predictive control. We use a feature-based representation of the dynamics that allows the dynamics model to be fitted with a simple least squares procedure, and the features are identified from a high-level specification of the robot's morphology, consisting of the...

chapter

Optimal control with learned local models: Application to dexterous manipulation

Vikash Kumar, Emanuel Todorov, Sergey Levine

2016 IEEE International Conference on Robotics and Automation (ICRA) > 378 - 383

2016 IEEE International Conference on Robotics and Automation (ICRA)

We describe a method for learning dexterous manipulation skills with a pneumatically-actuated tendon-driven 24-DoF hand. The method combines iteratively refitted time-varying linear models with trajectory optimization, and can be seen as an instance of model-based reinforcement learning or as adaptive optimal control. Its appeal lies in the ability to handle challenging problems with surprisingly...

INFONA - science communication portal

Search results for: Sergey Levine

GPLAC: Generalizing Vision-Based Robotic Skills Using Weakly Labeled Images

Collective robot reinforcement learning with distributed asynchronous guided policy search

Cognitive Mapping and Planning for Visual Navigation

Time-Contrastive Networks: Self-Supervised Learning from Multi-view Observation

PLATO: Policy learning using adaptive trajectory optimization

Reset-free guided policy search: Efficient deep reinforcement learning with stochastic initial states

Path integral guided policy search

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates

Deep reinforcement learning for tensegrity robot locomotion

Learning from the hindsight plan — Episodic MPC improvement

Deep visual foresight for planning robot motion

Learning modular neural network policies for multi-task and multi-robot transfer

Combining self-supervised learning and imitation for vision-based rope manipulation

One-shot learning of manipulation skills with online dynamics adaptation and neural network priors

Learning dexterous manipulation for a soft robotic hand from human demonstrations

Deep spatial autoencoders for visuomotor learning

Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search

Learning deep neural network policies with continuous memory states

Model-based reinforcement learning with parametrized physical models and optimism-driven exploration

Optimal control with learned local models: Application to dexterous manipulation

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Sergey Levine

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options