Skip to main content

Showing 1–17 of 17 results for author: Erez, T

  1. arXiv:2312.09190  [pdf, other

    cs.RO

    Efficient Online Learning of Contact Force Models for Connector Insertion

    Authors: Kevin Tracy, Zachary Manchester, Ajinkya Jain, Keegan Go, Stefan Schaal, Tom Erez, Yuval Tassa

    Abstract: Contact-rich manipulation tasks with stiff frictional elements like connector insertion are difficult to model with rigid-body simulators. In this work, we propose a new approach for modeling these environments by learning a quasi-static contact force model instead of a full simulator. Using a feature vector that contains information about the configuration and control, we find a linear mapping ad… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  2. arXiv:2306.08647  [pdf, other

    cs.RO cs.AI cs.LG

    Language to Rewards for Robotic Skill Synthesis

    Authors: Wenhao Yu, Nimrod Gileadi, Chuyuan Fu, Sean Kirmani, Kuang-Huei Lee, Montse Gonzalez Arenas, Hao-Tien Lewis Chiang, Tom Erez, Leonard Hasenclever, Jan Humplik, Brian Ichter, Ted Xiao, Peng Xu, Andy Zeng, Tingnan Zhang, Nicolas Heess, Dorsa Sadigh, Jie Tan, Yuval Tassa, Fei Xia

    Abstract: Large language models (LLMs) have demonstrated exciting progress in acquiring diverse new capabilities through in-context learning, ranging from logical reasoning to code-writing. Robotics researchers have also explored using LLMs to advance the capabilities of robotic control. However, since low-level robot actions are hardware-dependent and underrepresented in LLM training corpora, existing effo… ▽ More

    Submitted 16 June, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: https://language-to-reward.github.io/

  3. arXiv:2212.00541  [pdf, other

    cs.RO eess.SY

    Predictive Sampling: Real-time Behaviour Synthesis with MuJoCo

    Authors: Taylor Howell, Nimrod Gileadi, Saran Tunyasuvunakool, Kevin Zakka, Tom Erez, Yuval Tassa

    Abstract: We introduce MuJoCo MPC (MJPC), an open-source, interactive application and software framework for real-time predictive control, based on MuJoCo physics. MJPC allows the user to easily author and solve complex robotics tasks, and currently supports three shooting-based planners: derivative-based iLQG and Gradient Descent, and a simple derivative-free method we call Predictive Sampling. Predictive… ▽ More

    Submitted 23 December, 2022; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: Minor fixes and formatting

  4. arXiv:2110.10819  [pdf, other

    cs.LG cs.AI

    Shaking the foundations: delusions in sequence models for interaction and control

    Authors: Pedro A. Ortega, Markus Kunesch, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Joel Veness, Jonas Buchli, Jonas Degrave, Bilal Piot, Julien Perolat, Tom Everitt, Corentin Tallec, Emilio Parisotto, Tom Erez, Yutian Chen, Scott Reed, Marcus Hutter, Nando de Freitas, Shane Legg

    Abstract: The recent phenomenal success of language models has reinvigorated machine learning research, and large sequence models such as transformers are being applied to a variety of domains. One important problem class that has remained relatively elusive however is purposeful adaptive behavior. Currently there is a common perception that sequence models "lack the understanding of the cause and effect of… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: DeepMind Tech Report, 16 pages, 4 figures

  5. dm_control: Software and Tasks for Continuous Control

    Authors: Yuval Tassa, Saran Tunyasuvunakool, Alistair Muldal, Yotam Doron, Piotr Trochim, Siqi Liu, Steven Bohez, Josh Merel, Tom Erez, Timothy Lillicrap, Nicolas Heess

    Abstract: The dm_control software package is a collection of Python libraries and task suites for reinforcement learning agents in an articulated-body simulation. A MuJoCo wrapper provides convenient bindings to functions and data structures. The PyMJCF and Composer libraries enable procedural model manipulation and task authoring. The Control Suite is a fixed set of tasks with standardised structure, inten… ▽ More

    Submitted 7 September, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: text overlap with arXiv:1801.00690

  6. arXiv:1911.06636  [pdf, other

    cs.AI cs.RO

    Catch & Carry: Reusable Neural Controllers for Vision-Guided Whole-Body Tasks

    Authors: Josh Merel, Saran Tunyasuvunakool, Arun Ahuja, Yuval Tassa, Leonard Hasenclever, Vu Pham, Tom Erez, Greg Wayne, Nicolas Heess

    Abstract: We address the longstanding challenge of producing flexible, realistic humanoid character controllers that can perform diverse whole-body tasks involving object interactions. This challenge is central to a variety of fields, from graphics and animation to robotics and motor neuroscience. Our physics-based environment uses realistic actuation and first-person perception -- including touch sensors a… ▽ More

    Submitted 16 June, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

  7. arXiv:1910.09471  [pdf, other

    cs.RO cs.LG

    Modelling Generalized Forces with Reinforcement Learning for Sim-to-Real Transfer

    Authors: Rae Jeong, Jackie Kay, Francesco Romano, Thomas Lampe, Tom Rothorl, Abbas Abdolmaleki, Tom Erez, Yuval Tassa, Francesco Nori

    Abstract: Learning robotic control policies in the real world gives rise to challenges in data efficiency, safety, and controlling the initial condition of the system. On the other hand, simulations are a useful alternative as they provide an abundant source of data without the restrictions of the real world. Unfortunately, simulations often fail to accurately model complex real-world phenomena. Traditional… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

  8. arXiv:1812.01647  [pdf, other

    cs.LG cs.CR stat.ML

    Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures

    Authors: Jonathan Uesato, Ananya Kumar, Csaba Szepesvari, Tom Erez, Avraham Ruderman, Keith Anderson, Krishmamurthy, Dvijotham, Nicolas Heess, Pushmeet Kohli

    Abstract: This paper addresses the problem of evaluating learning systems in safety critical domains such as autonomous driving, where failures can have catastrophic consequences. We focus on two problems: searching for scenarios when learned agents fail and assessing their probability of failure. The standard method for agent evaluation in reinforcement learning, Vanilla Monte Carlo, can miss failures enti… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

  9. arXiv:1804.06318  [pdf, other

    cs.AI cs.NE cs.RO

    Learning Awareness Models

    Authors: Brandon Amos, Laurent Dinh, Serkan Cabi, Thomas Rothörl, Sergio Gómez Colmenarejo, Alistair Muldal, Tom Erez, Yuval Tassa, Nando de Freitas, Misha Denil

    Abstract: We consider the setting of an agent with a fixed body interacting with an unknown and uncertain external world. We show that models trained to predict proprioceptive information about the agent's body come to represent objects in the external world. In spite of being trained with only internally available signals, these dynamic body models come to represent external objects through the necessity o… ▽ More

    Submitted 17 April, 2018; originally announced April 2018.

    Comments: Accepted to ICLR 2018

  10. arXiv:1802.09564  [pdf, other

    cs.RO cs.AI cs.LG

    Reinforcement and Imitation Learning for Diverse Visuomotor Skills

    Authors: Yuke Zhu, Ziyu Wang, Josh Merel, Andrei Rusu, Tom Erez, Serkan Cabi, Saran Tunyasuvunakool, János Kramár, Raia Hadsell, Nando de Freitas, Nicolas Heess

    Abstract: We propose a model-free deep reinforcement learning method that leverages a small amount of demonstration data to assist a reinforcement learning agent. We apply this approach to robotic manipulation tasks and train end-to-end visuomotor policies that map directly from RGB camera inputs to joint velocities. We demonstrate that our approach can solve a wide variety of visuomotor tasks, for which en… ▽ More

    Submitted 27 May, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

    Comments: 13 pages, 6 figures, Published in RSS 2018

  11. arXiv:1801.00690  [pdf, other

    cs.AI

    DeepMind Control Suite

    Authors: Yuval Tassa, Yotam Doron, Alistair Muldal, Tom Erez, Yazhe Li, Diego de Las Casas, David Budden, Abbas Abdolmaleki, Josh Merel, Andrew Lefrancq, Timothy Lillicrap, Martin Riedmiller

    Abstract: The DeepMind Control Suite is a set of continuous control tasks with a standardised structure and interpretable rewards, intended to serve as performance benchmarks for reinforcement learning agents. The tasks are written in Python and powered by the MuJoCo physics engine, making them easy to use and modify. We include benchmarks for several learning algorithms. The Control Suite is publicly avail… ▽ More

    Submitted 2 January, 2018; originally announced January 2018.

    Comments: 24 pages, 7 figures, 2 tables

  12. arXiv:1707.02286  [pdf, other

    cs.AI

    Emergence of Locomotion Behaviours in Rich Environments

    Authors: Nicolas Heess, Dhruva TB, Srinivasan Sriram, Jay Lemmon, Josh Merel, Greg Wayne, Yuval Tassa, Tom Erez, Ziyu Wang, S. M. Ali Eslami, Martin Riedmiller, David Silver

    Abstract: The reinforcement learning paradigm allows, in principle, for complex behaviours to be learned directly from simple reward signals. In practice, however, it is common to carefully hand-design the reward function to encourage a particular solution, or to derive it from demonstration data. In this paper explore how a rich environment can help to promote the learning of complex behavior. Specifically… ▽ More

    Submitted 10 July, 2017; v1 submitted 7 July, 2017; originally announced July 2017.

  13. arXiv:1704.03073  [pdf, other

    cs.LG cs.RO

    Data-efficient Deep Reinforcement Learning for Dexterous Manipulation

    Authors: Ivaylo Popov, Nicolas Heess, Timothy Lillicrap, Roland Hafner, Gabriel Barth-Maron, Matej Vecerik, Thomas Lampe, Yuval Tassa, Tom Erez, Martin Riedmiller

    Abstract: Deep learning and reinforcement learning methods have recently been used to solve a variety of problems in continuous control domains. An obvious application of these techniques is dexterous manipulation tasks in robotics which are difficult to solve using traditional control theory or hand-engineered approaches. One example of such a task is to grasp an object and precisely stack it on another. S… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Comments: 12 pages, 5 Figures

  14. arXiv:1611.01843  [pdf, other

    stat.ML cs.AI cs.CV cs.LG cs.NE physics.soc-ph

    Learning to Perform Physics Experiments via Deep Reinforcement Learning

    Authors: Misha Denil, Pulkit Agrawal, Tejas D Kulkarni, Tom Erez, Peter Battaglia, Nando de Freitas

    Abstract: When encountering novel objects, humans are able to infer a wide range of physical properties such as mass, friction and deformability by interacting with them in a goal driven way. This process of active interaction is in the same spirit as a scientist performing experiments to discover hidden facts. Recent advances in artificial intelligence have yielded machines that can achieve superhuman perf… ▽ More

    Submitted 17 August, 2017; v1 submitted 6 November, 2016; originally announced November 2016.

  15. arXiv:1510.09142  [pdf, other

    cs.LG cs.NE

    Learning Continuous Control Policies by Stochastic Value Gradients

    Authors: Nicolas Heess, Greg Wayne, David Silver, Timothy Lillicrap, Yuval Tassa, Tom Erez

    Abstract: We present a unified framework for learning continuous control policies using backpropagation. It supports stochastic control by treating stochasticity in the Bellman equation as a deterministic function of exogenous noise. The product is a spectrum of general policy gradient algorithms that range from model-free methods with value functions to model-based methods without value functions. We use l… ▽ More

    Submitted 30 October, 2015; originally announced October 2015.

    Comments: 13 pages, NIPS 2015

  16. arXiv:1509.02971  [pdf, other

    cs.LG stat.ML

    Continuous control with deep reinforcement learning

    Authors: Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, Daan Wierstra

    Abstract: We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain. We present an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces. Using the same learning algorithm, network architecture and hyper-parameters, our algorithm robustly solves more than 20 simulated physics tasks, including classic pr… ▽ More

    Submitted 5 July, 2019; v1 submitted 9 September, 2015; originally announced September 2015.

    Comments: 10 pages + supplementary

  17. arXiv:1203.3477  [pdf

    cs.AI

    A Scalable Method for Solving High-Dimensional Continuous POMDPs Using Local Approximation

    Authors: Tom Erez, William D. Smart

    Abstract: Partially-Observable Markov Decision Processes (POMDPs) are typically solved by finding an approximate global solution to a corresponding belief-MDP. In this paper, we offer a new planning algorithm for POMDPs with continuous state, action and observation spaces. Since such domains have an inherent notion of locality, we can find an approximate solution using local optimization methods. We paramet… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-160-167