Skip to main content

Showing 1–15 of 15 results for author: Coumans, E

  1. Robotic Table Tennis: A Case Study into a High Speed Learning System

    Authors: David B. D'Ambrosio, Jonathan Abelian, Saminda Abeyruwan, Michael Ahn, Alex Bewley, Justin Boyd, Krzysztof Choromanski, Omar Cortes, Erwin Coumans, Tianli Ding, Wenbo Gao, Laura Graesser, Atil Iscen, Navdeep Jaitly, Deepali Jain, Juhana Kangaspunta, Satoshi Kataoka, Gus Kouretas, Yuheng Kuang, Nevena Lazic, Corey Lynch, Reza Mahjourian, Sherry Q. Moore, Thinh Nguyen, Ken Oslund , et al. (10 additional authors not shown)

    Abstract: We present a deep-dive into a real-world robotic learning system that, in previous work, was shown to be capable of hundreds of table tennis rallies with a human and has the ability to precisely return the ball to desired targets. This system puts together a highly optimized perception subsystem, a high-speed low-latency robot controller, a simulation paradigm that can prevent damage in the real w… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Published and presented at Robotics: Science and Systems (RSS2023)

  2. arXiv:2305.14654  [pdf, other

    cs.RO cs.AI

    Barkour: Benchmarking Animal-level Agility with Quadruped Robots

    Authors: Ken Caluwaerts, Atil Iscen, J. Chase Kew, Wenhao Yu, Tingnan Zhang, Daniel Freeman, Kuang-Huei Lee, Lisa Lee, Stefano Saliceti, Vincent Zhuang, Nathan Batchelor, Steven Bohez, Federico Casarini, Jose Enrique Chen, Omar Cortes, Erwin Coumans, Adil Dostmohamed, Gabriel Dulac-Arnold, Alejandro Escontrela, Erik Frey, Roland Hafner, Deepali Jain, Bauyrjan Jyenis, Yuheng Kuang, Edward Lee , et al. (19 additional authors not shown)

    Abstract: Animals have evolved various agile locomotion strategies, such as sprinting, leaping, and jumping. There is a growing interest in developing legged robots that move like their biological counterparts and show various agile skills to navigate complex environments quickly. Despite the interest, the field lacks systematic benchmarks to measure the performance of control policies and hardware in agili… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 17 pages, 19 figures

  3. arXiv:2205.12256  [pdf, other

    cs.CV

    Differentiable Dynamics for Articulated 3d Human Motion Reconstruction

    Authors: Erik Gärtner, Mykhaylo Andriluka, Erwin Coumans, Cristian Sminchisescu

    Abstract: We introduce DiffPhy, a differentiable physics-based model for articulated 3d human motion reconstruction from video. Applications of physics-based reasoning in human motion analysis have so far been limited, both by the complexity of constructing adequate physical models of articulated human motion, and by the formidable challenges of performing stable and efficient inference with physics in the… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: Accepted to CVPR 2022

  4. arXiv:2203.10488  [pdf, other

    cs.RO cs.AI cs.CV

    Inferring Articulated Rigid Body Dynamics from RGBD Video

    Authors: Eric Heiden, Ziang Liu, Vibhav Vineet, Erwin Coumans, Gaurav S. Sukhatme

    Abstract: Being able to reproduce physical phenomena ranging from light interaction to contact mechanics, simulators are becoming increasingly useful in more and more application domains where real-world interaction or labeled data are difficult to obtain. Despite recent progress, significant human effort is needed to configure simulators to accurately reproduce real-world behavior. We introduce a pipeline… ▽ More

    Submitted 11 September, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: IROS 2022 camera-ready

  5. arXiv:2110.04686  [pdf, other

    cs.LG cs.AI

    Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization

    Authors: Shixiang Shane Gu, Manfred Diaz, Daniel C. Freeman, Hiroki Furuta, Seyed Kamyar Seyed Ghasemipour, Anton Raichuk, Byron David, Erik Frey, Erwin Coumans, Olivier Bachem

    Abstract: The goal of continuous control is to synthesize desired behaviors. In reinforcement learning (RL)-driven approaches, this is often accomplished through careful task reward engineering for efficient exploration and running an off-the-shelf RL algorithm. While reward maximization is at the core of RL, reward engineering is not the only -- sometimes nor the easiest -- way for specifying complex behav… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

  6. arXiv:2109.07578  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Multi-Task Learning with Sequence-Conditioned Transporter Networks

    Authors: Michael H. Lim, Andy Zeng, Brian Ichter, Maryam Bandari, Erwin Coumans, Claire Tomlin, Stefan Schaal, Aleksandra Faust

    Abstract: Enabling robots to solve multiple manipulation tasks has a wide range of industrial applications. While learning-based approaches enjoy flexibility and generalizability, scaling these approaches to solve such compositional tasks remains a challenge. In this work, we aim to solve multi-task learning through the lens of sequence-conditioning and weighted sampling. First, we propose a new suite of be… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  7. arXiv:2104.04644  [pdf, other

    cs.RO cs.LG

    Fast and Efficient Locomotion via Learned Gait Transitions

    Authors: Yuxiang Yang, Tingnan Zhang, Erwin Coumans, Jie Tan, Byron Boots

    Abstract: We focus on the problem of developing energy efficient controllers for quadrupedal robots. Animals can actively switch gaits at different speeds to lower their energy consumption. In this paper, we devise a hierarchical learning framework, in which distinctive locomotion gaits and natural gait transitions emerge automatically with a simple reward of energy minimization. We use evolutionary strateg… ▽ More

    Submitted 22 November, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: Published in CoRL 2021. Website: Website: https://sites.google.com/view/fast-and-efficient Code: https://github.com/yxyang/fast_and_efficient

  8. arXiv:2012.03385  [pdf, other

    cs.RO cs.LG

    Learning to Rearrange Deformable Cables, Fabrics, and Bags with Goal-Conditioned Transporter Networks

    Authors: Daniel Seita, Pete Florence, Jonathan Tompson, Erwin Coumans, Vikas Sindhwani, Ken Goldberg, Andy Zeng

    Abstract: Rearranging and manipulating deformable objects such as cables, fabrics, and bags is a long-standing challenge in robotic manipulation. The complex dynamics and high-dimensional configuration spaces of deformables, compared to rigid objects, make manipulation difficult not only for multi-step planning, but even for goal specification. Goals cannot be as easily specified as rigid object poses, and… ▽ More

    Submitted 18 June, 2023; v1 submitted 6 December, 2020; originally announced December 2020.

    Comments: See https://berkeleyautomation.github.io/bags/ for project website and code; v3 is ICRA 2021 version and v4 adds physical experiments and improves simulation results

  9. arXiv:2011.04217  [pdf, other

    cs.RO

    NeuralSim: Augmenting Differentiable Simulators with Neural Networks

    Authors: Eric Heiden, David Millard, Erwin Coumans, Yizhou Sheng, Gaurav S. Sukhatme

    Abstract: Differentiable simulators provide an avenue for closing the sim-to-real gap by enabling the use of efficient, gradient-based optimization algorithms to find the simulation parameters that best fit the observed sensor readings. Nonetheless, these analytical models can only predict the dynamical behavior of systems for which they have been designed. In this work, we study the augmentation of a novel… ▽ More

    Submitted 19 May, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted at IEEE International Conference on Robotics and Automation (ICRA) 2021

  10. arXiv:2007.06045  [pdf, other

    cs.RO cs.LG eess.SY

    Augmenting Differentiable Simulators with Neural Networks to Close the Sim2Real Gap

    Authors: Eric Heiden, David Millard, Erwin Coumans, Gaurav S. Sukhatme

    Abstract: We present a differentiable simulation architecture for articulated rigid-body dynamics that enables the augmentation of analytical models with neural networks at any point of the computation. Through gradient-based optimization, identification of the simulation parameters and network weights is performed efficiently in preliminary experiments on a real-world dataset and in sim2sim transfer applic… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

  11. arXiv:2004.00784  [pdf, other

    cs.RO cs.LG

    Learning Agile Robotic Locomotion Skills by Imitating Animals

    Authors: Xue Bin Peng, Erwin Coumans, Tingnan Zhang, Tsang-Wei Lee, Jie Tan, Sergey Levine

    Abstract: Reproducing the diverse and agile locomotion skills of animals has been a longstanding challenge in robotics. While manually-designed controllers have been able to emulate many complex behaviors, building such controllers involves a time-consuming and difficult development process, often requiring substantial expertise of the nuances of each skill. Reinforcement learning provides an appealing alte… ▽ More

    Submitted 20 July, 2020; v1 submitted 1 April, 2020; originally announced April 2020.

  12. arXiv:1910.02812  [pdf, other

    cs.RO cs.AI cs.LG

    Policies Modulating Trajectory Generators

    Authors: Atil Iscen, Ken Caluwaerts, Jie Tan, Tingnan Zhang, Erwin Coumans, Vikas Sindhwani, Vincent Vanhoucke

    Abstract: We propose an architecture for learning complex controllable behaviors by having simple Policies Modulate Trajectory Generators (PMTG), a powerful combination that can provide both memory and prior knowledge to the controller. The result is a flexible architecture that is applicable to a class of problems with periodic motion for which one has an insight into the class of trajectories that might l… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Journal ref: In Proceedings of The 2nd Conference on Robot Learning, volume 87 of Proceedings of Machine Learning Research, pages 916-926. PMLR, 29-31 Oct 2018

  13. arXiv:1909.12995  [pdf, other

    cs.RO cs.LG

    Learning Fast Adaptation with Meta Strategy Optimization

    Authors: Wenhao Yu, Jie Tan, Yunfei Bai, Erwin Coumans, Sehoon Ha

    Abstract: The ability to walk in new scenarios is a key milestone on the path toward real-world applications of legged robots. In this work, we introduce Meta Strategy Optimization, a meta-learning algorithm for training policies with latent variable inputs that can quickly adapt to new scenarios with a handful of trials in the target environment. The key idea behind MSO is to expose the same adaptation pro… ▽ More

    Submitted 15 February, 2020; v1 submitted 27 September, 2019; originally announced September 2019.

  14. arXiv:1805.07831  [pdf, other

    cs.RO

    Optimizing Simulations with Noise-Tolerant Structured Exploration

    Authors: Krzysztof Choromanski, Atil Iscen, Vikas Sindhwani, Jie Tan, Erwin Coumans

    Abstract: We propose a simple drop-in noise-tolerant replacement for the standard finite difference procedure used ubiquitously in blackbox optimization. In our approach, parameter perturbation directions are defined by a family of structured orthogonal matrices. We show that at the small cost of computing a Fast Walsh-Hadamard/Fourier Transform (FWHT/FFT), such structured finite differences consistently gi… ▽ More

    Submitted 20 May, 2018; originally announced May 2018.

  15. arXiv:1804.10332  [pdf, other

    cs.RO cs.AI

    Sim-to-Real: Learning Agile Locomotion For Quadruped Robots

    Authors: Jie Tan, Tingnan Zhang, Erwin Coumans, Atil Iscen, Yunfei Bai, Danijar Hafner, Steven Bohez, Vincent Vanhoucke

    Abstract: Designing agile locomotion for quadruped robots often requires extensive expertise and tedious manual tuning. In this paper, we present a system to automate this process by leveraging deep reinforcement learning techniques. Our system can learn quadruped locomotion from scratch using simple reward signals. In addition, users can provide an open loop reference to guide the learning process when mor… ▽ More

    Submitted 16 May, 2018; v1 submitted 26 April, 2018; originally announced April 2018.

    Comments: Accompanying video: https://www.youtube.com/watch?v=lUZUr7jxoqM