Skip to main content

Showing 1–33 of 33 results for author: Balakrishna, A

  1. arXiv:2406.09246  [pdf, other

    cs.RO cs.LG

    OpenVLA: An Open-Source Vision-Language-Action Model

    Authors: Moo Jin Kim, Karl Pertsch, Siddharth Karamcheti, Ted Xiao, Ashwin Balakrishna, Suraj Nair, Rafael Rafailov, Ethan Foster, Grace Lam, Pannag Sanketi, Quan Vuong, Thomas Kollar, Benjamin Burchfiel, Russ Tedrake, Dorsa Sadigh, Sergey Levine, Percy Liang, Chelsea Finn

    Abstract: Large policies pretrained on a combination of Internet-scale vision-language data and diverse robot demonstrations have the potential to change how we teach robots new skills: rather than training new behaviors from scratch, we can fine-tune such vision-language-action (VLA) models to obtain robust, generalizable policies for visuomotor control. Yet, widespread adoption of VLAs for robotics has be… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Website: https://openvla.github.io/

  2. arXiv:2406.08878  [pdf, other

    cs.LG

    CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving

    Authors: Jonathan Booher, Khashayar Rohanimanesh, Junhong Xu, Vladislav Isenbaev, Ashwin Balakrishna, Ishan Gupta, Wei Liu, Aleksandr Petiushko

    Abstract: Modern approaches to autonomous driving rely heavily on learned components trained with large amounts of human driving data via imitation learning. However, these methods require large amounts of expensive data collection and even then face challenges with safely handling long-tail scenarios and compounding errors over time. At the same time, pure Reinforcement Learning (RL) methods can fail to le… ▽ More

    Submitted 26 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  4. arXiv:2402.07865  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models

    Authors: Siddharth Karamcheti, Suraj Nair, Ashwin Balakrishna, Percy Liang, Thomas Kollar, Dorsa Sadigh

    Abstract: Visually-conditioned language models (VLMs) have seen growing adoption in applications such as visual dialogue, scene understanding, and robotic task planning; adoption that has fueled a wealth of new models such as LLaVa, InstructBLIP, and PaLI-3. Despite the volume of new releases, key design decisions around image preprocessing, architecture, and optimization are under-explored, making it chall… ▽ More

    Submitted 30 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Published at ICML 2024. 22 pages, 11 figures. Training code and models: https://github.com/TRI-ML/prismatic-vlms. Evaluation code: https://github.com/TRI-ML/vlm-evaluation

  5. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  6. arXiv:2210.07432  [pdf, other

    cs.LG cs.AI

    Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations

    Authors: Albert Wilcox, Ashwin Balakrishna, Jules Dedieu, Wyame Benslimane, Daniel S. Brown, Ken Goldberg

    Abstract: Providing densely shaped reward functions for RL algorithms is often exceedingly challenging, motivating the development of RL algorithms that can learn from easier-to-specify sparse reward functions. This sparsity poses new exploration challenges. One common way to address this problem is using demonstrations to provide initial signal about regions of the state space with high rewards. However, p… ▽ More

    Submitted 20 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: To be published in the 36th Conference on Neural Information Processing Systems (NeurIPS 2022). 19 pages. 11 figures

  7. arXiv:2207.00911  [pdf, other

    cs.RO

    Learning Switching Criteria for Sim2Real Transfer of Robotic Fabric Manipulation Policies

    Authors: Satvik Sharma, Ellen Novoseller, Vainavi Viswanath, Zaynah Javed, Rishi Parikh, Ryan Hoque, Ashwin Balakrishna, Daniel S. Brown, Ken Goldberg

    Abstract: Simulation-to-reality transfer has emerged as a popular and highly successful method to train robotic control policies for a wide variety of tasks. However, it is often challenging to determine when policies trained in simulation are ready to be transferred to the physical world. Deploying policies that have been trained with very little simulation data can result in unreliable and dangerous behav… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

    Comments: CASE 2022. The first two authors contributed equally. 9 pages; 5 figures; 1 table

  8. arXiv:2201.10081  [pdf, ps, other

    cs.LG cs.AI

    Dynamics-Aware Comparison of Learned Reward Functions

    Authors: Blake Wulfe, Ashwin Balakrishna, Logan Ellis, Jean Mercat, Rowan McAllister, Adrien Gaidon

    Abstract: The ability to learn reward functions plays an important role in enabling the deployment of intelligent agents in the real world. However, comparing reward functions, for example as a means of evaluating reward learning methods, presents a challenge. Reward functions are typically compared by considering the behavior of optimized policies, but this approach conflates deficiencies in the reward fun… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

  9. arXiv:2112.03575  [pdf, other

    cs.LG cs.AI cs.RO

    MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance

    Authors: Michael Luo, Ashwin Balakrishna, Brijen Thananjeyan, Suraj Nair, Julian Ibarz, Jie Tan, Chelsea Finn, Ion Stoica, Ken Goldberg

    Abstract: Safe exploration is critical for using reinforcement learning (RL) in risk-sensitive environments. Recent work learns risk measures which measure the probability of violating constraints, which can then be used to enable safety. However, learning such risk measures requires significant interaction with the environment, resulting in excessive constraint violations during learning. Furthermore, thes… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Journal ref: Workshop on Safe and Robust Control of Uncertain Systems at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Online

  10. arXiv:2111.15002  [pdf, other

    cs.RO

    LEGS: Learning Efficient Grasp Sets for Exploratory Grasping

    Authors: Letian Fu, Michael Danielczuk, Ashwin Balakrishna, Daniel S. Brown, Jeffrey Ichnowski, Eugen Solowjow, Ken Goldberg

    Abstract: While deep learning has enabled significant progress in designing general purpose robot grasping systems, there remain objects which still pose challenges for these systems. Recent work on Exploratory Grasping has formalized the problem of systematically exploring grasps on these adversarial objects and explored a multi-armed bandit model for identifying high-quality grasps on each object stable p… ▽ More

    Submitted 1 March, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: Proceedings of 2022 IEEE International Conference on Robotics and Automation. Philadelphia, PA. May, 2022

  11. arXiv:2109.08273  [pdf, other

    cs.RO cs.AI

    ThriftyDAgger: Budget-Aware Novelty and Risk Gating for Interactive Imitation Learning

    Authors: Ryan Hoque, Ashwin Balakrishna, Ellen Novoseller, Albert Wilcox, Daniel S. Brown, Ken Goldberg

    Abstract: Effective robot learning often requires online human feedback and interventions that can cost significant human time, giving rise to the central challenge in interactive imitation learning: is it possible to control the timing and length of interventions to both facilitate learning and limit burden on the human supervisor? This paper presents ThriftyDAgger, an algorithm for actively querying a hum… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: CoRL 2021 Oral

  12. arXiv:2107.08942  [pdf, other

    cs.RO cs.AI cs.LG

    Untangling Dense Non-Planar Knots by Learning Manipulation Features and Recovery Policies

    Authors: Priya Sundaresan, Jennifer Grannen, Brijen Thananjeyan, Ashwin Balakrishna, Jeffrey Ichnowski, Ellen Novoseller, Minho Hwang, Michael Laskey, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Robot manipulation for untangling 1D deformable structures such as ropes, cables, and wires is challenging due to their infinite dimensional configuration space, complex dynamics, and tendency to self-occlude. Analytical controllers often fail in the presence of dense configurations, due to the difficulty of grasping between adjacent cable segments. We present two algorithms that enhance robust ca… ▽ More

    Submitted 29 June, 2021; originally announced July 2021.

  13. arXiv:2107.05789  [pdf, other

    cs.RO cs.AI cs.CV

    Kit-Net: Self-Supervised Learning to Kit Novel 3D Objects into Novel 3D Cavities

    Authors: Shivin Devgon, Jeffrey Ichnowski, Michael Danielczuk, Daniel S. Brown, Ashwin Balakrishna, Shirin Joshi, Eduardo M. C. Rocha, Eugen Solowjow, Ken Goldberg

    Abstract: In industrial part kitting, 3D objects are inserted into cavities for transportation or subsequent assembly. Kitting is a critical step as it can decrease downstream processing and handling times and enable lower storage and shipping costs. We present Kit-Net, a framework for kitting previously unseen 3D objects into cavities given depth images of both the target cavity and an object held by a gri… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Journal ref: Conference on Automation Science and Engineering (CASE) 2021

  14. arXiv:2107.04775  [pdf, other

    cs.LG cs.AI cs.RO

    LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Sparse Reward Iterative Tasks

    Authors: Albert Wilcox, Ashwin Balakrishna, Brijen Thananjeyan, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Reinforcement learning (RL) has shown impressive success in exploring high-dimensional environments to learn complex tasks, but can often exhibit unsafe behaviors and require extensive environment interaction when exploration is unconstrained. A promising strategy for learning in dynamically uncertain environments is requiring that the agent can robustly return to learned safe sets, where task suc… ▽ More

    Submitted 20 September, 2021; v1 submitted 10 July, 2021; originally announced July 2021.

    Comments: Conference on Robot Learning (CoRL) 2021. First two authors contributed equally

    Journal ref: Conference on Robot Learning (CoRL) 2021

  15. arXiv:2106.06499  [pdf, other

    cs.LG cs.AI

    Policy Gradient Bayesian Robust Optimization for Imitation Learning

    Authors: Zaynah Javed, Daniel S. Brown, Satvik Sharma, Jerry Zhu, Ashwin Balakrishna, Marek Petrik, Anca D. Dragan, Ken Goldberg

    Abstract: The difficulty in specifying rewards for many real-world problems has led to an increased focus on learning rewards from human feedback, such as demonstrations. However, there are often many different reward functions that explain the human feedback, leaving agents with uncertainty over what the true reward function is. While most policy optimization approaches handle this uncertainty by optimizin… ▽ More

    Submitted 21 June, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: In proceedings of the International Conference on Machine Learning (ICML) 2021

  16. arXiv:2106.02252  [pdf, other

    cs.RO cs.AI cs.LG

    Disentangling Dense Multi-Cable Knots

    Authors: Vainavi Viswanath, Jennifer Grannen, Priya Sundaresan, Brijen Thananjeyan, Ashwin Balakrishna, Ellen Novoseller, Jeffrey Ichnowski, Michael Laskey, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Disentangling two or more cables requires many steps to remove crossings between and within cables. We formalize the problem of disentangling multiple cables and present an algorithm, Iterative Reduction Of Non-planar Multiple cAble kNots (IRON-MAN), that outputs robot actions to remove crossings from multi-cable knotted structures. We instantiate this algorithm with a learned perception system, i… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: First three authors contributed equally

  17. arXiv:2105.14246  [pdf, other

    cs.RO cs.CV

    Orienting Novel 3D Objects Using Self-Supervised Learning of Rotation Transforms

    Authors: Shivin Devgon, Jeffrey Ichnowski, Ashwin Balakrishna, Harry Zhang, Ken Goldberg

    Abstract: Orienting objects is a critical component in the automation of many packing and assembly tasks. We present an algorithm to orient novel objects given a depth image of the object in its current and desired orientation. We formulate a self-supervised objective for this problem and train a deep neural network to estimate the 3D rotation as parameterized by a quaternion, between these current and desi… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

    Journal ref: Conference on Automation Science and Engineering (CASE) 2020

  18. arXiv:2104.00053  [pdf, other

    cs.RO cs.AI

    LazyDAgger: Reducing Context Switching in Interactive Imitation Learning

    Authors: Ryan Hoque, Ashwin Balakrishna, Carl Putterman, Michael Luo, Daniel S. Brown, Daniel Seita, Brijen Thananjeyan, Ellen Novoseller, Ken Goldberg

    Abstract: Corrective interventions while a robot is learning to automate a task provide an intuitive method for a human supervisor to assist the robot and convey information about desired behavior. However, these interventions can impose significant burden on a human supervisor, as each intervention interrupts other work the human is doing, incurs latency with each context switch between supervisor and auto… ▽ More

    Submitted 20 July, 2021; v1 submitted 31 March, 2021; originally announced April 2021.

    Comments: IEEE CASE 2021

  19. arXiv:2102.09754  [pdf, other

    cs.RO cs.AI cs.CV

    VisuoSpatial Foresight for Physical Sequential Fabric Manipulation

    Authors: Ryan Hoque, Daniel Seita, Ashwin Balakrishna, Aditya Ganapathi, Ajay Kumar Tanwani, Nawid Jamali, Katsu Yamane, Soshi Iba, Ken Goldberg

    Abstract: Robotic fabric manipulation has applications in home robotics, textiles, senior care and surgery. Existing fabric manipulation techniques, however, are designed for specific tasks, making it difficult to generalize across different but related tasks. We build upon the Visual Foresight framework to learn fabric dynamics that can be efficiently reused to accomplish different sequential fabric manipu… ▽ More

    Submitted 20 July, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

    Comments: Journal extension of prior work on VSF to appear in Autonomous Robots S.I. 207. arXiv admin note: text overlap with arXiv:2003.09044

  20. arXiv:2011.05661  [pdf, other

    cs.RO cs.AI cs.LG

    Accelerating Grasp Exploration by Leveraging Learned Priors

    Authors: Han Yu Li, Michael Danielczuk, Ashwin Balakrishna, Vishal Satish, Ken Goldberg

    Abstract: The ability of robots to grasp novel objects has industry applications in e-commerce order fulfillment and home service. Data-driven grasping policies have achieved success in learning general strategies for grasping arbitrary objects. However, these approaches can fail to grasp objects which have complex geometry or are significantly outside of the training distribution. We present a Thompson sam… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Conference on Automation Science and Engineering (CASE) 2020. First three authors contributed equally

  21. arXiv:2011.05632  [pdf, other

    cs.RO cs.AI cs.LG

    Exploratory Grasping: Asymptotically Optimal Algorithms for Grasping Challenging Polyhedral Objects

    Authors: Michael Danielczuk, Ashwin Balakrishna, Daniel S. Brown, Shivin Devgon, Ken Goldberg

    Abstract: There has been significant recent work on data-driven algorithms for learning general-purpose grasping policies. However, these policies can consistently fail to grasp challenging objects which are significantly out of the distribution of objects in the training data or which have very few high quality grasps. Motivated by such objects, we propose a novel problem setting, Exploratory Grasping, for… ▽ More

    Submitted 11 November, 2020; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: Conference on Robot Learning (CoRL) 2020. First two authors contributed equally

  22. arXiv:2011.04999  [pdf, other

    cs.RO cs.AI cs.LG

    Untangling Dense Knots by Learning Task-Relevant Keypoints

    Authors: Jennifer Grannen, Priya Sundaresan, Brijen Thananjeyan, Jeffrey Ichnowski, Ashwin Balakrishna, Minho Hwang, Vainavi Viswanath, Michael Laskey, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Untangling ropes, wires, and cables is a challenging task for robots due to the high-dimensional configuration space, visual homogeneity, self-occlusions, and complex dynamics. We consider dense (tight) knots that lack space between self-intersections and present an iterative approach that uses learned geometric structure in configurations. We instantiate this into an algorithm, HULK: Hierarchical… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: Conference on Robot Learning (CoRL) 2020 Oral. First two authors contributed equally

    Journal ref: 4th Conference on Robot Learning (CoRL 2020)

  23. arXiv:2010.15920  [pdf, other

    cs.LG cs.AI cs.RO

    Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones

    Authors: Brijen Thananjeyan, Ashwin Balakrishna, Suraj Nair, Michael Luo, Krishnan Srinivasan, Minho Hwang, Joseph E. Gonzalez, Julian Ibarz, Chelsea Finn, Ken Goldberg

    Abstract: Safety remains a central obstacle preventing widespread use of RL in the real world: learning new tasks in uncertain environments requires extensive exploration, but safety requires limiting exploration. We propose Recovery RL, an algorithm which navigates this tradeoff by (1) leveraging offline data to learn about constraint violating zones before policy learning and (2) separating the goals of i… ▽ More

    Submitted 17 May, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: RA-L and ICRA 2021. First two authors contributed equally

    Journal ref: Robotics and Automation Letters (RA-L) and International Conference on Robotics and Automation (ICRA) 2021

  24. arXiv:2010.04339  [pdf, other

    cs.CV cs.RO

    MMGSD: Multi-Modal Gaussian Shape Descriptors for Correspondence Matching in 1D and 2D Deformable Objects

    Authors: Aditya Ganapathi, Priya Sundaresan, Brijen Thananjeyan, Ashwin Balakrishna, Daniel Seita, Ryan Hoque, Joseph E. Gonzalez, Ken Goldberg

    Abstract: We explore learning pixelwise correspondences between images of deformable objects in different configurations. Traditional correspondence matching approaches such as SIFT, SURF, and ORB can fail to provide sufficient contextual information for fine-grained manipulation. We propose Multi-Modal Gaussian Shape Descriptor (MMGSD), a new visual representation of deformable objects which extends ideas… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: IROS 2020 Workshop on Managing Deformation: A Step Towards Higher Robot Autonomy

  25. arXiv:2003.12698  [pdf, other

    cs.RO cs.CV cs.LG

    Learning Dense Visual Correspondences in Simulation to Smooth and Fold Real Fabrics

    Authors: Aditya Ganapathi, Priya Sundaresan, Brijen Thananjeyan, Ashwin Balakrishna, Daniel Seita, Jennifer Grannen, Minho Hwang, Ryan Hoque, Joseph E. Gonzalez, Nawid Jamali, Katsu Yamane, Soshi Iba, Ken Goldberg

    Abstract: Robotic fabric manipulation is challenging due to the infinite dimensional configuration space, self-occlusion, and complex dynamics of fabrics. There has been significant prior work on learning policies for specific deformable manipulation tasks, but comparatively less focus on algorithms which can efficiently learn many different tasks. In this paper, we learn visual correspondences for deformab… ▽ More

    Submitted 11 November, 2020; v1 submitted 28 March, 2020; originally announced March 2020.

  26. arXiv:2003.09044  [pdf, other

    cs.RO cs.AI cs.CV

    VisuoSpatial Foresight for Multi-Step, Multi-Task Fabric Manipulation

    Authors: Ryan Hoque, Daniel Seita, Ashwin Balakrishna, Aditya Ganapathi, Ajay Kumar Tanwani, Nawid Jamali, Katsu Yamane, Soshi Iba, Ken Goldberg

    Abstract: Robotic fabric manipulation has applications in home robotics, textiles, senior care and surgery. Existing fabric manipulation techniques, however, are designed for specific tasks, making it difficult to generalize across different but related tasks. We extend the Visual Foresight framework to learn fabric dynamics that can be efficiently reused to accomplish different fabric manipulation tasks wi… ▽ More

    Submitted 18 February, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: Robotics: Science and Systems (RSS) 2020

  27. arXiv:2003.01835  [pdf, other

    cs.RO cs.CV cs.LG

    Learning Rope Manipulation Policies Using Dense Object Descriptors Trained on Synthetic Depth Data

    Authors: Priya Sundaresan, Jennifer Grannen, Brijen Thananjeyan, Ashwin Balakrishna, Michael Laskey, Kevin Stone, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Robotic manipulation of deformable 1D objects such as ropes, cables, and hoses is challenging due to the lack of high-fidelity analytic models and large configuration spaces. Furthermore, learning end-to-end manipulation policies directly from images and physical interaction requires significant time on a robot and can fail to generalize across tasks. We address these challenges using interpretabl… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Journal ref: 2020 International Conference on Robotics and Automation

  28. arXiv:2003.01410  [pdf, other

    eess.SY cs.LG cs.RO

    ABC-LMPC: Safe Sample-Based Learning MPC for Stochastic Nonlinear Dynamical Systems with Adjustable Boundary Conditions

    Authors: Brijen Thananjeyan, Ashwin Balakrishna, Ugo Rosolia, Joseph E. Gonzalez, Aaron Ames, Ken Goldberg

    Abstract: Sample-based learning model predictive control (LMPC) strategies have recently attracted attention due to their desirable theoretical properties and their good empirical performance on robotic tasks. However, prior analysis of LMPC controllers for stochastic systems has mainly focused on linear systems in the iterative learning control setting. We present a novel LMPC algorithm, Adjustable Boundar… ▽ More

    Submitted 15 May, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: Workshop on the Algorithmic Foundations of Robotics (WAFR) 2020. First two authors contributed equally

    Journal ref: 14th International Workshop on the Algorithmic Foundations of Robotics (WAFR) 2020

  29. arXiv:1910.04854  [pdf, other

    cs.RO cs.AI cs.CV

    Deep Imitation Learning of Sequential Fabric Smoothing From an Algorithmic Supervisor

    Authors: Daniel Seita, Aditya Ganapathi, Ryan Hoque, Minho Hwang, Edward Cen, Ajay Kumar Tanwani, Ashwin Balakrishna, Brijen Thananjeyan, Jeffrey Ichnowski, Nawid Jamali, Katsu Yamane, Soshi Iba, John Canny, Ken Goldberg

    Abstract: Sequential pulling policies to flatten and smooth fabrics have applications from surgery to manufacturing to home tasks such as bed making and folding clothes. Due to the complexity of fabric states and dynamics, we apply deep imitation learning to learn policies that, given color (RGB), depth (D), or combined color-depth (RGBD) images of a rectangular fabric sample, estimate pick points and pull… ▽ More

    Submitted 2 March, 2020; v1 submitted 23 September, 2019; originally announced October 2019.

    Comments: Supplementary material is available at https://sites.google.com/view/fabric-smoothing ; Version 2 has significant improvements with new results and figures

  30. arXiv:1907.03423  [pdf, other

    cs.LG cs.AI cs.RO

    On-Policy Robot Imitation Learning from a Converging Supervisor

    Authors: Ashwin Balakrishna, Brijen Thananjeyan, Jonathan Lee, Felix Li, Arsh Zahed, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Existing on-policy imitation learning algorithms, such as DAgger, assume access to a fixed supervisor. However, there are many settings where the supervisor may evolve during policy learning, such as a human performing a novel task or an improving algorithmic controller. We formalize imitation learning from a "converging supervisor" and provide sublinear static and dynamic regret guarantees agains… ▽ More

    Submitted 15 May, 2020; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: Conference on Robot Learning (CoRL) 2019 Oral. First two authors contributed equally

    Journal ref: 3rd Conference on Robot Learning (CoRL 2019)

  31. arXiv:1905.13402  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Safety Augmented Value Estimation from Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic Tasks

    Authors: Brijen Thananjeyan, Ashwin Balakrishna, Ugo Rosolia, Felix Li, Rowan McAllister, Joseph E. Gonzalez, Sergey Levine, Francesco Borrelli, Ken Goldberg

    Abstract: Reinforcement learning (RL) for robotics is challenging due to the difficulty in hand-engineering a dense cost function, which can lead to unintended behavior, and dynamical uncertainty, which makes exploration and constraint satisfaction challenging. We address these issues with a new model-based reinforcement learning algorithm, Safety Augmented Value Estimation from Demonstrations (SAVED), whic… ▽ More

    Submitted 15 May, 2020; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: Robotics and Automation Letters and International Conference on Robotics and Automation 2020. First two authors contributed equally

    Journal ref: Robotics and Automation Letters 2020

  32. Mechanical Search: Multi-Step Retrieval of a Target Object Occluded by Clutter

    Authors: Michael Danielczuk, Andrey Kurenkov, Ashwin Balakrishna, Matthew Matl, David Wang, Roberto Martín-Martín, Animesh Garg, Silvio Savarese, Ken Goldberg

    Abstract: When operating in unstructured environments such as warehouses, homes, and retail centers, robots are frequently required to interactively search for and retrieve specific objects from cluttered bins, shelves, or tables. Mechanical Search describes the class of tasks where the goal is to locate and extract a known target object. In this paper, we formalize Mechanical Search and study a version whe… ▽ More

    Submitted 4 March, 2019; originally announced March 2019.

    Comments: To appear in IEEE International Conference on Robotics and Automation (ICRA), 2019. 9 pages with 4 figures

  33. arXiv:1804.00714  [pdf, other

    cs.LG stat.ML

    Predicting Electric Vehicle Charging Station Usage: Using Machine Learning to Estimate Individual Station Statistics from Physical Configurations of Charging Station Networks

    Authors: Anshul Ramachandran, Ashwin Balakrishna, Peter Kundzicz, Anirudh Neti

    Abstract: Electric vehicles (EVs) have been gaining popularity due to their environmental friendliness and efficiency. EV charging station networks are scalable solutions for supporting increasing numbers of EVs within modern electric grid constraints, yet few tools exist to aid the physical configuration design of new networks. We use neural networks to predict individual charging station usage statistics… ▽ More

    Submitted 2 April, 2018; originally announced April 2018.

    Comments: 8 pages, 9 figures