Skip to main content

Showing 1–50 of 140 results for author: Goldberg, K

  1. arXiv:2405.09581  [pdf, other

    cs.RO

    Self-Supervised Learning of Dynamic Planar Manipulation of Free-End Cables

    Authors: Jonathan Wang, Huang Huang, Vincent Lim, Harry Zhang, Jeffrey Ichnowski, Daniel Seita, Yunliang Chen, Ken Goldberg

    Abstract: Dynamic manipulation of free-end cables has applications for cable management in homes, warehouses and manufacturing plants. We present a supervised learning approach for dynamic manipulation of free-end cables, focusing on the problem of getting the cable endpoint to a designated target position, which may lie outside the reachable workspace of the robot end effector. We present a simulator, tune… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  2. arXiv:2405.05226  [pdf, other

    cs.RO

    SuFIA: Language-Guided Augmented Dexterity for Robotic Surgical Assistants

    Authors: Masoud Moghani, Lars Doorenbos, William Chung-Ho Panitch, Sean Huver, Mahdi Azizian, Ken Goldberg, Animesh Garg

    Abstract: In this work, we present SuFIA, the first framework for natural language-guided augmented dexterity for robotic surgical assistants. SuFIA incorporates the strong reasoning capabilities of large language models (LLMs) with perception modules to implement high-level planning and low-level control of a robot for surgical sub-task execution. This enables a learning-free approach to surgical augmented… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  3. arXiv:2405.01472  [pdf, other

    cs.RO cs.AI

    IntervenGen: Interventional Data Generation for Robust and Data-Efficient Robot Imitation Learning

    Authors: Ryan Hoque, Ajay Mandlekar, Caelan Garrett, Ken Goldberg, Dieter Fox

    Abstract: Imitation learning is a promising paradigm for training robot control policies, but these policies can suffer from distribution shift, where the conditions at evaluation time differ from those in the training data. A popular approach for increasing policy robustness to distribution shift is interactive imitation learning (i.e., DAgger and variants), where a human operator provides corrective inter… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  4. arXiv:2404.16027  [pdf, other

    cs.RO

    ORBIT-Surgical: An Open-Simulation Framework for Learning Surgical Augmented Dexterity

    Authors: Qinxi Yu, Masoud Moghani, Karthik Dharmarajan, Vincent Schorp, William Chung-Ho Panitch, Jingzhou Liu, Kush Hari, Huang Huang, Mayank Mittal, Ken Goldberg, Animesh Garg

    Abstract: Physics-based simulations have accelerated progress in robot learning for driving, manipulation, and locomotion. Yet, a fast, accurate, and robust surgical simulation environment remains a challenge. In this paper, we present ORBIT-Surgical, a physics-based surgical robot simulation framework with photorealistic rendering in NVIDIA Omniverse. We provide 14 benchmark surgical tasks for the da Vinci… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  5. arXiv:2404.05151  [pdf, other

    cs.RO

    STITCH: Augmented Dexterity for Suture Throws Including Thread Coordination and Handoffs

    Authors: Kush Hari, Hansoul Kim, Will Panitch, Kishore Srinivas, Vincent Schorp, Karthik Dharmarajan, Shreya Ganti, Tara Sadjadpour, Ken Goldberg

    Abstract: We present STITCH: an augmented dexterity pipeline that performs Suture Throws Including Thread Coordination and Handoffs. STITCH iteratively performs needle insertion, thread sweeping, needle extraction, suture cinching, needle handover, and needle pose correction with failure recovery policies. We introduce a novel visual 6D needle pose estimation framework using a stereo camera pair and new sut… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  6. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  7. arXiv:2403.10494  [pdf, other

    cs.RO

    Lifelong LERF: Local 3D Semantic Inventory Monitoring Using FogROS2

    Authors: Adam Rashid, Chung Min Kim, Justin Kerr, Letian Fu, Kush Hari, Ayah Ahmad, Kaiyuan Chen, Huang Huang, Marcus Gualtieri, Michael Wang, Christian Juette, Nan Tian, Liu Ren, Ken Goldberg

    Abstract: Inventory monitoring in homes, factories, and retail stores relies on maintaining data despite objects being swapped, added, removed, or moved. We introduce Lifelong LERF, a method that allows a mobile robot with minimal compute to jointly optimize a dense language and geometric representation of its surroundings. Lifelong LERF maintains this representation over time by detecting semantic changes… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: See project webpage at: https://sites.google.com/berkeley.edu/lifelonglerf/home

  8. arXiv:2402.19249  [pdf, other

    cs.RO

    Mirage: Cross-Embodiment Zero-Shot Policy Transfer with Cross-Painting

    Authors: Lawrence Yunliang Chen, Kush Hari, Karthik Dharmarajan, Chenfeng Xu, Quan Vuong, Ken Goldberg

    Abstract: The ability to reuse collected data and transfer trained policies between robots could alleviate the burden of additional data collection and training. While existing approaches such as pretraining plus finetuning and co-training show promise, they do not generalize to robots unseen in training. Focusing on common robot arms with similar workspaces and 2-jaw grippers, we investigate the feasibilit… ▽ More

    Submitted 16 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: RSS 2024. Project page: https://robot-mirage.github.io/

  9. arXiv:2402.13232  [pdf, other

    cs.CV cs.RO

    A Touch, Vision, and Language Dataset for Multimodal Alignment

    Authors: Letian Fu, Gaurav Datta, Huang Huang, William Chung-Ho Panitch, Jaimyn Drake, Joseph Ortiz, Mustafa Mukadam, Mike Lambeta, Roberto Calandra, Ken Goldberg

    Abstract: Touch is an important sensing modality for humans, but it has not yet been incorporated into a multimodal generative language model. This is partially due to the difficulty of obtaining natural language labels for tactile data and the complexity of aligning tactile readings with both visual observations and language descriptions. As a step towards bridging that gap, this work introduces a new data… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  10. arXiv:2401.14391  [pdf, other

    cs.CV

    Rethinking Patch Dependence for Masked Autoencoders

    Authors: Letian Fu, Long Lian, Renhao Wang, Baifeng Shi, Xudong Wang, Adam Yala, Trevor Darrell, Alexei A. Efros, Ken Goldberg

    Abstract: In this work, we re-examine inter-patch dependencies in the decoding mechanism of masked autoencoders (MAE). We decompose this decoding mechanism for masked patch reconstruction in MAE into self-attention and cross-attention. Our investigations suggest that self-attention between mask patches is not essential for learning good representations. To this end, we propose a novel pretraining framework:… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  11. arXiv:2401.09419  [pdf, other

    cs.CV cs.GR

    GARField: Group Anything with Radiance Fields

    Authors: Chung Min Kim, Mingxuan Wu, Justin Kerr, Ken Goldberg, Matthew Tancik, Angjoo Kanazawa

    Abstract: Grouping is inherently ambiguous due to the multiple levels of granularity in which one can decompose a scene -- should the wheels of an excavator be considered separate or part of the whole? We present Group Anything with Radiance Fields (GARField), an approach for decomposing 3D scenes into a hierarchy of semantically meaningful groups from posed image inputs. To do this we embrace group ambigui… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Project site: https://www.garfield.studio/ First three authors contributed equally

  12. arXiv:2311.05600  [pdf, other

    cs.RO eess.SY

    FogROS2-Config: Optimizing Latency and Cost for Multi-Cloud Robot Applications

    Authors: Kaiyuan Chen, Kush Hari, Rohil Khare, Charlotte Le, Trinity Chung, Jaimyn Drake, Jeffrey Ichnowski, John Kubiatowicz, Ken Goldberg

    Abstract: Cloud service providers provide over 50,000 distinct and dynamically changing set of cloud server options. To help roboticists make cost-effective decisions, we present FogROS2-Config, an open toolkit that takes ROS2 nodes as input and automatically runs relevant benchmarks to quickly return a menu of cloud compute services that tradeoff latency and cost. Because it is infeasible to try every hard… ▽ More

    Submitted 13 May, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Published 2024 IEEE International Conference on Robotics and Automation (ICRA), Former name: FogROS2-Sky

  13. arXiv:2311.01457  [pdf, other

    cs.RO cs.AI

    Conformal Policy Learning for Sensorimotor Control Under Distribution Shifts

    Authors: Huang Huang, Satvik Sharma, Antonio Loquercio, Anastasios Angelopoulos, Ken Goldberg, Jitendra Malik

    Abstract: This paper focuses on the problem of detecting and reacting to changes in the distribution of a sensorimotor controller's observables. The key idea is the design of switching policies that can take conformal quantiles as input, which we define as conformal policy learning, that allows robots to detect distribution shifts with formal statistical guarantees. We show how to design such policies by us… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Conformal Policy Learning

  14. arXiv:2310.16951  [pdf, other

    cs.RO

    The Teenager's Problem: Efficient Garment Decluttering With Grasp Optimization

    Authors: Aviv Adler, Ayah Ahmad, Shengyin Wang, Wisdom C. Agboh, Edith Llontop, Tianshuang Qiu, Jeffrey Ichnowski, Mehmet Dogar, Thomas Kollar, Richard Cheng, Ken Goldberg

    Abstract: This paper addresses the ''Teenager's Problem'': efficiently removing scattered garments from a planar surface. As grasping and transporting individual garments is highly inefficient, we propose analytical policies to select grasp locations for multiple garments using an overhead camera. Two classes of methods are considered: depth-based, which use overhead depth data to find efficient grasps, and… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  15. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  16. arXiv:2309.07970  [pdf, other

    cs.RO cs.CV

    Language Embedded Radiance Fields for Zero-Shot Task-Oriented Grasping

    Authors: Adam Rashid, Satvik Sharma, Chung Min Kim, Justin Kerr, Lawrence Chen, Angjoo Kanazawa, Ken Goldberg

    Abstract: Grasping objects by a specific part is often crucial for safety and for executing downstream tasks. Yet, learning-based grasp planners lack this behavior unless they are trained on specific object part data, making it a significant challenge to scale object diversity. Instead, we propose LERF-TOGO, Language Embedded Radiance Fields for Task-Oriented Grasping of Objects, which uses vision-language… ▽ More

    Submitted 18 September, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: See the project website at: lerftogo.github.io

  17. arXiv:2308.02669  [pdf, other

    cs.CV

    ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior Constraints

    Authors: Elad Richardson, Kfir Goldberg, Yuval Alaluf, Daniel Cohen-Or

    Abstract: Recent text-to-image generative models have enabled us to transform our words into vibrant, captivating imagery. The surge of personalization techniques that has followed has also allowed us to imagine unique concepts in new scenes. However, an intriguing question remains: How can we generate a new, imaginary concept that has never been seen before? In this paper, we present the task of creative t… ▽ More

    Submitted 17 December, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Project page: https://kfirgoldberg.github.io/ConceptLab/

  18. arXiv:2307.06845  [pdf, other

    cs.RO cs.AI

    Self-Supervised Learning for Interactive Perception of Surgical Thread for Autonomous Suture Tail-Shortening

    Authors: Vincent Schorp, Will Panitch, Kaushik Shivakumar, Vainavi Viswanath, Justin Kerr, Yahav Avigal, Danyal M Fer, Lionel Ott, Ken Goldberg

    Abstract: Accurate 3D sensing of suturing thread is a challenging problem in automated surgical suturing because of the high state-space complexity, thinness and deformability of the thread, and possibility of occlusion by the grippers and tissue. In this work we present a method for tracking surgical thread in 3D which is robust to occlusions and complex thread configurations, and apply it to autonomously… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: International Conference on Automation Science and Engineering (CASE) 2023, 7 pages

  19. arXiv:2307.03882  [pdf, other

    cs.RO

    The Busboy Problem: Efficient Tableware Decluttering Using Consolidation and Multi-Object Grasps

    Authors: Kishore Srinivas, Shreya Ganti, Rishi Parikh, Ayah Ahmad, Wisdom Agboh, Mehmet Dogar, Ken Goldberg

    Abstract: We present the "Busboy Problem": automating an efficient decluttering of cups, bowls, and silverware from a planar surface. As grasping and transporting individual items is highly inefficient, we propose policies to generate grasps for multiple items. We introduce the metric of Objects per Trip (OpT) carried by the robot to the collection bin to analyze the improvement seen as a result of our poli… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  20. arXiv:2306.17162  [pdf, other

    cs.RO

    Can Machines Garden? Systematically Comparing the AlphaGarden vs. Professional Horticulturalists

    Authors: Simeon Adebola, Rishi Parikh, Mark Presten, Satvik Sharma, Shrey Aeron, Ananth Rao, Sandeep Mukherjee, Tomson Qu, Christina Wistrom, Eugen Solowjow, Ken Goldberg

    Abstract: The AlphaGarden is an automated testbed for indoor polyculture farming which combines a first-order plant simulator, a gantry robot, a seed planting algorithm, plant phenotyping and tracking algorithms, irrigation sensors and algorithms, and custom pruning tools and algorithms. In this paper, we systematically compare the performance of the AlphaGarden to professional horticulturalists on the staf… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: International Conference on Robotics and Automation(ICRA) 2023 Oral

  21. arXiv:2306.17157  [pdf, other

    cs.RO

    FogROS2-SGC: A ROS2 Cloud Robotics Platform for Secure Global Connectivity

    Authors: Kaiyuan Chen, Ryan Hoque, Karthik Dharmarajan, Edith LLontop, Simeon Adebola, Jeffrey Ichnowski, John Kubiatowicz, Ken Goldberg

    Abstract: The Robot Operating System (ROS2) is the most widely used software platform for building robotics applications. FogROS2 extends ROS2 to allow robots to access cloud computing on demand. However, ROS2 and FogROS2 assume that all robots are locally connected and that each robot has full access and control of the other robots. With applications like distributed multi-robot systems, remote robot contr… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: 9 pages, 8 figures

  22. arXiv:2306.15228  [pdf, other

    cs.RO cs.AI

    IIFL: Implicit Interactive Fleet Learning from Heterogeneous Human Supervisors

    Authors: Gaurav Datta, Ryan Hoque, Anrui Gu, Eugen Solowjow, Ken Goldberg

    Abstract: Imitation learning has been applied to a range of robotic tasks, but can struggle when robots encounter edge cases that are not represented in the training data (i.e., distribution shift). Interactive fleet learning (IFL) mitigates distribution shift by allowing robots to access remote human supervisors during task execution and learn from them over time, but different supervisors may demonstrate… ▽ More

    Submitted 20 October, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: CoRL 2023

  23. arXiv:2306.14021  [pdf, other

    cs.RO

    Push-MOG: Efficient Pushing to Consolidate Polygonal Objects for Multi-Object Grasping

    Authors: Shrey Aeron, Edith LLontop, Aviv Adler, Wisdom C. Agboh, Mehmet R Dogar, Ken Goldberg

    Abstract: Recently, robots have seen rapidly increasing use in homes and warehouses to declutter by collecting objects from a planar surface and placing them into a container. While current techniques grasp objects individually, Multi-Object Grasping (MOG) can improve efficiency by increasing the average number of objects grasped per trip (OpT). However, grasping multiple objects requires the objects to be… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 6 pages, 4 figures, CASE 2023

  24. arXiv:2306.10007  [pdf, other

    cs.RO cs.CV cs.LG

    Robot Learning with Sensorimotor Pre-training

    Authors: Ilija Radosavovic, Baifeng Shi, Letian Fu, Ken Goldberg, Trevor Darrell, Jitendra Malik

    Abstract: We present a self-supervised sensorimotor pre-training approach for robotics. Our model, called RPT, is a Transformer that operates on sequences of sensorimotor tokens. Given a sequence of camera images, proprioceptive robot states, and actions, we encode the sequence into tokens, mask out a subset, and train a model to predict the missing content from the rest. We hypothesize that if a robot can… ▽ More

    Submitted 14 December, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: CoRL 2023; Project page: https://robotic-pretrained-transformer.github.io

  25. arXiv:2305.14343  [pdf, other

    cs.LG cs.AI cs.CV

    Video Prediction Models as Rewards for Reinforcement Learning

    Authors: Alejandro Escontrela, Ademi Adeniji, Wilson Yan, Ajay Jain, Xue Bin Peng, Ken Goldberg, Youngwoon Lee, Danijar Hafner, Pieter Abbeel

    Abstract: Specifying reward signals that allow agents to learn complex behaviors is a long-standing challenge in reinforcement learning. A promising approach is to extract preferences for behaviors from unlabeled videos, which are widely available on the internet. We present Video Prediction Rewards (VIPER), an algorithm that leverages pretrained video prediction models as action-free reward signals for rei… ▽ More

    Submitted 30 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 22 pages, 18 figures, 4 tables. under review

  26. arXiv:2305.01648  [pdf, other

    cs.RO

    More Than an Arm: Using a Manipulator as a Tail for Enhanced Stability in Legged Locomotion

    Authors: Huang Huang, Antonio Loquercio, Ashish Kumar, Neerja Thakkar, Ken Goldberg, Jitendra Malik

    Abstract: Is a manipulator on a legged robot a liability or an asset for locomotion? Prior works mainly designed specific controllers to account for the added payload and inertia from a manipulator. In contrast, biological systems typically benefit from additional limbs, which can simplify postural control. For instance, cats use their tails to enhance the stability of their bodies and prevent falls under d… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  27. arXiv:2303.16898  [pdf, other

    cs.RO

    Bagging by Learning to Singulate Layers Using Interactive Perception

    Authors: Lawrence Yunliang Chen, Baiyu Shi, Roy Lin, Daniel Seita, Ayah Ahmad, Richard Cheng, Thomas Kollar, David Held, Ken Goldberg

    Abstract: Many fabric handling and 2D deformable material tasks in homes and industry require singulating layers of material such as opening a bag or arranging garments for sewing. In contrast to methods requiring specialized sensing or end effectors, we use only visual observations with ordinary parallel jaw grippers. We propose SLIP: Singulating Layers using Interactive Perception, and apply SLIP to the t… ▽ More

    Submitted 1 September, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: IROS 2023

  28. arXiv:2303.09553  [pdf, other

    cs.CV cs.GR

    LERF: Language Embedded Radiance Fields

    Authors: Justin Kerr, Chung Min Kim, Ken Goldberg, Angjoo Kanazawa, Matthew Tancik

    Abstract: Humans describe the physical world using natural language to refer to specific 3D locations based on a vast range of properties: visual appearance, semantics, abstract associations, or actionable affordances. In this work we propose Language Embedded Radiance Fields (LERFs), a method for grounding language embeddings from off-the-shelf models like CLIP into NeRF, which enable these types of open-e… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Project website can be found at https://lerf.io

  29. arXiv:2303.08975  [pdf, other

    cs.RO

    HANDLOOM: Learned Tracing of One-Dimensional Objects for Inspection and Manipulation

    Authors: Vainavi Viswanath, Kaushik Shivakumar, Jainil Ajmera, Mallika Parulekar, Justin Kerr, Jeffrey Ichnowski, Richard Cheng, Thomas Kollar, Ken Goldberg

    Abstract: Tracing - estimating the spatial state of - long deformable linear objects such as cables, threads, hoses, or ropes, is useful for a broad range of tasks in homes, retail, factories, construction, transportation, and healthcare. For long deformable linear objects (DLOs or simply cables) with many (over 25) crossings, we present HANDLOOM (Heterogeneous Autoregressive Learned Deformable Linear Objec… ▽ More

    Submitted 28 October, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  30. arXiv:2302.12915  [pdf, other

    cs.RO

    Semantic Mechanical Search with Large Vision and Language Models

    Authors: Satvik Sharma, Huang Huang, Kaushik Shivakumar, Lawrence Yunliang Chen, Ryan Hoque, Brian Ichter, Ken Goldberg

    Abstract: Moving objects to find a fully-occluded target object, known as mechanical search, is a challenging problem in robotics. As objects are often organized semantically, we conjecture that semantic information about object relationships can facilitate mechanical search and reduce search time. Large pretrained vision and language models (VLMs and LLMs) have shown promise in generalizing to uncommon obj… ▽ More

    Submitted 30 October, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

  31. arXiv:2211.02293  [pdf, other

    cs.RO

    Automating Vascular Shunt Insertion with the dVRK Surgical Robot

    Authors: Karthik Dharmarajan, Will Panitch, Muyan Jiang, Kishore Srinivas, Baiyu Shi, Yahav Avigal, Huang Huang, Thomas Low, Danyal Fer, Ken Goldberg

    Abstract: Vascular shunt insertion is a fundamental surgical procedure used to temporarily restore blood flow to tissues. It is often performed in the field after major trauma. We formulate a problem of automated vascular shunt insertion and propose a pipeline to perform Automated Vascular Shunt Insertion (AVSI) using a da Vinci Research Kit. The pipeline uses a learned visual model to estimate the locus of… ▽ More

    Submitted 8 March, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: Published in: IEEE International Conference on Robotics and Automation (ICRA) 2023

  32. arXiv:2210.17217  [pdf, other

    cs.RO

    AutoBag: Learning to Open Plastic Bags and Insert Objects

    Authors: Lawrence Yunliang Chen, Baiyu Shi, Daniel Seita, Richard Cheng, Thomas Kollar, David Held, Ken Goldberg

    Abstract: Thin plastic bags are ubiquitous in retail stores, healthcare, food handling, recycling, homes, and school lunchrooms. They are challenging both for perception (due to specularities and occlusions) and for manipulation (due to the dynamics of their 3D deformable structure). We formulate the task of "bagging:" manipulating common plastic shopping bags with two handles from an unstructured initial s… ▽ More

    Submitted 19 March, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: ICRA 2023

  33. arXiv:2210.11691  [pdf, other

    cs.RO

    FogROS G: Enabling Secure, Connected and Mobile Fog Robotics with Global Addressability

    Authors: Kaiyuan Chen, Jiachen Yuan, Nikhil Jha, Jeffrey Ichnowski, John Kubiatowicz, Ken Goldberg

    Abstract: Fog Robotics renders networked robots with greater mobility, on-demand compute capabilities and better energy efficiency by offloading heavy robotics workloads to nearby Edge and distant Cloud data centers. However, as the de-facto standard for implementing fog robotics applications, Robot Operating System (ROS) and its successor ROS2 fail to provide fog robots with a mobile-friendly and secure co… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 5 pages, 5 figures. Published at IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022 Cloud Robotics Workshop

  34. arXiv:2210.07432  [pdf, other

    cs.LG cs.AI

    Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations

    Authors: Albert Wilcox, Ashwin Balakrishna, Jules Dedieu, Wyame Benslimane, Daniel S. Brown, Ken Goldberg

    Abstract: Providing densely shaped reward functions for RL algorithms is often exceedingly challenging, motivating the development of RL algorithms that can learn from easier-to-specify sparse reward functions. This sparsity poses new exploration challenges. One common way to address this problem is using demonstrations to provide initial signal about regions of the state space with high rewards. However, p… ▽ More

    Submitted 20 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: To be published in the 36th Conference on Neural Information Processing Systems (NeurIPS 2022). 19 pages. 11 figures

  35. arXiv:2210.07420  [pdf, other

    cs.RO cs.AI cs.LG

    Learning to Efficiently Plan Robust Frictional Multi-Object Grasps

    Authors: Wisdom C. Agboh, Satvik Sharma, Kishore Srinivas, Mallika Parulekar, Gaurav Datta, Tianshuang Qiu, Jeffrey Ichnowski, Eugen Solowjow, Mehmet Dogar, Ken Goldberg

    Abstract: We consider a decluttering problem where multiple rigid convex polygonal objects rest in randomly placed positions and orientations on a planar surface and must be efficiently transported to a packing box using both single and multi-object grasps. Prior work considered frictionless multi-object grasping. In this paper, we introduce friction to increase the number of potential grasps for a given gr… ▽ More

    Submitted 2 August, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: IEEE IROS 2023

  36. arXiv:2210.01340  [pdf, other

    cs.RO

    Safe Self-Supervised Learning in Real of Visuo-Tactile Feedback Policies for Industrial Insertion

    Authors: Letian Fu, Huang Huang, Lars Berscheid, Hui Li, Ken Goldberg, Sachin Chitta

    Abstract: Industrial insertion tasks are often performed repetitively with parts that are subject to tight tolerances and prone to breakage. Learning an industrial insertion policy in real is challenging as the collision between the parts and the environment can cause slippage or breakage of the part. In this paper, we present a safe self-supervised method to learn a visuo-tactile insertion policy that is r… ▽ More

    Submitted 21 March, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

  37. arXiv:2209.13706  [pdf, other

    cs.RO cs.AI cs.LG

    SGTM 2.0: Autonomously Untangling Long Cables using Interactive Perception

    Authors: Kaushik Shivakumar, Vainavi Viswanath, Anrui Gu, Yahav Avigal, Justin Kerr, Jeffrey Ichnowski, Richard Cheng, Thomas Kollar, Ken Goldberg

    Abstract: Cables are commonplace in homes, hospitals, and industrial warehouses and are prone to tangling. This paper extends prior work on autonomously untangling long cables by introducing novel uncertainty quantification metrics and actions that interact with the cable to reduce perception uncertainty. We present Sliding and Grasping for Tangle Manipulation 2.0 (SGTM 2.0), a system that autonomously unta… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  38. arXiv:2209.13042  [pdf, other

    cs.RO

    Self-Supervised Visuo-Tactile Pretraining to Locate and Follow Garment Features

    Authors: Justin Kerr, Huang Huang, Albert Wilcox, Ryan Hoque, Jeffrey Ichnowski, Roberto Calandra, Ken Goldberg

    Abstract: Humans make extensive use of vision and touch as complementary senses, with vision providing global information about the scene and touch measuring local information during manipulation without suffering from occlusions. While prior work demonstrates the efficacy of tactile sensing for precise manipulation of deformables, they typically rely on supervised, human-labeled datasets. We propose Self-S… ▽ More

    Submitted 31 July, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: RSS 2023, site: https://sites.google.com/berkeley.edu/ssvtp

  39. arXiv:2208.10552  [pdf, other

    cs.RO cs.AI

    SpeedFolding: Learning Efficient Bimanual Folding of Garments

    Authors: Yahav Avigal, Lars Berscheid, Tamim Asfour, Torsten Kröger, Ken Goldberg

    Abstract: Folding garments reliably and efficiently is a long standing challenge in robotic manipulation due to the complex dynamics and high dimensional configuration space of garments. An intuitive approach is to initially manipulate the garment to a canonical smooth configuration before folding. In this work, we develop SpeedFolding, a reliable and efficient bimanual system, which given user-defined inst… ▽ More

    Submitted 9 September, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022

  40. arXiv:2208.10472  [pdf, other

    cs.RO cs.AI

    Automated Pruning of Polyculture Plants

    Authors: Mark Presten, Rishi Parikh, Shrey Aeron, Sandeep Mukherjee, Simeon Adebola, Satvik Sharma, Mark Theis, Walter Teitelbaum, Ken Goldberg

    Abstract: Polyculture farming has environmental advantages but requires substantially more pruning than monoculture farming. We present novel hardware and algorithms for automated pruning. Using an overhead camera to collect data from a physical scale garden testbed, the autonomous system utilizes a learned Plant Phenotyping convolutional neural network and a Bounding Disk Tracking algorithm to evaluate the… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: CASE 2022, 8 pages. arXiv admin note: substantial text overlap with arXiv:2111.06014

  41. arXiv:2207.07813  [pdf, other

    cs.RO cs.AI

    Autonomously Untangling Long Cables

    Authors: Vainavi Viswanath, Kaushik Shivakumar, Justin Kerr, Brijen Thananjeyan, Ellen Novoseller, Jeffrey Ichnowski, Alejandro Escontrela, Michael Laskey, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Cables are ubiquitous in many settings and it is often useful to untangle them. However, cables are prone to self-occlusions and knots, making them difficult to perceive and manipulate. The challenge increases with cable length: long cables require more complex slack management to facilitate observability and reachability. In this paper, we focus on autonomously untangling cables up to 3 meters in… ▽ More

    Submitted 31 July, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

  42. arXiv:2207.02347  [pdf, other

    cs.RO

    Mechanical Search on Shelves with Efficient Stacking and Destacking of Objects

    Authors: Huang Huang, Letian Fu, Michael Danielczuk, Chung Min Kim, Zachary Tam, Jeffrey Ichnowski, Anelia Angelova, Brian Ichter, Ken Goldberg

    Abstract: Stacking increases storage efficiency in shelves, but the lack of visibility and accessibility makes the mechanical search problem of revealing and extracting target objects difficult for robots. In this paper, we extend the lateral-access mechanical search problem to shelves with stacked items and introduce two novel policies -- Distribution Area Reduction for Stacked Scenes (DARSS) and Monte Car… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  43. arXiv:2207.00911  [pdf, other

    cs.RO

    Learning Switching Criteria for Sim2Real Transfer of Robotic Fabric Manipulation Policies

    Authors: Satvik Sharma, Ellen Novoseller, Vainavi Viswanath, Zaynah Javed, Rishi Parikh, Ryan Hoque, Ashwin Balakrishna, Daniel S. Brown, Ken Goldberg

    Abstract: Simulation-to-reality transfer has emerged as a popular and highly successful method to train robotic control policies for a wide variety of tasks. However, it is often challenging to determine when policies trained in simulation are ready to be transferred to the physical world. Deploying policies that have been trained with very little simulation data can result in unreliable and dangerous behav… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

    Comments: CASE 2022. The first two authors contributed equally. 9 pages; 5 figures; 1 table

  44. arXiv:2206.14349  [pdf, other

    cs.RO cs.AI

    Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision

    Authors: Ryan Hoque, Lawrence Yunliang Chen, Satvik Sharma, Karthik Dharmarajan, Brijen Thananjeyan, Pieter Abbeel, Ken Goldberg

    Abstract: Commercial and industrial deployments of robot fleets at Amazon, Nimble, Plus One, Waymo, and Zoox query remote human teleoperators when robots are at risk or unable to make task progress. With continual learning, interventions from the remote pool of humans can also be used to improve the robot fleet control policy over time. A central question is how to effectively allocate limited human attenti… ▽ More

    Submitted 16 November, 2022; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: CoRL 2022 Oral

  45. arXiv:2206.14176  [pdf, other

    cs.RO cs.AI cs.LG

    DayDreamer: World Models for Physical Robot Learning

    Authors: Philipp Wu, Alejandro Escontrela, Danijar Hafner, Ken Goldberg, Pieter Abbeel

    Abstract: To solve tasks in complex environments, robots need to learn from experience. Deep reinforcement learning is a common approach to robot learning but requires a large amount of trial and error to learn, limiting its deployment in the physical world. As a consequence, many advances in robot learning rely on simulators. On the other hand, learning inside of simulators fails to capture the complexity… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: Website: https://danijar.com/daydreamer

  46. arXiv:2206.08921  [pdf, other

    cs.RO

    Efficiently Learning Single-Arm Fling Motions to Smooth Garments

    Authors: Lawrence Yunliang Chen, Huang Huang, Ellen Novoseller, Daniel Seita, Jeffrey Ichnowski, Michael Laskey, Richard Cheng, Thomas Kollar, Ken Goldberg

    Abstract: Recent work has shown that 2-arm "fling" motions can be effective for garment smoothing. We consider single-arm fling motions. Unlike 2-arm fling motions, which require little robot trajectory parameter tuning, single-arm fling motions are very sensitive to trajectory parameters. We consider a single 6-DOF robot arm that learns fling trajectories to achieve high garment coverage. Given a garment g… ▽ More

    Submitted 24 September, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted to 2022 International Symposium on Robotics Research (ISRR)

  47. arXiv:2206.08607  [pdf, other

    cs.RO

    Optimal Shelf Arrangement to Minimize Robot Retrieval Time

    Authors: Lawrence Yunliang Chen, Huang Huang, Michael Danielczuk, Jeffrey Ichnowski, Ken Goldberg

    Abstract: Shelves are commonly used to store objects in homes, stores, and warehouses. We formulate the problem of Optimal Shelf Arrangement (OSA), where the goal is to optimize the arrangement of objects on a shelf for access time given an access frequency and movement cost for each object. We propose OSA-MIP, a mixed-integer program (MIP), show that it finds an optimal solution for OSA under certain condi… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE)

  48. arXiv:2206.00229  [pdf, other

    cs.RO cs.AI

    Multi-Object Grasping in the Plane

    Authors: Wisdom C. Agboh, Jeffrey Ichnowski, Ken Goldberg, Mehmet R. Dogar

    Abstract: We consider a novel problem where multiple rigid convex polygonal objects rest in randomly placed positions and orientations on a planar surface visible from an overhead camera. The objective is to efficiently grasp and transport all objects into a bin using multi-object push-grasps, where multiple objects are pushed together to facilitate multi-object grasping. We provide necessary conditions for… ▽ More

    Submitted 21 September, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Accepted to the International Symposium on Robotics Research (ISRR), 2022

  49. arXiv:2205.09778  [pdf, other

    cs.RO

    FogROS2: An Adaptive Platform for Cloud and Fog Robotics Using ROS 2

    Authors: Jeffrey Ichnowski, Kaiyuan Chen, Karthik Dharmarajan, Simeon Adebola, Michael Danielczuk, Vıctor Mayoral-Vilches, Nikhil Jha, Hugo Zhan, Edith LLontop, Derek Xu, Camilo Buscaron, John Kubiatowicz, Ion Stoica, Joseph Gonzalez, Ken Goldberg

    Abstract: Mobility, power, and price points often dictate that robots do not have sufficient computing power on board to run contemporary robot algorithms at desired rates. Cloud computing providers such as AWS, GCP, and Azure offer immense computing power and increasingly low latency on demand, but tapping into that power from a robot is non-trivial. We present FogROS2, an open-source platform to facilitat… ▽ More

    Submitted 24 April, 2023; v1 submitted 19 May, 2022; originally announced May 2022.

  50. arXiv:2205.07147  [pdf

    cs.DC

    The Sky Above The Clouds

    Authors: Sarah Chasins, Alvin Cheung, Natacha Crooks, Ali Ghodsi, Ken Goldberg, Joseph E. Gonzalez, Joseph M. Hellerstein, Michael I. Jordan, Anthony D. Joseph, Michael W. Mahoney, Aditya Parameswaran, David Patterson, Raluca Ada Popa, Koushik Sen, Scott Shenker, Dawn Song, Ion Stoica

    Abstract: Technology ecosystems often undergo significant transformations as they mature. For example, telephony, the Internet, and PCs all started with a single provider, but in the United States each is now served by a competitive market that uses comprehensive and universal technology standards to provide compatibility. This white paper presents our view on how the cloud ecosystem, barely over fifteen ye… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

    Comments: 35 pages