Skip to main content

Showing 1–9 of 9 results for author: Agia, C

  1. arXiv:2407.08735  [pdf, other

    cs.RO cs.AI eess.SY

    Real-Time Anomaly Detection and Reactive Planning with Large Language Models

    Authors: Rohan Sinha, Amine Elhafsi, Christopher Agia, Matthew Foutter, Edward Schmerling, Marco Pavone

    Abstract: Foundation models, e.g., large language models (LLMs), trained on internet-scale data possess zero-shot generalization capabilities that make them a promising technology towards detecting and mitigating out-of-distribution failure modes of robotic systems. Fully realizing this promise, however, poses two challenges: (i) mitigating the considerable computational expense of these models such that th… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted to Robotics: Science and Systems (RSS) 2024

  2. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  3. arXiv:2401.11371  [pdf, other

    cs.RO eess.SY

    Modeling Considerations for Developing Deep Space Autonomous Spacecraft and Simulators

    Authors: Christopher Agia, Guillem Casadesus Vila, Saptarshi Bandyopadhyay, David S. Bayard, Kar-Ming Cheung, Charles H. Lee, Eric Wood, Ian Aenishanslin, Steven Ardito, Lorraine Fesq, Marco Pavone, Issa A. D. Nesnas

    Abstract: To extend the limited scope of autonomy used in prior missions for operation in distant and complex environments, there is a need to further develop and mature autonomy that jointly reasons over multiple subsystems, which we term system-level autonomy. System-level autonomy establishes situational awareness that resolves conflicting information across subsystems, which may necessitate the refineme… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: Project page: https://sites.google.com/stanford.edu/spacecraft-models. 20 pages, 8 figures. Accepted to the IEEE Conference on Aerospace (AeroConf) 2024

    ACM Class: I.2.8; I.2.9; I.6.1; I.6.3; I.6.4; I.6.6; J.2

  4. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  5. arXiv:2305.11307  [pdf, other

    cs.RO

    Semantic Anomaly Detection with Large Language Models

    Authors: Amine Elhafsi, Rohan Sinha, Christopher Agia, Edward Schmerling, Issa Nesnas, Marco Pavone

    Abstract: As robots acquire increasingly sophisticated skills and see increasingly complex and varied environments, the threat of an edge case or anomalous failure is ever present. For example, Tesla cars have seen interesting failure modes ranging from autopilot disengagements due to inactive traffic lights carried by trucks to phantom braking caused by images of stop signs on roadside billboards. These sy… ▽ More

    Submitted 11 September, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  6. Text2Motion: From Natural Language Instructions to Feasible Plans

    Authors: Kevin Lin, Christopher Agia, Toki Migimatsu, Marco Pavone, Jeannette Bohg

    Abstract: We propose Text2Motion, a language-based planning framework enabling robots to solve sequential manipulation tasks that require long-horizon reasoning. Given a natural language instruction, our framework constructs both a task- and motion-level plan that is verified to reach inferred symbolic goals. Text2Motion uses feasibility heuristics encoded in Q-functions of a library of skills to guide task… ▽ More

    Submitted 26 November, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Published in Autonomous Robots, Special Issue: Large Language Models in Robotics 2023. Project page: https://sites.google.com/stanford.edu/text2motion. First two authors contributed equally

  7. STAP: Sequencing Task-Agnostic Policies

    Authors: Christopher Agia, Toki Migimatsu, Jiajun Wu, Jeannette Bohg

    Abstract: Advances in robotic skill acquisition have made it possible to build general-purpose libraries of learned skills for downstream manipulation tasks. However, naively executing these skills one after the other is unlikely to succeed without accounting for dependencies between actions prevalent in long-horizon plans. We present Sequencing Task-Agnostic Policies (STAP), a scalable framework for traini… ▽ More

    Submitted 31 May, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: Video: https://drive.google.com/file/d/1zp3qFeZLACNPsGLLP7p6q9X1tuA_PGEo/view. Project page: https://sites.google.com/stanford.edu/stap. 12 pages, 7 figures. In proceedings of the IEEE International Conference on Robotics and Automation (ICRA) 2023. The first two authors contributed equally

  8. arXiv:2207.05006  [pdf, other

    cs.RO cs.AI cs.LG

    TASKOGRAPHY: Evaluating robot task planning over large 3D scene graphs

    Authors: Christopher Agia, Krishna Murthy Jatavallabhula, Mohamed Khodeir, Ondrej Miksik, Vibhav Vineet, Mustafa Mukadam, Liam Paull, Florian Shkurti

    Abstract: 3D scene graphs (3DSGs) are an emerging description; unifying symbolic, topological, and metric scene representations. However, typical 3DSGs contain hundreds of objects and symbols even for small environments; rendering task planning on the full graph impractical. We construct TASKOGRAPHY, the first large-scale robotic task planning benchmark over 3DSGs. While most benchmarking efforts in this ar… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Video: https://www.youtube.com/watch?v=mM4v5hP4LdA&ab_channel=KrishnaMurthy . Project page: https://taskography.github.io/ . 18 pages, 7 figures. In proceedings of Conference on Robot Learning (CoRL) 2021. The first two authors contributed equally

    ACM Class: I.2.8; I.2.9; I.2.10; I.2.6

    Journal ref: PMLR 164 (2022) 46-58

  9. arXiv:2012.09242  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    S3CNet: A Sparse Semantic Scene Completion Network for LiDAR Point Clouds

    Authors: Ran Cheng, Christopher Agia, Yuan Ren, Xinhai Li, Liu Bingbing

    Abstract: With the increasing reliance of self-driving and similar robotic systems on robust 3D vision, the processing of LiDAR scans with deep convolutional neural networks has become a trend in academia and industry alike. Prior attempts on the challenging Semantic Scene Completion task - which entails the inference of dense 3D structure and associated semantic labels from "sparse" representations - have… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: 14 pages

    Journal ref: PMLR 155 (2021) 2148-2161