Skip to main content

Showing 1–43 of 43 results for author: Xie, A

  1. arXiv:2406.18559  [pdf, other

    cs.HC cs.AI cs.CV cs.LG

    Revision Matters: Generative Design Guided by Revision Edits

    Authors: Tao Li, Chin-Yi Cheng, Amber Xie, Gang Li, Yang Li

    Abstract: Layout design, such as user interface or graphical layout in general, is fundamentally an iterative revision process. Through revising a design repeatedly, the designer converges on an ideal layout. In this paper, we investigate how revision edits from human designer can benefit a multimodal generative model. To do so, we curate an expert dataset that traces how human designers iteratively edit an… ▽ More

    Submitted 27 May, 2024; originally announced June 2024.

  2. arXiv:2406.16838  [pdf, other

    cs.CL cs.LG

    From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

    Authors: Sean Welleck, Amanda Bertsch, Matthew Finlayson, Hailey Schoelkopf, Alex Xie, Graham Neubig, Ilia Kulikov, Zaid Harchaoui

    Abstract: One of the most striking findings in modern research on large language models (LLMs) is that scaling up compute during training leads to better results. However, less attention has been given to the benefits of scaling compute during inference. This survey focuses on these inference-time approaches. We explore three areas under a unified mathematical formalism: token-level generation algorithms, m… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.15313  [pdf, other

    cs.IR cs.CL

    STARD: A Chinese Statute Retrieval Dataset with Real Queries Issued by Non-professionals

    Authors: Weihang Su, Yiran Hu, Anzhe Xie, Qingyao Ai, Zibing Que, Ning Zheng, Yun Liu, Weixing Shen, Yiqun Liu

    Abstract: Statute retrieval aims to find relevant statutory articles for specific queries. This process is the basis of a wide range of legal applications such as legal advice, automated judicial decisions, legal document drafting, etc. Existing statute retrieval benchmarks focus on formal and professional queries from sources like bar exams and legal case documents, thereby neglecting non-professional quer… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  4. arXiv:2406.11698  [pdf, other

    cs.CL

    Meta Reasoning for Large Language Models

    Authors: Peizhong Gao, Ao Xie, Shaoguang Mao, Wenshan Wu, Yan Xia, Haipeng Mi, Furu Wei

    Abstract: We introduce Meta-Reasoning Prompting (MRP), a novel and efficient system prompting method for large language models (LLMs) inspired by human meta-reasoning. Traditional in-context learning-based reasoning techniques, such as Tree-of-Thoughts, show promise but lack consistent state-of-the-art performance across diverse tasks due to their specialized nature. MRP addresses this limitation by guiding… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2405.13026  [pdf, other

    cs.CL cs.AI

    Leveraging Human Revisions for Improving Text-to-Layout Models

    Authors: Amber Xie, Chin-Yi Cheng, Forrest Huang, Yang Li

    Abstract: Learning from human feedback has shown success in aligning large, pretrained models with human values. Prior works have mostly focused on learning from high-level labels, such as preferences between pairs of model outputs. On the other hand, many domains could benefit from more involved, detailed feedback, such as revisions, explanations, and reasoning of human users. Our work proposes using nuanc… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  6. arXiv:2404.00566  [pdf, other

    cs.SE cs.CL

    CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks

    Authors: Yiqing Xie, Alex Xie, Divyanshu Sheth, Pengfei Liu, Daniel Fried, Carolyn Rose

    Abstract: To facilitate evaluation of code generation systems across diverse scenarios, we present CodeBenchGen, a framework to create scalable execution-based benchmarks that only requires light guidance from humans. Specifically, we leverage a large language model (LLM) to convert an arbitrary piece of code into an evaluation example, including test cases for execution-based evaluation. We illustrate the… ▽ More

    Submitted 7 May, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  7. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  8. arXiv:2403.05110  [pdf, other

    cs.RO cs.AI cs.LG

    Efficient Data Collection for Robotic Manipulation via Compositional Generalization

    Authors: Jensen Gao, Annie Xie, Ted Xiao, Chelsea Finn, Dorsa Sadigh

    Abstract: Data collection has become an increasingly important problem in robotic manipulation, yet there still lacks much understanding of how to effectively collect data to facilitate broad generalization. Recent works on large-scale robotic data collection typically vary many environmental factors of variation (e.g., object types, table textures) during data collection, to cover a diverse range of scenar… ▽ More

    Submitted 21 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: RSS 2024

  9. arXiv:2403.02882  [pdf, other

    eess.SY cs.LG cs.RO

    Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization

    Authors: Yuan Lin, Antai Xie, Xiao Liu

    Abstract: Most of the current studies on autonomous vehicle decision-making and control tasks based on reinforcement learning are conducted in simulated environments. The training and testing of these studies are carried out under rule-based microscopic traffic flow, with little consideration of migrating them to real or near-real environments to test their performance. It may lead to a degradation in perfo… ▽ More

    Submitted 19 April, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  10. arXiv:2402.07872  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

    Authors: Soroush Nasiriany, Fei Xia, Wenhao Yu, Ted Xiao, Jacky Liang, Ishita Dasgupta, Annie Xie, Danny Driess, Ayzaan Wahid, Zhuo Xu, Quan Vuong, Tingnan Zhang, Tsang-Wei Edward Lee, Kuang-Huei Lee, Peng Xu, Sean Kirmani, Yuke Zhu, Andy Zeng, Karol Hausman, Nicolas Heess, Chelsea Finn, Sergey Levine, Brian Ichter

    Abstract: Vision language models (VLMs) have shown impressive capabilities across a variety of tasks, from logical reasoning to visual understanding. This opens the door to richer interaction with the world, for example robotic control. However, VLMs produce only textual outputs, while robotic control and other spatial tasks require outputting continuous coordinates, actions, or trajectories. How can we ena… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  11. arXiv:2402.03761  [pdf

    eess.IV cs.LG q-bio.TO

    Deep Learning-Based Correction and Unmixing of Hyperspectral Images for Brain Tumor Surgery

    Authors: David Black, Jaidev Gill, Andrew Xie, Benoit Liquet, Antonio Di leva, Walter Stummer, Eric Suero Molina

    Abstract: Hyperspectral Imaging (HSI) for fluorescence-guided brain tumor resection enables visualization of differences between tissues that are not distinguishable to humans. This augmentation can maximize brain tumor resection, improving patient outcomes. However, much of the processing in HSI uses simplified linear methods that are unable to capture the non-linear, wavelength-dependent phenomena that mu… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 20 pages, 8 figures, 3 tables - Under Review

  12. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  13. arXiv:2310.03294  [pdf, other

    cs.LG cs.AI cs.DC

    DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training

    Authors: Dacheng Li, Rulin Shao, Anze Xie, Eric P. Xing, Xuezhe Ma, Ion Stoica, Joseph E. Gonzalez, Hao Zhang

    Abstract: FlashAttention (Dao, 2023) effectively reduces the quadratic peak memory usage to linear in training transformer-based large language models (LLMs) on a single GPU. In this paper, we introduce DISTFLASHATTN, a distributed memory-efficient attention mechanism optimized for long-context LLMs training. We propose three key techniques: token-level workload balancing, overlapping key-value communicatio… ▽ More

    Submitted 31 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

  14. arXiv:2310.01387  [pdf, other

    cs.CL

    It's MBR All the Way Down: Modern Generation Techniques Through the Lens of Minimum Bayes Risk

    Authors: Amanda Bertsch, Alex Xie, Graham Neubig, Matthew R. Gormley

    Abstract: Minimum Bayes Risk (MBR) decoding is a method for choosing the outputs of a machine learning system based not on the output with the highest probability, but the output with the lowest risk (expected error) among multiple candidates. It is a simple but powerful method: for an additional cost at inference time, MBR provides reliable several-point improvements across metrics for a wide variety of ta… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: Under submission

  15. arXiv:2309.11206  [pdf, other

    cs.CL cs.AI

    Retrieve-Rewrite-Answer: A KG-to-Text Enhanced LLMs Framework for Knowledge Graph Question Answering

    Authors: Yike Wu, Nan Hu, Sheng Bi, Guilin Qi, Jie Ren, Anhuan Xie, Wei Song

    Abstract: Despite their competitive performance on knowledge-intensive tasks, large language models (LLMs) still have limitations in memorizing all world knowledge especially long tail knowledge. In this paper, we study the KG-augmented language model approach for solving the knowledge graph question answering (KGQA) task that requires rich world knowledge. Existing work has shown that retrieving KG knowled… ▽ More

    Submitted 21 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

  16. arXiv:2308.16893  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Language-Conditioned Path Planning

    Authors: Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James

    Abstract: Contact is at the core of robotic manipulation. At times, it is desired (e.g. manipulation and grasping), and at times, it is harmful (e.g. when avoiding obstacles). However, traditional path planning algorithms focus solely on collision-free paths, limiting their applicability in contact-rich tasks. To address this limitation, we propose the domain of Language-Conditioned Path Planning, where con… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Conference on Robot Learning, 2023

  17. arXiv:2308.12270  [pdf, other

    cs.LG cs.AI

    Language Reward Modulation for Pretraining Reinforcement Learning

    Authors: Ademi Adeniji, Amber Xie, Carmelo Sferrazza, Younggyo Seo, Stephen James, Pieter Abbeel

    Abstract: Using learned reward functions (LRFs) as a means to solve sparse-reward reinforcement learning (RL) tasks has yielded some steady progress in task-complexity through the years. In this work, we question whether today's LRFs are best-suited as a direct replacement for task rewards. Instead, we propose leveraging the capabilities of LRFs as a pretraining signal for RL. Concretely, we propose… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Code available at https://github.com/ademiadeniji/lamp

  18. arXiv:2307.03659  [pdf, other

    cs.RO cs.AI

    Decomposing the Generalization Gap in Imitation Learning for Visual Robotic Manipulation

    Authors: Annie Xie, Lisa Lee, Ted Xiao, Chelsea Finn

    Abstract: What makes generalization hard for imitation learning in visual robotic manipulation? This question is difficult to approach at face value, but the environment from the perspective of a robot can often be decomposed into enumerable factors of variation, such as the lighting conditions or the placement of the camera. Empirically, generalization to some of these factors have presented a greater obst… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: Project webpage at https://sites.google.com/view/generalization-gap

  19. arXiv:2306.14892  [pdf, other

    cs.LG cs.AI

    Supervised Pretraining Can Learn In-Context Reinforcement Learning

    Authors: Jonathan N. Lee, Annie Xie, Aldo Pacchiano, Yash Chandak, Chelsea Finn, Ofir Nachum, Emma Brunskill

    Abstract: Large transformer models trained on diverse datasets have shown a remarkable ability to learn in-context, achieving high few-shot performance on tasks they were not explicitly trained to solve. In this paper, we study the in-context learning capabilities of transformers in decision-making problems, i.e., reinforcement learning (RL) for bandits and Markov decision processes. To do so, we introduce… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  20. arXiv:2303.06902  [pdf, other

    q-bio.BM cs.LG

    Molecular Property Prediction by Semantic-invariant Contrastive Learning

    Authors: Ziqiao Zhang, Ailin Xie, Jihong Guan, Shuigeng Zhou

    Abstract: Contrastive learning have been widely used as pretext tasks for self-supervised pre-trained molecular representation learning models in AI-aided drug design and discovery. However, exiting methods that generate molecular views by noise-adding operations for contrastive learning may face the semantic inconsistency problem, which leads to false positive pairs and consequently poor prediction perform… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  21. arXiv:2302.07541  [pdf, other

    q-bio.BM cs.LG

    Activity Cliff Prediction: Dataset and Benchmark

    Authors: Ziqiao Zhang, Bangyi Zhao, Ailin Xie, Yatao Bian, Shuigeng Zhou

    Abstract: Activity cliffs (ACs), which are generally defined as pairs of structurally similar molecules that are active against the same bio-target but significantly different in the binding potency, are of great importance to drug discovery. Up to date, the AC prediction problem, i.e., to predict whether a pair of molecules exhibit the AC relationship, has not yet been fully explored. In this paper, we fir… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  22. arXiv:2211.11319  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models

    Authors: Ajay Jain, Amber Xie, Pieter Abbeel

    Abstract: Diffusion models have shown impressive results in text-to-image synthesis. Using massive datasets of captioned images, diffusion models learn to generate raster images of highly diverse objects and scenes. However, designers frequently use vector representations of images like Scalable Vector Graphics (SVGs) for digital icons or art. Vector graphics can be scaled to any size, and are compact. We s… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Project webpage: https://ajayj.com/vectorfusion

  23. arXiv:2210.14721  [pdf, other

    cs.LG cs.AI

    Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data

    Authors: John So, Amber Xie, Sunggoo Jung, Jeffrey Edlund, Rohan Thakker, Ali Agha-mohammadi, Pieter Abbeel, Stephen James

    Abstract: Autonomous driving is complex, requiring sophisticated 3D scene understanding, localization, mapping, and control. Rather than explicitly modelling and fusing each of these components, we instead consider an end-to-end approach via reinforcement learning (RL). However, collecting exploration driving data in the real world is impractical and dangerous. While training in simulation and deploying vis… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: CoRL 2022 Paper

  24. arXiv:2210.13446  [pdf, other

    cs.RO

    Flying Trot Control Method for Quadruped Robot Based on Trajectory Planning

    Authors: Hongge Wang, Hui Chai, Bin Chen, Aizhen Xie, Rui Song, Bo Su

    Abstract: An intuitive control method for the flying trot, which combines offline trajectory planning with real-time balance control, is presented. The motion features of running animals in the vertical direction were analysed using the spring-load-inverted-pendulum (SLIP) model, and the foot trajectory of the robot was planned, so the robot could run similar to an animal capable of vertical flight, accordi… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 30 pages, 20 figures, journal

  25. arXiv:2210.10765  [pdf, other

    cs.LG

    When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning

    Authors: Annie Xie, Fahim Tajwar, Archit Sharma, Chelsea Finn

    Abstract: A long-term goal of reinforcement learning is to design agents that can autonomously interact and learn in the world. A critical challenge to such autonomy is the presence of irreversible states which require external assistance to recover from, such as when a robot arm has pushed an object off of a table. While standard agents require constant monitoring to decide when to intervene, we aim to des… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  26. arXiv:2210.07426  [pdf, other

    cs.LG cs.AI cs.RO

    Skill-Based Reinforcement Learning with Intrinsic Reward Matching

    Authors: Ademi Adeniji, Amber Xie, Pieter Abbeel

    Abstract: While unsupervised skill discovery has shown promise in autonomously acquiring behavioral primitives, there is still a large methodological disconnect between task-agnostic skill pretraining and downstream, task-aware finetuning. We present Intrinsic Reward Matching (IRM), which unifies these two phases of learning via the $\textit{skill discriminator}$, a pretraining model component often discard… ▽ More

    Submitted 25 May, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 16 pages

  27. arXiv:2209.07423  [pdf, other

    q-bio.BM cs.LG

    Can Pre-trained Models Really Learn Better Molecular Representations for AI-aided Drug Discovery?

    Authors: Ziqiao Zhang, Yatao Bian, Ailin Xie, Pengju Han, Long-Kai Huang, Shuigeng Zhou

    Abstract: Self-supervised pre-training is gaining increasingly more popularity in AI-aided drug discovery, leading to more and more pre-trained models with the promise that they can extract better feature representations for molecules. Yet, the quality of learned representations have not been fully explored. In this work, inspired by the two phenomena of Activity Cliffs (ACs) and Scaffold Hopping (SH) in tr… ▽ More

    Submitted 21 August, 2022; originally announced September 2022.

  28. arXiv:2207.03037  [pdf

    cs.CL

    Sensitivity Analysis on Transferred Neural Architectures of BERT and GPT-2 for Financial Sentiment Analysis

    Authors: Tracy Qian, Andy Xie, Camille Bruckmann

    Abstract: The explosion in novel NLP word embedding and deep learning techniques has induced significant endeavors into potential applications. One of these directions is in the financial sector. Although there is a lot of work done in state-of-the-art models like GPT and BERT, there are relatively few works on how well these methods perform through fine-tuning after being pre-trained, as well as info on ho… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

  29. arXiv:2202.07013  [pdf, other

    cs.LG cs.AI cs.RO

    Robust Policy Learning over Multiple Uncertainty Sets

    Authors: Annie Xie, Shagun Sodhani, Chelsea Finn, Joelle Pineau, Amy Zhang

    Abstract: Reinforcement learning (RL) agents need to be robust to variations in safety-critical environments. While system identification methods provide a way to infer the variation from online experience, they can fail in settings where fast identification is not possible. Another dominant approach is robust RL which produces a policy that can handle worst-case scenarios, but these methods are generally d… ▽ More

    Submitted 4 March, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Project webpage at https://sites.google.com/view/sirsa-public/home

  30. arXiv:2111.14059  [pdf, other

    cs.CV cs.CY cs.LG

    NoFADE: Analyzing Diminishing Returns on CO2 Investment

    Authors: Andre Fu, Justin Tran, Andy Xie, Jonathan Spraggett, Elisa Ding, Chang-Won Lee, Kanav Singla, Mahdi S. Hosseini, Konstantinos N. Plataniotis

    Abstract: Climate change continues to be a pressing issue that currently affects society at-large. It is important that we as a society, including the Computer Vision (CV) community take steps to limit our impact on the environment. In this paper, we (a) analyze the effect of diminishing returns on CV methods, and (b) propose a \textit{``NoFADE''}: a novel entropy-based metric to quantify model--dataset--co… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

    Comments: Climate Change with Machine Learning workshop at 35th Conference on Neural Information Processing Systems (NeurIPS2021-CCAI)

  31. arXiv:2110.08229  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Influencing Towards Stable Multi-Agent Interactions

    Authors: Woodrow Z. Wang, Andy Shih, Annie Xie, Dorsa Sadigh

    Abstract: Learning in multi-agent environments is difficult due to the non-stationarity introduced by an opponent's or partner's changing behaviors. Instead of reactively adapting to the other agent's (opponent or partner) behavior, we propose an algorithm to proactively influence the other agent's strategy to stabilize -- which can restrain the non-stationarity caused by the other agent. We learn a low-dim… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: 15 pages, 5 figures, Published as an Oral at Conference on Robot Learning (CoRL) 2021

  32. arXiv:2109.09180  [pdf, other

    cs.LG cs.AI cs.RO

    Lifelong Robotic Reinforcement Learning by Retaining Experiences

    Authors: Annie Xie, Chelsea Finn

    Abstract: Multi-task learning ideally allows robots to acquire a diverse repertoire of useful skills. However, many multi-task reinforcement learning efforts assume the robot can collect data from all tasks at all times. In reality, the tasks that the robot learns arrive sequentially, depending on the user and the robot's current environment. In this work, we study a practical sequential multi-task RL probl… ▽ More

    Submitted 6 April, 2022; v1 submitted 19 September, 2021; originally announced September 2021.

    Comments: Supplementary website at https://sites.google.com/view/retain-experience/

  33. arXiv:2109.00115  [pdf, other

    eess.IV cs.CV cs.LG

    Uncertainty Quantified Deep Learning for Predicting Dice Coefficient of Digital Histopathology Image Segmentation

    Authors: Sambuddha Ghosal, Audrey Xie, Pratik Shah

    Abstract: Deep learning models (DLMs) can achieve state of the art performance in medical image segmentation and classification tasks. However, DLMs that do not provide feedback for their predictions such as Dice coefficients (Dice) have limited deployment potential in real world clinical settings. Uncertainty estimates can increase the trust of these automated systems by identifying predictions that need f… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

    Comments: Submitted to the 2022 IEEE International Symposium on Biomedical Imaging (ISBI) scientific conference

    MSC Class: 68T07; 54H30 ACM Class: I.2.1; G.3

  34. arXiv:2011.06619  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Latent Representations to Influence Multi-Agent Interaction

    Authors: Annie Xie, Dylan P. Losey, Ryan Tolsma, Chelsea Finn, Dorsa Sadigh

    Abstract: Seamlessly interacting with humans or robots is hard because these agents are non-stationary. They update their policy in response to the ego agent's behavior, and the ego agent must anticipate these changes to co-adapt. Inspired by humans, we recognize that robots do not need to explicitly model every low-level action another agent will make; instead, we can capture the latent strategy of other a… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Conference on Robot Learning (CoRL) 2020. Supplementary website at https://sites.google.com/view/latent-strategies/

  35. Efficient Learning of Control Policies for Robust Quadruped Bounding using Pretrained Neural Networks

    Authors: Zhicheng Wang, Anqiao Li, Yixiao Zheng, Anhuan Xie, Zhibin Li, Jun Wu, Qiuguo Zhu

    Abstract: Bounding is one of the important gaits in quadrupedal locomotion for negotiating obstacles. The authors proposed an effective approach that can learn robust bounding gaits more efficiently despite its large variation in dynamic body movements. The authors first pretrained the neural network (NN) based on data from a robot operated by conventional model based controllers, and then further optimised… ▽ More

    Submitted 29 October, 2023; v1 submitted 1 November, 2020; originally announced November 2020.

    Comments: 12 pages

    Journal ref: IET Cyber-Systems and Robotics 2022 4(4):331-338

  36. arXiv:2006.10701  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Deep Reinforcement Learning amidst Lifelong Non-Stationarity

    Authors: Annie Xie, James Harrison, Chelsea Finn

    Abstract: As humans, our goals and our environment are persistently changing throughout our lifetime based on our experiences, actions, and internal and external drives. In contrast, typical reinforcement learning problem set-ups consider decision processes that are stationary across episodes. Can we develop reinforcement learning algorithms that can cope with the persistent change in the former, more reali… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: supplementary website at https://sites.google.com/stanford.edu/lilac/

  37. arXiv:1912.12773  [pdf, other

    cs.LG cs.RO stat.ML

    Learning Predictive Models From Observation and Interaction

    Authors: Karl Schmeckpeper, Annie Xie, Oleh Rybkin, Stephen Tian, Kostas Daniilidis, Sergey Levine, Chelsea Finn

    Abstract: Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works, and then use this learned model to plan coordinated sequences of actions to bring about desired outcomes. However, learning a model that captures the dynamics of complex skills represents a major challenge: if the agent needs a good model to perform these skills, it migh… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

  38. arXiv:1909.13371  [pdf, other

    cs.LG stat.ML

    Gradient Descent: The Ultimate Optimizer

    Authors: Kartik Chandra, Audrey Xie, Jonathan Ragan-Kelley, Erik Meijer

    Abstract: Working with any gradient-based machine learning algorithm involves the tedious task of tuning the optimizer's hyperparameters, such as its step size. Recent work has shown how the step size can itself be optimized alongside the model parameters by manually deriving expressions for "hypergradients" ahead of time. We show how to automatically compute hypergradients with a simple and elegant modif… ▽ More

    Submitted 14 October, 2022; v1 submitted 29 September, 2019; originally announced September 2019.

  39. arXiv:1904.05538  [pdf, other

    cs.RO cs.AI cs.LG

    Improvisation through Physical Understanding: Using Novel Objects as Tools with Visual Foresight

    Authors: Annie Xie, Frederik Ebert, Sergey Levine, Chelsea Finn

    Abstract: Machine learning techniques have enabled robots to learn narrow, yet complex tasks and also perform broad, yet simple skills with a wide variety of objects. However, learning a model that can both perform complex tasks and generalize to previously unseen objects and goals remains a significant challenge. We study this challenge in the context of "improvisational" tool use: a robot is presented wit… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

    Comments: Videos available at https://sites.google.com/view/gvf-tool

  40. arXiv:1812.00568  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control

    Authors: Frederik Ebert, Chelsea Finn, Sudeep Dasari, Annie Xie, Alex Lee, Sergey Levine

    Abstract: Deep reinforcement learning (RL) algorithms can learn complex robotic skills from raw sensory inputs, but have yet to achieve the kind of broad generalization and applicability demonstrated by deep learning methods in supervised domains. We present a deep RL method that is practical for real-world robotics tasks, such as robotic manipulation, and generalizes effectively to never-before-seen tasks… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

  41. arXiv:1810.00482  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Few-Shot Goal Inference for Visuomotor Learning and Planning

    Authors: Annie Xie, Avi Singh, Sergey Levine, Chelsea Finn

    Abstract: Reinforcement learning and planning methods require an objective or reward function that encodes the desired behavior. Yet, in practice, there is a wide range of scenarios where an objective is difficult to provide programmatically, such as tasks with visual observations involving unknown object positions or deformable objects. In these cases, prior methods use engineered problem-specific solution… ▽ More

    Submitted 30 September, 2018; originally announced October 2018.

    Comments: Videos available at https://sites.google.com/view/few-shot-goals

  42. arXiv:1802.01557  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning

    Authors: Tianhe Yu, Chelsea Finn, Annie Xie, Sudeep Dasari, Tianhao Zhang, Pieter Abbeel, Sergey Levine

    Abstract: Humans and animals are capable of learning a new behavior by observing others perform the skill just once. We consider the problem of allowing a robot to do the same -- learning from a raw video pixels of a human, even when there is substantial domain shift in the perspective, environment, and embodiment between the robot and the observed human. Prior approaches to this problem have hand-specified… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

    Comments: First two authors contributed equally. Video available at https://sites.google.com/view/daml

  43. arXiv:1602.06686  [pdf, ps, other

    cs.NI cs.DC

    Designing a Disaster-resilient Network with Software Defined Networking

    Authors: An Xie, Xiaoliang Wang, Guido Maier, Sanglu Lu

    Abstract: With the wide deployment of network facilities and the increasing requirement of network reliability, the disruptive event like natural disaster, power outage or malicious attack has become a non-negligible threat to the current communication network. Such disruptive event can simultaneously destroy all devices in a specific geographical area and affect many network based applications for a long t… ▽ More

    Submitted 23 February, 2016; v1 submitted 22 February, 2016; originally announced February 2016.