Skip to main content

Showing 1–50 of 54 results for author: Jayaraman, D

  1. arXiv:2406.01967  [pdf, other

    cs.RO cs.AI cs.LG

    DrEureka: Language Model Guided Sim-To-Real Transfer

    Authors: Yecheng Jason Ma, William Liang, Hung-Ju Wang, Sam Wang, Yuke Zhu, Linxi Fan, Osbert Bastani, Dinesh Jayaraman

    Abstract: Transferring policies learned in simulation to the real world is a promising strategy for acquiring robot skills at scale. However, sim-to-real approaches typically rely on manual design and tuning of the task reward function as well as the simulation physics parameters, rendering the process slow and human-labor intensive. In this paper, we investigate using Large Language Models (LLMs) to automa… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Robotics: Science and Systems (RSS) 2024. Project website and open-source code: https://eureka-research.github.io/dr-eureka/

  2. arXiv:2405.15916  [pdf, other

    cs.CV cs.RO

    Recasting Generic Pretrained Vision Transformers As Object-Centric Scene Encoders For Manipulation Policies

    Authors: Jianing Qian, Anastasios Panagopoulos, Dinesh Jayaraman

    Abstract: Generic re-usable pre-trained image representation encoders have become a standard component of methods for many computer vision tasks. As visual representations for robots however, their utility has been limited, leading to a recent wave of efforts to pre-train robotics-specific image encoders that are better suited to robotic tasks than their generic counterparts. We propose Scene Objects From T… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted to International Conference on Robotics and Automation(ICRA) 2024

  3. arXiv:2405.14853  [pdf, other

    cs.LG cs.AI cs.RO

    Privileged Sensing Scaffolds Reinforcement Learning

    Authors: Edward S. Hu, James Springer, Oleh Rybkin, Dinesh Jayaraman

    Abstract: We need to look at our shoelaces as we first learn to tie them but having mastered this skill, can do it from touch alone. We call this phenomenon "sensory scaffolding": observation streams that are not needed by a master might yet aid a novice learner. We consider such sensory scaffolding setups for training artificial agents. For example, a robot arm may need to be deployed with just a low-cost,… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: ICLR 2024 Spotlight version

  4. arXiv:2404.13474  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Composing Pre-Trained Object-Centric Representations for Robotics From "What" and "Where" Foundation Models

    Authors: Junyao Shi, Jianing Qian, Yecheng Jason Ma, Dinesh Jayaraman

    Abstract: There have recently been large advances both in pre-training visual representations for robotic control and segmenting unknown category objects in general images. To leverage these for improved robot learning, we propose $\textbf{POCR}$, a new framework for building pre-trained object-centric representations for robotic control. Building on theories of "what-where" representations in psychology an… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: ICRA 2024. Project website: https://sites.google.com/view/pocr

  5. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  6. arXiv:2403.00729  [pdf, other

    cs.CV cs.RO

    Can Transformers Capture Spatial Relations between Objects?

    Authors: Chuan Wen, Dinesh Jayaraman, Yang Gao

    Abstract: Spatial relationships between objects represent key scene information for humans to understand and interact with the world. To study the capability of current computer vision systems to recognize physically grounded spatial relations, we start by proposing precise relation definitions that permit consistently annotating a benchmark dataset. Despite the apparent simplicity of this task relative to… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 21 pages, 8 figures, ICLR 2024

  7. arXiv:2312.04036  [pdf, other

    cs.CV cs.LG

    DiffusionPhase: Motion Diffusion in Frequency Domain

    Authors: Weilin Wan, Yiming Huang, Shutong Wu, Taku Komura, Wenping Wang, Dinesh Jayaraman, Lingjie Liu

    Abstract: In this study, we introduce a learning-based method for generating high-quality human motion sequences from text descriptions (e.g., ``A person walks forward"). Existing techniques struggle with motion diversity and smooth transitions in generating arbitrary-length motion sequences, due to limited text-to-motion datasets and the pose representations used that often lack expressiveness or compactne… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  8. arXiv:2311.17135  [pdf, other

    cs.CV cs.GR

    TLControl: Trajectory and Language Control for Human Motion Synthesis

    Authors: Weilin Wan, Zhiyang Dou, Taku Komura, Wenping Wang, Dinesh Jayaraman, Lingjie Liu

    Abstract: Controllable human motion synthesis is essential for applications in AR/VR, gaming, movies, and embodied AI. Existing methods often focus solely on either language or full trajectory control, lacking precision in synthesizing motions aligned with user-specified trajectories, especially for multi-joint control. To address these issues, we present TLControl, a new method for realistic human motion s… ▽ More

    Submitted 12 December, 2023; v1 submitted 28 November, 2023; originally announced November 2023.

  9. arXiv:2310.12931  [pdf, other

    cs.RO cs.AI cs.LG

    Eureka: Human-Level Reward Design via Coding Large Language Models

    Authors: Yecheng Jason Ma, William Liang, Guanzhi Wang, De-An Huang, Osbert Bastani, Dinesh Jayaraman, Yuke Zhu, Linxi Fan, Anima Anandkumar

    Abstract: Large Language Models (LLMs) have excelled as high-level semantic planners for sequential decision-making tasks. However, harnessing them to learn complex low-level manipulation tasks, such as dexterous pen spinning, remains an open problem. We bridge this fundamental gap and present Eureka, a human-level reward design algorithm powered by LLMs. Eureka exploits the remarkable zero-shot generation,… ▽ More

    Submitted 30 April, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: ICLR 2024. Project website and open-source code: https://eureka-research.github.io/

  10. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  11. arXiv:2310.08581  [pdf, other

    cs.RO cs.CV

    Universal Visual Decomposer: Long-Horizon Manipulation Made Easy

    Authors: Zichen Zhang, Yunshuang Li, Osbert Bastani, Abhishek Gupta, Dinesh Jayaraman, Yecheng Jason Ma, Luca Weihs

    Abstract: Real-world robotic tasks stretch over extended horizons and encompass multiple stages. Learning long-horizon manipulation tasks, however, is a long-standing challenge, and demands decomposing the overarching task into several manageable subtasks to facilitate policy learning and generalization to unseen tasks. Prior task decomposition methods require task-specific knowledge, are computationally in… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  12. arXiv:2310.06171  [pdf, other

    cs.LG cs.AI cs.RO

    Memory-Consistent Neural Networks for Imitation Learning

    Authors: Kaustubh Sridhar, Souradeep Dutta, Dinesh Jayaraman, James Weimer, Insup Lee

    Abstract: Imitation learning considerably simplifies policy synthesis compared to alternative approaches by exploiting access to expert demonstrations. For such imitation policies, errors away from the training samples are particularly critical. Even rare slip-ups in the policy action outputs can compound quickly over time, since they lead to unfamiliar future states where the policy is still more likely to… ▽ More

    Submitted 16 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: ICLR 2024. 26 pages (9 main pages)

  13. arXiv:2306.00958  [pdf, other

    cs.RO cs.AI cs.LG

    LIV: Language-Image Representations and Rewards for Robotic Control

    Authors: Yecheng Jason Ma, William Liang, Vaidehi Som, Vikash Kumar, Amy Zhang, Osbert Bastani, Dinesh Jayaraman

    Abstract: We present Language-Image Value learning (LIV), a unified objective for vision-language representation and reward learning from action-free videos with text annotations. Exploiting a novel connection between dual reinforcement learning and mutual information contrastive learning, the LIV objective trains a multi-modal representation that implicitly encodes a universal value function for tasks spec… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Extended version of ICML 2023 camera-ready; Project website: https://penn-pal-lab.github.io/LIV/

  14. arXiv:2305.12663  [pdf, other

    cs.LG cs.AI

    TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching

    Authors: Yecheng Jason Ma, Kausik Sivakumar, Jason Yan, Osbert Bastani, Dinesh Jayaraman

    Abstract: Standard model-based reinforcement learning (MBRL) approaches fit a transition model of the environment to all past experience, but this wastes model capacity on data that is irrelevant for policy improvement. We instead propose a new "transition occupancy matching" (TOM) objective for MBRL model learning: a model is good to the extent that the current policy experiences the same distribution of t… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: L4DC 2023; Project website: https://penn-pal-lab.github.io/TOM/

  15. arXiv:2305.10424  [pdf, other

    cs.CV cs.LG

    ZeroFlow: Scalable Scene Flow via Distillation

    Authors: Kyle Vedder, Neehar Peri, Nathaniel Chodosh, Ishan Khatri, Eric Eaton, Dinesh Jayaraman, Yang Liu, Deva Ramanan, James Hays

    Abstract: Scene flow estimation is the task of describing the 3D motion field between temporally successive point clouds. State-of-the-art methods use strong priors and test-time optimization techniques, but require on the order of tens of seconds to process full-size point clouds, making them unusable as computer vision primitives for real-time applications such as open world object detection. Feedforward… ▽ More

    Submitted 14 March, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted to ICLR 2024. 9 pages, 4 pages of citations, 6 pages of Supplemental. Project page with data releases is at http://vedder.io/zeroflow.html

  16. arXiv:2303.13002  [pdf, other

    cs.LG cs.AI cs.RO

    Planning Goals for Exploration

    Authors: Edward S. Hu, Richard Chang, Oleh Rybkin, Dinesh Jayaraman

    Abstract: Dropped into an unknown environment, what should an agent do to quickly learn about the environment and how to accomplish diverse tasks within it? We address this question within the goal-conditioned reinforcement learning paradigm, by identifying how the agent should set its goals at training time to maximize exploration. We propose "Planning Exploratory Goals" (PEG), a method that sets goals for… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: Camera Ready version for ICLR2023 Spotlight

  17. arXiv:2302.08463  [pdf, other

    cs.RO

    Dynamic Grasping with a Learned Meta-Controller

    Authors: Yinsen Jia, Jingxi Xu, Dinesh Jayaraman, Shuran Song

    Abstract: Grasping moving objects is a challenging task that requires multiple submodules such as object pose predictor, arm motion planner, etc. Each submodule operates under its own set of meta-parameters. For example, how far the pose predictor should look into the future (i.e., look-ahead time) and the maximum amount of time the motion planner can spend planning a motion (i.e., time budget). Many previo… ▽ More

    Submitted 26 March, 2024; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: 9 pages

  18. arXiv:2212.08961  [pdf, other

    cs.LG cs.RO

    Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning

    Authors: Kun Huang, Edward S. Hu, Dinesh Jayaraman

    Abstract: Physical interactions can often help reveal information that is not readily apparent. For example, we may tug at a table leg to evaluate whether it is built well, or turn a water bottle upside down to check that it is watertight. We propose to train robots to acquire such interactive behaviors automatically, for the purpose of evaluating the result of an attempted robotic skill execution. These ev… ▽ More

    Submitted 17 December, 2022; originally announced December 2022.

    Comments: CoRL 2022

  19. arXiv:2210.15908  [pdf, other

    cs.CV cs.RO

    Long-HOT: A Modular Hierarchical Approach for Long-Horizon Object Transport

    Authors: Sriram Narayanan, Dinesh Jayaraman, Manmohan Chandraker

    Abstract: We address key challenges in long-horizon embodied exploration and navigation by proposing a new object transport task and a novel modular framework for temporally extended navigation. Our first contribution is the design of a novel Long-HOT environment focused on deep exploration and long-horizon planning where the agent is required to efficiently find and pick up target objects to be carried and… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

  20. arXiv:2210.00030  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training

    Authors: Yecheng Jason Ma, Shagun Sodhani, Dinesh Jayaraman, Osbert Bastani, Vikash Kumar, Amy Zhang

    Abstract: Reward and representation learning are two long-standing challenges for learning an expanding set of robot manipulation skills from sensory observations. Given the inherent cost and scarcity of in-domain, task-specific robot data, learning from large, diverse, offline human videos has emerged as a promising path towards acquiring a generally useful visual representation for control; however, how t… ▽ More

    Submitted 6 March, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: ICLR 2023, Notable-Top-25% (Spotlight). Project website: https://sites.google.com/view/vip-rl

  21. arXiv:2209.12136  [pdf, other

    cs.CV cs.RO

    Vision-based Perimeter Defense via Multiview Pose Estimation

    Authors: Elijah S. Lee, Giuseppe Loianno, Dinesh Jayaraman, Vijay Kumar

    Abstract: Previous studies in the perimeter defense game have largely focused on the fully observable setting where the true player states are known to all players. However, this is unrealistic for practical implementation since defenders may have to perceive the intruders and estimate their states. In this work, we study the perimeter defense game in a photo-realistic simulator and the real world, requirin… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Comments: 7 pages, 10 figures

  22. arXiv:2206.10816  [pdf, other

    cs.LG cs.CV cs.RO

    Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming

    Authors: Chuan Wen, Jianing Qian, Jierui Lin, Jiaye Teng, Dinesh Jayaraman, Yang Gao

    Abstract: Across applications spanning supervised classification and sequential control, deep learning has been reported to find "shortcut" solutions that fail catastrophically under minor changes in the data distribution. In this paper, we show empirically that DNNs can be coaxed to avoid poor shortcuts by providing an additional "priming" feature computed from key input features, usually a coarse output e… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: 28 pages, 13 figures, ICML2022

  23. arXiv:2206.03023  [pdf, other

    cs.LG cs.AI

    How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via $f$-Advantage Regression

    Authors: Yecheng Jason Ma, Jason Yan, Dinesh Jayaraman, Osbert Bastani

    Abstract: Offline goal-conditioned reinforcement learning (GCRL) promises general-purpose skill learning in the form of reaching diverse goals from purely offline datasets. We propose $\textbf{Go}$al-conditioned $f$-$\textbf{A}$dvantage $\textbf{R}$egression (GoFAR), a novel regression-based offline GCRL algorithm derived from a state-occupancy matching perspective; the key intuition is that the goal-reachi… ▽ More

    Submitted 10 November, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022. Project website: https://jasonma2016.github.io/GoFAR/

  24. arXiv:2202.02433  [pdf, other

    cs.LG cs.AI

    Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching

    Authors: Yecheng Jason Ma, Andrew Shen, Dinesh Jayaraman, Osbert Bastani

    Abstract: We propose State Matching Offline DIstribution Correction Estimation (SMODICE), a novel and versatile regression-based offline imitation learning (IL) algorithm derived via state-occupancy matching. We show that the SMODICE objective admits a simple optimization procedure through an application of Fenchel duality and an analytic solution in tabular MDPs. Without requiring access to expert actions,… ▽ More

    Submitted 18 June, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: ICML 2022. Project website: https://sites.google.com/view/smodice/home

  25. arXiv:2201.07372  [pdf, other

    cs.LG cs.AI

    Prospective Learning: Principled Extrapolation to the Future

    Authors: Ashwin De Silva, Rahul Ramesh, Lyle Ungar, Marshall Hussain Shuler, Noah J. Cowan, Michael Platt, Chen Li, Leyla Isik, Seung-Eon Roh, Adam Charles, Archana Venkataraman, Brian Caffo, Javier J. How, Justus M Kebschull, John W. Krakauer, Maxim Bichuch, Kaleab Alemayehu Kinfu, Eva Yezerets, Dinesh Jayaraman, Jong M. Shin, Soledad Villar, Ian Phillips, Carey E. Priebe, Thomas Hartung, Michael I. Miller , et al. (18 additional authors not shown)

    Abstract: Learning is a process which can update decision rules, based on past experience, such that future performance improves. Traditionally, machine learning is often evaluated under the assumption that the future will be identical to the past in distribution or change adversarially. But these assumptions can be either too optimistic or pessimistic for many problems in the real world. Real world scenari… ▽ More

    Submitted 13 July, 2023; v1 submitted 18 January, 2022; originally announced January 2022.

    Comments: Accepted at the 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023

  26. arXiv:2112.07701  [pdf, other

    cs.LG

    Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

    Authors: Yecheng Jason Ma, Andrew Shen, Osbert Bastani, Dinesh Jayaraman

    Abstract: Reinforcement Learning (RL) agents in the real world must satisfy safety constraints in addition to maximizing a reward objective. Model-based RL algorithms hold promise for reducing unsafe real-world actions: they may synthesize policies that obey all constraints using simulated samples from a learned model. However, imperfect models can result in real-world constraint violations even for actions… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: AAAI 2022

  27. arXiv:2108.11944  [pdf, other

    cs.CV

    Probabilistic Modeling for Human Mesh Recovery

    Authors: Nikos Kolotouros, Georgios Pavlakos, Dinesh Jayaraman, Kostas Daniilidis

    Abstract: This paper focuses on the problem of 3D human reconstruction from 2D evidence. Although this is an inherently ambiguous problem, the majority of recent works avoid the uncertainty modeling and typically regress a single estimate for a given input. In contrast to that, in this work, we propose to embrace the reconstruction ambiguity and we recast the problem as learning a mapping from the input to… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: ICCV 2021. Project page: https://www.seas.upenn.edu/~nkolot/projects/prohmr

  28. arXiv:2107.09047  [pdf, other

    cs.LG cs.CV cs.RO

    Know Thyself: Transferable Visual Control Policies Through Robot-Awareness

    Authors: Edward S. Hu, Kun Huang, Oleh Rybkin, Dinesh Jayaraman

    Abstract: Training visual control policies from scratch on a new robot typically requires generating large amounts of robot-specific data. How might we leverage data previously collected on another robot to reduce or even completely remove this need for robot-specific data? We propose a "robot-aware control" paradigm that achieves this by exploiting readily available knowledge about the robot. We then insta… ▽ More

    Submitted 17 October, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: Updated to ICLR22 version

  29. arXiv:2107.06106  [pdf, other

    cs.LG

    Conservative Offline Distributional Reinforcement Learning

    Authors: Yecheng Jason Ma, Dinesh Jayaraman, Osbert Bastani

    Abstract: Many reinforcement learning (RL) problems in practice are offline, learning purely from observational data. A key challenge is how to ensure the learned policy is safe, which requires quantifying the risk associated with different actions. In the online setting, distributional RL algorithms do so by learning the distribution over returns (i.e., cumulative rewards) instead of the expected return; b… ▽ More

    Submitted 26 October, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: NeurIPS 2021

  30. arXiv:2106.06452  [pdf, other

    cs.LG cs.RO

    Keyframe-Focused Visual Imitation Learning

    Authors: Chuan Wen, Jierui Lin, Jianing Qian, Yang Gao, Dinesh Jayaraman

    Abstract: Imitation learning trains control policies by mimicking pre-recorded expert demonstrations. In partially observable settings, imitation policies must rely on observation histories, but many seemingly paradoxical results show better performance for policies that only access the most recent observation. Recent solutions ranging from causal graph learning to deep information bottlenecks have shown pr… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: 14 pages, 7 figures, ICML2021

  31. arXiv:2104.00827  [pdf, other

    cs.LG cs.CV cs.RO

    How Are Learned Perception-Based Controllers Impacted by the Limits of Robust Control?

    Authors: Jingxi Xu, Bruce Lee, Nikolai Matni, Dinesh Jayaraman

    Abstract: The difficulty of optimal control problems has classically been characterized in terms of system properties such as minimum eigenvalues of controllability/observability gramians. We revisit these characterizations in the context of the increasing popularity of data-driven techniques like reinforcement learning (RL), and in control settings where input observations are high-dimensional images and t… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: Accepted to L4DC 2021

  32. arXiv:2011.15084  [pdf, other

    cs.CV cs.LG

    Likelihood-Based Diverse Sampling for Trajectory Forecasting

    Authors: Yecheng Jason Ma, Jeevana Priya Inala, Dinesh Jayaraman, Osbert Bastani

    Abstract: Forecasting complex vehicle and pedestrian multi-modal distributions requires powerful probabilistic approaches. Normalizing flows (NF) have recently emerged as an attractive tool to model such distributions. However, a key drawback is that independent samples drawn from a flow model often do not adequately capture all the modes in the underlying distribution. We propose Likelihood-Based Diverse S… ▽ More

    Submitted 14 September, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: ICCV 2021

  33. arXiv:2010.14876  [pdf, other

    cs.LG cs.RO

    Fighting Copycat Agents in Behavioral Cloning from Observation Histories

    Authors: Chuan Wen, Jierui Lin, Trevor Darrell, Dinesh Jayaraman, Yang Gao

    Abstract: Imitation learning trains policies to map from input observations to the actions that an expert would choose. In this setting, distribution shift frequently exacerbates the effect of misattributing expert actions to nuisance correlates among the observed variables. We observe that a common instance of this causal confusion occurs in partially observed settings when expert actions are strongly corr… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: Published at NeurIPS 2020 9 pages(exclude reference and appendices)

  34. arXiv:2010.09034  [pdf, other

    cs.RO cs.LG

    Model-Based Inverse Reinforcement Learning from Visual Demonstrations

    Authors: Neha Das, Sarah Bechtle, Todor Davchev, Dinesh Jayaraman, Akshara Rai, Franziska Meier

    Abstract: Scaling model-based inverse reinforcement learning (IRL) to real robotic manipulation tasks with unknown dynamics remains an open problem. The key challenges lie in learning good dynamics models, developing algorithms that scale to high-dimensional state-spaces and being able to learn from both visual and proprioceptive demonstrations. In this work, we present a gradient-based inverse reinforcemen… ▽ More

    Submitted 6 January, 2021; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: Accepted at the 4th Conference on Robotic Learning (CoRL 2020), Cambridge MA, USA

    Report number: PMLR 155:1930-1942

    Journal ref: Proceedings of the 2020 Conference on Robot Learning, PMLR 155:1930-1942, 2021

  35. arXiv:2008.06622  [pdf, other

    cs.LG stat.ML

    Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

    Authors: Jesse Zhang, Brian Cheung, Chelsea Finn, Sergey Levine, Dinesh Jayaraman

    Abstract: Reinforcement learning (RL) in real-world safety-critical target settings like urban driving is hazardous, imperiling the RL agent, other agents, and the environment. To overcome this difficulty, we propose a "safety-critical adaptation" task setting: an agent first trains in non-safety-critical "source" environments such as in a simulator, before it adapts to the target environment where failures… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

    Comments: 15 pages, 8 figures, ICML 2020. Website with code: https://sites.google.com/berkeley.edu/carl

    Journal ref: Proceedings of the 37th International Conference on Machine Learning, PMLR 119:11055-11065, 2020

  36. arXiv:2006.13205  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors

    Authors: Karl Pertsch, Oleh Rybkin, Frederik Ebert, Chelsea Finn, Dinesh Jayaraman, Sergey Levine

    Abstract: The ability to predict and plan into the future is fundamental for agents acting in the world. To reach a faraway goal, we predict trajectories at multiple timescales, first devising a coarse plan towards the goal and then gradually filling in details. In contrast, current learning approaches for visual prediction and planning fail on long-horizon tasks as they generate predictions (1) without con… ▽ More

    Submitted 27 November, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: Project page: orybkin.github.io/video-gcp. KP and OR contributed equally

  37. arXiv:2005.14679  [pdf, other

    cs.RO cs.LG eess.SY stat.ML

    DIGIT: A Novel Design for a Low-Cost Compact High-Resolution Tactile Sensor with Application to In-Hand Manipulation

    Authors: Mike Lambeta, Po-Wei Chou, Stephen Tian, Brian Yang, Benjamin Maloon, Victoria Rose Most, Dave Stroud, Raymond Santos, Ahmad Byagowi, Gregg Kammerer, Dinesh Jayaraman, Roberto Calandra

    Abstract: Despite decades of research, general purpose in-hand manipulation remains one of the unsolved challenges of robotics. One of the contributing factors that limit current robotic manipulation systems is the difficulty of precisely sensing contact forces -- sensing and reasoning about contact forces are crucial to accurately control interactions with the environment. As a step towards enabling better… ▽ More

    Submitted 29 May, 2020; originally announced May 2020.

    Comments: 8 pages, published in the IEEE Robotics and Automation Letters (RA-L)

  38. arXiv:2001.02192  [pdf, other

    cs.CV cs.AI

    An Exploration of Embodied Visual Exploration

    Authors: Santhosh K. Ramakrishnan, Dinesh Jayaraman, Kristen Grauman

    Abstract: Embodied computer vision considers perception for robots in novel, unstructured environments. Of particular importance is the embodied visual exploration problem: how might a robot equipped with a camera scope out a new environment? Despite the progress thus far, many basic questions pertinent to this problem remain unanswered: (i) What does it mean for an agent to explore its environment well? (i… ▽ More

    Submitted 20 August, 2020; v1 submitted 7 January, 2020; originally announced January 2020.

    Comments: 30 main + 21 appendix pages, 23 figures

  39. arXiv:1912.13360  [pdf, other

    cs.RO cs.CV

    Morphology-Agnostic Visual Robotic Control

    Authors: Brian Yang, Dinesh Jayaraman, Glen Berseth, Alexei Efros, Sergey Levine

    Abstract: Existing approaches for visuomotor robotic control typically require characterizing the robot in advance by calibrating the camera or performing system identification. We propose MAVRIC, an approach that works with minimal prior knowledge of the robot's morphology, and requires only a camera view containing the robot and its environment and an unknown control interface. MAVRIC revolves around a mu… ▽ More

    Submitted 31 December, 2019; originally announced December 2019.

  40. arXiv:1912.05510  [pdf, other

    cs.LG cs.AI stat.ML

    SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments

    Authors: Glen Berseth, Daniel Geng, Coline Devin, Nicholas Rhinehart, Chelsea Finn, Dinesh Jayaraman, Sergey Levine

    Abstract: Every living organism struggles against disruptive environmental forces to carve out and maintain an orderly niche. We propose that such a struggle to achieve and preserve order might offer a principle for the emergence of useful behaviors in artificial agents. We formalize this idea into an unsupervised reinforcement learning method called surprise minimizing reinforcement learning (SMiRL). SMiRL… ▽ More

    Submitted 7 February, 2021; v1 submitted 11 December, 2019; originally announced December 2019.

    Comments: ICLR 2021

    ACM Class: G.3

  41. Emergence of Exploratory Look-Around Behaviors through Active Observation Completion

    Authors: Santhosh K. Ramakrishnan, Dinesh Jayaraman, Kristen Grauman

    Abstract: Standard computer vision systems assume access to intelligently captured inputs (e.g., photos from a human photographer), yet autonomously capturing good observations is a major challenge in itself. We address the problem of learning to look around: how can an agent learn to acquire informative visual observations? We propose a reinforcement learning solution, where the agent is rewarded for reduc… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Comments: Main paper 7 figures, supplementary 6 figures. Published in Science Robotics 2019

  42. arXiv:1905.11979  [pdf, other

    cs.LG stat.ML

    Causal Confusion in Imitation Learning

    Authors: Pim de Haan, Dinesh Jayaraman, Sergey Levine

    Abstract: Behavioral cloning reduces policy learning to supervised learning by training a discriminative model to predict expert actions given observations. Such discriminative models are non-causal: the training procedure is unaware of the causal structure of the interaction between the expert and the environment. We point out that ignoring causality is particularly damaging because of the distributional s… ▽ More

    Submitted 4 November, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Published at NeurIPS 2019 9 pages, plus references and appendices

  43. arXiv:1905.07447  [pdf, other

    cs.RO cs.CV cs.LG

    REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning

    Authors: Brian Yang, Jesse Zhang, Vitchyr Pong, Sergey Levine, Dinesh Jayaraman

    Abstract: Standardized evaluation measures have aided in the progress of machine learning approaches in disciplines such as computer vision and machine translation. In this paper, we make the case that robotic learning would also benefit from benchmarking, and present the "REPLAB" platform for benchmarking vision-based manipulation tasks. REPLAB is a reproducible and self-contained hardware stack (robot arm… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: Extended version of paper accepted to ICRA 2019

  44. arXiv:1903.04128  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Manipulation by Feel: Touch-Based Control with Deep Predictive Models

    Authors: Stephen Tian, Frederik Ebert, Dinesh Jayaraman, Mayur Mudigonda, Chelsea Finn, Roberto Calandra, Sergey Levine

    Abstract: Touch sensing is widely acknowledged to be important for dexterous robotic manipulation, but exploiting tactile sensing for continuous, non-prehensile manipulation is challenging. General purpose control techniques that are able to effectively leverage tactile sensing as well as accurate physics models of contacts and forces remain largely elusive, and it is unclear how to even specify a desired b… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

    Comments: Accepted to ICRA 2019

  45. arXiv:1808.07784  [pdf, other

    cs.CV cs.LG stat.ML

    Time-Agnostic Prediction: Predicting Predictable Video Frames

    Authors: Dinesh Jayaraman, Frederik Ebert, Alexei A. Efros, Sergey Levine

    Abstract: Prediction is arguably one of the most basic functions of an intelligent system. In general, the problem of predicting events in the future or between two waypoints is exceedingly difficult. However, most phenomena naturally pass through relatively predictable bottlenecks---while we cannot predict the precise trajectory of a robot arm between being at rest and holding an object up, we can be certa… ▽ More

    Submitted 23 October, 2018; v1 submitted 23 August, 2018; originally announced August 2018.

    Comments: 8 pages, plus appendices

  46. arXiv:1805.11085  [pdf, other

    cs.RO cs.LG stat.ML

    More Than a Feeling: Learning to Grasp and Regrasp using Vision and Touch

    Authors: Roberto Calandra, Andrew Owens, Dinesh Jayaraman, Justin Lin, Wenzhen Yuan, Jitendra Malik, Edward H. Adelson, Sergey Levine

    Abstract: For humans, the process of grasping an object relies heavily on rich tactile feedback. Most recent robotic grasping work, however, has been based only on visual input, and thus cannot easily benefit from feedback after initiating contact. In this paper, we investigate how a robot can learn to use tactile information to iteratively and efficiently adjust its grasp. To this end, we propose an end-to… ▽ More

    Submitted 26 July, 2018; v1 submitted 28 May, 2018; originally announced May 2018.

    Comments: 8 pages. Published on IEEE Robotics and Automation Letters (RAL). Website: https://sites.google.com/view/more-than-a-feeling

  47. arXiv:1709.00507  [pdf, other

    cs.CV

    Learning to Look Around: Intelligently Exploring Unseen Environments for Unknown Tasks

    Authors: Dinesh Jayaraman, Kristen Grauman

    Abstract: It is common to implicitly assume access to intelligently captured inputs (e.g., photos from a human photographer), yet autonomously capturing good observations is itself a major challenge. We address the problem of learning to look around: if a visual agent has the ability to voluntarily acquire new views to observe its environment, how can it learn efficient exploratory behaviors to acquire info… ▽ More

    Submitted 21 December, 2017; v1 submitted 1 September, 2017; originally announced September 2017.

  48. arXiv:1709.00505  [pdf, other

    cs.CV

    ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids

    Authors: Dinesh Jayaraman, Ruohan Gao, Kristen Grauman

    Abstract: We introduce an unsupervised feature learning approach that embeds 3D shape information into a single-view image representation. The main idea is a self-supervised training objective that, given only a single 2D image, requires all unseen views of the object to be predictable from learned features. We implement this idea as an encoder-decoder convolutional neural network. The network maps an input… ▽ More

    Submitted 30 July, 2018; v1 submitted 1 September, 2017; originally announced September 2017.

    Comments: To appear at ECCV 2018

  49. arXiv:1612.02335  [pdf, other

    cs.CV

    Pano2Vid: Automatic Cinematography for Watching 360$^{\circ}$ Videos

    Authors: Yu-Chuan Su, Dinesh Jayaraman, Kristen Grauman

    Abstract: We introduce the novel task of Pano2Vid $-$ automatic cinematography in panoramic 360$^{\circ}$ videos. Given a 360$^{\circ}$ video, the goal is to direct an imaginary camera to virtually capture natural-looking normal field-of-view (NFOV) video. By selecting "where to look" within the panorama at each time step, Pano2Vid aims to free both the videographer and the end viewer from the task of deter… ▽ More

    Submitted 7 December, 2016; originally announced December 2016.

  50. arXiv:1612.00500  [pdf, other

    cs.CV

    Object-Centric Representation Learning from Unlabeled Videos

    Authors: Ruohan Gao, Dinesh Jayaraman, Kristen Grauman

    Abstract: Supervised (pre-)training currently yields state-of-the-art performance for representation learning for visual recognition, yet it comes at the cost of (1) intensive manual annotations and (2) an inherent restriction in the scope of data relevant for learning. In this work, we explore unsupervised feature learning from unlabeled video. We introduce a novel object-centric approach to temporal coher… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.

    Comments: In Proceedings of the Asian Conference on Computer Vision (ACCV), 2016