Skip to main content

Showing 1–22 of 22 results for author: Bauza, M

  1. arXiv:2402.11450  [pdf, other

    cs.RO

    Learning to Learn Faster from Human Feedback with Language Model Predictive Control

    Authors: Jacky Liang, Fei Xia, Wenhao Yu, Andy Zeng, Montserrat Gonzalez Arenas, Maria Attarian, Maria Bauza, Matthew Bennice, Alex Bewley, Adil Dostmohamed, Chuyuan Kelly Fu, Nimrod Gileadi, Marissa Giustina, Keerthana Gopalakrishnan, Leonard Hasenclever, Jan Humplik, Jasmine Hsu, Nikhil Joshi, Ben Jyenis, Chase Kew, Sean Kirmani, Tsang-Wei Edward Lee, Kuang-Huei Lee, Assaf Hurwitz Michaely, Joss Moore , et al. (25 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to exhibit a wide range of capabilities, such as writing robot code from language commands -- enabling non-experts to direct robot behaviors, modify them based on feedback, or compose them to perform new tasks. However, these capabilities (driven by in-context learning) are limited to short-term interactions, where users' feedback remains relevant for o… ▽ More

    Submitted 31 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  2. arXiv:2307.13133  [pdf, other

    cs.RO cs.CV cs.LG

    simPLE: a visuotactile method learned in simulation to precisely pick, localize, regrasp, and place objects

    Authors: Maria Bauza, Antonia Bronars, Yifan Hou, Ian Taylor, Nikhil Chavan-Dafle, Alberto Rodriguez

    Abstract: Existing robotic systems have a clear tension between generality and precision. Deployed solutions for robotic manipulation tend to fall into the paradigm of one robot solving a single task, lacking precise generalization, i.e., the ability to solve many tasks without compromising on precision. This paper explores solutions for precise and general pick-and-place. In precise pick-and-place, i.e. ki… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 33 pages, 6 figures, 2 tables, submitted to Science Robotics

  3. arXiv:2306.11706  [pdf, other

    cs.RO cs.LG

    RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation

    Authors: Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Żołna, Scott Reed, Sergio Gómez Colmenarejo, Jon Scholz , et al. (14 additional authors not shown)

    Abstract: The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to transform robot learning. Inspired by recent advances in foundation models for vision and language, we propose a multi-embodiment, multi-task generalist agent for robotic manipulation. This agent, named RoboCat, is a visual goal-conditioned de… ▽ More

    Submitted 22 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Transactions on Machine Learning Research (12/2023)

  4. arXiv:2303.07997  [pdf, other

    cs.RO cs.AI cs.CV

    FingerSLAM: Closed-loop Unknown Object Localization and Reconstruction from Visuo-tactile Feedback

    Authors: Jialiang Zhao, Maria Bauza, Edward H. Adelson

    Abstract: In this paper, we address the problem of using visuo-tactile feedback for 6-DoF localization and 3D reconstruction of unknown in-hand objects. We propose FingerSLAM, a closed-loop factor graph-based pose estimator that combines local tactile sensing at finger-tip and global vision sensing from a wrist-mount camera. FingerSLAM is constructed with two constituent pose estimators: a multi-pass refine… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Submitted and accepted to 2023 IEEE International Conference on Robotics and Automation (ICRA 2023)

  5. Tac2Pose: Tactile Object Pose Estimation from the First Touch

    Authors: Maria Bauza, Antonia Bronars, Alberto Rodriguez

    Abstract: In this paper, we present Tac2Pose, an object-specific approach to tactile pose estimation from the first touch for known objects. Given the object geometry, we learn a tailored perception model in simulation that estimates a probability distribution over possible object poses given a tactile observation. To do so, we simulate the contact shapes that a dense set of object poses would produce on th… ▽ More

    Submitted 14 September, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Submitted to IJRR, 22 pages + Appendix, 11 figures

  6. arXiv:2012.05205  [pdf, other

    cs.RO cs.CV cs.LG

    Tactile Object Pose Estimation from the First Touch with Geometric Contact Rendering

    Authors: Maria Bauza, Eric Valls, Bryan Lim, Theo Sechopoulos, Alberto Rodriguez

    Abstract: In this paper, we present an approach to tactile pose estimation from the first touch for known objects. First, we create an object-agnostic map from real tactile observations to contact shapes. Next, for a new object with known geometry, we learn a tailored perception model completely in simulation. To do so, we simulate the contact shapes that a dense set of object poses would produce on the sen… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Comments: CORL 2020, 5 figures + 2 in appendix Video: https://youtu.be/2ygtSJTmo08

  7. arXiv:2011.07044  [pdf, other

    cs.RO

    Tactile SLAM: Real-time inference of shape and pose from planar pushing

    Authors: Sudharshan Suresh, Maria Bauza, Kuan-Ting Yu, Joshua G. Mangelson, Alberto Rodriguez, Michael Kaess

    Abstract: Tactile perception is central to robot manipulation in unstructured environments. However, it requires contact, and a mature implementation must infer object models while also accounting for the motion induced by the interaction. In this work, we present a method to estimate both object shape and pose in real-time from a stream of tactile measurements. This is applied towards tactile exploration o… ▽ More

    Submitted 26 March, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: Camera-ready version to be presented at the 2021 IEEE International Conference on Robotics and Automation (ICRA 2021). For associated video file, see https://youtu.be/wdyagx5MM40

  8. arXiv:2009.10623  [pdf, other

    cs.LG cs.CV stat.ML

    Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time

    Authors: Ferran Alet, Maria Bauza, Kenji Kawaguchi, Nurullah Giray Kuru, Tomas Lozano-Perez, Leslie Pack Kaelbling

    Abstract: From CNNs to attention mechanisms, encoding inductive biases into neural networks has been a fruitful source of improvement in machine learning. Adding auxiliary losses to the main objective function is a general way of encoding biases that can help networks learn better representations. However, since auxiliary losses are minimized only on training data, they suffer from the same generalization g… ▽ More

    Submitted 6 September, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: NeurIPS 2020 workshops on Interpretable Inductive Biases and Meta-learning

  9. arXiv:1911.05071  [pdf, other

    cs.CV cs.LG cs.RO

    Experience-Embedded Visual Foresight

    Authors: Lin Yen-Chen, Maria Bauza, Phillip Isola

    Abstract: Visual foresight gives an agent a window into the future, which it can use to anticipate events before they happen and plan strategic behavior. Although impressive results have been achieved on video prediction in constrained settings, these models fail to generalize when confronted with unfamiliar real-world objects. In this paper, we tackle the generalization problem via fast adaptation, where w… ▽ More

    Submitted 17 November, 2019; v1 submitted 12 November, 2019; originally announced November 2019.

    Comments: CoRL 2019. Project website: http://yenchenlin.me/evf/

  10. arXiv:1911.03112  [pdf, other

    cs.RO cs.CV cs.LG

    Accurate Vision-based Manipulation through Contact Reasoning

    Authors: Alina Kloss, Maria Bauza, Jiajun Wu, Joshua B. Tenenbaum, Alberto Rodriguez, Jeannette Bohg

    Abstract: Planning contact interactions is one of the core challenges of many robotic tasks. Optimizing contact locations while taking dynamics into account is computationally costly and, in environments that are only partially observable, executing contact-based tasks often suffers from low accuracy. We present an approach that addresses these two challenges for the problem of vision-based manipulation. Fi… ▽ More

    Submitted 17 April, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: accepted at ICRA 2020

  11. arXiv:1910.00618  [pdf, other

    cs.RO cs.CV cs.LG eess.SY

    Omnipush: accurate, diverse, real-world dataset of pushing dynamics with RGB-D video

    Authors: Maria Bauza, Ferran Alet, Yen-Chen Lin, Tomas Lozano-Perez, Leslie P. Kaelbling, Phillip Isola, Alberto Rodriguez

    Abstract: Pushing is a fundamental robotic skill. Existing work has shown how to exploit models of pushing to achieve a variety of tasks, including grasping under uncertainty, in-hand manipulation and clearing clutter. Such models, however, are approximate, which limits their applicability. Learning-based methods can reason directly from raw sensory data with accuracy, and have the potential to generalize t… ▽ More

    Submitted 19 August, 2021; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: IROS 2019, 8 pages, 7 figures

  12. arXiv:1904.10944  [pdf, other

    cs.RO cs.LG

    Tactile Mapping and Localization from High-Resolution Tactile Imprints

    Authors: Maria Bauza, Oleguer Canal, Alberto Rodriguez

    Abstract: This work studies the problem of shape reconstruction and object localization using a vision-based tactile sensor, GelSlim. The main contributions are the recovery of local shapes from contact, an approach to reconstruct the tactile shape of objects from tactile imprints, and an accurate method for object localization of previously reconstructed objects. The algorithms can be applied to a large va… ▽ More

    Submitted 11 July, 2019; v1 submitted 24 April, 2019; originally announced April 2019.

    Comments: ICRA 2019, 7 pages, 7 figures. Website: http://web.mit.edu/mcube/research/tactile_localization.html Video: https://youtu.be/uMkspjmDbqs

  13. arXiv:1904.09019  [pdf, other

    cs.LG stat.ML

    Graph Element Networks: adaptive, structured computation and memory

    Authors: Ferran Alet, Adarsh K. Jeewajee, Maria Bauza, Alberto Rodriguez, Tomas Lozano-Perez, Leslie Pack Kaelbling

    Abstract: We explore the use of graph neural networks (GNNs) to model spatial processes in which there is no a priori graphical structure. Similar to finite element analysis, we assign nodes of a GNN to spatial locations and use a computational process defined on the graph to model the relationship between an initial function defined over a space and a resulting function in the same space. We use GNNs as a… ▽ More

    Submitted 17 November, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

    Comments: Accepted to ICML 2019

  14. arXiv:1904.06580  [pdf, other

    cs.RO cs.LG

    Combining Physical Simulators and Object-Based Networks for Control

    Authors: Anurag Ajay, Maria Bauza, Jiajun Wu, Nima Fazeli, Joshua B. Tenenbaum, Alberto Rodriguez, Leslie P. Kaelbling

    Abstract: Physics engines play an important role in robot planning and control; however, many real-world control problems involve complex contact dynamics that cannot be characterized analytically. Most physics engines therefore employ . approximations that lead to a loss in precision. In this paper, we propose a hybrid dynamics model, simulator-augmented interaction networks (SAIN), combining a physics eng… ▽ More

    Submitted 13 April, 2019; originally announced April 2019.

    Comments: ICRA 2019; Project page: http://sain.csail.mit.edu

  15. arXiv:1812.07768  [pdf, other

    cs.LG stat.ML

    Modular meta-learning in abstract graph networks for combinatorial generalization

    Authors: Ferran Alet, Maria Bauza, Alberto Rodriguez, Tomas Lozano-Perez, Leslie P. Kaelbling

    Abstract: Modular meta-learning is a new framework that generalizes to unseen datasets by combining a small set of neural modules in different ways. In this work we propose abstract graph networks: using graphs as abstractions of a system's subparts without a fixed assignment of nodes to system subparts, for which we would need supervision. We combine this idea with modular meta-learning to get a flexible f… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: Presented at NeurIPS meta-learning workshop 2018

  16. arXiv:1808.03246  [pdf, other

    cs.RO cs.LG

    Augmenting Physical Simulators with Stochastic Neural Networks: Case Study of Planar Pushing and Bouncing

    Authors: Anurag Ajay, Jiajun Wu, Nima Fazeli, Maria Bauza, Leslie P. Kaelbling, Joshua B. Tenenbaum, Alberto Rodriguez

    Abstract: An efficient, generalizable physical simulator with universal uncertainty estimates has wide applications in robot state estimation, planning, and control. In this paper, we build such a simulator for two scenarios, planar pushing and ball bouncing, by augmenting an analytical rigid-body simulator with a neural network that learns to model uncertainty as residuals. Combining symbolic, deterministi… ▽ More

    Submitted 9 August, 2018; originally announced August 2018.

    Comments: IROS 2018

  17. arXiv:1807.09904  [pdf, other

    cs.RO cs.LG eess.SY

    A Data-Efficient Approach to Precise and Controlled Pushing

    Authors: Maria Bauza, Francois R. Hogan, Alberto Rodriguez

    Abstract: Decades of research in control theory have shown that simple controllers, when provided with timely feedback, can control complex systems. Pushing is an example of a complex mechanical system that is difficult to model accurately due to unknown system parameters such as coefficients of friction and pressure distributions. In this paper, we explore the data-complexity required for controlling, rath… ▽ More

    Submitted 9 October, 2018; v1 submitted 25 July, 2018; originally announced July 2018.

    Comments: Maria Bauza and Francois R. Hogan contributed equally to this work. 10 pages, 5 figures

    Journal ref: CoRL 2018

  18. arXiv:1803.01940  [pdf, other

    cs.RO eess.SY

    Tactile Regrasp: Grasp Adjustments via Simulated Tactile Transformations

    Authors: Francois R. Hogan, Maria Bauza, Oleguer Canal, Elliott Donlon, Alberto Rodriguez

    Abstract: This paper presents a novel regrasp control policy that makes use of tactile sensing to plan local grasp adjustments. Our approach determines regrasp actions by virtually searching for local transformations of tactile measurements that improve the quality of the grasp. First, we construct a tactile-based grasp quality metric using a deep convolutional neural network trained on over 2800 grasps. Th… ▽ More

    Submitted 9 October, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: Francois R. Hogan and Maria Bauza contributed equally to this work. 8 pages, 7 figures

    Journal ref: IROS 2018

  19. arXiv:1710.01330  [pdf, other

    cs.RO cs.CV

    Robotic Pick-and-Place of Novel Objects in Clutter with Multi-Affordance Grasping and Cross-Domain Image Matching

    Authors: Andy Zeng, Shuran Song, Kuan-Ting Yu, Elliott Donlon, Francois R. Hogan, Maria Bauza, Daolin Ma, Orion Taylor, Melody Liu, Eudald Romo, Nima Fazeli, Ferran Alet, Nikhil Chavan Dafle, Rachel Holladay, Isabella Morona, Prem Qu Nair, Druck Green, Ian Taylor, Weber Liu, Thomas Funkhouser, Alberto Rodriguez

    Abstract: This paper presents a robotic pick-and-place system that is capable of grasping and recognizing both known and novel objects in cluttered environments. The key new feature of the system is that it handles a wide range of object categories without needing any task-specific training data for novel objects. To achieve this, it first uses a category-agnostic affordance prediction algorithm to select a… ▽ More

    Submitted 30 May, 2020; v1 submitted 3 October, 2017; originally announced October 2017.

    Comments: Project webpage: http://arc.cs.princeton.edu Summary video: https://youtu.be/6fG7zwGfIkI

  20. arXiv:1709.08120  [pdf, other

    cs.RO cs.LG stat.ML

    GP-SUM. Gaussian Processes Filtering of non-Gaussian Beliefs

    Authors: Maria Bauza, Alberto Rodriguez

    Abstract: This work studies the problem of stochastic dynamic filtering and state propagation with complex beliefs. The main contribution is GP-SUM, a filtering algorithm tailored to dynamic systems and observation models expressed as Gaussian Processes (GP), and to states represented as a weighted sum of Gaussians. The key attribute of GP-SUM is that it does not rely on linearizations of the dynamic or obs… ▽ More

    Submitted 30 January, 2019; v1 submitted 23 September, 2017; originally announced September 2017.

    Comments: WAFR 2018, 16 pages, 7 figures

  21. arXiv:1704.03033  [pdf, other

    cs.RO cs.LG stat.ML

    A probabilistic data-driven model for planar pushing

    Authors: Maria Bauza, Alberto Rodriguez

    Abstract: This paper presents a data-driven approach to model planar pushing interaction to predict both the most likely outcome of a push and its expected variability. The learned models rely on a variation of Gaussian processes with input-dependent noise called Variational Heteroscedastic Gaussian processes (VHGP) that capture the mean and variance of a stochastic function. We show that we can learn accur… ▽ More

    Submitted 23 September, 2017; v1 submitted 10 April, 2017; originally announced April 2017.

    Comments: 8 pages, 11 figures, ICRA 2017

  22. arXiv:1604.04038  [pdf, other

    cs.RO

    More than a Million Ways to Be Pushed: A High-Fidelity Experimental Dataset of Planar Pushing

    Authors: Kuan-Ting Yu, Maria Bauza, Nima Fazeli, Alberto Rodriguez

    Abstract: Pushing is a motion primitive useful to handle objects that are too large, too heavy, or too cluttered to be grasped. It is at the core of much of robotic manipulation, in particular when physical interaction is involved. It seems reasonable then to wish for robots to understand how pushed objects move. In reality, however, robots often rely on approximations which yield models that are computab… ▽ More

    Submitted 3 August, 2016; v1 submitted 14 April, 2016; originally announced April 2016.

    Comments: 8 pages, 10 figures

    Journal ref: IROS 2016