Skip to main content

Showing 1–50 of 140 results for author: Majumdar, A

  1. arXiv:2406.06534  [pdf, other

    cs.CV eess.IV physics.optics

    Compressed Meta-Optical Encoder for Image Classification

    Authors: Anna Wirth-Singh, Jinlin Xiang, Minho Choi, Johannes E. Fröch, Luocheng Huang, Shane Colburn, Eli Shlizerman, Arka Majumdar

    Abstract: Optical and hybrid convolutional neural networks (CNNs) recently have become of increasing interest to achieve low-latency, low-power image classification and computer vision tasks. However, implementing optical nonlinearity is challenging, and omitting the nonlinear layers in a standard CNN comes at a significant reduction in accuracy. In this work, we use knowledge distillation to compress modif… ▽ More

    Submitted 14 June, 2024; v1 submitted 22 April, 2024; originally announced June 2024.

  2. arXiv:2403.15959  [pdf, other

    cs.RO eess.SY math.OC

    Risk-Calibrated Human-Robot Interaction via Set-Valued Intent Prediction

    Authors: Justin Lidard, Hang Pham, Ariel Bachman, Bryan Boateng, Anirudha Majumdar

    Abstract: Tasks where robots must anticipate human intent, such as navigating around a cluttered home or sorting everyday items, are challenging because they exhibit a wide range of valid actions that lead to similar outcomes. Moreover, zero-shot cooperation between human-robot partners is an especially challenging problem because it requires the robot to infer and adapt on the fly to a latent human intent,… ▽ More

    Submitted 23 April, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Website with additional information, videos, and code: https://risk-calibrated-planning.github.io/

  3. arXiv:2403.15941  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Explore until Confident: Efficient Exploration for Embodied Question Answering

    Authors: Allen Z. Ren, Jaden Clark, Anushri Dixit, Masha Itkina, Anirudha Majumdar, Dorsa Sadigh

    Abstract: We consider the problem of Embodied Question Answering (EQA), which refers to settings where an embodied agent such as a robot needs to actively explore an environment to gather information until it is confident about the answer to a question. In this work, we leverage the strong semantic reasoning capabilities of large vision-language models (VLMs) to efficiently explore and answer such questions… ▽ More

    Submitted 7 July, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Robotics: Science and Systems (RSS) 2024

  4. arXiv:2403.08185  [pdf, other

    cs.RO eess.SY

    Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception

    Authors: Anushri Dixit, Zhiting Mei, Meghan Booker, Mariko Storey-Matsutani, Allen Z. Ren, Anirudha Majumdar

    Abstract: Rapid advances in perception have enabled large pre-trained models to be used out of the box for transforming high-dimensional, noisy, and partial observations of the world into rich occupancy representations. However, the reliability of these models and consequently their safe integration onto robots remains unknown when deployed in environments unseen during training. In this work, we address th… ▽ More

    Submitted 8 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Videos and code can be found at https://perceive-with-confidence.github.io

  5. arXiv:2402.17130  [pdf, other

    cs.RO

    Privacy-Preserving Map-Free Exploration for Confirming the Absence of a Radioactive Source

    Authors: Eric Lepowsky, David Snyder, Alexander Glaser, Anirudha Majumdar

    Abstract: Performing an inspection task while maintaining the privacy of the inspected site is a challenging balancing act. In this work, we are motivated by the future of nuclear arms control verification, which requires both a high level of privacy and guaranteed correctness. For scenarios with limitations on sensors and stored information due to the potentially secret nature of observable features, we pr… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 10 pages, 6 figures, in submission

  6. arXiv:2402.11400  [pdf

    cs.HC

    From Text to Map: A System Dynamics Bot for Constructing Causal Loop Diagrams

    Authors: Niyousha Hosseinichimeh, Aritra Majumdar, Ross Williams, Navid Ghaffarzadegan

    Abstract: We introduce and test the System Dynamics Bot, a computer program leveraging a large language model to automate the creation of causal loop diagrams from textual data. To evaluate its performance, we ensembled two distinct databases. The first dataset includes 20 causal loop diagrams and associated texts sourced from the system dynamics literature. The second dataset comprises responses from 30 pa… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 23 pages, 4 figures, 3 tables

  7. Static and Dynamic Synthesis of Bengali and Devanagari Signatures

    Authors: Miguel A. Ferrer, Sukalpa Chanda, Moises Diaz, Chayan Kr. Banerjee, Anirban Majumdar, Cristina Carmona-Duarte, Parikshit Acharya, Umapada Pal

    Abstract: Developing an automatic signature verification system is challenging and demands a large number of training samples. This is why synthetic handwriting generation is an emerging topic in document image analysis. Some handwriting synthesizers use the motor equivalence model, the well-established hypothesis from neuroscience, which analyses how a human being accomplishes movement. Specifically, a mot… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted version. Published on IEEE Transactions on Cybernetics [ISSN 2168-2267], v. 48(10), p. 2896-2907

    Journal ref: IEEE Transactions on Cybernetics, v. 48(10), p. 2896-2907, 2018

  8. arXiv:2312.13279  [pdf, other

    cs.RO

    Stretch with Stretch: Physical Therapy Exercise Games Led by a Mobile Manipulator

    Authors: Matthew Lamsey, You Liang Tan, Meredith D. Wells, Madeline Beatty, Zexuan Liu, Arjun Majumdar, Kendra Washington, Jerry Feldman, Naveen Kuppuswamy, Elizabeth Nguyen, Arielle Wallenstein, Madeleine E. Hackney, Charles C. Kemp

    Abstract: Physical therapy (PT) is a key component of many rehabilitation regimens, such as treatments for Parkinson's disease (PD). However, there are shortages of physical therapists and adherence to self-guided PT is low. Robots have the potential to support physical therapists and increase adherence to self-guided PT, but prior robotic systems have been large and immobile, which can be a barrier to use… ▽ More

    Submitted 21 December, 2023; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  9. arXiv:2312.07843  [pdf, ps, other

    cs.RO

    Foundation Models in Robotics: Applications, Challenges, and the Future

    Authors: Roya Firoozi, Johnathan Tucker, Stephen Tian, Anirudha Majumdar, Jiankai Sun, Weiyu Liu, Yuke Zhu, Shuran Song, Ashish Kapoor, Karol Hausman, Brian Ichter, Danny Driess, Jiajun Wu, Cewu Lu, Mac Schwager

    Abstract: We survey applications of pretrained foundation models in robotics. Traditional deep learning models in robotics are trained on small datasets tailored for specific tasks, which limits their adaptability across diverse applications. In contrast, foundation models pretrained on internet-scale data appear to have superior generalization capabilities, and in some instances display an emergent ability… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  10. arXiv:2312.04658  [pdf, other

    cs.LG stat.ML

    PAC-Bayes Generalization Certificates for Learned Inductive Conformal Prediction

    Authors: Apoorva Sharma, Sushant Veer, Asher Hancock, Heng Yang, Marco Pavone, Anirudha Majumdar

    Abstract: Inductive Conformal Prediction (ICP) provides a practical and effective approach for equipping deep learning models with uncertainty estimates in the form of set-valued predictions which are guaranteed to contain the ground truth with high probability. Despite the appeal of this coverage guarantee, these sets may not be efficient: the size and contents of the prediction sets are not directly contr… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  11. arXiv:2311.14731  [pdf, ps, other

    q-fin.ST cs.LG stat.AP

    Deep State-Space Model for Predicting Cryptocurrency Price

    Authors: Shalini Sharma, Angshul Majumdar, Emilie Chouzenoux, Victor Elvira

    Abstract: Our work presents two fundamental contributions. On the application side, we tackle the challenging problem of predicting day-ahead crypto-currency prices. On the methodological side, a new dynamical modeling approach is proposed. Our approach keeps the probabilistic formulation of the state-space model, which provides uncertainty quantification on the estimates, and the function approximation abi… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  12. arXiv:2311.14100  [pdf, other

    cs.RO

    MonoNav: MAV Navigation via Monocular Depth Estimation and Reconstruction

    Authors: Nathaniel Simon, Anirudha Majumdar

    Abstract: A major challenge in deploying the smallest of Micro Aerial Vehicle (MAV) platforms (< 100 g) is their inability to carry sensors that provide high-resolution metric depth information (e.g., LiDAR or stereo cameras). Current systems rely on end-to-end learning or heuristic approaches that directly map images to control inputs, and struggle to fly fast in unknown environments. In this work, we ask… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: International Symposium on Experimental Robotics (ISER) 2023

  13. arXiv:2311.11929  [pdf, other

    physics.app-ph cond-mat.mtrl-sci cs.ET

    Novel implementations for reservoir computing -- from spin to charge

    Authors: Karin Everschor-Sitte, Atreya Majumdar, Katharina Wolk, Dennis Meier

    Abstract: Topological textures in magnetic and electric materials are considered to be promising candidates for next-generation information technology and unconventional computing. Here, we discuss how the physical properties of topological nanoscale systems, such as skyrmions and domain walls, can be leveraged for reservoir computing, translating non-linear problems into linearly solvable ones. In addition… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  14. arXiv:2310.02219  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?

    Authors: Sneha Silwal, Karmesh Yadav, Tingfan Wu, Jay Vakil, Arjun Majumdar, Sergio Arnaud, Claire Chen, Vincent-Pierre Berges, Dhruv Batra, Aravind Rajeswaran, Mrinal Kalakrishnan, Franziska Meier, Oleksandr Maksymets

    Abstract: We present a large empirical investigation on the use of pre-trained visual representations (PVRs) for training downstream policies that execute real-world tasks. Our study involves five different PVRs, each trained for five distinct manipulation or indoor navigation tasks. We performed this evaluation using three different robots and two different policy learning paradigms. From this effort, we c… ▽ More

    Submitted 13 July, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Project website https://pvrs-sim2real.github.io/

    MSC Class: 68T45 (Primary) 68T40; 68T05(Secondary) ACM Class: I.2.9; I.2.6; I.4.8; I.5.4

  15. arXiv:2309.12428  [pdf, other

    cs.CV

    Synthetic Image Detection: Highlights from the IEEE Video and Image Processing Cup 2022 Student Competition

    Authors: Davide Cozzolino, Koki Nagano, Lucas Thomaz, Angshul Majumdar, Luisa Verdoliva

    Abstract: The Video and Image Processing (VIP) Cup is a student competition that takes place each year at the IEEE International Conference on Image Processing. The 2022 IEEE VIP Cup asked undergraduate students to develop a system capable of distinguishing pristine images from generated ones. The interest in this topic stems from the incredible advances in the AI-based generation of visual data, with tools… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  16. arXiv:2309.11456  [pdf

    cs.AI cs.LG cs.MA nlin.AO physics.soc-ph

    Generative Agent-Based Modeling: Unveiling Social System Dynamics through Coupling Mechanistic Models with Generative Artificial Intelligence

    Authors: Navid Ghaffarzadegan, Aritra Majumdar, Ross Williams, Niyousha Hosseinichimeh

    Abstract: We discuss the emerging new opportunity for building feedback-rich computational models of social systems using generative artificial intelligence. Referred to as Generative Agent-Based Models (GABMs), such individual-level models utilize large language models such as ChatGPT to represent human decision-making in social settings. We provide a GABM case in which human behavior can be incorporated i… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: System Dynamics Review (2024)

  17. arXiv:2309.02561  [pdf, other

    cs.RO cs.AI cs.CV

    Physically Grounded Vision-Language Models for Robotic Manipulation

    Authors: Jensen Gao, Bidipta Sarkar, Fei Xia, Ted Xiao, Jiajun Wu, Brian Ichter, Anirudha Majumdar, Dorsa Sadigh

    Abstract: Recent advances in vision-language models (VLMs) have led to improved performance on tasks such as visual question answering and image captioning. Consequently, these models are now well-positioned to reason about the physical world, particularly within domains such as robotic manipulation. However, current VLMs are limited in their understanding of the physical concepts (e.g., material, fragility… ▽ More

    Submitted 3 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Updated version for ICRA 2024

  18. arXiv:2308.03407  [pdf, other

    cs.CV

    Spatially Varying Nanophotonic Neural Networks

    Authors: Kaixuan Wei, Xiao Li, Johannes Froech, Praneeth Chakravarthula, James Whitehead, Ethan Tseng, Arka Majumdar, Felix Heide

    Abstract: The explosive growth of computation and energy cost of artificial intelligence has spurred strong interests in new computing modalities as potential alternatives to conventional electronic processors. Photonic processors that execute operations using photons instead of electrons, have promised to enable optical neural networks with ultra-low latency and power consumption. However, existing optical… ▽ More

    Submitted 30 December, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  19. arXiv:2308.02797  [pdf, other

    physics.optics cs.CV

    Thin On-Sensor Nanophotonic Array Cameras

    Authors: Praneeth Chakravarthula, Jipeng Sun, Xiao Li, Chenyang Lei, Gene Chou, Mario Bijelic, Johannes Froesch, Arka Majumdar, Felix Heide

    Abstract: Today's commodity camera systems rely on compound optics to map light originating from the scene to positions on the sensor where it gets recorded as an image. To record images without optical aberrations, i.e., deviations from Gauss' linear model of optics, typical lens systems introduce increasingly complex stacks of optical elements which are responsible for the height of existing commodity cam… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: 18 pages, 12 figures, to be published in ACM Transactions on Graphics

    ACM Class: I.4.0

  20. arXiv:2307.10790  [pdf, other

    cs.CV cs.RO

    Behavioral Analysis of Vision-and-Language Navigation Agents

    Authors: Zijiao Yang, Arjun Majumdar, Stefan Lee

    Abstract: To be successful, Vision-and-Language Navigation (VLN) agents must be able to ground instructions to actions based on their surroundings. In this work, we develop a methodology to study agent behavior on a skill-specific basis -- examining how well existing agents ground instructions about stopping, turning, and moving towards specified objects or rooms. Our approach is based on generating skill-s… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: accepted to CVPR2023

    ACM Class: I.2.9

    Journal ref: In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2574-2582. 2023

  21. arXiv:2307.04986  [pdf

    cs.AI cs.MA econ.GN nlin.AO physics.soc-ph

    Epidemic Modeling with Generative Agents

    Authors: Ross Williams, Niyousha Hosseinichimeh, Aritra Majumdar, Navid Ghaffarzadegan

    Abstract: This study offers a new paradigm of individual-level modeling to address the grand challenge of incorporating human behavior in epidemic models. Using generative artificial intelligence in an agent-based epidemic model, each agent is empowered to make its own reasonings and decisions via connecting to a large language model such as ChatGPT. Through various simulation experiments, we present compel… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  22. arXiv:2307.01928  [pdf, other

    cs.RO cs.AI stat.AP

    Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners

    Authors: Allen Z. Ren, Anushri Dixit, Alexandra Bodrova, Sumeet Singh, Stephen Tu, Noah Brown, Peng Xu, Leila Takayama, Fei Xia, Jake Varley, Zhenjia Xu, Dorsa Sadigh, Andy Zeng, Anirudha Majumdar

    Abstract: Large language models (LLMs) exhibit a wide range of promising capabilities -- from step-by-step planning to commonsense reasoning -- that may provide utility for robots, but remain prone to confidently hallucinated predictions. In this work, we present KnowNo, which is a framework for measuring and aligning the uncertainty of LLM-based planners such that they know when they don't know and ask for… ▽ More

    Submitted 4 September, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: Conference on Robot Learning (CoRL) 2023, Oral Presentation

  23. arXiv:2306.17248  [pdf, other

    cs.LG physics.ao-ph stat.ML

    TemperatureGAN: Generative Modeling of Regional Atmospheric Temperatures

    Authors: Emmanuel Balogun, Ram Rajagopal, Arun Majumdar

    Abstract: Stochastic generators are useful for estimating climate impacts on various sectors. Projecting climate risk in various sectors, e.g. energy systems, requires generators that are accurate (statistical resemblance to ground-truth), reliable (do not produce erroneous examples), and efficient. Leveraging data from the North American Land Data Assimilation System, we introduce TemperatureGAN, a Generat… ▽ More

    Submitted 19 January, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

  24. arXiv:2306.08776  [pdf, other

    cs.RO

    Online Learning for Obstacle Avoidance

    Authors: David Snyder, Meghan Booker, Nathaniel Simon, Wenhan Xia, Daniel Suo, Elad Hazan, Anirudha Majumdar

    Abstract: We approach the fundamental problem of obstacle avoidance for robotic systems via the lens of online learning. In contrast to prior work that either assumes worst-case realizations of uncertainty in the environment or a stationary stochastic model of uncertainty, we propose a method that is efficient to implement and provably grants instance-optimality with respect to perturbations of trajectories… ▽ More

    Submitted 5 November, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 8 + 21 pages, 2 + 11 figures, Accepted to CoRL 2023 [Poster]

  25. arXiv:2305.12875  [pdf, other

    cs.ET

    Powering AI at the Edge: A Robust, Memristor-based Binarized Neural Network with Near-Memory Computing and Miniaturized Solar Cell

    Authors: Fadi Jebali, Atreya Majumdar, Clément Turck, Kamel-Eddine Harabi, Mathieu-Coumba Faye, Eloi Muhr, Jean-Pierre Walder, Oleksandr Bilousov, Amadeo Michaud, Elisa Vianello, Tifenn Hirtzlin, François Andrieu, Marc Bocquet, Stéphane Collin, Damien Querlioz, Jean-Michel Portal

    Abstract: Memristor-based neural networks provide an exceptional energy-efficient platform for artificial intelligence (AI), presenting the possibility of self-powered operation when paired with energy harvesters. However, most memristor-based networks rely on analog in-memory computing, necessitating a stable and precise power supply, which is incompatible with the inherently unstable and unreliable energy… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  26. arXiv:2305.09634  [pdf, other

    cs.GT

    Bi-Objective Lexicographic Optimization in Markov Decision Processes with Related Objectives

    Authors: Damien Busatto-Gaston, Debraj Chakraborty, Anirban Majumdar, Sayan Mukherjee, Guillermo A. Pérez, Jean-François Raskin

    Abstract: We consider lexicographic bi-objective problems on Markov Decision Processes (MDPs), where we optimize one objective while guaranteeing optimality of another. We propose a two-stage technique for solving such problems when the objectives are related (in a way that we formalize). We instantiate our technique for two natural pairs of objectives: minimizing the (conditional) expected number of steps… ▽ More

    Submitted 15 August, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

  27. arXiv:2305.02968  [pdf, other

    cs.LG cs.AI

    Masked Trajectory Models for Prediction, Representation, and Control

    Authors: Philipp Wu, Arjun Majumdar, Kevin Stone, Yixin Lin, Igor Mordatch, Pieter Abbeel, Aravind Rajeswaran

    Abstract: We introduce Masked Trajectory Models (MTM) as a generic abstraction for sequential decision making. MTM takes a trajectory, such as a state-action sequence, and aims to reconstruct the trajectory conditioned on random subsets of the same trajectory. By training with a highly randomized masking pattern, MTM learns versatile networks that can take on different roles or capabilities, by simply choos… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted for publication at ICML 2023. Project webpage: https://wuphilipp.github.io/mtm/

  28. arXiv:2305.01743  [pdf

    physics.optics cs.CV

    Photonic Advantage of Optical Encoders

    Authors: Luocheng Huang, Quentin A. A. Tanguy, Johannes E. Froch, Saswata Mukherjee, Karl F. Bohringer, Arka Majumdar

    Abstract: Light's ability to perform massive linear operations parallelly has recently inspired numerous demonstrations of optics-assisted artificial neural networks (ANN). However, a clear advantage of optics over purely digital ANN in a system-level has not yet been established. While linear operations can indeed be optically performed very efficiently, the lack of nonlinearity and signal regeneration req… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  29. arXiv:2304.13479  [pdf, other

    cs.LG cs.AI cs.IT

    Fundamental Tradeoffs in Learning with Prior Information

    Authors: Anirudha Majumdar

    Abstract: We seek to understand fundamental tradeoffs between the accuracy of prior information that a learner has on a given problem and its learning performance. We introduce the notion of prioritized risk, which differs from traditional notions of minimax and Bayes risk by allowing us to study such fundamental tradeoffs in settings where reality does not necessarily conform to the learner's prior. We pre… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Proceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA. PMLR 202, 2023

  30. arXiv:2303.18240  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?

    Authors: Arjun Majumdar, Karmesh Yadav, Sergio Arnaud, Yecheng Jason Ma, Claire Chen, Sneha Silwal, Aryan Jain, Vincent-Pierre Berges, Pieter Abbeel, Jitendra Malik, Dhruv Batra, Yixin Lin, Oleksandr Maksymets, Aravind Rajeswaran, Franziska Meier

    Abstract: We present the largest and most comprehensive empirical study of pre-trained visual representations (PVRs) or visual 'foundation models' for Embodied AI. First, we curate CortexBench, consisting of 17 different tasks spanning locomotion, navigation, dexterous, and mobile manipulation. Next, we systematically evaluate existing PVRs and find that none are universally dominant. To study the effect of… ▽ More

    Submitted 1 February, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: Project website: https://eai-vc.github.io

  31. arXiv:2303.07798  [pdf, other

    cs.CV cs.AI

    OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav

    Authors: Karmesh Yadav, Arjun Majumdar, Ram Ramrakhya, Naoki Yokoyama, Alexei Baevski, Zsolt Kira, Oleksandr Maksymets, Dhruv Batra

    Abstract: We present a single neural network architecture composed of task-agnostic components (ViTs, convolutions, and LSTMs) that achieves state-of-art results on both the ImageNav ("go to location in <this picture>") and ObjectNav ("find a chair") tasks without any task-specific modules like object detection, segmentation, mapping, or planning modules. Such general-purpose methods offer advantages of sim… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: 15 pages, 7 figures, 9 tables

  32. arXiv:2302.04903  [pdf, other

    cs.RO cs.LG eess.SY

    AdaptSim: Task-Driven Simulation Adaptation for Sim-to-Real Transfer

    Authors: Allen Z. Ren, Hongkai Dai, Benjamin Burchfiel, Anirudha Majumdar

    Abstract: Simulation parameter settings such as contact models and object geometry approximations are critical to training robust robotic policies capable of transferring from simulation to real-world deployment. Previous approaches typically handcraft distributions over such parameters (domain randomization), or identify parameters that best match the dynamics of the real environment (system identification… ▽ More

    Submitted 30 September, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: Conference on Robot Learning (CoRL), 2023

  33. arXiv:2212.06345  [pdf, other

    physics.optics cs.CV

    Foveated Thermal Computational Imaging in the Wild Using All-Silicon Meta-Optics

    Authors: Vishwanath Saragadam, Zheyi Han, Vivek Boominathan, Luocheng Huang, Shiyu Tan, Johannes E. Fröch, Karl F. Böhringer, Richard G. Baraniuk, Arka Majumdar, Ashok Veeraraghavan

    Abstract: Foveated imaging provides a better tradeoff between situational awareness (field of view) and resolution and is critical in long-wavelength infrared regimes because of the size, weight, power, and cost of thermal sensors. We demonstrate computational foveated imaging by exploiting the ability of a meta-optical frontend to discriminate between different polarization states and a computational backe… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

  34. arXiv:2211.05865  [pdf, other

    cs.RO

    Switching Attention in Time-Varying Environments via Bayesian Inference of Abstractions

    Authors: Meghan Booker, Anirudha Majumdar

    Abstract: Motivated by the goal of endowing robots with a means for focusing attention in order to operate reliably in complex, uncertain, and time-varying environments, we consider how a robot can (i) determine which portions of its environment to pay attention to at any given point in time, (ii) infer changes in context (e.g., task or environment dynamics), and (iii) switch its attention accordingly. In t… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: 8 pages, 5 figures

  35. arXiv:2210.10784  [pdf, other

    q-bio.QM cs.AI cs.LG

    Graph Regularized Probabilistic Matrix Factorization for Drug-Drug Interactions Prediction

    Authors: Stuti Jain, Emilie Chouzenoux, Kriti Kumar, Angshul Majumdar

    Abstract: Co-administration of two or more drugs simultaneously can result in adverse drug reactions. Identifying drug-drug interactions (DDIs) is necessary, especially for drug development and for repurposing old drugs. DDI prediction can be viewed as a matrix completion task, for which matrix factorization (MF) appears as a suitable solution. This paper presents a novel Graph Regularized Probabilistic Mat… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  36. arXiv:2210.05857  [pdf, other

    cs.RO

    FlowDrone: Wind Estimation and Gust Rejection on UAVs Using Fast-Response Hot-Wire Flow Sensors

    Authors: Nathaniel Simon, Allen Z. Ren, Alexander Piqué, David Snyder, Daphne Barretto, Marcus Hultmark, Anirudha Majumdar

    Abstract: Unmanned aerial vehicles (UAVs) are finding use in applications that place increasing emphasis on robustness to external disturbances including extreme wind. However, traditional multirotor UAV platforms do not directly sense wind; conventional flow sensors are too slow, insensitive, or bulky for widespread integration on UAVs. Instead, drones typically observe the effects of wind indirectly throu… ▽ More

    Submitted 24 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Submitted to ICRA 2023. See supplementary video at https://youtu.be/KWqkH9Z-338

  37. arXiv:2206.13074  [pdf, other

    cs.RO cs.AI cs.LG

    Leveraging Language for Accelerated Learning of Tool Manipulation

    Authors: Allen Z. Ren, Bharat Govil, Tsung-Yen Yang, Karthik Narasimhan, Anirudha Majumdar

    Abstract: Robust and generalized tool manipulation requires an understanding of the properties and affordances of different tools. We investigate whether linguistic information about a tool (e.g., its geometry, common uses) can help control policies adapt faster to new tools for a given task. We obtain diverse descriptions of various tools in natural language and use pre-trained language models to generate… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  38. arXiv:2206.12403  [pdf, other

    cs.CV cs.LG cs.RO

    ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings

    Authors: Arjun Majumdar, Gunjan Aggarwal, Bhavika Devnani, Judy Hoffman, Dhruv Batra

    Abstract: We present a scalable approach for learning open-world object-goal navigation (ObjectNav) -- the task of asking a virtual robot (agent) to find any instance of an object in an unexplored environment (e.g., "find a sink"). Our approach is entirely zero-shot -- i.e., it does not require ObjectNav rewards or demonstrations of any kind. Instead, we train on the image-goal navigation (ImageNav) task, i… ▽ More

    Submitted 12 October, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: code: https://github.com/gunagg/zson

  39. arXiv:2205.04790  [pdf, other

    stat.ML cs.AI cs.HC cs.LG

    Don't Throw it Away! The Utility of Unlabeled Data in Fair Decision Making

    Authors: Miriam Rateike, Ayan Majumdar, Olga Mineeva, Krishna P. Gummadi, Isabel Valera

    Abstract: Decision making algorithms, in practice, are often trained on data that exhibits a variety of biases. Decision-makers often aim to take decisions based on some ground-truth target that is assumed or expected to be unbiased, i.e., equally distributed across socially salient groups. In many practical settings, the ground-truth cannot be directly observed, and instead, we have to rely on a biased pro… ▽ More

    Submitted 4 July, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

  40. arXiv:2204.13520  [pdf

    physics.optics cs.CV

    Inverse-Designed Meta-Optics with Spectral-Spatial Engineered Response to Mimic Color Perception

    Authors: Chris Munley, Wenchao Ma, Johannes E. Fröch, Quentin A. A. Tanguy, Elyas Bayati, Karl F. Böhringer, Zin Lin, Raphaël Pestourie, Steven G. Johnson, Arka Majumdar

    Abstract: Meta-optics have rapidly become a major research field within the optics and photonics community, strongly driven by the seemingly limitless opportunities made possible by controlling optical wavefronts through interaction with arrays of sub-wavelength scatterers. As more and more modalities are explored, the design strategies to achieve desired functionalities become increasingly demanding, neces… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

  41. arXiv:2204.13226  [pdf, other

    cs.CV cs.LG

    Offline Visual Representation Learning for Embodied Navigation

    Authors: Karmesh Yadav, Ram Ramrakhya, Arjun Majumdar, Vincent-Pierre Berges, Sachit Kuhar, Dhruv Batra, Alexei Baevski, Oleksandr Maksymets

    Abstract: How should we learn visual representations for embodied agents that must see and move? The status quo is tabula rasa in vivo, i.e. learning visual representations from scratch while also learning to move, potentially augmented with auxiliary tasks (e.g. predicting the action taken between two successive observations). In this paper, we show that an alternative 2-stage strategy is far more effectiv… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: 15 pages, 4 figures, 7 tables and supplementary

  42. arXiv:2202.09892  [pdf, other

    cs.RO cs.CC

    Towards a Framework for Comparing the Complexity of Robotic Tasks

    Authors: Michelle Ho, Alec Farid, Anirudha Majumdar

    Abstract: We are motivated by the problem of comparing the complexity of one robotic task relative to another. To this end, we define a notion of reduction that formalizes the following intuition: Task 1 reduces to Task 2 if we can efficiently transform any policy that solves Task 2 into a policy that solves Task 1. We further define a quantitative measure of the relative complexity between any two tasks fo… ▽ More

    Submitted 24 June, 2022; v1 submitted 20 February, 2022; originally announced February 2022.

  43. arXiv:2202.05894  [pdf, other

    cs.RO

    Failure Prediction with Statistical Guarantees for Vision-Based Robot Control

    Authors: Alec Farid, David Snyder, Allen Z. Ren, Anirudha Majumdar

    Abstract: We are motivated by the problem of performing failure prediction for safety-critical robotic systems with high-dimensional sensor observations (e.g., vision). Given access to a black-box control policy (e.g., in the form of a neural network) and a dataset of training environments, we present an approach for synthesizing a failure predictor with guaranteed bounds on false-positive and false-negativ… ▽ More

    Submitted 5 May, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

  44. arXiv:2202.00129  [pdf, other

    cs.RO cs.AI cs.IT cs.LG math.OC

    Fundamental Limits for Sensor-Based Robot Control

    Authors: Anirudha Majumdar, Zhiting Mei, Vincent Pacelli

    Abstract: Our goal is to develop theory and algorithms for establishing fundamental limits on performance imposed by a robot's sensors for a given task. In order to achieve this, we define a quantity that captures the amount of task-relevant information provided by a sensor. Using a novel version of the generalized Fano inequality from information theory, we demonstrate that this quantity provides an upper… ▽ More

    Submitted 11 July, 2023; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: Extended version of paper presented at the 2022 Robotics: Science and Systems (RSS) conference

  45. Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees

    Authors: Kai-Chieh Hsu, Allen Z. Ren, Duy Phuong Nguyen, Anirudha Majumdar, Jaime F. Fisac

    Abstract: Safety is a critical component of autonomous systems and remains a challenge for learning-based policies to be utilized in the real world. In particular, policies learned using reinforcement learning often fail to generalize to novel environments due to unsafe behavior. In this paper, we propose Sim-to-Lab-to-Real to bridge the reality gap with a probabilistically guaranteed safety-aware policy di… ▽ More

    Submitted 1 April, 2023; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: Accepted to Special Issue on Risk-aware Autonomous Systems: Theory and Practice, Artificial Intelligence

  46. arXiv:2111.14726  [pdf, other

    cs.CV cs.AI cs.LG

    Do Invariances in Deep Neural Networks Align with Human Perception?

    Authors: Vedant Nanda, Ayan Majumdar, Camila Kolling, John P. Dickerson, Krishna P. Gummadi, Bradley C. Love, Adrian Weller

    Abstract: An evaluation criterion for safe and trustworthy deep learning is how well the invariances captured by representations of deep neural networks (DNNs) are shared with humans. We identify challenges in measuring these invariances. Prior works used gradient-based methods to generate identically represented inputs (IRIs), ie, inputs which have identical representations (on a given layer) of a neural n… ▽ More

    Submitted 2 December, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: AAAI 2023

  47. arXiv:2111.13921  [pdf

    cs.LG stat.ML

    Transformed K-means Clustering

    Authors: Anurag Goel, Angshul Majumdar

    Abstract: In this work we propose a clustering framework based on the paradigm of transform learning. In simple terms the representation from transform learning is used for K-means clustering; however, the problem is not solved in such a naïve piecemeal fashion. The K-means clustering loss is embedded into the transform learning framework and the joint problem is solved using the alternating direction metho… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: EUSIPCO 2021

  48. arXiv:2111.13920  [pdf

    cs.CV cs.LG eess.IV

    Sparse Subspace Clustering Friendly Deep Dictionary Learning for Hyperspectral Image Classification

    Authors: Anurag Goel, Angshul Majumdar

    Abstract: Subspace clustering techniques have shown promise in hyperspectral image segmentation. The fundamental assumption in subspace clustering is that the samples belonging to different clusters/segments lie in separable subspaces. What if this condition does not hold? We surmise that even if the condition does not hold in the original space, the data may be nonlinearly transformed to a space where it w… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: IEEE Geoscience And Remote Sensing Letters

  49. arXiv:2111.08761  [pdf, other

    cs.RO cs.LG eess.SY

    Stronger Generalization Guarantees for Robot Learning by Combining Generative Models and Real-World Data

    Authors: Abhinav Agarwal, Sushant Veer, Allen Z. Ren, Anirudha Majumdar

    Abstract: We are motivated by the problem of learning policies for robotic systems with rich sensory inputs (e.g., vision) in a manner that allows us to guarantee generalization to environments unseen during training. We provide a framework for providing such generalization guarantees by leveraging a finite dataset of real-world environments in combination with a (potentially inaccurate) generative model of… ▽ More

    Submitted 22 July, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

  50. arXiv:2111.08733  [pdf, other

    cs.RO cs.LG eess.SY

    Learning Provably Robust Motion Planners Using Funnel Libraries

    Authors: Ali Ekin Gurgen, Anirudha Majumdar, Sushant Veer

    Abstract: This paper presents an approach for learning motion planners that are accompanied with probabilistic guarantees of success on new environments that hold uniformly for any disturbance to the robot's dynamics within an admissible set. We achieve this by bringing together tools from generalization theory and robust control. First, we curate a library of motion primitives where the robustness of each… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.