Skip to main content

Showing 1–22 of 22 results for author: Wijmans, E

  1. arXiv:2301.13261  [pdf, other

    cs.AI cs.CV cs.LG cs.RO

    Emergence of Maps in the Memories of Blind Navigation Agents

    Authors: Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra

    Abstract: Animal navigation research posits that organisms build and maintain internal spatial representations, or maps, of their environment. We ask if machines -- specifically, artificial intelligence (AI) navigation agents -- also build implicit (or 'mental') maps. A positive answer to this question would (a) explain the surprising phenomenon in recent literature of ostensibly map-free neural-networks ac… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted to ICLR 2023

  2. arXiv:2301.07302  [pdf, other

    cs.LG cs.AI cs.RO

    PIRLNav: Pretraining with Imitation and RL Finetuning for ObjectNav

    Authors: Ram Ramrakhya, Dhruv Batra, Erik Wijmans, Abhishek Das

    Abstract: We study ObjectGoal Navigation -- where a virtual robot situated in a new environment is asked to navigate to an object. Prior work has shown that imitation learning (IL) using behavior cloning (BC) on a dataset of human demonstrations achieves promising results. However, this has limitations -- 1) BC policies generalize poorly to new states, since the training mimics actions not their consequence… ▽ More

    Submitted 26 March, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: 8 pages + supplement

  3. arXiv:2210.05064  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement

    Authors: Erik Wijmans, Irfan Essa, Dhruv Batra

    Abstract: We present Variable Experience Rollout (VER), a technique for efficiently scaling batched on-policy reinforcement learning in heterogenous environments (where different environments take vastly different times to generate rollouts) to many GPUs residing on, potentially, many machines. VER combines the strengths of and blurs the line between synchronous and asynchronous on-policy RL methods (SyncOn… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS 2022

  4. arXiv:2206.00997  [pdf, other

    cs.CV

    Is Mapping Necessary for Realistic PointGoal Navigation?

    Authors: Ruslan Partsey, Erik Wijmans, Naoki Yokoyama, Oles Dobosevych, Dhruv Batra, Oleksandr Maksymets

    Abstract: Can an autonomous agent navigate in a new environment without building an explicit map? For the task of PointGoal navigation ('Go to $Δx$, $Δy$') under idealized settings (no RGB-D and actuation noise, perfect GPS+Compass), the answer is a clear 'yes' - map-less neural models composed of task-agnostic components (CNNs and RNNs) trained with large-scale reinforcement learning achieve 100% Success… ▽ More

    Submitted 7 June, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: Corrected typos in the Abstract

  5. arXiv:2109.08677  [pdf, other

    cs.CV cs.LG cs.RO

    Realistic PointGoal Navigation via Auxiliary Losses and Information Bottleneck

    Authors: Guillermo Grande, Dhruv Batra, Erik Wijmans

    Abstract: We propose a novel architecture and training paradigm for training realistic PointGoal Navigation -- navigating to a target coordinate in an unseen environment under actuation and sensor noise without access to ground-truth localization. Specifically, we find that the primary challenge under this setting is learning localization -- when stripped of idealized localization, agents fail to stop preci… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

  6. arXiv:2109.08238  [pdf, other

    cs.CV cs.AI

    Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI

    Authors: Santhosh K. Ramakrishnan, Aaron Gokaslan, Erik Wijmans, Oleksandr Maksymets, Alex Clegg, John Turner, Eric Undersander, Wojciech Galuba, Andrew Westbury, Angel X. Chang, Manolis Savva, Yili Zhao, Dhruv Batra

    Abstract: We present the Habitat-Matterport 3D (HM3D) dataset. HM3D is a large-scale dataset of 1,000 building-scale 3D reconstructions from a diverse set of real-world locations. Each scene in the dataset consists of a textured 3D mesh reconstruction of interiors such as multi-floor residences, stores, and other private indoor spaces. HM3D surpasses existing datasets available for academic research in te… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: 21 pages, 14 figures

  7. arXiv:2107.08170  [pdf, other

    cs.LG cs.AI

    Megaverse: Simulating Embodied Agents at One Million Experiences per Second

    Authors: Aleksei Petrenko, Erik Wijmans, Brennan Shacklett, Vladlen Koltun

    Abstract: We present Megaverse, a new 3D simulation platform for reinforcement learning and embodied AI research. The efficient design of our engine enables physics-based simulation with high-dimensional egocentric observations at more than 1,000,000 actions per second on a single 8-GPU node. Megaverse is up to 70x faster than DeepMind Lab in fully-shaded 3D scenes with interactive objects. We achieve this… ▽ More

    Submitted 20 July, 2021; v1 submitted 16 July, 2021; originally announced July 2021.

    Comments: Paper published in ICML2021

  8. arXiv:2106.14405  [pdf, other

    cs.LG cs.RO

    Habitat 2.0: Training Home Assistants to Rearrange their Habitat

    Authors: Andrew Szot, Alex Clegg, Eric Undersander, Erik Wijmans, Yili Zhao, John Turner, Noah Maestre, Mustafa Mukadam, Devendra Chaplot, Oleksandr Maksymets, Aaron Gokaslan, Vladimir Vondrus, Sameer Dharur, Franziska Meier, Wojciech Galuba, Angel Chang, Zsolt Kira, Vladlen Koltun, Jitendra Malik, Manolis Savva, Dhruv Batra

    Abstract: We introduce Habitat 2.0 (H2.0), a simulation platform for training virtual robots in interactive 3D environments and complex physics-enabled scenarios. We make comprehensive contributions to all levels of the embodied AI stack - data, simulation, and benchmark tasks. Specifically, we present: (i) ReplicaCAD: an artist-authored, annotated, reconfigurable 3D dataset of apartments (matching real spa… ▽ More

    Submitted 1 July, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

  9. arXiv:2104.04112  [pdf, other

    cs.CV cs.RO

    Auxiliary Tasks and Exploration Enable ObjectNav

    Authors: Joel Ye, Dhruv Batra, Abhishek Das, Erik Wijmans

    Abstract: ObjectGoal Navigation (ObjectNav) is an embodied task wherein agents are to navigate to an object instance in an unseen environment. Prior works have shown that end-to-end ObjectNav agents that use vanilla visual and recurrent modules, e.g. a CNN+RNN, perform poorly due to overfitting and sample inefficiency. This has motivated current state-of-the-art methods to mix analytic and learned component… ▽ More

    Submitted 2 August, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

  10. arXiv:2103.07013  [pdf, other

    cs.LG cs.AI cs.CV cs.GR

    Large Batch Simulation for Deep Reinforcement Learning

    Authors: Brennan Shacklett, Erik Wijmans, Aleksei Petrenko, Manolis Savva, Dhruv Batra, Vladlen Koltun, Kayvon Fatahalian

    Abstract: We accelerate deep reinforcement learning-based training in visually complex 3D environments by two orders of magnitude over prior work, realizing end-to-end training speeds of over 19,000 frames of experience per second on a single GPU and up to 72,000 frames per second on a single eight-GPU machine. The key idea of our approach is to design a 3D renderer and embodied navigation simulator around… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: Published as a conference paper at ICLR 2021

  11. arXiv:2012.06117  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget

    Authors: Erik Wijmans, Irfan Essa, Dhruv Batra

    Abstract: PointGoal navigation has seen significant recent interest and progress, spurred on by the Habitat platform and associated challenge. In this paper, we study PointGoal navigation under both a sample budget (75 million frames) and a compute budget (1 GPU for 1 day). We conduct an extensive set of experiments, cumulatively totaling over 50,000 GPU-hours, that let us identify and discuss a number of o… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

  12. arXiv:2007.09841  [pdf, other

    cs.CV cs.RO

    Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation

    Authors: Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh

    Abstract: We introduce a learning-based approach for room navigation using semantic maps. Our proposed architecture learns to predict top-down belief maps of regions that lie beyond the agent's field of view while modeling architectural and stylistic regularities in houses. First, we train a model to generate amodal semantic top-down maps indicating beliefs of location, size, and shape of rooms by learning… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: Published at the European Conference on Computer Vision, 2020

  13. arXiv:2007.04561  [pdf, other

    cs.CV cs.LG cs.RO

    Auxiliary Tasks Speed Up Learning PointGoal Navigation

    Authors: Joel Ye, Dhruv Batra, Erik Wijmans, Abhishek Das

    Abstract: PointGoal Navigation is an embodied task that requires agents to navigate to a specified point in an unseen environment. Wijmans et al. showed that this task is solvable but their method is computationally prohibitive, requiring 2.5 billion frames and 180 GPU-days. In this work, we develop a method to significantly increase sample and time efficiency in learning PointNav using self-supervised auxi… ▽ More

    Submitted 4 November, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: 8 pages. Accepted to CoRL 2020

  14. arXiv:2006.13171  [pdf, other

    cs.CV cs.RO

    ObjectNav Revisited: On Evaluation of Embodied Agents Navigating to Objects

    Authors: Dhruv Batra, Aaron Gokaslan, Aniruddha Kembhavi, Oleksandr Maksymets, Roozbeh Mottaghi, Manolis Savva, Alexander Toshev, Erik Wijmans

    Abstract: We revisit the problem of Object-Goal Navigation (ObjectNav). In its simplest form, ObjectNav is defined as the task of navigating to an object, specified by its label, in an unexplored environment. In particular, the agent is initialized at a random location and pose in an environment and asked to find an instance of an object category, e.g., find a chair, by navigating to it. As the community… ▽ More

    Submitted 30 August, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

  15. arXiv:2004.02857  [pdf, other

    cs.CV cs.CL cs.RO

    Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments

    Authors: Jacob Krantz, Erik Wijmans, Arjun Majumdar, Dhruv Batra, Stefan Lee

    Abstract: We develop a language-guided navigation task set in a continuous 3D environment where agents must execute low-level actions to follow natural language navigation directions. By being situated in continuous environments, this setting lifts a number of assumptions implicit in prior work that represents environments as a sparse graph of panoramas with edges corresponding to navigability. Specifically… ▽ More

    Submitted 1 May, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

  16. arXiv:2003.05993  [pdf, other

    cs.CV cs.AI cs.LG

    Analyzing Visual Representations in Embodied Navigation Tasks

    Authors: Erik Wijmans, Julian Straub, Dhruv Batra, Irfan Essa, Judy Hoffman, Ari Morcos

    Abstract: Recent advances in deep reinforcement learning require a large amount of training data and generally result in representations that are often over specialized to the target task. In this work, we present a methodology to study the underlying potential causes for this specialization. We use the recently proposed projection weighted Canonical Correlation Analysis (PWCCA) to measure the similarity of… ▽ More

    Submitted 12 March, 2020; originally announced March 2020.

  17. arXiv:1912.06321  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance?

    Authors: Abhishek Kadian, Joanne Truong, Aaron Gokaslan, Alexander Clegg, Erik Wijmans, Stefan Lee, Manolis Savva, Sonia Chernova, Dhruv Batra

    Abstract: Does progress in simulation translate to progress on robots? If one method outperforms another in simulation, how likely is that trend to hold in reality on a robot? We examine this question for embodied PointGoal navigation, developing engineering tools and a research paradigm for evaluating a simulator by its sim2real predictivity. First, we develop Habitat-PyRobot Bridge (HaPy), a library for s… ▽ More

    Submitted 16 August, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Journal ref: IEEE Robotics and Automation Letters (RA-L) 2020

  18. arXiv:1911.00357  [pdf, other

    cs.CV cs.AI cs.LG

    DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames

    Authors: Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra

    Abstract: We present Decentralized Distributed Proximal Policy Optimization (DD-PPO), a method for distributed reinforcement learning in resource-intensive simulated environments. DD-PPO is distributed (uses multiple machines), decentralized (lacks a centralized server), and synchronous (no computation is ever stale), making it conceptually simple and easy to implement. In our experiments on training virtua… ▽ More

    Submitted 19 January, 2020; v1 submitted 1 November, 2019; originally announced November 2019.

  19. arXiv:1906.05797  [pdf, other

    cs.CV cs.GR eess.IV

    The Replica Dataset: A Digital Replica of Indoor Spaces

    Authors: Julian Straub, Thomas Whelan, Lingni Ma, Yufan Chen, Erik Wijmans, Simon Green, Jakob J. Engel, Raul Mur-Artal, Carl Ren, Shobhit Verma, Anton Clarkson, Mingfei Yan, Brian Budge, Yajie Yan, Xiaqing Pan, June Yon, Yuyang Zou, Kimberly Leon, Nigel Carter, Jesus Briales, Tyler Gillingham, Elias Mueggler, Luis Pesqueira, Manolis Savva, Dhruv Batra , et al. (5 additional authors not shown)

    Abstract: We introduce Replica, a dataset of 18 highly photo-realistic 3D indoor scene reconstructions at room and building scale. Each scene consists of a dense mesh, high-resolution high-dynamic-range (HDR) textures, per-primitive semantic class and instance information, and planar mirror and glass reflectors. The goal of Replica is to enable machine learning (ML) research that relies on visually, geometr… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  20. arXiv:1904.03461  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Embodied Question Answering in Photorealistic Environments with Point Cloud Perception

    Authors: Erik Wijmans, Samyak Datta, Oleksandr Maksymets, Abhishek Das, Georgia Gkioxari, Stefan Lee, Irfan Essa, Devi Parikh, Dhruv Batra

    Abstract: To help bridge the gap between internet vision-style problems and the goal of vision for embodied perception we instantiate a large-scale navigation task -- Embodied Question Answering [1] in photo-realistic environments (Matterport 3D). We thoroughly study navigation policies that utilize 3D point clouds, RGB images, or their combination. Our analysis of these models reveals several key findings.… ▽ More

    Submitted 6 April, 2019; originally announced April 2019.

  21. arXiv:1904.01201  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.RO

    Habitat: A Platform for Embodied AI Research

    Authors: Manolis Savva, Abhishek Kadian, Oleksandr Maksymets, Yili Zhao, Erik Wijmans, Bhavana Jain, Julian Straub, Jia Liu, Vladlen Koltun, Jitendra Malik, Devi Parikh, Dhruv Batra

    Abstract: We present Habitat, a platform for research in embodied artificial intelligence (AI). Habitat enables training embodied agents (virtual robots) in highly efficient photorealistic 3D simulation. Specifically, Habitat consists of: (i) Habitat-Sim: a flexible, high-performance 3D simulator with configurable agents, sensors, and generic 3D dataset handling. Habitat-Sim is fast -- when rendering a scen… ▽ More

    Submitted 24 November, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

    Comments: ICCV 2019

  22. Exploiting 2D Floorplan for Building-scale Panorama RGBD Alignment

    Authors: Erik Wijmans, Yasutaka Furukawa

    Abstract: This paper presents a novel algorithm that utilizes a 2D floorplan to align panorama RGBD scans. While effective panorama RGBD alignment techniques exist, such a system requires extremely dense RGBD image sampling. Our approach can significantly reduce the number of necessary scans with the aid of a floorplan image. We formulate a novel Markov Random Field inference problem as a scan placement ove… ▽ More

    Submitted 8 December, 2016; originally announced December 2016.