Skip to main content

Showing 1–37 of 37 results for author: Handa, A

  1. arXiv:2407.08028  [pdf, other

    cs.RO

    AutoMate: Specialist and Generalist Assembly Policies over Diverse Geometries

    Authors: Bingjie Tang, Iretiayo Akinola, Jie Xu, Bowen Wen, Ankur Handa, Karl Van Wyk, Dieter Fox, Gaurav S. Sukhatme, Fabio Ramos, Yashraj Narang

    Abstract: Robotic assembly for high-mixture settings requires adaptivity to diverse parts and poses, which is an open challenge. Meanwhile, in other areas of robotics, large models and sim-to-real have led to tremendous progress. Inspired by such work, we present AutoMate, a learning framework and system that consists of 4 parts: 1) a dataset of 100 assemblies compatible with simulation and the real world,… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2407.02274  [pdf, other

    cs.RO

    DextrAH-G: Pixels-to-Action Dexterous Arm-Hand Grasping with Geometric Fabrics

    Authors: Tyler Ga Wei Lum, Martin Matak, Viktor Makoviychuk, Ankur Handa, Arthur Allshire, Tucker Hermans, Nathan D. Ratliff, Karl Van Wyk

    Abstract: A pivotal challenge in robotics is achieving fast, safe, and robust dexterous grasping across a diverse range of objects, an important goal within industrial applications. However, existing methods often have very limited speed, dexterity, and generality, along with limited or no hardware safety guarantees. In this work, we introduce DextrAH-G, a depth-based dexterous grasping policy trained entir… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2405.02250  [pdf, other

    cs.RO

    Geometric Fabrics: a Safe Guiding Medium for Policy Learning

    Authors: Karl Van Wyk, Ankur Handa, Viktor Makoviychuk, Yijie Guo, Arthur Allshire, Nathan D. Ratliff

    Abstract: Robotics policies are always subjected to complex, second order dynamics that entangle their actions with resulting states. In reinforcement learning (RL) contexts, policies have the burden of deciphering these complicated interactions over massive amounts of experience and complex reward functions to learn how to accomplish tasks. Moreover, policies typically issue actions directly to controllers… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  4. arXiv:2404.03336  [pdf, other

    cs.RO

    Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation

    Authors: Asad Ali Shahid, Yashraj Narang, Vincenzo Petrone, Enrico Ferrentino, Ankur Handa, Dieter Fox, Marco Pavone, Loris Roveda

    Abstract: In recent years, deep reinforcement learning (RL) has shown its effectiveness in solving complex continuous control tasks like locomotion and dexterous manipulation. However, this comes at the cost of an enormous amount of experience required for training, exacerbated by the sensitivity of learning efficiency and the policy performance to hyperparameter selection, which often requires numerous tri… ▽ More

    Submitted 24 June, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Submitted for publication to IEEE-RAS 23rd International Conference on Humanoid Robots

  5. arXiv:2310.17274  [pdf, other

    cs.RO cs.AR cs.DC

    cuRobo: Parallelized Collision-Free Minimum-Jerk Robot Motion Generation

    Authors: Balakumar Sundaralingam, Siva Kumar Sastry Hari, Adam Fishman, Caelan Garrett, Karl Van Wyk, Valts Blukis, Alexander Millane, Helen Oleynikova, Ankur Handa, Fabio Ramos, Nathan Ratliff, Dieter Fox

    Abstract: This paper explores the problem of collision-free motion generation for manipulators by formulating it as a global motion optimization problem. We develop a parallel optimization technique to solve this problem and demonstrate its effectiveness on massively parallel GPUs. We show that combining simple optimization techniques with many parallel seeds leads to solving difficult motion generation pro… ▽ More

    Submitted 3 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: revised technical report, 62 pages, Website: https://curobo.org

  6. arXiv:2305.17110  [pdf, other

    cs.RO

    IndustReal: Transferring Contact-Rich Assembly Tasks from Simulation to Reality

    Authors: Bingjie Tang, Michael A. Lin, Iretiayo Akinola, Ankur Handa, Gaurav S. Sukhatme, Fabio Ramos, Dieter Fox, Yashraj Narang

    Abstract: Robotic assembly is a longstanding challenge, requiring contact-rich interaction and high precision and accuracy. Many applications also require adaptivity to diverse parts, poses, and environments, as well as low cycle times. In other areas of robotics, simulation is a powerful tool to develop algorithms, generate datasets, and train agents. However, simulation has had a more limited impact on as… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to Robotics: Science and Systems (RSS) 2023

  7. arXiv:2305.16309  [pdf, other

    cs.RO cs.CV cs.LG

    Imitating Task and Motion Planning with Visuomotor Transformers

    Authors: Murtaza Dalal, Ajay Mandlekar, Caelan Garrett, Ankur Handa, Ruslan Salakhutdinov, Dieter Fox

    Abstract: Imitation learning is a powerful tool for training robot manipulation policies, allowing them to learn from expert demonstrations without manual programming or trial-and-error. However, common methods of data collection, such as human supervision, scale poorly, as they are time-consuming and labor-intensive. In contrast, Task and Motion Planning (TAMP) can autonomously generate large-scale dataset… ▽ More

    Submitted 17 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Conference on Robot Learning (CoRL) 2023. 8 pages, 5 figures, 2 tables; 11 pages appendix (10 additional figures)

  8. arXiv:2305.12127  [pdf, other

    cs.RO cs.AI

    DexPBT: Scaling up Dexterous Manipulation for Hand-Arm Systems with Population Based Training

    Authors: Aleksei Petrenko, Arthur Allshire, Gavriel State, Ankur Handa, Viktor Makoviychuk

    Abstract: In this work, we propose algorithms and methods that enable learning dexterous object manipulation using simulated one- or two-armed robots equipped with multi-fingered hand end-effectors. Using a parallel GPU-accelerated physics simulator (Isaac Gym), we implement challenging tasks for these robots, including regrasping, grasp-and-throw, and object reorientation. To solve these problems we introd… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: Published in RSS2023

  9. arXiv:2210.13702  [pdf, other

    cs.RO cs.LG

    DeXtreme: Transfer of Agile In-hand Manipulation from Simulation to Reality

    Authors: Ankur Handa, Arthur Allshire, Viktor Makoviychuk, Aleksei Petrenko, Ritvik Singh, Jingzhou Liu, Denys Makoviichuk, Karl Van Wyk, Alexander Zhurkevich, Balakumar Sundaralingam, Yashraj Narang, Jean-Francois Lafleche, Dieter Fox, Gavriel State

    Abstract: Recent work has demonstrated the ability of deep reinforcement learning (RL) algorithms to learn complex robotic behaviours in simulation, including in the domain of multi-fingered manipulation. However, such models can be challenging to transfer to the real world due to the gap between simulation and reality. In this paper, we present our techniques to train a) a policy that can perform robust de… ▽ More

    Submitted 2 January, 2024; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: 28 pages. A smaller version of this paper is accepted to ICRA 2023

  10. arXiv:2205.03532  [pdf, other

    cs.RO cs.GR cs.LG

    Factory: Fast Contact for Robotic Assembly

    Authors: Yashraj Narang, Kier Storey, Iretiayo Akinola, Miles Macklin, Philipp Reist, Lukasz Wawrzyniak, Yunrong Guo, Adam Moravanszky, Gavriel State, Michelle Lu, Ankur Handa, Dieter Fox

    Abstract: Robotic assembly is one of the oldest and most challenging applications of robotics. In other areas of robotics, such as perception and grasping, simulation has rapidly accelerated research progress, particularly when combined with modern deep learning. However, accurately, efficiently, and robustly simulating the range of contact-rich interactions in assembly remains a longstanding challenge. In… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: Accepted to Robotics: Science and Systems (RSS) 2022

  11. arXiv:2112.05129  [pdf, other

    cs.RO

    Assistive Tele-op: Leveraging Transformers to Collect Robotic Task Demonstrations

    Authors: Henry M. Clever, Ankur Handa, Hammad Mazhar, Kevin Parker, Omer Shapira, Qian Wan, Yashraj Narang, Iretiayo Akinola, Maya Cakmak, Dieter Fox

    Abstract: Sharing autonomy between robots and human operators could facilitate data collection of robotic task demonstrations to continuously improve learned models. Yet, the means to communicate intent and reason about the future are disparate between humans and robots. We present Assistive Tele-op, a virtual reality (VR) system for collecting robot task demonstrations that displays an autonomous trajector… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: 9 pages, 4 figures, 1 table. NeurIPS 2021 Workshop on Robot Learning: Self-Supervised and Lifelong Learning, Virtual, Virtual

  12. arXiv:2108.10470  [pdf, other

    cs.RO cs.LG

    Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning

    Authors: Viktor Makoviychuk, Lukasz Wawrzyniak, Yunrong Guo, Michelle Lu, Kier Storey, Miles Macklin, David Hoeller, Nikita Rudin, Arthur Allshire, Ankur Handa, Gavriel State

    Abstract: Isaac Gym offers a high performance learning platform to train policies for wide variety of robotics tasks directly on GPU. Both physics simulation and the neural network policy training reside on GPU and communicate by directly passing data from physics buffers to PyTorch tensors without ever going through any CPU bottlenecks. This leads to blazing fast training times for complex robotics tasks o… ▽ More

    Submitted 25 August, 2021; v1 submitted 23 August, 2021; originally announced August 2021.

    Comments: tech report on isaac-gym

  13. Transferring Dexterous Manipulation from GPU Simulation to a Remote Real-World TriFinger

    Authors: Arthur Allshire, Mayank Mittal, Varun Lodaya, Viktor Makoviychuk, Denys Makoviichuk, Felix Widmaier, Manuel Wüthrich, Stefan Bauer, Ankur Handa, Animesh Garg

    Abstract: We present a system for learning a challenging dexterous manipulation task involving moving a cube to an arbitrary 6-DoF pose with only 3-fingers trained with NVIDIA's IsaacGym simulator. We show empirical benefits, both in simulation and sim-to-real transfer, of using keypoints as opposed to position+quaternion representations for the object pose in 6-DoF for policy observations and in reward cal… ▽ More

    Submitted 20 October, 2022; v1 submitted 22 August, 2021; originally announced August 2021.

    Comments: International Conference on Intelligent Robots and Systems (IROS 2022)

  14. arXiv:2104.04631  [pdf, other

    cs.CV

    DexYCB: A Benchmark for Capturing Hand Grasping of Objects

    Authors: Yu-Wei Chao, Wei Yang, Yu Xiang, Pavlo Molchanov, Ankur Handa, Jonathan Tremblay, Yashraj S. Narang, Karl Van Wyk, Umar Iqbal, Stan Birchfield, Jan Kautz, Dieter Fox

    Abstract: We introduce DexYCB, a new dataset for capturing hand grasping of objects. We first compare DexYCB with a related one through cross-dataset evaluation. We then present a thorough benchmark of state-of-the-art approaches on three relevant tasks: 2D object and keypoint detection, 6D object pose estimation, and 3D hand pose estimation. Finally, we evaluate a new robotics-relevant task: generating saf… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021

  15. arXiv:2012.03806  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Perspectives on Sim2Real Transfer for Robotics: A Summary of the R:SS 2020 Workshop

    Authors: Sebastian Höfer, Kostas Bekris, Ankur Handa, Juan Camilo Gamboa, Florian Golemo, Melissa Mozifian, Chris Atkeson, Dieter Fox, Ken Goldberg, John Leonard, C. Karen Liu, Jan Peters, Shuran Song, Peter Welinder, Martha White

    Abstract: This report presents the debates, posters, and discussions of the Sim2Real workshop held in conjunction with the 2020 edition of the "Robotics: Science and System" conference. Twelve leaders of the field took competing debate positions on the definition, viability, and importance of transferring skills from simulation to the real world in the context of robotics problems. The debaters also joined… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: Summary of the "2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics" held in conjunction with "Robotics: Science and System 2020". Website: https://sim2real.github.io/

  16. arXiv:2011.08985  [pdf, other

    cs.LG cs.RO

    A User's Guide to Calibrating Robotics Simulators

    Authors: Bhairav Mehta, Ankur Handa, Dieter Fox, Fabio Ramos

    Abstract: Simulators are a critical component of modern robotics research. Strategies for both perception and decision making can be studied in simulation first before deployed to real world systems, saving on time and costs. Despite significant progress on the development of sim-to-real algorithms, the analysis of different methods is still conducted in an ad-hoc manner, without a consistent set of tests a… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: Accepted at Conference on Robot Learning 2020

  17. Model-Based Generalization Under Parameter Uncertainty Using Path Integral Control

    Authors: Ian Abraham, Ankur Handa, Nathan Ratliff, Kendall Lowrey, Todd D. Murphey, Dieter Fox

    Abstract: This work addresses the problem of robot interaction in complex environments where online control and adaptation is necessary. By expanding the sample space in the free energy formulation of path integral control, we derive a natural extension to the path integral control that embeds uncertainty into action and provides robustness for model-based robot planning. Our algorithm is applied to a diver… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 5 , Issue: 2 , April 2020 )

  18. arXiv:2003.01223  [pdf, other

    eess.IV cs.CV physics.med-ph

    A Deep learning Approach to Generate Contrast-Enhanced Computerised Tomography Angiography without the Use of Intravenous Contrast Agents

    Authors: Anirudh Chandrashekar, Ashok Handa, Natesh Shivakumar, Pierfrancesco Lapolla, Vicente Grau, Regent Lee

    Abstract: Contrast-enhanced computed tomography angiograms (CTAs) are widely used in cardiovascular imaging to obtain a non-invasive view of arterial structures. However, contrast agents are associated with complications at the injection site as well as renal toxicity leading to contrast-induced nephropathy (CIN) and renal failure. We hypothesised that the raw data acquired from a non-contrast CT contains s… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: 7 Pages, 6 Figures

  19. arXiv:2002.12160  [pdf, other

    cs.RO

    In-Hand Object Pose Tracking via Contact Feedback and GPU-Accelerated Robotic Simulation

    Authors: Jacky Liang, Ankur Handa, Karl Van Wyk, Viktor Makoviychuk, Oliver Kroemer, Dieter Fox

    Abstract: Tracking the pose of an object while it is being held and manipulated by a robot hand is difficult for vision-based methods due to significant occlusions. Prior works have explored using contact feedback and particle filters to localize in-hand objects. However, they have mostly focused on the static grasp setting and not when the object is in motion, as doing so requires modeling of complex conta… ▽ More

    Submitted 5 November, 2020; v1 submitted 27 February, 2020; originally announced February 2020.

    Comments: Accepted to the International Conference on Robotics and Automation (ICRA) 2020

  20. arXiv:2002.03463  [pdf

    eess.IV cs.CV physics.med-ph

    A Deep Learning Approach to Automate High-Resolution Blood Vessel Reconstruction on Computerized Tomography Images With or Without the Use of Contrast Agent

    Authors: Anirudh Chandrashekar, Ashok Handa, Natesh Shivakumar, Pierfrancesco Lapolla, Vicente Grau, Regent Lee

    Abstract: Existing methods to reconstruct vascular structures from a computed tomography (CT) angiogram rely on injection of intravenous contrast to enhance the radio-density within the vessel lumen. However, pathological changes can be present in the blood lumen, vessel wall or a combination of both that prevent accurate reconstruction. In the example of aortic aneurysmal disease, a blood clot or thrombus… ▽ More

    Submitted 9 February, 2020; originally announced February 2020.

    Comments: 18 pages, 10 figures, 7 tables

  21. arXiv:2001.02153  [pdf, other

    cs.LG cs.RO stat.ML

    Information Theoretic Model Predictive Q-Learning

    Authors: Mohak Bhardwaj, Ankur Handa, Dieter Fox, Byron Boots

    Abstract: Model-free Reinforcement Learning (RL) works well when experience can be collected cheaply and model-based RL is effective when system dynamics can be modeled accurately. However, both assumptions can be violated in real world problems such as robotics, where querying the system can be expensive and real-world dynamics can be difficult to model. In contrast to RL, Model Predictive Control (MPC) al… ▽ More

    Submitted 5 May, 2020; v1 submitted 30 December, 2019; originally announced January 2020.

    Comments: Extended version (15 pages) of paper accepted at the 2nd Learning for Dynamics and Control (L4DC) Conference, 2020

  22. arXiv:1910.03135  [pdf, other

    cs.CV cs.LG cs.RO

    DexPilot: Vision Based Teleoperation of Dexterous Robotic Hand-Arm System

    Authors: Ankur Handa, Karl Van Wyk, Wei Yang, Jacky Liang, Yu-Wei Chao, Qian Wan, Stan Birchfield, Nathan Ratliff, Dieter Fox

    Abstract: Teleoperation offers the possibility of imparting robotic systems with sophisticated reasoning skills, intuition, and creativity to perform tasks. However, current teleoperation solutions for high degree-of-actuation (DoA), multi-fingered robots are generally cost-prohibitive, while low-cost offerings usually provide reduced degrees of control. Herein, a low-cost, vision based teleoperation system… ▽ More

    Submitted 14 October, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: 17 pages, first version of DexPilot

  23. arXiv:1904.03754  [pdf, other

    cs.RO cs.CV

    ContactGrasp: Functional Multi-finger Grasp Synthesis from Contact

    Authors: Samarth Brahmbhatt, Ankur Handa, James Hays, Dieter Fox

    Abstract: Grasping and manipulating objects is an important human skill. Since most objects are designed to be manipulated by human hands, anthropomorphic hands can enable richer human-robot interaction. Desirable grasps are not only stable, but also functional: they enable post-grasp actions with the object. However, functional grasp synthesis for high degree-of-freedom anthropomorphic hands from object sh… ▽ More

    Submitted 25 July, 2019; v1 submitted 7 April, 2019; originally announced April 2019.

    Comments: IROS 2019 camera ready version

  24. Learning Latent Space Dynamics for Tactile Servoing

    Authors: Giovanni Sutanto, Nathan Ratliff, Balakumar Sundaralingam, Yevgen Chebotar, Zhe Su, Ankur Handa, Dieter Fox

    Abstract: To achieve a dexterous robotic manipulation, we need to endow our robot with tactile feedback capability, i.e. the ability to drive action based on tactile sensing. In this paper, we specifically address the challenge of tactile servoing, i.e. given the current tactile sensing and a target/goal tactile sensing --memorized from a successful task execution in the past-- what is the action that will… ▽ More

    Submitted 15 April, 2019; v1 submitted 8 November, 2018; originally announced November 2018.

    Comments: Accepted to be published at the International Conference on Robotics and Automation (ICRA) 2019. The final version for publication at ICRA 2019 is 7 pages (i.e. 6 pages of technical content (including text, figures, tables, acknowledgement, etc.) and 1 page of the Bibliography/References), while this arXiv version is 8 pages (added Appendix and some extra details)

  25. arXiv:1810.06187  [pdf, other

    cs.RO

    Robust Learning of Tactile Force Estimation through Robot Interaction

    Authors: Balakumar Sundaralingam, Alexander Lambert, Ankur Handa, Byron Boots, Tucker Hermans, Stan Birchfield, Nathan Ratliff, Dieter Fox

    Abstract: Current methods for estimating force from tactile sensor signals are either inaccurate analytic models or task-specific learned models. In this paper, we explore learning a robust model that maps tactile sensor signals to force. We specifically explore learning a mapping for the SynTouch BioTac sensor via neural networks. We propose a voxelized input feature layer for spatial signals and leverage… ▽ More

    Submitted 5 March, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: accepted to ICRA 2019 (camera ready version)

  26. arXiv:1810.05762  [pdf, other

    cs.RO

    GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning

    Authors: Jacky Liang, Viktor Makoviychuk, Ankur Handa, Nuttapong Chentanez, Miles Macklin, Dieter Fox

    Abstract: Most Deep Reinforcement Learning (Deep RL) algorithms require a prohibitively large number of training samples for learning complex tasks. Many recent works on speeding up Deep RL have focused on distributed training and simulation. While distributed training is often done on the GPU, simulation is not. In this work, we propose using GPU-accelerated RL simulations as an alternative to CPU ones. Us… ▽ More

    Submitted 24 October, 2018; v1 submitted 12 October, 2018; originally announced October 2018.

    Comments: Accepted and to appear at the Conference on Robot Learning (CoRL) 2018

  27. arXiv:1810.05687  [pdf, other

    cs.RO cs.LG

    Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience

    Authors: Yevgen Chebotar, Ankur Handa, Viktor Makoviychuk, Miles Macklin, Jan Issac, Nathan Ratliff, Dieter Fox

    Abstract: We consider the problem of transferring policies to the real world by training on a distribution of simulated scenarios. Rather than manually tuning the randomization of simulations, we adapt the simulation parameter distribution using a few real world roll-outs interleaved with policy training. In doing so, we are able to change the distribution of simulations to improve the policy transfer by ma… ▽ More

    Submitted 5 March, 2019; v1 submitted 12 October, 2018; originally announced October 2018.

  28. arXiv:1710.06425  [pdf, other

    cs.RO cs.LG

    Domain Randomization and Generative Models for Robotic Grasping

    Authors: Joshua Tobin, Lukas Biewald, Rocky Duan, Marcin Andrychowicz, Ankur Handa, Vikash Kumar, Bob McGrew, Jonas Schneider, Peter Welinder, Wojciech Zaremba, Pieter Abbeel

    Abstract: Deep learning-based robotic grasping has made significant progress thanks to algorithmic improvements and increased data availability. However, state-of-the-art models are often trained on as few as hundreds or thousands of unique object instances, and as a result generalization can be a challenge. In this work, we explore a novel data generation pipeline for training a deep neural network to pe… ▽ More

    Submitted 3 April, 2018; v1 submitted 17 October, 2017; originally announced October 2017.

    Comments: 8 pages, 11 figures. Submitted to 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018)

  29. arXiv:1705.08260  [pdf

    cs.CV cs.RO

    Self-Supervised Siamese Learning on Stereo Image Pairs for Depth Estimation in Robotic Surgery

    Authors: Menglong Ye, Edward Johns, Ankur Handa, Lin Zhang, Philip Pratt, Guang-Zhong Yang

    Abstract: Robotic surgery has become a powerful tool for performing minimally invasive procedures, providing advantages in dexterity, precision, and 3D vision, over traditional surgery. One popular robotic system is the da Vinci surgical platform, which allows preoperative information to be incorporated into live procedures using Augmented Reality (AR). Scene depth estimation is a prerequisite for AR, as ac… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

    Comments: A two-page short report to be presented at the Hamlyn Symposium on Medical Robotics 2017. An extension of this work is on progress

  30. arXiv:1612.05079  [pdf, other

    cs.CV

    SceneNet RGB-D: 5M Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth

    Authors: John McCormac, Ankur Handa, Stefan Leutenegger, Andrew J. Davison

    Abstract: We introduce SceneNet RGB-D, expanding the previous work of SceneNet to enable large scale photorealistic rendering of indoor scene trajectories. It provides pixel-perfect ground truth for scene understanding problems such as semantic segmentation, instance segmentation, and object detection, and also for geometric computer vision problems such as optical flow, depth estimation, camera pose estima… ▽ More

    Submitted 30 January, 2017; v1 submitted 15 December, 2016; originally announced December 2016.

  31. arXiv:1609.05130  [pdf, other

    cs.CV

    SemanticFusion: Dense 3D Semantic Mapping with Convolutional Neural Networks

    Authors: John McCormac, Ankur Handa, Andrew Davison, Stefan Leutenegger

    Abstract: Ever more robust, accurate and detailed mapping using visual sensing has proven to be an enabling factor for mobile robots across a wide variety of applications. For the next level of robot intelligence and intuitive user interaction, maps need extend beyond geometry and appearence - they need to contain semantics. We address this challenge by combining Convolutional Neural Networks (CNNs) and a s… ▽ More

    Submitted 28 September, 2016; v1 submitted 16 September, 2016; originally announced September 2016.

  32. arXiv:1607.07405  [pdf, other

    cs.CV cs.LG

    gvnn: Neural Network Library for Geometric Computer Vision

    Authors: Ankur Handa, Michael Bloesch, Viorica Patraucean, Simon Stent, John McCormac, Andrew Davison

    Abstract: We introduce gvnn, a neural network library in Torch aimed towards bridging the gap between classic geometric computer vision and deep learning. Inspired by the recent success of Spatial Transformer Networks, we propose several new layers which are often used as parametric transformations on the data in geometric computer vision. These layers can be inserted within a neural network much in the spi… ▽ More

    Submitted 12 August, 2016; v1 submitted 25 July, 2016; originally announced July 2016.

    Comments: Submitted to ECCV Workshop on Deep Geometry

  33. arXiv:1604.00895  [pdf, other

    cs.CV

    HDRFusion: HDR SLAM using a low-cost auto-exposure RGB-D sensor

    Authors: Shuda Li, Ankur Handa, Yang Zhang, Andrew Calway

    Abstract: We describe a new method for comparing frame appearance in a frame-to-model 3-D mapping and tracking system using an low dynamic range (LDR) RGB-D camera which is robust to brightness changes caused by auto exposure. It is based on a normalised radiance measure which is invariant to exposure changes and not only robustifies the tracking under changing lighting conditions, but also enables the foll… ▽ More

    Submitted 4 April, 2016; originally announced April 2016.

    Comments: 14 pages

  34. arXiv:1511.07041  [pdf, other

    cs.CV

    SceneNet: Understanding Real World Indoor Scenes With Synthetic Data

    Authors: Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla

    Abstract: Scene understanding is a prerequisite to many high level tasks for any automated intelligent machine operating in real world environments. Recent attempts with supervised learning have shown promise in this direction but also highlighted the need for enormous quantity of supervised data --- performance increases in proportion to the amount of data used. However, this quickly becomes prohibitive wh… ▽ More

    Submitted 26 November, 2015; v1 submitted 22 November, 2015; originally announced November 2015.

  35. arXiv:1511.06309  [pdf, other

    cs.LG cs.CV

    Spatio-temporal video autoencoder with differentiable memory

    Authors: Viorica Patraucean, Ankur Handa, Roberto Cipolla

    Abstract: We describe a new spatio-temporal video autoencoder, based on a classic spatial image autoencoder and a novel nested temporal autoencoder. The temporal encoder is represented by a differentiable visual memory composed of convolutional long short-term memory (LSTM) cells that integrate changes over time. Here we target motion changes and use as temporal decoder a robust optical flow prediction modu… ▽ More

    Submitted 1 September, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: The experiments section has been extended and a direct application to weakly-supervised video segmentation through label propagation has been included

  36. arXiv:1505.07293  [pdf, other

    cs.CV

    SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling

    Authors: Vijay Badrinarayanan, Ankur Handa, Roberto Cipolla

    Abstract: We propose a novel deep architecture, SegNet, for semantic pixel wise image labelling. SegNet has several attractive properties; (i) it only requires forward evaluation of a fully learnt function to obtain smooth label predictions, (ii) with increasing depth, a larger context is considered for pixel labelling which improves accuracy, and (iii) it is easy to visualise the effect of feature activati… ▽ More

    Submitted 27 May, 2015; originally announced May 2015.

    Comments: This version was first submitted to CVPR' 15 on November 14, 2014 with paper Id 1468. A similar architecture was proposed more recently on May 17, 2015, see http://arxiv.org/pdf/1505.04366.pdf

  37. arXiv:1505.00171  [pdf, other

    cs.CV

    SynthCam3D: Semantic Understanding With Synthetic Indoor Scenes

    Authors: Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla

    Abstract: We are interested in automatic scene understanding from geometric cues. To this end, we aim to bring semantic segmentation in the loop of real-time reconstruction. Our semantic segmentation is built on a deep autoencoder stack trained exclusively on synthetic depth data generated from our novel 3D scene library, SynthCam3D. Importantly, our network is able to segment real world scenes without any… ▽ More

    Submitted 1 May, 2015; originally announced May 2015.