Skip to main content

Showing 1–50 of 69 results for author: Posner, I

  1. arXiv:2407.05560  [pdf, other

    cs.RO

    A Review of Differentiable Simulators

    Authors: Rhys Newbury, Jack Collins, Kerry He, Jiahe Pan, Ingmar Posner, David Howard, Akansel Cosgun

    Abstract: Differentiable simulators continue to push the state of the art across a range of domains including computational physics, robotics, and machine learning. Their main value is the ability to compute gradients of physical processes, which allows differentiable simulators to be readily integrated into commonly employed gradient-based optimization schemes. To achieve this, a number of design decisions… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted to IEEE Access

  2. arXiv:2405.19452  [pdf, other

    cs.RO cs.LG

    Gaitor: Learning a Unified Representation Across Gaits for Real-World Quadruped Locomotion

    Authors: Alexander L. Mitchell, Wolfgang Merkt, Aristotelis Papatheodorou, Ioannis Havoutis, Ingmar Posner

    Abstract: The current state-of-the-art in quadruped locomotion is able to produce robust motion for terrain traversal but requires the segmentation of a desired robot trajectory into a discrete set of locomotion skills such as trot and crawl. In contrast, in this work we demonstrate the feasibility of learning a single, unified representation for quadruped locomotion enabling continuous blending between gai… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 10 pages, 8 figures, 2 tables

  3. arXiv:2404.15109  [pdf, other

    cs.LG

    Compete and Compose: Learning Independent Mechanisms for Modular World Models

    Authors: Anson Lei, Frederik Nolte, Bernhard Schölkopf, Ingmar Posner

    Abstract: We present COmpetitive Mechanisms for Efficient Transfer (COMET), a modular world model which leverages reusable, independent mechanisms across different environments. COMET is trained on multiple environments with varying dynamics via a two-step process: competition and composition. This enables the model to recognise and learn transferable mechanisms. Specifically, in the competition phase, COME… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  4. arXiv:2403.12861  [pdf, other

    cs.RO cs.LG

    D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Manipulation

    Authors: Jun Yamada, Shaohong Zhong, Jack Collins, Ingmar Posner

    Abstract: Mastering dexterous robotic manipulation of deformable objects is vital for overcoming the limitations of parallel grippers in real-world applications. Current trajectory optimisation approaches often struggle to solve such tasks due to the large search space and the limited task information available from a cost function. In this work, we propose D-Cubed, a novel trajectory optimisation method us… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: https://applied-ai-lab.github.io/D-cubed/

  5. arXiv:2402.16308  [pdf, other

    cs.RO

    DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer

    Authors: Yizhe Wu, Haitz Sáez de Ocáriz Borde, Jack Collins, Oiwi Parker Jones, Ingmar Posner

    Abstract: 3D scene understanding for robotic applications exhibits a unique set of requirements including real-time inference, object-centric latent representation learning, accurate 6D pose estimation and 3D reconstruction of objects. Current methods for scene understanding typically rely on a combination of trained models paired with either an explicit or learnt volumetric representation, all of which hav… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2312.08533  [pdf, other

    cs.LG cs.AI

    World Models via Policy-Guided Trajectory Diffusion

    Authors: Marc Rigter, Jun Yamada, Ingmar Posner

    Abstract: World models are a powerful tool for developing intelligent agents. By predicting the outcome of a sequence of actions, world models enable policies to be optimised via on-policy reinforcement learning (RL) using synthetic data, i.e. in "in imagination". Existing world models are autoregressive in that they interleave predicting the next state with sampling the next action from the policy. Predict… ▽ More

    Submitted 27 March, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Published in TMLR, March 2024

  7. arXiv:2311.03622  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    TWIST: Teacher-Student World Model Distillation for Efficient Sim-to-Real Transfer

    Authors: Jun Yamada, Marc Rigter, Jack Collins, Ingmar Posner

    Abstract: Model-based RL is a promising approach for real-world robotics due to its improved sample efficiency and generalization capabilities compared to model-free RL. However, effective model-based RL solutions for vision-based real-world applications require bridging the sim-to-real gap for any world model learnt. Due to its significant computational cost, standard domain randomisation does not provide… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 7 pages, 6 figures

  8. arXiv:2309.05678  [pdf, other

    cs.LG

    Gromov-Hausdorff Distances for Comparing Product Manifolds of Model Spaces

    Authors: Haitz Saez de Ocariz Borde, Alvaro Arroyo, Ismael Morales, Ingmar Posner, Xiaowen Dong

    Abstract: Recent studies propose enhancing machine learning models by aligning the geometric characteristics of the latent space with the underlying data structure. Instead of relying solely on Euclidean space, researchers have suggested using hyperbolic and spherical spaces with constant curvature, or their combinations (known as product manifolds), to improve model performance. However, there exists no pr… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2309.04810

  9. arXiv:2309.04810  [pdf, other

    cs.LG stat.ML

    Neural Latent Geometry Search: Product Manifold Inference via Gromov-Hausdorff-Informed Bayesian Optimization

    Authors: Haitz Saez de Ocariz Borde, Alvaro Arroyo, Ismael Morales, Ingmar Posner, Xiaowen Dong

    Abstract: Recent research indicates that the performance of machine learning models can be improved by aligning the geometry of the latent space with the underlying data structure. Rather than relying solely on Euclidean space, researchers have proposed using hyperbolic and spherical spaces with constant curvature, or combinations thereof, to better model the latent space and enhance model performance. Howe… ▽ More

    Submitted 27 October, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

  10. arXiv:2306.15410  [pdf, other

    cs.CV

    AutoGraph: Predicting Lane Graphs from Traffic Observations

    Authors: Jannik Zürn, Ingmar Posner, Wolfram Burgard

    Abstract: Lane graph estimation is a long-standing problem in the context of autonomous driving. Previous works aimed at solving this problem by relying on large-scale, hand-annotated lane graphs, introducing a data bottleneck for training models to solve this task. To overcome this limitation, we propose to use the motion patterns of traffic participants as lane graph annotations. In our AutoGraph approach… ▽ More

    Submitted 10 November, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: 8 pages, 6 figures

  11. arXiv:2306.09205  [pdf, other

    cs.LG

    Reward-Free Curricula for Training Robust World Models

    Authors: Marc Rigter, Minqi Jiang, Ingmar Posner

    Abstract: There has been a recent surge of interest in developing generally-capable agents that can adapt to new tasks without additional training in the environment. Learning world models from reward-free exploration is a promising approach, and enables policies to be trained using imagined experience for new tasks. However, achieving a general agent requires robustness across different environments. In th… ▽ More

    Submitted 24 January, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: ICLR 2024

  12. arXiv:2305.12626  [pdf, other

    cs.RO cs.CV

    You Only Look at One: Category-Level Object Representations for Pose Estimation From a Single Example

    Authors: Walter Goodwin, Ioannis Havoutis, Ingmar Posner

    Abstract: In order to meaningfully interact with the world, robot manipulators must be able to interpret objects they encounter. A critical aspect of this interpretation is pose estimation: inferring quantities that describe the position and orientation of an object in 3D space. Most existing approaches to pose estimation make limiting assumptions, often working only for specific, known object instances, or… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: 16 pages, 6 figures, CoRL 2022

  13. RAMP: A Benchmark for Evaluating Robotic Assembly Manipulation and Planning

    Authors: Jack Collins, Mark Robson, Jun Yamada, Mohan Sridharan, Karol Janik, Ingmar Posner

    Abstract: We introduce RAMP, an open-source robotics benchmark inspired by real-world industrial assembly tasks. RAMP consists of beams that a robot must assemble into specified goal configurations using pegs as fasteners. As such, it assesses planning and execution capabilities, and poses challenges in perception, reasoning, manipulation, diagnostics, fault recovery, and goal parsing. RAMP has been designe… ▽ More

    Submitted 8 November, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Project website: https://sites.google.com/oxfordrobotics.institute/ramp

  14. arXiv:2303.11754  [pdf, ps, other

    cs.LG

    Projections of Model Spaces for Latent Graph Inference

    Authors: Haitz Sáez de Ocáriz Borde, Álvaro Arroyo, Ingmar Posner

    Abstract: Graph Neural Networks leverage the connectivity structure of graphs as an inductive bias. Latent graph inference focuses on learning an adequate graph structure to diffuse information on and improve the downstream performance of the model. In this work we employ stereographic projections of the hyperbolic and spherical model spaces, as well as products of Riemannian manifolds, for the purpose of l… ▽ More

    Submitted 12 April, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted at the ICLR 2023 Workshop on Physics for Machine Learning

  15. arXiv:2303.03365  [pdf, other

    cs.RO cs.LG

    Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments

    Authors: Jun Yamada, Jack Collins, Ingmar Posner

    Abstract: Data efficiency in robotic skill acquisition is crucial for operating robots in varied small-batch assembly settings. To operate in such environments, robots must have robust obstacle avoidance and versatile goal conditioning acquired from only a few simple demonstrations. Existing approaches, however, fall short of these requirements. Deep reinforcement learning (RL) enables a robot to learn comp… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 8 pages, 5 figures

  16. arXiv:2303.03364  [pdf, other

    cs.RO cs.CV cs.LG

    Leveraging Scene Embeddings for Gradient-Based Motion Planning in Latent Space

    Authors: Jun Yamada, Chia-Man Hung, Jack Collins, Ioannis Havoutis, Ingmar Posner

    Abstract: Motion planning framed as optimisation in structured latent spaces has recently emerged as competitive with traditional methods in terms of planning success while significantly outperforming them in terms of computational speed. However, the real-world applicability of recent work in this domain remains limited by the need to express obstacle information directly in state-space, involving simple g… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Project website: https://amp-ls.github.io/

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2023

  17. arXiv:2302.03086  [pdf, other

    cs.LG cs.AI

    DITTO: Offline Imitation Learning with World Models

    Authors: Branton DeMoss, Paul Duckworth, Nick Hawes, Ingmar Posner

    Abstract: We propose DITTO, an offline imitation learning algorithm which uses world models and on-policy reinforcement learning to addresses the problem of covariate shift, without access to an oracle or any additional online interactions. We discuss how world models enable offline, on-policy imitation learning, and propose a simple intrinsic reward defined in the world model latent space that induces imit… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  18. Reaching Through Latent Space: From Joint Statistics to Path Planning in Manipulation

    Authors: Chia-Man Hung, Shaohong Zhong, Walter Goodwin, Oiwi Parker Jones, Martin Engelcke, Ioannis Havoutis, Ingmar Posner

    Abstract: We present a novel approach to path planning for robotic manipulators, in which paths are produced via iterative optimisation in the latent space of a generative model of robot poses. Constraints are incorporated through the use of constraint satisfaction classifiers operating on the same space. Optimisation leverages gradients through our learned models that provide a simple way to combine goal r… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 10 pages, 6 figures, 4 tables

    ACM Class: I.2.6; I.2.9; I.2.10

    Journal ref: IEEE Robotics and Automation Letters 7.2 (2022): 5334-5341

  19. arXiv:2206.11131  [pdf, other

    cs.LG stat.ME

    Variational Causal Dynamics: Discovering Modular World Models from Interventions

    Authors: Anson Lei, Bernhard Schölkopf, Ingmar Posner

    Abstract: Latent world models allow agents to reason about complex environments with high-dimensional observations. However, adapting to new environments and effectively leveraging previous knowledge remain significant challenges. We present variational causal dynamics (VCD), a structured world model that exploits the invariance of causal mechanisms across environments to achieve fast and modular adaptation… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  20. arXiv:2206.03591  [pdf, other

    cs.CV cs.AI

    ObPose: Leveraging Pose for Object-Centric Scene Inference and Generation in 3D

    Authors: Yizhe Wu, Oiwi Parker Jones, Ingmar Posner

    Abstract: We present ObPose, an unsupervised object-centric inference and generation model which learns 3D-structured latent representations from RGB-D scenes. Inspired by prior art in 2D representation learning, ObPose considers a factorised latent space, separately encoding object location (where) and appearance (what). ObPose further leverages an object's pose (i.e. location and orientation), defined via… ▽ More

    Submitted 9 June, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: 14 pages, 4 figures

    MSC Class: 68T07

  21. arXiv:2205.01179  [pdf, other

    cs.RO cs.LG

    VAE-Loco: Versatile Quadruped Locomotion by Learning a Disentangled Gait Representation

    Authors: Alexander L. Mitchell, Wolfgang Merkt, Mathieu Geisert, Siddhant Gangapurwala, Martin Engelcke, Oiwi Parker Jones, Ioannis Havoutis, Ingmar Posner

    Abstract: Quadruped locomotion is rapidly maturing to a degree where robots are able to realise highly dynamic manoeuvres. However, current planners are unable to vary key gait parameters of the in-swing feet midair. In this work we address this limitation and show that it is pivotal in increasing controller robustness by learning a latent space capturing the key stance phases constituting a particular gait… ▽ More

    Submitted 12 July, 2023; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: 16 pages, 13 figures, 1 table, accepted by IEEE Transactions on Robotics (T-RO) as an extended paper. arXiv admin note: substantial text overlap with arXiv:2112.04809

  22. arXiv:2204.03635  [pdf, other

    cs.CV cs.RO

    Zero-Shot Category-Level Object Pose Estimation

    Authors: Walter Goodwin, Sagar Vaze, Ioannis Havoutis, Ingmar Posner

    Abstract: Object pose estimation is an important component of most vision pipelines for embodied agents, as well as in 3D vision more generally. In this paper we tackle the problem of estimating the pose of novel object categories in a zero-shot manner. This extends much of the existing literature by removing the need for pose-labelled datasets or category-specific CAD models for training or inference. Spec… ▽ More

    Submitted 2 October, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: 28 pages, 6 figures

    Journal ref: ECCV 2022

  23. arXiv:2203.00459  [pdf, other

    cs.RO

    Fast-MbyM: Leveraging Translational Invariance of the Fourier Transform for Efficient and Accurate Radar Odometry

    Authors: Robert Weston, Matthew Gadd, Daniele De Martini, Paul Newman, Ingmar Posner

    Abstract: Masking By Moving (MByM), provides robust and accurate radar odometry measurements through an exhaustive correlative search across discretised pose candidates. However, this dense search creates a significant computational bottleneck which hinders real-time performance when high-end GPUs are not available. Utilising the translational invariance of the Fourier Transform, in our approach, f-MByM, we… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: 7 pages

  24. arXiv:2201.08115  [pdf, other

    cs.AI cs.LG cs.RO stat.ML

    Priors, Hierarchy, and Information Asymmetry for Skill Transfer in Reinforcement Learning

    Authors: Sasha Salter, Kristian Hartikainen, Walter Goodwin, Ingmar Posner

    Abstract: The ability to discover behaviours from past experience and transfer them to new tasks is a hallmark of intelligent agents acting sample-efficiently in the real world. Equipping embodied reinforcement learners with the same ability may be crucial for their successful deployment in robotics. While hierarchical and KL-regularized reinforcement learning individually hold promise here, arguably a hybr… ▽ More

    Submitted 24 April, 2023; v1 submitted 20 January, 2022; originally announced January 2022.

    Journal ref: Published at the International Conference on Learning Representations, 2023

  25. arXiv:2112.04809  [pdf, other

    cs.RO cs.LG

    Next Steps: Learning a Disentangled Gait Representation for Versatile Quadruped Locomotion

    Authors: Alexander L. Mitchell, Wolfgang Merkt, Mathieu Geisert, Siddhant Gangapurwala, Martin Engelcke, Oiwi Parker Jones, Ioannis Havoutis, Ingmar Posner

    Abstract: Quadruped locomotion is rapidly maturing to a degree where robots now routinely traverse a variety of unstructured terrains. However, while gaits can be varied typically by selecting from a range of pre-computed styles, current planners are unable to vary key gait parameters continuously while the robot is in motion. The synthesis, on-the-fly, of gaits with unexpected operational characteristics o… ▽ More

    Submitted 29 March, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: 7 pages, 4 figures, accepted at IEEE International Conference on Robotics and Automation (ICRA), 2022

  26. arXiv:2111.07975  [pdf, other

    cs.RO cs.CV

    Semantically Grounded Object Matching for Robust Robotic Scene Rearrangement

    Authors: Walter Goodwin, Sagar Vaze, Ioannis Havoutis, Ingmar Posner

    Abstract: Object rearrangement has recently emerged as a key competency in robot manipulation, with practical solutions generally involving object detection, recognition, grasping and high-level planning. Goal-images describing a desired scene configuration are a promising and increasingly used mode of instruction. A key outstanding challenge is the accurate inference of matches between objects in front of… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: 8 pages, 5 figures

  27. arXiv:2110.15245  [pdf, ps, other

    cs.RO cs.LG

    From Machine Learning to Robotics: Challenges and Opportunities for Embodied Intelligence

    Authors: Nicholas Roy, Ingmar Posner, Tim Barfoot, Philippe Beaudoin, Yoshua Bengio, Jeannette Bohg, Oliver Brock, Isabelle Depatie, Dieter Fox, Dan Koditschek, Tomas Lozano-Perez, Vikash Mansinghka, Christopher Pal, Blake Richards, Dorsa Sadigh, Stefan Schaal, Gaurav Sukhatme, Denis Therien, Marc Toussaint, Michiel Van de Panne

    Abstract: Machine learning has long since become a keystone technology, accelerating science and applications in a broad range of domains. Consequently, the notion of applying learning methods to a particular problem set has become an established and valuable modus operandi to advance a particular field. In this article we argue that such an approach does not straightforwardly extended to robotics -- or to… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  28. arXiv:2107.01959  [pdf, other

    cs.LG stat.ML

    Universal Approximation of Functions on Sets

    Authors: Edward Wagstaff, Fabian B. Fuchs, Martin Engelcke, Michael A. Osborne, Ingmar Posner

    Abstract: Modelling functions of sets, or equivalently, permutation-invariant functions, is a long-standing challenge in machine learning. Deep Sets is a popular method which is known to be a universal approximator for continuous set functions. We provide a theoretical analysis of Deep Sets which shows that this universal approximation property is only guaranteed if the model's latent space is sufficiently… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 54 pages, 13 figures

  29. arXiv:2105.14895  [pdf, other

    cs.RO

    APEX: Unsupervised, Object-Centric Scene Segmentation and Tracking for Robot Manipulation

    Authors: Yizhe Wu, Oiwi Parker Jones, Martin Engelcke, Ingmar Posner

    Abstract: Recent advances in unsupervised learning for object detection, segmentation, and tracking hold significant promise for applications in robotics. A common approach is to frame these tasks as inference in probabilistic latent-variable models. In this paper, however, we show that the current state-of-the-art struggles with visually complex scenes such as typically encountered in robot manipulation ta… ▽ More

    Submitted 12 September, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: 8 pages, 5 figures

    MSC Class: I.2.9

  30. arXiv:2105.09016  [pdf, other

    cs.LG physics.chem-ph stat.ML

    E(n) Equivariant Normalizing Flows

    Authors: Victor Garcia Satorras, Emiel Hoogeboom, Fabian B. Fuchs, Ingmar Posner, Max Welling

    Abstract: This paper introduces a generative model equivariant to Euclidean symmetries: E(n) Equivariant Normalizing Flows (E-NFs). To construct E-NFs, we take the discriminative E(n) graph neural networks and integrate them as a differential equation to obtain an invertible equivariant function: a continuous-time normalizing flow. We demonstrate that E-NFs considerably outperform baselines and existing met… ▽ More

    Submitted 14 January, 2022; v1 submitted 19 May, 2021; originally announced May 2021.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS 2021)

  31. arXiv:2104.09958  [pdf, other

    cs.CV cs.LG stat.ML

    GENESIS-V2: Inferring Unordered Object Representations without Iterative Refinement

    Authors: Martin Engelcke, Oiwi Parker Jones, Ingmar Posner

    Abstract: Advances in unsupervised learning of object-representations have culminated in the development of a broad range of methods for unsupervised object segmentation and interpretable object-centric scene generation. These methods, however, are limited to simulated and real-world datasets with limited visual complexity. Moreover, object representations are often inferred using RNNs which do not scale we… ▽ More

    Submitted 25 January, 2022; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: NeurIPS 2021 camera-ready version; 26 pages, 19 figures

  32. arXiv:2103.11881  [pdf, other

    cs.RO cs.LG

    Introspective Visuomotor Control: Exploiting Uncertainty in Deep Visuomotor Control for Failure Recovery

    Authors: Chia-Man Hung, Li Sun, Yizhe Wu, Ioannis Havoutis, Ingmar Posner

    Abstract: End-to-end visuomotor control is emerging as a compelling solution for robot manipulation tasks. However, imitation learning-based visuomotor control approaches tend to suffer from a common limitation, lacking the ability to recover from an out-of-distribution state caused by compounding errors. In this paper, instead of using tactile feedback or explicitly detecting the failure through vision, we… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: 7 pages, 5 figures, 1 table

    ACM Class: I.2.9; I.2.10

  33. arXiv:2102.13419  [pdf, other

    cs.LG stat.ML

    Iterative SE(3)-Transformers

    Authors: Fabian B. Fuchs, Edward Wagstaff, Justas Dauparas, Ingmar Posner

    Abstract: When manipulating three-dimensional data, it is possible to ensure that rotational and translational symmetries are respected by applying so-called SE(3)-equivariant models. Protein structure prediction is a prominent example of a task which displays these symmetries. Recent work in this area has successfully made use of an SE(3)-equivariant model, applying an iterative SE(3)-equivariant attention… ▽ More

    Submitted 16 March, 2021; v1 submitted 26 February, 2021; originally announced February 2021.

  34. arXiv:2011.14389  [pdf, other

    cs.RO cs.CV cs.LG eess.SP

    There and Back Again: Learning to Simulate Radar Data for Real-World Applications

    Authors: Rob Weston, Oiwi Parker Jones, Ingmar Posner

    Abstract: Simulating realistic radar data has the potential to significantly accelerate the development of data-driven approaches to radar processing. However, it is fraught with difficulty due to the notoriously complex image formation process. Here we propose to learn a radar sensor model capable of synthesising faithful radar observations based on simulated elevation maps. In particular, we adopt an adve… ▽ More

    Submitted 29 November, 2020; originally announced November 2020.

    Comments: 6 pages + 2 references

  35. arXiv:2007.06245  [pdf, other

    cs.LG stat.ML

    Reconstruction Bottlenecks in Object-Centric Generative Models

    Authors: Martin Engelcke, Oiwi Parker Jones, Ingmar Posner

    Abstract: A range of methods with suitable inductive biases exist to learn interpretable object-centric representations of images without supervision. However, these are largely restricted to visually simple images; robust object discovery in real-world sensory datasets remains elusive. To increase the understanding of such inductive biases, we empirically investigate the role of "reconstruction bottlenecks… ▽ More

    Submitted 24 November, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: 10 pages, 7 Figures, Workshop on Object-Oriented Learning at ICML 2020

  36. arXiv:2007.01520  [pdf, other

    cs.RO cs.LG

    First Steps: Latent-Space Control with Semantic Constraints for Quadruped Locomotion

    Authors: Alexander L. Mitchell, Martin Engelcke, Oiwi Parker Jones, David Surovik, Siddhant Gangapurwala, Oliwier Melon, Ioannis Havoutis, Ingmar Posner

    Abstract: Traditional approaches to quadruped control frequently employ simplified, hand-derived models. This significantly reduces the capability of the robot since its effective kinematic range is curtailed. In addition, kinodynamic constraints are often non-differentiable and difficult to implement in an optimisation approach. In this work, these challenges are addressed by framing quadruped control as o… ▽ More

    Submitted 20 November, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: 8 pages, 7 figures, accepted at IROS 2020

  37. arXiv:2007.01272  [pdf, other

    cs.CV

    RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces

    Authors: Sebastien Ehrhardt, Oliver Groth, Aron Monszpart, Martin Engelcke, Ingmar Posner, Niloy Mitra, Andrea Vedaldi

    Abstract: We present RELATE, a model that learns to generate physically plausible scenes and videos of multiple interacting objects. Similar to other generative approaches, RELATE is trained end-to-end on raw, unlabeled data. RELATE combines an object-centric GAN formulation with a model that explicitly accounts for correlations between individual objects. This allows the model to generate realistic scenes… ▽ More

    Submitted 9 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

  38. arXiv:2003.08854  [pdf, other

    cs.RO cs.CV cs.LG

    Goal-Conditioned End-to-End Visuomotor Control for Versatile Skill Primitives

    Authors: Oliver Groth, Chia-Man Hung, Andrea Vedaldi, Ingmar Posner

    Abstract: Visuomotor control (VMC) is an effective means of achieving basic manipulation tasks such as pushing or pick-and-place from raw images. Conditioning VMC on desired goal states is a promising way of achieving versatile skill primitives. However, common conditioning schemes either rely on task-specific fine tuning - e.g. using one-shot imitation learning (IL) - or on sampling approaches using a forw… ▽ More

    Submitted 24 September, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: revised manuscript with additional baselines and generalisation experiments; 11 pages, 8 figures, 7 tables

    ACM Class: I.2.9; I.2.10

  39. arXiv:2003.01875  [pdf, other

    cs.RO cs.CV cs.LG

    Localising Faster: Efficient and precise lidar-based robot localisation in large-scale environments

    Authors: Li Sun, Daniel Adolfsson, Martin Magnusson, Henrik Andreasson, Ingmar Posner, Tom Duckett

    Abstract: This paper proposes a novel approach for global localisation of mobile robots in large-scale environments. Our method leverages learning-based localisation and filtering-based localisation, to localise the robot efficiently and precisely through seeding Monte Carlo Localisation (MCL) with a deep-learned distribution. In particular, a fast localisation system rapidly estimates the 6-DOF pose throug… ▽ More

    Submitted 15 July, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 7 pages, 5 pages. Accepted by IEEE International Conference on Robotics and Automation (ICRA) 2020

  40. arXiv:2001.10789  [pdf, other

    cs.CV cs.LG cs.RO

    Under the Radar: Learning to Predict Robust Keypoints for Odometry Estimation and Metric Localisation in Radar

    Authors: Dan Barnes, Ingmar Posner

    Abstract: This paper presents a self-supervised framework for learning to detect robust keypoints for odometry estimation and metric localisation in radar. By embedding a differentiable point-based motion estimator inside our architecture, we learn keypoint locations, scores and descriptors from localisation error alone. This approach avoids imposing any assumption on what makes a robust keypoint and crucia… ▽ More

    Submitted 24 February, 2020; v1 submitted 29 January, 2020; originally announced January 2020.

    Comments: Video summary: https://youtu.be/L-PO7nxWpJU

  41. arXiv:1911.08363  [pdf, other

    cs.AI cs.LG

    Attention-Privileged Reinforcement Learning

    Authors: Sasha Salter, Dushyant Rao, Markus Wulfmeier, Raia Hadsell, Ingmar Posner

    Abstract: Image-based Reinforcement Learning is known to suffer from poor sample efficiency and generalisation to unseen visuals such as distractors (task-independent aspects of the observation space). Visual domain randomisation encourages transfer by training over visual factors of variation that may be encountered in the target domain. This increases learning complexity, can negatively impact learning ra… ▽ More

    Submitted 11 January, 2021; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Published at Conference on Robot Learning (CoRL) 2020

  42. arXiv:1909.13561  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Imagine That! Leveraging Emergent Affordances for 3D Tool Synthesis

    Authors: Yizhe Wu, Sudhanshu Kasewa, Oliver Groth, Sasha Salter, Li Sun, Oiwi Parker Jones, Ingmar Posner

    Abstract: In this paper we explore the richness of information captured by the latent space of a vision-based generative model. The model combines unsupervised generative learning with a task-based performance predictor to learn and to exploit task-relevant object affordances given visual observations from a reaching task, involving a scenario and a stick-like tool. While the learned embedding of the genera… ▽ More

    Submitted 7 October, 2020; v1 submitted 30 September, 2019; originally announced September 2019.

    Comments: 12 pages, 6 figures

    ACM Class: I.2.10; I.2.6

  43. arXiv:1909.03752  [pdf, other

    cs.CV cs.LG cs.RO

    Masking by Moving: Learning Distraction-Free Radar Odometry from Pose Information

    Authors: Dan Barnes, Rob Weston, Ingmar Posner

    Abstract: This paper presents an end-to-end radar odometry system which delivers robust, real-time pose estimates based on a learned embedding space free of sensing artefacts and distractor objects. The system deploys a fully differentiable, correlation-based radar matching approach. This provides the same level of interpretability as established scan-matching methods and allows for a principled derivation… ▽ More

    Submitted 17 January, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: Conference on Robot Learning (CoRL), 2019. Video summary: https://youtu.be/eG4Q-j3_6dk

  44. arXiv:1909.01300  [pdf, other

    cs.RO eess.SP

    The Oxford Radar RobotCar Dataset: A Radar Extension to the Oxford RobotCar Dataset

    Authors: Dan Barnes, Matthew Gadd, Paul Murcutt, Paul Newman, Ingmar Posner

    Abstract: In this paper we present The Oxford Radar RobotCar Dataset, a new dataset for researching scene understanding using Millimetre-Wave FMCW scanning radar data. The target application is autonomous vehicles where this modality is robust to environmental conditions such as fog, rain, snow, or lens flare, which typically challenge other sensor modalities such as vision and LIDAR. The data were gather… ▽ More

    Submitted 26 February, 2020; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: The Oxford Radar RobotCar Dataset Website: http://ori.ox.ac.uk/datasets/radar-robotcar-dataset

  45. arXiv:1907.13052  [pdf, other

    cs.LG cs.CV cs.NE cs.RO stat.ML

    GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations

    Authors: Martin Engelcke, Adam R. Kosiorek, Oiwi Parker Jones, Ingmar Posner

    Abstract: Generative latent-variable models are emerging as promising tools in robotics and reinforcement learning. Yet, even though tasks in these domains typically involve distinct objects, most state-of-the-art generative models do not explicitly capture the compositional nature of visual scenes. Two recent exceptions, MONet and IODINE, decompose scenes into objects in an unsupervised fashion. Their unde… ▽ More

    Submitted 23 November, 2020; v1 submitted 30 July, 2019; originally announced July 2019.

    Comments: Published at the International Conference on Learning Representations (ICLR) 2020

  46. arXiv:1907.12887  [pdf, other

    cs.CV cs.AI cs.LG cs.RO stat.ML

    End-to-end Recurrent Multi-Object Tracking and Trajectory Prediction with Relational Reasoning

    Authors: Fabian B. Fuchs, Adam R. Kosiorek, Li Sun, Oiwi Parker Jones, Ingmar Posner

    Abstract: The majority of contemporary object-tracking approaches do not model interactions between objects. This contrasts with the fact that objects' paths are not independent: a cyclist might abruptly deviate from a previously planned trajectory in order to avoid colliding with a car. Building upon HART, a neural class-agnostic single-object tracker, we introduce a multi-object tracking method MOHART cap… ▽ More

    Submitted 28 September, 2020; v1 submitted 12 July, 2019; originally announced July 2019.

  47. arXiv:1901.09006  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    On the Limitations of Representing Functions on Sets

    Authors: Edward Wagstaff, Fabian B. Fuchs, Martin Engelcke, Ingmar Posner, Michael Osborne

    Abstract: Recent work on the representation of functions on sets has considered the use of summation in a latent space to enforce permutation invariance. In particular, it has been conjectured that the dimension of this latent space may remain fixed as the cardinality of the sets under consideration increases. However, we demonstrate that the analysis leading to this conjecture requires mappings which are h… ▽ More

    Submitted 7 October, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: Published at the International Conference on Machine Learning (2019)

  48. arXiv:1810.08151  [pdf, other

    cs.RO

    Probably Unknown: Deep Inverse Sensor Modelling In Radar

    Authors: Rob Weston, Sarah Cen, Paul Newman, Ingmar Posner

    Abstract: Radar presents a promising alternative to lidar and vision in autonomous vehicle applications, able to detect objects at long range under a variety of weather conditions. However, distinguishing between occupied and free space from raw radar power returns is challenging due to complex interactions between sensor noise and occlusion. To counter this we propose to learn an Inverse Sensor Model (IS… ▽ More

    Submitted 10 May, 2019; v1 submitted 18 October, 2018; originally announced October 2018.

    Comments: 6 full pages, 1 page of references

  49. arXiv:1809.10562  [pdf, other

    cs.CV

    Dropout Distillation for Efficiently Estimating Model Confidence

    Authors: Corina Gurau, Alex Bewley, Ingmar Posner

    Abstract: We propose an efficient way to output better calibrated uncertainty scores from neural networks. The Distilled Dropout Network (DDN) makes standard (non-Bayesian) neural networks more introspective by adding a new training loss which prevents them from being overconfident. Our method is more efficient than Bayesian neural networks or model ensembles which, despite providing more reliable uncertain… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

  50. arXiv:1806.05502  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Scrutinizing and De-Biasing Intuitive Physics with Neural Stethoscopes

    Authors: Fabian B. Fuchs, Oliver Groth, Adam R. Kosiorek, Alex Bewley, Markus Wulfmeier, Andrea Vedaldi, Ingmar Posner

    Abstract: Visually predicting the stability of block towers is a popular task in the domain of intuitive physics. While previous work focusses on prediction accuracy, a one-dimensional performance measure, we provide a broader analysis of the learned physical understanding of the final model and how the learning process can be guided. To this end, we introduce neural stethoscopes as a general purpose framew… ▽ More

    Submitted 6 September, 2019; v1 submitted 14 June, 2018; originally announced June 2018.