Skip to main content

Showing 1–17 of 17 results for author: Lioutikov, R

  1. arXiv:2407.05996  [pdf, other

    cs.RO

    Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals

    Authors: Moritz Reuss, Ömer Erdinç Yağmurlu, Fabian Wenzel, Rudolf Lioutikov

    Abstract: This work introduces the Multimodal Diffusion Transformer (MDT), a novel diffusion policy framework, that excels at learning versatile behavior from multimodal goal specifications with few language annotations. MDT leverages a diffusion-based multimodal transformer backbone and two self-supervised auxiliary objectives to master long-horizon manipulation tasks based on multimodal goals. The vast ma… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: RSS 2024

  2. arXiv:2406.12538  [pdf, other

    cs.LG cs.AI cs.RO

    Variational Distillation of Diffusion Policies into Mixture of Experts

    Authors: Hongyi Zhou, Denis Blessing, Ge Li, Onur Celik, Xiaogang Jia, Gerhard Neumann, Rudolf Lioutikov

    Abstract: This work introduces Variational Diffusion Distillation (VDD), a novel method that distills denoising diffusion policies into Mixtures of Experts (MoE) through variational inference. Diffusion Models are the current state-of-the-art in generative modeling due to their exceptional ability to accurately learn and represent complex, multi-modal distributions. This ability allows Diffusion Models to r… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.08234  [pdf, other

    cs.LG cs.RO

    MaIL: Improving Imitation Learning with Mamba

    Authors: Xiaogang Jia, Qian Wang, Atalay Donat, Bowen Xing, Ge Li, Hongyi Zhou, Onur Celik, Denis Blessing, Rudolf Lioutikov, Gerhard Neumann

    Abstract: This work introduces Mamba Imitation Learning (MaIL), a novel imitation learning (IL) architecture that offers a computationally efficient alternative to state-of-the-art (SoTA) Transformer policies. Transformer-based policies have achieved remarkable results due to their ability in handling human-recorded data with inherently non-Markovian behavior. However, their high performance comes with the… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  4. arXiv:2402.14606  [pdf, other

    cs.RO

    Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations

    Authors: Xiaogang Jia, Denis Blessing, Xinkai Jiang, Moritz Reuss, Atalay Donat, Rudolf Lioutikov, Gerhard Neumann

    Abstract: Imitation learning with human data has demonstrated remarkable success in teaching robots in a wide range of skills. However, the inherent diversity in human behavior leads to the emergence of multi-modal data distributions, thereby presenting a formidable challenge for existing imitation learning algorithms. Quantifying a model's capacity to capture and replicate this diversity effectively is sti… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  5. arXiv:2401.11437  [pdf, other

    cs.LG cs.RO

    Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning

    Authors: Ge Li, Hongyi Zhou, Dominik Roth, Serge Thilges, Fabian Otto, Rudolf Lioutikov, Gerhard Neumann

    Abstract: Current advancements in reinforcement learning (RL) have predominantly focused on learning step-based policies that generate actions for each perceived state. While these methods efficiently leverage step information from environmental interaction, they often ignore the temporal correlation between actions, resulting in inefficient exploration and unsmooth trajectories that are challenging to impl… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: Codebase, see: https://github.com/BruceGeLi/TCE_RL

  6. arXiv:2312.10008  [pdf, ps, other

    cs.RO cs.AI cs.LG

    Movement Primitive Diffusion: Learning Gentle Robotic Manipulation of Deformable Objects

    Authors: Paul Maria Scheikl, Nicolas Schreiber, Christoph Haas, Niklas Freymuth, Gerhard Neumann, Rudolf Lioutikov, Franziska Mathis-Ullrich

    Abstract: Policy learning in robot-assisted surgery (RAS) lacks data efficient and versatile methods that exhibit the desired motion quality for delicate surgical interventions. To this end, we introduce Movement Primitive Diffusion (MPD), a novel method for imitation learning (IL) in RAS that focuses on gentle manipulation of deformable objects. The approach combines the versatility of diffusion-based imit… ▽ More

    Submitted 10 June, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Journal ref: IEEE Robotics and Automation Letters 9 (2024) 5338-5345

  7. arXiv:2306.12729  [pdf, other

    cs.LG cs.AI cs.RO

    MP3: Movement Primitive-Based (Re-)Planning Policy

    Authors: Fabian Otto, Hongyi Zhou, Onur Celik, Ge Li, Rudolf Lioutikov, Gerhard Neumann

    Abstract: We introduce a novel deep reinforcement learning (RL) approach called Movement Primitive-based Planning Policy (MP3). By integrating movement primitives (MPs) into the deep RL framework, MP3 enables the generation of smooth trajectories throughout the whole learning process while effectively learning from sparse and non-Markovian rewards. Additionally, MP3 maintains the capability to adapt to chan… ▽ More

    Submitted 2 July, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: The video demonstration can be accessed at https://intuitive-robots.github.io/mp3_website/. arXiv admin note: text overlap with arXiv:2210.09622

  8. Curriculum-Based Imitation of Versatile Skills

    Authors: Maximilian Xiling Li, Onur Celik, Philipp Becker, Denis Blessing, Rudolf Lioutikov, Gerhard Neumann

    Abstract: Learning skills by imitation is a promising concept for the intuitive teaching of robots. A common way to learn such skills is to learn a parametric model by maximizing the likelihood given the demonstrations. Yet, human demonstrations are often multi-modal, i.e., the same task is solved in multiple ways which is a major challenge for most imitation learning methods that are based on such a maximu… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Journal ref: 2023 IEEE International Conference on Robotics and Automation (ICRA)

  9. arXiv:2304.02532  [pdf, other

    cs.LG cs.RO

    Goal-Conditioned Imitation Learning using Score-based Diffusion Policies

    Authors: Moritz Reuss, Maximilian Li, Xiaogang Jia, Rudolf Lioutikov

    Abstract: We propose a new policy representation based on score-based diffusion models (SDMs). We apply our new policy representation in the domain of Goal-Conditioned Imitation Learning (GCIL) to learn general-purpose goal-specified policies from large uncurated datasets without rewards. Our new goal-conditioned policy architecture "$\textbf{BE}$havior generation with $\textbf{S}$c$\textbf{O}$re-based Diff… ▽ More

    Submitted 1 June, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted at RSS 2023

  10. arXiv:2303.15349  [pdf, other

    cs.LG

    Information Maximizing Curriculum: A Curriculum-Based Approach for Imitating Diverse Skills

    Authors: Denis Blessing, Onur Celik, Xiaogang Jia, Moritz Reuss, Maximilian Xiling Li, Rudolf Lioutikov, Gerhard Neumann

    Abstract: Imitation learning uses data for training policies to solve complex tasks. However, when the training data is collected from human demonstrators, it often leads to multimodal distributions because of the variability in human actions. Most imitation learning methods rely on a maximum likelihood (ML) objective to learn a parameterized policy, but this can result in suboptimal or unsafe behavior due… ▽ More

    Submitted 31 October, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

  11. arXiv:2211.00352  [pdf, other

    cs.RO

    Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots

    Authors: Akanksha Saran, Kush Desai, Mai Lee Chang, Rudolf Lioutikov, Andrea Thomaz, Scott Niekum

    Abstract: Humans use audio signals in the form of spoken language or verbal reactions effectively when teaching new skills or tasks to other humans. While demonstrations allow humans to teach robots in a natural way, learning from trajectories alone does not leverage other available modalities including audio from human teachers. To effectively utilize audio cues accompanying human demonstrations, first it… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: IROS 2022

  12. arXiv:2210.01531  [pdf, other

    cs.RO cs.LG

    ProDMPs: A Unified Perspective on Dynamic and Probabilistic Movement Primitives

    Authors: Ge Li, Zeqi Jin, Michael Volpp, Fabian Otto, Rudolf Lioutikov, Gerhard Neumann

    Abstract: Movement Primitives (MPs) are a well-known concept to represent and generate modular trajectories. MPs can be broadly categorized into two types: (a) dynamics-based approaches that generate smooth trajectories from any initial state, e. g., Dynamic Movement Primitives (DMPs), and (b) probabilistic approaches that capture higher-order statistics of the motion, e. g., Probabilistic Movement Primitiv… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: 12 pages, 13 figures

  13. arXiv:2108.05875  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Distributional Depth-Based Estimation of Object Articulation Models

    Authors: Ajinkya Jain, Stephen Giguere, Rudolf Lioutikov, Scott Niekum

    Abstract: We propose a method that efficiently learns distributions over articulation model parameters directly from depth images without the need to know articulation model categories a priori. By contrast, existing methods that learn articulation models from raw observations typically only predict point estimates of the model parameters, which are insufficient to guarantee the safe manipulation of articul… ▽ More

    Submitted 25 October, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: In the proceedings of the 5th Annual Conference on Robot Learning (CoRL), 2021. Project webpage: https://pearl-utexas.github.io/DUST-net/ . 18 pages, 10 figures, 4 tables

  14. arXiv:2103.04529  [pdf, other

    cs.LG cs.RO

    Self-Supervised Online Reward Shaping in Sparse-Reward Environments

    Authors: Farzan Memarian, Wonjoon Goo, Rudolf Lioutikov, Scott Niekum, Ufuk Topcu

    Abstract: We introduce Self-supervised Online Reward Shaping (SORS), which aims to improve the sample efficiency of any RL algorithm in sparse-reward environments by automatically densifying rewards. The proposed framework alternates between classification-based reward inference and policy update steps -- the original sparse reward provides a self-supervisory signal for reward inference by ranking trajector… ▽ More

    Submitted 25 July, 2021; v1 submitted 7 March, 2021; originally announced March 2021.

    Comments: Accepted for publication in IROS 2021

  15. arXiv:2008.10518  [pdf, other

    cs.RO cs.AI

    ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw Theory

    Authors: Ajinkya Jain, Rudolf Lioutikov, Caleb Chuck, Scott Niekum

    Abstract: Robots in human environments will need to interact with a wide variety of articulated objects such as cabinets, drawers, and dishwashers while assisting humans in performing day-to-day tasks. Existing methods either require objects to be textured or need to know the articulation model category a priori for estimating the model parameters for an articulated object. We propose ScrewNet, a novel appr… ▽ More

    Submitted 19 July, 2021; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: Presented at ICRA'21. Project webpage: https://pearl-utexas.github.io/ScrewNet/

  16. arXiv:1806.06063  [pdf, other

    stat.ML cs.LG cs.RO

    Probabilistic Trajectory Segmentation by Means of Hierarchical Dirichlet Process Switching Linear Dynamical Systems

    Authors: Maximilian Sieb, Matthias Schultheis, Sebastian Szelag, Rudolf Lioutikov, Jan Peters

    Abstract: Using movement primitive libraries is an effective means to enable robots to solve more complex tasks. In order to build these movement libraries, current algorithms require a prior segmentation of the demonstration trajectories. A promising approach is to model the trajectory as being generated by a set of Switching Linear Dynamical Systems and inferring a meaningful segmentation by inspecting th… ▽ More

    Submitted 1 March, 2020; v1 submitted 29 May, 2018; originally announced June 2018.

  17. arXiv:1510.03253  [pdf, other

    cs.RO

    Low-cost Sensor Glove with Force Feedback for Learning from Demonstrations using Probabilistic Trajectory Representations

    Authors: Elmar Rueckert, Rudolf Lioutikov, Roberto Calandra, Marius Schmidt, Philipp Beckerle, Jan Peters

    Abstract: Sensor gloves are popular input devices for a large variety of applications including health monitoring, control of music instruments, learning sign language, dexterous computer interfaces, and tele-operating robot hands. Many commercial products as well as low-cost open source projects have been developed. We discuss here how low-cost (approx. 250 EUROs) sensor gloves with force feedback can be b… ▽ More

    Submitted 12 October, 2015; originally announced October 2015.

    Comments: 3 pages, 3 figures. Workshop paper of the International Conference on Robotics and Automation (ICRA 2015)