Skip to main content

Showing 1–6 of 6 results for author: Schultheis, M

  1. arXiv:2303.16698  [pdf, other

    cs.LG eess.SY math.OC q-bio.NC stat.ML

    Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costs

    Authors: Dominik Straub, Matthias Schultheis, Heinz Koeppl, Constantin A. Rothkopf

    Abstract: Inverse optimal control can be used to characterize behavior in sequential decision-making tasks. Most existing work, however, is limited to fully observable or linear systems, or requires the action signals to be known. Here, we introduce a probabilistic approach to inverse optimal control for partially observable stochastic non-linear systems with unobserved action signals, which unifies previou… ▽ More

    Submitted 30 October, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

  2. arXiv:2209.13413  [pdf, other

    cs.LG eess.SY q-bio.NC stat.ML

    Reinforcement Learning with Non-Exponential Discounting

    Authors: Matthias Schultheis, Constantin A. Rothkopf, Heinz Koeppl

    Abstract: Commonly in reinforcement learning (RL), rewards are discounted over time using an exponential function to model time preference, thereby bounding the expected long-term reward. In contrast, in economics and psychology, it has been shown that humans often adopt a hyperbolic discounting scheme, which is optimal when a specific task termination time distribution is assumed. In this work, we propose… ▽ More

    Submitted 7 December, 2022; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: 22 pages, 3 figures, published at 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  3. arXiv:2110.11130  [pdf, other

    cs.LG eess.SY q-bio.NC stat.ML

    Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System

    Authors: Matthias Schultheis, Dominik Straub, Constantin A. Rothkopf

    Abstract: Computational level explanations based on optimal feedback control with signal-dependent noise have been able to account for a vast array of phenomena in human sensorimotor behavior. However, commonly a cost function needs to be assumed for a task and the optimality of human behavior is evaluated by comparing observed and predicted trajectories. Here, we introduce inverse optimal control with sign… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 24 pages, 11 figures, to be published at NeurIPS 2021

  4. arXiv:2010.01014  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    POMDPs in Continuous Time and Discrete Spaces

    Authors: Bastian Alt, Matthias Schultheis, Heinz Koeppl

    Abstract: Many processes, such as discrete event systems in engineering or population dynamics in biology, evolve in discrete space and continuous time. We consider the problem of optimal decision making in such discrete state and action space systems under partial observability. This places our work at the intersection of optimal filtering and optimal control. At the current state of research, a mathematic… ▽ More

    Submitted 26 October, 2020; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: published at Conference on Neural Information Processing Systems (NeurIPS) 2020

  5. arXiv:1910.03620  [pdf, ps, other

    cs.LG cs.RO stat.ML

    Receding Horizon Curiosity

    Authors: Matthias Schultheis, Boris Belousov, Hany Abdulsamad, Jan Peters

    Abstract: Sample-efficient exploration is crucial not only for discovering rewarding experiences but also for adapting to environment changes in a task-agnostic fashion. A principled treatment of the problem of optimal input synthesis for system identification is provided within the framework of sequential Bayesian experimental design. In this paper, we present an effective trajectory-optimization-based app… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: Published at Conference on Robot Learning (CoRL 2019)

  6. arXiv:1806.06063  [pdf, other

    stat.ML cs.LG cs.RO

    Probabilistic Trajectory Segmentation by Means of Hierarchical Dirichlet Process Switching Linear Dynamical Systems

    Authors: Maximilian Sieb, Matthias Schultheis, Sebastian Szelag, Rudolf Lioutikov, Jan Peters

    Abstract: Using movement primitive libraries is an effective means to enable robots to solve more complex tasks. In order to build these movement libraries, current algorithms require a prior segmentation of the demonstration trajectories. A promising approach is to model the trajectory as being generated by a set of Switching Linear Dynamical Systems and inferring a meaningful segmentation by inspecting th… ▽ More

    Submitted 1 March, 2020; v1 submitted 29 May, 2018; originally announced June 2018.