Skip to main content

Showing 1–50 of 53 results for author: van der Smagt, P

  1. arXiv:2407.02904  [pdf, other

    cs.RO cs.AI cs.LG

    The Shortcomings of Force-from-Motion in Robot Learning

    Authors: Elie Aljalbout, Felix Frank, Patrick van der Smagt, Alexandros Paraschos

    Abstract: Robotic manipulation requires accurate motion and physical interaction control. However, current robot learning approaches focus on motion-centric action spaces that do not explicitly give the policy control over the interaction. In this paper, we discuss the repercussions of this choice and argue for more interaction-explicit action spaces in robot learning.

    Submitted 3 July, 2024; originally announced July 2024.

    ACM Class: I.2.6; I.2.8; I.2.9

  2. arXiv:2405.13191  [pdf, other

    cs.LG

    Pragmatic auditing: a pilot-driven approach for auditing Machine Learning systems

    Authors: Djalel Benbouzid, Christiane Plociennik, Laura Lucaj, Mihai Maftei, Iris Merget, Aljoscha Burchardt, Marc P. Hauer, Abdeldjallil Naceri, Patrick van der Smagt

    Abstract: The growing adoption and deployment of Machine Learning (ML) systems came with its share of ethical incidents and societal concerns. It also unveiled the necessity to properly audit these systems in light of ethical principles. For such a novel type of algorithmic auditing to become standard practice, two main prerequisites need to be available: A lifecycle model that is tailored towards transpare… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2404.18896  [pdf, other

    cs.LG

    Overcoming Knowledge Barriers: Online Imitation Learning from Observation with Pretrained World Models

    Authors: Xingyuan Zhang, Philip Becker-Ehmck, Patrick van der Smagt, Maximilian Karl

    Abstract: Incorporating the successful paradigm of pretraining and finetuning from Computer Vision and Natural Language Processing into decision-making has become increasingly popular in recent years. In this paper, we study Imitation Learning from Observation with pretrained models and find existing approaches such as BCO and AIME face knowledge barriers, specifically the Embodiment Knowledge Barrier (EKB)… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 19 pages, 7 figures

  4. arXiv:2404.03253  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    A dataset of primary nasopharyngeal carcinoma MRI with multi-modalities segmentation

    Authors: Yin Li, Qi Chen, Kai Wang, Meige Li, Liping Si, Yingwei Guo, Yu Xiong, Qixing Wang, Yang Qin, Ling Xu, Patrick van der Smagt, Jun Tang, Nutan Chen

    Abstract: Multi-modality magnetic resonance imaging data with various sequences facilitate the early diagnosis, tumor segmentation, and disease staging in the management of nasopharyngeal carcinoma (NPC). The lack of publicly available, comprehensive datasets limits advancements in diagnosis, treatment planning, and the development of machine learning algorithms for NPC. Addressing this critical need, we in… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  5. arXiv:2403.15239  [pdf, other

    cs.RO cs.LG

    Guided Decoding for Robot Motion Generation and Adaption

    Authors: Nutan Chen, Elie Aljalbout, Botond Cseke, Patrick van der Smagt

    Abstract: We address motion generation for high-DoF robot arms in complex settings with obstacles, via points, etc. A significant advancement in this domain is achieved by integrating Learning from Demonstration (LfD) into the motion generation process. This integration facilitates rapid adaptation to new tasks and optimizes the utilization of accumulated expertise by allowing robots to learn and generalize… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 7 pages

  6. arXiv:2401.11447  [pdf, other

    cs.LG q-bio.QM

    Sequential Model for Predicting Patient Adherence in Subcutaneous Immunotherapy for Allergic Rhinitis

    Authors: Yin Li, Yu Xiong, Wenxin Fan, Kai Wang, Qingqing Yu, Liping Si, Patrick van der Smagt, Jun Tang, Nutan Chen

    Abstract: Objective: Subcutaneous Immunotherapy (SCIT) is the long-lasting causal treatment of allergic rhinitis (AR). How to enhance the adherence of patients to maximize the benefit of allergen immunotherapy (AIT) plays a crucial role in the management of AIT. This study aims to leverage novel machine learning models to precisely predict the risk of non-adherence of AR patients and related local symptom s… ▽ More

    Submitted 28 June, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: Frontiers in Pharmacology, research topic: Methods and Metrics to Measure Medication Adherence

  7. On the Role of the Action Space in Robot Manipulation Learning and Sim-to-Real Transfer

    Authors: Elie Aljalbout, Felix Frank, Maximilian Karl, Patrick van der Smagt

    Abstract: We study the choice of action space in robot manipulation learning and sim-to-real transfer. We define metrics that assess the performance, and examine the emerging properties in the different action spaces. We train over 250 reinforcement learning~(RL) agents in simulated reaching and pushing tasks, using 13 different control spaces. The choice of spaces spans combinations of common action space… ▽ More

    Submitted 29 April, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

  8. arXiv:2312.02019  [pdf, other

    cs.LG cs.AI

    Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models

    Authors: Xingyuan Zhang, Philip Becker-Ehmck, Patrick van der Smagt, Maximilian Karl

    Abstract: Unlike most reinforcement learning agents which require an unrealistic amount of environment interactions to learn a new behaviour, humans excel at learning quickly by merely observing and imitating others. This ability highly depends on the fact that humans have a model of their own embodiment that allows them to infer the most likely actions that led to the observed behaviour. In this paper, we… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  9. arXiv:2304.10246  [pdf, other

    cs.LG cs.RO eess.SY

    Filter-Aware Model-Predictive Control

    Authors: Baris Kayalibay, Atanas Mirchev, Ahmed Agha, Patrick van der Smagt, Justin Bayer

    Abstract: Partially-observable problems pose a trade-off between reducing costs and gathering information. They can be solved optimally by planning in belief space, but that is often prohibitively expensive. Model-predictive control (MPC) takes the alternative approach of using a state estimator to form a belief over the state, and then plan in state space. This ignores potential future observations during… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  10. arXiv:2212.02988  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    PRISM: Probabilistic Real-Time Inference in Spatial World Models

    Authors: Atanas Mirchev, Baris Kayalibay, Ahmed Agha, Patrick van der Smagt, Daniel Cremers, Justin Bayer

    Abstract: We introduce PRISM, a method for real-time filtering in a probabilistic generative model of agent motion and visual perception. Previous approaches either lack uncertainty estimates for the map and agent state, do not run in real-time, do not have a dense scene representation or do not model agent dynamics. Our solution reconciles all of these aspects. We start from a predefined state-space model… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Will appear in PMLR, CoRL 2022

  11. arXiv:2211.15824  [pdf, other

    cs.RO cs.GT cs.LG cs.MA cs.NE

    CLAS: Coordinating Multi-Robot Manipulation with Central Latent Action Spaces

    Authors: Elie Aljalbout, Maximilian Karl, Patrick van der Smagt

    Abstract: Multi-robot manipulation tasks involve various control entities that can be separated into dynamically independent parts. A typical example of such real-world tasks is dual-arm manipulation. Learning to naively solve such tasks with reinforcement learning is often unfeasible due to the sample complexity and exploration requirements growing with the dimensionality of the action and state spaces. In… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    ACM Class: I.2.6; I.2.8; I.2.9

  12. arXiv:2209.09453  [pdf, other

    cs.LG astro-ph.IM

    Probabilistic Dalek -- Emulator framework with probabilistic prediction for supernova tomography

    Authors: Wolfgang Kerzendorf, Nutan Chen, Jack O'Brien, Johannes Buchner, Patrick van der Smagt

    Abstract: Supernova spectral time series can be used to reconstruct a spatially resolved explosion model known as supernova tomography. In addition to an observed spectral time series, a supernova tomography requires a radiative transfer model to perform the inverse problem with uncertainty quantification for a reconstruction. The smallest parametrizations of supernova tomography models are roughly a dozen… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: 7 pages, accepted at ICML 2022 Workshop on Machine Learning for Astrophysics

  13. arXiv:2206.05909  [pdf, other

    cs.LG

    Local Distance Preserving Auto-encoders using Continuous k-Nearest Neighbours Graphs

    Authors: Nutan Chen, Patrick van der Smagt, Botond Cseke

    Abstract: Auto-encoder models that preserve similarities in the data are a popular tool in representation learning. In this paper we introduce several auto-encoder models that preserve local distances when mapping from the data space to the latent space. We use a local distance preserving loss that is based on the continuous k-nearest neighbours graph which is known to capture topological features at all sc… ▽ More

    Submitted 30 September, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

  14. arXiv:2202.12243  [pdf, other

    cs.SD cs.LG eess.AS

    Flat Latent Manifolds for Human-machine Co-creation of Music

    Authors: Nutan Chen, Djalel Benbouzid, Francesco Ferroni, Mathis Nitschke, Luciano Pinna, Patrick van der Smagt

    Abstract: The use of machine learning in artistic music generation leads to controversial discussions of the quality of art, for which objective quantification is nonsensical. We therefore consider a music-generating algorithm as a counterpart to a human musician, in a setting where reciprocal interplay is to lead to new experiences, both for the musician and the audience. To obtain this behaviour, we resor… ▽ More

    Submitted 10 August, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: 3rd Conference on AI Music Creativity (AIMC 2022)

  15. arXiv:2201.10335  [pdf, other

    cs.LG

    Tracking and Planning with Spatial World Models

    Authors: Baris Kayalibay, Atanas Mirchev, Patrick van der Smagt, Justin Bayer

    Abstract: We introduce a method for real-time navigation and tracking with differentiably rendered world models. Learning models for control has led to impressive results in robotics and computer games, but this success has yet to be extended to vision-based navigation. To address this, we transfer advances in the emergent field of differentiable rendering to model-based control. We do this by planning in a… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  16. Constrained Probabilistic Movement Primitives for Robot Trajectory Adaptation

    Authors: Felix Frank, Alexandros Paraschos, Patrick van der Smagt, Botond Cseke

    Abstract: Placing robots outside controlled conditions requires versatile movement representations that allow robots to learn new tasks and adapt them to environmental changes. The introduction of obstacles or the placement of additional robots in the workspace, the modification of the joint range due to faults or range-of-motion constraints are typical cases where the adaptation capabilities play a key rol… ▽ More

    Submitted 5 January, 2022; v1 submitted 29 January, 2021; originally announced January 2021.

    Comments: There is a supplementary video accompanying the paper. It can be found at https://youtu.be/7UI6QX-eZ3I

  17. arXiv:2101.07046  [pdf, other

    cs.LG stat.ML

    Mind the Gap when Conditioning Amortised Inference in Sequential Latent-Variable Models

    Authors: Justin Bayer, Maximilian Soelch, Atanas Mirchev, Baris Kayalibay, Patrick van der Smagt

    Abstract: Amortised inference enables scalable learning of sequential latent-variable models (LVMs) with the evidence lower bound (ELBO). In this setting, variational posteriors are often only partially conditioned. While the true posteriors depend, e.g., on the entire sequence of observations, approximate posteriors are only informed by past observations. This mimics the Bayesian filter -- a mixture of smo… ▽ More

    Submitted 17 March, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: Published as a conference paper at ICLR 2021 (Poster)

  18. arXiv:2007.01868  [pdf, other

    astro-ph.IM cs.LG

    Dalek -- a deep-learning emulator for TARDIS

    Authors: Wolfgang E. Kerzendorf, Christian Vogl, Johannes Buchner, Gabriella Contardo, Marc Williamson, Patrick van der Smagt

    Abstract: Supernova spectral time series contain a wealth of information about the progenitor and explosion process of these energetic events. The modeling of these data requires the exploration of very high dimensional posterior probabilities with expensive radiative transfer codes. Even modest parametrizations of supernovae contain more than ten parameters and a detailed exploration demands at least sever… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Comments: 6 pages;5 figures submitted to AAS Journals. Constructive Criticism invited

  19. Layerwise learning for quantum neural networks

    Authors: Andrea Skolik, Jarrod R. McClean, Masoud Mohseni, Patrick van der Smagt, Martin Leib

    Abstract: With the increased focus on quantum circuit learning for near-term applications on quantum devices, in conjunction with unique challenges presented by cost function landscapes of parametrized quantum circuits, strategies for effective training are becoming increasingly important. In order to ameliorate some of these challenges, we investigate a layerwise learning strategy for parametrized quantum… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: 11 pages, 7 figures

    Journal ref: Quantum Machine Intelligence Vol. 3, No. 5 (2021)

  20. arXiv:2006.10178  [pdf, other

    stat.ML cs.CV cs.LG

    Variational State-Space Models for Localisation and Dense 3D Mapping in 6 DoF

    Authors: Atanas Mirchev, Baris Kayalibay, Patrick van der Smagt, Justin Bayer

    Abstract: We solve the problem of 6-DoF localisation and 3D dense reconstruction in spatial environments as approximate Bayesian inference in a deep state-space model. Our approach leverages both learning and domain knowledge from multiple-view geometry and rigid-body dynamics. This results in an expressive predictive model of the world, often missing in current state-of-the-art visual SLAM solutions. The c… ▽ More

    Submitted 15 March, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Update for ICLR2021

  21. arXiv:2003.08876  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    Learning to Fly via Deep Model-Based Reinforcement Learning

    Authors: Philip Becker-Ehmck, Maximilian Karl, Jan Peters, Patrick van der Smagt

    Abstract: Learning to control robots without requiring engineered models has been a long-term goal, promising diverse and novel applications. Yet, reinforcement learning has only achieved limited impact on real-time robot control due to its high demand of real-world interactions. In this work, by leveraging a learnt probabilistic model of drone dynamics, we learn a thrust-attitude controller for a quadrotor… ▽ More

    Submitted 4 August, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

  22. arXiv:2002.04881  [pdf, other

    stat.ML cs.LG

    Learning Flat Latent Manifolds with VAEs

    Authors: Nutan Chen, Alexej Klushyn, Francesco Ferroni, Justin Bayer, Patrick van der Smagt

    Abstract: Measuring the similarity between data points often requires domain knowledge, which can in parts be compensated by relying on unsupervised methods such as latent-variable models, where similarity/distance is estimated in a more compact latent space. Prevalent is the use of the Euclidean metric, which has the drawback of ignoring information about similarity of data stored in the decoder, as captur… ▽ More

    Submitted 12 August, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Thirty-seventh International Conference on Machine Learning (ICML) 2020

    Journal ref: International Conference on Machine Learning 2020

  23. arXiv:1911.00756  [pdf, other

    cs.LG stat.ML

    Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations

    Authors: Neha Das, Maximilian Karl, Philip Becker-Ehmck, Patrick van der Smagt

    Abstract: Learning a model of dynamics from high-dimensional images can be a core ingredient for success in many applications across different domains, especially in sequential decision making. However, currently prevailing methods based on latent-variable models are limited to working with low resolution images only. In this work, we show that some of the issues with using high-dimensional observations ari… ▽ More

    Submitted 2 November, 2019; originally announced November 2019.

  24. arXiv:1910.06205  [pdf, other

    stat.ML cs.CV cs.LG

    Variational Tracking and Prediction with Generative Disentangled State-Space Models

    Authors: Adnan Akhundov, Maximilian Soelch, Justin Bayer, Patrick van der Smagt

    Abstract: We address tracking and prediction of multiple moving objects in visual data streams as inference and sampling in a disentangled latent state-space model. By encoding objects separately and including explicit position information in the latent state space, we perform tracking via amortized variational Bayesian inference of the respective latent positions. Inference is implemented in a modular neur… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

  25. arXiv:1909.05659  [pdf, other

    cs.RO cs.CV cs.LG stat.ML

    Estimating Fingertip Forces, Torques, and Local Curvatures from Fingernail Images

    Authors: Nutan Chen, Göran Westling, Benoni B. Edin, Patrick van der Smagt

    Abstract: The study of dexterous manipulation has provided important insights in humans sensorimotor control as well as inspiration for manipulation strategies in robotic hands. Previous work focused on experimental environment with restrictions. Here we describe a method using the deformation and color distribution of the fingernail and its surrounding skin, to estimate the fingertip forces, torques and co… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: Robotica

  26. arXiv:1908.08750  [pdf, other

    stat.ML cs.LG

    Increasing the Generalisation Capacity of Conditional VAEs

    Authors: Alexej Klushyn, Nutan Chen, Botond Cseke, Justin Bayer, Patrick van der Smagt

    Abstract: We address the problem of one-to-many mappings in supervised learning, where a single instance has many different solutions of possibly equal cost. The framework of conditional variational autoencoders describes a class of methods to tackle such structured-prediction tasks by means of latent variables. We propose to incentivise informative latent representations for increasing the generalisation c… ▽ More

    Submitted 10 September, 2019; v1 submitted 23 August, 2019; originally announced August 2019.

  27. arXiv:1905.12434  [pdf, other

    stat.ML cs.LG

    Switching Linear Dynamics for Variational Bayes Filtering

    Authors: Philip Becker-Ehmck, Jan Peters, Patrick van der Smagt

    Abstract: System identification of complex and nonlinear systems is a central problem for model predictive control and model-based reinforcement learning. Despite their complexity, such systems can often be approximated well by a set of linear dynamical systems if broken into appropriate subsequences. This mechanism not only helps us find good approximations of dynamics, but also gives us deeper insight int… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: Appears in Proceedings of the 36th International Conference on Machine Learning (ICML)

  28. arXiv:1905.04982  [pdf, other

    stat.ML cs.LG

    Learning Hierarchical Priors in VAEs

    Authors: Alexej Klushyn, Nutan Chen, Richard Kurle, Botond Cseke, Patrick van der Smagt

    Abstract: We propose to learn a hierarchical prior in the context of variational autoencoders to avoid the over-regularisation resulting from a standard normal prior distribution. To incentivise an informative latent representation of the data, we formulate the learning problem as a constrained optimisation problem by extending the Taming VAEs framework to two-level hierarchical models. We introduce a graph… ▽ More

    Submitted 5 October, 2019; v1 submitted 13 May, 2019; originally announced May 2019.

    Comments: Published at NeurIPS 2019 (spotlight)

  29. On Deep Set Learning and the Choice of Aggregations

    Authors: Maximilian Soelch, Adnan Akhundov, Patrick van der Smagt, Justin Bayer

    Abstract: Recently, it has been shown that many functions on sets can be represented by sum decompositions. These decompositons easily lend themselves to neural approximations, extending the applicability of neural nets to set-valued inputs---Deep Set learning. This work investigates a core component of Deep Set architecture: aggregation functions. We suggest and examine alternatives to commonly used aggreg… ▽ More

    Submitted 8 April, 2020; v1 submitted 18 March, 2019; originally announced March 2019.

  30. arXiv:1901.04436  [pdf, other

    stat.ML cs.LG

    Bayesian Learning of Neural Network Architectures

    Authors: Georgi Dikov, Patrick van der Smagt, Justin Bayer

    Abstract: In this paper we propose a Bayesian method for estimating architectural parameters of neural networks, namely layer size and network depth. We do this by learning concrete distributions over these parameters. Our results show that regular networks with a learnt structure can generalise better on small datasets, while fully stochastic networks can be more robust to parameter initialisation. The pro… ▽ More

    Submitted 27 January, 2019; v1 submitted 14 January, 2019; originally announced January 2019.

    Comments: The 22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019)

  31. arXiv:1812.08284  [pdf, other

    stat.ML cs.LG

    Fast Approximate Geodesics for Deep Generative Models

    Authors: Nutan Chen, Francesco Ferroni, Alexej Klushyn, Alexandros Paraschos, Justin Bayer, Patrick van der Smagt

    Abstract: The length of the geodesic between two data points along a Riemannian manifold, induced by a deep generative model, yields a principled measure of similarity. Current approaches are limited to low-dimensional latent spaces, due to the computational complexity of solving a non-convex optimisation problem. We propose finding shortest paths in a finite graph of samples from the aggregate approximate… ▽ More

    Submitted 23 May, 2019; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: 28th International Conference on Artificial Neural Networks, 2019

    Journal ref: 28th International Conference on Artificial Neural Networks, 2019

  32. arXiv:1811.04451  [pdf, other

    stat.ML cs.LG

    Multi-Source Neural Variational Inference

    Authors: Richard Kurle, Stephan Günnemann, Patrick van der Smagt

    Abstract: Learning from multiple sources of information is an important problem in machine-learning research. The key challenges are learning representations and formulating inference methods that take into account the complementarity and redundancy of various information sources. In this paper we formulate a variational autoencoder based multi-source learning framework in which each encoder is conditioned… ▽ More

    Submitted 17 November, 2018; v1 submitted 11 November, 2018; originally announced November 2018.

    Comments: AAAI 2019, Association for the Advancement of Artificial Intelligence (AAAI) 2019

  33. arXiv:1808.02026  [pdf, other

    stat.ML cs.LG

    Active Learning based on Data Uncertainty and Model Sensitivity

    Authors: Nutan Chen, Alexej Klushyn, Alexandros Paraschos, Djalel Benbouzid, Patrick van der Smagt

    Abstract: Robots can rapidly acquire new skills from demonstrations. However, during generalisation of skills or transitioning across fundamentally different skills, it is unclear whether the robot has the necessary knowledge to perform the task. Failing to detect missing information often leads to abrupt movements or to collisions with the environment. Active learning can quantify the uncertainty of perfor… ▽ More

    Submitted 6 August, 2018; originally announced August 2018.

    Comments: Published on 2018 IEEE/RSJ International Conference on Intelligent Robots and System

  34. arXiv:1805.07206  [pdf, other

    stat.ML cs.LG

    Approximate Bayesian inference in spatial environments

    Authors: Atanas Mirchev, Baris Kayalibay, Maximilian Soelch, Patrick van der Smagt, Justin Bayer

    Abstract: Model-based approaches bear great promise for decision making of agents interacting with the physical world. In the context of spatial environments, different types of problems such as localisation, mapping, navigation or autonomous exploration are typically adressed with specialised methods, often relying on detailed knowledge of the system at hand. We express these tasks as probabilistic inferen… ▽ More

    Submitted 20 June, 2019; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: Preprint of publication at RSS 2019

  35. arXiv:1711.11059  [pdf, other

    stat.ML cs.LG cs.NE

    Gaussian Process Neurons Learn Stochastic Activation Functions

    Authors: Sebastian Urban, Marcus Basalla, Patrick van der Smagt

    Abstract: We propose stochastic, non-parametric activation functions that are fully learnable and individual to each neuron. Complexity and the risk of overfitting are controlled by placing a Gaussian process prior over these functions. The result is the Gaussian process neuron, a probabilistic unit that can be used as the basic building block for probabilistic graphical models that resemble the structure o… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

  36. arXiv:1711.01348  [pdf, ps, other

    cs.SC stat.ML

    Automatic Differentiation for Tensor Algebras

    Authors: Sebastian Urban, Patrick van der Smagt

    Abstract: Kjolstad et. al. proposed a tensor algebra compiler. It takes expressions that define a tensor element-wise, such as $f_{ij}(a,b,c,d) = \exp\left[-\sum_{k=0}^4 \left((a_{ik}+b_{jk})^2\, c_{ii} + d_{i+k}^3 \right) \right]$, and generates the corresponding compute kernel code. For machine learning, especially deep learning, it is often necessary to compute the gradient of a loss function… ▽ More

    Submitted 3 November, 2017; originally announced November 2017.

    Comments: Technical Report

  37. arXiv:1711.01204  [pdf, other

    stat.ML cs.LG

    Metrics for Deep Generative Models

    Authors: Nutan Chen, Alexej Klushyn, Richard Kurle, Xueyan Jiang, Justin Bayer, Patrick van der Smagt

    Abstract: Neural samplers such as variational autoencoders (VAEs) or generative adversarial networks (GANs) approximate distributions by transforming samples from a simple random source---the latent space---to samples from a more complex distribution represented by a dataset. While the manifold hypothesis implies that the density induced by a dataset contains large regions of low density, the training crite… ▽ More

    Submitted 8 February, 2018; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: Published on the 21st International Conference on Artificial Intelligence and Statistics (AISTATS), 2018

    Journal ref: The 21st International Conference on Artificial Intelligence and Statistics, 2018

  38. Two-Stream RNN/CNN for Action Recognition in 3D Videos

    Authors: Rui Zhao, Haider Ali, Patrick van der Smagt

    Abstract: The recognition of actions from video sequences has many applications in health monitoring, assisted living, surveillance, and smart homes. Despite advances in sensing, in particular related to 3D video, the methodologies to process the data are still subject to research. We demonstrate superior results by a system which combines recurrent neural networks with convolutional neural networks in a vo… ▽ More

    Submitted 2 October, 2018; v1 submitted 22 March, 2017; originally announced March 2017.

    Comments: Published in 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  39. arXiv:1701.03056  [pdf, other

    cs.CV

    CNN-based Segmentation of Medical Imaging Data

    Authors: Baris Kayalibay, Grady Jensen, Patrick van der Smagt

    Abstract: Convolutional neural networks have been applied to a wide variety of computer vision tasks. Recent advances in semantic segmentation have enabled their application to medical image segmentation. While most CNNs use two-dimensional kernels, recent CNN-based publications on medical image segmentation featured three-dimensional kernels, allowing full access to the three-dimensional structure of medic… ▽ More

    Submitted 25 July, 2017; v1 submitted 11 January, 2017; originally announced January 2017.

    Comments: 24 pages, Code available on https://github.com/BRML/CNNbasedMedicalSegmentation

  40. arXiv:1606.07312  [pdf, other

    cs.RO cs.LG stat.ML

    Unsupervised preprocessing for Tactile Data

    Authors: Maximilian Karl, Justin Bayer, Patrick van der Smagt

    Abstract: Tactile information is important for gripping, stable grasp, and in-hand manipulation, yet the complexity of tactile data prevents widespread use of such sensors. We make use of an unsupervised learning algorithm that transforms the complex tactile data into a compact, latent representation without the need to record ground truth reference data. These compact representations can either be used dir… ▽ More

    Submitted 23 June, 2016; originally announced June 2016.

  41. arXiv:1606.06588  [pdf, other

    cs.RO cs.LG

    ML-based tactile sensor calibration: A universal approach

    Authors: Maximilian Karl, Artur Lohrer, Dhananjay Shah, Frederik Diehl, Max Fiedler, Saahil Ognawala, Justin Bayer, Patrick van der Smagt

    Abstract: We study the responses of two tactile sensors, the fingertip sensor from the iCub and the BioTac under different external stimuli. The question of interest is to which degree both sensors i) allow the estimation of force exerted on the sensor and ii) enable the recognition of differing degrees of curvature. Making use of a force controlled linear motor affecting the tactile sensors we acquire seve… ▽ More

    Submitted 21 June, 2016; originally announced June 2016.

  42. arXiv:1605.06432  [pdf, other

    stat.ML cs.LG eess.SY

    Deep Variational Bayes Filters: Unsupervised Learning of State Space Models from Raw Data

    Authors: Maximilian Karl, Maximilian Soelch, Justin Bayer, Patrick van der Smagt

    Abstract: We introduce Deep Variational Bayes Filters (DVBF), a new method for unsupervised learning and identification of latent Markovian state space models. Leveraging recent advances in Stochastic Gradient Variational Bayes, DVBF can overcome intractable inference distributions via variational inference. Thus, it can handle highly nonlinear input data with temporal and spatial dependencies such as image… ▽ More

    Submitted 3 March, 2017; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: Published as a conference paper at ICLR 2017

  43. arXiv:1604.03736  [pdf, other

    cs.LG stat.ML

    A Differentiable Transition Between Additive and Multiplicative Neurons

    Authors: Wiebke Köpp, Patrick van der Smagt, Sebastian Urban

    Abstract: Existing approaches to combine both additive and multiplicative neural units either use a fixed assignment of operations or require discrete optimization to determine what function a neuron should perform. However, this leads to an extensive increase in the computational complexity of the training procedure. We present a novel, parameterizable transfer function based on the mathematical concept… ▽ More

    Submitted 13 April, 2016; originally announced April 2016.

    Comments: ICLR 2016 extended abstract

  44. arXiv:1602.07109  [pdf, other

    stat.ML cs.LG

    Variational Inference for On-line Anomaly Detection in High-Dimensional Time Series

    Authors: Maximilian Soelch, Justin Bayer, Marvin Ludersdorfer, Patrick van der Smagt

    Abstract: Approximate variational inference has shown to be a powerful tool for modeling unknown complex probability distributions. Recent advances in the field allow us to learn probabilistic models of sequences that actively exploit spatial and temporal structure. We apply a Stochastic Recurrent Network (STORN) to learn robot time series data. Our evaluation demonstrates that we can robustly detect anomal… ▽ More

    Submitted 14 June, 2016; v1 submitted 23 February, 2016; originally announced February 2016.

    Comments: Accepted as workshop paper at ICLR 2016; accepted as workshop paper for anomaly detection workshop at ICML 2016

  45. arXiv:1601.04862  [pdf, other

    cs.RO cs.DC cs.NE eess.SY

    Scalability in Neural Control of Musculoskeletal Robots

    Authors: Christoph Richter, Sören Jentzsch, Rafael Hostettler, Jesús A. Garrido, Eduardo Ros, Alois C. Knoll, Florian Röhrbein, Patrick van der Smagt, Jörg Conradt

    Abstract: Anthropomimetic robots are robots that sense, behave, interact and feel like humans. By this definition, anthropomimetic robots require human-like physical hardware and actuation, but also brain-like control and sensing. The most self-evident realization to meet those requirements would be a human-like musculoskeletal robot with a brain-like neural controller. While both musculoskeletal robotic ha… ▽ More

    Submitted 19 January, 2016; originally announced January 2016.

    Comments: Accepted at IEEE Robotics and Automation Magazine on 2015-12-31

  46. arXiv:1509.08455  [pdf, other

    stat.ML cs.LG

    Efficient Empowerment

    Authors: Maximilian Karl, Justin Bayer, Patrick van der Smagt

    Abstract: Empowerment quantifies the influence an agent has on its environment. This is formally achieved by the maximum of the expected KL-divergence between the distribution of the successor state conditioned on a specific action and a distribution where the actions are marginalised out. This is a natural candidate for an intrinsic reward signal in the context of reinforcement learning: the agent will pla… ▽ More

    Submitted 28 September, 2015; originally announced September 2015.

  47. arXiv:1507.05331  [pdf, ps, other

    stat.ML cs.LG

    Fast Adaptive Weight Noise

    Authors: Justin Bayer, Maximilian Karl, Daniela Korhammer, Patrick van der Smagt

    Abstract: Marginalising out uncertain quantities within the internal representations or parameters of neural networks is of central importance for a wide range of learning techniques, such as empirical, variational or full Bayesian methods. We set out to generalise fast dropout (Wang & Manning, 2013) to cover a wider variety of noise processes in neural networks. This leads to an efficient calculation of th… ▽ More

    Submitted 19 July, 2015; originally announced July 2015.

  48. arXiv:1504.06852  [pdf, other

    cs.CV cs.LG

    FlowNet: Learning Optical Flow with Convolutional Networks

    Authors: Philipp Fischer, Alexey Dosovitskiy, Eddy Ilg, Philip Häusser, Caner Hazırbaş, Vladimir Golkov, Patrick van der Smagt, Daniel Cremers, Thomas Brox

    Abstract: Convolutional neural networks (CNNs) have recently been very successful in a variety of computer vision tasks, especially on those linked to recognition. Optical flow estimation has not been among the tasks where CNNs were successful. In this paper we construct appropriate CNNs which are capable of solving the optical flow estimation problem as a supervised learning task. We propose and compare tw… ▽ More

    Submitted 4 May, 2015; v1 submitted 26 April, 2015; originally announced April 2015.

    Comments: Added supplementary material

    ACM Class: I.2.6; I.4.8

  49. arXiv:1503.05724  [pdf, other

    stat.ML cs.LG cs.NE

    A Neural Transfer Function for a Smooth and Differentiable Transition Between Additive and Multiplicative Interactions

    Authors: Sebastian Urban, Patrick van der Smagt

    Abstract: Existing approaches to combine both additive and multiplicative neural units either use a fixed assignment of operations or require discrete optimization to determine what function a neuron should perform. This leads either to an inefficient distribution of computational resources or an extensive increase in the computational complexity of the training procedure. We present a novel, parameteriza… ▽ More

    Submitted 29 March, 2016; v1 submitted 19 March, 2015; originally announced March 2015.

  50. arXiv:1311.0701  [pdf, other

    stat.ML cs.LG cs.NE

    On Fast Dropout and its Applicability to Recurrent Networks

    Authors: Justin Bayer, Christian Osendorfer, Daniela Korhammer, Nutan Chen, Sebastian Urban, Patrick van der Smagt

    Abstract: Recurrent Neural Networks (RNNs) are rich models for the processing of sequential data. Recent work on advancing the state of the art has been focused on the optimization or modelling of RNNs, mostly motivated by adressing the problems of the vanishing and exploding gradients. The control of overfitting has seen considerably less attention. This paper contributes to that by analyzing fast dropout,… ▽ More

    Submitted 5 March, 2014; v1 submitted 4 November, 2013; originally announced November 2013.

    Comments: The experiments for the Penn Treebank corpus were erroneous and have been stripped from this version