Skip to main content

Showing 1–50 of 62 results for author: Köhler, J

  1. arXiv:2406.11506  [pdf, other

    cs.RO

    Embedded Hierarchical MPC for Autonomous Navigation

    Authors: Dennis Benders, Johannes Köhler, Thijs Niesten, Robert Babuška, Javier Alonso-Mora, Laura Ferranti

    Abstract: To efficiently deploy robotic systems in society, mobile robots need to autonomously and safely move through complex environments. Nonlinear model predictive control (MPC) methods provide a natural way to find a dynamically feasible trajectory through the environment without colliding with nearby obstacles. However, the limited computation power available on typical embedded robotic systems, such… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 18 pages, 14 figures (excluding biography entries)

  2. arXiv:2405.19243  [pdf

    cs.AI physics.ed-ph

    Challenge-Device-Synthesis: A multi-disciplinary approach for the development of social innovation competences for students of Artificial Intelligence

    Authors: Matías Bilkis, Joan Moya Kohler, Fernando Vilariño

    Abstract: The advent of Artificial Intelligence is expected to imply profound changes in the short-term. It is therefore imperative for Academia, and particularly for the Computer Science scope, to develop cross-disciplinary tools that bond AI developments to their social dimension. To this aim, we introduce the Challenge-Device-Synthesis methodology (CDS), in which a specific challenge is presented to the… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: accepted as contribution for EDULEARN24 - 16th annual International Conference on Education and New Learning Technologies

  3. arXiv:2405.05224  [pdf, other

    cs.CV

    Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation

    Authors: Jonas Kohler, Albert Pumarola, Edgar Schönfeld, Artsiom Sanakoyeu, Roshan Sumbaly, Peter Vajda, Ali Thabet

    Abstract: Diffusion models are a powerful generative framework, but come with expensive inference. Existing acceleration methods often compromise image quality or fail under complex conditioning when operating in an extremely low-step regime. In this work, we propose a novel distillation framework tailored to enable high-fidelity, diverse sample generation using just one to three steps. Our approach compris… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  4. arXiv:2405.03243  [pdf, other

    cs.CV

    Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to Probe the Boundaries of Stable Diffusion Generated Data

    Authors: Leonhard Hennicke, Christian Medeiros Adriano, Holger Giese, Jan Mathias Koehler, Lukas Schott

    Abstract: Generative foundation models like Stable Diffusion comprise a diverse spectrum of knowledge in computer vision with the potential for transfer learning, e.g., via generating data to train student models for downstream tasks. This could circumvent the necessity of collecting labeled real-world data, thereby presenting a form of data-free knowledge distillation. However, the resultant student models… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  5. arXiv:2404.19020  [pdf

    cs.IR

    Information literacy development and assessment at school level: a systematic review of the literature

    Authors: Luz Chourio-Acevedo, Jacqueline Köhler, Carla Coscarelli, Daniel Gacitúa, Verónica Proaño-Ríos, Roberto González-Ibáñez

    Abstract: Information literacy (IL) involves a group of competences and fundamental skills in the 21st century. Today, society operates around information, which is challenging considering the vast amount of content available online. People must be capable of searching, critically assessing, making sense of, and communicating information. This set of competences must be properly developed since childhood, e… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  6. Automatic Defect Detection in Sewer Network Using Deep Learning Based Object Detector

    Authors: Bach Ha, Birgit Schalter, Laura White, Joachim Koehler

    Abstract: Maintaining sewer systems in large cities is important, but also time and effort consuming, because visual inspections are currently done manually. To reduce the amount of aforementioned manual work, defects within sewer pipes should be located and classified automatically. In the past, multiple works have attempted solving this problem using classical image processing, machine learning, or a comb… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Journal ref: (2023) In Proceedings of the 3rd International Conference on Image Processing and Vision Engineering - IMPROVE; ISBN 978-989-758-642-2; ISSN 2795-4943, SciTePress, pages 188-198

  7. arXiv:2404.01550  [pdf, other

    cs.RO eess.SY math.OC

    Perfecting Periodic Trajectory Tracking: Model Predictive Control with a Periodic Observer ($Π$-MPC)

    Authors: Luis Pabon, Johannes Köhler, John Irvin Alora, Patrick Benito Eberhard, Andrea Carron, Melanie N. Zeilinger, Marco Pavone

    Abstract: In Model Predictive Control (MPC), discrepancies between the actual system and the predictive model can lead to substantial tracking errors and significantly degrade performance and reliability. While such discrepancies can be alleviated with more complex models, this often complicates controller design and implementation. By leveraging the fact that many trajectories of interest are periodic, we… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures, Submitted to the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  8. arXiv:2402.06562  [pdf, other

    eess.SY cs.LG cs.RO math.OC

    Safe Guaranteed Exploration for Non-linear Systems

    Authors: Manish Prajapat, Johannes Köhler, Matteo Turchetta, Andreas Krause, Melanie N. Zeilinger

    Abstract: Safely exploring environments with a-priori unknown constraints is a fundamental challenge that restricts the autonomy of robots. While safety is paramount, guarantees on sufficient exploration are also crucial for ensuring autonomous task completion. To address these challenges, we propose a novel safe guaranteed exploration framework using optimal control, which achieves first-of-its-kind result… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  9. arXiv:2312.16109  [pdf, other

    cs.CV cs.LG

    fMPI: Fast Novel View Synthesis in the Wild with Layered Scene Representations

    Authors: Jonas Kohler, Nicolas Griffiths Sanchez, Luca Cavalli, Catherine Herold, Albert Pumarola, Alberto Garcia Garcia, Ali Thabet

    Abstract: In this study, we propose two novel input processing paradigms for novel view synthesis (NVS) methods based on layered scene representations that significantly improve their runtime without compromising quality. Our approach identifies and mitigates the two most time-consuming aspects of traditional pipelines: building and processing the so-called plane sweep volume (PSV), which is a high-dimensio… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  10. arXiv:2312.12487  [pdf, other

    cs.LG cs.AI

    Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models

    Authors: Angela Castillo, Jonas Kohler, Juan C. Pérez, Juan Pablo Pérez, Albert Pumarola, Bernard Ghanem, Pablo Arbeláez, Ali Thabet

    Abstract: This paper presents a comprehensive study on the role of Classifier-Free Guidance (CFG) in text-conditioned diffusion models from the perspective of inference efficiency. In particular, we relax the default choice of applying CFG in all diffusion steps and instead search for efficient guidance policies. We formulate the discovery of such policies in the differentiable Neural Architecture Search fr… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  11. arXiv:2312.10199  [pdf, other

    eess.SY cs.LG math.OC

    Automatic nonlinear MPC approximation with closed-loop guarantees

    Authors: Abdullah Tokmak, Christian Fiedler, Melanie N. Zeilinger, Sebastian Trimpe, Johannes Köhler

    Abstract: Safety guarantees are vital in many control applications, such as robotics. Model predictive control (MPC) provides a constructive framework for controlling safety-critical systems, but is limited by its computational complexity. We address this problem by presenting a novel algorithm that automatically computes an explicit approximation to nonlinear MPC schemes while retaining closed-loop guarant… ▽ More

    Submitted 11 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Submitted to IEEE Transactions on Automatic Control. Compared to the previously uploaded version, this version contains an additional numerical example

  12. arXiv:2312.03209  [pdf, other

    cs.CV

    Cache Me if You Can: Accelerating Diffusion Models through Block Caching

    Authors: Felix Wimbauer, Bichen Wu, Edgar Schoenfeld, Xiaoliang Dai, Ji Hou, Zijian He, Artsiom Sanakoyeu, Peizhao Zhang, Sam Tsai, Jonas Kohler, Christian Rupprecht, Daniel Cremers, Peter Vajda, Jialiang Wang

    Abstract: Diffusion models have recently revolutionized the field of image synthesis due to their ability to generate photorealistic images. However, one of the major drawbacks of diffusion models is that the image generation process is costly. A large image-to-image network has to be applied many times to iteratively refine an image from random noise. While many recent works propose techniques to reduce th… ▽ More

    Submitted 12 January, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Project page: https://fwmb.github.io/blockcaching/

  13. arXiv:2311.04698  [pdf, other

    cs.LG cs.AI cs.CV

    Examining Common Paradigms in Multi-Task Learning

    Authors: Cathrin Elich, Lukas Kirchdorfer, Jan M. Köhler, Lukas Schott

    Abstract: While multi-task learning (MTL) has gained significant attention in recent years, its underlying mechanisms remain poorly understood. Recent methods did not yield consistent performance improvements over single task learning (STL) baselines, underscoring the importance of gaining more profound insights about challenges specific to MTL. In our study, we investigate paradigms in MTL in the context o… ▽ More

    Submitted 27 June, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: -

  14. arXiv:2311.00604  [pdf, other

    cs.DS

    A Systematic Review of Approximability Results for Traveling Salesman Problems leveraging the TSP-T3CO Definition Scheme

    Authors: Sophia Saller, Jana Koehler, Andreas Karrenbauer

    Abstract: The traveling salesman (or salesperson) problem, short TSP, is a problem of strong interest to many researchers from mathematics, economics, and computer science. Manifold TSP variants occur in nearly every scientific field and application domain: engineering, physics, biology, life sciences, and manufacturing just to name a few. Several thousand papers are published on theoretical research or app… ▽ More

    Submitted 27 January, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  15. arXiv:2309.05746  [pdf, other

    eess.SY cs.RO math.OC

    Robust Nonlinear Reduced-Order Model Predictive Control

    Authors: John Irvin Alora, Luis A. Pabon, Johannes Köhler, Mattia Cenedese, Ed Schmerling, Melanie N. Zeilinger, George Haller, Marco Pavone

    Abstract: Real-world systems are often characterized by high-dimensional nonlinear dynamics, making them challenging to control in real time. While reduced-order models (ROMs) are frequently employed in model-based control schemes, dimensionality reduction introduces model uncertainty which can potentially compromise the stability and safety of the original high-dimensional system. In this work, we propose… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: 9 pages, 3 figures, To be presented at Conference for Decision and Control 2023

  16. arXiv:2304.09575  [pdf, other

    eess.SY cs.LG math.OC

    Approximate non-linear model predictive control with safety-augmented neural networks

    Authors: Henrik Hose, Johannes Köhler, Melanie N. Zeilinger, Sebastian Trimpe

    Abstract: Model predictive control (MPC) achieves stability and constraint satisfaction for general nonlinear systems, but requires computationally expensive online optimization. This paper studies approximations of such MPC controllers via neural networks (NNs) to achieve fast online evaluation. We propose safety augmentation that yields deterministic guarantees for convergence and constraint satisfaction… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  17. arXiv:2301.11355  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph stat.ML

    Rigid Body Flows for Sampling Molecular Crystal Structures

    Authors: Jonas Köhler, Michele Invernizzi, Pim de Haan, Frank Noé

    Abstract: Normalizing flows (NF) are a class of powerful generative models that have gained popularity in recent years due to their ability to model complex distributions with high flexibility and expressiveness. In this work, we introduce a new type of normalizing flow that is tailored for modeling positions and orientations of multiple objects in three-dimensional space, such as molecules in a crystal. Ou… ▽ More

    Submitted 7 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: International Conference on Machine Learning, 2023

  18. Motion Planning using Reactive Circular Fields: A 2D Analysis of Collision Avoidance and Goal Convergence

    Authors: Marvin Becker, Johannes Köhler, Sami Haddadin, Matthias A. Müller

    Abstract: Recently, many reactive trajectory planning approaches were suggested in the literature because of their inherent immediate adaption in the ever more demanding cluttered and unpredictable environments of robotic systems. However, typically those approaches are only locally reactive without considering global path planning and no guarantees for simultaneous collision avoidance and goal convergence… ▽ More

    Submitted 3 November, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: Published in IEEE Transactions on Automatic Control (Early Access)

  19. arXiv:2206.02479  [pdf, other

    cs.SE cs.AI

    Easy, adaptable and high-quality Modelling with domain-specific Constraint Patterns

    Authors: Sophia Saller, Jana Koehler

    Abstract: Domain-specific constraint patterns are introduced, which form the counterpart to design patterns in software engineering for the constraint programming setting. These patterns describe the expert knowledge and best-practice solution to recurring problems and include example implementations. We aim to reach a stage where, for common problems, the modelling process consists of simply picking the ap… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: 15 pages

    Journal ref: Twentieth International Workshop on Constraint Modelling and Reformulation, ModRef, 2021

  20. arXiv:2203.11167  [pdf, other

    physics.comp-ph cs.LG physics.bio-ph physics.chem-ph

    Flow-matching -- efficient coarse-graining of molecular dynamics without forces

    Authors: Jonas Köhler, Yaoyi Chen, Andreas Krämer, Cecilia Clementi, Frank Noé

    Abstract: Coarse-grained (CG) molecular simulations have become a standard tool to study molecular processes on time- and length-scales inaccessible to all-atom simulations. Parameterizing CG force fields to match all-atom simulations has mainly relied on force-matching or relative entropy minimization, which require many samples from costly simulations with all-atom or CG resolutions, respectively. Here we… ▽ More

    Submitted 5 February, 2023; v1 submitted 21 March, 2022; originally announced March 2022.

    Journal ref: J. Chem. Theory Comput. 2023, XXXX, XXX, XXX-XXX

  21. arXiv:2201.06868  [pdf, other

    eess.AS cs.CL cs.SD

    A Study on the Ambiguity in Human Annotation of German Oral History Interviews for Perceived Emotion Recognition and Sentiment Analysis

    Authors: Michael Gref, Nike Matthiesen, Sreenivasa Hikkal Venugopala, Shalaka Satheesh, Aswinkumar Vijayananth, Duc Bach Ha, Sven Behnke, Joachim Köhler

    Abstract: For research in audiovisual interview archives often it is not only of interest what is said but also how. Sentiment analysis and emotion recognition can help capture, categorize and make these different facets searchable. In particular, for oral history archives, such indexing technologies can be of great interest. These technologies can help understand the role of emotions in historical remember… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: Submitted to LREC 2022

  22. arXiv:2201.06841  [pdf, other

    eess.AS cs.CL cs.SD

    Human and Automatic Speech Recognition Performance on German Oral History Interviews

    Authors: Michael Gref, Nike Matthiesen, Christoph Schmidt, Sven Behnke, Joachim Köhler

    Abstract: Automatic speech recognition systems have accomplished remarkable improvements in transcription accuracy in recent years. On some domains, models now achieve near-human performance. However, transcription performance on oral history has not yet reached human accuracy. In the present work, we investigate how large this gap between human and machine transcription still is. For this purpose, we analy… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: Submitted to LREC 2022

  23. arXiv:2111.01457  [pdf, other

    cs.SD cs.HC cs.LG

    Synthesizing Speech from Intracranial Depth Electrodes using an Encoder-Decoder Framework

    Authors: Jonas Kohler, Maarten C. Ottenhoff, Sophocles Goulis, Miguel Angrick, Albert J. Colon, Louis Wagner, Simon Tousseyn, Pieter L. Kubben, Christian Herff

    Abstract: Speech Neuroprostheses have the potential to enable communication for people with dysarthria or anarthria. Recent advances have demonstrated high-quality text decoding and speech synthesis from electrocorticographic grids placed on the cortical surface. Here, we investigate a less invasive measurement modality in three participants, namely stereotactic EEG (sEEG) that provides sparse sampling from… ▽ More

    Submitted 31 October, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

  24. arXiv:2110.00351  [pdf, other

    stat.ML cs.LG physics.chem-ph

    Smooth Normalizing Flows

    Authors: Jonas Köhler, Andreas Krämer, Frank Noé

    Abstract: Normalizing flows are a promising tool for modeling probability distributions in physical systems. While state-of-the-art flows accurately approximate distributions and energies, applications in physics additionally require smooth energies to compute forces and higher-order derivatives. Furthermore, such densities are often defined on non-trivial topologies. A recent example are Boltzmann Generato… ▽ More

    Submitted 30 November, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: Neural Information Proceessing Systems (NeurIPS) 2021

  25. arXiv:2108.03952  [pdf, other

    cs.LG cs.RO

    Safe Deep Reinforcement Learning for Multi-Agent Systems with Continuous Action Spaces

    Authors: Ziyad Sheebaelhamd, Konstantinos Zisis, Athina Nisioti, Dimitris Gkouletsos, Dario Pavllo, Jonas Kohler

    Abstract: Multi-agent control problems constitute an interesting area of application for deep reinforcement learning models with continuous action spaces. Such real-world applications, however, typically come with critical safety constraints that must not be violated. In order to ensure safety, we enhance the well-known multi-agent deep deterministic policy gradient (MADDPG) framework by adding a safety lay… ▽ More

    Submitted 11 August, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: ICML 2021 Workshop on Reinforcement Learning for Real Life

  26. arXiv:2107.05007  [pdf, other

    physics.chem-ph cs.LG

    Generating stable molecules using imitation and reinforcement learning

    Authors: Søren Ager Meldgaard, Jonas Köhler, Henrik Lund Mortensen, Mads-Peter V. Christiansen, Frank Noé, Bjørk Hammer

    Abstract: Chemical space is routinely explored by machine learning methods to discover interesting molecules, before time-consuming experimental synthesizing is attempted. However, these methods often rely on a graph representation, ignoring 3D information necessary for determining the stability of the molecules. We propose a reinforcement learning approach for generating molecules in cartesian coordinates… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

  27. arXiv:2106.03763  [pdf, other

    cs.LG

    Vanishing Curvature and the Power of Adaptive Methods in Randomly Initialized Deep Networks

    Authors: Antonio Orvieto, Jonas Kohler, Dario Pavllo, Thomas Hofmann, Aurelien Lucchi

    Abstract: This paper revisits the so-called vanishing gradient phenomenon, which commonly occurs in deep randomly initialized neural networks. Leveraging an in-depth analysis of neural chains, we first show that vanishing gradients cannot be circumvented when the network width scales with less than O(depth), even when initialized with the popular Xavier and He initializations. Second, we extend the analysis… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  28. arXiv:2105.11879  [pdf, other

    cs.CV

    Flexible Table Recognition and Semantic Interpretation System

    Authors: Marcin Namysl, Alexander M. Esser, Sven Behnke, Joachim Köhler

    Abstract: Table extraction is an important but still unsolved problem. In this paper, we introduce a flexible and modular table extraction system. We develop two rule-based algorithms that perform the complete table recognition process, including table detection and segmentation, and support the most frequent table formats. Moreover, to incorporate the extraction of semantic information, we develop a graph-… ▽ More

    Submitted 2 December, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in Proceedings of the 17th International Conference on Computer Vision Theory and Applications (VISAPP 2022)

  29. arXiv:2105.11872  [pdf, other

    cs.CL

    Empirical Error Modeling Improves Robustness of Noisy Neural Sequence Labeling

    Authors: Marcin Namysl, Sven Behnke, Joachim Köhler

    Abstract: Despite recent advances, standard sequence labeling systems often fail when processing noisy user-generated text or consuming the output of an Optical Character Recognition (OCR) process. In this paper, we improve the noise-aware training method by proposing an empirical error generation approach that employs a sequence-to-sequence model trained to perform translation from error-free to erroneous… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: Accepted to appear in Findings of ACL 2021 (camera-ready version)

  30. arXiv:2105.02968  [pdf, other

    cs.CV cs.AI cs.LG

    This Looks Like That... Does it? Shortcomings of Latent Space Prototype Interpretability in Deep Networks

    Authors: Adrian Hoffmann, Claudio Fanconi, Rahul Rade, Jonas Kohler

    Abstract: Deep neural networks that yield human interpretable decisions by architectural design have lately become an increasingly popular alternative to post hoc interpretation of traditional black-box models. Among these networks, the arguably most widespread approach is so-called prototype learning, where similarities to learned latent prototypes serve as the basis of classifying an unseen data point. In… ▽ More

    Submitted 23 June, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

    Journal ref: ICML 2021 Workshop on Theoretic Foundation, Criticism, and Application Trend of Explainable AI

  31. arXiv:2103.15627  [pdf, other

    cs.CV cs.GR cs.LG

    Learning Generative Models of Textured 3D Meshes from Real-World Images

    Authors: Dario Pavllo, Jonas Kohler, Thomas Hofmann, Aurelien Lucchi

    Abstract: Recent advances in differentiable rendering have sparked an interest in learning generative models of textured 3D meshes from image collections. These models natively disentangle pose and appearance, enable downstream applications in computer graphics, and improve the ability of generative models to understand the concept of image formation. Although there has been prior work on learning such mode… ▽ More

    Submitted 17 August, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

    Comments: ICCV 2021

  32. arXiv:2011.14006  [pdf, ps, other

    eess.SY cs.LG stat.ML

    Offset-free setpoint tracking using neural network controllers

    Authors: Patricia Pauli, Johannes Köhler, Julian Berberich, Anne Koch, Frank Allgöwer

    Abstract: In this paper, we present a method to analyze local and global stability in offset-free setpoint tracking using neural network controllers and we provide ellipsoidal inner approximations of the corresponding region of attraction. We consider a feedback interconnection of a linear plant in connection with a neural network controller and an integrator, which allows for offset-free tracking of a desi… ▽ More

    Submitted 29 April, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

  33. arXiv:2011.12862  [pdf, other

    cs.AI

    Cable Tree Wiring -- Benchmarking Solvers on a Real-World Scheduling Problem with a Variety of Precedence Constraints

    Authors: Jana Koehler, Joseph Bürgler, Urs Fontana, Etienne Fux, Florian Herzog, Marc Pouly, Sophia Saller, Anastasia Salyaeva, Peter Scheiblechner, Kai Waelti

    Abstract: Cable trees are used in industrial products to transmit energy and information between different product parts. To this date, they are mostly assembled by humans and only few automated manufacturing solutions exist using complex robotic machines. For these machines, the wiring plan has to be translated into a wiring sequence of cable plugging operations to be followed by the machine. In this paper… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

  34. arXiv:2011.00573  [pdf, other

    cs.LG

    Two-Level K-FAC Preconditioning for Deep Learning

    Authors: Nikolaos Tselepidis, Jonas Kohler, Antonio Orvieto

    Abstract: In the context of deep learning, many optimization methods use gradient covariance information in order to accelerate the convergence of Stochastic Gradient Descent. In particular, starting with Adagrad, a seemingly endless line of research advocates the use of diagonal approximations of the so-called empirical Fisher matrix in stochastic gradient-based algorithms, with the most prominent one argu… ▽ More

    Submitted 6 December, 2020; v1 submitted 1 November, 2020; originally announced November 2020.

  35. arXiv:2010.07033  [pdf, other

    stat.ML cs.LG math.OC physics.chem-ph physics.data-an

    Training Invertible Linear Layers through Rank-One Perturbations

    Authors: Andreas Krämer, Jonas Köhler, Frank Noé

    Abstract: Many types of neural network layers rely on matrix properties such as invertibility or orthogonality. Retaining such properties during optimization with gradient-based stochastic optimizers is a challenging task, which is usually addressed by either reparameterization of the affected parameters or by directly optimizing on the manifold. This work presents a novel approach for training invertible l… ▽ More

    Submitted 30 November, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: 17 pages, 10 figures

    MSC Class: 68T07; 82-10

  36. arXiv:2006.02425  [pdf, other

    stat.ML cs.LG physics.chem-ph physics.comp-ph

    Equivariant Flows: Exact Likelihood Generative Learning for Symmetric Densities

    Authors: Jonas Köhler, Leon Klein, Frank Noé

    Abstract: Normalizing flows are exact-likelihood generative neural networks which approximately transform samples from a simple prior distribution to samples of the probability distribution of interest. Recent work showed that such generative models can be utilized in statistical mechanics to sample equilibrium states of many-body systems in physics and chemistry. To scale and generalize these results, it i… ▽ More

    Submitted 26 October, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

  37. arXiv:2005.12562  [pdf, other

    eess.AS cs.CL

    Multi-Staged Cross-Lingual Acoustic Model Adaption for Robust Speech Recognition in Real-World Applications -- A Case Study on German Oral History Interviews

    Authors: Michael Gref, Oliver Walter, Christoph Schmidt, Sven Behnke, Joachim Köhler

    Abstract: While recent automatic speech recognition systems achieve remarkable performance when large amounts of adequate, high quality annotated speech data is used for training, the same systems often only achieve an unsatisfactory result for tasks in domains that greatly deviate from the conditions represented by the training data. For many real-world applications, there is a lack of sufficient data that… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

    Comments: Published version of the paper can be accessed via https://www.aclweb.org/anthology/2020.lrec-1.780

    Journal ref: 12th International Conference on Language Resources and Evaluation (LREC 2020), pages 6354-6362

  38. arXiv:2005.07162  [pdf, other

    cs.CL

    NAT: Noise-Aware Training for Robust Neural Sequence Labeling

    Authors: Marcin Namysl, Sven Behnke, Joachim Köhler

    Abstract: Sequence labeling systems should perform reliably not only under ideal conditions but also with corrupted inputs - as these systems often process user-generated text or follow an error-prone upstream component. To this end, we formulate the noisy sequence labeling problem, where the input may undergo an unknown noising process and propose two Noise-Aware Training (NAT) objectives that improve robu… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: Accepted to appear at ACL 2020

  39. arXiv:2004.08355  [pdf

    cs.CL cs.AI

    Towards an Interoperable Ecosystem of AI and LT Platforms: A Roadmap for the Implementation of Different Levels of Interoperability

    Authors: Georg Rehm, Dimitrios Galanis, Penny Labropoulou, Stelios Piperidis, Martin Welß, Ricardo Usbeck, Joachim Köhler, Miltos Deligiannis, Katerina Gkirtzou, Johannes Fischer, Christian Chiarcos, Nils Feldhus, Julián Moreno-Schneider, Florian Kintzel, Elena Montiel, Víctor Rodríguez Doncel, John P. McCrae, David Laqua, Irina Patricia Theile, Christian Dittmar, Kalina Bontcheva, Ian Roberts, Andrejs Vasiljevs, Andis Lagzdiņš

    Abstract: With regard to the wider area of AI/LT platform interoperability, we concentrate on two core aspects: (1) cross-platform search and discovery of resources and services; (2) composition of cross-platform service workflows. We devise five different levels (of increasing complexity) of platform interoperability that we suggest to implement in a wider federation of AI/LT platforms. We illustrate the a… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

    Comments: Proceedings of the 1st International Workshop on Language Technology Platforms (IWLTP 2020). To appear

  40. arXiv:2003.13833  [pdf

    cs.CL cs.AI cs.DL

    The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual Europe

    Authors: Georg Rehm, Katrin Marheinecke, Stefanie Hegele, Stelios Piperidis, Kalina Bontcheva, Jan Hajič, Khalid Choukri, Andrejs Vasiļjevs, Gerhard Backfried, Christoph Prinz, José Manuel Gómez Pérez, Luc Meertens, Paul Lukowicz, Josef van Genabith, Andrea Lösch, Philipp Slusallek, Morten Irgens, Patrick Gatellier, Joachim Köhler, Laure Le Bars, Dimitra Anastasiou, Albina Auksoriūtė, Núria Bel, António Branco, Gerhard Budin , et al. (22 additional authors not shown)

    Abstract: Multilingualism is a cultural cornerstone of Europe and firmly anchored in the European treaties including full language equality. However, language barriers impacting business, cross-lingual and cross-cultural communication are still omnipresent. Language Technologies (LTs) are a powerful means to break down these barriers. While the last decade has seen various initiatives that created a multitu… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020). To appear

  41. arXiv:2003.01652  [pdf, other

    stat.ML cs.LG

    Batch Normalization Provably Avoids Rank Collapse for Randomly Initialised Deep Networks

    Authors: Hadi Daneshmand, Jonas Kohler, Francis Bach, Thomas Hofmann, Aurelien Lucchi

    Abstract: Randomly initialized neural networks are known to become harder to train with increasing depth, unless architectural enhancements like residual connections and batch normalization are used. We here investigate this phenomenon by revisiting the connection between random initialization in deep networks and spectral instabilities in products of random matrices. Given the rich literature on random mat… ▽ More

    Submitted 11 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

  42. arXiv:2002.06707  [pdf, other

    stat.ML cs.LG physics.chem-ph physics.data-an

    Stochastic Normalizing Flows

    Authors: Hao Wu, Jonas Köhler, Frank Noé

    Abstract: The sampling of probability distributions specified up to a normalization constant is an important problem in both machine learning and statistical mechanics. While classical stochastic sampling methods such as Markov Chain Monte Carlo (MCMC) or Langevin Dynamics (LD) can suffer from slow mixing times there is a growing interest in using normalizing flows in order to learn the transformation of a… ▽ More

    Submitted 26 October, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

  43. arXiv:1912.10360  [pdf, other

    cs.RO cs.LG eess.SY

    Safe and Fast Tracking on a Robot Manipulator: Robust MPC and Neural Network Control

    Authors: Julian Nubert, Johannes Köhler, Vincent Berenz, Frank Allgöwer, Sebastian Trimpe

    Abstract: Fast feedback control and safety guarantees are essential in modern robotics. We present an approach that achieves both by combining novel robust model predictive control (MPC) with function approximation via (deep) neural networks (NNs). The result is a new approach for complex tasks with nonlinear, uncertain, and constrained dynamics as are common in robotics. Specifically, we leverage recent re… ▽ More

    Submitted 2 March, 2020; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: 8 pages, 4 figures,

    Journal ref: Robotics and Automation Letters, 2020

  44. arXiv:1911.10367  [pdf, other

    math.OC cs.LG

    A Sub-sampled Tensor Method for Non-convex Optimization

    Authors: Aurelien Lucchi, Jonas Kohler

    Abstract: We present a stochastic optimization method that uses a fourth-order regularized model to find local minima of smooth and potentially non-convex objective functions with a finite-sum structure. This algorithm uses sub-sampled derivatives instead of exact quantities. The proposed approach is shown to find an $(ε_1,ε_2,ε_3)$-third-order critical point in at most… ▽ More

    Submitted 15 July, 2023; v1 submitted 23 November, 2019; originally announced November 2019.

    Comments: Initial title: A Stochastic Tensor Method for Non-convex Optimization

  45. arXiv:1910.06924  [pdf, other

    cs.LG stat.ML

    DP-MAC: The Differentially Private Method of Auxiliary Coordinates for Deep Learning

    Authors: Frederik Harder, Jonas Köhler, Max Welling, Mijung Park

    Abstract: Developing a differentially private deep learning algorithm is challenging, due to the difficulty in analyzing the sensitivity of objective functions that are typically used to train deep neural networks. Many existing methods resort to the stochastic gradient descent algorithm and apply a pre-defined sensitivity to the gradients for privatizing weights. However, their slow convergence typically y… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  46. arXiv:1910.00753  [pdf, other

    stat.ML cs.LG physics.chem-ph physics.comp-ph

    Equivariant Flows: sampling configurations for multi-body systems with symmetric energies

    Authors: Jonas Köhler, Leon Klein, Frank Noé

    Abstract: Flows are exact-likelihood generative neural networks that transform samples from a simple prior distribution to the samples of the probability distribution of interest. Boltzmann Generators (BG) combine flows and statistical mechanics to sample equilibrium states of strongly interacting many-body systems such as proteins with 1000 atoms. In order to scale and generalize these results, it is essen… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

  47. arXiv:1908.06709  [pdf, other

    eess.AS cs.CL cs.SD

    Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews

    Authors: Michael Gref, Christoph Schmidt, Sven Behnke, Joachim Köhler

    Abstract: In automatic speech recognition, often little training data is available for specific challenging tasks, but training of state-of-the-art automatic speech recognition systems requires large amounts of annotated speech. To address this issue, we propose a two-staged approach to acoustic modeling that combines noise and reverberation data augmentation with transfer learning to robustly address chall… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

    Comments: Accepted for IEEE International Conference on Multimedia and Expo (ICME), Shanghai, China, July 2019

    Journal ref: IEEE International Conference on Multimedia and Expo (ICME), Shanghai, China, July 2019

  48. arXiv:1908.02686  [pdf, other

    cs.CV cs.LG

    Interpretable and Fine-Grained Visual Explanations for Convolutional Neural Networks

    Authors: Jörg Wagner, Jan Mathias Köhler, Tobias Gindele, Leon Hetzel, Jakob Thaddäus Wiedemer, Sven Behnke

    Abstract: To verify and validate networks, it is essential to gain insight into their decisions, limitations as well as possible shortcomings of training data. In this work, we propose a post-hoc, optimization based visual explanation method, which highlights the evidence in the input image for a specific prediction. Our approach is based on a novel technique to defend against adversarial evidence (i.e. fau… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, June 2019

  49. arXiv:1907.01678  [pdf, other

    cs.LG math.OC stat.ML

    The Role of Memory in Stochastic Optimization

    Authors: Antonio Orvieto, Jonas Kohler, Aurelien Lucchi

    Abstract: The choice of how to retain information about past gradients dramatically affects the convergence properties of state-of-the-art stochastic optimization methods, such as Heavy-ball, Nesterov's momentum, RMSprop and Adam. Building on this observation, we use stochastic differential equations (SDEs) to explicitly study the role of memory in gradient-based algorithms. We first derive a general contin… ▽ More

    Submitted 11 March, 2020; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: Accepted paper at the 35th Conference on Uncertainty in Artificial Intelligence (UAI), Tel Aviv, 2019

  50. arXiv:1906.11876  [pdf, other

    cs.CV cs.LG stat.ML

    Uncertainty Based Detection and Relabeling of Noisy Image Labels

    Authors: Jan M. Köhler, Maximilian Autenrieth, William H. Beluch

    Abstract: Deep neural networks (DNNs) are powerful tools in computer vision tasks. However, in many realistic scenarios label noise is prevalent in the training images, and overfitting to these noisy labels can significantly harm the generalization performance of DNNs. We propose a novel technique to identify data with noisy labels based on the different distributions of the predictive uncertainties from a… ▽ More

    Submitted 29 May, 2019; originally announced June 2019.

    Comments: Uncertainty and Robustness in Deep Visual Learning Workshop at CVPR 2019