-
Learning to Predict Structural Vibrations
Authors:
Jan van Delden,
Julius Schultz,
Christopher Blech,
Sabine C. Langer,
Timo Lüddecke
Abstract:
In mechanical structures like airplanes, cars and houses, noise is generated and transmitted through vibrations. To take measures to reduce this noise, vibrations need to be simulated with expensive numerical computations. Surrogate deep learning models present a promising alternative to classical numerical simulations as they can be evaluated magnitudes faster, while trading-off accuracy. To quan…
▽ More
In mechanical structures like airplanes, cars and houses, noise is generated and transmitted through vibrations. To take measures to reduce this noise, vibrations need to be simulated with expensive numerical computations. Surrogate deep learning models present a promising alternative to classical numerical simulations as they can be evaluated magnitudes faster, while trading-off accuracy. To quantify such trade-offs systematically and foster the development of methods, we present a benchmark on the task of predicting the vibration of harmonically excited plates. The benchmark features a total of 12000 plate geometries with varying forms of beadings, material and sizes with associated numerical solutions. To address the benchmark task, we propose a new network architecture, named Frequency-Query Operator, which is trained to map plate geometries to their vibration pattern given a specific excitation frequency. Applying principles from operator learning and implicit models for shape encoding, our approach effectively addresses the prediction of highly variable frequency response functions occurring in dynamic systems. To quantify the prediction quality, we introduce a set of evaluation metrics and evaluate the method on our vibrating-plates benchmark. Our method outperforms DeepONets, Fourier Neural Operators and more traditional neural network architectures. Code, dataset and visualizations: https://eckerlab.org/code/delden2023_plate
△ Less
Submitted 22 March, 2024; v1 submitted 9 October, 2023;
originally announced October 2023.
-
Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning
Authors:
Marc Lanctot,
John Schultz,
Neil Burch,
Max Olan Smith,
Daniel Hennes,
Thomas Anthony,
Julien Perolat
Abstract:
Progress in fields of machine learning and adversarial planning has benefited significantly from benchmark domains, from checkers and the classic UCI data sets to Go and Diplomacy. In sequential decision-making, agent evaluation has largely been restricted to few interactions against experts, with the aim to reach some desired level of performance (e.g. beating a human professional player). We pro…
▽ More
Progress in fields of machine learning and adversarial planning has benefited significantly from benchmark domains, from checkers and the classic UCI data sets to Go and Diplomacy. In sequential decision-making, agent evaluation has largely been restricted to few interactions against experts, with the aim to reach some desired level of performance (e.g. beating a human professional player). We propose a benchmark for multiagent learning based on repeated play of the simple game Rock, Paper, Scissors along with a population of forty-three tournament entries, some of which are intentionally sub-optimal. We describe metrics to measure the quality of agents based both on average returns and exploitability. We then show that several RL, online learning, and language model approaches can learn good counter-strategies and generalize well, but ultimately lose to the top-performing bots, creating an opportunity for research in multiagent learning.
△ Less
Submitted 31 October, 2023; v1 submitted 2 March, 2023;
originally announced March 2023.
-
Discrepancies among Pre-trained Deep Neural Networks: A New Threat to Model Zoo Reliability
Authors:
Diego Montes,
Pongpatapee Peerapatanapokin,
Jeff Schultz,
Chengjun Gun,
Wenxin Jiang,
James C. Davis
Abstract:
Training deep neural networks (DNNs) takes signifcant time and resources. A practice for expedited deployment is to use pre-trained deep neural networks (PTNNs), often from model zoos -- collections of PTNNs; yet, the reliability of model zoos remains unexamined. In the absence of an industry standard for the implementation and performance of PTNNs, engineers cannot confidently incorporate them in…
▽ More
Training deep neural networks (DNNs) takes signifcant time and resources. A practice for expedited deployment is to use pre-trained deep neural networks (PTNNs), often from model zoos -- collections of PTNNs; yet, the reliability of model zoos remains unexamined. In the absence of an industry standard for the implementation and performance of PTNNs, engineers cannot confidently incorporate them into production systems. As a first step, discovering potential discrepancies between PTNNs across model zoos would reveal a threat to model zoo reliability. Prior works indicated existing variances in deep learning systems in terms of accuracy. However, broader measures of reliability for PTNNs from model zoos are unexplored. This work measures notable discrepancies between accuracy, latency, and architecture of 36 PTNNs across four model zoos. Among the top 10 discrepancies, we find differences of 1.23%-2.62% in accuracy and 9%-131% in latency. We also fnd mismatches in architecture for well-known DNN architectures (e.g., ResNet and AlexNet). Our findings call for future works on empirical validation, automated tools for measurement, and best practices for implementation.
△ Less
Submitted 4 March, 2023;
originally announced March 2023.
-
Learning to Navigate Wikipedia by Taking Random Walks
Authors:
Manzil Zaheer,
Kenneth Marino,
Will Grathwohl,
John Schultz,
Wendy Shang,
Sheila Babayan,
Arun Ahuja,
Ishita Dasgupta,
Christine Kaeser-Chen,
Rob Fergus
Abstract:
A fundamental ability of an intelligent web-based agent is seeking out and acquiring new information. Internet search engines reliably find the correct vicinity but the top results may be a few links away from the desired target. A complementary approach is navigation via hyperlinks, employing a policy that comprehends local content and selects a link that moves it closer to the target. In this pa…
▽ More
A fundamental ability of an intelligent web-based agent is seeking out and acquiring new information. Internet search engines reliably find the correct vicinity but the top results may be a few links away from the desired target. A complementary approach is navigation via hyperlinks, employing a policy that comprehends local content and selects a link that moves it closer to the target. In this paper, we show that behavioral cloning of randomly sampled trajectories is sufficient to learn an effective link selection policy. We demonstrate the approach on a graph version of Wikipedia with 38M nodes and 387M edges. The model is able to efficiently navigate between nodes 5 and 20 steps apart 96% and 92% of the time, respectively. We then use the resulting embeddings and policy in downstream fact verification and question answering tasks where, in combination with basic TF-IDF search and ranking methods, they are competitive results to the state-of-the-art methods.
△ Less
Submitted 31 October, 2022;
originally announced November 2022.
-
A Framework for Deprecating Datasets: Standardizing Documentation, Identification, and Communication
Authors:
Alexandra Sasha Luccioni,
Frances Corry,
Hamsini Sridharan,
Mike Ananny,
Jason Schultz,
Kate Crawford
Abstract:
Datasets are central to training machine learning (ML) models. The ML community has recently made significant improvements to data stewardship and documentation practices across the model development life cycle. However, the act of deprecating, or deleting, datasets has been largely overlooked, and there are currently no standardized approaches for structuring this stage of the dataset life cycle.…
▽ More
Datasets are central to training machine learning (ML) models. The ML community has recently made significant improvements to data stewardship and documentation practices across the model development life cycle. However, the act of deprecating, or deleting, datasets has been largely overlooked, and there are currently no standardized approaches for structuring this stage of the dataset life cycle. In this paper, we study the practice of dataset deprecation in ML, identify several cases of datasets that continued to circulate despite having been deprecated, and describe the different technical, legal, ethical, and organizational issues raised by such continuations. We then propose a Dataset Deprecation Framework that includes considerations of risk, mitigation of impact, appeal mechanisms, timeline, post-deprecation protocols, and publication checks that can be adapted and implemented by the ML community. Finally, we propose creating a centralized, sustainable repository system for archiving datasets, tracking dataset modifications or deprecations, and facilitating practices of care and stewardship that can be integrated into research and publication processes.
△ Less
Submitted 9 May, 2022; v1 submitted 18 October, 2021;
originally announced November 2021.
-
The Advantage Regret-Matching Actor-Critic
Authors:
Audrūnas Gruslys,
Marc Lanctot,
Rémi Munos,
Finbarr Timbers,
Martin Schmid,
Julien Perolat,
Dustin Morrill,
Vinicius Zambaldi,
Jean-Baptiste Lespiau,
John Schultz,
Mohammad Gheshlaghi Azar,
Michael Bowling,
Karl Tuyls
Abstract:
Regret minimization has played a key role in online learning, equilibrium computation in games, and reinforcement learning (RL). In this paper, we describe a general model-free RL method for no-regret learning based on repeated reconsideration of past behavior. We propose a model-free RL algorithm, the AdvantageRegret-Matching Actor-Critic (ARMAC): rather than saving past state-action data, ARMAC…
▽ More
Regret minimization has played a key role in online learning, equilibrium computation in games, and reinforcement learning (RL). In this paper, we describe a general model-free RL method for no-regret learning based on repeated reconsideration of past behavior. We propose a model-free RL algorithm, the AdvantageRegret-Matching Actor-Critic (ARMAC): rather than saving past state-action data, ARMAC saves a buffer of past policies, replaying through them to reconstruct hindsight assessments of past behavior. These retrospective value estimates are used to predict conditional advantages which, combined with regret matching, produces a new policy. In particular, ARMAC learns from sampled trajectories in a centralized training setting, without requiring the application of importance sampling commonly used in Monte Carlo counterfactual regret (CFR) minimization; hence, it does not suffer from excessive variance in large environments. In the single-agent setting, ARMAC shows an interesting form of exploration by keeping past policies intact. In the multiagent setting, ARMAC in self-play approaches Nash equilibria on some partially-observable zero-sum benchmarks. We provide exploitability estimates in the significantly larger game of betting-abstracted no-limit Texas Hold'em.
△ Less
Submitted 27 August, 2020;
originally announced August 2020.
-
Application and Assessment of Deep Learning for the Generation of Potential NMDA Receptor Antagonists
Authors:
Katherine J. Schultz,
Sean M. Colby,
Yasemin Yesiltepe,
Jamie R. Nuñez,
Monee Y. McGrady,
Ryan R. Renslow
Abstract:
Uncompetitive antagonists of the N-methyl D-aspartate receptor (NMDAR) have demonstrated therapeutic benefit in the treatment of neurological diseases such as Parkinson's and Alzheimer's, but some also cause dissociative effects that have led to the synthesis of illicit drugs. The ability to generate NMDAR antagonists in silico is therefore desirable both for new medication development and for pre…
▽ More
Uncompetitive antagonists of the N-methyl D-aspartate receptor (NMDAR) have demonstrated therapeutic benefit in the treatment of neurological diseases such as Parkinson's and Alzheimer's, but some also cause dissociative effects that have led to the synthesis of illicit drugs. The ability to generate NMDAR antagonists in silico is therefore desirable both for new medication development and for preempting and identifying new designer drugs. Recently, generative deep learning models have been applied to de novo drug design as a means to expand the amount of chemical space that can be explored for potential drug-like compounds. In this study, we assess the application of a generative model to the NMDAR to achieve two primary objectives: (i) the creation and release of a comprehensive library of experimentally validated NMDAR phencyclidine (PCP) site antagonists to assist the drug discovery community and (ii) an analysis of both the advantages conferred by applying such generative artificial intelligence models to drug design and the current limitations of the approach. We apply, and provide source code for, a variety of ligand- and structure-based assessment techniques used in standard drug discovery analyses to the deep learning-generated compounds. We present twelve candidate antagonists that are not available in existing chemical databases to provide an example of what this type of workflow can achieve, though synthesis and experimental validation of these compounds is still required.
△ Less
Submitted 31 March, 2020;
originally announced March 2020.
-
Uncertainty Modeling and Analysis of the European X-ray Free Electron Laser Cavities Manufacturing Process
Authors:
Jacopo Corno,
Niklas Georg,
Shahnam Gorgi Zadeh,
Johann Heller,
Vladimir Gubarev,
Toon Roggen,
Ulrich Römer,
Christian Schmidt,
Sebastian Schöps,
Julius Schultz,
Alexey Sulimov,
Ursula van Rienen
Abstract:
This paper reports on comprehensive efforts on uncertainty quantification and global sensitivity analysis for accelerator cavity design. As a case study object the TESLA shaped superconducting cavities, as produced for the European X-ray Free Electron Laser (EXFEL), are selected. The choice for these cavities is explained by the available measurement data that can be leveraged to substantiate the…
▽ More
This paper reports on comprehensive efforts on uncertainty quantification and global sensitivity analysis for accelerator cavity design. As a case study object the TESLA shaped superconducting cavities, as produced for the European X-ray Free Electron Laser (EXFEL), are selected. The choice for these cavities is explained by the available measurement data that can be leveraged to substantiate the simulation model. Each step of the manufacturing chain is documented together with the involved uncertainties. Several of these steps are mimicked on the simulation side, e.g. by introducing a random eigenvalue problem. The uncertainties are then quantified numerically and in particular the sensitivities give valuable insight into the systems behavior. We also compare these findings to purely statistical studies carried out for the manufactured cavities. More advanced, adaptive, surrogate modeling techniques are adopted, which are crucial to incorporate a large number of uncertain parameters. The main contribution is the detailed comparison and fusion of measurement results for the EXFEL cavities on the one hand and simulation based uncertainty studies on the other hand. After introducing the quantities of physical interest for accelerator cavities and the Maxwell eigenvalue problem, the details on the manufacturing of the EXFEL cavities and measurements are reported. This is followed by uncertainty modeling with quantification studies.
△ Less
Submitted 12 December, 2019; v1 submitted 21 June, 2019;
originally announced June 2019.
-
Efficient Computation of Higher-Order Variational Integrators in Robotic Simulation and Trajectory Optimization
Authors:
Taosha Fan,
Jarvis Schultz,
Todd Murphey
Abstract:
This paper addresses the problem of efficiently computing higher-order variational integrators in simulation and trajectory optimization of mechanical systems as those often found in robotic applications. We develop $O(n)$ algorithms to evaluate the discrete Euler-Lagrange (DEL) equations and compute the Newton direction for solving the DEL equations, which results in linear-time variational integ…
▽ More
This paper addresses the problem of efficiently computing higher-order variational integrators in simulation and trajectory optimization of mechanical systems as those often found in robotic applications. We develop $O(n)$ algorithms to evaluate the discrete Euler-Lagrange (DEL) equations and compute the Newton direction for solving the DEL equations, which results in linear-time variational integrators of arbitrarily high order. To our knowledge, no linear-time higher-order variational or even implicit integrators have been developed before. Moreover, an $O(n^2)$ algorithm to linearize the DEL equations is presented, which is useful for trajectory optimization. These proposed algorithms eliminate the bottleneck of implementing higher-order variational integrators in simulation and trajectory optimization of complex robotic systems. The efficacy of this paper is validated through comparison with existing methods, and implementation on various robotic systems---including trajectory optimization of the Spring Flamingo robot, the LittleDog robot and the Atlas robot. The results illustrate that the same integrator can be used for simulation and trajectory optimization in robotics, preserving mechanical properties while achieving good scalability and accuracy.
△ Less
Submitted 29 April, 2019;
originally announced April 2019.
-
Artificial Intelligence for Social Good
Authors:
Gregory D. Hager,
Ann Drobnis,
Fei Fang,
Rayid Ghani,
Amy Greenwald,
Terah Lyons,
David C. Parkes,
Jason Schultz,
Suchi Saria,
Stephen F. Smith,
Milind Tambe
Abstract:
The Computing Community Consortium (CCC), along with the White House Office of Science and Technology Policy (OSTP), and the Association for the Advancement of Artificial Intelligence (AAAI), co-sponsored a public workshop on Artificial Intelligence for Social Good on June 7th, 2016 in Washington, DC. This was one of five workshops that OSTP co-sponsored and held around the country to spur public…
▽ More
The Computing Community Consortium (CCC), along with the White House Office of Science and Technology Policy (OSTP), and the Association for the Advancement of Artificial Intelligence (AAAI), co-sponsored a public workshop on Artificial Intelligence for Social Good on June 7th, 2016 in Washington, DC. This was one of five workshops that OSTP co-sponsored and held around the country to spur public dialogue on artificial intelligence, machine learning, and to identify challenges and opportunities related to AI. In the AI for Social Good workshop, the successful deployments and the potential use of AI in various topics that are essential for social good were discussed, including but not limited to urban computing, health, environmental sustainability, and public welfare. This report highlights each of these as well as a number of crosscutting issues.
△ Less
Submitted 16 January, 2019;
originally announced January 2019.
-
Dynamic Task Execution using Active Parameter Identification with the Baxter Research Robot
Authors:
Andrew D. Wilson,
Jarvis A. Schultz,
Alex R. Ansari,
Todd D. Murphey
Abstract:
This paper presents experimental results from real-time parameter estimation of a system model and subsequent trajectory optimization for a dynamic task using the Baxter Research Robot from Rethink Robotics. An active estimator maximizing Fisher information is used in real-time with a closed-loop, non-linear control technique known as Sequential Action Control. Baxter is tasked with estimating the…
▽ More
This paper presents experimental results from real-time parameter estimation of a system model and subsequent trajectory optimization for a dynamic task using the Baxter Research Robot from Rethink Robotics. An active estimator maximizing Fisher information is used in real-time with a closed-loop, non-linear control technique known as Sequential Action Control. Baxter is tasked with estimating the length of a string connected to a load suspended from the gripper with a load cell providing the single source of feedback to the estimator. Following the active estimation, a trajectory is generated using the trep software package that controls Baxter to dynamically swing a suspended load into a box. Several trials are presented with varying initial estimates showing that estimation is required to obtain adequate open-loop trajectories to complete the prescribed task. The result of one trial with and without the active estimation is also shown in the accompanying video.
△ Less
Submitted 11 September, 2017;
originally announced September 2017.
-
Trajectory Synthesis for Fisher Information Maximization
Authors:
Andrew D. Wilson,
Jarvis A. Schultz,
Todd D. Murphey
Abstract:
Estimation of model parameters in a dynamic system can be significantly improved with the choice of experimental trajectory. For general, nonlinear dynamic systems, finding globally "best" trajectories is typically not feasible; however, given an initial estimate of the model parameters and an initial trajectory, we present a continuous-time optimization method that produces a locally optimal traj…
▽ More
Estimation of model parameters in a dynamic system can be significantly improved with the choice of experimental trajectory. For general, nonlinear dynamic systems, finding globally "best" trajectories is typically not feasible; however, given an initial estimate of the model parameters and an initial trajectory, we present a continuous-time optimization method that produces a locally optimal trajectory for parameter estimation in the presence of measurement noise. The optimization algorithm is formulated to find system trajectories that improve a norm on the Fisher information matrix. A double-pendulum cart apparatus is used to numerically and experimentally validate this technique. In simulation, the optimized trajectory increases the minimum eigenvalue of the Fisher information matrix by three orders of magnitude compared to the initial trajectory. Experimental results show that this optimized trajectory translates to an order of magnitude improvement in the parameter estimate error in practice.
△ Less
Submitted 11 September, 2017;
originally announced September 2017.
-
Autonomous Visual Rendering using Physical Motion
Authors:
Ahalya Prabhakar,
Anastasia Mavrommati,
Jarvis Schultz,
Todd Murphey
Abstract:
This paper addresses the problem of enabling a robot to represent and recreate visual information through physical motion, focusing on drawing using pens, brushes, or other tools. This work uses ergodicity as a control objective that translates planar visual input to physical motion without preprocessing (e.g., image processing, motion primitives). % or human-generated training data (i.e., machine…
▽ More
This paper addresses the problem of enabling a robot to represent and recreate visual information through physical motion, focusing on drawing using pens, brushes, or other tools. This work uses ergodicity as a control objective that translates planar visual input to physical motion without preprocessing (e.g., image processing, motion primitives). % or human-generated training data (i.e., machine learning).
We achieve comparable results to existing drawing methods, while reducing the algorithmic complexity of the software. We demonstrate that optimal ergodic control algorithms with different time-horizon characteristics (infinitesimal, finite, and receding horizon) can generate qualitatively and stylistically different motions that render a wide range of visual information (e.g., letters, portraits, landscapes). In addition, we show that ergodic control enables the same software design to apply to multiple robotic systems by incorporating their particular dynamics, thereby reducing the dependence on task-specific robots. Finally, we demonstrate physical drawings with the Baxter robot.
△ Less
Submitted 8 September, 2017;
originally announced September 2017.
-
Real-time Dynamic-Mode Scheduling Using Single-Integration Hybrid Optimization for Linear Time-Varying Systems
Authors:
Anastasia Mavrommati,
Jarvis A. Schultz,
Todd D. Murphey
Abstract:
This paper considers the problem of real-time mode scheduling in linear time-varying switched systems subject to a quadratic cost functional. The execution time of hybrid control algorithms is often prohibitive for real-time applications and typically may only be reduced at the expense of approximation accuracy. We address this trade-off by taking advantage of system linearity to formulate a proje…
▽ More
This paper considers the problem of real-time mode scheduling in linear time-varying switched systems subject to a quadratic cost functional. The execution time of hybrid control algorithms is often prohibitive for real-time applications and typically may only be reduced at the expense of approximation accuracy. We address this trade-off by taking advantage of system linearity to formulate a projection-based approach so that no simulation is required during open-loop optimization. A numerical example shows how the proposed open-loop algorithm outperforms methods employing common numerical integration techniques. Additionally, we follow a receding-horizon scheme to apply real-time closed-loop hybrid control to a customized experimental setup, using the Robot Operating System (ROS). In particular, we demonstrate---both in Monte-Carlo simulation and in experiment---that optimal hybrid control efficiently regulates a cart and suspended mass system in real time.
△ Less
Submitted 31 August, 2017;
originally announced September 2017.
-
Structured Linearization of Discrete Mechanical Systems for Analysis and Optimal Control
Authors:
Elliot Johnson,
Jarvis Schultz,
Todd Murphey
Abstract:
Variational integrators are well-suited for simulation of mechanical systems because they preserve mechanical quantities about a system such as momentum, or its change if external forcing is involved, and holonomic constraints. While they are not energy-preserving they do exhibit long-time stable energy behavior. However, variational integrators often simulate mechanical system dynamics by solving…
▽ More
Variational integrators are well-suited for simulation of mechanical systems because they preserve mechanical quantities about a system such as momentum, or its change if external forcing is involved, and holonomic constraints. While they are not energy-preserving they do exhibit long-time stable energy behavior. However, variational integrators often simulate mechanical system dynamics by solving an implicit difference equation at each time step, one that is moreover expressed purely in terms of configurations at different time steps. This paper formulates the first- and second-order linearizations of a variational integrator in a manner that is amenable to control analysis and synthesis, creating a bridge between existing analysis and optimal control tools for discrete dynamic systems and variational integrators for mechanical systems in generalized coordinates with forcing and holonomic constraints. The forced pendulum is used to illustrate the technique. A second example solves the discrete LQR problem to find a locally stabilizing controller for a 40 DOF system with 6 constraints.
△ Less
Submitted 31 August, 2017;
originally announced September 2017.