Skip to main content

Showing 1–13 of 13 results for author: Stanton, S

  1. arXiv:2407.00236  [pdf, other

    cs.LG cs.NE

    Closed-Form Test Functions for Biophysical Sequence Optimization Algorithms

    Authors: Samuel Stanton, Robert Alberstein, Nathan Frey, Andrew Watkins, Kyunghyun Cho

    Abstract: There is a growing body of work seeking to replicate the success of machine learning (ML) on domains like computer vision (CV) and natural language processing (NLP) to applications involving biophysical data. One of the key ingredients of prior successes in CV and NLP was the broad acceptance of difficult benchmarks that distilled key subproblems into approachable tasks that any junior researcher… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  2. arXiv:2405.06627  [pdf, other

    cs.LG cs.AI stat.ML

    Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)

    Authors: Drew Prinster, Samuel Stanton, Anqi Liu, Suchi Saria

    Abstract: As artificial intelligence (AI) / machine learning (ML) gain widespread adoption, practitioners are increasingly seeking means to quantify and control the risk these systems incur. This challenge is especially salient when such systems have autonomy to collect their own data, such as in black-box optimization and active learning, where their actions induce sequential feedback-loop shifts in the da… ▽ More

    Submitted 5 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: ICML 2024. Code available at https://github.com/drewprinster/conformal-mfcs

  3. arXiv:2305.20009  [pdf, other

    cs.LG q-bio.BM

    Protein Design with Guided Discrete Diffusion

    Authors: Nate Gruver, Samuel Stanton, Nathan C. Frey, Tim G. J. Rudner, Isidro Hotzel, Julien Lafrance-Vanasse, Arvind Rajpal, Kyunghyun Cho, Andrew Gordon Wilson

    Abstract: A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling. The generative model samples plausible sequences while the discriminative model guides a search for sequences with high fitness. Given its broad success in conditional sampling, classifier-guided diffusion modeling is a promising foundation for protein design, leading many to… ▽ More

    Submitted 12 December, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Journal ref: Advances in Neural Information Processing Systems 36, December 10-16, 2023

  4. arXiv:2212.04450  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci cs.LG

    GAUCHE: A Library for Gaussian Processes in Chemistry

    Authors: Ryan-Rhys Griffiths, Leo Klarner, Henry B. Moss, Aditya Ravuri, Sang Truong, Samuel Stanton, Gary Tom, Bojana Rankovic, Yuanqi Du, Arian Jamasb, Aryan Deshwal, Julius Schwartz, Austin Tripp, Gregory Kell, Simon Frieder, Anthony Bourached, Alex Chan, Jacob Moss, Chengzhi Guo, Johannes Durholt, Saudamini Chaurasia, Felix Strieth-Kalthoff, Alpha A. Lee, Bingqing Cheng, Alán Aspuru-Guzik , et al. (2 additional authors not shown)

    Abstract: We introduce GAUCHE, a library for GAUssian processes in CHEmistry. Gaussian processes have long been a cornerstone of probabilistic machine learning, affording particular advantages for uncertainty quantification and Bayesian optimisation. Extending Gaussian processes to chemical representations, however, is nontrivial, necessitating kernels defined over structured inputs such as graphs, strings… ▽ More

    Submitted 21 February, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

  5. arXiv:2210.12496  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization with Conformal Prediction Sets

    Authors: Samuel Stanton, Wesley Maddox, Andrew Gordon Wilson

    Abstract: Bayesian optimization is a coherent, ubiquitous approach to decision-making under uncertainty, with applications including multi-arm bandits, active learning, and black-box optimization. Bayesian optimization selects decisions (i.e. objective function queries) with maximal expected utility with respect to the posterior distribution of a Bayesian model, which quantifies reducible, epistemic uncerta… ▽ More

    Submitted 12 December, 2023; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: For code, see https://www.github.com/samuelstanton/conformal-bayesopt.git

    Journal ref: Proceedings of Machine Learning Research, Volume 206, 959-986, PMLR, 2023

  6. arXiv:2210.04096  [pdf, other

    cs.LG q-bio.QM

    PropertyDAG: Multi-objective Bayesian optimization of partially ordered, mixed-variable properties for biological sequence design

    Authors: Ji Won Park, Samuel Stanton, Saeed Saremi, Andrew Watkins, Henri Dwyer, Vladimir Gligorijevic, Richard Bonneau, Stephen Ra, Kyunghyun Cho

    Abstract: Bayesian optimization offers a sample-efficient framework for navigating the exploration-exploitation trade-off in the vast design space of biological sequences. Whereas it is possible to optimize the various properties of interest jointly using a multi-objective acquisition function, such as the expected hypervolume improvement (EHVI), this approach does not account for objectives with a hierarch… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 9 pages, 7 figures. Submitted to NeurIPS 2022 AI4Science Workshop

  7. arXiv:2203.12742  [pdf, other

    cs.LG cs.NE q-bio.QM stat.ML

    Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders

    Authors: Samuel Stanton, Wesley Maddox, Nate Gruver, Phillip Maffettone, Emily Delaney, Peyton Greenside, Andrew Gordon Wilson

    Abstract: Bayesian optimization (BayesOpt) is a gold standard for query-efficient continuous optimization. However, its adoption for drug design has been hindered by the discrete, high-dimensional nature of the decision variables. We develop a new approach (LaMBO) which jointly trains a denoising autoencoder with a discriminative multi-task Gaussian process head, allowing gradient-based optimization of mult… ▽ More

    Submitted 12 July, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: ICML 2022. Code available at https://github.com/samuelstanton/lambo

  8. arXiv:2202.04836  [pdf, other

    cs.LG math.DS physics.data-an stat.ML

    Deconstructing the Inductive Biases of Hamiltonian Neural Networks

    Authors: Nate Gruver, Marc Finzi, Samuel Stanton, Andrew Gordon Wilson

    Abstract: Physics-inspired neural networks (NNs), such as Hamiltonian or Lagrangian NNs, dramatically outperform other learned dynamics models by leveraging strong inductive biases. These models, however, are challenging to apply to many real world systems, such as those that don't conserve energy or contain contacts, a common setting for robotics and reinforcement learning. In this paper, we examine the in… ▽ More

    Submitted 11 February, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: ICLR 2022. Code available at https://github.com/ngruver/decon-hnn

  9. arXiv:2110.15172  [pdf, other

    cs.LG stat.ML

    Conditioning Sparse Variational Gaussian Processes for Online Decision-making

    Authors: Wesley J. Maddox, Samuel Stanton, Andrew Gordon Wilson

    Abstract: With a principled representation of uncertainty and closed form posterior updates, Gaussian processes (GPs) are a natural choice for online decision making. However, Gaussian processes typically require at least $\mathcal{O}(n^2)$ computations for $n$ training points, limiting their general applicability. Stochastic variational Gaussian processes (SVGPs) can provide scalable inference for a datase… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021

  10. arXiv:2106.05945  [pdf, other

    cs.LG stat.ML

    Does Knowledge Distillation Really Work?

    Authors: Samuel Stanton, Pavel Izmailov, Polina Kirichenko, Alexander A. Alemi, Andrew Gordon Wilson

    Abstract: Knowledge distillation is a popular technique for training a small student network to emulate a larger teacher model, such as an ensemble of networks. We show that while knowledge distillation can improve student generalization, it does not typically work as it is commonly understood: there often remains a surprisingly large discrepancy between the predictive distributions of the teacher and the s… ▽ More

    Submitted 6 December, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021. Code available at https://github.com/samuelstanton/gnosis

  11. arXiv:2103.01454  [pdf, other

    stat.ML cs.LG

    Kernel Interpolation for Scalable Online Gaussian Processes

    Authors: Samuel Stanton, Wesley J. Maddox, Ian Delbridge, Andrew Gordon Wilson

    Abstract: Gaussian processes (GPs) provide a gold standard for performance in online settings, such as sample-efficient control and black box optimization, where we need to update a posterior distribution as we acquire data in a sequential fashion. However, updating a GP posterior to accommodate even a single new observation after having observed $n$ points incurs at least $O(n)$ computations in the exact s… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: AISTATS 2021

  12. arXiv:2008.12775  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    On the model-based stochastic value gradient for continuous reinforcement learning

    Authors: Brandon Amos, Samuel Stanton, Denis Yarats, Andrew Gordon Wilson

    Abstract: For over a decade, model-based reinforcement learning has been seen as a way to leverage control-based domain knowledge to improve the sample-efficiency of reinforcement learning agents. While model-based agents are conceptually appealing, their policies tend to lag behind those of model-free agents in terms of final reward, especially in non-trivial environments. In response, researchers have pro… ▽ More

    Submitted 27 May, 2021; v1 submitted 28 August, 2020; originally announced August 2020.

    Comments: L4DC 2021

  13. arXiv:2002.12880  [pdf, other

    stat.ML cs.LG

    Generalizing Convolutional Neural Networks for Equivariance to Lie Groups on Arbitrary Continuous Data

    Authors: Marc Finzi, Samuel Stanton, Pavel Izmailov, Andrew Gordon Wilson

    Abstract: The translation equivariance of convolutional layers enables convolutional neural networks to generalize well on image problems. While translation equivariance provides a powerful inductive bias for images, we often additionally desire equivariance to other transformations, such as rotations, especially for non-image data. We propose a general method to construct a convolutional layer that is equi… ▽ More

    Submitted 24 September, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: ICML 2020. Code available at https://github.com/mfinzi/LieConv