Skip to main content

Showing 1–14 of 14 results for author: Neklyudov, K

  1. arXiv:2406.16976  [pdf, other

    cs.NE cs.AI cs.LG physics.chem-ph

    Efficient Evolutionary Search Over Chemical Space with Large Language Models

    Authors: Haorui Wang, Marta Skreta, Cher-Tian Ser, Wenhao Gao, Lingkai Kong, Felix Strieth-Kalthoff, Chenru Duan, Yuchen Zhuang, Yue Yu, Yanqiao Zhu, Yuanqi Du, Alán Aspuru-Guzik, Kirill Neklyudov, Chao Zhang

    Abstract: Molecular discovery, when formulated as an optimization problem, presents significant computational challenges because optimization objectives can be non-differentiable. Evolutionary Algorithms (EAs), often used to optimize black-box objectives in molecular discovery, traverse chemical space by performing random mutations and crossovers, leading to a large number of expensive objective evaluations… ▽ More

    Submitted 2 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  2. arXiv:2402.18012  [pdf, other

    cs.LG cs.AI

    Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints

    Authors: Lingkai Kong, Yuanqi Du, Wenhao Mu, Kirill Neklyudov, Valentin De Bortoli, Haorui Wang, Dongxia Wu, Aaron Ferber, Yi-An Ma, Carla P. Gomes, Chao Zhang

    Abstract: Addressing real-world optimization problems becomes particularly challenging when analytic objective functions or constraints are unavailable. While numerous studies have addressed the issue of unknown objectives, limited research has focused on scenarios where feasibility constraints are not given explicitly. Overlooking these constraints can lead to spurious solutions that are unrealistic in pra… ▽ More

    Submitted 29 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  3. arXiv:2312.05705  [pdf, other

    cs.LG stat.ML

    Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC

    Authors: Wu Lin, Felix Dangel, Runa Eschenhagen, Kirill Neklyudov, Agustinus Kristiadi, Richard E. Turner, Alireza Makhzani

    Abstract: Second-order methods such as KFAC can be useful for neural net training. However, they are often memory-inefficient since their preconditioning Kronecker factors are dense, and numerically unstable in low precision as they require matrix inversion or decomposition. These limitations render such methods unpopular for modern mixed-precision training. We address them by (i) formulating an inverse-fre… ▽ More

    Submitted 15 June, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: A long version of the ICML 2024 paper

  4. arXiv:2310.10649  [pdf, other

    cs.LG math.OC stat.ML

    A Computational Framework for Solving Wasserstein Lagrangian Flows

    Authors: Kirill Neklyudov, Rob Brekelmans, Alexander Tong, Lazar Atanackovic, Qiang Liu, Alireza Makhzani

    Abstract: The dynamical formulation of the optimal transport can be extended through various choices of the underlying geometry (kinetic energy), and the regularization of density paths (potential energy). These combinations yield different variational problems (Lagrangians), encompassing many variations of the optimal transport problem such as the Schrödinger bridge, unbalanced optimal transport, and optim… ▽ More

    Submitted 3 July, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  5. arXiv:2307.07050  [pdf, other

    physics.comp-ph cs.LG physics.chem-ph

    Wasserstein Quantum Monte Carlo: A Novel Approach for Solving the Quantum Many-Body Schrödinger Equation

    Authors: Kirill Neklyudov, Jannes Nys, Luca Thiede, Juan Carrasquilla, Qiang Liu, Max Welling, Alireza Makhzani

    Abstract: Solving the quantum many-body Schrödinger equation is a fundamental and challenging problem in the fields of quantum physics, quantum chemistry, and material sciences. One of the common computational approaches to this problem is Quantum Variational Monte Carlo (QVMC), in which ground-state solutions are obtained by minimizing the energy of the system within a restricted family of parameterized wa… ▽ More

    Submitted 26 October, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Published in NeurIPS 2023

  6. arXiv:2301.08292  [pdf, other

    quant-ph cs.LG

    Quantum HyperNetworks: Training Binary Neural Networks in Quantum Superposition

    Authors: Juan Carrasquilla, Mohamed Hibat-Allah, Estelle Inack, Alireza Makhzani, Kirill Neklyudov, Graham W. Taylor, Giacomo Torlai

    Abstract: Binary neural networks, i.e., neural networks whose parameters and activations are constrained to only two possible values, offer a compelling avenue for the deployment of deep learning models on energy- and memory-limited devices. However, their training, architectural design, and hyperparameter tuning remain challenging as these involve multiple computationally expensive combinatorial optimizati… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: 10 pages, 6 figures. Minimal implementation: https://github.com/carrasqu/binncode

  7. arXiv:2210.06662  [pdf, other

    cs.LG

    Action Matching: Learning Stochastic Dynamics from Samples

    Authors: Kirill Neklyudov, Rob Brekelmans, Daniel Severo, Alireza Makhzani

    Abstract: Learning the continuous dynamics of a system from snapshots of its temporal marginals is a problem which appears throughout natural sciences and machine learning, including in quantum systems, single-cell biological data, and generative modeling. In these settings, we assume access to cross-sectional samples that are uncorrelated over time, rather than full trajectories of samples. In order to bet… ▽ More

    Submitted 8 June, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Published in ICML 2023

  8. arXiv:2111.13772  [pdf, other

    cs.LG stat.ML

    Particle Dynamics for Learning EBMs

    Authors: Kirill Neklyudov, Priyank Jaini, Max Welling

    Abstract: Energy-based modeling is a promising approach to unsupervised learning, which yields many downstream applications from a single model. The main difficulty in learning energy-based models with the "contrastive approaches" is the generation of samples from the current energy function at each iteration. Many advances have been made to accomplish this subroutine cheaply. Nevertheless, all such samplin… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

  9. arXiv:2106.10188  [pdf, other

    stat.CO cs.LG

    Deterministic Gibbs Sampling via Ordinary Differential Equations

    Authors: Kirill Neklyudov, Roberto Bondesan, Max Welling

    Abstract: Deterministic dynamics is an essential part of many MCMC algorithms, e.g. Hybrid Monte Carlo or samplers utilizing normalizing flows. This paper presents a general construction of deterministic measure-preserving dynamics using autonomous ODEs and tools from differential geometry. We show how Hybrid Monte Carlo and other deterministic samplers follow as special cases of our theory. We then demonst… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

  10. arXiv:2010.08047  [pdf, other

    cs.LG stat.CO

    Orbital MCMC

    Authors: Kirill Neklyudov, Max Welling

    Abstract: Markov Chain Monte Carlo (MCMC) algorithms ubiquitously employ complex deterministic transformations to generate proposal points that are then filtered by the Metropolis-Hastings-Green (MHG) test. However, the condition of the target measure invariance puts restrictions on the design of these transformations. In this paper, we first derive the acceptance test for the stochastic Markov kernel consi… ▽ More

    Submitted 7 June, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

  11. arXiv:2006.16653  [pdf, other

    cs.LG stat.CO stat.ME stat.ML

    Involutive MCMC: a Unifying Framework

    Authors: Kirill Neklyudov, Max Welling, Evgenii Egorov, Dmitry Vetrov

    Abstract: Markov Chain Monte Carlo (MCMC) is a computational approach to fundamental problems such as inference, integration, optimization, and simulation. The field has developed a broad spectrum of algorithms, varying in the way they are motivated, the way they are applied and how efficiently they sample. Despite all the differences, many of them share the same core principle, which we unify as the Involu… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

  12. arXiv:1906.03644  [pdf, other

    stat.ML cs.LG

    The Implicit Metropolis-Hastings Algorithm

    Authors: Kirill Neklyudov, Evgenii Egorov, Dmitry Vetrov

    Abstract: Recent works propose using the discriminator of a GAN to filter out unrealistic samples of the generator. We generalize these ideas by introducing the implicit Metropolis-Hastings algorithm. For any implicit probabilistic model and a target distribution represented by a set of samples, implicit Metropolis-Hastings operates by learning a discriminator to estimate the density-ratio and then generati… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

  13. arXiv:1810.07151  [pdf, other

    stat.ML cs.AI cs.LG

    Metropolis-Hastings view on variational inference and adversarial training

    Authors: Kirill Neklyudov, Evgenii Egorov, Pavel Shvechikov, Dmitry Vetrov

    Abstract: A significant part of MCMC methods can be considered as the Metropolis-Hastings (MH) algorithm with different proposal distributions. From this point of view, the problem of constructing a sampler can be reduced to the question - how to choose a proposal for the MH algorithm? To address this question, we propose to learn an independent sampler that maximizes the acceptance rate of the MH algorithm… ▽ More

    Submitted 9 June, 2019; v1 submitted 16 October, 2018; originally announced October 2018.

  14. arXiv:1802.04893  [pdf, other

    stat.ML cs.LG

    Uncertainty Estimation via Stochastic Batch Normalization

    Authors: Andrei Atanov, Arsenii Ashukha, Dmitry Molchanov, Kirill Neklyudov, Dmitry Vetrov

    Abstract: In this work, we investigate Batch Normalization technique and propose its probabilistic interpretation. We propose a probabilistic model and show that Batch Normalization maximazes the lower bound of its marginalized log-likelihood. Then, according to the new probabilistic model, we design an algorithm which acts consistently during train and test. However, inference becomes computationally ineff… ▽ More

    Submitted 20 March, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: Under review as a workshop paper at ICLR 2018

    Journal ref: Workshop track - ICLR 2018