Skip to main content

Showing 1–38 of 38 results for author: Durmus, A

  1. arXiv:2406.19824  [pdf, ps, other

    cs.GT stat.ML

    Learning to Mitigate Externalities: the Coase Theorem with Hindsight Rationality

    Authors: Antoine Scheid, Aymeric Capitaine, Etienne Boursier, Eric Moulines, Michael I Jordan, Alain Durmus

    Abstract: In economic theory, the concept of externality refers to any indirect effect resulting from an interaction between players that affects the social welfare. Most of the models within which externality has been studied assume that agents have perfect knowledge of their environment and preferences. This is a major hindrance to the practical implementation of many proposed solutions. To address this i… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.04012  [pdf, other

    stat.ML cs.LG

    Theoretical Guarantees for Variational Inference with Fixed-Variance Mixture of Gaussians

    Authors: Tom Huix, Anna Korba, Alain Durmus, Eric Moulines

    Abstract: Variational inference (VI) is a popular approach in Bayesian inference, that looks for the best approximation of the posterior distribution within a parametric family, minimizing a loss that is typically the (reverse) Kullback-Leibler (KL) divergence. Despite its empirical success, the theoretical properties of VI have only received attention recently, and mostly when the parametric family is the… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2403.11407  [pdf, other

    stat.ML cs.LG

    Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors

    Authors: Yazid Janati, Alain Durmus, Eric Moulines, Jimmy Olsson

    Abstract: Interest in the use of Denoising Diffusion Models (DDM) as priors for solving inverse Bayesian problems has recently increased significantly. However, sampling from the resulting posterior distribution poses a challenge. To solve this problem, previous works have proposed approximations to bias the drift term of the diffusion. In this work, we take a different approach and utilize the specific str… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: preprint

  4. arXiv:2403.03811  [pdf, other

    stat.ML cs.GT cs.LG

    Incentivized Learning in Principal-Agent Bandit Games

    Authors: Antoine Scheid, Daniil Tiapkin, Etienne Boursier, Aymeric Capitaine, El Mahdi El Mhamdi, Eric Moulines, Michael I. Jordan, Alain Durmus

    Abstract: This work considers a repeated principal-agent bandit game, where the principal can only interact with her environment through the agent. The principal and the agent have misaligned objectives and the choice of action is only left to the agent. However, the principal can influence the agent's decisions by offering incentives which add up to his rewards. The principal aims to iteratively learn an i… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  5. arXiv:2403.02506  [pdf, other

    cs.CV cs.LG

    Differentially Private Representation Learning via Image Captioning

    Authors: Tom Sander, Yaodong Yu, Maziar Sanjabi, Alain Durmus, Yi Ma, Kamalika Chaudhuri, Chuan Guo

    Abstract: Differentially private (DP) machine learning is considered the gold-standard solution for training a model from sensitive data while still preserving privacy. However, a major barrier to achieving this ideal is its sub-optimal privacy-accuracy trade-off, which is particularly visible in DP representation learning. Specifically, it has been shown that under modest privacy budgets, most models learn… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  6. arXiv:2402.17870  [pdf, other

    stat.CO cs.LG math.OC stat.ML

    Stochastic Approximation with Biased MCMC for Expectation Maximization

    Authors: Samuel Gruffaz, Kyurae Kim, Alain Oliviero Durmus, Jacob R. Gardner

    Abstract: The expectation maximization (EM) algorithm is a widespread method for empirical Bayesian inference, but its expectation step (E-step) is often intractable. Employing a stochastic approximation scheme with Markov chain Monte Carlo (MCMC) can circumvent this issue, resulting in an algorithm known as MCMC-SAEM. While theoretical guarantees for MCMC-SAEM have previously been established, these result… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted to AISTATS'24

  7. arXiv:2402.14904  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Watermarking Makes Language Models Radioactive

    Authors: Tom Sander, Pierre Fernandez, Alain Durmus, Matthijs Douze, Teddy Furon

    Abstract: This paper investigates the radioactivity of LLM-generated texts, i.e. whether it is possible to detect that such input was used as training data. Conventional methods like membership inference can carry out this detection with some level of accuracy. We show that watermarked training data leaves traces easier to detect and much more reliable than membership inference. We link the contamination le… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  8. arXiv:2402.10758  [pdf, other

    stat.ML cs.LG stat.CO

    Stochastic Localization via Iterative Posterior Sampling

    Authors: Louis Grenioux, Maxence Noble, Marylou Gabrié, Alain Oliviero Durmus

    Abstract: Building upon score-based learning, new interest in stochastic localization techniques has recently emerged. In these models, one seeks to noise a sample from the data distribution through a stochastic process, called observation process, and progressively learns a denoiser associated to this dynamics. Apart from specific applications, the use of stochastic localization for the problem of sampling… ▽ More

    Submitted 28 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024

  9. arXiv:2402.08344  [pdf, other

    stat.ML cs.LG

    Implicit Bias in Noisy-SGD: With Applications to Differentially Private Training

    Authors: Tom Sander, Maxime Sylvestre, Alain Durmus

    Abstract: Training Deep Neural Networks (DNNs) with small batches using Stochastic Gradient Descent (SGD) yields superior test performance compared to larger batches. The specific noise structure inherent to SGD is known to be responsible for this implicit bias. DP-SGD, used to ensure differential privacy (DP) in DNNs' training, adds Gaussian noise to the clipped gradients. Surprisingly, large-batch trainin… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  10. arXiv:2310.18455  [pdf, other

    cs.LG stat.ML

    Approximate Heavy Tails in Offline (Multi-Pass) Stochastic Gradient Descent

    Authors: Krunoslav Lehman Pavasovic, Alain Durmus, Umut Simsekli

    Abstract: A recent line of empirical studies has demonstrated that SGD might exhibit a heavy-tailed behavior in practical settings, and the heaviness of the tails might correlate with the overall performance. In this paper, we investigate the emergence of such heavy tails. Previous works on this problem only considered, up to our knowledge, online (also called single-pass) SGD, in which the emergence of hea… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: In Neural Information Processing Systems (NeurIPS), Spotlight Presentation, 2023

  11. arXiv:2307.10167  [pdf, other

    stat.ML cs.LG

    VITS : Variational Inference Thompson Sampling for contextual bandits

    Authors: Pierre Clavier, Tom Huix, Alain Durmus

    Abstract: In this paper, we introduce and analyze a variant of the Thompson sampling (TS) algorithm for contextual bandits. At each round, traditional TS requires samples from the current posterior distribution, which is usually intractable. To circumvent this issue, approximate inference techniques can be used and provide samples with distribution close to the posteriors. However, current approximate techn… ▽ More

    Submitted 3 July, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

  12. arXiv:2305.16557  [pdf, other

    stat.ML cs.LG math.PR

    Tree-Based Diffusion Schrödinger Bridge with Applications to Wasserstein Barycenters

    Authors: Maxence Noble, Valentin De Bortoli, Arnaud Doucet, Alain Durmus

    Abstract: Multi-marginal Optimal Transport (mOT), a generalization of OT, aims at minimizing the integral of a cost function with respect to a distribution with some prescribed marginals. In this paper, we consider an entropic version of mOT with a tree-structured quadratic cost, i.e., a function that can be written as a sum of pairwise cost functions between the nodes of a tree. To address this problem, we… ▽ More

    Submitted 28 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  13. arXiv:2302.04763  [pdf, other

    stat.ML cs.LG

    On Sampling with Approximate Transport Maps

    Authors: Louis Grenioux, Alain Durmus, Éric Moulines, Marylou Gabrié

    Abstract: Transport maps can ease the sampling of distributions with non-trivial geometries by transforming them into distributions that are easier to handle. The potential of this approach has risen with the development of Normalizing Flows (NF) which are maps parameterized with deep neural networks trained to push a reference distribution towards a target. NF-enhanced samplers recently proposed blend (Mar… ▽ More

    Submitted 18 February, 2024; v1 submitted 9 February, 2023; originally announced February 2023.

  14. arXiv:2211.00100  [pdf, other

    stat.ML cs.LG

    Federated Averaging Langevin Dynamics: Toward a unified theory and new algorithms

    Authors: Vincent Plassier, Alain Durmus, Eric Moulines

    Abstract: This paper focuses on Bayesian inference in a federated learning context (FL). While several distributed MCMC algorithms have been proposed, few consider the specific limitations of FL such as communication bottlenecks and statistical heterogeneity. Recently, Federated Averaging Langevin Dynamics (FALD) was introduced, which extends the Federated Averaging algorithm to Bayesian inference. We obtai… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 58 pages

  15. arXiv:2210.11925  [pdf, other

    stat.ML cs.LG math.PR

    Unbiased constrained sampling with Self-Concordant Barrier Hamiltonian Monte Carlo

    Authors: Maxence Noble, Valentin De Bortoli, Alain Durmus

    Abstract: In this paper, we propose Barrier Hamiltonian Monte Carlo (BHMC), a version of the HMC algorithm which aims at sampling from a Gibbs distribution $π$ on a manifold $\mathrm{M}$, endowed with a Hessian metric $\mathfrak{g}$ derived from a self-concordant barrier. Our method relies on Hamiltonian dynamics which comprises $\mathfrak{g}$. Therefore, it incorporates the constraints defining… ▽ More

    Submitted 28 October, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

  16. arXiv:2207.04475  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    Finite-time High-probability Bounds for Polyak-Ruppert Averaged Iterates of Linear Stochastic Approximation

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov

    Abstract: This paper provides a finite-time analysis of linear stochastic approximation (LSA) algorithms with fixed step size, a core method in statistics and machine learning. LSA is used to compute approximate solutions of a $d$-dimensional linear system $\bar{\mathbf{A}} θ= \bar{\mathbf{b}}$ for which $(\bar{\mathbf{A}}, \bar{\mathbf{b}})$ can only be estimated by (asymptotically) unbiased observations… ▽ More

    Submitted 29 March, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

    MSC Class: 62L20; 60J20

  17. arXiv:2207.03859  [pdf, other

    stat.ML cs.LG

    Variational Inference of overparameterized Bayesian Neural Networks: a theoretical and empirical study

    Authors: Tom Huix, Szymon Majewski, Alain Durmus, Eric Moulines, Anna Korba

    Abstract: This paper studies the Variational Inference (VI) used for training Bayesian Neural Networks (BNN) in the overparameterized regime, i.e., when the number of neurons tends to infinity. More specifically, we consider overparameterized two-layer BNN and point out a critical issue in the mean-field VI training. This problem arises from the decomposition of the lower bound on the evidence (ELBO) into t… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  18. arXiv:2206.03611  [pdf, other

    cs.LG stat.ME stat.ML

    FedPop: A Bayesian Approach for Personalised Federated Learning

    Authors: Nikita Kotelevskii, Maxime Vono, Eric Moulines, Alain Durmus

    Abstract: Personalised federated learning (FL) aims at collaboratively learning a machine learning model taylored for each client. Albeit promising advances have been made in this direction, most of existing approaches works do not allow for uncertainty quantification which is crucial in many applications. In addition, personalisation in the cross-device setting still involves important issues, especially f… ▽ More

    Submitted 26 January, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

  19. arXiv:2201.06133  [pdf, other

    stat.ML cs.CV cs.LG eess.IV math.OC

    On Maximum-a-Posteriori estimation with Plug & Play priors and stochastic gradient descent

    Authors: Rémi Laumont, Valentin de Bortoli, Andrés Almansa, Julie Delon, Alain Durmus, Marcelo Pereyra

    Abstract: Bayesian methods to solve imaging inverse problems usually combine an explicit data likelihood function with a prior distribution that explicitly models expected properties of the solution. Many kinds of priors have been explored in the literature, from simple ones expressing local properties to more involved ones exploiting image redundancy at a non-local scale. In a departure from explicit model… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

    MSC Class: 65K10 (Primary) 65K05; 62F15; 62C10; 68Q25; 68U10; 90C26 (Secondary) 65K10; 65K05; 62F15; 62C10; 68Q25; 68U10; 90C26

  20. arXiv:2111.02702  [pdf, other

    stat.ML cs.LG

    Local-Global MCMC kernels: the best of both worlds

    Authors: Sergey Samsonov, Evgeny Lagutin, Marylou Gabrié, Alain Durmus, Alexey Naumov, Eric Moulines

    Abstract: Recent works leveraging learning to enhance sampling have shown promising results, in particular by designing effective non-local moves and global proposals. However, learning accuracy is inevitably limited in regions where little data is available such as in the tails of distributions as well as in high-dimensional problems. In the present paper we study an Explore-Exploit Markov chain Monte Carl… ▽ More

    Submitted 4 October, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: text overlap with arXiv:1111.5421 by other authors

  21. arXiv:2106.15921  [pdf, other

    stat.ML cs.LG

    Monte Carlo Variational Auto-Encoders

    Authors: Achille Thin, Nikita Kotelevskii, Arnaud Doucet, Alain Durmus, Eric Moulines, Maxim Panov

    Abstract: Variational auto-encoders (VAE) are popular deep latent variable models which are trained by maximizing an Evidence Lower Bound (ELBO). To obtain tighter ELBO and hence better variational approximations, it has been proposed to use importance sampling to get a lower variance estimate of the evidence. However, importance sampling is known to perform poorly in high dimensions. While it has been sugg… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

  22. arXiv:2106.15427  [pdf, other

    stat.ML cs.LG

    Fast Approximation of the Sliced-Wasserstein Distance Using Concentration of Random Projections

    Authors: Kimia Nadjahi, Alain Durmus, Pierre E. Jacob, Roland Badeau, Umut Şimşekli

    Abstract: The Sliced-Wasserstein distance (SW) is being increasingly used in machine learning applications as an alternative to the Wasserstein distance and offers significant computational and statistical benefits. Since it is defined as an expectation over random projections, SW is commonly approximated by Monte Carlo. We adopt a new perspective to approximate SW by making use of the concentration of meas… ▽ More

    Submitted 4 January, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

    Comments: Published at NeurIPS 2021

  23. arXiv:2106.06300  [pdf, other

    stat.ME cs.AI cs.LG stat.CO

    DG-LMC: A Turn-key and Scalable Synchronous Distributed MCMC Algorithm via Langevin Monte Carlo within Gibbs

    Authors: Vincent Plassier, Maxime Vono, Alain Durmus, Eric Moulines

    Abstract: Performing reliable Bayesian inference on a big data scale is becoming a keystone in the modern era of machine learning. A workhorse class of methods to achieve this task are Markov chain Monte Carlo (MCMC) algorithms and their design to handle distributed datasets has been the subject of many works. However, existing methods are not completely either reliable or computationally efficient. In this… ▽ More

    Submitted 18 June, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: 77 pages. Accepted for publication at ICML 2021, to appear

  24. arXiv:2106.01257  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    Tight High Probability Bounds for Linear Stochastic Approximation with Fixed Stepsize

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov, Kevin Scaman, Hoi-To Wai

    Abstract: This paper provides a non-asymptotic analysis of linear stochastic approximation (LSA) algorithms with fixed stepsize. This family of methods arises in many machine learning tasks and is used to obtain approximate solutions of a linear system $\bar{A}θ= \bar{b}$ for which $\bar{A}$ and $\bar{b}$ can only be accessed through random estimates $\{({\bf A}_n, {\bf b}_n): n \in \mathbb{N}^*\}$. Our ana… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: 21 pages

  25. arXiv:2106.00797  [pdf, other

    cs.LG cs.AI stat.CO stat.ME stat.ML

    QLSD: Quantised Langevin stochastic dynamics for Bayesian federated learning

    Authors: Maxime Vono, Vincent Plassier, Alain Durmus, Aymeric Dieuleveut, Eric Moulines

    Abstract: The objective of Federated Learning (FL) is to perform statistical inference for data which are decentralised and stored locally on networked clients. FL raises many constraints which include privacy and data ownership, communication overhead, statistical heterogeneity, and partial client participation. In this paper, we address these problems in the framework of the Bayesian paradigm. To this end… ▽ More

    Submitted 31 May, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

  26. arXiv:2103.04715  [pdf, other

    stat.ME cs.CV eess.IV math.ST stat.ML

    Bayesian imaging using Plug & Play priors: when Langevin meets Tweedie

    Authors: Rémi Laumont, Valentin de Bortoli, Andrés Almansa, Julie Delon, Alain Durmus, Marcelo Pereyra

    Abstract: Since the seminal work of Venkatakrishnan et al. in 2013, Plug & Play (PnP) methods have become ubiquitous in Bayesian imaging. These methods derive Minimum Mean Square Error (MMSE) or Maximum A Posteriori (MAP) estimators for inverse problems in imaging by combining an explicit likelihood function with a prior that is implicitly defined by an image denoising algorithm. The PnP algorithms proposed… ▽ More

    Submitted 12 January, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    MSC Class: 65K10; 65K05; 65D18; 62F15; 62C10; 68Q25; 68U10; 90C26

  27. arXiv:2102.07586  [pdf, other

    stat.ML cs.LG math.PR

    On Riemannian Stochastic Approximation Schemes with Fixed Step-Size

    Authors: Alain Durmus, Pablo Jiménez, Éric Moulines, Salem Said

    Abstract: This paper studies fixed step-size stochastic approximation (SA) schemes, including stochastic gradient schemes, in a Riemannian framework. It is motivated by several applications, where geodesics can be computed explicitly, and their use accelerates crude Euclidean methods. A fixed step-size scheme defines a family of time-homogeneous Markov chains, parametrized by the step-size. Here, using this… ▽ More

    Submitted 19 February, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: 37 pages, 4 figures, to appear in AISTAT21

    MSC Class: 60F05

  28. arXiv:2102.00185  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov, Hoi-To Wai

    Abstract: This paper studies the exponential stability of random matrix products driven by a general (possibly unbounded) state space Markov chain. It is a cornerstone in the analysis of stochastic algorithms in machine learning (e.g. for parameter tracking in online learning or reinforcement learning). The existing results impose strong conditions such as uniform boundedness of the matrix-valued functions… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

  29. arXiv:2007.06352  [pdf, other

    stat.ML cs.LG math.PR

    Quantitative Propagation of Chaos for SGD in Wide Neural Networks

    Authors: Valentin De Bortoli, Alain Durmus, Xavier Fontaine, Umut Simsekli

    Abstract: In this paper, we investigate the limiting behavior of a continuous-time counterpart of the Stochastic Gradient Descent (SGD) algorithm applied to two-layer overparameterized neural networks, as the number or neurons (ie, the size of the hidden layer) $N \to +\infty$. Following a probabilistic approach, we show 'propagation of chaos' for the particle system defined by this continuous-time dynamics… ▽ More

    Submitted 14 July, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

  30. arXiv:2005.13284  [pdf, other

    stat.ML cs.LG math.OC

    Convergence Analysis of Riemannian Stochastic Approximation Schemes

    Authors: Alain Durmus, Pablo Jiménez, Éric Moulines, Salem Said, Hoi-To Wai

    Abstract: This paper analyzes the convergence for a large class of Riemannian stochastic approximation (SA) schemes, which aim at tackling stochastic optimization problems. In particular, the recursions we study use either the exponential map of the considered manifold (geodesic schemes) or more general retraction functions (retraction schemes) used as a proxy for the exponential map. Such approximations ar… ▽ More

    Submitted 19 May, 2021; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: 41 pages, 2 figures

    MSC Class: 60F05

  31. arXiv:2003.05783  [pdf, other

    stat.ML cs.LG

    Statistical and Topological Properties of Sliced Probability Divergences

    Authors: Kimia Nadjahi, Alain Durmus, Lénaïc Chizat, Soheil Kolouri, Shahin Shahrampour, Umut Şimşekli

    Abstract: The idea of slicing divergences has been proven to be successful when comparing two probability measures in various machine learning applications including generative modeling, and consists in computing the expected value of a `base divergence' between one-dimensional random projections of the two measures. However, the topological, statistical, and computational consequences of this technique hav… ▽ More

    Submitted 4 January, 2022; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: Published at NeurIPS 2020 (Spotlight)

  32. arXiv:2002.12253  [pdf, other

    stat.ML cs.LG stat.CO

    MetFlow: A New Efficient Method for Bridging the Gap between Markov Chain Monte Carlo and Variational Inference

    Authors: Achille Thin, Nikita Kotelevskii, Jean-Stanislas Denain, Leo Grinsztajn, Alain Durmus, Maxim Panov, Eric Moulines

    Abstract: In this contribution, we propose a new computationally efficient method to combine Variational Inference (VI) with Markov Chain Monte Carlo (MCMC). This approach can be used with generic MCMC kernels, but is especially well suited to \textit{MetFlow}, a novel family of MCMC algorithms we introduce, in which proposals are obtained using Normalizing Flows. The marginal distribution produced by such… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  33. arXiv:1912.01691  [pdf, other

    math.ST cs.CV math.PR stat.CO

    Maximum entropy methods for texture synthesis: theory and practice

    Authors: Valentin De Bortoli, Agnes Desolneux, Alain Durmus, Bruno Galerne, Arthur Leclaire

    Abstract: Recent years have seen the rise of convolutional neural network techniques in exemplar-based image synthesis. These methods often rely on the minimization of some variational formulation on the image space for which the minimizers are assumed to be the solutions of the synthesis problem. In this paper we investigate, both theoretically and experimentally, another framework to deal with this proble… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

  34. Markov Decision Process for MOOC users behavioral inference

    Authors: Firas Jarboui, Célya Gruson-daniel, Pierre Chanial, Alain Durmus, Vincent Rocchisani, Sophie-helene Goulet Ebongue, Anneliese Depoux, Wilfried Kirschenmann, Vianney Perchet

    Abstract: Studies on massive open online courses (MOOCs) users discuss the existence of typical profiles and their impact on the learning process of the students. However defining the typical behaviors as well as classifying the users accordingly is a difficult task. In this paper we suggest two methods to model MOOC users behaviour given their log data. We mold their behavior into a Markov Decision Process… ▽ More

    Submitted 10 March, 2021; v1 submitted 10 July, 2019; originally announced July 2019.

  35. arXiv:1906.04516  [pdf, other

    stat.ML cs.LG

    Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance

    Authors: Kimia Nadjahi, Alain Durmus, Umut Şimşekli, Roland Badeau

    Abstract: Minimum expected distance estimation (MEDE) algorithms have been widely used for probabilistic models with intractable likelihood functions and they have become increasingly popular due to their use in implicit generative modeling (e.g. Wasserstein generative adversarial networks, Wasserstein autoencoders). Emerging from computational optimal transport, the Sliced-Wasserstein (SW) distance has bec… ▽ More

    Submitted 24 March, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: Accepted at NeurIPS 2019 (publication and spotlight presentation)

  36. arXiv:1904.07153  [pdf, other

    stat.ML cs.LG

    Copula-like Variational Inference

    Authors: Marcel Hirt, Petros Dellaportas, Alain Durmus

    Abstract: This paper considers a new family of variational distributions motivated by Sklar's theorem. This family is based on new copula-like densities on the hypercube with non-uniform marginals which can be sampled efficiently, i.e. with a complexity linear in the dimension of state space. Then, the proposed variational densities that we suggest can be seen as arising from these copula-like densities use… ▽ More

    Submitted 22 December, 2019; v1 submitted 15 April, 2019; originally announced April 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  37. arXiv:1811.10072  [pdf, other

    stat.ML cs.LG

    The promises and pitfalls of Stochastic Gradient Langevin Dynamics

    Authors: Nicolas Brosse, Alain Durmus, Eric Moulines

    Abstract: Stochastic Gradient Langevin Dynamics (SGLD) has emerged as a key MCMC algorithm for Bayesian learning from large scale datasets. While SGLD with decreasing step sizes converges weakly to the posterior distribution, the algorithm is often used with a constant step size in practice and has demonstrated successes in machine learning tasks. The current practice is to set the step size inversely propo… ▽ More

    Submitted 25 November, 2018; originally announced November 2018.

  38. arXiv:1806.08141  [pdf, other

    stat.ML cs.LG

    Sliced-Wasserstein Flows: Nonparametric Generative Modeling via Optimal Transport and Diffusions

    Authors: Antoine Liutkus, Umut Şimşekli, Szymon Majewski, Alain Durmus, Fabian-Robert Stöter

    Abstract: By building upon the recent theory that established the connection between implicit generative modeling (IGM) and optimal transport, in this study, we propose a novel parameter-free algorithm for learning the underlying distributions of complicated datasets and sampling from them. The proposed algorithm is based on a functional optimization problem, which aims at finding a measure that is close to… ▽ More

    Submitted 11 June, 2019; v1 submitted 21 June, 2018; originally announced June 2018.

    Comments: Published at the International Conference on Machine Learning (ICML) 2019