Skip to main content

Showing 1–16 of 16 results for author: Gabrié, M

  1. arXiv:2402.10758  [pdf, other

    stat.ML cs.LG stat.CO

    Stochastic Localization via Iterative Posterior Sampling

    Authors: Louis Grenioux, Maxence Noble, Marylou Gabrié, Alain Oliviero Durmus

    Abstract: Building upon score-based learning, new interest in stochastic localization techniques has recently emerged. In these models, one seeks to noise a sample from the data distribution through a stochastic process, called observation process, and progressively learns a denoiser associated to this dynamics. Apart from specific applications, the use of stochastic localization for the problem of sampling… ▽ More

    Submitted 28 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024

  2. arXiv:2306.00684  [pdf, other

    cs.LG stat.ML

    Balanced Training of Energy-Based Models with Adaptive Flow Sampling

    Authors: Louis Grenioux, Éric Moulines, Marylou Gabrié

    Abstract: Energy-based models (EBMs) are versatile density estimation models that directly parameterize an unnormalized log density. Although very flexible, EBMs lack a specified normalization constant of the model, making the likelihood of the model computationally intractable. Several approximate samplers and variational inference techniques have been proposed to estimate the likelihood gradients for trai… ▽ More

    Submitted 18 February, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

  3. arXiv:2302.04763  [pdf, other

    stat.ML cs.LG

    On Sampling with Approximate Transport Maps

    Authors: Louis Grenioux, Alain Durmus, Éric Moulines, Marylou Gabrié

    Abstract: Transport maps can ease the sampling of distributions with non-trivial geometries by transforming them into distributions that are easier to handle. The potential of this approach has risen with the development of Normalizing Flows (NF) which are maps parameterized with deep neural networks trained to push a reference distribution towards a target. NF-enhanced samplers recently proposed blend (Mar… ▽ More

    Submitted 18 February, 2024; v1 submitted 9 February, 2023; originally announced February 2023.

  4. arXiv:2111.02702  [pdf, other

    stat.ML cs.LG

    Local-Global MCMC kernels: the best of both worlds

    Authors: Sergey Samsonov, Evgeny Lagutin, Marylou Gabrié, Alain Durmus, Alexey Naumov, Eric Moulines

    Abstract: Recent works leveraging learning to enhance sampling have shown promising results, in particular by designing effective non-local moves and global proposals. However, learning accuracy is inevitably limited in regions where little data is available such as in the tails of distributions as well as in high-dimensional problems. In the present paper we study an Explore-Exploit Markov chain Monte Carl… ▽ More

    Submitted 4 October, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: text overlap with arXiv:1111.5421 by other authors

  5. arXiv:2107.08001  [pdf, other

    stat.ML cs.LG physics.data-an

    Efficient Bayesian Sampling Using Normalizing Flows to Assist Markov Chain Monte Carlo Methods

    Authors: Marylou Gabrié, Grant M. Rotskoff, Eric Vanden-Eijnden

    Abstract: Normalizing flows can generate complex target distributions and thus show promise in many applications in Bayesian statistics as an alternative or complement to MCMC for sampling posteriors. Since no data set from the target posterior distribution is available beforehand, the flow is typically trained using the reverse Kullback-Leibler (KL) divergence that only requires samples from a base distrib… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

  6. arXiv:2107.05134  [pdf, other

    cs.LG math.OC stat.ML

    Dual Training of Energy-Based Models with Overparametrized Shallow Neural Networks

    Authors: Carles Domingo-Enrich, Alberto Bietti, Marylou Gabrié, Joan Bruna, Eric Vanden-Eijnden

    Abstract: Energy-based models (EBMs) are generative models that are usually trained via maximum likelihood estimation. This approach becomes challenging in generic situations where the trained energy is non-convex, due to the need to sample the Gibbs distribution associated with this energy. Using general Fenchel duality results, we derive variational principles dual to maximum likelihood EBMs with shallow… ▽ More

    Submitted 15 February, 2022; v1 submitted 11 July, 2021; originally announced July 2021.

  7. arXiv:2103.05524  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    On the interplay between data structure and loss function in classification problems

    Authors: Stéphane d'Ascoli, Marylou Gabrié, Levent Sagun, Giulio Biroli

    Abstract: One of the central puzzles in modern machine learning is the ability of heavily overparametrized models to generalize well. Although the low-dimensional structure of typical datasets is key to this behavior, most theoretical studies of overparametrization focus on isotropic inputs. In this work, we instead consider an analytically tractable model of structured data, where the input covariance is b… ▽ More

    Submitted 12 October, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

  8. arXiv:2012.07386  [pdf, other

    cs.LG cs.CV physics.optics stat.ML

    Phase Retrieval with Holography and Untrained Priors: Tackling the Challenges of Low-Photon Nanoscale Imaging

    Authors: Hannah Lawrence, David A. Barmherzig, Henry Li, Michael Eickenberg, Marylou Gabrié

    Abstract: Phase retrieval is the inverse problem of recovering a signal from magnitude-only Fourier measurements, and underlies numerous imaging modalities, such as Coherent Diffraction Imaging (CDI). A variant of this setup, known as holography, includes a reference object that is placed adjacent to the specimen of interest before measurements are collected. The resulting inverse problem, known as holograp… ▽ More

    Submitted 20 April, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

  9. arXiv:1911.00890  [pdf, other

    cond-mat.dis-nn cs.LG stat.ML

    Mean-field inference methods for neural networks

    Authors: Marylou Gabrié

    Abstract: Machine learning algorithms relying on deep neural networks recently allowed a great leap forward in artificial intelligence. Despite the popularity of their applications, the efficiency of these algorithms remains largely unexplained from a theoretical point of view. The mathematical description of learning problems involves very large collections of interacting random variables, difficult to han… ▽ More

    Submitted 5 March, 2020; v1 submitted 3 November, 2019; originally announced November 2019.

    Journal ref: JPhysA 2020

  10. arXiv:1910.00285  [pdf, other

    cond-mat.stat-mech cond-mat.dis-nn cs.IT

    Blind calibration for compressed sensing: State evolution and an online algorithm

    Authors: Marylou Gabrié, Jean Barbier, Florent Krzakala, Lenka Zdeborová

    Abstract: Compressed sensing, allows to acquire compressible signals with a small number of measurements. In applications, a hardware implementation often requires a calibration as the sensing process is not perfectly known. Blind calibration, that is performing at the same time calibration and compressed sensing is thus particularly appealing. A potential approach was suggested by Schülke and collaborators… ▽ More

    Submitted 23 March, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Journal ref: J. Phys. A: Math. Theor. 53 334004 (2020)

  11. arXiv:1805.09785  [pdf, other

    cs.LG cond-mat.dis-nn cs.IT stat.ML

    Entropy and mutual information in models of deep neural networks

    Authors: Marylou Gabrié, Andre Manoel, Clément Luneau, Jean Barbier, Nicolas Macris, Florent Krzakala, Lenka Zdeborová

    Abstract: We examine a class of deep learning models with a tractable method to compute information-theoretic quantities. Our contributions are three-fold: (i) We show how entropies and mutual informations can be derived from heuristic statistical physics methods, under the assumption that weight matrices are independent and orthogonally-invariant. (ii) We extend particular cases in which this result is kno… ▽ More

    Submitted 29 October, 2018; v1 submitted 24 May, 2018; originally announced May 2018.

    Journal ref: J. Stat. Mech. (2019) 124014. & NeurIPS 2018

  12. arXiv:1707.01983  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.DM math.PR

    Phase transitions in the $q$-coloring of random hypergraphs

    Authors: Marylou Gabrié, Varsha Dani, Guilhem Semerjian, Lenka Zdeborová

    Abstract: We study in this paper the structure of solutions in the random hypergraph coloring problem and the phase transitions they undergo when the density of constraints is varied. Hypergraph coloring is a constraint satisfaction problem where each constraint includes $K$ variables that must be assigned one out of $q$ colors in such a way that there are no monochromatic constraints, i.e. there are at lea… ▽ More

    Submitted 6 July, 2017; originally announced July 2017.

    Comments: 31 pages, 7 figures

    Journal ref: J. Phys. A 50, 505002 (2017)

  13. arXiv:1702.03260  [pdf, other

    cs.LG cond-mat.dis-nn cs.NE stat.ML

    A Deterministic and Generalized Framework for Unsupervised Learning with Restricted Boltzmann Machines

    Authors: Eric W. Tramel, Marylou Gabrié, Andre Manoel, Francesco Caltagirone, Florent Krzakala

    Abstract: Restricted Boltzmann machines (RBMs) are energy-based neural-networks which are commonly used as the building blocks for deep architectures neural architectures. In this work, we derive a deterministic framework for the training, evaluation, and use of RBMs based upon the Thouless-Anderson-Palmer (TAP) mean-field approximation of widely-connected systems with weak interactions coming from spin-gla… ▽ More

    Submitted 9 October, 2018; v1 submitted 10 February, 2017; originally announced February 2017.

    Journal ref: Phys. Rev. X 8, 041006 (2018)

  14. arXiv:1609.04167  [pdf, other

    math.NA cs.CV cs.IT cs.LG math.OC

    Proceedings of the third "international Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST'16)

    Authors: V. Abrol, O. Absil, P. -A. Absil, S. Anthoine, P. Antoine, T. Arildsen, N. Bertin, F. Bleichrodt, J. Bobin, A. Bol, A. Bonnefoy, F. Caltagirone, V. Cambareri, C. Chenot, V. Crnojević, M. Daňková, K. Degraux, J. Eisert, J. M. Fadili, M. Gabrié, N. Gac, D. Giacobello, A. Gonzalez, C. A. Gomez Gonzalez, A. González , et al. (36 additional authors not shown)

    Abstract: The third edition of the "international - Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST) took place in Aalborg, the 4th largest city in Denmark situated beautifully in the northern part of the country, from the 24th to 26th of August 2016. The workshop venue was at the Aalborg University campus. One implicit objective of this biennial workshop is to foster collab… ▽ More

    Submitted 14 September, 2016; originally announced September 2016.

    Comments: 69 pages, 22 extended abstracts, iTWIST'16 website: http://www.itwist16.es.aau.dk

  15. arXiv:1606.03956  [pdf, other

    cs.IT cond-mat.dis-nn cs.LG stat.ML

    Inferring Sparsity: Compressed Sensing using Generalized Restricted Boltzmann Machines

    Authors: Eric W. Tramel, Andre Manoel, Francesco Caltagirone, Marylou Gabrié, Florent Krzakala

    Abstract: In this work, we consider compressed sensing reconstruction from $M$ measurements of $K$-sparse structured signals which do not possess a writable correlation model. Assuming that a generative statistical model, such as a Boltzmann machine, can be trained in an unsupervised manner on example signals, we demonstrate how this signal model can be used within a Bayesian framework of signal reconstruct… ▽ More

    Submitted 13 June, 2016; originally announced June 2016.

    Comments: IEEE Information Theory Workshop, 2016

    Journal ref: 2016 IEEE Information Theory Workshop (ITW), Pages: 265 - 269

  16. arXiv:1506.02914  [pdf, other

    cond-mat.dis-nn cs.LG cs.NE stat.ML

    Training Restricted Boltzmann Machines via the Thouless-Anderson-Palmer Free Energy

    Authors: Marylou Gabrié, Eric W. Tramel, Florent Krzakala

    Abstract: Restricted Boltzmann machines are undirected neural networks which have been shown to be effective in many applications, including serving as initializations for training deep multi-layer neural networks. One of the main reasons for their success is the existence of efficient and practical stochastic algorithms, such as contrastive divergence, for unsupervised training. We propose an alternative d… ▽ More

    Submitted 15 June, 2015; v1 submitted 9 June, 2015; originally announced June 2015.

    Comments: 8 pages, 7 figures, demo online at http://www.lps.ens.fr/~krzakala/WASP.html

    Journal ref: Advances in Neural Information Processing Systems (NIPS 2015) 28, pages 640--648