Skip to main content

Showing 1–6 of 6 results for author: Saul, L K

  1. arXiv:2403.13748  [pdf, other

    stat.ML cs.LG stat.CO

    Variational Inference for Uncertainty Quantification: an Analysis of Trade-offs

    Authors: Charles C. Margossian, Loucas Pillaud-Vivien, Lawrence K. Saul

    Abstract: Given an intractable distribution $p$, the problem of variational inference (VI) is to find the best approximation from some more tractable family $Q$. Commonly, one chooses $Q$ to be a family of factorized distributions (i.e., the mean-field assumption), even though~$p$ itself does not factorize. We show that this mismatch leads to an impossibility theorem: if $p$ does not factorize, then any fac… ▽ More

    Submitted 7 June, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  2. arXiv:2402.14758  [pdf, other

    stat.ML cs.AI cs.LG stat.CO

    Batch and match: black-box variational inference with a score-based divergence

    Authors: Diana Cai, Chirag Modi, Loucas Pillaud-Vivien, Charles C. Margossian, Robert M. Gower, David M. Blei, Lawrence K. Saul

    Abstract: Most leading implementations of black-box variational inference (BBVI) are based on optimizing a stochastic evidence lower bound (ELBO). But such approaches to BBVI often converge slowly due to the high variance of their gradient estimates and their sensitivity to hyperparameters. In this work, we propose batch and match (BaM), an alternative approach to BBVI based on a score-based divergence. Not… ▽ More

    Submitted 12 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 49 pages, 14 figures. To appear in the Proceedings of the 41st International Conference on Machine Learning (ICML), 2024

  3. arXiv:1409.3518  [pdf, other

    stat.ML cs.IR cs.LG

    Topic Modeling of Hierarchical Corpora

    Authors: Do-kyum Kim, Geoffrey M. Voelker, Lawrence K. Saul

    Abstract: We study the problem of topic modeling in corpora whose documents are organized in a multi-level hierarchy. We explore a parametric approach to this problem, assuming that the number of topics is known or can be estimated by cross-validation. The models we consider can be viewed as special (finite-dimensional) instances of hierarchical Dirichlet processes (HDPs). For these models we show that ther… ▽ More

    Submitted 13 April, 2015; v1 submitted 11 September, 2014; originally announced September 2014.

  4. arXiv:1112.3714  [pdf

    cs.LG

    Nonnegative Matrix Factorization for Semi-supervised Dimensionality Reduction

    Authors: Youngmin Cho, Lawrence K. Saul

    Abstract: We show how to incorporate information from labeled examples into nonnegative matrix factorization (NMF), a popular unsupervised learning algorithm for dimensionality reduction. In addition to mapping the data into a space of lower dimensionality, our approach aims to preserve the nonnegative components of the data that are important for classification. We identify these components from the suppor… ▽ More

    Submitted 16 December, 2011; originally announced December 2011.

    Comments: Preprint submitted to Machine Learning Journal

  5. arXiv:1112.3712  [pdf

    cs.LG

    Analysis and Extension of Arc-Cosine Kernels for Large Margin Classification

    Authors: Youngmin Cho, Lawrence K. Saul

    Abstract: We investigate a recently proposed family of positive-definite kernels that mimic the computation in large neural networks. We examine the properties of these kernels using tools from differential geometry; specifically, we analyze the geometry of surfaces in Hilbert space that are induced by these kernels. When this geometry is described by a Riemannian manifold, we derive results for the metric,… ▽ More

    Submitted 16 December, 2011; originally announced December 2011.

    Comments: Preprint submitted to Neural Networks

  6. arXiv:cs/9603102  [pdf, ps

    cs.AI

    Mean Field Theory for Sigmoid Belief Networks

    Authors: L. K. Saul, T. Jaakkola, M. I. Jordan

    Abstract: We develop a mean field theory for sigmoid belief networks based on ideas from statistical mechanics. Our mean field theory provides a tractable approximation to the true probability distribution in these networks; it also yields a lower bound on the likelihood of evidence. We demonstrate the utility of this framework on a benchmark problem in statistical pattern recognition---the classification… ▽ More

    Submitted 29 February, 1996; originally announced March 1996.

    Comments: See http://www.jair.org/ for any accompanying files

    Journal ref: Journal of Artificial Intelligence Research, Vol 4, (1996), 61-76