Skip to main content

Showing 1–50 of 78 results for author: Blei, D M

  1. arXiv:2402.14758  [pdf, other

    stat.ML cs.AI cs.LG stat.CO

    Batch and match: black-box variational inference with a score-based divergence

    Authors: Diana Cai, Chirag Modi, Loucas Pillaud-Vivien, Charles C. Margossian, Robert M. Gower, David M. Blei, Lawrence K. Saul

    Abstract: Most leading implementations of black-box variational inference (BBVI) are based on optimizing a stochastic evidence lower bound (ELBO). But such approaches to BBVI often converge slowly due to the high variance of their gradient estimates and their sensitivity to hyperparameters. In this work, we propose batch and match (BaM), an alternative approach to BBVI based on a score-based divergence. Not… ▽ More

    Submitted 12 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 49 pages, 14 figures. To appear in the Proceedings of the 41st International Conference on Machine Learning (ICML), 2024

  2. arXiv:2312.02331  [pdf, ps, other

    cs.CL cs.LG

    Revisiting Topic-Guided Language Models

    Authors: Carolina Zheng, Keyon Vafa, David M. Blei

    Abstract: A recent line of work in natural language processing has aimed to combine language models and topic models. These topic-guided language models augment neural language models with topic models, unsupervised learning methods that can discover document-level patterns of word use. This paper compares the effectiveness of these methods in a standardized setting. We study four topic-guided language mode… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR) (12/2023)

  3. A Computational Approach to Style in American Poetry

    Authors: David M. Kaplan, David M. Blei

    Abstract: We develop a quantitative method to assess the style of American poems and to visualize a collection of poems in relation to one another. Qualitative poetry criticism helped guide our development of metrics that analyze various orthographic, syntactic, and phonemic features. These features are used to discover comprehensive stylistic information from a poem's multi-layered latent structure, and to… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: accepted manuscript; see doi for version of record

    ACM Class: J.5; I.2.7

    Journal ref: Seventh IEEE International Conference on Data Mining (ICDM 2007), Omaha, NE, USA, 2007, pp. 553-558

  4. arXiv:2307.14324  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Evaluating the Moral Beliefs Encoded in LLMs

    Authors: Nino Scherrer, Claudia Shi, Amir Feder, David M. Blei

    Abstract: This paper presents a case study on the design, administration, post-processing, and evaluation of surveys on large language models (LLMs). It comprises two components: (1) A statistical method for eliciting beliefs encoded in LLMs. We introduce statistical measures and evaluation metrics that quantify the probability of an LLM "making a choice", the associated uncertainty, and the consistency of… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  5. arXiv:2307.11018  [pdf, other

    stat.ML cs.LG

    Amortized Variational Inference: When and Why?

    Authors: Charles C. Margossian, David M. Blei

    Abstract: In a probabilistic latent variable model, factorized (or mean-field) variational inference (F-VI) fits a separate parametric distribution for each latent variable. Amortized variational inference (A-VI) instead learns a common inference function, which maps each observation to its corresponding latent variable's approximate posterior. Typically, A-VI is used as a step in the training of variationa… ▽ More

    Submitted 23 May, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

  6. arXiv:2306.17775  [pdf, other

    stat.ML cs.LG q-bio.BM

    Practical and Asymptotically Exact Conditional Sampling in Diffusion Models

    Authors: Luhuan Wu, Brian L. Trippe, Christian A. Naesseth, David M. Blei, John P. Cunningham

    Abstract: Diffusion models have been successful on a range of conditional generation tasks including molecular design and text-to-image generation. However, these achievements have primarily depended on task-specific conditional training or error-prone heuristic approximations. Ideally, a conditional generation method should provide exact samples for a broad range of conditional distributions without requir… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Code: https://github.com/blt2114/twisted_diffusion_sampler

  7. arXiv:2306.12497  [pdf, other

    cs.LG stat.ML

    Density Uncertainty Layers for Reliable Uncertainty Estimation

    Authors: Yookoon Park, David M. Blei

    Abstract: Assessing the predictive uncertainty of deep neural networks is crucial for safety-related applications of deep learning. Although Bayesian deep learning offers a principled framework for estimating model uncertainty, the common approaches that approximate the parameter posterior often fail to deliver reliable estimates of predictive uncertainty. In this paper, we propose a novel criterion for rel… ▽ More

    Submitted 4 March, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: Published in AISTATS 2024

  8. arXiv:2306.00542  [pdf, other

    stat.ML cs.AI cs.LG

    Nonparametric Identifiability of Causal Representations from Unknown Interventions

    Authors: Julius von Kügelgen, Michel Besserve, Liang Wendong, Luigi Gresele, Armin Kekić, Elias Bareinboim, David M. Blei, Bernhard Schölkopf

    Abstract: We study causal representation learning, the task of inferring latent causal variables and their causal relations from high-dimensional mixtures of the variables. Prior work relies on weak supervision, in the form of counterfactual pre- and post-intervention views or temporal structure; places restrictive assumptions, such as linearity, on the mixing function or latent causal model; or requires pa… ▽ More

    Submitted 28 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera-ready version; 36 pages, 4 figures

    MSC Class: 68T05 ACM Class: I.2.6

  9. arXiv:2306.00198  [pdf, other

    cs.CL cs.LG

    An Invariant Learning Characterization of Controlled Text Generation

    Authors: Carolina Zheng, Claudia Shi, Keyon Vafa, Amir Feder, David M. Blei

    Abstract: Controlled generation refers to the problem of creating text that contains stylistic or semantic attributes of interest. Many approaches reduce this problem to training a predictor of the desired attribute. For example, researchers hoping to deploy a large language model to produce non-toxic content may use a toxicity classifier to filter generated text. In practice, the generated text to classify… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: To appear in the 2023 Conference of the Association for Computational Linguistics (ACL 2023)

  10. arXiv:2301.00537  [pdf, other

    stat.ML cs.LG

    Posterior Collapse and Latent Variable Non-identifiability

    Authors: Yixin Wang, David M. Blei, John P. Cunningham

    Abstract: Variational autoencoders model high-dimensional data by positing low-dimensional latent variables that are mapped through a flexible distribution parametrized by a neural network. Unfortunately, variational autoencoders often suffer from posterior collapse: the posterior of the latent variables is equal to its prior, rendering the variational autoencoder useless as a means to produce meaningful re… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

    Comments: 19 pages, 4 figures; NeurIPS 2021

  11. arXiv:2211.11183  [pdf, other

    cs.LG

    Causal Fairness Assessment of Treatment Allocation with Electronic Health Records

    Authors: Linying Zhang, Lauren R. Richter, Yixin Wang, Anna Ostropolets, Noemie Elhadad, David M. Blei, George Hripcsak

    Abstract: Healthcare continues to grapple with the persistent issue of treatment disparities, sparking concerns regarding the equitable allocation of treatments in clinical practice. While various fairness metrics have emerged to assess fairness in decision-making processes, a growing focus has been on causality-based fairness concepts due to their capacity to mitigate confounding effects and reason about b… ▽ More

    Submitted 7 January, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

  12. arXiv:2206.06584  [pdf, other

    stat.ML cs.LG stat.ME

    Probabilistic Conformal Prediction Using Conditional Random Samples

    Authors: Zhendong Wang, Ruijiang Gao, Mingzhang Yin, Mingyuan Zhou, David M. Blei

    Abstract: This paper proposes probabilistic conformal prediction (PCP), a predictive inference algorithm that estimates a target variable by a discontinuous predictive set. Given inputs, PCP construct the predictive set based on random samples from an estimated generative model. It is efficient and compatible with either explicit or implicit conditional generative models. Theoretically, we show that PCP gua… ▽ More

    Submitted 20 June, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

  13. arXiv:2202.08370  [pdf, other

    cs.LG econ.EM

    CAREER: A Foundation Model for Labor Sequence Data

    Authors: Keyon Vafa, Emil Palikot, Tianyu Du, Ayush Kanodia, Susan Athey, David M. Blei

    Abstract: Labor economists regularly analyze employment data by fitting predictive models to small, carefully constructed longitudinal survey datasets. Although machine learning methods offer promise for such problems, these survey datasets are too small to take advantage of them. In recent years large datasets of online resumes have also become available, providing data about the career trajectories of mil… ▽ More

    Submitted 29 February, 2024; v1 submitted 16 February, 2022; originally announced February 2022.

  14. arXiv:2202.01841  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Transport Score Climbing: Variational Inference Using Forward KL and Adaptive Neural Transport

    Authors: Liyi Zhang, David M. Blei, Christian A. Naesseth

    Abstract: Variational inference often minimizes the "reverse" Kullbeck-Leibler (KL) KL(q||p) from the approximate distribution q to the posterior p. Recent work studies the "forward" KL KL(p||q), which unlike reverse KL does not lead to variational approximations that underestimate uncertainty. This paper introduces Transport Score Climbing (TSC), a method that optimizes KL(p||q) by using Hamiltonian Monte… ▽ More

    Submitted 2 September, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: 14 pages, 8 figures

  15. arXiv:2112.04014  [pdf, other

    cs.LG cs.CV

    Unsupervised Representation Learning via Neural Activation Coding

    Authors: Yookoon Park, Sangho Lee, Gunhee Kim, David M. Blei

    Abstract: We present neural activation coding (NAC) as a novel approach for learning deep representations from unlabeled data for downstream applications. We argue that the deep encoder should maximize its nonlinear expressivity on the data for downstream predictors to take full advantage of its representation power. To this end, NAC maximizes the mutual information between activation patterns of the encode… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Published in International Conference on Machine Learning (ICML), 2021

  16. arXiv:2110.10804  [pdf, other

    stat.ML cs.LG stat.ME

    Identifiable Deep Generative Models via Sparse Decoding

    Authors: Gemma E. Moran, Dhanya Sridhar, Yixin Wang, David M. Blei

    Abstract: We develop the sparse VAE for unsupervised representation learning on high-dimensional data. The sparse VAE learns a set of latent factors (representations) which summarize the associations in the observed data features. The underlying model is sparse in that each observed feature (i.e. each dimension of the data) depends on a small subset of the latent factors. As examples, in ratings data each m… ▽ More

    Submitted 17 February, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

  17. arXiv:2109.11990  [pdf, other

    stat.ME cs.LG stat.ML

    Optimization-based Causal Estimation from Heterogenous Environments

    Authors: Mingzhang Yin, Yixin Wang, David M. Blei

    Abstract: This paper presents a new optimization approach to causal estimation. Given data that contains covariates and an outcome, which covariates are causes of the outcome, and what is the strength of the causality? In classical machine learning (ML), the goal of optimization is to maximize predictive accuracy. However, some covariates might exhibit a non-causal association with the outcome. Such spuriou… ▽ More

    Submitted 10 June, 2024; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: Journal of Machine Learning Research (JMLR). Code at https://github.com/mingzhang-yin/CoCo

  18. arXiv:2109.06387  [pdf, other

    cs.CL cs.LG

    Rationales for Sequential Predictions

    Authors: Keyon Vafa, Yuntian Deng, David M. Blei, Alexander M. Rush

    Abstract: Sequence models are a critical component of modern NLP systems, but their predictions are difficult to explain. We consider model explanations though rationales, subsets of context that can explain individual model predictions. We find sequential rationales by solving a combinatorial optimization: the best rationale is the smallest subset of input tokens that would predict the same output as the f… ▽ More

    Submitted 17 November, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Appeared in the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

  19. arXiv:2005.04232  [pdf, other

    cs.CL cs.LG stat.ML

    Text-Based Ideal Points

    Authors: Keyon Vafa, Suresh Naidu, David M. Blei

    Abstract: Ideal point models analyze lawmakers' votes to quantify their political positions, or ideal points. But votes are not the only way to express a political position. Lawmakers also give speeches, release press statements, and post tweets. In this paper, we introduce the text-based ideal point model (TBIP), an unsupervised probabilistic topic model that analyzes texts to quantify the political positi… ▽ More

    Submitted 21 July, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: Appeared in Proceedings of the 2020 Conference of the Association for Computational Linguistics (ACL 2020)

  20. arXiv:2003.04948  [pdf, ps, other

    stat.ML cs.LG

    Towards Clarifying the Theory of the Deconfounder

    Authors: Yixin Wang, David M. Blei

    Abstract: Wang and Blei (2019) studies multiple causal inference and proposes the deconfounder algorithm. The paper discusses theoretical requirements and presents empirical studies. Several refinements have been suggested around the theory of the deconfounder. Among these, Imai and Jiang clarified the assumption of "no unobserved single-cause confounders." Using their assumption, this paper clarifies the t… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

  21. arXiv:1910.12991  [pdf, other

    stat.ML cs.LG

    Poisson-Randomized Gamma Dynamical Systems

    Authors: Aaron Schein, Scott W. Linderman, Mingyuan Zhou, David M. Blei, Hanna Wallach

    Abstract: This paper presents the Poisson-randomized gamma dynamical system (PRGDS), a model for sequentially observed count tensors that encodes a strong inductive bias toward sparsity and burstiness. The PRGDS is based on a new motif in Bayesian latent variable modeling, an alternating chain of discrete Poisson and continuous gamma latent states that is analytically convenient and computationally tractabl… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: To appear in the Proceedings of the 32nd Advances in Neural Information Processing Systems (NeurIPS 2019)

  22. arXiv:1910.07320  [pdf, ps, other

    stat.ML cs.LG

    The Blessings of Multiple Causes: A Reply to Ogburn et al. (2019)

    Authors: Yixin Wang, David M. Blei

    Abstract: Ogburn et al. (2019, arXiv:1910.05438) discuss "The Blessings of Multiple Causes" (Wang and Blei, 2018, arXiv:1805.06826). Many of their remarks are interesting. But they also claim that the paper has "foundational errors" and that its "premise is...incorrect." These claims are not substantiated. There are no foundational errors; the premise is correct.

    Submitted 20 December, 2019; v1 submitted 15 October, 2019; originally announced October 2019.

  23. arXiv:1910.04302  [pdf, other

    stat.ML cs.LG stat.ME

    Prescribed Generative Adversarial Networks

    Authors: Adji B. Dieng, Francisco J. R. Ruiz, David M. Blei, Michalis K. Titsias

    Abstract: Generative adversarial networks (GANs) are a powerful approach to unsupervised learning. They have achieved state-of-the-art performance in the image domain. However, GANs are limited in two ways. They often learn distributions with low support---a phenomenon known as mode collapse---and they do not guarantee the existence of a probability density, which makes evaluating generalization using predi… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: Code for this paper can be found at https://github.com/adjidieng/PresGANs

  24. Population Predictive Checks

    Authors: Gemma E. Moran, David M. Blei, Rajesh Ranganath

    Abstract: Bayesian modeling helps applied researchers articulate assumptions about their data and develop models tailored for specific applications. Thanks to good methods for approximate posterior inference, researchers can now easily build, use, and revise complicated Bayesian models for large and rich data. These capabilities, however, bring into focus the problem of model criticism. Researchers need too… ▽ More

    Submitted 15 July, 2022; v1 submitted 2 August, 2019; originally announced August 2019.

  25. arXiv:1907.05545  [pdf, other

    cs.CL stat.ML

    The Dynamic Embedded Topic Model

    Authors: Adji B. Dieng, Francisco J. R. Ruiz, David M. Blei

    Abstract: Topic modeling analyzes documents to learn meaningful patterns of words. For documents collected in sequence, dynamic topic models capture how these patterns vary over time. We develop the dynamic embedded topic model (D-ETM), a generative model of documents that combines dynamic latent Dirichlet allocation (D-LDA) and word embeddings. The D-ETM models each word with a categorical distribution par… ▽ More

    Submitted 10 October, 2019; v1 submitted 11 July, 2019; originally announced July 2019.

  26. arXiv:1907.04907  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    Topic Modeling in Embedding Spaces

    Authors: Adji B. Dieng, Francisco J. R. Ruiz, David M. Blei

    Abstract: Topic modeling analyzes documents to learn meaningful patterns of words. However, existing topic models fail to learn interpretable topics when working with large and heavy-tailed vocabularies. To this end, we develop the Embedded Topic Model (ETM), a generative model of documents that marries traditional topic models with word embeddings. In particular, it models each word with a categorical dist… ▽ More

    Submitted 7 July, 2019; originally announced July 2019.

    Comments: Code can be found at https://github.com/adjidieng/ETM

  27. arXiv:1906.04072  [pdf, other

    stat.ML cs.LG stat.ME

    A Bayesian Model of Dose-Response for Cancer Drug Studies

    Authors: Wesley Tansey, Christopher Tosh, David M. Blei

    Abstract: Exploratory cancer drug studies test multiple tumor cell lines against multiple candidate drugs. The goal in each paired (cell line, drug) experiment is to map out the dose-response curve of the cell line as the dose level of the drug increases. We propose Bayesian Tensor Filtering (BTF), a hierarchical Bayesian model for dose-response modeling in multi-sample, multi-treatment cancer drug studies.… ▽ More

    Submitted 22 March, 2021; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: Extended to handle covariates; additional benchmarks comparing to related work

  28. arXiv:1906.02120  [pdf, other

    stat.ML cs.LG stat.ME

    Adapting Neural Networks for the Estimation of Treatment Effects

    Authors: Claudia Shi, David M. Blei, Victor Veitch

    Abstract: This paper addresses the use of neural networks for the estimation of treatment effects from observational data. Generally, estimation proceeds in two stages. First, we fit models for the expected outcome and the probability of treatment (propensity score) for each unit. Second, we plug these fitted models into a downstream estimator of the effect. Neural networks are a natural choice for the mode… ▽ More

    Submitted 17 October, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

  29. arXiv:1905.12793  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Multiple Causes: A Causal Graphical View

    Authors: Yixin Wang, David M. Blei

    Abstract: Unobserved confounding is a major hurdle for causal inference from observational data. Confounders---the variables that affect both the causes and the outcome---induce spurious non-causal correlations between the two. Wang & Blei (2018) lower this hurdle with "the blessings of multiple causes," where the correlation structure of multiple causes provides indirect evidence for unobserved confounding… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: 23 pages

  30. arXiv:1905.12741  [pdf, other

    cs.LG cs.CL stat.ML

    Adapting Text Embeddings for Causal Inference

    Authors: Victor Veitch, Dhanya Sridhar, David M. Blei

    Abstract: Does adding a theorem to a paper affect its chance of acceptance? Does labeling a post with the author's gender affect the post popularity? This paper develops a method to estimate such causal effects from observational text data, adjusting for confounding features of the text such as the subject or writing quality. We assume that the text suffices for causal adjustment but that, in practice, it i… ▽ More

    Submitted 25 July, 2020; v1 submitted 29 May, 2019; originally announced May 2019.

  31. arXiv:1905.10870  [pdf, other

    stat.ML cs.LG

    Equal Opportunity and Affirmative Action via Counterfactual Predictions

    Authors: Yixin Wang, Dhanya Sridhar, David M. Blei

    Abstract: Machine learning (ML) can automate decision-making by learning to predict decisions from historical data. However, these predictors may inherit discriminatory policies from past decisions and reproduce unfair decisions. In this paper, we propose two algorithms that adjust fitted ML predictors to make them fair. We focus on two legal notions of fairness: (a) providing equal opportunity (EO) to indi… ▽ More

    Submitted 29 May, 2019; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: 18 pages

  32. arXiv:1905.10859  [pdf, other

    stat.ML cs.LG math.ST

    Variational Bayes under Model Misspecification

    Authors: Yixin Wang, David M. Blei

    Abstract: Variational Bayes (VB) is a scalable alternative to Markov chain Monte Carlo (MCMC) for Bayesian posterior inference. Though popular, VB comes with few theoretical guarantees, most of which focus on well-specified models. However, models are rarely well-specified in practice. In this work, we study VB under model misspecification. We prove the VB posterior is asymptotically normal and centers at t… ▽ More

    Submitted 11 August, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

  33. arXiv:1904.02098  [pdf, other

    stat.ML cs.LG

    The Medical Deconfounder: Assessing Treatment Effects with Electronic Health Records

    Authors: Linying Zhang, Yixin Wang, Anna Ostropolets, Jami J. Mulgrave, David M. Blei, George Hripcsak

    Abstract: The treatment effects of medications play a key role in guiding medical prescriptions. They are usually assessed with randomized controlled trials (RCTs), which are expensive. Recently, large-scale electronic health records (EHRs) have become available, opening up new opportunities for more cost-effective assessments. However, assessing a treatment effect from EHRs is challenging: it is biased by… ▽ More

    Submitted 17 August, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

  34. arXiv:1902.04114  [pdf, other

    stat.ML cs.LG

    Using Embeddings to Correct for Unobserved Confounding in Networks

    Authors: Victor Veitch, Yixin Wang, David M. Blei

    Abstract: We consider causal inference in the presence of unobserved confounding. We study the case where a proxy is available for the unobserved confounding in the form of a network connecting the units. For example, the link structure of a social network carries information about its members. We show how to effectively use the proxy to do causal inference. The main idea is to reduce the causal estimation… ▽ More

    Submitted 31 May, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

    Comments: An earlier version also addressed the use of text embeddings. That material has been expanded and moved to arxiv:1905.12741, "Using Text Embeddings for Causal Inference"

  35. arXiv:1812.00209  [pdf, other

    stat.ML cs.LG q-bio.QM

    A Probabilistic Model of Cardiac Physiology and Electrocardiograms

    Authors: Andrew C. Miller, Ziad Obermeyer, David M. Blei, John P. Cunningham, Sendhil Mullainathan

    Abstract: An electrocardiogram (EKG) is a common, non-invasive test that measures the electrical activity of a patient's heart. EKGs contain useful diagnostic information about patient health that may be absent from other electronic health record (EHR) data. As multi-dimensional waveforms, they could be modeled using generic machine learning tools, such as a linear factor model or a variational autoencoder.… ▽ More

    Submitted 1 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200

    Report number: ML4H/2018/97

  36. arXiv:1808.06581  [pdf, other

    cs.IR cs.LG stat.ML

    The Deconfounded Recommender: A Causal Inference Approach to Recommendation

    Authors: Yixin Wang, Dawen Liang, Laurent Charlin, David M. Blei

    Abstract: The goal of recommendation is to show users items that they will like. Though usually framed as a prediction, the spirit of recommendation is to answer an interventional question---for each user and movie, what would the rating be if we "forced" the user to watch the movie? To this end, we develop a causal approach to recommendation, one where watching a movie is a "treatment" and a user's rating… ▽ More

    Submitted 27 May, 2019; v1 submitted 20 August, 2018; originally announced August 2018.

    Comments: 15 pages

  37. arXiv:1807.04863  [pdf, other

    stat.ML cs.CL cs.LG

    Avoiding Latent Variable Collapse With Generative Skip Models

    Authors: Adji B. Dieng, Yoon Kim, Alexander M. Rush, David M. Blei

    Abstract: Variational autoencoders learn distributions of high-dimensional data. They model data with a deep latent-variable model and then fit the model by maximizing a lower bound of the log marginal likelihood. VAEs can capture complex distributions, but they can also suffer from an issue known as "latent variable collapse," especially if the likelihood model is powerful. Specifically, the lower bound in… ▽ More

    Submitted 30 January, 2019; v1 submitted 12 July, 2018; originally announced July 2018.

    Comments: In the Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019), Naha, Okinawa, Japan. PMLR: Volume 89. An earlier version of this paper was presented at the Workshop on Theoretical Foundations and Applications of Deep Generative Models, ICML, 2018

  38. arXiv:1806.10701  [pdf, other

    stat.ML cs.LG cs.SI

    Empirical Risk Minimization and Stochastic Gradient Descent for Relational Data

    Authors: Victor Veitch, Morgane Austern, Wenda Zhou, David M. Blei, Peter Orbanz

    Abstract: Empirical risk minimization is the main tool for prediction problems, but its extension to relational data remains unsolved. We solve this problem using recent ideas from graph sampling theory to (i) define an empirical risk for relational data and (ii) obtain stochastic gradients for this empirical risk that are automatically unbiased. This is achieved by considering the method by which data is s… ▽ More

    Submitted 22 February, 2019; v1 submitted 27 June, 2018; originally announced June 2018.

    Comments: Accepted as AISTATS 2019 Oral

  39. arXiv:1806.03143  [pdf, other

    stat.ML cs.LG

    Black Box FDR

    Authors: Wesley Tansey, Yixin Wang, David M. Blei, Raul Rabadan

    Abstract: Analyzing large-scale, multi-experiment studies requires scientists to test each experimental outcome for statistical significance and then assess the results as a whole. We present Black Box FDR (BB-FDR), an empirical-Bayes method for analyzing multi-experiment studies when many covariates are gathered per experiment. BB-FDR learns a series of black box predictive models to boost power and contro… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: To appear at ICML'18; code available at https://github.com/tansey/bb-fdr

  40. arXiv:1805.06826  [pdf, other

    stat.ML cs.LG stat.ME

    The Blessings of Multiple Causes

    Authors: Yixin Wang, David M. Blei

    Abstract: Causal inference from observational data often assumes "ignorability," that all confounders are observed. This assumption is standard yet untestable. However, many scientific studies involve multiple causes, different variables whose effects are simultaneously of interest. We propose the deconfounder, an algorithm that combines unsupervised machine learning and predictive model checking to perform… ▽ More

    Submitted 14 April, 2019; v1 submitted 17 May, 2018; originally announced May 2018.

    Comments: 72 pages

  41. arXiv:1805.01500  [pdf, other

    stat.ML cs.LG stat.ME

    Noisin: Unbiased Regularization for Recurrent Neural Networks

    Authors: Adji B. Dieng, Rajesh Ranganath, Jaan Altosaar, David M. Blei

    Abstract: Recurrent neural networks (RNNs) are powerful models of sequential data. They have been successfully used in domains such as text and speech. However, RNNs are susceptible to overfitting; regularization is important. In this paper we develop Noisin, a new method for regularizing RNNs. Noisin injects random noise into the hidden states of the RNN and then maximizes the corresponding marginal likeli… ▽ More

    Submitted 12 July, 2018; v1 submitted 3 May, 2018; originally announced May 2018.

    Comments: In Proceedings of the International Conference on Machine Learning, 2018

  42. arXiv:1803.09123  [pdf, other

    stat.ML cs.CL cs.LG

    Equation Embeddings

    Authors: Kriste Krstovski, David M. Blei

    Abstract: We present an unsupervised approach for discovering semantic representations of mathematical equations. Equations are challenging to analyze because each is unique, or nearly unique. Our method, which we call equation embeddings, finds good representations of equations by using the representations of their surrounding words. We used equation embeddings to analyze four collections of scientific art… ▽ More

    Submitted 24 March, 2018; originally announced March 2018.

    Comments: 12 pages, 2 figures

  43. arXiv:1802.04220  [pdf, other

    stat.ML cs.LG

    Augment and Reduce: Stochastic Inference for Large Categorical Distributions

    Authors: Francisco J. R. Ruiz, Michalis K. Titsias, Adji B. Dieng, David M. Blei

    Abstract: Categorical distributions are ubiquitous in machine learning, e.g., in classification, language models, and recommendation systems. However, when the number of possible outcomes is very large, using categorical distributions becomes computationally expensive, as the complexity scales linearly with the number of outcomes. To address this problem, we propose augment and reduce (A&R), a method to all… ▽ More

    Submitted 7 June, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: 11 pages, 2 figures

    Journal ref: Francisco J. R. Ruiz, Michalis K. Titsias, Adji B. Dieng, and David M. Blei. Augment and Reduce: Stochastic Inference for Large Categorical Distributions. International Conference on Machine Learning. Stockholm (Sweden), July 2018

  44. arXiv:1711.03560  [pdf, other

    stat.ML cs.LG econ.EM

    SHOPPER: A Probabilistic Model of Consumer Choice with Substitutes and Complements

    Authors: Francisco J. R. Ruiz, Susan Athey, David M. Blei

    Abstract: We develop SHOPPER, a sequential probabilistic model of shopping data. SHOPPER uses interpretable components to model the forces that drive how a customer chooses products; in particular, we designed SHOPPER to capture how items interact with other items. We develop an efficient posterior inference algorithm to estimate these forces from large-scale data, and we analyze a large dataset from a majo… ▽ More

    Submitted 9 June, 2019; v1 submitted 9 November, 2017; originally announced November 2017.

    Comments: Published at Annals of Applied Statistics. 27 pages, 4 figures

  45. arXiv:1710.10742  [pdf, ps, other

    stat.ML cs.LG q-bio.GN stat.AP stat.ME

    Implicit Causal Models for Genome-wide Association Studies

    Authors: Dustin Tran, David M. Blei

    Abstract: Progress in probabilistic generative models has accelerated, developing richer models with neural architectures, implicit densities, and with scalable algorithms for their Bayesian inference. However, there has been limited progress in models that capture causal relationships, for example, how individual genetic factors cause major human diseases. In this work, we focus on two challenges in partic… ▽ More

    Submitted 29 October, 2017; originally announced October 2017.

  46. arXiv:1705.08931  [pdf, other

    stat.ML cs.LG stat.CO

    Proximity Variational Inference

    Authors: Jaan Altosaar, Rajesh Ranganath, David M. Blei

    Abstract: Variational inference is a powerful approach for approximate posterior inference. However, it is sensitive to initialization and can be subject to poor local optima. In this paper, we develop proximity variational inference (PVI). PVI is a new method for optimizing the variational objective that constrains subsequent iterates of the variational parameters to robustify the optimization path. Conseq… ▽ More

    Submitted 24 May, 2017; originally announced May 2017.

    MSC Class: 68T10 ACM Class: G.3; I.5.0; I.5.1

  47. arXiv:1705.03439  [pdf, other

    stat.ML cs.LG math.ST

    Frequentist Consistency of Variational Bayes

    Authors: Yixin Wang, David M. Blei

    Abstract: A key challenge for modern Bayesian statistics is how to perform scalable inference of posterior distributions. To address this challenge, variational Bayes (VB) methods have emerged as a popular alternative to the classical Markov chain Monte Carlo (MCMC) methods. VB methods tend to be faster while achieving comparable predictive performance. However, there are few theoretical results around VB.… ▽ More

    Submitted 7 July, 2021; v1 submitted 9 May, 2017; originally announced May 2017.

    Journal ref: Journal of the American Statistical Association 114.527 (2019): 1147-1161

  48. arXiv:1704.04289  [pdf, other

    stat.ML cs.LG

    Stochastic Gradient Descent as Approximate Bayesian Inference

    Authors: Stephan Mandt, Matthew D. Hoffman, David M. Blei

    Abstract: Stochastic Gradient Descent with a constant learning rate (constant SGD) simulates a Markov chain with a stationary distribution. With this perspective, we derive several new results. (1) We show that constant SGD can be used as an approximate Bayesian posterior inference algorithm. Specifically, we show how to adjust the tuning parameters of constant SGD to best match the stationary distribution… ▽ More

    Submitted 19 January, 2018; v1 submitted 13 April, 2017; originally announced April 2017.

    Comments: 35 pages, published version (JMLR 2017)

    Journal ref: Journal of Machine Learning Research 18 (2017) 1-35

  49. arXiv:1702.08896  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Hierarchical Implicit Models and Likelihood-Free Variational Inference

    Authors: Dustin Tran, Rajesh Ranganath, David M. Blei

    Abstract: Implicit probabilistic models are a flexible class of models defined by a simulation process for data. They form the basis for theories which encompass our understanding of the physical world. Despite this fundamental nature, the use of implicit models remains limited due to challenges in specifying complex latent structure in them, and in performing inferences in such models with large data sets.… ▽ More

    Submitted 4 November, 2017; v1 submitted 28 February, 2017; originally announced February 2017.

    Comments: Appears in Neural Information Processing Systems, 2017

  50. arXiv:1701.03757  [pdf, ps, other

    stat.ML cs.AI cs.LG cs.PL stat.CO

    Deep Probabilistic Programming

    Authors: Dustin Tran, Matthew D. Hoffman, Rif A. Saurous, Eugene Brevdo, Kevin Murphy, David M. Blei

    Abstract: We propose Edward, a Turing-complete probabilistic programming language. Edward defines two compositional representations---random variables and inference. By treating inference as a first class citizen, on a par with modeling, we show that probabilistic programming can be as flexible and computationally efficient as traditional deep learning. For flexibility, Edward makes it easy to fit the same… ▽ More

    Submitted 7 March, 2017; v1 submitted 13 January, 2017; originally announced January 2017.

    Comments: Appears in International Conference on Learning Representations, 2017. A companion webpage for this paper is available at http://edwardlib.org/iclr2017