Skip to main content

Showing 1–47 of 47 results for author: Poole, B

  1. arXiv:2405.16852  [pdf, other

    cs.LG cs.AI stat.ML

    EM Distillation for One-step Diffusion Models

    Authors: Sirui Xie, Zhisheng Xiao, Diederik P Kingma, Tingbo Hou, Ying Nian Wu, Kevin Patrick Murphy, Tim Salimans, Ben Poole, Ruiqi Gao

    Abstract: While diffusion models can learn complex distributions, sampling requires a computationally expensive iterative process. Existing distillation methods enable efficient sampling, but have notable limitations, such as performance degradation with very few sampling steps, reliance on training data access, or mode-seeking optimization that may fail to capture the full distribution. We propose EM Disti… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2405.10314  [pdf, other

    cs.CV

    CAT3D: Create Anything in 3D with Multi-View Diffusion Models

    Authors: Ruiqi Gao, Aleksander Holynski, Philipp Henzler, Arthur Brussee, Ricardo Martin-Brualla, Pratul Srinivasan, Jonathan T. Barron, Ben Poole

    Abstract: Advances in 3D reconstruction have enabled high-quality 3D capture, but require a user to collect hundreds to thousands of images to create a 3D scene. We present CAT3D, a method for creating anything in 3D by simulating this real-world capture process with a multi-view diffusion model. Given any number of input images and a set of target novel viewpoints, our model generates highly consistent nov… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Project page: https://cat3d.github.io

  3. arXiv:2404.04491  [pdf, other

    astro-ph.IM astro-ph.GA cs.LG

    Galaxy 3D Shape Recovery using Mixture Density Network

    Authors: Suk Yee Yong, K. E. Harborne, Caroline Foster, Robert Bassett, Gregory B. Poole, Mitchell Cavanagh

    Abstract: Since the turn of the century, astronomers have been exploiting the rich information afforded by combining stellar kinematic maps and imaging in an attempt to recover the intrinsic, three-dimensional (3D) shape of a galaxy. A common intrinsic shape recovery method relies on an expected monotonic relationship between the intrinsic misalignment of the kinematic and morphological axes and the triaxia… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Accepted for publication in PASA. 18 pages, 12 figures, 2 tables

    Journal ref: Publ. Astron. Soc. Aust. 41 (2024) e033

  4. arXiv:2404.01203  [pdf, other

    cs.CV

    Video Interpolation with Diffusion Models

    Authors: Siddhant Jain, Daniel Watson, Eric Tabellion, Aleksander Hołyński, Ben Poole, Janne Kontkanen

    Abstract: We present VIDIM, a generative model for video interpolation, which creates short videos given a start and end frame. In order to achieve high fidelity and generate motions unseen in the input data, VIDIM uses cascaded diffusion models to first generate the target video at low resolution, and then generate the high-resolution video conditioned on the low-resolution generated video. We compare VIDI… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, Project page at https://vidim-interpolation.github.io/

  5. arXiv:2402.16936  [pdf, other

    cs.CV cs.LG

    Disentangled 3D Scene Generation with Layout Learning

    Authors: Dave Epstein, Ben Poole, Ben Mildenhall, Alexei A. Efros, Aleksander Holynski

    Abstract: We introduce a method to generate 3D scenes that are disentangled into their component objects. This disentanglement is unsupervised, relying only on the knowledge of a large pretrained text-to-image model. Our key insight is that objects can be discovered by finding parts of a 3D scene that, when rearranged spatially, still produce valid configurations of the same scene. Concretely, our method jo… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2312.03869  [pdf, other

    cs.CV

    Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion

    Authors: Kira Prabhu, Jane Wu, Lynn Tsai, Peter Hedman, Dan B Goldman, Ben Poole, Michael Broxton

    Abstract: This paper presents a novel approach to inpainting 3D regions of a scene, given masked multi-view images, by distilling a 2D diffusion model into a learned 3D scene representation (e.g. a NeRF). Unlike 3D generative methods that explicitly condition the diffusion model on camera pose or multi-view information, our diffusion model is conditioned only on a single masked 2D image. Nevertheless, we sh… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  7. arXiv:2312.02981  [pdf, other

    cs.CV

    ReconFusion: 3D Reconstruction with Diffusion Priors

    Authors: Rundi Wu, Ben Mildenhall, Philipp Henzler, Keunhong Park, Ruiqi Gao, Daniel Watson, Pratul P. Srinivasan, Dor Verbin, Jonathan T. Barron, Ben Poole, Aleksander Holynski

    Abstract: 3D reconstruction methods such as Neural Radiance Fields (NeRFs) excel at rendering photorealistic novel views of complex scenes. However, recovering a high-quality NeRF typically requires tens to hundreds of input images, resulting in a time-consuming capture process. We present ReconFusion to reconstruct real-world scenes using only a few photos. Our approach leverages a diffusion prior for nove… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Project page: https://reconfusion.github.io/

  8. arXiv:2307.07568  [pdf, other

    cs.LG stat.ML

    Variational Prediction

    Authors: Alexander A. Alemi, Ben Poole

    Abstract: Bayesian inference offers benefits over maximum likelihood, but it also comes with computational costs. Computing the posterior is typically intractable, as is marginalizing that posterior to form the posterior predictive distribution. In this paper, we present variational prediction, a technique for directly learning a variational approximation to the posterior predictive distribution using a var… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: AABI2023

  9. arXiv:2306.00986  [pdf, other

    cs.CV cs.LG stat.ML

    Diffusion Self-Guidance for Controllable Image Generation

    Authors: Dave Epstein, Allan Jabri, Ben Poole, Alexei A. Efros, Aleksander Holynski

    Abstract: Large-scale generative models are capable of producing high-quality images from detailed text descriptions. However, many aspects of an image are difficult or impossible to convey through text. We introduce self-guidance, a method that provides greater control over generated images by guiding the internal representations of diffusion models. We demonstrate that properties such as the shape, locati… ▽ More

    Submitted 11 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Project page at https://dave.ml/selfguidance/

  10. arXiv:2304.14473  [pdf, other

    cs.CV cs.AI cs.LG

    Learning a Diffusion Prior for NeRFs

    Authors: Guandao Yang, Abhijit Kundu, Leonidas J. Guibas, Jonathan T. Barron, Ben Poole

    Abstract: Neural Radiance Fields (NeRFs) have emerged as a powerful neural 3D representation for objects and scenes derived from 2D data. Generating NeRFs, however, remains difficult in many scenarios. For instance, training a NeRF with only a small number of views as supervision remains challenging since it is an under-constrained problem. In such settings, it calls for some inductive prior to filter out b… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

  11. arXiv:2303.13508  [pdf, other

    cs.CV cs.AI cs.GR

    DreamBooth3D: Subject-Driven Text-to-3D Generation

    Authors: Amit Raj, Srinivas Kaza, Ben Poole, Michael Niemeyer, Nataniel Ruiz, Ben Mildenhall, Shiran Zada, Kfir Aberman, Michael Rubinstein, Jonathan Barron, Yuanzhen Li, Varun Jampani

    Abstract: We present DreamBooth3D, an approach to personalize text-to-3D generative models from as few as 3-6 casually captured images of a subject. Our approach combines recent advances in personalizing text-to-image models (DreamBooth) with text-to-3D generation (DreamFusion). We find that naively combining these methods fails to yield satisfactory subject-specific 3D assets due to personalized text-to-im… ▽ More

    Submitted 27 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Project page at https://dreambooth3d.github.io/ Video Summary at https://youtu.be/kKVDrbfvOoA

  12. arXiv:2301.06555  [pdf, other

    cs.HC cs.AI

    Error-related Potential Variability: Exploring the Effects on Classification and Transferability

    Authors: Benjamin Poole, Minwoo Lee

    Abstract: Brain-Computer Interfaces (BCI) have allowed for direct communication from the brain to external applications for the automatic detection of cognitive processes such as error recognition. Error-related potentials (ErrPs) are a particular brain signal elicited when one commits or observes an erroneous event. However, due to the noisy properties of the brain and recording devices, ErrPs vary from in… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: Published in 2022 IEEE Symposium Series on Computational Intelligence (SSCI)

  13. arXiv:2211.09760  [pdf, other

    cs.LG math.OC stat.ML

    VeLO: Training Versatile Learned Optimizers by Scaling Up

    Authors: Luke Metz, James Harrison, C. Daniel Freeman, Amil Merchant, Lucas Beyer, James Bradbury, Naman Agrawal, Ben Poole, Igor Mordatch, Adam Roberts, Jascha Sohl-Dickstein

    Abstract: While deep learning models have replaced hand-designed features across many domains, these models are still trained with hand-designed optimizers. In this work, we leverage the same scaling approach behind the success of deep learning to learn versatile optimizers. We train an optimizer for deep learning which is itself a small neural network that ingests gradients and outputs parameter updates. M… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  14. arXiv:2210.02303  [pdf, other

    cs.CV cs.LG

    Imagen Video: High Definition Video Generation with Diffusion Models

    Authors: Jonathan Ho, William Chan, Chitwan Saharia, Jay Whang, Ruiqi Gao, Alexey Gritsenko, Diederik P. Kingma, Ben Poole, Mohammad Norouzi, David J. Fleet, Tim Salimans

    Abstract: We present Imagen Video, a text-conditional video generation system based on a cascade of video diffusion models. Given a text prompt, Imagen Video generates high definition videos using a base video generation model and a sequence of interleaved spatial and temporal video super-resolution models. We describe how we scale up the system as a high definition text-to-video model including design deci… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: See accompanying website: https://imagen.research.google/video/

  15. arXiv:2209.14988  [pdf, other

    cs.CV cs.LG stat.ML

    DreamFusion: Text-to-3D using 2D Diffusion

    Authors: Ben Poole, Ajay Jain, Jonathan T. Barron, Ben Mildenhall

    Abstract: Recent breakthroughs in text-to-image synthesis have been driven by diffusion models trained on billions of image-text pairs. Adapting this approach to 3D synthesis would require large-scale datasets of labeled 3D data and efficient architectures for denoising 3D data, neither of which currently exist. In this work, we circumvent these limitations by using a pretrained 2D text-to-image diffusion m… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: see project page at https://dreamfusion3d.github.io/

  16. Towards Interactive Reinforcement Learning with Intrinsic Feedback

    Authors: Benjamin Poole, Minwoo Lee

    Abstract: Reinforcement learning (RL) and brain-computer interfaces (BCI) have experienced significant growth over the past decade. With rising interest in human-in-the-loop (HITL), incorporating human input with RL algorithms has given rise to the sub-field of interactive RL. Adjacently, the field of BCI has long been interested in extracting informative brain signals from neural activity for use in human-… ▽ More

    Submitted 23 August, 2023; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Name change and vast rewrites of the paper

    Report number: Neurocomputing, 587, (2024), 127628

  17. arXiv:2112.01455  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Zero-Shot Text-Guided Object Generation with Dream Fields

    Authors: Ajay Jain, Ben Mildenhall, Jonathan T. Barron, Pieter Abbeel, Ben Poole

    Abstract: We combine neural rendering with multi-modal image and text representations to synthesize diverse 3D objects solely from natural language descriptions. Our method, Dream Fields, can generate the geometry and color of a wide range of objects without 3D supervision. Due to the scarcity of diverse, captioned 3D data, prior methods only generate objects from a handful of categories, such as ShapeNet.… ▽ More

    Submitted 4 May, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: CVPR 2022. 13 pages. Website: https://ajayj.com/dreamfields

  18. arXiv:2110.02037  [pdf, other

    cs.LG stat.ML

    Autoregressive Diffusion Models

    Authors: Emiel Hoogeboom, Alexey A. Gritsenko, Jasmijn Bastings, Ben Poole, Rianne van den Berg, Tim Salimans

    Abstract: We introduce Autoregressive Diffusion Models (ARDMs), a model class encompassing and generalizing order-agnostic autoregressive models (Uria et al., 2014) and absorbing discrete diffusion (Austin et al., 2021), which we show are special cases of ARDMs under mild assumptions. ARDMs are simple to implement and easy to train. Unlike standard ARMs, they do not require causal masking of model represent… ▽ More

    Submitted 1 February, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at International Conference on Learning Representations (ICLR) 2022

  19. arXiv:2107.00630  [pdf, other

    cs.LG stat.ML

    Variational Diffusion Models

    Authors: Diederik P. Kingma, Tim Salimans, Ben Poole, Jonathan Ho

    Abstract: Diffusion-based generative models have demonstrated a capacity for perceptually impressive synthesis, but can they also be great likelihood-based models? We answer this in the affirmative, and introduce a family of diffusion-based generative models that obtain state-of-the-art likelihoods on standard image density estimation benchmarks. Unlike other diffusion-based models, our method allows for ef… ▽ More

    Submitted 13 April, 2023; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: Published at NeurIPS'21

  20. arXiv:2012.08125  [pdf, other

    cs.LG stat.ML

    Learning Energy-Based Models by Diffusion Recovery Likelihood

    Authors: Ruiqi Gao, Yang Song, Ben Poole, Ying Nian Wu, Diederik P. Kingma

    Abstract: While energy-based models (EBMs) exhibit a number of desirable properties, training and sampling on high-dimensional datasets remains challenging. Inspired by recent progress on diffusion probabilistic models, we present a diffusion recovery likelihood method to tractably learn and sample from a sequence of EBMs trained on increasingly noisy versions of a dataset. Each EBM is trained with recovery… ▽ More

    Submitted 27 March, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

  21. arXiv:2011.13456  [pdf, other

    cs.LG stat.ML

    Score-Based Generative Modeling through Stochastic Differential Equations

    Authors: Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, Ben Poole

    Abstract: Creating noise from data is easy; creating data from noise is generative modeling. We present a stochastic differential equation (SDE) that smoothly transforms a complex data distribution to a known prior distribution by slowly injecting noise, and a corresponding reverse-time SDE that transforms the prior distribution back into the data distribution by slowly removing the noise. Crucially, the re… ▽ More

    Submitted 10 February, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: ICLR 2021 (Oral)

  22. arXiv:2011.08711  [pdf, other

    stat.ML cs.LG

    VIB is Half Bayes

    Authors: Alexander A Alemi, Warren R Morningstar, Ben Poole, Ian Fischer, Joshua V Dillon

    Abstract: In discriminative settings such as regression and classification there are two random variables at play, the inputs X and the targets Y. Here, we demonstrate that the Variational Information Bottleneck can be viewed as a compromise between fully empirical and fully Bayesian objectives, attempting to minimize the risks due to finite sampling of Y only. We argue that this approach provides some of t… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  23. arXiv:2010.08029  [pdf, other

    cs.LG stat.ML

    Non-saturating GAN training as divergence minimization

    Authors: Matt Shannon, Ben Poole, Soroosh Mariooryad, Tom Bagby, Eric Battenberg, David Kao, Daisy Stanton, RJ Skerry-Ryan

    Abstract: Non-saturating generative adversarial network (GAN) training is widely used and has continued to obtain groundbreaking results. However so far this approach has lacked strong theoretical justification, in contrast to alternatives such as f-GANs and Wasserstein GANs which are motivated in terms of approximate divergence minimization. In this paper we show that non-saturating GAN training does in fa… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  24. arXiv:2009.11243  [pdf, other

    cs.LG cs.NE stat.ML

    Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves

    Authors: Luke Metz, Niru Maheswaranathan, C. Daniel Freeman, Ben Poole, Jascha Sohl-Dickstein

    Abstract: Much as replacing hand-designed features with learned functions has revolutionized how we solve perceptual tasks, we believe learned algorithms will transform how we train models. In this work we focus on general-purpose learned optimizers capable of training a wide variety of problems with no user-specified hyperparameters. We introduce a new, neural network parameterized, hierarchical optimizer… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

  25. arXiv:2005.10243  [pdf, other

    cs.CV cs.LG

    What Makes for Good Views for Contrastive Learning?

    Authors: Yonglong Tian, Chen Sun, Ben Poole, Dilip Krishnan, Cordelia Schmid, Phillip Isola

    Abstract: Contrastive learning between multiple views of the data has recently achieved state of the art performance in the field of self-supervised representation learning. Despite its success, the influence of different view choices has been less studied. In this paper, we use theoretical and empirical analysis to better understand the importance of view selection, and argue that we should reduce the mutu… ▽ More

    Submitted 18 December, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: NeurIPS 2020. Project page: https://hobbitlong.github.io/InfoMin/

  26. arXiv:2002.11887  [pdf, other

    cs.LG stat.ML

    Using a thousand optimization tasks to learn hyperparameter search strategies

    Authors: Luke Metz, Niru Maheswaranathan, Ruoxi Sun, C. Daniel Freeman, Ben Poole, Jascha Sohl-Dickstein

    Abstract: We present TaskSet, a dataset of tasks for use in training and evaluating optimizers. TaskSet is unique in its size and diversity, containing over a thousand tasks ranging from image classification with fully connected or convolutional neural networks, to variational autoencoders, to non-volume preserving flows on a variety of datasets. As an example application of such a dataset we explore meta-l… ▽ More

    Submitted 31 March, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

  27. arXiv:2002.08927  [pdf, other

    cs.LG stat.ML

    Regularized Autoencoders via Relaxed Injective Probability Flow

    Authors: Abhishek Kumar, Ben Poole, Kevin Murphy

    Abstract: Invertible flow-based generative models are an effective method for learning to generate samples, while allowing for tractable likelihood computation and inference. However, the invertibility requirement restricts models to have the same latent dimensionality as the inputs. This imposes significant architectural, memory, and computational costs, making them more challenging to scale than other cla… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: AISTATS 2020

  28. arXiv:2002.02886  [pdf, other

    cs.LG stat.ML

    Weakly-Supervised Disentanglement Without Compromises

    Authors: Francesco Locatello, Ben Poole, Gunnar Rätsch, Bernhard Schölkopf, Olivier Bachem, Michael Tschannen

    Abstract: Intelligent agents should be able to learn useful representations by observing changes in their environment. We model such observations as pairs of non-i.i.d. images sharing at least one of the underlying factors of variation. First, we theoretically show that only knowing how many factors have changed, but not which ones, is sufficient to learn disentangled representations. Second, we provide pra… ▽ More

    Submitted 20 October, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

    Comments: We updated the description of the generation of the dataset compared to the ICML version

    Journal ref: ICML 2020

  29. arXiv:2002.00041  [pdf, other

    cs.LG stat.ML

    On Implicit Regularization in $β$-VAEs

    Authors: Abhishek Kumar, Ben Poole

    Abstract: While the impact of variational inference (VI) on posterior inference in a fixed generative model is well-characterized, its role in regularizing a learned generative model when used in variational autoencoders (VAEs) is poorly understood. We study the regularizing effects of variational distributions on learning in generative models from two perspectives. First, we analyze the role that the choic… ▽ More

    Submitted 28 December, 2020; v1 submitted 31 January, 2020; originally announced February 2020.

    Comments: ICML 2020; Final version, including appendix

  30. arXiv:1910.09772  [pdf, other

    cs.LG stat.ML

    Weakly Supervised Disentanglement with Guarantees

    Authors: Rui Shu, Yining Chen, Abhishek Kumar, Stefano Ermon, Ben Poole

    Abstract: Learning disentangled representations that correspond to factors of variation in real-world data is critical to interpretable and human-controllable machine learning. Recently, concerns about the viability of learning disentangled representations in a purely unsupervised manner has spurred a shift toward the incorporation of weak supervision. However, there is currently no formalism that identifie… ▽ More

    Submitted 10 April, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: ICLR 2020

  31. arXiv:1910.09578  [pdf, other

    cs.LG cs.IT stat.ML

    On Predictive Information in RNNs

    Authors: Zhe Dong, Deniz Oktay, Ben Poole, Alexander A. Alemi

    Abstract: Certain biological neurons demonstrate a remarkable capability to optimally compress the history of sensory inputs while being maximally informative about the future. In this work, we investigate if the same can be said of artificial neurons in recurrent neural networks (RNNs) trained with maximum likelihood. Empirically, we find that RNNs are suboptimal in the information plane. Instead of optima… ▽ More

    Submitted 10 February, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

  32. arXiv:1906.02611  [pdf, other

    cs.LG cs.CV stat.ML

    Improving Robustness Without Sacrificing Accuracy with Patch Gaussian Augmentation

    Authors: Raphael Gontijo Lopes, Dong Yin, Ben Poole, Justin Gilmer, Ekin D. Cubuk

    Abstract: Deploying machine learning systems in the real world requires both high accuracy on clean data and robustness to naturally occurring corruptions. While architectural advances have led to improved accuracy, building robust models remains challenging. Prior work has argued that there is an inherent trade-off between robustness and accuracy, which is exemplified by standard data augment techniques su… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  33. arXiv:1905.10347  [pdf, other

    cs.LG stat.ML

    Discrete Flows: Invertible Generative Models of Discrete Data

    Authors: Dustin Tran, Keyon Vafa, Kumar Krishna Agrawal, Laurent Dinh, Ben Poole

    Abstract: While normalizing flows have led to significant advances in modeling high-dimensional continuous distributions, their applicability to discrete distributions remains unknown. In this paper, we show that flows can in fact be extended to discrete events---and under a simple change-of-variables formula not requiring log-determinant-Jacobian computations. Discrete flows have numerous applications. We… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

  34. arXiv:1905.06922  [pdf, other

    cs.LG stat.ML

    On Variational Bounds of Mutual Information

    Authors: Ben Poole, Sherjil Ozair, Aaron van den Oord, Alexander A. Alemi, George Tucker

    Abstract: Estimating and optimizing Mutual Information (MI) is core to many problems in machine learning; however, bounding MI in high dimensions is challenging. To establish tractable and scalable objectives, recent work has turned to variational bounds parameterized by neural networks, but the relationships and tradeoffs between these bounds remains unclear. In this work, we unify these recent development… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Comments: ICML 2019

  35. arXiv:1901.03416  [pdf, other

    cs.LG stat.ML

    Preventing Posterior Collapse with delta-VAEs

    Authors: Ali Razavi, Aäron van den Oord, Ben Poole, Oriol Vinyals

    Abstract: Due to the phenomenon of "posterior collapse," current latent variable generative models pose a challenging design choice that either weakens the capacity of the decoder or requires augmenting the objective so it does not only maximize the likelihood of the data. In this paper, we propose an alternative that utilizes the most powerful generative models as decoders, whilst optimising the variationa… ▽ More

    Submitted 10 January, 2019; originally announced January 2019.

  36. arXiv:1711.00464  [pdf, other

    cs.LG stat.ML

    Fixing a Broken ELBO

    Authors: Alexander A. Alemi, Ben Poole, Ian Fischer, Joshua V. Dillon, Rif A. Saurous, Kevin Murphy

    Abstract: Recent work in unsupervised representation learning has focused on learning deep directed latent-variable models. Fitting these models by maximizing the marginal likelihood or evidence is typically intractable, thus a common approximation is to maximize the evidence lower bound (ELBO) instead. However, maximum likelihood training (whether exact or approximate) does not necessarily result in a good… ▽ More

    Submitted 13 February, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: 21 pages, 9 figures

  37. arXiv:1703.04200  [pdf, other

    cs.LG q-bio.NC stat.ML

    Continual Learning Through Synaptic Intelligence

    Authors: Friedemann Zenke, Ben Poole, Surya Ganguli

    Abstract: While deep learning has led to remarkable advances across diverse applications, it struggles in domains where the data distribution changes over the course of learning. In stark contrast, biological neural networks continually adapt to changing domains, possibly by leveraging complex molecular machinery to solve many tasks simultaneously. In this study, we introduce intelligent synapses that bring… ▽ More

    Submitted 12 June, 2017; v1 submitted 12 March, 2017; originally announced March 2017.

    Comments: ICML 2017

  38. arXiv:1612.02780  [pdf, other

    cs.LG stat.ML

    Improved generator objectives for GANs

    Authors: Ben Poole, Alexander A. Alemi, Jascha Sohl-Dickstein, Anelia Angelova

    Abstract: We present a framework to understand GAN training as alternating density ratio estimation and approximate divergence minimization. This provides an interpretation for the mismatched GAN generator and discriminator objectives often used in practice, and explains the problem of poor sample diversity. We also derive a family of generator objectives that target arbitrary $f$-divergences without minimi… ▽ More

    Submitted 8 December, 2016; originally announced December 2016.

    Comments: NIPS 2016 Workshop on Adversarial Training

  39. arXiv:1611.08083  [pdf, other

    stat.ML cs.LG cs.NE

    Survey of Expressivity in Deep Neural Networks

    Authors: Maithra Raghu, Ben Poole, Jon Kleinberg, Surya Ganguli, Jascha Sohl-Dickstein

    Abstract: We survey results on neural network expressivity described in "On the Expressive Power of Deep Neural Networks". The paper motivates and develops three natural measures of expressiveness, which all display an exponential dependence on the depth of the network. In fact, all of these measures are related to a fourth quantity, trajectory length. This quantity grows exponentially in the depth of the n… ▽ More

    Submitted 24 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  40. arXiv:1611.02163  [pdf, other

    cs.LG stat.ML

    Unrolled Generative Adversarial Networks

    Authors: Luke Metz, Ben Poole, David Pfau, Jascha Sohl-Dickstein

    Abstract: We introduce a method to stabilize Generative Adversarial Networks (GANs) by defining the generator objective with respect to an unrolled optimization of the discriminator. This allows training to be adjusted between using the optimal discriminator in the generator's objective, which is ideal but infeasible in practice, and using the current value of the discriminator, which is often unstable and… ▽ More

    Submitted 12 May, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

  41. arXiv:1611.01144  [pdf, other

    stat.ML cs.LG

    Categorical Reparameterization with Gumbel-Softmax

    Authors: Eric Jang, Shixiang Gu, Ben Poole

    Abstract: Categorical variables are a natural choice for representing discrete structure in the world. However, stochastic neural networks rarely use categorical latent variables due to the inability to backpropagate through samples. In this work, we present an efficient gradient estimator that replaces the non-differentiable sample from a categorical distribution with a differentiable sample from a novel G… ▽ More

    Submitted 5 August, 2017; v1 submitted 3 November, 2016; originally announced November 2016.

  42. arXiv:1606.05340  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Exponential expressivity in deep neural networks through transient chaos

    Authors: Ben Poole, Subhaneil Lahiri, Maithra Raghu, Jascha Sohl-Dickstein, Surya Ganguli

    Abstract: We combine Riemannian geometry with the mean field theory of high dimensional chaos to study the nature of signal propagation in generic, deep neural networks with random weights. Our results reveal an order-to-chaos expressivity phase transition, with networks in the chaotic phase computing nonlinear functions whose global curvature grows exponentially with depth but not width. We prove this gene… ▽ More

    Submitted 17 June, 2016; v1 submitted 16 June, 2016; originally announced June 2016.

    Comments: Fixed equation references

  43. arXiv:1606.05336  [pdf, other

    stat.ML cs.AI cs.LG

    On the Expressive Power of Deep Neural Networks

    Authors: Maithra Raghu, Ben Poole, Jon Kleinberg, Surya Ganguli, Jascha Sohl-Dickstein

    Abstract: We propose a new approach to the problem of neural network expressivity, which seeks to characterize how structural properties of a neural network family affect the functions it is able to compute. Our approach is based on an interrelated set of measures of expressivity, unified by the novel notion of trajectory length, which measures how the output of a network changes as the input sweeps along a… ▽ More

    Submitted 18 June, 2017; v1 submitted 16 June, 2016; originally announced June 2016.

    Comments: Accepted to ICML 2017

  44. arXiv:1606.00704  [pdf, other

    stat.ML cs.LG

    Adversarially Learned Inference

    Authors: Vincent Dumoulin, Ishmael Belghazi, Ben Poole, Olivier Mastropietro, Alex Lamb, Martin Arjovsky, Aaron Courville

    Abstract: We introduce the adversarially learned inference (ALI) model, which jointly learns a generation network and an inference network using an adversarial process. The generation network maps samples from stochastic latent variables to the data space while the inference network maps training examples in data space to the space of latent variables. An adversarial game is cast between these two networks… ▽ More

    Submitted 21 February, 2017; v1 submitted 2 June, 2016; originally announced June 2016.

  45. arXiv:1511.03296  [pdf, other

    cs.CV

    The Fast Bilateral Solver

    Authors: Jonathan T. Barron, Ben Poole

    Abstract: We present the bilateral solver, a novel algorithm for edge-aware smoothing that combines the flexibility and speed of simple filtering approaches with the accuracy of domain-specific optimization algorithms. Our technique is capable of matching or improving upon state-of-the-art results on several different computer vision tasks (stereo, depth superresolution, colorization, and semantic segmentat… ▽ More

    Submitted 22 July, 2016; v1 submitted 10 November, 2015; originally announced November 2015.

  46. arXiv:1406.1831  [pdf, other

    cs.NE cs.LG

    Analyzing noise in autoencoders and deep networks

    Authors: Ben Poole, Jascha Sohl-Dickstein, Surya Ganguli

    Abstract: Autoencoders have emerged as a useful framework for unsupervised learning of internal representations, and a wide variety of apparently conceptually disparate regularization techniques have been proposed to generate useful features. Here we extend existing denoising autoencoders to additionally inject noise before the nonlinearity, and at the hidden unit activations. We show that a wide variety of… ▽ More

    Submitted 6 June, 2014; originally announced June 2014.

  47. arXiv:1311.2115  [pdf, other

    cs.LG

    Fast large-scale optimization by unifying stochastic gradient and quasi-Newton methods

    Authors: Jascha Sohl-Dickstein, Ben Poole, Surya Ganguli

    Abstract: We present an algorithm for minimizing a sum of functions that combines the computational efficiency of stochastic gradient descent (SGD) with the second order curvature information leveraged by quasi-Newton methods. We unify these disparate approaches by maintaining an independent Hessian approximation for each contributing function in the sum. We maintain computational tractability and limit mem… ▽ More

    Submitted 29 November, 2014; v1 submitted 8 November, 2013; originally announced November 2013.

    MSC Class: 90C26 ACM Class: G.1.6