Skip to main content

Showing 1–11 of 11 results for author: Bergman, A W

  1. arXiv:2405.18407  [pdf, other

    cs.LG cs.CV

    Phased Consistency Model

    Authors: Fu-Yun Wang, Zhaoyang Huang, Alexander William Bergman, Dazhong Shen, Peng Gao, Michael Lingelbach, Keqiang Sun, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li, Xiaogang Wang

    Abstract: The consistency model (CM) has recently made significant progress in accelerating the generation of diffusion models. However, its application to high-resolution, text-conditioned image generation in the latent space (a.k.a., LCM) remains unsatisfactory. In this paper, we identify three key flaws in the current design of LCM. We investigate the reasons behind these limitations and propose the Phas… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2309.01811  [pdf, other

    cs.CV

    Instant Continual Learning of Neural Radiance Fields

    Authors: Ryan Po, Zhengyang Dong, Alexander W. Bergman, Gordon Wetzstein

    Abstract: Neural radiance fields (NeRFs) have emerged as an effective method for novel-view synthesis and 3D scene reconstruction. However, conventional training methods require access to all training views during scene optimization. This assumption may be prohibitive in continual learning scenarios, where new data is acquired in a sequential manner and a continuous update of the NeRF is desired, as in auto… ▽ More

    Submitted 5 September, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: For project page please visit https://ryanpo.com/icngp/

  3. arXiv:2307.05462  [pdf, other

    cs.CV

    Efficient 3D Articulated Human Generation with Layered Surface Volumes

    Authors: Yinghao Xu, Wang Yifan, Alexander W. Bergman, Menglei Chai, Bolei Zhou, Gordon Wetzstein

    Abstract: Access to high-quality and diverse 3D articulated digital human assets is crucial in various applications, ranging from virtual reality to social platforms. Generative approaches, such as 3D generative adversarial networks (GANs), are rapidly replacing laborious manual content creation tools. However, existing 3D GAN frameworks typically rely on scene representations that leverage either template… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: Project page: https://www.computationalimaging.org/publications/lsv/ Demo: https://www.youtube.com/watch?v=vahgMFCM3j4

  4. arXiv:2307.04859  [pdf, other

    cs.CV cs.GR cs.LG

    Articulated 3D Head Avatar Generation using Text-to-Image Diffusion Models

    Authors: Alexander W. Bergman, Wang Yifan, Gordon Wetzstein

    Abstract: The ability to generate diverse 3D articulated head avatars is vital to a plethora of applications, including augmented reality, cinematography, and education. Recent work on text-guided 3D object generation has shown great promise in addressing these needs. These methods directly leverage pre-trained 2D text-to-image diffusion models to generate 3D-multi-view-consistent radiance fields of generic… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Project website: http://www.computationalimaging.org/publications/articulated-diffusion/

  5. arXiv:2304.02602  [pdf, other

    cs.CV cs.AI cs.GR

    Generative Novel View Synthesis with 3D-Aware Diffusion Models

    Authors: Eric R. Chan, Koki Nagano, Matthew A. Chan, Alexander W. Bergman, Jeong Joon Park, Axel Levy, Miika Aittala, Shalini De Mello, Tero Karras, Gordon Wetzstein

    Abstract: We present a diffusion-based model for 3D-aware generative novel view synthesis from as few as a single input image. Our model samples from the distribution of possible renderings consistent with the input and, even in the presence of ambiguity, is capable of rendering diverse and plausible novel views. To achieve this, our method makes use of existing 2D diffusion backbones but, crucially, incorp… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Project page: https://nvlabs.github.io/genvs

  6. arXiv:2303.04291  [pdf, other

    eess.IV cs.CV

    Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition

    Authors: Cindy M. Nguyen, Eric R. Chan, Alexander W. Bergman, Gordon Wetzstein

    Abstract: Capturing images is a key part of automation for high-level tasks such as scene text recognition. Low-light conditions pose a challenge for high-level perception stacks, which are often optimized on well-lit, artifact-free images. Reconstruction methods for low-light images can produce well-lit counterparts, but typically at the cost of high-frequency details critical for downstream tasks. We prop… ▽ More

    Submitted 30 October, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: WACV 2024. Project website: https://ccnguyen.github.io/diffusion-in-the-dark/

  7. arXiv:2206.14314  [pdf, other

    cs.CV cs.GR

    Generative Neural Articulated Radiance Fields

    Authors: Alexander W. Bergman, Petr Kellnhofer, Wang Yifan, Eric R. Chan, David B. Lindell, Gordon Wetzstein

    Abstract: Unsupervised learning of 3D-aware generative adversarial networks (GANs) using only collections of single-view 2D photographs has very recently made much progress. These 3D GANs, however, have not been demonstrated for human bodies and the generated radiance fields of existing frameworks are not directly editable, limiting their applicability in downstream tasks. We propose a solution to these cha… ▽ More

    Submitted 9 January, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: Project website: http://www.computationalimaging.org/publications/gnarf/

  8. arXiv:2106.14942  [pdf, other

    cs.CV cs.GR cs.LG

    Fast Training of Neural Lumigraph Representations using Meta Learning

    Authors: Alexander W. Bergman, Petr Kellnhofer, Gordon Wetzstein

    Abstract: Novel view synthesis is a long-standing problem in machine learning and computer vision. Significant progress has recently been made in developing neural scene representations and rendering techniques that synthesize photorealistic images from arbitrary views. These representations, however, are extremely slow to train and often also slow to render. Inspired by neural variants of image-based rende… ▽ More

    Submitted 26 October, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Project website: http://www.computationalimaging.org/publications/metanlr/

  9. ScanGAN360: A Generative Model of Realistic Scanpaths for 360$^{\circ}$ Images

    Authors: Daniel Martin, Ana Serrano, Alexander W. Bergman, Gordon Wetzstein, Belen Masia

    Abstract: Understanding and modeling the dynamics of human gaze behavior in 360$^\circ$ environments is a key challenge in computer vision and virtual reality. Generative adversarial approaches could alleviate this challenge by generating a large number of possible scanpaths for unseen images. Existing methods for scanpath generation, however, do not adequately predict realistic scanpaths for 360$^\circ$ im… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Journal ref: IEEE Transactions on Visualization and Computer Graphics 2022

  10. arXiv:2006.09661  [pdf, other

    cs.CV cs.LG eess.IV

    Implicit Neural Representations with Periodic Activation Functions

    Authors: Vincent Sitzmann, Julien N. P. Martel, Alexander W. Bergman, David B. Lindell, Gordon Wetzstein

    Abstract: Implicitly defined, continuous, differentiable signal representations parameterized by neural networks have emerged as a powerful paradigm, offering many possible benefits over conventional representations. However, current network architectures for such implicit neural representations are incapable of modeling signals with fine detail, and fail to represent a signal's spatial and temporal derivat… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: Project website: https://vsitzmann.github.io/siren/ Project video: https://youtu.be/Q2fLWGBeaiI

  11. arXiv:2001.02748  [pdf, other

    cs.IT

    Rate-Constrained Shaping Codes for Structured Sources

    Authors: Yi Liu, Pengfei Huang, Alexander W. Bergman, Paul H. Siegel

    Abstract: Shaping codes are used to encode information for use on channels with cost constraints. Applications include data transmission with a power constraint and, more recently, data storage on flash memories with a constraint on memory cell wear. In the latter application, system requirements often impose a rate constraint. In this paper, we study rate-constrained fixed-to-variable length shaping codes… ▽ More

    Submitted 8 January, 2020; originally announced January 2020.