Skip to main content

Showing 1–6 of 6 results for author: Sountsov, P

  1. arXiv:2402.01915  [pdf, other

    cs.CV stat.CO

    Robust Inverse Graphics via Probabilistic Inference

    Authors: Tuan Anh Le, Pavel Sountsov, Matthew D. Hoffman, Ben Lee, Brian Patton, Rif A. Saurous

    Abstract: How do we infer a 3D scene from a single image in the presence of corruptions like rain, snow or fog? Straightforward domain randomization relies on knowing the family of corruptions ahead of time. Here, we propose a Bayesian approach-dubbed robust inverse graphics (RIG)-that relies on a strong scene prior and an uninformative uniform corruption prior, making it applicable to a wide range of corru… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: ICML submission. Reworked main body, new appendix figures

  2. arXiv:2312.02179  [pdf, other

    cs.LG cs.AI cs.CL

    Training Chain-of-Thought via Latent-Variable Inference

    Authors: Du Phan, Matthew D. Hoffman, David Dohan, Sholto Douglas, Tuan Anh Le, Aaron Parisi, Pavel Sountsov, Charles Sutton, Sharad Vikram, Rif A. Saurous

    Abstract: Large language models (LLMs) solve problems more accurately and interpretably when instructed to work out the answer step by step using a ``chain-of-thought'' (CoT) prompt. One can also improve LLMs' performance on a specific task by supervised fine-tuning, i.e., by using gradient ascent on some tunable parameters to maximize the average log-likelihood of correct answers from a labeled training se… ▽ More

    Submitted 28 November, 2023; originally announced December 2023.

    Comments: 23 pages, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  3. arXiv:2210.17415  [pdf, other

    cs.CV cs.LG

    ProbNeRF: Uncertainty-Aware Inference of 3D Shapes from 2D Images

    Authors: Matthew D. Hoffman, Tuan Anh Le, Pavel Sountsov, Christopher Suter, Ben Lee, Vikash K. Mansinghka, Rif A. Saurous

    Abstract: The problem of inferring object shape from a single 2D image is underconstrained. Prior knowledge about what objects are plausible can help, but even given such prior knowledge there may still be uncertainty about the shapes of occluded parts of objects. Recently, conditional neural radiance field (NeRF) models have been developed that can learn to infer good point estimates of 3D models from sing… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 18 pages, 18 figures, 1 table; submitted to the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

    MSC Class: 62F15 (Primary) 68T45 (Secondary) ACM Class: G.3; I.5.1; I.4.10

  4. arXiv:2006.06897  [pdf, other

    stat.ML cs.LG

    MCMC Should Mix: Learning Energy-Based Model with Neural Transport Latent Space MCMC

    Authors: Erik Nijkamp, Ruiqi Gao, Pavel Sountsov, Srinivas Vasudevan, Bo Pang, Song-Chun Zhu, Ying Nian Wu

    Abstract: Learning energy-based model (EBM) requires MCMC sampling of the learned model as an inner loop of the learning algorithm. However, MCMC sampling of EBMs in high-dimensional data space is generally not mixing, because the energy function, which is usually parametrized by a deep network, is highly multi-modal in the data space. This is a serious handicap for both theory and practice of EBMs. In this… ▽ More

    Submitted 16 March, 2022; v1 submitted 11 June, 2020; originally announced June 2020.

  5. arXiv:2002.01184  [pdf, ps, other

    stat.CO cs.PL stat.ML

    tfp.mcmc: Modern Markov Chain Monte Carlo Tools Built for Modern Hardware

    Authors: Junpeng Lao, Christopher Suter, Ian Langmore, Cyril Chimisov, Ashish Saxena, Pavel Sountsov, Dave Moore, Rif A. Saurous, Matthew D. Hoffman, Joshua V. Dillon

    Abstract: Markov chain Monte Carlo (MCMC) is widely regarded as one of the most important algorithms of the 20th century. Its guarantees of asymptotic convergence, stability, and estimator-variance bounds using only unnormalized probability functions make it indispensable to probabilistic programming. In this paper, we introduce the TensorFlow Probability MCMC toolkit, and discuss some of the considerations… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

    Comments: Based on extended abstract submitted to PROBPROG 2020

  6. arXiv:1606.03402  [pdf, other

    cs.AI cs.CL

    Length bias in Encoder Decoder Models and a Case for Global Conditioning

    Authors: Pavel Sountsov, Sunita Sarawagi

    Abstract: Encoder-decoder networks are popular for modeling sequences probabilistically in many applications. These models use the power of the Long Short-Term Memory (LSTM) architecture to capture the full dependence among variables, unlike earlier models like CRFs that typically assumed conditional independence among non-adjacent variables. However in practice encoder-decoder models exhibit a bias towards… ▽ More

    Submitted 21 September, 2016; v1 submitted 10 June, 2016; originally announced June 2016.