Skip to main content

Showing 1–32 of 32 results for author: Fletcher, K

  1. arXiv:2305.08277  [pdf, other

    cs.LG stat.ML

    Local Convergence of Gradient Descent-Ascent for Training Generative Adversarial Networks

    Authors: Evan Becker, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Generative Adversarial Networks (GANs) are a popular formulation to train generative models for complex high dimensional data. The standard method for training GANs involves a gradient descent-ascent (GDA) procedure on a minimax optimization problem. This procedure is hard to analyze in general due to the nonlinear nature of the dynamics. We study the local dynamics of GDA for training a GAN with… ▽ More

    Submitted 29 May, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

  2. arXiv:2212.13631   

    cs.AI

    Proceedings of AAAI 2022 Fall Symposium: The Role of AI in Responding to Climate Challenges

    Authors: Feras A. Batarseh, Priya L. Donti, Ján Drgoňa, Kristen Fletcher, Pierre-Adrien Hanania, Melissa Hatton, Srinivasan Keshav, Bran Knowles, Raphaela Kotsch, Sean McGinnis, Peetak Mitra, Alex Philp, Jim Spohrer, Frank Stein, Meghna Tare, Svitlana Volkov, Gege Wen

    Abstract: Climate change is one of the most pressing challenges of our time, requiring rapid action across society. As artificial intelligence tools (AI) are rapidly deployed, it is therefore crucial to understand how they will impact climate action. On the one hand, AI can support applications in climate change mitigation (reducing or preventing greenhouse gas emissions), adaptation (preparing for the effe… ▽ More

    Submitted 29 January, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

  3. arXiv:2208.09938  [pdf, other

    cs.LG

    Instability and Local Minima in GAN Training with Kernel Discriminators

    Authors: Evan Becker, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Generative Adversarial Networks (GANs) are a widely-used tool for generative modeling of complex data. Despite their empirical success, the training of GANs is not fully understood due to the min-max optimization of the generator and discriminator. This paper analyzes these joint dynamics when the true samples, as well as the generated samples, are discrete, finite sets, and the discriminator is k… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

  4. arXiv:2201.08082  [pdf, other

    stat.ML cs.LG

    Kernel Methods and Multi-layer Perceptrons Learn Linear Models in High Dimensions

    Authors: Mojtaba Sahraee-Ardakan, Melikasadat Emami, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Empirical observation of high dimensional phenomena, such as the double descent behaviour, has attracted a lot of interest in understanding classical techniques such as kernel methods, and their implications to explain generalization properties of neural networks. Many recent works analyze such models in a certain high-dimensional regime where the covariates are independent and the number of sampl… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  5. arXiv:2103.04557  [pdf, other

    stat.ML cs.LG

    Asymptotics of Ridge Regression in Convolutional Models

    Authors: Mojtaba Sahraee-Ardakan, Tung Mai, Anup Rao, Ryan Rossi, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Understanding generalization and estimation error of estimators for simple models such as linear and generalized linear models has attracted a lot of attention recently. This is in part due to an interesting observation made in machine learning community that highly over-parameterized neural networks achieve zero training error, and yet they are able to generalize well over the test samples. This… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

  6. arXiv:2101.07833  [pdf, ps, other

    cs.LG cs.NE eess.SY stat.ML

    Implicit Bias of Linear RNNs

    Authors: Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Contemporary wisdom based on empirical studies suggests that standard recurrent neural networks (RNNs) do not perform well on tasks requiring long-term memory. However, precise reasoning for this behavior is still unknown. This paper provides a rigorous explanation of this property in the special case of linear RNNs. Although this work is limited to linear RNNs, even these systems have traditional… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    Comments: 30 pages, 4 figures

  7. arXiv:2005.05053  [pdf, other

    q-bio.NC cs.LG cs.NE eess.SP stat.ML

    Low-Rank Nonlinear Decoding of $μ$-ECoG from the Primary Auditory Cortex

    Authors: Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Alyson K. Fletcher, Sundeep Rangan, Michael Trumpis, Brinnae Bent, Chia-Han Chiang, Jonathan Viventi

    Abstract: This paper considers the problem of neural decoding from parallel neural measurements systems such as micro-electrocorticography ($μ$-ECoG). In systems with large numbers of array elements at very high sampling rates, the dimension of the raw measurement data may be large. Learning neural decoders for this high-dimensional data can be challenging, particularly when the number of training samples i… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

    Comments: 4 pages, 3 figures

  8. arXiv:2005.00180  [pdf, other

    cs.LG stat.ML

    Generalization Error of Generalized Linear Models in High Dimensions

    Authors: Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: At the heart of machine learning lies the question of generalizability of learned rules over previously unseen data. While over-parameterized models based on neural networks are now ubiquitous in machine learning applications, our understanding of their generalization capabilities is incomplete. This task is made harder by the non-convexity of the underlying learning problems. We provide a general… ▽ More

    Submitted 30 April, 2020; originally announced May 2020.

    Comments: 20 pages, 4 figures

  9. arXiv:2001.09396  [pdf, other

    cs.LG cs.IT cs.NE eess.SP stat.ML

    Inference in Multi-Layer Networks with Matrix-Valued Unknowns

    Authors: Parthe Pandit, Mojtaba Sahraee-Ardakan, Sundeep Rangan, Philip Schniter, Alyson K. Fletcher

    Abstract: We consider the problem of inferring the input and hidden variables of a stochastic multi-layer neural network from an observation of the output. The hidden variables in each layer are represented as matrices. This problem applies to signal recovery via deep generative prior models, multi-task and mixed regression and learning certain classes of two-layer neural networks. A unified approximation a… ▽ More

    Submitted 25 January, 2020; originally announced January 2020.

    Comments: 3 figures, 6 pages (two-column) + Appendix. arXiv admin note: text overlap with arXiv:1911.03409

  10. arXiv:1911.03409  [pdf, other

    cs.LG cs.IT cs.NE eess.SP stat.ML

    Inference with Deep Generative Priors in High Dimensions

    Authors: Parthe Pandit, Mojtaba Sahraee-Ardakan, Sundeep Rangan, Philip Schniter, Alyson K. Fletcher

    Abstract: Deep generative priors offer powerful models for complex-structured data, such as images, audio, and text. Using these priors in inverse problems typically requires estimating the input and/or hidden signals in a multi-layer deep neural network from observation of its output. While these approaches have been successful in practice, rigorous performance analysis is complicated by the non-convex nat… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: 50 pages, double-spaced

  11. arXiv:1910.13672  [pdf, other

    cs.LG stat.ML

    Input-Output Equivalence of Unitary and Contractive RNNs

    Authors: M. Emami, M. Sahraee-Ardakan, S. Rangan, A. K. Fletcher

    Abstract: Unitary recurrent neural networks (URNNs) have been proposed as a method to overcome the vanishing and exploding gradient problem in modeling data with long-term dependencies. A basic question is how restrictive is the unitary constraint on the possible input-output mappings of such a network? This work shows that for any contractive RNN with ReLU activations, there is a URNN with at most twice th… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

  12. arXiv:1903.09631  [pdf, other

    math.ST cs.LG eess.SP stat.ML

    High-Dimensional Bernoulli Autoregressive Process with Long-Range Dependence

    Authors: Parthe Pandit, Mojtaba Sahraee-Ardakan, Arash A. Amini, Sundeep Rangan, Alyson K. Fletcher

    Abstract: We consider the problem of estimating the parameters of a multivariate Bernoulli process with auto-regressive feedback in the high-dimensional setting where the number of samples available is much less than the number of parameters. This problem arises in learning interconnections of networks of dynamical systems with spiking or binary-valued data. We allow the process to depend on its past up to… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

    Comments: To appear at AISTATS 2019 titled "Sparse Multivariate Bernoulli Processes in High Dimensions"

    Journal ref: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) 2019, Naha, Okinawa, Japan. PMLR: Volume 89

  13. arXiv:1903.01293  [pdf, other

    cs.IT stat.ML

    Asymptotics of MAP Inference in Deep Networks

    Authors: Parthe Pandit, Mojtaba Sahraee, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Deep generative priors are a powerful tool for reconstruction problems with complex data such as images and text. Inverse problems using such models require solving an inference problem of estimating the input and hidden units of the multi-layer network from its output. Maximum a priori (MAP) estimation is a widely-used inference method as it is straightforward to implement, and has been successfu… ▽ More

    Submitted 1 March, 2019; originally announced March 2019.

    Comments: 11 pages. arXiv admin note: text overlap with arXiv:1706.06549

  14. Bilinear Recovery using Adaptive Vector-AMP

    Authors: Subrata Sarkar, Alyson K. Fletcher, Sundeep Rangan, Philip Schniter

    Abstract: We consider the problem of jointly recovering the vector $\boldsymbol{b}$ and the matrix $\boldsymbol{C}$ from noisy measurements $\boldsymbol{Y} = \boldsymbol{A}(\boldsymbol{b})\boldsymbol{C} + \boldsymbol{W}$, where $\boldsymbol{A}(\cdot)$ is a known affine linear function of $\boldsymbol{b}$ (i.e., $\boldsymbol{A}(\boldsymbol{b})=\boldsymbol{A}_0+\sum_{i=1}^Q b_i \boldsymbol{A}_i$ with known ma… ▽ More

    Submitted 8 March, 2019; v1 submitted 31 August, 2018; originally announced September 2018.

  15. Plug-in Estimation in High-Dimensional Linear Inverse Problems: A Rigorous Analysis

    Authors: Alyson K. Fletcher, Sundeep Rangan, Subrata Sarkar, Philip Schniter

    Abstract: Estimating a vector $\mathbf{x}$ from noisy linear measurements $\mathbf{Ax}+\mathbf{w}$ often requires use of prior knowledge or structural constraints on $\mathbf{x}$ for accurate reconstruction. Several recent works have considered combining linear least-squares estimation with a generic or "plug-in" denoiser function that can be designed in a modular manner based on the prior knowledge about… ▽ More

    Submitted 1 August, 2018; v1 submitted 27 June, 2018; originally announced June 2018.

  16. arXiv:1706.06549  [pdf, other

    cs.LG cs.IT stat.ML

    Inference in Deep Networks in High Dimensions

    Authors: Alyson K. Fletcher, Sundeep Rangan

    Abstract: Deep generative networks provide a powerful tool for modeling complex data in a wide range of applications. In inverse problems that use these networks as generative priors on data, one must often perform inference of the inputs of the networks from the outputs. Inference is also required for sampling during stochastic training on these generative models. This paper considers inference in a deep s… ▽ More

    Submitted 20 June, 2017; originally announced June 2017.

    Comments: 27 pages

  17. arXiv:1706.06054  [pdf, other

    cs.IT cs.LG

    Rigorous Dynamics and Consistent Estimation in Arbitrarily Conditioned Linear Systems

    Authors: Alyson K. Fletcher, Mojtaba Sahraee-Ardakan, Philip Schniter, Sundeep Rangan

    Abstract: The problem of estimating a random vector x from noisy linear measurements y = A x + w with unknown parameters on the distributions of x and w, which must also be learned, arises in a wide range of statistical learning and linear inverse problems. We show that a computationally simple iterative message-passing algorithm can provably obtain asymptotically consistent estimates in a certain high-dime… ▽ More

    Submitted 19 June, 2017; originally announced June 2017.

  18. arXiv:1612.01186  [pdf, ps, other

    cs.IT

    Vector Approximate Message Passing for the Generalized Linear Model

    Authors: Philip Schniter, Sundeep Rangan, Alyson K. Fletcher

    Abstract: The generalized linear model (GLM), where a random vector $\boldsymbol{x}$ is observed through a noisy, possibly nonlinear, function of a linear transform output $\boldsymbol{z}=\boldsymbol{Ax}$, arises in a range of applications such as robust regression, binary classification, quantized compressed sensing, phase retrieval, photon-limited imaging, and inference from neural spike trains. When… ▽ More

    Submitted 4 December, 2016; originally announced December 2016.

  19. arXiv:1610.03082  [pdf, ps, other

    cs.IT

    Vector Approximate Message Passing

    Authors: Sundeep Rangan, Philip Schniter, Alyson K. Fletcher

    Abstract: The standard linear regression (SLR) problem is to recover a vector $\mathbf{x}^0$ from noisy linear observations $\mathbf{y}=\mathbf{Ax}^0+\mathbf{w}$. The approximate message passing (AMP) algorithm recently proposed by Donoho, Maleki, and Montanari is a computationally efficient iterative approach to SLR that has a remarkable property: for large i.i.d.\ sub-Gaussian matrices $\mathbf{A}$, its p… ▽ More

    Submitted 12 June, 2018; v1 submitted 10 October, 2016; originally announced October 2016.

  20. arXiv:1602.08207  [pdf, ps, other

    cs.IT stat.ML

    Learning and Free Energies for Vector Approximate Message Passing

    Authors: Alyson K. Fletcher, Philip Schniter

    Abstract: Vector approximate message passing (VAMP) is a computationally simple approach to the recovery of a signal $\mathbf{x}$ from noisy linear measurements $\mathbf{y}=\mathbf{Ax}+\mathbf{w}$. Like the AMP proposed by Donoho, Maleki, and Montanari in 2009, VAMP is characterized by a rigorous state evolution (SE) that holds under certain large random matrices and that matches the replica prediction of o… ▽ More

    Submitted 8 March, 2018; v1 submitted 26 February, 2016; originally announced February 2016.

  21. arXiv:1602.07795  [pdf, ps, other

    cs.IT stat.ML

    Expectation Consistent Approximate Inference: Generalizations and Convergence

    Authors: Alyson K. Fletcher, Mojtaba Sahraee-Ardakan, Sundeep Rangan, Philip Schniter

    Abstract: Approximations of loopy belief propagation, including expectation propagation and approximate message passing, have attracted considerable attention for probabilistic inference problems. This paper proposes and analyzes a generalization of Opper and Winther's expectation consistent (EC) approximate inference method. The proposed method, called Generalized Expectation Consistency (GEC), can be appl… ▽ More

    Submitted 24 January, 2017; v1 submitted 25 February, 2016; originally announced February 2016.

    Comments: 10 pages

  22. arXiv:1501.01797  [pdf, other

    cs.IT

    Inference for Generalized Linear Models via Alternating Directions and Bethe Free Energy Minimization

    Authors: Sundeep Rangan, Alyson K. Fletcher, Philip Schniter, Ulugbek Kamilov

    Abstract: Generalized Linear Models (GLMs), where a random vector $\mathbf{x}$ is observed through a noisy, possibly nonlinear, function of a linear transform $\mathbf{z}=\mathbf{Ax}$ arise in a range of applications in nonlinear filtering and regression. Approximate Message Passing (AMP) methods, based on loopy belief propagation, are a promising class of approaches for approximate inference in these model… ▽ More

    Submitted 2 May, 2016; v1 submitted 8 January, 2015; originally announced January 2015.

  23. arXiv:1409.0289  [pdf, other

    cs.IT

    Scalable Inference for Neuronal Connectivity from Calcium Imaging

    Authors: Alyson K. Fletcher, Sundeep Rangan

    Abstract: Fluorescent calcium imaging provides a potentially powerful tool for inferring connectivity in neural circuits with up to thousands of neurons. However, a key challenge in using calcium imaging for connectivity detection is that current systems often have a temporal response and frame rate that can be orders of magnitude slower than the underlying neural spiking process. Bayesian inference methods… ▽ More

    Submitted 3 September, 2014; v1 submitted 1 September, 2014; originally announced September 2014.

    Comments: 14 pages, 3 figures

  24. arXiv:1402.3210  [pdf, ps, other

    cs.IT

    On the Convergence of Approximate Message Passing with Arbitrary Matrices

    Authors: Sundeep Rangan, Philip Schniter, Alyson K. Fletcher, Subrata Sarkar

    Abstract: Approximate message passing (AMP) methods and their variants have attracted considerable recent attention for the problem of estimating a random vector $\mathbf{x}$ observed through a linear transform $\mathbf{A}$. In the case of large i.i.d. zero-mean Gaussian $\mathbf{A}$, the methods exhibit fast convergence with precise analytic characterizations on the algorithm behavior. However, the converg… ▽ More

    Submitted 2 March, 2018; v1 submitted 13 February, 2014; originally announced February 2014.

  25. arXiv:1207.3859  [pdf, other

    cs.IT cs.LG

    Approximate Message Passing with Consistent Parameter Estimation and Applications to Sparse Learning

    Authors: Ulugbek S. Kamilov, Sundeep Rangan, Alyson K. Fletcher, Michael Unser

    Abstract: We consider the estimation of an i.i.d. (possibly non-Gaussian) vector $\xbf \in \R^n$ from measurements $\ybf \in \R^m$ obtained by a general cascade model consisting of a known linear transform followed by a probabilistic componentwise (possibly nonlinear) measurement channel. A novel method, called adaptive generalized approximate message passing (Adaptive GAMP), that enables joint learning of… ▽ More

    Submitted 1 December, 2012; v1 submitted 16 July, 2012; originally announced July 2012.

    Comments: 14 pages, 3 figures

  26. arXiv:1202.2759  [pdf, ps, other

    cs.IT

    Iterative Reconstruction of Rank-One Matrices in Noise

    Authors: Alyson K. Fletcher, Sundeep Rangan

    Abstract: We consider the problem of estimating a rank-one matrix in Gaussian noise under a probabilistic model for the left and right factors of the matrix. The probabilistic model can impose constraints on the factors including sparsity and positivity that arise commonly in learning problems. We propose a family of algorithms that reduce the problem to a sequence of scalar estimation computations. These a… ▽ More

    Submitted 15 September, 2015; v1 submitted 13 February, 2012; originally announced February 2012.

    Comments: 28 pages, 2 figures

  27. arXiv:1111.2581  [pdf, ps, other

    cs.IT

    Hybrid Approximate Message Passing

    Authors: Sundeep Rangan, Alyson K. Fletcher, Vivek K. Goyal, Evan Byrne, Philip Schniter

    Abstract: Gaussian and quadratic approximations of message passing algorithms on graphs have attracted considerable recent attention due to their computational simplicity, analytic tractability, and wide applicability in optimization and statistical inference problems. This paper presents a systematic framework for incorporating such approximate message passing (AMP) methods in general graphical models. The… ▽ More

    Submitted 23 March, 2017; v1 submitted 10 November, 2011; originally announced November 2011.

  28. Ranked Sparse Signal Support Detection

    Authors: Alyson K. Fletcher, Sundeep Rangan, Vivek K Goyal

    Abstract: This paper considers the problem of detecting the support (sparsity pattern) of a sparse vector from random noisy measurements. Conditional power of a component of the sparse vector is defined as the energy conditioned on the component being nonzero. Analysis of a simplified version of orthogonal matching pursuit (OMP) called sequential OMP (SequOMP) demonstrates the importance of knowledge of the… ▽ More

    Submitted 27 October, 2011; originally announced October 2011.

    Comments: 13 pages

    Journal ref: IEEE Trans. on Signal Processing, vol. 60, no. 11, pp. 5919-5931, November 2012

  29. Orthogonal Matching Pursuit: A Brownian Motion Analysis

    Authors: Alyson K. Fletcher, Sundeep Rangan

    Abstract: A well-known analysis of Tropp and Gilbert shows that orthogonal matching pursuit (OMP) can recover a k-sparse n-dimensional real vector from 4 k log(n) noise-free linear measurements obtained through a random Gaussian measurement matrix with a probability that approaches one as n approaches infinity. This work strengthens this result by showing that a lower number of measurements, 2 k log(n - k),… ▽ More

    Submitted 29 May, 2011; originally announced May 2011.

    Comments: 11 pages, 2 figures

  30. Asymptotic Analysis of MAP Estimation via the Replica Method and Applications to Compressed Sensing

    Authors: Sundeep Rangan, Alyson K. Fletcher, Vivek K Goyal

    Abstract: The replica method is a non-rigorous but well-known technique from statistical physics used in the asymptotic analysis of large, random, nonlinear problems. This paper applies the replica method, under the assumption of replica symmetry, to study estimators that are maximum a posteriori (MAP) under a postulated prior distribution. It is shown that with random linear measurements and Gaussian noise… ▽ More

    Submitted 8 October, 2011; v1 submitted 17 June, 2009; originally announced June 2009.

    Comments: 22 pages; added details on the replica symmetry assumption

    Journal ref: IEEE Trans. on Information Theory, vol. 58, no. 3, pp. 1903-1923, March 2012

  31. arXiv:0903.1022  [pdf, ps, other

    cs.IT

    On-Off Random Access Channels: A Compressed Sensing Framework

    Authors: Alyson K. Fletcher, Sundeep Rangan, Vivek K Goyal

    Abstract: This paper considers a simple on-off random multiple access channel, where n users communicate simultaneously to a single receiver over m degrees of freedom. Each user transmits with probability lambda, where typically lambda n < m << n, and the receiver must detect which users transmitted. We show that when the codebook has i.i.d. Gaussian entries, detecting which users transmitted is mathemati… ▽ More

    Submitted 9 March, 2009; v1 submitted 5 March, 2009; originally announced March 2009.

    Comments: 18 pages, 5 figures; addition of inadvertently omitted support information and acknowledgments

  32. Necessary and Sufficient Conditions on Sparsity Pattern Recovery

    Authors: Alyson K. Fletcher, Sundeep Rangan, Vivek K. Goyal

    Abstract: The problem of detecting the sparsity pattern of a k-sparse vector in R^n from m random noisy measurements is of interest in many areas such as system identification, denoising, pattern recognition, and compressed sensing. This paper addresses the scaling of the number of measurements m, with signal dimension n and sparsity-level nonzeros k, for asymptotically-reliable detection. We show a neces… ▽ More

    Submitted 11 April, 2008; originally announced April 2008.

    Comments: Submitted to IEEE Transactions on Information Theory

    Journal ref: IEEE Trans. on Information Theory, vol. 55, no. 12, pp. 5758-5772, December 2009