Skip to main content

Showing 1–16 of 16 results for author: Mangoubi, O

  1. arXiv:2406.02502  [pdf, ps, other

    math.ST cs.DS math.NA math.PR

    Singular Subspace Perturbation Bounds via Rectangular Random Matrix Diffusions

    Authors: Peiyao Lai, Oren Mangoubi

    Abstract: Given a matrix $A \in \mathbb{R}^{m\times d}$ with singular values $σ_1\geq \cdots \geq σ_d$, and a random matrix $G \in \mathbb{R}^{m\times d}$ with iid $N(0,T)$ entries for some $T>0$, we derive new bounds on the Frobenius distance between subspaces spanned by the top-$k$ (right) singular vectors of $A$ and $A+G$. This problem arises in numerous applications in statistics where a data matrix may… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2306.16648  [pdf, other

    cs.DS cs.CR cs.LG math.NA math.PR

    Private Covariance Approximation and Eigenvalue-Gap Bounds for Complex Gaussian Perturbations

    Authors: Oren Mangoubi, Nisheeth K. Vishnoi

    Abstract: We consider the problem of approximating a $d \times d$ covariance matrix $M$ with a rank-$k$ matrix under $(\varepsilon,δ)$-differential privacy. We present and analyze a complex variant of the Gaussian mechanism and show that the Frobenius norm of the difference between the matrix output by this mechanism and the best rank-$k$ approximation to $M$ is bounded by roughly $\tilde{O}(\sqrt{kd})$, wh… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: This is the full version of a paper which was accepted to COLT 2023

  3. arXiv:2211.06418  [pdf, other

    cs.DS cs.CR cs.LG math.NA stat.ML

    Re-Analyze Gauss: Bounds for Private Matrix Approximation via Dyson Brownian Motion

    Authors: Oren Mangoubi, Nisheeth K. Vishnoi

    Abstract: Given a symmetric matrix $M$ and a vector $λ$, we present new bounds on the Frobenius-distance utility of the Gaussian mechanism for approximating $M$ by a matrix whose spectrum is $λ$, under $(\varepsilon,δ)$-differential privacy. Our bounds depend on both $λ$ and the gaps in the eigenvalues of $M$, and hold whenever the top $k+1$ eigenvalues of $M$ have sufficiently large gaps. When applied to t… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: This is the full version of a paper which was accepted to NeurIPS 2022

  4. arXiv:2207.02794  [pdf, ps, other

    cs.DS cs.CR cs.LG math.MG stat.ML

    Private Matrix Approximation and Geometry of Unitary Orbits

    Authors: Oren Mangoubi, Yikai Wu, Satyen Kale, Abhradeep Guha Thakurta, Nisheeth K. Vishnoi

    Abstract: Consider the following optimization problem: Given $n \times n$ matrices $A$ and $Λ$, maximize $\langle A, UΛU^*\rangle$ where $U$ varies over the unitary group $\mathrm{U}(n)$. This problem seeks to approximate $A$ by a matrix whose spectrum is the same as $Λ$ and, by setting $Λ$ to be appropriate diagonal matrices, one can recover matrix approximation problems such as PCA and rank-$k$ approximat… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Journal ref: Proceedings of Thirty Fifth Conference on Learning Theory (COLT), PMLR 178:3547-3588, 2022

  5. arXiv:2206.09384  [pdf, ps, other

    cs.DS cs.LG math.PR stat.ML

    Sampling from Log-Concave Distributions over Polytopes via a Soft-Threshold Dikin Walk

    Authors: Oren Mangoubi, Nisheeth K. Vishnoi

    Abstract: Given a Lipschitz or smooth convex function $\, f:K \to \mathbb{R}$ for a bounded polytope $K \subseteq \mathbb{R}^d$ defined by $m$ inequalities, we consider the problem of sampling from the log-concave distribution $π(θ) \propto e^{-f(θ)}$ constrained to $K$. Interest in this problem derives from its applications to Bayesian inference and differentially private learning. Our main result is a gen… ▽ More

    Submitted 14 November, 2022; v1 submitted 19 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2111.04089

  6. arXiv:2111.04089  [pdf, other

    cs.DS cs.CR cs.LG math.PR stat.ML

    Sampling from Log-Concave Distributions with Infinity-Distance Guarantees

    Authors: Oren Mangoubi, Nisheeth K. Vishnoi

    Abstract: For a $d$-dimensional log-concave distribution $π(θ) \propto e^{-f(θ)}$ constrained to a convex body $K$, the problem of outputting samples from a distribution $ν$ which is $\varepsilon$-close in infinity-distance $\sup_{θ\in K} |\log \frac{ν(θ)}{π(θ)}|$ to $π$ arises in differentially private optimization. While sampling within total-variation distance $\varepsilon$ of $π$ can be done by algorith… ▽ More

    Submitted 11 November, 2022; v1 submitted 7 November, 2021; originally announced November 2021.

    Comments: This is the full version of a paper which was accepted to NeurIPS 2022

  7. arXiv:2104.08364  [pdf, other

    cs.DC cs.LG cs.PF

    Sync-Switch: Hybrid Parameter Synchronization for Distributed Deep Learning

    Authors: Shijian Li, Oren Mangoubi, Lijie Xu, Tian Guo

    Abstract: Stochastic Gradient Descent (SGD) has become the de facto way to train deep neural networks in distributed clusters. A critical factor in determining the training throughput and model accuracy is the choice of the parameter synchronization protocol. For example, while Bulk Synchronous Parallel (BSP) often achieves better converged accuracy, the corresponding training throughput can be negatively i… ▽ More

    Submitted 19 April, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: 15 pages, 16 figures, 6 tables, ICDCS'21

  8. arXiv:2006.12376  [pdf, other

    cs.LG cs.DS math.OC stat.ML

    A Convergent and Dimension-Independent Min-Max Optimization Algorithm

    Authors: Vijay Keswani, Oren Mangoubi, Sushant Sachdeva, Nisheeth K. Vishnoi

    Abstract: We study a variant of a recently introduced min-max optimization framework where the max-player is constrained to update its parameters in a greedy manner until it reaches a first-order stationary point. Our equilibrium definition for this framework depends on a proposal distribution which the min-player uses to choose directions in which to update its parameters. We show that, given a smooth and… ▽ More

    Submitted 30 June, 2022; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: This is the full version of a paper accepted for presentation in ICML 2022. The code is available at https://github.com/vijaykeswani/Min-Max-Optimization-Algorithm

  9. arXiv:2006.12363  [pdf, other

    cs.DS cs.GT cs.LG math.OC stat.ML

    Greedy Adversarial Equilibrium: An Efficient Alternative to Nonconvex-Nonconcave Min-Max Optimization

    Authors: Oren Mangoubi, Nisheeth K. Vishnoi

    Abstract: Min-max optimization of an objective function $f: \mathbb{R}^d \times \mathbb{R}^d \rightarrow \mathbb{R}$ is an important model for robustness in an adversarial setting, with applications to many areas including optimization, economics, and deep learning. In many applications $f$ may be nonconvex-nonconcave, and finding a global min-max point may be computationally intractable. There is a long li… ▽ More

    Submitted 4 May, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Full version of a paper appearing in ACM Symposium on Theory of Computing (STOC) 2021

  10. arXiv:1905.01745  [pdf, ps, other

    cs.DS cs.LG math.PR stat.CO stat.ML

    Faster polytope rounding, sampling, and volume computation via a sublinear "Ball Walk"

    Authors: Oren Mangoubi, Nisheeth K. Vishnoi

    Abstract: We study the problem of "isotropically rounding" a polytope $K\subset\mathbb{R}^n$, that is, computing a linear transformation which makes the uniform distribution on the polytope have roughly identity covariance matrix. We assume $K$ is defined by $m$ linear inequalities, with guarantee that $rB\subset K\subset RB$, where $B$ is the unit ball. We introduce a new variant of the ball walk Markov ch… ▽ More

    Submitted 14 September, 2019; v1 submitted 5 May, 2019; originally announced May 2019.

    Comments: Accepted to IEEE Symposium on Foundations of Computer Science (FOCS) 2019

  11. arXiv:1902.08452  [pdf, ps, other

    cs.DS cs.LG math.PR stat.CO stat.ML

    Nonconvex sampling with the Metropolis-adjusted Langevin algorithm

    Authors: Oren Mangoubi, Nisheeth K. Vishnoi

    Abstract: The Langevin Markov chain algorithms are widely deployed methods to sample from distributions in challenging high-dimensional and non-convex statistics and machine learning applications. Despite this, current bounds for the Langevin algorithms are slower than those of competing algorithms in many important situations, for instance when sampling from weakly log-concave distributions, or when sampli… ▽ More

    Submitted 9 April, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

  12. arXiv:1902.08179  [pdf, other

    cs.LG cs.DS math.PR stat.CO stat.ML

    Online Sampling from Log-Concave Distributions

    Authors: Holden Lee, Oren Mangoubi, Nisheeth K. Vishnoi

    Abstract: Given a sequence of convex functions $f_0, f_1, \ldots, f_T$, we study the problem of sampling from the Gibbs distribution $π_t \propto e^{-\sum_{k=0}^tf_k}$ for each epoch $t$ in an online manner. Interest in this problem derives from applications in machine learning, Bayesian statistics, and optimization where, rather than obtaining all the observations at once, one constantly acquires new data,… ▽ More

    Submitted 4 December, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

    Comments: 42 pages

    Journal ref: NeurIPS 2019

  13. arXiv:1808.03230  [pdf, other

    math.PR cs.LG stat.CO stat.ME stat.ML

    Does Hamiltonian Monte Carlo mix faster than a random walk on multimodal densities?

    Authors: Oren Mangoubi, Natesh S. Pillai, Aaron Smith

    Abstract: Hamiltonian Monte Carlo (HMC) is a very popular and generic collection of Markov chain Monte Carlo (MCMC) algorithms. One explanation for the popularity of HMC algorithms is their excellent performance as the dimension $d$ of the target becomes large: under conditions that are satisfied for many common statistical models, optimally-tuned HMC algorithms have a running time that scales like… ▽ More

    Submitted 4 September, 2018; v1 submitted 9 August, 2018; originally announced August 2018.

  14. arXiv:1802.08898  [pdf, other

    cs.DS cs.LG math.PR stat.CO stat.ML

    Dimensionally Tight Bounds for Second-Order Hamiltonian Monte Carlo

    Authors: Oren Mangoubi, Nisheeth K. Vishnoi

    Abstract: Hamiltonian Monte Carlo (HMC) is a widely deployed method to sample from high-dimensional distributions in Statistics and Machine learning. HMC is known to run very efficiently in practice and its popular second-order "leapfrog" implementation has long been conjectured to run in $d^{1/4}$ gradient evaluations. Here we show that this conjecture is true when sampling from strongly log-concave target… ▽ More

    Submitted 9 August, 2018; v1 submitted 24 February, 2018; originally announced February 2018.

  15. arXiv:1711.02621  [pdf, other

    cs.DS cs.LG math.OC stat.CO stat.ML

    Convex Optimization with Unbounded Nonconvex Oracles using Simulated Annealing

    Authors: Oren Mangoubi, Nisheeth K. Vishnoi

    Abstract: We consider the problem of minimizing a convex objective function $F$ when one can only evaluate its noisy approximation $\hat{F}$. Unless one assumes some structure on the noise, $\hat{F}$ may be an arbitrary nonconvex function, making the task of minimizing $F$ intractable. To overcome this, prior work has often focused on the case when $F(x)-\hat{F}(x)$ is uniformly-bounded. In this paper we st… ▽ More

    Submitted 18 June, 2018; v1 submitted 7 November, 2017; originally announced November 2017.

    Comments: To appear in COLT 2018

  16. arXiv:1609.02901  [pdf, other

    math.PR cs.DS math.DG stat.CO

    Rapid Mixing of Geodesic Walks on Manifolds with Positive Curvature

    Authors: Oren Mangoubi, Aaron Smith

    Abstract: We introduce a Markov chain for sampling from the uniform distribution on a Riemannian manifold $\mathcal{M}$, which we call the $\textit{geodesic walk}$. We prove that the mixing time of this walk on any manifold with positive sectional curvature $C_{x}(u,v)$ bounded both above and below by $0 < \mathfrak{m}_{2} \leq C_{x}(u,v) \leq \mathfrak{M}_2 < \infty$ is… ▽ More

    Submitted 26 November, 2017; v1 submitted 9 September, 2016; originally announced September 2016.

    Comments: To appear in the Annals of Applied Probability

    MSC Class: 60J05; 60J20; 68W20; 53D25