Skip to main content

Showing 1–11 of 11 results for author: Hinder, O

  1. arXiv:2404.00666  [pdf, other

    cs.LG math.OC

    Accelerated Parameter-Free Stochastic Optimization

    Authors: Itai Kreisler, Maor Ivgi, Oliver Hinder, Yair Carmon

    Abstract: We propose a method that achieves near-optimal rates for smooth stochastic convex optimization and requires essentially no prior knowledge of problem parameters. This improves on prior work which requires knowing at least the initial distance to optimality d0. Our method, U-DoG, combines UniXGrad (Kavis et al., 2019) and DoG (Ivgi et al., 2023) with novel iterate stabilization techniques. It requi… ▽ More

    Submitted 5 July, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  2. arXiv:2402.10898  [pdf, other

    math.OC cs.LG stat.ML

    The Price of Adaptivity in Stochastic Convex Optimization

    Authors: Yair Carmon, Oliver Hinder

    Abstract: We prove impossibility results for adaptivity in non-smooth stochastic convex optimization. Given a set of problem parameters we wish to adapt to, we define a "price of adaptivity" (PoA) that, roughly speaking, measures the multiplicative increase in suboptimality due to uncertainty in these parameters. When the initial distance to the optimum is unknown but a gradient norm bound is known, we show… ▽ More

    Submitted 27 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted for presentation at the Conference on Learning Theory (COLT) 2024; to appear in proceedings as an extended abstract

  3. arXiv:2310.19053  [pdf, other

    cs.LG physics.optics stat.ML

    Datasets and Benchmarks for Nanophotonic Structure and Parametric Design Simulations

    Authors: Jungtaek Kim, Mingxuan Li, Oliver Hinder, Paul W. Leu

    Abstract: Nanophotonic structures have versatile applications including solar cells, anti-reflective coatings, electromagnetic interference shielding, optical filters, and light emitting diodes. To design and understand these nanophotonic structures, electrodynamic simulations are essential. These simulations enable us to model electromagnetic fields over time and calculate optical properties. In this work,… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: 31 pages, 31 figures, 4 tables. Accepted at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023), Datasets and Benchmarks Track

  4. arXiv:2302.12022  [pdf, other

    cs.LG math.OC

    DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule

    Authors: Maor Ivgi, Oliver Hinder, Yair Carmon

    Abstract: We propose a tuning-free dynamic SGD step size formula, which we call Distance over Gradients (DoG). The DoG step sizes depend on simple empirical quantities (distance from the initial point and norms of gradients) and have no ``learning rate'' parameter. Theoretically, we show that a slight variation of the DoG formula enjoys strong parameter-free convergence guarantees for stochastic convex opti… ▽ More

    Submitted 16 July, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: To appear at ICML 2023

  5. arXiv:2209.00809  [pdf, other

    math.OC cs.DS cs.LG stat.ML

    Optimal Diagonal Preconditioning

    Authors: Zhaonan Qu, Wenzhi Gao, Oliver Hinder, Yinyu Ye, Zhengyuan Zhou

    Abstract: Preconditioning has long been a staple technique in optimization, often applied to reduce the condition number of a matrix and speed up the convergence of algorithms. Although there are many popular preconditioning techniques in practice, most lack guarantees on reductions in condition number. Moreover, the degree to which we can improve over existing heuristic preconditioners remains an important… ▽ More

    Submitted 4 November, 2022; v1 submitted 2 September, 2022; originally announced September 2022.

  6. arXiv:2205.02160  [pdf, other

    math.OC cs.LG stat.ML

    Making SGD Parameter-Free

    Authors: Yair Carmon, Oliver Hinder

    Abstract: We develop an algorithm for parameter-free stochastic convex optimization (SCO) whose rate of convergence is only a double-logarithmic factor larger than the optimal rate for the corresponding known-parameter setting. In contrast, the best previously known rates for parameter-free SCO are based on online parameter-free regret bounds, which contain unavoidable excess logarithmic terms compared to t… ▽ More

    Submitted 1 March, 2024; v1 submitted 4 May, 2022; originally announced May 2022.

  7. arXiv:2010.14322  [pdf, other

    math.OC cs.AI cs.LG cs.NE

    An efficient nonconvex reformulation of stagewise convex optimization problems

    Authors: Rudy Bunel, Oliver Hinder, Srinadh Bhojanapalli, Krishnamurthy, Dvijotham

    Abstract: Convex optimization problems with staged structure appear in several contexts, including optimal control, verification of deep neural networks, and isotonic regression. Off-the-shelf solvers can solve these problems but may scale poorly. We develop a nonconvex reformulation designed to exploit this staged structure. Our reformulation has only simple bound constraints, enabling solution via project… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: First and second authors made equal contribution. To appear in Neurips 2020

  8. arXiv:1906.11985  [pdf, other

    math.OC cs.CC cs.DS cs.LG stat.ML

    Near-Optimal Methods for Minimizing Star-Convex Functions and Beyond

    Authors: Oliver Hinder, Aaron Sidford, Nimit S. Sohoni

    Abstract: In this paper, we provide near-optimal accelerated first-order methods for minimizing a broad class of smooth nonconvex functions that are strictly unimodal on all lines through a minimizer. This function class, which we call the class of smooth quasar-convex functions, is parameterized by a constant $γ\in (0,1]$, where $γ= 1$ encompasses the classes of smooth convex and star-convex functions, and… ▽ More

    Submitted 24 February, 2023; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: 48 pages. Published as a conference paper at COLT 2020

  9. arXiv:1807.00404  [pdf, other

    math.OC cs.CC

    Worst-case iteration bounds for log barrier methods on problems with nonconvex constraints

    Authors: Oliver Hinder, Yinyu Ye

    Abstract: Interior point methods (IPMs) that handle nonconvex constraints such as IPOPT, KNITRO and LOQO have had enormous practical success. We consider IPMs in the setting where the objective and constraints are thrice differentiable, and have Lipschitz first and second derivatives on the feasible region. We provide an IPM that, starting from a strictly feasible point, finds a $μ$-approximate Fritz John p… ▽ More

    Submitted 3 November, 2023; v1 submitted 1 July, 2018; originally announced July 2018.

    Comments: Accepted for publication in Mathematics of Operations Research. Note that several results were removed from the previous version most notably the results on convex case. These results were removed due to reviewer suggestions to focus the paper on the most significant contributions. These results still appear in the first author's PhD thesis (Principled Algorithms for Finding Local Minima)

  10. arXiv:1805.08370  [pdf, other

    math.OC cs.CC

    Cutting plane methods can be extended into nonconvex optimization

    Authors: Oliver Hinder

    Abstract: We show that it is possible to obtain an $O(ε^{-4/3})$ expected runtime --- including computational cost --- for finding $ε$-stationary points of smooth nonconvex functions using cutting plane methods. This improves on the best-known epsilon dependence achieved by cubic regularized Newton of $O(ε^{-3/2})$ as proved by Nesterov and Polyak (2006). Our techniques utilize the convex until proven guilt… ▽ More

    Submitted 26 June, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

  11. arXiv:1611.00756  [pdf, other

    math.OC cs.DS

    Accelerated Methods for Non-Convex Optimization

    Authors: Yair Carmon, John C. Duchi, Oliver Hinder, Aaron Sidford

    Abstract: We present an accelerated gradient method for non-convex optimization problems with Lipschitz continuous first and second derivatives. The method requires time $O(ε^{-7/4} \log(1/ ε) )$ to find an $ε$-stationary point, meaning a point $x$ such that $\|\nabla f(x)\| \le ε$. The method improves upon the $O(ε^{-2} )$ complexity of gradient descent and provides the additional second-order guarantee th… ▽ More

    Submitted 2 February, 2017; v1 submitted 2 November, 2016; originally announced November 2016.