Skip to main content

Showing 1–5 of 5 results for author: Amir, I

  1. arXiv:2206.03098  [pdf, ps, other

    cs.LG

    Better Best of Both Worlds Bounds for Bandits with Switching Costs

    Authors: Idan Amir, Guy Azov, Tomer Koren, Roi Livni

    Abstract: We study best-of-both-worlds algorithms for bandits with switching cost, recently addressed by Rouyer, Seldin and Cesa-Bianchi, 2021. We introduce a surprisingly simple and effective algorithm that simultaneously achieves minimax optimal regret bound of $\mathcal{O}(T^{2/3})$ in the oblivious adversarial setting and a bound of $\mathcal{O}(\min\{\log (T)/Δ^2,T^{2/3}\})$ in the stochastically-const… ▽ More

    Submitted 2 November, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

  2. arXiv:2202.13328  [pdf, ps, other

    cs.LG math.OC

    Thinking Outside the Ball: Optimal Learning with Gradient Descent for Generalized Linear Stochastic Convex Optimization

    Authors: Idan Amir, Roi Livni, Nathan Srebro

    Abstract: We consider linear prediction with a convex Lipschitz loss, or more generally, stochastic convex optimization problems of generalized linear form, i.e.~where each instantaneous loss is a scalar convex function of a linear function. We show that in this setting, early stopped Gradient Descent (GD), without any explicit regularization or projection, ensures excess error at most $ε$ (compared to the… ▽ More

    Submitted 30 October, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

  3. arXiv:2107.00469  [pdf, ps, other

    math.OC cs.LG

    Never Go Full Batch (in Stochastic Convex Optimization)

    Authors: Idan Amir, Yair Carmon, Tomer Koren, Roi Livni

    Abstract: We study the generalization performance of $\text{full-batch}$ optimization algorithms for stochastic convex optimization: these are first-order methods that only access the exact gradient of the empirical risk (rather than gradients with respect to individual data points), that include a wide range of algorithms such as gradient descent, mirror descent, and their regularized and/or accelerated va… ▽ More

    Submitted 29 June, 2021; originally announced July 2021.

  4. arXiv:2102.01117  [pdf, ps, other

    cs.LG stat.ML

    SGD Generalizes Better Than GD (And Regularization Doesn't Help)

    Authors: Idan Amir, Tomer Koren, Roi Livni

    Abstract: We give a new separation result between the generalization performance of stochastic gradient descent (SGD) and of full-batch gradient descent (GD) in the fundamental stochastic convex optimization model. While for SGD it is well-known that $O(1/ε^2)$ iterations suffice for obtaining a solution with $ε$ excess expected risk, we show that with the same number of steps GD may overfit and emit a solu… ▽ More

    Submitted 29 June, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

    Journal ref: Conference on Learning Theory 2021

  5. arXiv:2002.10286  [pdf, other

    cs.LG stat.ML

    Prediction with Corrupted Expert Advice

    Authors: Idan Amir, Idan Attias, Tomer Koren, Roi Livni, Yishay Mansour

    Abstract: We revisit the fundamental problem of prediction with expert advice, in a setting where the environment is benign and generates losses stochastically, but the feedback observed by the learner is subject to a moderate adversarial corruption. We prove that a variant of the classical Multiplicative Weights algorithm with decreasing step sizes achieves constant regret in this setting and performs opti… ▽ More

    Submitted 20 October, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: NeurIPS 2020 Camera Ready

    Journal ref: Conference on Neural Information Processing Systems 2020