Skip to main content

Showing 1–50 of 70 results for author: Koren, T

  1. arXiv:2406.12406  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Fast Rates for Bandit PAC Multiclass Classification

    Authors: Liad Erez, Alon Cohen, Tomer Koren, Yishay Mansour, Shay Moran

    Abstract: We study multiclass PAC learning with bandit feedback, where inputs are classified into one of $K$ possible labels and feedback is limited to whether or not the predicted labels are correct. Our main contribution is in designing a novel learning algorithm for the agnostic $(\varepsilon,δ)$-PAC version of the problem, with sample complexity of… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2406.03620  [pdf, ps, other

    cs.LG cs.CR cs.DS math.OC stat.ML

    Private Online Learning via Lazy Algorithms

    Authors: Hilal Asi, Tomer Koren, Daogao Liu, Kunal Talwar

    Abstract: We study the problem of private online learning, specifically, online prediction from experts (OPE) and online convex optimization (OCO). We propose a new transformation that transforms lazy online learning algorithms into private algorithms. We apply our transformation for differentially private OPE and OCO using existing lazy algorithms for these problems. Our final algorithms obtain regret, whi… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2405.10027  [pdf, ps, other

    cs.LG cs.AI stat.ML

    The Real Price of Bandit Information in Multiclass Classification

    Authors: Liad Erez, Alon Cohen, Tomer Koren, Yishay Mansour, Shay Moran

    Abstract: We revisit the classical problem of multiclass classification with bandit feedback (Kakade, Shalev-Shwartz and Tewari, 2008), where each input classifies to one of $K$ possible labels and feedback is restricted to whether the predicted label is correct or not. Our primary inquiry is with regard to the dependency on the number of labels $K$, and whether $T$-step regret bounds in this setting can be… ▽ More

    Submitted 19 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  4. arXiv:2403.02873  [pdf, other

    cs.LG cs.DS math.PR

    A Note on High-Probability Analysis of Algorithms with Exponential, Sub-Gaussian, and General Light Tails

    Authors: Amit Attia, Tomer Koren

    Abstract: This short note describes a simple technique for analyzing probabilistic algorithms that rely on a light-tailed (but not necessarily bounded) source of randomization. We show that the analysis of such an algorithm can be reduced, in a black-box manner and with only a small loss in logarithmic factors, to an analysis of a simpler variant of the same algorithm that uses bounded random variables and… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 9 pages

  5. arXiv:2402.03126  [pdf, other

    cs.LG math.OC stat.ML

    How Free is Parameter-Free Stochastic Optimization?

    Authors: Amit Attia, Tomer Koren

    Abstract: We study the problem of parameter-free stochastic optimization, inquiring whether, and under what conditions, do fully parameter-free methods exist: these are methods that achieve convergence rates competitive with optimally tuned methods, without requiring significant knowledge of the true problem parameters. Existing parameter-free methods can only be considered ``partially'' parameter-free, as… ▽ More

    Submitted 18 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 28 pages

  6. arXiv:2401.12058  [pdf, ps, other

    cs.LG stat.ML

    The Dimension Strikes Back with Gradients: Generalization of Gradient Methods in Stochastic Convex Optimization

    Authors: Matan Schliserman, Uri Sherman, Tomer Koren

    Abstract: We study the generalization performance of gradient methods in the fundamental stochastic convex optimization setting, focusing on its dimension dependence. First, for full-batch gradient descent (GD) we give a construction of a learning problem in dimension $d=O(n^2)$, where the canonical version of GD (tuned for optimal performance of the empirical risk) trained with $n$ training examples conver… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  7. arXiv:2312.11788  [pdf, other

    cs.LG math.OC

    Faster Convergence with Multiway Preferences

    Authors: Aadirupa Saha, Vitaly Feldman, Tomer Koren, Yishay Mansour

    Abstract: We address the problem of convex optimization with preference feedback, where the goal is to minimize a convex function given a weaker form of comparison queries. Each query consists of two points and the dueling feedback returns a (noisy) single-bit binary comparison of the function values of the two queried points. Here we consider the sign-function-based comparison feedback model and analyze th… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  8. arXiv:2311.13877  [pdf, other

    cs.LG math.OC stat.ML

    Locally Optimal Descent for Dynamic Stepsize Scheduling

    Authors: Gilad Yehudai, Alon Cohen, Amit Daniely, Yoel Drori, Tomer Koren, Mariano Schain

    Abstract: We introduce a novel dynamic learning-rate scheduling scheme grounded in theory with the goal of simplifying the manual and time-consuming tuning of schedules in practice. Our approach is based on estimating the locally-optimal stepsize, guaranteeing maximal descent in the direction of the stochastic gradient of the current step. We first establish theoretical convergence bounds for our method wit… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  9. arXiv:2308.14642  [pdf, ps, other

    cs.LG

    Rate-Optimal Policy Optimization for Linear Markov Decision Processes

    Authors: Uri Sherman, Alon Cohen, Tomer Koren, Yishay Mansour

    Abstract: We study regret minimization in online episodic linear Markov Decision Processes, and obtain rate-optimal $\widetilde O (\sqrt K)$ regret where $K$ denotes the number of episodes. Our work is the first to establish the optimal (w.r.t.~$K$) rate of convergence in the stochastic setting with bandit feedback using a policy optimization based approach, and the first to establish the optimal (w.r.t.~… ▽ More

    Submitted 16 May, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

  10. arXiv:2303.01135  [pdf, ps, other

    cs.LG

    Tight Risk Bounds for Gradient Descent on Separable Data

    Authors: Matan Schliserman, Tomer Koren

    Abstract: We study the generalization properties of unregularized gradient methods applied to separable linear classification -- a setting that has received considerable attention since the pioneering work of Soudry et al. (2018). We establish tight upper and lower (population) risk bounds for gradient descent in this setting, for any smooth loss function, expressed in terms of its tail decay rate. Our boun… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  11. arXiv:2302.14154  [pdf, ps, other

    cs.LG cs.CR math.OC stat.ML

    Near-Optimal Algorithms for Private Online Optimization in the Realizable Regime

    Authors: Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar

    Abstract: We consider online learning problems in the realizable setting, where there is a zero-loss solution, and propose new Differentially Private (DP) algorithms that obtain near-optimal regret bounds. For the problem of online prediction from experts, we design new algorithms that obtain near-optimal regret ${O} \big( \varepsilon^{-1} \log^{1.5}{d} \big)$ where $d$ is the number of experts. This signif… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  12. arXiv:2302.08783  [pdf, ps, other

    cs.LG math.OC stat.ML

    SGD with AdaGrad Stepsizes: Full Adaptivity with High Probability to Unknown Parameters, Unbounded Gradients and Affine Variance

    Authors: Amit Attia, Tomer Koren

    Abstract: We study Stochastic Gradient Descent with AdaGrad stepsizes: a popular adaptive (self-tuning) method for first-order stochastic optimization. Despite being well studied, existing analyses of this method suffer from various shortcomings: they either assume some knowledge of the problem parameters, impose strong global Lipschitz conditions, or fail to give bounds that hold with high probability. We… ▽ More

    Submitted 11 June, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: 27 pages

  13. arXiv:2301.13087  [pdf, ps, other

    cs.LG stat.ML

    Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation

    Authors: Uri Sherman, Tomer Koren, Yishay Mansour

    Abstract: We study reinforcement learning with linear function approximation and adversarially changing cost functions, a setup that has mostly been considered under simplifying assumptions such as full information feedback or exploratory conditions.We present a computationally efficient policy optimization algorithm for the challenging general setting of unknown dynamics and bandit feedback, featuring a co… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

  14. arXiv:2210.13537  [pdf, ps, other

    cs.LG cs.CR math.OC stat.ML

    Private Online Prediction from Experts: Separations and Faster Rates

    Authors: Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar

    Abstract: Online prediction from experts is a fundamental problem in machine learning and several works have studied this problem under privacy constraints. We propose and analyze new algorithms for this problem that improve over the regret bounds of the best existing algorithms for non-adaptive adversaries. For approximate differential privacy, our algorithms achieve regret bounds of… ▽ More

    Submitted 29 June, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Removed the results for the realizable setting which we uploaded with additional results for that setting in a separate paper. Added a proof sketch for the lower bound

  15. arXiv:2210.02562  [pdf, ps, other

    math.OC cs.LG

    Dueling Convex Optimization with General Preferences

    Authors: Aadirupa Saha, Tomer Koren, Yishay Mansour

    Abstract: We address the problem of \emph{convex optimization with dueling feedback}, where the goal is to minimize a convex function given a weaker form of \emph{dueling} feedback. Each query consists of two points and the dueling feedback returns a (noisy) single-bit binary comparison of the function values of the two queried points. The translation of the function values to the single comparison bit is t… ▽ More

    Submitted 27 September, 2022; originally announced October 2022.

  16. arXiv:2207.14211  [pdf, ps, other

    cs.LG cs.AI cs.GT stat.ML

    Regret Minimization and Convergence to Equilibria in General-sum Markov Games

    Authors: Liad Erez, Tal Lancewicki, Uri Sherman, Tomer Koren, Yishay Mansour

    Abstract: An abundance of recent impossibility results establish that regret minimization in Markov games with adversarial opponents is both statistically and computationally intractable. Nevertheless, none of these results preclude the possibility of regret minimization under the assumption that all parties adopt the same learning procedure. In this work, we present the first (to our knowledge) algorithm f… ▽ More

    Submitted 8 August, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

  17. arXiv:2207.08257  [pdf, ps, other

    cs.LG math.OC stat.ML

    Uniform Stability for First-Order Empirical Risk Minimization

    Authors: Amit Attia, Tomer Koren

    Abstract: We consider the problem of designing uniformly stable first-order optimization algorithms for empirical risk minimization. Uniform stability is often used to obtain generalization error bounds for optimization algorithms, and we are interested in a general approach to achieve it. For Euclidean geometry, we suggest a black-box conversion which given a smooth optimization algorithm, produces a unifo… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: 18 pages, Proceedings of Thirty Fifth Conference on Learning Theory, PMLR 178:3313-3332, 2022

  18. arXiv:2206.03098  [pdf, ps, other

    cs.LG

    Better Best of Both Worlds Bounds for Bandits with Switching Costs

    Authors: Idan Amir, Guy Azov, Tomer Koren, Roi Livni

    Abstract: We study best-of-both-worlds algorithms for bandits with switching cost, recently addressed by Rouyer, Seldin and Cesa-Bianchi, 2021. We introduce a surprisingly simple and effective algorithm that simultaneously achieves minimax optimal regret bound of $\mathcal{O}(T^{2/3})$ in the oblivious adversarial setting and a bound of $\mathcal{O}(\min\{\log (T)/Δ^2,T^{2/3}\})$ in the stochastically-const… ▽ More

    Submitted 2 November, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

  19. arXiv:2206.01426  [pdf, ps, other

    cs.LG math.OC stat.ML

    Rate-Optimal Online Convex Optimization in Adaptive Linear Control

    Authors: Asaf Cassel, Alon Cohen, Tomer Koren

    Abstract: We consider the problem of controlling an unknown linear dynamical system under adversarially changing convex costs and full feedback of both the state and cost function. We present the first computationally-efficient algorithm that attains an optimal $\smash{\sqrt{T}}$-regret rate compared to the best stabilizing linear controller in hindsight, while avoiding stringent assumptions on the costs su… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:2203.01170

  20. arXiv:2203.01170  [pdf, ps, other

    math.OC cs.LG stat.ML

    Efficient Online Linear Control with Stochastic Convex Costs and Unknown Dynamics

    Authors: Asaf Cassel, Alon Cohen, Tomer Koren

    Abstract: We consider the problem of controlling an unknown linear dynamical system under a stochastic convex cost and full feedback of both the state and cost function. We present a computationally efficient algorithm that attains an optimal $\sqrt{T}$ regret-rate compared to the best stabilizing linear controller in hindsight. In contrast to previous work, our algorithm is based on the Optimism in the Fac… ▽ More

    Submitted 22 June, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  21. arXiv:2202.13441  [pdf, ps, other

    cs.LG

    Stability vs Implicit Bias of Gradient Methods on Separable Data and Beyond

    Authors: Matan Schliserman, Tomer Koren

    Abstract: An influential line of recent work has focused on the generalization properties of unregularized gradient-based learning procedures applied to separable linear classification with exponentially-tailed loss functions. The ability of such methods to generalize well has been attributed to the their implicit bias towards large margin predictors, both asymptotically as well as in finite time. We give a… ▽ More

    Submitted 23 June, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: 37 pages

  22. arXiv:2202.13361  [pdf, other

    cs.LG math.OC stat.ML

    Benign Underfitting of Stochastic Gradient Descent

    Authors: Tomer Koren, Roi Livni, Yishay Mansour, Uri Sherman

    Abstract: We study to what extent may stochastic gradient descent (SGD) be understood as a "conventional" learning rule that achieves generalization performance by obtaining a good fit to training data. We consider the fundamental stochastic convex optimization framework, where (one pass, without-replacement) SGD is classically known to minimize the population risk at rate $O(1/\sqrt n)$, and prove that, su… ▽ More

    Submitted 12 January, 2023; v1 submitted 27 February, 2022; originally announced February 2022.

  23. arXiv:2111.08485  [pdf, other

    cs.CV

    Consistent Semantic Attacks on Optical Flow

    Authors: Tom Koren, Lior Talker, Michael Dinerstein, Roy J Jevnisek

    Abstract: We present a novel approach for semantically targeted adversarial attacks on Optical Flow. In such attacks the goal is to corrupt the flow predictions of a specific object category or instance. Usually, an attacker seeks to hide the adversarial perturbations in the input. However, a quick scan of the output reveals the attack. In contrast, our method helps to hide the attackers intent in the outpu… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: Paper and supplementary material

  24. arXiv:2107.09572  [pdf, other

    cs.LG

    Best-of-All-Worlds Bounds for Online Learning with Feedback Graphs

    Authors: Liad Erez, Tomer Koren

    Abstract: We study the online learning with feedback graphs framework introduced by Mannor and Shamir (2011), in which the feedback received by the online learner is specified by a graph $G$ over the available actions. We develop an algorithm that simultaneously achieves regret bounds of the form: $\smash{\mathcal{O}(\sqrt{θ(G) T})}$ with adversarial losses; $\mathcal{O}(θ(G)\operatorname{polylog}{T})$ with… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  25. arXiv:2107.00469  [pdf, ps, other

    math.OC cs.LG

    Never Go Full Batch (in Stochastic Convex Optimization)

    Authors: Idan Amir, Yair Carmon, Tomer Koren, Roi Livni

    Abstract: We study the generalization performance of $\text{full-batch}$ optimization algorithms for stochastic convex optimization: these are first-order methods that only access the exact gradient of the empirical risk (rather than gradients with respect to individual data points), that include a wide range of algorithms such as gradient descent, mirror descent, and their regularized and/or accelerated va… ▽ More

    Submitted 29 June, 2021; originally announced July 2021.

  26. arXiv:2106.15207  [pdf, ps, other

    cs.LG math.OC stat.ML

    Optimal Rates for Random Order Online Optimization

    Authors: Uri Sherman, Tomer Koren, Yishay Mansour

    Abstract: We study online convex optimization in the random order model, recently proposed by \citet{garber2020online}, where the loss functions may be chosen by an adversary, but are then presented to the online algorithm in a uniformly random order. Focusing on the scenario where the cumulative loss function is (strongly) convex, yet individual loss functions are smooth but might be non-convex, we give al… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  27. arXiv:2106.11879  [pdf, other

    math.OC cs.LG

    Asynchronous Stochastic Optimization Robust to Arbitrary Delays

    Authors: Alon Cohen, Amit Daniely, Yoel Drori, Tomer Koren, Mariano Schain

    Abstract: We consider stochastic optimization with delayed gradients where, at each time step $t$, the algorithm makes an update using a stale stochastic gradient from step $t - d_t$ for some arbitrary delay $d_t$. This setting abstracts asynchronous distributed optimization where a central server receives gradient updates computed by worker machines. These machines can experience computation and communicat… ▽ More

    Submitted 15 November, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

  28. arXiv:2106.02436  [pdf, other

    cs.LG

    Stochastic Multi-Armed Bandits with Unrestricted Delay Distributions

    Authors: Tal Lancewicki, Shahar Segal, Tomer Koren, Yishay Mansour

    Abstract: We study the stochastic Multi-Armed Bandit (MAB) problem with random delays in the feedback received by the algorithm. We consider two settings: the reward-dependent delay setting, where realized delays may depend on the stochastic rewards, and the reward-independent delay setting. Our main contribution is algorithms that achieve near-optimal regret in each of the settings, with an additional addi… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: 33 pages, 5 figures, ICML 2021

  29. arXiv:2103.01516  [pdf, ps, other

    cs.LG cs.CR math.OC stat.ML

    Private Stochastic Convex Optimization: Optimal Rates in $\ell_1$ Geometry

    Authors: Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar

    Abstract: Stochastic convex optimization over an $\ell_1$-bounded domain is ubiquitous in machine learning applications such as LASSO but remains poorly understood when learning with differential privacy. We show that, up to logarithmic factors the optimal excess population loss of any $(\varepsilon,δ)$-differentially private optimizer is $\sqrt{\log(d)/n} + \sqrt{d}/\varepsilon n.$ The upper bound is based… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

  30. arXiv:2102.12608  [pdf, ps, other

    cs.LG stat.ML

    Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt{T}$ Regret

    Authors: Asaf Cassel, Tomer Koren

    Abstract: We consider the task of learning to control a linear dynamical system under fixed quadratic costs, known as the Linear Quadratic Regulator (LQR) problem. While model-free approaches are often favorable in practice, thus far only model-based methods, which rely on costly system identification, have been shown to achieve regret that scales with the optimal dependence on the time horizon T. We presen… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  31. arXiv:2102.12192  [pdf, other

    cs.LG cs.CR

    Multiplicative Reweighting for Robust Neural Network Optimization

    Authors: Noga Bar, Tomer Koren, Raja Giryes

    Abstract: Neural networks are widespread due to their powerful performance. However, they degrade in the presence of noisy labels at training time. Inspired by the setting of learning with expert advice, where multiplicative weight (MW) updates were recently shown to be robust to moderate data corruptions in expert advice, we propose to use MW for reweighting examples during neural networks optimization. We… ▽ More

    Submitted 26 May, 2024; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: Our code is publicly available in https://github.com/NogaBar/mr_robust_optim

  32. arXiv:2102.03803  [pdf, other

    cs.LG math.OC stat.ML

    Lazy OCO: Online Convex Optimization on a Switching Budget

    Authors: Uri Sherman, Tomer Koren

    Abstract: We study a variant of online convex optimization where the player is permitted to switch decisions at most $S$ times in expectation throughout $T$ rounds. Similar problems have been addressed in prior work for the discrete decision set setting, and more recently in the continuous setting but only with an adaptive adversary. In this work, we aim to fill the gap and present computationally efficient… ▽ More

    Submitted 17 September, 2023; v1 submitted 7 February, 2021; originally announced February 2021.

  33. arXiv:2102.02167  [pdf, other

    cs.LG math.OC stat.ML

    Algorithmic Instabilities of Accelerated Gradient Descent

    Authors: Amit Attia, Tomer Koren

    Abstract: We study the algorithmic stability of Nesterov's accelerated gradient method. For convex quadratic objectives, Chen et al. (2018) proved that the uniform stability of the method grows quadratically with the number of optimization steps, and conjectured that the same is true for the general convex and smooth case. We disprove this conjecture and show, for two notions of algorithmic stability (inclu… ▽ More

    Submitted 19 June, 2021; v1 submitted 3 February, 2021; originally announced February 2021.

    Comments: 37 pages

  34. arXiv:2102.01117  [pdf, ps, other

    cs.LG stat.ML

    SGD Generalizes Better Than GD (And Regularization Doesn't Help)

    Authors: Idan Amir, Tomer Koren, Roi Livni

    Abstract: We give a new separation result between the generalization performance of stochastic gradient descent (SGD) and of full-batch gradient descent (GD) in the fundamental stochastic convex optimization model. While for SGD it is well-known that $O(1/ε^2)$ iterations suffice for obtaining a solution with $ε$ excess expected risk, we show that with the same number of steps GD may overfit and emit a solu… ▽ More

    Submitted 29 June, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

    Journal ref: Conference on Learning Theory 2021

  35. arXiv:2102.00490  [pdf, ps, other

    cs.LG stat.ML

    Online Markov Decision Processes with Aggregate Bandit Feedback

    Authors: Alon Cohen, Haim Kaplan, Tomer Koren, Yishay Mansour

    Abstract: We study a novel variant of online finite-horizon Markov Decision Processes with adversarially changing loss functions and initially unknown dynamics. In each episode, the learner suffers the loss accumulated along the trajectory realized by the policy chosen for the episode, and observes aggregate bandit feedback: the trajectory is revealed along with the cumulative loss suffered, rather than the… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

  36. arXiv:2010.14563  [pdf, ps, other

    cs.LG stat.ML

    Adversarial Dueling Bandits

    Authors: Aadirupa Saha, Tomer Koren, Yishay Mansour

    Abstract: We introduce the problem of regret minimization in Adversarial Dueling Bandits. As in classic Dueling Bandits, the learner has to repeatedly choose a pair of items and observe only a relative binary `win-loss' feedback for this pair, but here this feedback is generated from an arbitrary preference matrix, possibly chosen adversarially. Our main result is an algorithm whose $T$-round regret compare… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: 26 pages

  37. arXiv:2010.13639  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic Optimization with Laggard Data Pipelines

    Authors: Naman Agarwal, Rohan Anil, Tomer Koren, Kunal Talwar, Cyril Zhang

    Abstract: State-of-the-art optimization is steadily shifting towards massively parallel pipelines with extremely large batch sizes. As a consequence, CPU-bound preprocessing and disk/memory/network operations have emerged as new performance bottlenecks, as opposed to hardware-accelerated gradient computations. In this regime, a recently proposed approach is data echoing (Choi et al., 2019), which takes repe… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: Published as a conference paper at NeurIPS 2020

  38. arXiv:2008.04612  [pdf, other

    cs.DC cs.LG

    Holdout SGD: Byzantine Tolerant Federated Learning

    Authors: Shahar Azulay, Lior Raz, Amir Globerson, Tomer Koren, Yehuda Afek

    Abstract: This work presents a new distributed Byzantine tolerant federated learning algorithm, HoldOut SGD, for Stochastic Gradient Descent (SGD) optimization. HoldOut SGD uses the well known machine learning technique of holdout estimation, in a distributed fashion, in order to select parameter updates that are likely to lead to models with low loss values. This makes it more effective at discarding Byzan… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: 12 pages, 2 figures

  39. arXiv:2007.00759  [pdf, ps, other

    cs.LG stat.ML

    Bandit Linear Control

    Authors: Asaf Cassel, Tomer Koren

    Abstract: We consider the problem of controlling a known linear dynamical system under stochastic noise, adversarially chosen costs, and bandit feedback. Unlike the full feedback setting where the entire cost function is revealed after each decision, here only the cost incurred by the learner is observed. We present a new and efficient algorithm that, for strongly convex and smooth costs, obtains regret tha… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  40. arXiv:2005.04763  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    Private Stochastic Convex Optimization: Optimal Rates in Linear Time

    Authors: Vitaly Feldman, Tomer Koren, Kunal Talwar

    Abstract: We study differentially private (DP) algorithms for stochastic convex optimization: the problem of minimizing the population loss given i.i.d. samples from a distribution over convex loss functions. A recent work of Bassily et al. (2019) has established the optimal bound on the excess population loss achievable given $n$ samples. Unfortunately, their algorithm achieving this bound is relatively in… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

  41. arXiv:2003.06152  [pdf, other

    cs.LG stat.ML

    Can Implicit Bias Explain Generalization? Stochastic Convex Optimization as a Case Study

    Authors: Assaf Dauber, Meir Feder, Tomer Koren, Roi Livni

    Abstract: The notion of implicit bias, or implicit regularization, has been suggested as a means to explain the surprising generalization ability of modern-days overparameterized learning algorithms. This notion refers to the tendency of the optimization algorithm towards a certain structured solution that often generalizes well. Recently, several papers have studied implicit regularization and were able to… ▽ More

    Submitted 22 December, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

  42. arXiv:2002.11803  [pdf, other

    cs.LG stat.ML

    Disentangling Adaptive Gradient Methods from Learning Rates

    Authors: Naman Agarwal, Rohan Anil, Elad Hazan, Tomer Koren, Cyril Zhang

    Abstract: We investigate several confounding factors in the evaluation of optimization algorithms for deep learning. Primarily, we take a deeper look at how adaptive gradient methods interact with the learning rate schedule, a notoriously difficult-to-tune hyperparameter which has dramatic effects on the convergence and generalization of neural network training. We introduce a "grafting" experiment which de… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  43. arXiv:2002.10286  [pdf, other

    cs.LG stat.ML

    Prediction with Corrupted Expert Advice

    Authors: Idan Amir, Idan Attias, Tomer Koren, Roi Livni, Yishay Mansour

    Abstract: We revisit the fundamental problem of prediction with expert advice, in a setting where the environment is benign and generates losses stochastically, but the feedback observed by the learner is subject to a moderate adversarial corruption. We prove that a variant of the classical Multiplicative Weights algorithm with decreasing step sizes achieves constant regret in this setting and performs opti… ▽ More

    Submitted 20 October, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: NeurIPS 2020 Camera Ready

    Journal ref: Conference on Neural Information Processing Systems 2020

  44. arXiv:2002.09018  [pdf, other

    cs.LG math.OC stat.ML

    Scalable Second Order Optimization for Deep Learning

    Authors: Rohan Anil, Vineet Gupta, Tomer Koren, Kevin Regan, Yoram Singer

    Abstract: Optimization in machine learning, both theoretical and applied, is presently dominated by first-order gradient methods such as stochastic gradient descent. Second-order optimization methods, that involve second derivatives and/or second order statistics of the data, are far less prevalent despite strong theoretical properties, due to their prohibitive computation, memory and communication costs. I… ▽ More

    Submitted 5 March, 2021; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: 24 pages, Code available here: https://bit.ly/3uXXtKy

  45. arXiv:2002.08095  [pdf, ps, other

    cs.LG stat.ML

    Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently

    Authors: Asaf Cassel, Alon Cohen, Tomer Koren

    Abstract: We consider the problem of learning in Linear Quadratic Control systems whose transition parameters are initially unknown. Recent results in this setting have demonstrated efficient learning algorithms with regret growing with the square root of the number of decision steps. We present new efficient algorithms that achieve, perhaps surprisingly, regret that scales only (poly)logarithmically with t… ▽ More

    Submitted 1 July, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Accepted for presentation at International Conference on Machine Learning (ICML) 2020

  46. arXiv:1906.03361  [pdf, other

    cs.LG stat.ML

    Robust Bi-Tempered Logistic Loss Based on Bregman Divergences

    Authors: Ehsan Amid, Manfred K. Warmuth, Rohan Anil, Tomer Koren

    Abstract: We introduce a temperature into the exponential function and replace the softmax output layer of neural nets by a high temperature generalization. Similarly, the logarithm in the log loss we use for training is replaced by a low temperature logarithm. By tuning the two temperatures we create loss functions that are non-convex already in the single layer case. When replacing the last layer of the n… ▽ More

    Submitted 23 September, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

    Journal ref: Neural Information Processing Systems 2019

  47. arXiv:1904.10120  [pdf, other

    cs.LG stat.ML

    Semi-Cyclic Stochastic Gradient Descent

    Authors: Hubert Eichner, Tomer Koren, H. Brendan McMahan, Nathan Srebro, Kunal Talwar

    Abstract: We consider convex SGD updates with a block-cyclic structure, i.e. where each cycle consists of a small number of blocks, each with many samples from a possibly different, block-specific, distribution. This situation arises, e.g., in Federated Learning where the mobile devices available for updates at different times during the day have different characteristics. We show that such block-cyclic str… ▽ More

    Submitted 22 April, 2019; originally announced April 2019.

  48. arXiv:1902.08647  [pdf, ps, other

    cs.LG stat.ML

    Better Algorithms for Stochastic Bandits with Adversarial Corruptions

    Authors: Anupam Gupta, Tomer Koren, Kunal Talwar

    Abstract: We study the stochastic multi-armed bandits problem in the presence of adversarial corruption. We present a new algorithm for this problem whose regret is nearly optimal, substantially improving upon previous work. Our algorithm is agnostic to the level of adversarial contamination and can tolerate a significant amount of corruption with virtually no degradation in performance.

    Submitted 28 March, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

  49. arXiv:1902.06223  [pdf, ps, other

    cs.LG stat.ML

    Learning Linear-Quadratic Regulators Efficiently with only $\sqrt{T}$ Regret

    Authors: Alon Cohen, Tomer Koren, Yishay Mansour

    Abstract: We present the first computationally-efficient algorithm with $\widetilde O(\sqrt{T})$ regret for learning in Linear Quadratic Control systems with unknown dynamics. By that, we resolve an open question of Abbasi-Yadkori and Szepesvári (2011) and Dean, Mania, Matni, Recht, and Tu (2018).

    Submitted 23 February, 2019; v1 submitted 17 February, 2019; originally announced February 2019.

  50. arXiv:1901.11150  [pdf, other

    cs.LG math.OC stat.ML

    Memory-Efficient Adaptive Optimization

    Authors: Rohan Anil, Vineet Gupta, Tomer Koren, Yoram Singer

    Abstract: Adaptive gradient-based optimizers such as Adagrad and Adam are crucial for achieving state-of-the-art performance in machine translation and language modeling. However, these methods maintain second-order statistics for each parameter, thus introducing significant memory overheads that restrict the size of the model being used as well as the number of examples in a mini-batch. We describe an effe… ▽ More

    Submitted 11 September, 2019; v1 submitted 30 January, 2019; originally announced January 2019.