Skip to main content

Showing 1–2 of 2 results for author: Azulay, S

  1. arXiv:2102.09769  [pdf, other

    cs.LG

    On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent

    Authors: Shahar Azulay, Edward Moroshko, Mor Shpigel Nacson, Blake Woodworth, Nathan Srebro, Amir Globerson, Daniel Soudry

    Abstract: Recent work has highlighted the role of initialization scale in determining the structure of the solutions that gradient methods converge to. In particular, it was shown that large initialization leads to the neural tangent kernel regime solution, whereas small initialization leads to so called "rich regimes". However, the initialization structure is richer than the overall scale alone and involve… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

    Comments: 33 pages, 2 figures

    MSC Class: 68T07 (Primary) ACM Class: I.2.6; G.1.6

  2. arXiv:2008.04612  [pdf, other

    cs.DC cs.LG

    Holdout SGD: Byzantine Tolerant Federated Learning

    Authors: Shahar Azulay, Lior Raz, Amir Globerson, Tomer Koren, Yehuda Afek

    Abstract: This work presents a new distributed Byzantine tolerant federated learning algorithm, HoldOut SGD, for Stochastic Gradient Descent (SGD) optimization. HoldOut SGD uses the well known machine learning technique of holdout estimation, in a distributed fashion, in order to select parameter updates that are likely to lead to models with low loss values. This makes it more effective at discarding Byzan… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: 12 pages, 2 figures