Skip to main content

Showing 1–4 of 4 results for author: Girish, A

  1. arXiv:2406.03072  [pdf, other

    cs.LG cs.IT stat.ML

    Local to Global: Learning Dynamics and Effect of Initialization for Transformers

    Authors: Ashok Vardhan Makkuva, Marco Bondaschi, Chanakya Ekbote, Adway Girish, Alliot Nagle, Hyeji Kim, Michael Gastpar

    Abstract: In recent years, transformer-based models have revolutionized deep learning, particularly in sequence modeling. To better understand this phenomenon, there is a growing interest in using Markov input processes to study transformers. However, our current understanding in this regard remains limited with many fundamental questions about how transformers learn Markov chains still unanswered. In this… ▽ More

    Submitted 27 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2402.04161  [pdf, other

    cs.LG cs.CL cs.IT stat.ML

    Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains

    Authors: Ashok Vardhan Makkuva, Marco Bondaschi, Adway Girish, Alliot Nagle, Martin Jaggi, Hyeji Kim, Michael Gastpar

    Abstract: In recent years, attention-based transformers have achieved tremendous success across a variety of disciplines including natural languages. A key ingredient behind their success is the generative pretraining procedure, during which these models are trained on a large text corpus in an auto-regressive manner. To shed light on this phenomenon, we propose a new framework that allows both theory and s… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  3. arXiv:2305.00528  [pdf, ps, other

    cs.LG cs.IT cs.MA

    ICQ: A Quantization Scheme for Best-Arm Identification Over Bit-Constrained Channels

    Authors: Fathima Zarin Faizal, Adway Girish, Manjesh Kumar Hanawal, Nikhil Karamchandani

    Abstract: We study the problem of best-arm identification in a distributed variant of the multi-armed bandit setting, with a central learner and multiple agents. Each agent is associated with an arm of the bandit, generating stochastic rewards following an unknown distribution. Further, each agent can communicate the observed rewards with the learner over a bit-constrained channel. We propose a novel quanti… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: 17 pages, technical report

  4. arXiv:2211.13104  [pdf, other

    cs.CR cs.SE

    Mixed Signals: Analyzing Software Attribution Challenges in the Android Ecosystem

    Authors: Kaspar Hageman, Álvaro Feal, Julien Gamba, Aniketh Girish, Jakob Bleier, Martina Lindorfer, Juan Tapiador, Narseo Vallina-Rodriguez

    Abstract: The ability to identify the author responsible for a given software object is critical for many research studies and for enhancing software transparency and accountability. However, as opposed to other application markets like iOS, attribution in the Android ecosystem is known to be hard. Prior research has leveraged market metadata and signing certificates to identify software authors without que… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.