Skip to main content

Showing 1–33 of 33 results for author: Lyle, C

  1. arXiv:2407.06756  [pdf, other

    cs.LG cs.AI cs.NE

    Frequency and Generalisation of Periodic Activation Functions in Reinforcement Learning

    Authors: Augustine N. Mavor-Parker, Matthew J. Sargent, Caswell Barry, Lewis Griffin, Clare Lyle

    Abstract: Periodic activation functions, often referred to as learned Fourier features have been widely demonstrated to improve sample efficiency and stability in a variety of deep RL algorithms. Potentially incompatible hypotheses have been made about the source of these improvements. One is that periodic activations learn low frequency representations and as a result avoid overfitting to bootstrapped targ… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2407.01800  [pdf, other

    cs.LG cs.AI

    Normalization and effective learning rates in reinforcement learning

    Authors: Clare Lyle, Zeyu Zheng, Khimya Khetarpal, James Martens, Hado van Hasselt, Razvan Pascanu, Will Dabney

    Abstract: Normalization layers have recently experienced a renaissance in the deep reinforcement learning and continual learning literature, with several works highlighting diverse benefits such as improving loss landscape conditioning and combatting overestimation bias. However, normalization brings with it a subtle but important side effect: an equivalence between growth in the norm of the network paramet… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2407.01704  [pdf, other

    cs.LG cs.AI

    Weight Clipping for Deep Continual and Reinforcement Learning

    Authors: Mohamed Elsayed, Qingfeng Lan, Clare Lyle, A. Rupam Mahmood

    Abstract: Many failures in deep continual and reinforcement learning are associated with increasing magnitudes of the weights, making them hard to change and potentially causing overfitting. While many methods address these learning failures, they often change the optimizer or the architecture, a complexity that hinders widespread adoption in various systems. In this paper, we focus on learning failures tha… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Published in the First Reinforcement Learning Conference (RLC 2024). Code is available at https://github.com/mohmdelsayed/weight-clipping

  4. arXiv:2406.02596  [pdf, other

    cs.LG cs.AI

    Slow and Steady Wins the Race: Maintaining Plasticity with Hare and Tortoise Networks

    Authors: Hojoon Lee, Hyeonseo Cho, Hyunseung Kim, Donghu Kim, Dugki Min, Jaegul Choo, Clare Lyle

    Abstract: This study investigates the loss of generalization ability in neural networks, revisiting warm-starting experiments from Ash & Adams. Our empirical analysis reveals that common methods designed to enhance plasticity by maintaining trainability provide limited benefits to generalization. While reinitializing the network can be effective, it also risks losing valuable prior knowledge. To this end, w… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: accepted to ICML 2024

  5. arXiv:2406.02035  [pdf, other

    cs.LG cs.AI

    A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning

    Authors: Khimya Khetarpal, Zhaohan Daniel Guo, Bernardo Avila Pires, Yunhao Tang, Clare Lyle, Mark Rowland, Nicolas Heess, Diana Borsa, Arthur Guez, Will Dabney

    Abstract: Learning a good representation is a crucial challenge for Reinforcement Learning (RL) agents. Self-predictive learning provides means to jointly learn a latent representation and dynamics model by bootstrapping from future latent representations (BYOL). Recent work has developed theoretical insights into these algorithms by studying a continuous-time ODE model for self-predictive representation le… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2402.18762  [pdf, other

    cs.LG

    Disentangling the Causes of Plasticity Loss in Neural Networks

    Authors: Clare Lyle, Zeyu Zheng, Khimya Khetarpal, Hado van Hasselt, Razvan Pascanu, James Martens, Will Dabney

    Abstract: Underpinning the past decades of work on the design, initialization, and optimization of neural networks is a seemingly innocuous assumption: that the network is trained on a \textit{stationary} data distribution. In settings where this assumption is violated, e.g.\ deep reinforcement learning, learning algorithms become unstable and brittle with respect to hyperparameters and even random seeds. O… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  7. arXiv:2402.08609  [pdf, other

    cs.LG cs.AI

    Mixtures of Experts Unlock Parameter Scaling for Deep RL

    Authors: Johan Obando-Ceron, Ghada Sokar, Timon Willi, Clare Lyle, Jesse Farebrother, Jakob Foerster, Gintare Karolina Dziugaite, Doina Precup, Pablo Samuel Castro

    Abstract: The recent rapid progress in (self) supervised learning models is in large part predicted by empirical scaling laws: a model's performance scales proportionally to its size. Analogous scaling laws remain elusive for reinforcement learning domains, however, where increasing the parameter count of a model often hurts its final performance. In this paper, we demonstrate that incorporating Mixture-of-… ▽ More

    Submitted 26 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  8. arXiv:2402.07598  [pdf, other

    cs.LG stat.ML

    Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model

    Authors: Mark Rowland, Li Kevin Wenliang, Rémi Munos, Clare Lyle, Yunhao Tang, Will Dabney

    Abstract: We propose a new algorithm for model-based distributional reinforcement learning (RL), and prove that it is minimax-optimal for approximating return distributions with a generative model (up to logarithmic factors), resolving an open question of Zhang et al. (2023). Our analysis provides new theoretical results on categorical approaches to distributional RL, and also introduces a new distributiona… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  9. arXiv:2312.09187  [pdf, other

    cs.LG

    Vision-Language Models as a Source of Rewards

    Authors: Kate Baumli, Satinder Baveja, Feryal Behbahani, Harris Chan, Gheorghe Comanici, Sebastian Flennerhag, Maxime Gazeau, Kristian Holsheimer, Dan Horgan, Michael Laskin, Clare Lyle, Hussain Masoom, Kay McKinney, Volodymyr Mnih, Alexander Neitz, Dmitry Nikulin, Fabio Pardo, Jack Parker-Holder, John Quan, Tim Rocktäschel, Himanshu Sahni, Tom Schaul, Yannick Schroecker, Stephen Spencer, Richie Steigerwald , et al. (2 additional authors not shown)

    Abstract: Building generalist agents that can accomplish many goals in rich open-ended environments is one of the research frontiers for reinforcement learning. A key limiting factor for building generalist agents with RL has been the need for a large number of reward functions for achieving different goals. We investigate the feasibility of using off-the-shelf vision-language models, or VLMs, as sources of… ▽ More

    Submitted 12 July, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 10 pages, 5 figures

  10. arXiv:2312.04064  [pdf, other

    q-bio.QM cs.LG stat.ME

    DiscoBAX: Discovery of Optimal Intervention Sets in Genomic Experiment Design

    Authors: Clare Lyle, Arash Mehrjou, Pascal Notin, Andrew Jesson, Stefan Bauer, Yarin Gal, Patrick Schwab

    Abstract: The discovery of therapeutics to treat genetically-driven pathologies relies on identifying genes involved in the underlying disease mechanisms. Existing approaches search over the billions of potential interventions to maximize the expected influence on the target phenotype. However, to reduce the risk of failure in future stages of trials, practical experiment design aims to find a set of interv… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Journal ref: International Conference on Machine Learning, 2023

  11. arXiv:2305.18388  [pdf, other

    cs.LG stat.ML

    The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation

    Authors: Mark Rowland, Yunhao Tang, Clare Lyle, Rémi Munos, Marc G. Bellemare, Will Dabney

    Abstract: We study the problem of temporal-difference-based policy evaluation in reinforcement learning. In particular, we analyse the use of a distributional reinforcement learning algorithm, quantile temporal-difference learning (QTD), for this task. We reach the surprising conclusion that even if a practitioner has no interest in the return distribution beyond the mean, QTD (which learns predictions abou… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  12. arXiv:2305.15555  [pdf, other

    cs.LG cs.AI

    Deep Reinforcement Learning with Plasticity Injection

    Authors: Evgenii Nikishin, Junhyuk Oh, Georg Ostrovski, Clare Lyle, Razvan Pascanu, Will Dabney, André Barreto

    Abstract: A growing body of evidence suggests that neural networks employed in deep reinforcement learning (RL) gradually lose their plasticity, the ability to learn from new data; however, the analysis and mitigation of this phenomenon is hampered by the complex relationship between plasticity, exploration, and performance in RL. This paper introduces plasticity injection, a minimalistic intervention that… ▽ More

    Submitted 3 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 camera-ready

  13. arXiv:2303.01486  [pdf, other

    cs.LG

    Understanding plasticity in neural networks

    Authors: Clare Lyle, Zeyu Zheng, Evgenii Nikishin, Bernardo Avila Pires, Razvan Pascanu, Will Dabney

    Abstract: Plasticity, the ability of a neural network to quickly change its predictions in response to new information, is essential for the adaptability and robustness of deep reinforcement learning systems. Deep neural networks are known to lose plasticity over the course of training even in relatively simple learning problems, but the mechanisms driving this phenomenon are still poorly understood. This p… ▽ More

    Submitted 27 November, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted to ICML 2023 (oral presentation)

  14. arXiv:2212.05377  [pdf, other

    cs.LG

    Generalization Through the Lens of Learning Dynamics

    Authors: Clare Lyle

    Abstract: A machine learning (ML) system must learn not only to match the output of a target function on a training set, but also to generalize to novel situations in order to yield accurate predictions at deployment. In most practical applications, the user cannot exhaustively enumerate every possible input to the model; strong generalization performance is therefore crucial to the development of ML system… ▽ More

    Submitted 10 December, 2022; originally announced December 2022.

    Comments: PhD Thesis

  15. arXiv:2212.03319  [pdf, other

    cs.LG cs.AI

    Understanding Self-Predictive Learning for Reinforcement Learning

    Authors: Yunhao Tang, Zhaohan Daniel Guo, Pierre Harvey Richemond, Bernardo Ávila Pires, Yash Chandak, Rémi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, András György, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko

    Abstract: We study the learning dynamics of self-predictive learning for reinforcement learning, a family of algorithms that learn representations by minimizing the prediction error of their own future latent representations. Despite its recent empirical success, such algorithms have an apparent defect: trivial representations (such as constants) minimize the prediction error, yet it is obviously undesirabl… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  16. arXiv:2206.02126  [pdf, other

    cs.LG

    Learning Dynamics and Generalization in Reinforcement Learning

    Authors: Clare Lyle, Mark Rowland, Will Dabney, Marta Kwiatkowska, Yarin Gal

    Abstract: Solving a reinforcement learning (RL) problem poses two competing challenges: fitting a potentially discontinuous value function, and generalizing well to new observations. In this paper, we analyze the learning dynamics of temporal difference algorithms to gain novel insight into the tension between these two objectives. We show theoretically that temporal difference learning encourages agents to… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  17. arXiv:2204.09560  [pdf, other

    cs.LG

    Understanding and Preventing Capacity Loss in Reinforcement Learning

    Authors: Clare Lyle, Mark Rowland, Will Dabney

    Abstract: The reinforcement learning (RL) problem is rife with sources of non-stationarity, making it a notoriously difficult problem domain for the application of neural networks. We identify a mechanism by which non-stationary prediction targets can prevent learning progress in deep RL agents: \textit{capacity loss}, whereby networks trained on a sequence of target values lose their ability to quickly upd… ▽ More

    Submitted 4 May, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: Presented at ICLR 2022

  18. arXiv:2112.13023  [pdf, other

    cs.LG cs.AI

    DARTS without a Validation Set: Optimizing the Marginal Likelihood

    Authors: Miroslav Fil, Binxin Ru, Clare Lyle, Yarin Gal

    Abstract: The success of neural architecture search (NAS) has historically been limited by excessive compute requirements. While modern weight-sharing NAS methods such as DARTS are able to finish the search in single-digit GPU days, extracting the final best architecture from the shared weights is notoriously unreliable. Training-Speed-Estimate (TSE), a recently developed generalization estimator with a Bay… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

    Comments: Presented at the 5th Workshop on Meta-Learning at NeurIPS 2021

  19. arXiv:2106.02584  [pdf, other

    cs.LG stat.ML

    Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning

    Authors: Jannik Kossen, Neil Band, Clare Lyle, Aidan N. Gomez, Tom Rainforth, Yarin Gal

    Abstract: We challenge a common assumption underlying most supervised deep learning: that a model makes a prediction depending only on its parameters and the features of a single input. To this end, we introduce a general-purpose deep learning architecture that takes as input the entire dataset instead of processing one datapoint at a time. Our approach uses self-attention to reason about relationships betw… ▽ More

    Submitted 1 February, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at NeurIPS 2021. First two authors contributed equally

  20. arXiv:2105.09108  [pdf, other

    cs.AI cs.LG

    Provable Guarantees on the Robustness of Decision Rules to Causal Interventions

    Authors: Benjie Wang, Clare Lyle, Marta Kwiatkowska

    Abstract: Robustness of decision rules to shifts in the data-generating process is crucial to the successful deployment of decision-making systems. Such shifts can be viewed as interventions on a causal graph, which capture (possibly hypothetical) changes in the data-generating process, whether due to natural reasons or by the action of an adversary. We consider causal Bayesian networks and formally define… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: 21 pages (8+13 Appendix). To be published in IJCAI 2021

  21. arXiv:2103.06002  [pdf, other

    cs.LG stat.ML

    Robustness to Pruning Predicts Generalization in Deep Neural Networks

    Authors: Lorenz Kuhn, Clare Lyle, Aidan N. Gomez, Jonas Rothfuss, Yarin Gal

    Abstract: Existing generalization measures that aim to capture a model's simplicity based on parameter counts or norms fail to explain generalization in overparameterized deep neural networks. In this paper, we introduce a new, theoretically motivated measure of a network's simplicity which we call prunability: the smallest \emph{fraction} of the network's parameters that can be kept while pruning without a… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

  22. arXiv:2102.13089  [pdf, other

    cs.LG

    On The Effect of Auxiliary Tasks on Representation Dynamics

    Authors: Clare Lyle, Mark Rowland, Georg Ostrovski, Will Dabney

    Abstract: While auxiliary tasks play a key role in shaping the representations learnt by reinforcement learning agents, much is still unknown about the mechanisms through which this is achieved. This work develops our understanding of the relationship between auxiliary tasks, environment structure, and representations by analysing the dynamics of temporal difference algorithms. Through this approach, we est… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: AISTATS 2021

  23. arXiv:2102.12560  [pdf, other

    cs.LG cs.AI

    PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning

    Authors: Angelos Filos, Clare Lyle, Yarin Gal, Sergey Levine, Natasha Jaques, Gregory Farquhar

    Abstract: We study reinforcement learning (RL) with no-reward demonstrations, a setting in which an RL agent has access to additional data from the interaction of other agents with the same environment. However, it has no access to the rewards or goals of these agents, and their objectives and levels of expertise may vary widely. These assumptions are common in multi-agent settings, such as autonomous drivi… ▽ More

    Submitted 10 June, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: The last two authors contributed equally. Accepted at ICML 2021

  24. arXiv:2010.14499  [pdf, other

    cs.LG

    A Bayesian Perspective on Training Speed and Model Selection

    Authors: Clare Lyle, Lisa Schut, Binxin Ru, Yarin Gal, Mark van der Wilk

    Abstract: We take a Bayesian perspective to illustrate a connection between training speed and the marginal likelihood in linear models. This provides two major insights: first, that a measure of a model's training speed can be used to estimate its marginal likelihood. Second, that this measure, under certain conditions, predicts the relative weighting of models in linear model combinations trained to minim… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: To be presented at NeurIPS 2020

  25. arXiv:2006.04492  [pdf, other

    stat.ML cs.LG

    Speedy Performance Estimation for Neural Architecture Search

    Authors: Binxin Ru, Clare Lyle, Lisa Schut, Miroslav Fil, Mark van der Wilk, Yarin Gal

    Abstract: Reliable yet efficient evaluation of generalisation performance of a proposed architecture is crucial to the success of neural architecture search (NAS). Traditional approaches face a variety of limitations: training each architecture to completion is prohibitively expensive, early stopped validation accuracy may correlate poorly with fully trained performance, and model-based estimators require l… ▽ More

    Submitted 7 June, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: 23 pages, 14 figures

  26. arXiv:2005.00178  [pdf, other

    cs.LG stat.ML

    On the Benefits of Invariance in Neural Networks

    Authors: Clare Lyle, Mark van der Wilk, Marta Kwiatkowska, Yarin Gal, Benjamin Bloem-Reddy

    Abstract: Many real world data analysis problems exhibit invariant structure, and models that take advantage of this structure have shown impressive empirical performance, particularly in deep learning. While the literature contains a variety of methods to incorporate invariance into models, theoretical understanding is poor and there is no way to assess when one method should be preferred over another. In… ▽ More

    Submitted 30 April, 2020; originally announced May 2020.

  27. arXiv:2003.12537  [pdf, other

    cs.LG stat.ML

    Unpacking Information Bottlenecks: Unifying Information-Theoretic Objectives in Deep Learning

    Authors: Andreas Kirsch, Clare Lyle, Yarin Gal

    Abstract: The Information Bottleneck principle offers both a mechanism to explain how deep neural networks train and generalize, as well as a regularized objective with which to train models. However, multiple competing objectives are proposed in the literature, and the information-theoretic quantities used in these objectives are difficult to compute for large deep neural networks, which in turn limits the… ▽ More

    Submitted 5 January, 2021; v1 submitted 27 March, 2020; originally announced March 2020.

  28. arXiv:2003.06016  [pdf, other

    cs.LG cs.AI stat.ML

    Invariant Causal Prediction for Block MDPs

    Authors: Amy Zhang, Clare Lyle, Shagun Sodhani, Angelos Filos, Marta Kwiatkowska, Joelle Pineau, Yarin Gal, Doina Precup

    Abstract: Generalization across environments is critical to the successful application of reinforcement learning algorithms to real-world challenges. In this paper, we consider the problem of learning abstractions that generalize in block MDPs, families of environments with a shared latent state space and dynamics structure over that latent space, but varying observations. We leverage tools from causal infe… ▽ More

    Submitted 11 June, 2020; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: Accepted to ICML 2020. 16 pages, 8 figures

  29. arXiv:1901.11530  [pdf, other

    cs.LG cs.AI stat.ML

    A Geometric Perspective on Optimal Representations for Reinforcement Learning

    Authors: Marc G. Bellemare, Will Dabney, Robert Dadashi, Adrien Ali Taiga, Pablo Samuel Castro, Nicolas Le Roux, Dale Schuurmans, Tor Lattimore, Clare Lyle

    Abstract: We propose a new perspective on representation learning in reinforcement learning based on geometric properties of the space of value functions. We leverage this perspective to provide formal evidence regarding the usefulness of value functions as auxiliary tasks. Our formulation considers adapting the representation to minimize the (linear) approximation of the value function of all stationary po… ▽ More

    Submitted 25 June, 2019; v1 submitted 31 January, 2019; originally announced January 2019.

  30. arXiv:1901.11084  [pdf, other

    cs.LG stat.ML

    A Comparative Analysis of Expected and Distributional Reinforcement Learning

    Authors: Clare Lyle, Pablo Samuel Castro, Marc G. Bellemare

    Abstract: Since their introduction a year ago, distributional approaches to reinforcement learning (distributional RL) have produced strong results relative to the standard approach which models expected values (expected RL). However, aside from convergence guarantees, there have been few theoretical results investigating the reasons behind the improvements distributional RL provides. In this paper we begin… ▽ More

    Submitted 21 February, 2019; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: To appear in the Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence

  31. arXiv:1805.04874  [pdf, other

    stat.ML cs.LG

    GAN Q-learning

    Authors: Thang Doan, Bogdan Mazoure, Clare Lyle

    Abstract: Distributional reinforcement learning (distributional RL) has seen empirical success in complex Markov Decision Processes (MDPs) in the setting of nonlinear function approximation. However, there are many different ways in which one can leverage the distributional approach to reinforcement learning. In this paper, we propose GAN Q-learning, a novel distributional RL method based on generative adve… ▽ More

    Submitted 20 July, 2018; v1 submitted 13 May, 2018; originally announced May 2018.

  32. arXiv:1802.07228  [pdf

    cs.AI cs.CR cs.CY

    The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation

    Authors: Miles Brundage, Shahar Avin, Jack Clark, Helen Toner, Peter Eckersley, Ben Garfinkel, Allan Dafoe, Paul Scharre, Thomas Zeitzoff, Bobby Filar, Hyrum Anderson, Heather Roff, Gregory C. Allen, Jacob Steinhardt, Carrick Flynn, Seán Ó hÉigeartaigh, Simon Beard, Haydn Belfield, Sebastian Farquhar, Clare Lyle, Rebecca Crootof, Owain Evans, Michael Page, Joanna Bryson, Roman Yampolskiy , et al. (1 additional authors not shown)

    Abstract: This report surveys the landscape of potential security threats from malicious uses of AI, and proposes ways to better forecast, prevent, and mitigate these threats. After analyzing the ways in which AI may influence the threat landscape in the digital, physical, and political domains, we make four high-level recommendations for AI researchers and other stakeholders. We also suggest several promis… ▽ More

    Submitted 20 February, 2018; originally announced February 2018.

  33. arXiv:1612.04861  [pdf, other

    cs.CG cs.DM math.CO

    Some Counterexamples for Compatible Triangulations

    Authors: Cody Barnson, Dawn Chandler, Qiao Chen, Christina Chung, Andrew Coccimiglio, Sean La, Lily Li, Aïna Linn, Anna Lubiw, Clare Lyle, Shikha Mahajan, Gregory Mierzwinski, Simon Pratt, Yoon Su Matthias Yoo, Hongbo Zhang, Kevin Zhang

    Abstract: We consider the conjecture by Aichholzer, Aurenhammer, Hurtado, and Krasser that any two points sets with the same cardinality and the same size convex hull can be triangulated in the "same" way, more precisely via \emph{compatible triangulations}. We show counterexamples to various strengthened versions of this conjecture.

    Submitted 9 December, 2016; originally announced December 2016.

    MSC Class: 68R10 ACM Class: F.2.2