Skip to main content

Showing 1–25 of 25 results for author: Pearce, T

  1. arXiv:2406.12907  [pdf, other

    cs.LG cs.CL

    Reconciling Kaplan and Chinchilla Scaling Laws

    Authors: Tim Pearce, Jinyeop Song

    Abstract: Kaplan et al. [2020] (`Kaplan') and Hoffmann et al. [2022] (`Chinchilla') studied the scaling behavior of transformers trained on next-token language prediction. These studies produced different estimates for how the number of parameters ($N$) and training tokens ($D$) should be set to achieve the lowest possible loss for a given compute budget ($C$). Kaplan: $N_\text{optimal} \propto C^{0.73}$, C… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2406.04105  [pdf, other

    cs.LG eess.IV

    From Tissue Plane to Organ World: A Benchmark Dataset for Multimodal Biomedical Image Registration using Deep Co-Attention Networks

    Authors: Yifeng Wang, Weipeng Li, Thomas Pearce, Haohan Wang

    Abstract: Correlating neuropathology with neuroimaging findings provides a multiscale view of pathologic changes in the human organ spanning the meso- to micro-scales, and is an emerging methodology expected to shed light on numerous disease states. To gain the most information from this multimodal, multiscale approach, it is desirable to identify precisely where a histologic tissue section was taken from w… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2406.03947  [pdf, other

    cs.LG cs.AI

    Weight-based Decomposition: A Case for Bilinear MLPs

    Authors: Michael T. Pearce, Thomas Dooms, Alice Rigg

    Abstract: Gated Linear Units (GLUs) have become a common building block in modern foundation models. Bilinear layers drop the non-linearity in the "gate" but still have comparable performance to other GLUs. An attractive quality of bilinear layers is that they can be fully expressed in terms of a third-order tensor and linear operations. Leveraging this, we develop a method to decompose the bilinear tensor… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  4. arXiv:2405.12399  [pdf, other

    cs.LG cs.AI cs.CV

    Diffusion for World Modeling: Visual Details Matter in Atari

    Authors: Eloi Alonso, Adam Jelley, Vincent Micheli, Anssi Kanervisto, Amos Storkey, Tim Pearce, François Fleuret

    Abstract: World models constitute a promising approach for training reinforcement learning agents in a safe and sample-efficient manner. Recent world models predominantly operate on sequences of discrete latent variables to model environment dynamics. However, this compression into a compact discrete representation may ignore visual details that are important for reinforcement learning. Concurrently, diffus… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 25 pages, 11 figures, 10 tables

  5. arXiv:2402.16349  [pdf, other

    cs.LG eess.SY

    C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory

    Authors: Tianjiao Luo, Tim Pearce, Huayu Chen, Jianfei Chen, Jun Zhu

    Abstract: Generative Adversarial Imitation Learning (GAIL) trains a generative policy to mimic a demonstrator. It uses on-policy Reinforcement Learning (RL) to optimize a reward signal derived from a GAN-like discriminator. A major drawback of GAIL is its training instability - it inherits the complex training dynamics of GANs, and the distribution shift introduced by RL. This can cause oscillations during… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach

    Authors: Stephen Mak, Liming Xu, Tim Pearce, Michael Ostroumov, Alexandra Brintrup

    Abstract: Collaborative vehicle routing occurs when carriers collaborate through sharing their transportation requests and performing transportation requests on behalf of each other. This achieves economies of scale, thus reducing cost, greenhouse gas emissions and road congestion. But which carrier should partner with whom, and how much should each carrier be compensated? Traditional game theoretic solutio… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Final, published version can be found here: https://www.sciencedirect.com/science/article/pii/S0968090X23003662

    Journal ref: Volume 157, December 2023, 104376

  7. arXiv:2310.17458  [pdf, other

    cs.LG

    Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing

    Authors: Stephen Mak, Liming Xu, Tim Pearce, Michael Ostroumov, Alexandra Brintrup

    Abstract: Collaborative Vehicle Routing is where delivery companies cooperate by sharing their delivery information and performing delivery requests on behalf of each other. This achieves economies of scale and thus reduces cost, greenhouse gas emissions, and road congestion. But which company should partner with whom, and how much should each company be compensated? Traditional game theoretic solution conc… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2021 Workshop on Cooperative AI

  8. arXiv:2302.07515  [pdf, other

    cs.AI cs.LG cs.MA

    TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play

    Authors: Fanqi Lin, Shiyu Huang, Tim Pearce, Wenze Chen, Wei-Wei Tu

    Abstract: Multi-agent football poses an unsolved challenge in AI research. Existing work has focused on tackling simplified scenarios of the game, or else leveraging expert demonstrations. In this paper, we develop a multi-agent system to play the full 11 vs. 11 game mode, without demonstrations. This game mode contains aspects that present major challenges to modern reinforcement learning algorithms; multi… ▽ More

    Submitted 20 February, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: The 22nd International Conference on Autonomous Agents and Multiagent Systems(AAMAS2023)

  9. arXiv:2301.10677  [pdf, other

    cs.AI cs.LG stat.ML

    Imitating Human Behaviour with Diffusion Models

    Authors: Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin

    Abstract: Diffusion models have emerged as powerful generative models in the text-to-image domain. This paper studies their application as observation-to-action models for imitating human behaviour in sequential environments. Human behaviour is stochastic and multimodal, with structured correlations between action dimensions. Meanwhile, standard modelling choices in behaviour cloning are limited in their ex… ▽ More

    Submitted 3 March, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Published in ICLR 2023

    Journal ref: ICLR 2023

  10. arXiv:2207.05631  [pdf, other

    cs.LG cs.AI

    DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

    Authors: Wentse Chen, Shiyu Huang, Yuan Chiang, Tim Pearce, Wei-Wei Tu, Ting Chen, Jun Zhu

    Abstract: Most reinforcement learning algorithms seek a single optimal strategy that solves a given task. However, it can often be valuable to learn a diverse set of solutions, for instance, to make an agent's interaction with users more engaging, or improve the robustness of a policy to an unexpected perturbance. We propose Diversity-Guided Policy Optimization (DGPO), an on-policy algorithm that discovers… ▽ More

    Submitted 5 January, 2024; v1 submitted 12 July, 2022; originally announced July 2022.

  11. arXiv:2205.13496  [pdf, other

    stat.ML cs.LG

    Censored Quantile Regression Neural Networks for Distribution-Free Survival Analysis

    Authors: Tim Pearce, Jong-Hyeon Jeong, Yichen Jia, Jun Zhu

    Abstract: This paper considers doing quantile regression on censored data using neural networks (NNs). This adds to the survival analysis toolkit by allowing direct prediction of the target variable, along with a distribution-free characterisation of uncertainty, using a flexible function approximator. We begin by showing how an algorithm popular in linear models can be applied to NNs. However, the resultin… ▽ More

    Submitted 6 February, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Published in NeurIPS 2022

    Journal ref: NeurIPS 2022

  12. Sensitive Samples Revisited: Detecting Neural Network Attacks Using Constraint Solvers

    Authors: Amel Nestor Docena, Thomas Wahl, Trevor Pearce, Yunsi Fei

    Abstract: Neural Networks are used today in numerous security- and safety-relevant domains and are, as such, a popular target of attacks that subvert their classification capabilities, by manipulating the network parameters. Prior work has introduced sensitive samples -- inputs highly sensitive to parameter changes -- to detect such manipulations, and proposed a gradient ascent-based approach to compute the… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: In Proceedings SCSS 2021, arXiv:2109.02501

    Journal ref: EPTCS 342, 2021, pp. 35-48

  13. arXiv:2107.13304  [pdf, other

    cs.LG stat.ML

    Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection

    Authors: Bang Xiang Yong, Tim Pearce, Alexandra Brintrup

    Abstract: After an autoencoder (AE) has learnt to reconstruct one dataset, it might be expected that the likelihood on an out-of-distribution (OOD) input would be low. This has been studied as an approach to detect OOD inputs. Recent work showed this intuitive approach can fail for the dataset pairs FashionMNIST vs MNIST. This paper suggests this is due to the use of Bernoulli likelihood and analyses why th… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: Presented at the ICML 2020 Workshop on Uncertainty and Ro-bustness in Deep Learning

  14. arXiv:2106.04972  [pdf, other

    cs.LG cs.AI stat.ML

    Understanding Softmax Confidence and Uncertainty

    Authors: Tim Pearce, Alexandra Brintrup, Jun Zhu

    Abstract: It is often remarked that neural networks fail to increase their uncertainty when predicting on data far from the training distribution. Yet naively using softmax confidence as a proxy for uncertainty achieves modest success in tasks exclusively testing for this, e.g., out-of-distribution (OOD) detection. This paper investigates this contradiction, identifying two implicit biases that do encourage… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

  15. arXiv:2104.04258  [pdf, other

    cs.AI cs.LG stat.ML

    Counter-Strike Deathmatch with Large-Scale Behavioural Cloning

    Authors: Tim Pearce, Jun Zhu

    Abstract: This paper describes an AI agent that plays the popular first-person-shooter (FPS) video game `Counter-Strike; Global Offensive' (CSGO) from pixel input. The agent, a deep neural network, matches the performance of the medium difficulty built-in AI on the deathmatch game mode, whilst adopting a humanlike play style. Unlike much prior work in games, no API is available for CSGO, so algorithms must… ▽ More

    Submitted 9 December, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: Offline Reinforcement Learning Workshop at Neural Information Processing Systems, 2021

  16. arXiv:2102.06893  [pdf

    cs.SI cs.HC

    A Bayesian social platform for inclusive and evidence-based decision making

    Authors: Susannah Kate Devitt, Tamara Rose Pearce, Alok Kumar Chowdhury, Kerrie Mengersen

    Abstract: Against the backdrop of a social media reckoning, this paper seeks to demonstrate the potential of social tools to build virtuous behaviours online. We must assume that human behaviour is flawed, the truth can be elusive, and as communities we must commit to mechanisms to encourage virtuous social digital behaviours. Societies that use social platforms should be inclusive, responsive to evidence,… ▽ More

    Submitted 13 February, 2021; originally announced February 2021.

    Comments: 38 pages, 3 tables, 13 figures submitted for peer review for inclusion in M. Alfano, C. Klein and J de Ridder (Eds.) Social Virtue Epistemology. Routledge [forthcoming]

    MSC Class: 62C12 ACM Class: H.4.1; H.4.2; H.4.3; H.5.3; J.4

  17. arXiv:2007.14235  [pdf, other

    cs.CV

    Structured Weight Priors for Convolutional Neural Networks

    Authors: Tim Pearce, Andrew Y. K. Foong, Alexandra Brintrup

    Abstract: Selection of an architectural prior well suited to a task (e.g. convolutions for image data) is crucial to the success of deep neural networks (NNs). Conversely, the weight priors within these architectures are typically left vague, e.g.~independent Gaussian distributions, which has led to debate over the utility of Bayesian deep learning. This paper explores the benefits of adding structure to we… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Presented at the ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning

  18. arXiv:2004.07171  [pdf, other

    cs.SD cs.IR eess.AS

    Musical Features for Automatic Music Transcription Evaluation

    Authors: Adrien Ycart, Lele Liu, Emmanouil Benetos, Marcus T. Pearce

    Abstract: This technical report gives a detailed, formal description of the features introduced in the paper: Adrien Ycart, Lele Liu, Emmanouil Benetos and Marcus T. Pearce. "Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription", Transactions of the International Society for Music Information Retrieval (TISMIR), Accepted, 2020.

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: Technical report

  19. arXiv:2002.08517  [pdf, other

    cs.LG stat.ML

    Avoiding Kernel Fixed Points: Computing with ELU and GELU Infinite Networks

    Authors: Russell Tsuchida, Tim Pearce, Chris van der Heide, Fred Roosta, Marcus Gallagher

    Abstract: Analysing and computing with Gaussian processes arising from infinitely wide neural networks has recently seen a resurgence in popularity. Despite this, many explicit covariance functions of networks with activation functions used in modern networks remain unknown. Furthermore, while the kernels of deep networks can be computed iteratively, theoretical understanding of deep kernels is lacking, par… ▽ More

    Submitted 28 February, 2021; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: AAAI camera ready version. 18 pages, 9 figures, 2 tables. Corrected name particle capitalisation and formatting

  20. arXiv:1905.06076  [pdf, other

    stat.ML cs.LG

    Expressive Priors in Bayesian Neural Networks: Kernel Combinations and Periodic Functions

    Authors: Tim Pearce, Russell Tsuchida, Mohamed Zaki, Alexandra Brintrup, Andy Neely

    Abstract: A simple, flexible approach to creating expressive priors in Gaussian process (GP) models makes new kernels from a combination of basic kernels, e.g. summing a periodic and linear kernel can capture seasonal variation with a long term trend. Despite a well-studied link between GPs and Bayesian neural networks (BNNs), the BNN analogue of this has not yet been explored. This paper derives BNN archit… ▽ More

    Submitted 28 June, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Journal ref: The 35th Conference on Uncertainty in Artificial Intelligence (UAI 2019)

  21. arXiv:1904.02288  [pdf

    cs.DC

    Metabolomics in the Cloud: Scaling Computational Tools to Big Data

    Authors: Jianliang Gao, Noureddin Sadawi, Ibrahim Karaman, Jake T M Pearce, Pablo Moreno, Anders Larsson, Marco Capuccini, Paul Elliott, Jeremy K Nicholson, Timothy M D Ebbels, Robert Glen

    Abstract: Background: Metabolomics datasets are becoming increasingly large and complex, with multiple types of algorithms and workflows needed to process and analyse the data. A cloud infrastructure with portable software tools can provide much needed resources enabling faster processing of much larger datasets than would be possible at any individual lab. The PhenoMeNal project has developed such an infra… ▽ More

    Submitted 9 April, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: 25 pages, 5 figures

  22. arXiv:1811.12188  [pdf, other

    cs.LG stat.ML

    Bayesian Neural Network Ensembles

    Authors: Tim Pearce, Mohamed Zaki, Andy Neely

    Abstract: Ensembles of neural networks (NNs) have long been used to estimate predictive uncertainty; a small number of NNs are trained from different initialisations and sometimes on differing versions of the dataset. The variance of the ensemble's predictions is interpreted as its epistemic uncertainty. The appeal of ensembling stems from being a collection of regular NNs - this makes them both scalable an… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1810.05546

  23. arXiv:1810.05546  [pdf, other

    stat.ML cs.LG

    Uncertainty in Neural Networks: Approximately Bayesian Ensembling

    Authors: Tim Pearce, Felix Leibfried, Alexandra Brintrup, Mohamed Zaki, Andy Neely

    Abstract: Understanding the uncertainty of a neural network's (NN) predictions is essential for many purposes. The Bayesian framework provides a principled approach to this, however applying it to NNs is challenging due to large numbers of parameters and data. Ensembling NNs provides an easily implementable, scalable method for uncertainty quantification, however, it has been criticised for not being Bayesi… ▽ More

    Submitted 26 February, 2020; v1 submitted 12 October, 2018; originally announced October 2018.

    Comments: Please cite as published in AISTATS 2020

    Journal ref: The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020

  24. arXiv:1807.00790  [pdf, other

    cs.SD eess.AS

    An energy-based generative sequence model for testing sensory theories of Western harmony

    Authors: Peter M. C. Harrison, Marcus T. Pearce

    Abstract: The relationship between sensory consonance and Western harmony is an important topic in music theory and psychology. We introduce new methods for analysing this relationship, and apply them to large corpora representing three prominent genres of Western music: classical, popular, and jazz music. These methods centre on a generative sequence model with an exponential-family energy-based form that… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

    Comments: 8 pages, 2 figures. To appear in Proceedings of the 19th International Society for Music Information Retrieval Conference (ISMIR), Paris, France, 2018

  25. arXiv:1805.11324  [pdf, other

    stat.ML cs.AI cs.LG

    Bayesian Inference with Anchored Ensembles of Neural Networks, and Application to Exploration in Reinforcement Learning

    Authors: Tim Pearce, Nicolas Anastassacos, Mohamed Zaki, Andy Neely

    Abstract: The use of ensembles of neural networks (NNs) for the quantification of predictive uncertainty is widespread. However, the current justification is intuitive rather than analytical. This work proposes one minor modification to the normal ensembling methodology, which we prove allows the ensemble to perform Bayesian inference, hence converging to the corresponding Gaussian Process as both the total… ▽ More

    Submitted 2 July, 2018; v1 submitted 29 May, 2018; originally announced May 2018.