Skip to main content

Showing 1–7 of 7 results for author: Boone, V

  1. arXiv:2406.01234  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Achieving Tractable Minimax Optimal Regret in Average Reward MDPs

    Authors: Victor Boone, Zihan Zhang

    Abstract: In recent years, significant attention has been directed towards learning average-reward Markov Decision Processes (MDPs). However, existing algorithms either suffer from sub-optimal regret guarantees or computational inefficiencies. In this paper, we present the first tractable algorithm with minimax optimal regret of $\widetilde{\mathrm{O}}(\sqrt{\mathrm{sp}(h^*) S A T})$, where… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2402.16059  [pdf, other

    stat.ML cs.LG

    Gradient-enhanced deep Gaussian processes for multifidelity modelling

    Authors: Viv Bone, Chris van der Heide, Kieran Mackle, Ingo H. J. Jahn, Peter M. Dower, Chris Manzie

    Abstract: Multifidelity models integrate data from multiple sources to produce a single approximator for the underlying process. Dense low-fidelity samples are used to reduce interpolation error, while sparse high-fidelity samples are used to compensate for bias or noise in the low-fidelity samples. Deep Gaussian processes (GPs) are attractive for multifidelity modelling as they are non-parametric, robust t… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  3. arXiv:2311.18437  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    The Sliding Regret in Stochastic Bandits: Discriminating Index and Randomized Policies

    Authors: Victor Boone

    Abstract: This paper studies the one-shot behavior of no-regret algorithms for stochastic bandits. Although many algorithms are known to be asymptotically optimal with respect to the expected regret, over a single run, their pseudo-regret seems to follow one of two tendencies: it is either smooth or bumpy. To measure this tendency, we introduce a new notion: the sliding regret, that measures the worst pseud… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 31 pages

  4. arXiv:2311.02407  [pdf, other

    cs.GT cs.LG math.OC

    The equivalence of dynamic and strategic stability under regularized learning in games

    Authors: Victor Boone, Panayotis Mertikopoulos

    Abstract: In this paper, we examine the long-run behavior of regularized, no-regret learning in finite games. A well-known result in the field states that the empirical frequencies of no-regret play converge to the game's set of coarse correlated equilibria; however, our understanding of how the players' actual strategies evolve over time is much more limited - and, in many cases, non-existent. This issue i… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 31 pages, 8 figures, 2 tables

    MSC Class: Primary 91A10; 91A26; secondary 68Q32; 62L20

  5. arXiv:2304.08048  [pdf, ps, other

    eess.SY math.OC

    When do discounted-optimal policies also optimize the gain?

    Authors: Victor Boone

    Abstract: In this technical note, we establish an upper-bound on the threshold on the discount factor starting from which all discounted-optimal deterministic policies are gain-optimal, that we prove to be tight on an example. To address computability issues of that theoretical threshold, we provide a weaker bound which is tractable on ergodic MDPs in polynomial time.

    Submitted 17 April, 2023; originally announced April 2023.

  6. arXiv:2108.12127  [pdf, other

    eess.SY

    Towards model predictive control of supercritical CO2 cycles

    Authors: Viv Bone, Michael Kearney, Ingo Jahn

    Abstract: Control of non-condensing non-ideal-gas power cycles is challenging because their output power dynamics depend on complex system interactions, non-ideal-gas effects complicate turbomachinery behavior, and state constraints must be respected. This article presents a control methodology for these systems, comprising a control modeling approach and model predictive control (MPC) strategy. This method… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: 26 pages, 11 figures

  7. arXiv:1910.01334  [pdf, other

    cs.GT math.DS q-bio.PE

    From Darwin to Poincaré and von Neumann: Recurrence and Cycles in Evolutionary and Algorithmic Game Theory

    Authors: Victor Boone, Georgios Piliouras

    Abstract: Replicator dynamics, the continuous-time analogue of Multiplicative Weights Updates, is the main dynamic in evolutionary game theory. In simple evolutionary zero-sum games, such as Rock-Paper-Scissors, replicator dynamic is periodic \cite{zeeman1980population}, however, its behavior in higher dimensions is not well understood. We provide a complete characterization of its behavior in zero-sum evol… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.