Skip to main content

Showing 1–1 of 1 results for author: Ramesh, A A

  1. arXiv:2405.03878  [pdf, other

    cs.LG cs.AI

    Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning

    Authors: Aditya A. Ramesh, Kenny Young, Louis Kirsch, Jürgen Schmidhuber

    Abstract: Temporal credit assignment in reinforcement learning is challenging due to delayed and stochastic outcomes. Monte Carlo targets can bridge long delays between action and consequence but lead to high-variance targets due to stochasticity. Temporal difference (TD) learning uses bootstrapping to overcome variance but introduces a bias that can only be corrected through many iterations. TD($λ$) provid… ▽ More

    Submitted 4 June, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: ICML 2024 version