Computer Science > Artificial Intelligence

arXiv:2401.15196 (cs)

[Submitted on 26 Jan 2024]

Title:Regularized Q-Learning with Linear Function Approximation

Authors:Jiachen Xi, Alfredo Garcia, Petar Momcilovic

Abstract:Several successful reinforcement learning algorithms make use of regularization to promote multi-modal policies that exhibit enhanced exploration and robustness. With functional approximation, the convergence properties of some of these algorithms (e.g. soft Q-learning) are not well understood. In this paper, we consider a single-loop algorithm for minimizing the projected Bellman error with finite time convergence guarantees in the case of linear function approximation. The algorithm operates on two scales: a slower scale for updating the target network of the state-action values, and a faster scale for approximating the Bellman backups in the subspace of the span of basis vectors. We show that, under certain assumptions, the proposed algorithm converges to a stationary point in the presence of Markovian noise. In addition, we provide a performance guarantee for the policies derived from the proposed algorithm.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2401.15196 [cs.AI]
	(or arXiv:2401.15196v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2401.15196

Submission history

From: Jiachen Xi [view email]
[v1] Fri, 26 Jan 2024 20:45:40 UTC (156 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2024-01

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Regularized Q-Learning with Linear Function Approximation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Regularized Q-Learning with Linear Function Approximation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators