Skip to main content

Showing 1–1 of 1 results for author: Van Soest, A

  1. arXiv:1812.02690  [pdf, other

    cs.LG cs.AI stat.ML

    Provably Efficient Maximum Entropy Exploration

    Authors: Elad Hazan, Sham M. Kakade, Karan Singh, Abby Van Soest

    Abstract: Suppose an agent is in a (possibly unknown) Markov Decision Process in the absence of a reward signal, what might we hope that an agent can efficiently learn to do? This work studies a broad class of objectives that are defined solely as functions of the state-visitation frequencies that are induced by how the agent behaves. For example, one natural, intrinsically defined, objective problem is for… ▽ More

    Submitted 25 January, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: Updated experiment results; minor revisions in writing