Skip to main content

Showing 1–1 of 1 results for author: Wistub, M

  1. arXiv:1906.03979  [pdf, other

    cs.LG stat.ML

    Optimal Exploitation of Clustering and History Information in Multi-Armed Bandit

    Authors: Djallel Bouneffouf, Srinivasan Parthasarathy, Horst Samulowitz, Martin Wistub

    Abstract: We consider the stochastic multi-armed bandit problem and the contextual bandit problem with historical observations and pre-clustered arms. The historical observations can contain any number of instances for each arm, and the pre-clustering information is a fixed clustering of arms provided as part of the input. We develop a variety of algorithms which incorporate this offline information effecti… ▽ More

    Submitted 31 May, 2019; originally announced June 2019.

    Comments: IJCAI 2019, International Joint Conferences on Artificial Intelligence