Skip to main content

Showing 1–2 of 2 results for author: Lefarov, M

  1. arXiv:2110.07985  [pdf, other

    cs.LG cs.RO eess.SY

    On-Policy Model Errors in Reinforcement Learning

    Authors: Lukas P. Fröhlich, Maksym Lefarov, Melanie N. Zeilinger, Felix Berkenkamp

    Abstract: Model-free reinforcement learning algorithms can compute policy gradients given sampled environment transitions, but require large amounts of data. In contrast, model-based methods can use the learned model to generate new data, but model errors and bias can render learning unstable or suboptimal. In this paper, we present a novel method that combines real-world data and a learned model in order t… ▽ More

    Submitted 3 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Published at The Tenth International Conference on Learning Representations (ICLR 2022)

  2. arXiv:2104.01646  [pdf, other

    cs.LG math.OC

    SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems

    Authors: Joel Oren, Chana Ross, Maksym Lefarov, Felix Richter, Ayal Taitler, Zohar Feldman, Christian Daniel, Dotan Di Castro

    Abstract: We study combinatorial problems with real world applications such as machine scheduling, routing, and assignment. We propose a method that combines Reinforcement Learning (RL) and planning. This method can equally be applied to both the offline, as well as online, variants of the combinatorial problem, in which the problem components (e.g., jobs in scheduling problems) are not known in advance, bu… ▽ More

    Submitted 18 May, 2021; v1 submitted 4 April, 2021; originally announced April 2021.