Skip to main content

Showing 1–6 of 6 results for author: Taitler, A

  1. arXiv:2401.12243  [pdf, other

    math.OC cs.LG cs.RO cs.SC eess.SY

    Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs

    Authors: Michael Gimelfarb, Ayal Taitler, Scott Sanner

    Abstract: We propose Constraint-Generation Policy Optimization (CGPO) for optimizing policy parameters within compact and interpretable policy classes for mixed discrete-continuous Markov Decision Processes (DC-MDPs). CGPO is not only able to provide bounded policy error guarantees over an infinite range of initial states for many DC-MDPs with expressive nonlinear dynamics, but it can also provably derive o… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  2. arXiv:2305.19291  [pdf, other

    cs.LG cs.AI eess.SY

    Perimeter Control Using Deep Reinforcement Learning: A Model-free Approach towards Homogeneous Flow Rate Optimization

    Authors: Xiaocan Li, Ray Coden Mercurius, Ayal Taitler, Xiaoyu Wang, Mohammad Noaeen, Scott Sanner, Baher Abdulhai

    Abstract: Perimeter control maintains high traffic efficiency within protected regions by controlling transfer flows among regions to ensure that their traffic densities are below critical values. Existing approaches can be categorized as either model-based or model-free, depending on whether they rely on network transmission models (NTMs) and macroscopic fundamental diagrams (MFDs). Although model-based ap… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  3. arXiv:2304.03289  [pdf, other

    cs.HC cs.SD eess.AS

    A2D: Anywhere Anytime Drumming

    Authors: Harel Yadid, Almog Algranti, Mark Levin, Ayal Taitler

    Abstract: The drum kit, which has only been around for around 100 years, is a popular instrument in many music genres such as pop, rock, and jazz. However, the road to owning a kit is expensive, both financially and space-wise. Also, drums are more difficult to move around compared to other instruments, as they do not fit into a single bag. We propose a no-drums approach that uses only two sticks and a smar… ▽ More

    Submitted 30 June, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  4. arXiv:2211.05939  [pdf, other

    cs.AI

    pyRDDLGym: From RDDL to Gym Environments

    Authors: Ayal Taitler, Michael Gimelfarb, Jihwan Jeong, Sriram Gopalakrishnan, Martin Mladenov, Xiaotian Liu, Scott Sanner

    Abstract: We present pyRDDLGym, a Python framework for auto-generation of OpenAI Gym environments from RDDL declerative description. The discrete time step evolution of variables in RDDL is described by conditional probability functions, which fits naturally into the Gym step scheme. Furthermore, since RDDL is a lifted description, the modification and scaling up of environments to support multiple entities… ▽ More

    Submitted 5 February, 2024; v1 submitted 10 November, 2022; originally announced November 2022.

  5. arXiv:2104.01646  [pdf, other

    cs.LG math.OC

    SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems

    Authors: Joel Oren, Chana Ross, Maksym Lefarov, Felix Richter, Ayal Taitler, Zohar Feldman, Christian Daniel, Dotan Di Castro

    Abstract: We study combinatorial problems with real world applications such as machine scheduling, routing, and assignment. We propose a method that combines Reinforcement Learning (RL) and planning. This method can equally be applied to both the offline, as well as online, variants of the combinatorial problem, in which the problem components (e.g., jobs in scheduling problems) are not known in advance, bu… ▽ More

    Submitted 18 May, 2021; v1 submitted 4 April, 2021; originally announced April 2021.

  6. arXiv:1702.08074  [pdf, other

    cs.LG cs.RO

    Learning Control for Air Hockey Striking using Deep Reinforcement Learning

    Authors: Ayal Taitler, Nahum Shimkin

    Abstract: We consider the task of learning control policies for a robotic mechanism striking a puck in an air hockey game. The control signal is a direct command to the robot's motors. We employ a model free deep reinforcement learning framework to learn the motoric skills of striking the puck accurately in order to score. We propose certain improvements to the standard learning scheme which make the deep Q… ▽ More

    Submitted 25 April, 2017; v1 submitted 26 February, 2017; originally announced February 2017.

    Comments: Corrected typos Graphs added in results section