Skip to main content

Showing 1–5 of 5 results for author: Jusup, M

  1. arXiv:2406.16151  [pdf, other

    cs.AI cs.LG eess.SY

    Monte Carlo Planning for Stochastic Control on Constrained Markov Decision Processes

    Authors: Larkin Liu, Shiqi Liu, Matej Jusup

    Abstract: In the world of stochastic control, especially in economics and engineering, Markov Decision Processes (MDPs) can effectively model various stochastic decision processes, from asset management to transportation optimization. These underlying MDPs, upon closer examination, often reveal a specifically constrained causal structure concerning the transition and reward dynamics. By exploiting this stru… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Working manuscript

    ACM Class: C.4

  2. arXiv:2306.17052  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Safe Model-Based Multi-Agent Mean-Field Reinforcement Learning

    Authors: Matej Jusup, Barna Pásztor, Tadeusz Janik, Kenan Zhang, Francesco Corman, Andreas Krause, Ilija Bogunovic

    Abstract: Many applications, e.g., in shared mobility, require coordinating a large number of agents. Mean-field reinforcement learning addresses the resulting scalability challenge by optimizing the policy of a representative agent interacting with the infinite population of identical agents instead of considering individual pairwise interactions. In this paper, we address an important generalization where… ▽ More

    Submitted 27 December, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 23 pages, 26 figures, 6 tables

  3. arXiv:2302.04376  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning

    Authors: Volodymyr Tkachuk, Seyed Alireza Bakhtiari, Johannes Kirschner, Matej Jusup, Ilija Bogunovic, Csaba Szepesvári

    Abstract: A practical challenge in reinforcement learning are combinatorial action spaces that make planning computationally demanding. For example, in cooperative multi-agent reinforcement learning, a potentially large number of agents jointly optimize a global reward function, which leads to a combinatorial blow-up in the action space by the number of agents. As a minimal requirement, we assume access to… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  4. arXiv:2110.01866  [pdf, other

    physics.soc-ph cs.SI nlin.AO q-bio.PE

    Social physics

    Authors: Marko Jusup, Petter Holme, Kiyoshi Kanazawa, Misako Takayasu, Ivan Romic, Zhen Wang, Suncana Gecek, Tomislav Lipic, Boris Podobnik, Lin Wang, Wei Luo, Tin Klanjscek, Jingfang Fan, Stefano Boccaletti, Matjaz Perc

    Abstract: Recent decades have seen a rise in the use of physics methods to study different societal phenomena. This development has been due to physicists venturing outside of their traditional domains of interest, but also due to scientists from other disciplines taking from physics the methods that have proven so successful throughout the 19th and the 20th century. Here we dub this field 'social physics'… ▽ More

    Submitted 11 January, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: 359 pages, 78 figures; published in Physics Reports

    Journal ref: Phys. Rep. 948, 1-148 (2022)

  5. arXiv:2002.05106  [pdf, other

    physics.soc-ph cs.SI q-bio.PE

    A novel route to cyclic dominance in voluntary social dilemmas

    Authors: Hao Guo, Zhao Song, Sunčana Geček, Xuelong Li, Marko Jusup, Matjaz Perc, Yamir Moreno, Stefano Boccaletti, Zhen Wang

    Abstract: Cooperation is the backbone of modern human societies, making it a priority to understand how successful cooperation-sustaining mechanisms operate. Cyclic dominance, a non-transitive setup comprising at least three strategies wherein the first strategy overrules the second which overrules the third which, in turn, overrules the first strategy, is known to maintain bio-diversity, drive competition… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: 9 pages, 6 figures, supplementary information

    Journal ref: J. R. Soc. Interface 17, 20190789 (2020)