Skip to main content

Showing 1–11 of 11 results for author: Stellato, B

  1. arXiv:2404.13831  [pdf, other

    math.OC cs.LG

    Data-Driven Performance Guarantees for Classical and Learned Optimizers

    Authors: Rajiv Sambharya, Bartolomeo Stellato

    Abstract: We introduce a data-driven approach to analyze the performance of continuous optimization algorithms using generalization guarantees from statistical learning theory. We study classical and learned optimizers to solve families of parametric optimization problems. We build generalization guarantees for classical optimizers, using a sample convergence bound, and for learned optimizers, using the Pro… ▽ More

    Submitted 21 May, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  2. arXiv:2404.01814  [pdf, other

    eess.SY cs.LG math.OC

    A neural network-based approach to hybrid systems identification for control

    Authors: Filippo Fabiani, Bartolomeo Stellato, Daniele Masti, Paul J. Goulart

    Abstract: We consider the problem of designing a machine learning-based model of an unknown dynamical system from a finite number of (state-input)-successor state data points, such that the model obtained is also suitable for optimal control design. We propose a specific neural network (NN) architecture that yields a hybrid system with piecewise-affine dynamics that is differentiable with respect to the net… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  3. arXiv:2403.14545  [pdf, other

    cs.RO eess.SY

    Learning Hierarchical Control For Multi-Agent Capacity-Constrained Systems

    Authors: Charlott Vallon, Alessandro Pinto, Bartolomeo Stellato, Francesco Borrelli

    Abstract: This paper introduces a novel data-driven hierarchical control scheme for managing a fleet of nonlinear, capacity-constrained autonomous agents in an iterative environment. We propose a control framework consisting of a high-level dynamic task assignment and routing layer and low-level motion planning and tracking layer. Each layer of the control hierarchy uses a data-driven Model Predictive Contr… ▽ More

    Submitted 10 April, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 16 pages, 4 figures

  4. arXiv:2402.09246  [pdf, other

    cs.RO cs.AI eess.SY math.OC

    Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots

    Authors: Haimin Hu, Gabriele Dragotto, Zixu Zhang, Kaiqu Liang, Bartolomeo Stellato, Jaime F. Fisac

    Abstract: We consider the multi-agent spatial navigation problem of computing the socially optimal order of play, i.e., the sequence in which the agents commit to their decisions, and its associated equilibrium in an N-player Stackelberg trajectory game. We model this problem as a mixed-integer optimization problem over the space of all possible Stackelberg games associated with the order of play's permutat… ▽ More

    Submitted 24 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Robotics: Science and Systems (RSS) 2024

  5. arXiv:2309.07835  [pdf, other

    math.OC cs.LG

    Learning to Warm-Start Fixed-Point Optimization Algorithms

    Authors: Rajiv Sambharya, Georgina Hall, Brandon Amos, Bartolomeo Stellato

    Abstract: We introduce a machine-learning framework to warm-start fixed-point optimization algorithms. Our architecture consists of a neural network mapping problem parameters to warm starts, followed by a predefined number of fixed-point iterations. We propose two loss functions designed to either minimize the fixed-point residual or the distance to a ground truth solution. In this way, the neural network… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  6. arXiv:2111.02552  [pdf, other

    cs.LG cs.AI cs.RO

    Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies

    Authors: Tim Seyde, Igor Gilitschenski, Wilko Schwarting, Bartolomeo Stellato, Martin Riedmiller, Markus Wulfmeier, Daniela Rus

    Abstract: Reinforcement learning (RL) for continuous control typically employs distributions whose support covers the entire action space. In this work, we investigate the colloquially known phenomenon that trained agents often prefer actions at the boundaries of that space. We draw theoretical connections to the emergence of bang-bang behavior in optimal control, and provide extensive empirical evaluation… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

  7. arXiv:2107.10847  [pdf, other

    cs.LG math.OC

    Accelerating Quadratic Optimization with Reinforcement Learning

    Authors: Jeffrey Ichnowski, Paras Jain, Bartolomeo Stellato, Goran Banjac, Michael Luo, Francesco Borrelli, Joseph E. Gonzalez, Ion Stoica, Ken Goldberg

    Abstract: First-order methods for quadratic optimization such as OSQP are widely used for large-scale machine learning and embedded optimal control, where many related problems must be rapidly solved. These methods face two persistent challenges: manual hyperparameter tuning and convergence time to high-accuracy solutions. To address these, we explore how Reinforcement Learning (RL) can learn a policy to tu… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: 25 pages, 7 figures. Code available at https://github.com/berkeleyautomation/rlqp

  8. arXiv:2107.08143  [pdf, other

    cs.RO

    CoCo: Online Mixed-Integer Control via Supervised Learning

    Authors: A. Cauligi, P. Culbertson, E. Schmerling, M. Schwager, B. Stellato, M. Pavone

    Abstract: Many robotics problems, from robot motion planning to object manipulation, can be modeled as mixed-integer convex programs (MICPs). However, state-of-the-art algorithms are still unable to solve MICPs for control problems quickly enough for online use and existing heuristics can typically only find suboptimal solutions that might degrade robot performance. In this work, we turn to data-driven meth… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

  9. arXiv:2004.03736  [pdf, other

    cs.RO

    Learning Mixed-Integer Convex Optimization Strategies for Robot Planning and Control

    Authors: A. Cauligi, P. Culbertson, B. Stellato, D. Bertsimas, M. Schwager, M. Pavone

    Abstract: Mixed-integer convex programming (MICP) has seen significant algorithmic and hardware improvements with several orders of magnitude solve time speedups compared to 25 years ago. Despite these advances, MICP has been rarely applied to real-world robotic control because the solution times are still too slow for online applications. In this work, we present the CoCo (Combinatorial Offline, Convex Onl… ▽ More

    Submitted 11 April, 2022; v1 submitted 7 April, 2020; originally announced April 2020.

  10. arXiv:1912.09529  [pdf, other

    math.OC cs.LG

    Learning Convex Optimization Control Policies

    Authors: Akshay Agrawal, Shane Barratt, Stephen Boyd, Bartolomeo Stellato

    Abstract: Many control policies used in various applications determine the input or action by solving a convex optimization problem that depends on the current state and some parameters. Common examples of such convex optimization control policies (COCPs) include the linear quadratic regulator (LQR), convex model predictive control (MPC), and convex control-Lyapunov or approximate dynamic programming (ADP)… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

    Comments: Authors listed in alphabetical order

  11. arXiv:1907.02206  [pdf, other

    math.OC cs.LG

    Online Mixed-Integer Optimization in Milliseconds

    Authors: Dimitris Bertsimas, Bartolomeo Stellato

    Abstract: We propose a method to solve online mixed-integer optimization (MIO) problems at very high speed using machine learning. By exploiting the repetitive nature of online optimization, we are able to greatly speedup the solution time. Our approach encodes the optimal solution into a small amount of information denoted as strategy using the Voice of Optimization framework proposed in [BS21]. In this wa… ▽ More

    Submitted 22 March, 2021; v1 submitted 3 July, 2019; originally announced July 2019.