Skip to main content

Showing 201–250 of 262 results for author: Pavone, M

  1. arXiv:1812.11315  [pdf, other

    cs.RO eess.SY

    On Infusing Reachability-Based Safety Assurance within Probabilistic Planning Frameworks for Human-Robot Vehicle Interactions

    Authors: Karen Leung, Edward Schmerling, Mo Chen, John Talbot, J. Christian Gerdes, Marco Pavone

    Abstract: Action anticipation, intent prediction, and proactive behavior are all desirable characteristics for autonomous driving policies in interactive scenarios. Paramount, however, is ensuring safety on the road --- a key challenge in doing so is accounting for uncertainty in human driver actions without unduly impacting planner performance. This paper introduces a minimally-interventional safety contro… ▽ More

    Submitted 29 December, 2018; originally announced December 2018.

    Comments: Presented at the International Symposium on Experimental Robotics, Buenos Aires, Argentina, 2018

  2. arXiv:1811.06590  [pdf, other

    eess.SY

    Reduced Order Model Predictive Control For Setpoint Tracking

    Authors: Joseph Lorenzetti, Benoit Landry, Sumeet Singh, Marco Pavone

    Abstract: Despite the success of model predictive control (MPC), its application to high-dimensional systems, such as flexible structures and coupled fluid/rigid-body systems, remains a largely open challenge due to excessive computational complexity. A promising solution approach is to leverage reduced order models for designing the model predictive controller. In this paper we present a reduced order MPC… ▽ More

    Submitted 2 May, 2019; v1 submitted 15 November, 2018; originally announced November 2018.

  3. arXiv:1810.05993  [pdf, other

    cs.RO cs.HC cs.LG

    The Trajectron: Probabilistic Multi-Agent Trajectory Modeling With Dynamic Spatiotemporal Graphs

    Authors: Boris Ivanovic, Marco Pavone

    Abstract: Developing safe human-robot interaction systems is a necessary step towards the widespread integration of autonomous agents in society. A key component of such systems is the ability to reason about the many potential futures (e.g. trajectories) of other agents in the scene. Towards this end, we present the Trajectron, a graph-structured model that predicts many potential future trajectories of mu… ▽ More

    Submitted 23 August, 2019; v1 submitted 14 October, 2018; originally announced October 2018.

    Comments: IEEE/CVF International Conference on Computer Vision (ICCV) 2019 -- 10 pages, 10 figures, 2 tables

  4. arXiv:1808.04468  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Risk-Sensitive Generative Adversarial Imitation Learning

    Authors: Jonathan Lacotte, Mohammad Ghavamzadeh, Yinlam Chow, Marco Pavone

    Abstract: We study risk-sensitive imitation learning where the agent's goal is to perform at least as well as the expert in terms of a risk profile. We first formulate our risk-sensitive imitation learning setting. We consider the generative adversarial approach to imitation learning (GAIL) and derive an optimization problem for our formulation, which we call it risk-sensitive GAIL (RS-GAIL). We then derive… ▽ More

    Submitted 23 December, 2018; v1 submitted 13 August, 2018; originally announced August 2018.

  5. arXiv:1808.00649  [pdf, other

    eess.SY cs.RO math.OC

    Robust Tracking with Model Mismatch for Fast and Safe Planning: an SOS Optimization Approach

    Authors: Sumeet Singh, Mo Chen, Sylvia L. Herbert, Claire J. Tomlin, Marco Pavone

    Abstract: In the pursuit of real-time motion planning, a commonly adopted practice is to compute a trajectory by running a planning algorithm on a simplified, low-dimensional dynamical model, and then employ a feedback tracking controller that tracks such a trajectory by accounting for the full, high-dimensional system dynamics. While this strategy of planning with model mismatch generally yields fast compu… ▽ More

    Submitted 28 July, 2019; v1 submitted 1 August, 2018; originally announced August 2018.

    Comments: Presented at WAFR 2018; final version v2 -- fixed typos

  6. arXiv:1808.00113  [pdf, other

    eess.SY cs.LG cs.RO math.OC

    Learning Stabilizable Dynamical Systems via Control Contraction Metrics

    Authors: Sumeet Singh, Vikas Sindhwani, Jean-Jacques E. Slotine, Marco Pavone

    Abstract: We propose a novel framework for learning stabilizable nonlinear dynamical systems for continuous control tasks in robotics. The key idea is to develop a new control-theoretic regularizer for dynamics fitting rooted in the notion of stabilizability, which guarantees that the learned system can be accompanied by a robust controller capable of stabilizing any open-loop trajectory that the system may… ▽ More

    Submitted 10 November, 2018; v1 submitted 31 July, 2018; originally announced August 2018.

    Comments: To appear at WAFR 2018. v2: re-structured Sections 3 & 4 to improve clarity; expanded discussion on limitations & future work in Section 5; added details on training & validation, significantly expanded experiments

  7. arXiv:1807.11553  [pdf, other

    eess.SY cs.RO math.OC

    Reach-Avoid Problems via Sum-of-Squares Optimization and Dynamic Programming

    Authors: Benoit Landry, Mo Chen, Scott Hemley, Marco Pavone

    Abstract: Reach-avoid problems involve driving a system to a set of desirable configurations while keeping it away from undesirable ones. Providing mathematical guarantees for such scenarios is challenging but have numerous potential practical applications. Due to the challenges, analysis of reach-avoid problems involves making trade-offs between generality of system dynamics, generality of problem setups,… ▽ More

    Submitted 30 July, 2018; originally announced July 2018.

    Comments: International Conference on Intelligent Robots & Systems (IROS), 2018

  8. arXiv:1807.10366  [pdf, other

    cs.RO

    Robot Motion Planning in Learned Latent Spaces

    Authors: Brian Ichter, Marco Pavone

    Abstract: This paper presents Latent Sampling-based Motion Planning (L-SBMP), a methodology towards computing motion plans for complex robotic systems by learning a plannable latent representation. Recent works in control of robotic systems have effectively leveraged local, low-dimensional embeddings of high-dimensional dynamics. In this paper we combine these recent advances with techniques from sampling-b… ▽ More

    Submitted 6 November, 2018; v1 submitted 26 July, 2018; originally announced July 2018.

  9. arXiv:1807.08912  [pdf, other

    cs.RO cs.LG

    Meta-Learning Priors for Efficient Online Bayesian Regression

    Authors: James Harrison, Apoorva Sharma, Marco Pavone

    Abstract: Gaussian Process (GP) regression has seen widespread use in robotics due to its generality, simplicity of use, and the utility of Bayesian predictions. The predominant implementation of GP regression is a nonparameteric kernel-based approach, as it enables fitting of arbitrary nonlinear functions. However, this approach suffers from two main drawbacks: (1) it is computationally inefficient, as com… ▽ More

    Submitted 30 October, 2018; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018

  10. arXiv:1806.06161  [pdf, other

    cs.RO cs.LG eess.SY

    BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning

    Authors: Boris Ivanovic, James Harrison, Apoorva Sharma, Mo Chen, Marco Pavone

    Abstract: Model-free Reinforcement Learning (RL) offers an attractive approach to learn control policies for high-dimensional systems, but its relatively poor sample complexity often forces training in simulated environments. Even in simulation, goal-directed tasks whose natural reward function is sparse remain intractable for state-of-the-art model-free algorithms for continuous control. The bottleneck in… ▽ More

    Submitted 16 September, 2018; v1 submitted 15 June, 2018; originally announced June 2018.

  11. arXiv:1804.11278  [pdf, other

    eess.SY cs.RO

    On the Interaction between Autonomous Mobility-on-Demand and Public Transportation Systems

    Authors: Mauro Salazar, Federico Rossi, Maximilian Schiffer, Christopher H. Onder, Marco Pavone

    Abstract: In this paper we study models and coordination policies for intermodal Autonomous Mobility-on-Demand (AMoD), wherein a fleet of self-driving vehicles provides on-demand mobility jointly with public transit. Specifically, we first present a network flow model for intermodal AMoD, where we capture the coupling between AMoD and public transit and the goal is to maximize social welfare. Second, levera… ▽ More

    Submitted 5 September, 2018; v1 submitted 30 April, 2018; originally announced April 2018.

    Comments: 9 pages, 8 figures, ITSC 2018

  12. arXiv:1804.11074  [pdf, other

    eess.SY

    Stochastic Model Predictive Control for Autonomous Mobility on Demand

    Authors: Matthew Tsao, Ramon Iglesias, Marco Pavone

    Abstract: This paper presents a stochastic, model predictive control (MPC) algorithm that leverages short-term probabilistic forecasts for dispatching and rebalancing Autonomous Mobility-on-Demand systems (AMoD, i.e. fleets of self-driving vehicles). We first present the core stochastic optimization problem in terms of a time-expanded network flow model. Then, to ameliorate its tractability, we present two… ▽ More

    Submitted 4 May, 2018; v1 submitted 30 April, 2018; originally announced April 2018.

    Comments: Submitting to the IEEE International Conference on Intelligent Transportation Systems 2018

  13. arXiv:1804.05804  [pdf, other

    cs.RO cs.AI

    Safe Motion Planning in Unknown Environments: Optimality Benchmarks and Tractable Policies

    Authors: Lucas Janson, Tommy Hu, Marco Pavone

    Abstract: This paper addresses the problem of planning a safe (i.e., collision-free) trajectory from an initial state to a goal region when the obstacle space is a-priori unknown and is incrementally revealed online, e.g., through line-of-sight perception. Despite its ubiquitous nature, this formulation of motion planning has received relatively little theoretical investigation, as opposed to the setup wher… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

  14. arXiv:1803.05464  [pdf, ps, other

    cs.RO cs.MA

    Review of Multi-Agent Algorithms for Collective Behavior: a Structural Taxonomy

    Authors: Federico Rossi, Saptarshi Bandyopadhyay, Michael Wolf, Marco Pavone

    Abstract: In this paper, we review multi-agent collective behavior algorithms in the literature and classify them according to their underlying mathematical structure. For each mathematical technique, we identify the multi-agent coordination tasks it can be applied to, and we analyze its scalability, bandwidth use, and demonstrated maturity. We highlight how versatile techniques such as artificial potential… ▽ More

    Submitted 14 March, 2018; originally announced March 2018.

    Comments: Six pages, one table. To be presented at NAASS 2018

  15. arXiv:1803.02015  [pdf, other

    cs.RO cs.HC

    Generative Modeling of Multimodal Multi-Human Behavior

    Authors: Boris Ivanovic, Edward Schmerling, Karen Leung, Marco Pavone

    Abstract: This work presents a methodology for modeling and predicting human behavior in settings with N humans interacting in highly multimodal scenarios (i.e. where there are many possible highly-distinct futures). A motivating example includes robots interacting with humans in crowded environments, such as self-driving cars operating alongside human-driven vehicles or human-robot collaborative bin packin… ▽ More

    Submitted 26 July, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018 -- 8 pages, 5 figures

  16. arXiv:1711.10055  [pdf, other

    cs.AI cs.LG cs.RO

    Risk-sensitive Inverse Reinforcement Learning via Semi- and Non-Parametric Methods

    Authors: Sumeet Singh, Jonathan Lacotte, Anirudha Majumdar, Marco Pavone

    Abstract: The literature on Inverse Reinforcement Learning (IRL) typically assumes that humans take actions in order to minimize the expected value of a cost function, i.e., that humans are risk neutral. Yet, in practice, humans are often far from being risk neutral. To fill this gap, the objective of this paper is to devise a framework for risk-sensitive IRL in order to explicitly account for a human's ris… ▽ More

    Submitted 22 March, 2018; v1 submitted 27 November, 2017; originally announced November 2017.

    Comments: Submitted to International Journal of Robotics Research; Revision 1: (i) Clarified minor technical points; (ii) Revised proof for Theorem 3 to hold under weaker assumptions; (iii) Added additional figures and expanded discussions to improve readability

  17. arXiv:1710.11040  [pdf, other

    cs.RO cs.AI eess.SY math.OC

    How Should a Robot Assess Risk? Towards an Axiomatic Theory of Risk in Robotics

    Authors: Anirudha Majumdar, Marco Pavone

    Abstract: Endowing robots with the capability of assessing risk and making risk-aware decisions is widely considered a key step toward ensuring safety for robots operating under uncertainty. But, how should a robot quantify risk? A natural and common approach is to consider the framework whereby costs are assigned to stochastic outcomes - an assignment captured by a cost random variable. Quantifying risk th… ▽ More

    Submitted 1 November, 2017; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: Extended version of paper published in International Symposium on Robotics Research (ISRR) 2017

  18. arXiv:1710.09483  [pdf, other

    cs.RO cs.LG

    Multimodal Probabilistic Model-Based Planning for Human-Robot Interaction

    Authors: Edward Schmerling, Karen Leung, Wolf Vollprecht, Marco Pavone

    Abstract: This paper presents a method for constructing human-robot interaction policies in settings where multimodality, i.e., the possibility of multiple highly distinct futures, plays a critical role in decision making. We are motivated in this work by the example of traffic weaving, e.g., at highway on-ramps/off-ramps, where entering and exiting cars must swap lanes in a short distance---a challenging n… ▽ More

    Submitted 25 October, 2017; originally announced October 2017.

  19. arXiv:1709.07032  [pdf, other

    cs.RO cs.MA eess.SY stat.AP

    Data-Driven Model Predictive Control of Autonomous Mobility-on-Demand Systems

    Authors: Ramon Iglesias, Federico Rossi, Kevin Wang, David Hallac, Jure Leskovec, Marco Pavone

    Abstract: The goal of this paper is to present an end-to-end, data-driven framework to control Autonomous Mobility-on-Demand systems (AMoD, i.e. fleets of self-driving vehicles). We first model the AMoD system using a time-expanded network, and present a formulation that computes the optimal rebalancing strategy (i.e., preemptive repositioning) and the minimum feasible fleet size for a given travel demand.… ▽ More

    Submitted 20 September, 2017; originally announced September 2017.

    Comments: Submitted to the International Conference on Robotics and Automation 2018

  20. arXiv:1709.05448  [pdf, other

    cs.RO cs.LG

    Learning Sampling Distributions for Robot Motion Planning

    Authors: Brian Ichter, James Harrison, Marco Pavone

    Abstract: A defining feature of sampling-based motion planning is the reliance on an implicit representation of the state space, which is enabled by a set of probing samples. Traditionally, these samples are drawn either probabilistically or deterministically to uniformly cover the state space. Yet, the motion of many robotic systems is often restricted to "small" regions of the state space, due to, for exa… ▽ More

    Submitted 11 March, 2019; v1 submitted 15 September, 2017; originally announced September 2017.

    Comments: International Conference on Robotics and Automation (ICRA), 2018

  21. arXiv:1709.04906  [pdf, other

    eess.SY cs.MA cs.RO

    On the interaction between Autonomous Mobility-on-Demand systems and the power network: models and coordination algorithms

    Authors: Federico Rossi, Ramon Iglesias, Mahnoosh Alizadeh, Marco Pavone

    Abstract: We study the interaction between a fleet of electric, self-driving vehicles servicing on-demand transportation requests (referred to as Autonomous Mobility-on-Demand, or AMoD, system) and the electric power network. We propose a model that captures the coupling between the two systems stemming from the vehicles' charging requirements and captures time-varying customer demand and power generation c… ▽ More

    Submitted 8 June, 2019; v1 submitted 14 September, 2017; originally announced September 2017.

    Comments: Extended version of the paper presented at Robotics: Science and Systems XIV and accepted by TCNS. In Version 4, the body of the paper is largely rewritten for clarity and consistency, and new numerical simulations are presented. All source code is available (MIT) at https://dx.doi.org/10.5281/zenodo.3241651

  22. arXiv:1707.04674  [pdf, other

    cs.RO

    ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems

    Authors: James Harrison, Animesh Garg, Boris Ivanovic, Yuke Zhu, Silvio Savarese, Li Fei-Fei, Marco Pavone

    Abstract: Model-free policy learning has enabled robust performance of complex tasks with relatively simple algorithms. However, this simplicity comes at the cost of requiring an Oracle and arguably very poor sample complexity. This renders such methods unsuitable for physical systems. Variants of model-based methods address this problem through the use of simulators, however, this gives rise to the problem… ▽ More

    Submitted 8 November, 2017; v1 submitted 14 July, 2017; originally announced July 2017.

    Comments: International Symposium on Robotics Research (ISRR), 2017

  23. arXiv:1705.02408  [pdf, other

    cs.RO

    Perception-Aware Motion Planning via Multiobjective Search on GPUs

    Authors: Brian Ichter, Benoit Landry, Edward Schmerling, Marco Pavone

    Abstract: In this paper we describe a framework towards computing well-localized, robust motion plans through the perception-aware motion planning problem, whereby we seek a low-cost motion plan subject to a separate constraint on perception localization quality. To solve this problem we introduce the Multiobjective Perception-Aware Planning (MPAP) algorithm which explores the state space via a multiobjecti… ▽ More

    Submitted 6 December, 2017; v1 submitted 5 May, 2017; originally announced May 2017.

  24. arXiv:1705.02403  [pdf, other

    cs.RO

    Group Marching Tree: Sampling-Based Approximately Optimal Motion Planning on GPUs

    Authors: Brian Ichter, Edward Schmerling, Marco Pavone

    Abstract: This paper presents a novel approach, named the Group Marching Tree (GMT*) algorithm, to planning on GPUs at rates amenable to application within control loops, allowing planning in real-world settings via repeated computation of near-optimal plans. GMT*, like the Fast Marching Tree (FMT) algorithm, explores the state space with a "lazy" dynamic programming recursion on a set of samples to grow a… ▽ More

    Submitted 5 May, 2017; originally announced May 2017.

  25. arXiv:1703.01029  [pdf, other

    math.OC eess.SY

    A Framework for Time-Consistent, Risk-Sensitive Model Predictive Control: Theory and Algorithms

    Authors: Sumeet Singh, Yin-Lam Chow, Anirudha Majumdar, Marco Pavone

    Abstract: In this paper we present a framework for risk-sensitive model predictive control (MPC) of linear systems affected by stochastic multiplicative uncertainty. Our key innovation is to consider a time-consistent, dynamic risk evaluation of the cumulative cost as the objective function to be minimized. This framework is axiomatically justified in terms of time-consistency of risk assessments, is amenab… ▽ More

    Submitted 25 April, 2018; v1 submitted 2 March, 2017; originally announced March 2017.

    Comments: Submitted to IEEE Transactions on Automatic Control. arXiv admin note: text overlap with arXiv:1511.06981; v2: clarified exposition, reduced review of dynamic risk theory, updated simulations with computation time

  26. arXiv:1612.03232  [pdf, other

    cs.RO

    The Team Surviving Orienteers Problem: Routing Robots in Uncertain Environments with Survival Constraints

    Authors: Stefan Jorgensen, Robert H. Chen, Mark B. Milam, Marco Pavone

    Abstract: In this paper we study the following multi-robot coordination problem: given a graph, where each edge is weighted by the probability of surviving while traversing it, find a set of paths for $K$ robots that maximizes the expected number of nodes collectively visited, subject to constraints on the probability that each robot survives to its destination. We call this problem the Team Surviving Orien… ▽ More

    Submitted 9 December, 2016; originally announced December 2016.

    Comments: 8 pages, 6 figures. Submitted to the IEEE International Conference on Robotic Computing, 2017

  27. arXiv:1609.05399  [pdf, other

    cs.RO

    Evaluating Trajectory Collision Probability through Adaptive Importance Sampling for Safe Motion Planning

    Authors: Edward Schmerling, Marco Pavone

    Abstract: This paper presents a tool for addressing a key component in many algorithms for planning robot trajectories under uncertainty: evaluation of the safety of a robot whose actions are governed by a closed-loop feedback policy near a nominal planned trajectory. We describe an adaptive importance sampling Monte Carlo framework that enables the evaluation of a given control policy for satisfaction of a… ▽ More

    Submitted 1 June, 2017; v1 submitted 17 September, 2016; originally announced September 2016.

  28. arXiv:1609.02546   

    eess.SY

    Congestion-Aware Randomized Routing in Autonomous Mobility-on-Demand Systems

    Authors: Federico Rossi, Rick Zhang, Marco Pavone

    Abstract: In this paper we study the routing and rebalancing problem for a fleet of autonomous vehicles providing on-demand transportation within a congested urban road network (that is, a road network where traffic speed depends on vehicle density). We show that the congestion-free routing and rebalancing problem is NP-hard and provide a randomized algorithm which finds a low-congestion solution to the rou… ▽ More

    Submitted 15 September, 2016; v1 submitted 8 September, 2016; originally announced September 2016.

    Comments: This paper has been withdrawn by the authors due to an error in the proofs of Theorem 3.4 (bound on the probability of violating the congestion constraints) and Lemma 3.5 (approximation factor of the algorithm)

  29. arXiv:1607.06886  [pdf, other

    cs.RO

    Real-Time Stochastic Kinodynamic Motion Planning via Multiobjective Search on GPUs

    Authors: Brian Ichter, Edward Schmerling, Ali-akbar Agha-mohammadi, Marco Pavone

    Abstract: In this paper we present the PUMP (Parallel Uncertainty-aware Multiobjective Planning) algorithm for addressing the stochastic kinodynamic motion planning problem, whereby one seeks a low-cost, dynamically-feasible motion plan subject to a constraint on collision probability (CP). To ensure exhaustive evaluation of candidate motion plans (as needed to tradeoff the competing objectives of performan… ▽ More

    Submitted 23 February, 2017; v1 submitted 22 July, 2016; originally announced July 2016.

  30. arXiv:1607.04357  [pdf, other

    eess.SY cs.MA

    A BCMP Network Approach to Modeling and Controlling Autonomous Mobility-on-Demand Systems

    Authors: Ramon Iglesias, Federico Rossi, Rick Zhang, Marco Pavone

    Abstract: In this paper we present a queueing network approach to the problem of routing and rebalancing a fleet of self-driving vehicles providing on-demand mobility within a capacitated road network. We refer to such systems as autonomous mobility-on-demand systems, or AMoD. We first cast an AMoD system into a closed, multi-class BCMP queueing network model. Second, we present analysis tools that allow th… ▽ More

    Submitted 26 March, 2017; v1 submitted 14 July, 2016; originally announced July 2016.

    Comments: 18 pages, 3 figures. In preparation for conference submission. In version 2, clarity is improved and some typos are removed with no changes to the technical content of the paper

  31. arXiv:1607.01478  [pdf, other

    cs.RO cs.AI eess.SY

    Mixed Strategy for Constrained Stochastic Optimal Control

    Authors: Masahiro Ono, Mahmoud El Chamie, Marco Pavone, Behcet Acikmese

    Abstract: Choosing control inputs randomly can result in a reduced expected cost in optimal control problems with stochastic constraints, such as stochastic model predictive control (SMPC). We consider a controller with initial randomization, meaning that the controller randomly chooses from K+1 control sequences at the beginning (called K-randimization).It is known that, for a finite-state, finite-action M… ▽ More

    Submitted 6 July, 2016; originally announced July 2016.

    Comments: 11 pages. 9 figures.Preliminary version of a working journal paper

  32. Routing Autonomous Vehicles in Congested Transportation Networks: Structural Properties and Coordination Algorithms

    Authors: Rick Zhang, Federico Rossi, Marco Pavone

    Abstract: This paper considers the problem of routing and rebalancing a shared fleet of autonomous (i.e., self-driving) vehicles providing on-demand mobility within a capacitated transportation network, where congestion might disrupt throughput. We model the problem within a network flow framework and show that under relatively mild assumptions the rebalancing vehicles, if properly coordinated, do not lead… ▽ More

    Submitted 29 July, 2016; v1 submitted 2 March, 2016; originally announced March 2016.

    Comments: 11 pages, 3 figures. Presented at Robotics: Science and Systems (RSS) 2016. Version 2 is the extended version of the final submission included in the conference proceedings. The title of the initial submission was modified in deference to RSS's double-blind submission process: in this version, the title matches the published paper

  33. arXiv:1602.05130  [pdf, other

    math.OC

    Risk Aversion in Finite Markov Decision Processes Using Total Cost Criteria and Average Value at Risk

    Authors: Stefano Carpin, Yin-Lam Chow, Marco Pavone

    Abstract: In this paper we present an algorithm to compute risk averse policies in Markov Decision Processes (MDP) when the total cost criterion is used together with the average value at risk (AVaR) metric. Risk averse policies are needed when large deviations from the expected behavior may have detrimental effects, and conventional MDP algorithms usually ignore this aspect. We provide conditions for the s… ▽ More

    Submitted 16 February, 2016; originally announced February 2016.

  34. arXiv:1602.04762  [pdf, other

    cs.RO

    Optimized and Trusted Collision Avoidance for Unmanned Aerial Vehicles using Approximate Dynamic Programming (Technical Report)

    Authors: Zachary N. Sunberg, Mykel J. Kochenderfer, Marco Pavone

    Abstract: Safely integrating unmanned aerial vehicles into civil airspace is contingent upon development of a trustworthy collision avoidance system. This paper proposes an approach whereby a parameterized resolution logic that is considered trusted for a given range of its parameters is adaptively tuned online. Specifically, to address the potential conservatism of the resolution logic with static paramete… ▽ More

    Submitted 18 February, 2016; v1 submitted 15 February, 2016; originally announced February 2016.

    Comments: An abbreviated version was submitted to ICRA 2016

  35. Fast, Safe, and Propellant-Efficient Spacecraft Planning under Clohessy-Wiltshire-Hill Dynamics

    Authors: Joseph A. Starek, Edward Schmerling, Gabriel D. Maher, Brent W. Barbee, Marco Pavone

    Abstract: This paper presents a sampling-based motion planning algorithm for real-time and propellant-optimized autonomous spacecraft trajectory generation in near-circular orbits. Specifically, this paper leverages recent algorithmic advances in the field of robot motion planning to the problem of impulsively-actuated, propellant-optimized rendezvous and proximity operations under the Clohessy-Wiltshire-Hi… ▽ More

    Submitted 31 December, 2015; originally announced January 2016.

    Comments: Submitted to the AIAA Journal of Guidance, Control, and Dynamics (JGCD) special issue entitled "Computational Guidance and Control". This submission is the journal version corresponding to the conference manuscript "Real-Time, Propellant-Efficient Spacecraft Planning under Clohessy-Wiltshire-Hill Dynamics" accepted to the 2016 IEEE Aerospace Conference in Big Sky, MT, USA

  36. arXiv:1512.01629  [pdf, ps, other

    cs.AI cs.LG math.OC

    Risk-Constrained Reinforcement Learning with Percentile Risk Criteria

    Authors: Yinlam Chow, Mohammad Ghavamzadeh, Lucas Janson, Marco Pavone

    Abstract: In many sequential decision-making problems one is interested in minimizing an expected cumulative cost while taking into account \emph{risk}, i.e., increased awareness of events of small probability and high consequences. Accordingly, the objective of this paper is to present efficient reinforcement learning algorithms for risk-constrained Markov decision processes (MDPs), where risk is represent… ▽ More

    Submitted 6 April, 2017; v1 submitted 5 December, 2015; originally announced December 2015.

    Comments: arXiv admin note: substantial text overlap with arXiv:1406.3339

  37. arXiv:1511.06982  [pdf, other

    cs.RO math.OC

    Trading Safety Versus Performance: Rapid Deployment of Robotic Swarms with Robust Performance Constraints

    Authors: Yin-Lam Chow, Marco Pavone, Brian M. Sadler, Stefano Carpin

    Abstract: In this paper we consider a stochastic deployment problem, where a robotic swarm is tasked with the objective of positioning at least one robot at each of a set of pre-assigned targets while meeting a temporal deadline. Travel times and failure rates are stochastic but related, inasmuch as failure rates increase with speed. To maximize chances of success while meeting the deadline, a control strat… ▽ More

    Submitted 22 November, 2015; originally announced November 2015.

  38. arXiv:1511.06981  [pdf, other

    math.OC

    A Framework for Time-Consistent, Risk-Averse Model Predictive Control: Theory and Algorithms

    Authors: Yin-Lam Chow, Marco Pavone

    Abstract: In this paper we present a framework for risk-averse model predictive control (MPC) of linear systems affected by multiplicative uncertainty. Our key innovation is to consider time-consistent, dynamic risk metrics as objective functions to be minimized. This framework is axiomatically justified in terms of time-consistency of risk preferences, is amenable to dynamic optimization, and is unifying i… ▽ More

    Submitted 22 November, 2015; originally announced November 2015.

  39. arXiv:1511.06980  [pdf, other

    math.OC

    Stochastic Optimal Control With Dynamic, Time-Consistent Risk Constraints

    Authors: Yin-Lam Chow, Marco Pavone

    Abstract: In this paper we present a dynamic programing approach to stochastic optimal control problems with dynamic, time-consistent risk constraints. Constrained stochastic optimal control problems, which naturally arise when one has to consider multiple objectives, have been extensively investigated in the past 20 years, however, in most formulations, the constraints are formulated as either risk-neutral… ▽ More

    Submitted 22 November, 2015; originally announced November 2015.

    Comments: arXiv admin note: text overlap with arXiv:1501.02024, arXiv:1503.07461

  40. arXiv:1511.02547  [pdf, other

    eess.SY cs.MA cs.RO

    Decentralized Algorithms for 3D Symmetric Formations in Robotic Networks: a Contraction Theory Approach

    Authors: Sumeet Singh, Edward Schmerling, Marco Pavone

    Abstract: This paper presents decentralized algorithms for formation control of multiple robots in three dimensions. Specifically, we leverage the mathematical properties of cyclic pursuit along with results from contraction and partial contraction theory to design decentralized control algorithms that ensure global convergence to symmetric formations. We first consider regular polygon formations as a base… ▽ More

    Submitted 8 November, 2015; originally announced November 2015.

    Comments: Submitted to IEEE Transactions in Robotics

  41. arXiv:1509.08932  [pdf, ps, other

    cs.AI math.OC

    Two Phase $Q-$learning for Bidding-based Vehicle Sharing

    Authors: Yinlam Chow, Jia Yuan Yu, Marco Pavone

    Abstract: We consider one-way vehicle sharing systems where customers can rent a car at one station and drop it off at another. The problem we address is to optimize the distribution of cars, and quality of service, by pricing rentals appropriately. We propose a bidding approach that is inspired from auctions and takes into account the significant uncertainty inherent in the problem data (e.g., pick-up and… ▽ More

    Submitted 19 October, 2015; v1 submitted 29 September, 2015; originally announced September 2015.

    Comments: Submitted to AISTATS 2016

  42. Model Predictive Control of Autonomous Mobility-on-Demand Systems

    Authors: Rick Zhang, Federico Rossi, Marco Pavone

    Abstract: In this paper we present a model predictive control (MPC) approach to optimize vehicle scheduling and routing in an autonomous mobility-on-demand (AMoD) system. In AMoD systems, robotic, self-driving vehicles transport customers within an urban environment and are coordinated to optimize service throughout the entire network. Specifically, we first propose a novel discrete-time model of an AMoD sy… ▽ More

    Submitted 15 February, 2016; v1 submitted 14 September, 2015; originally announced September 2015.

    Comments: Extended version of ICRA16 paper, with full proofs of the theorems

  43. An Asymptotically-Optimal Sampling-Based Algorithm for Bi-directional Motion Planning

    Authors: Joseph A. Starek, Javier V. Gomez, Edward Schmerling, Lucas Janson, Luis Moreno, Marco Pavone

    Abstract: Bi-directional search is a widely used strategy to increase the success and convergence rates of sampling-based motion planning algorithms. Yet, few results are available that merge both bi-directional search and asymptotic optimality into existing optimal planners, such as PRM*, RRT*, and FMT*. The objective of this paper is to fill this gap. Specifically, this paper presents a bi-directional, sa… ▽ More

    Submitted 27 July, 2015; originally announced July 2015.

    Comments: Accepted to the 2015 IEEE Intelligent Robotics and Systems Conference in Hamburg, Germany. This submission represents the long version of the conference manuscript, with additional proof details (Section IV) regarding the asymptotic optimality of the BFMT* algorithm

  44. arXiv:1506.02188  [pdf, other

    cs.AI math.OC

    Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach

    Authors: Yinlam Chow, Aviv Tamar, Shie Mannor, Marco Pavone

    Abstract: In this paper we address the problem of decision making within a Markov decision process (MDP) framework where risk and modeling errors are taken into account. Our approach is to minimize a risk-sensitive conditional-value-at-risk (CVaR) objective, as opposed to a standard risk-neutral expectation. We refer to such problem as CVaR MDP. Our first contribution is to show that a CVaR objective, besid… ▽ More

    Submitted 6 June, 2015; originally announced June 2015.

    Comments: Submitted to NIPS 15

  45. arXiv:1506.01085  [pdf, other

    cs.RO

    A Convex Optimization Approach to Smooth Trajectories for Motion Planning with Car-Like Robots

    Authors: Zhijie Zhu, Edward Schmerling, Marco Pavone

    Abstract: In the recent past, several sampling-based algorithms have been proposed to compute trajectories that are collision-free and dynamically-feasible. However, the outputs of such algorithms are notoriously jagged. In this paper, by focusing on robots with car-like dynamics, we present a fast and simple heuristic algorithm, named Convex Elastic Smoothing (CES) algorithm, for trajectory smoothing and s… ▽ More

    Submitted 26 October, 2015; v1 submitted 2 June, 2015; originally announced June 2015.

  46. arXiv:1505.00023  [pdf, other

    cs.RO

    Deterministic Sampling-Based Motion Planning: Optimality, Complexity, and Performance

    Authors: Lucas Janson, Brian Ichter, Marco Pavone

    Abstract: Probabilistic sampling-based algorithms, such as the probabilistic roadmap (PRM) and the rapidly-exploring random tree (RRT) algorithms, represent one of the most successful approaches to robotic motion planning, due to their strong theoretical properties (in terms of probabilistic completeness or even asymptotic optimality) and remarkable practical performance. Such algorithms are probabilistic i… ▽ More

    Submitted 3 May, 2016; v1 submitted 30 April, 2015; originally announced May 2015.

  47. arXiv:1504.08053  [pdf, other

    cs.RO

    Monte Carlo Motion Planning for Robot Trajectory Optimization Under Uncertainty

    Authors: Lucas Janson, Edward Schmerling, Marco Pavone

    Abstract: This article presents a novel approach, named MCMP (Monte Carlo Motion Planning), to the problem of motion planning under uncertainty, i.e., to the problem of computing a low-cost path that fulfills probabilistic collision avoidance constraints. MCMP estimates the collision probability (CP) of a given path by sampling via Monte Carlo the execution of a reference tracking controller (in this paper… ▽ More

    Submitted 28 May, 2015; v1 submitted 29 April, 2015; originally announced April 2015.

  48. arXiv:1503.07461  [pdf, other

    math.OC

    A Time Consistent Formulation of Risk Constrained Stochastic Optimal Control

    Authors: Yinlam Chow, Marco Pavone

    Abstract: Time-consistency is an essential requirement in risk sensitive optimal control problems to make rational decisions. An optimization problem is time consistent if its solution policy does not depend on the time sequence of solving the optimization problem. On the other hand, a dynamic risk measure is time consistent if a certain outcome is considered less risky in the future implies this outcome is… ▽ More

    Submitted 25 March, 2015; originally announced March 2015.

  49. arXiv:1501.02024  [pdf, ps, other

    math.OC

    A Uniform-grid Discretization Algorithm for Stochastic Control with Risk Constraints

    Authors: Yin-Lam Chow, Marco Pavone

    Abstract: In this paper, we present a discretization algorithm for finite horizon risk constrained dynamic programming algorithm in [Chow_Pavone_13]. Although in a theoretical standpoint, Bellman's recursion provides a systematic way to find optimal value functions and generate optimal history dependent policies, there is a serious computational issue. Even if the state space and action space of this constr… ▽ More

    Submitted 8 January, 2015; originally announced January 2015.

  50. arXiv:1410.0956  [pdf, other

    eess.SY cs.DC

    Distributed consensus with mixed time/communication bandwidth performance metrics

    Authors: Federico Rossi, Marco Pavone

    Abstract: In this paper we study the inherent trade-off between time and communication complexity for the distributed consensus problem. In our model, communication complexity is measured as the maximum data throughput (in bits per second) sent through the network at a given instant. Such a notion of communication complexity, referred to as bandwidth complexity, is related to the frequency bandwidth a desig… ▽ More

    Submitted 3 October, 2014; originally announced October 2014.

    Comments: Draft, submitted to Allerton 2014