Skip to main content

Showing 1–50 of 81 results for author: Ozdaglar, A

  1. arXiv:2405.12421  [pdf, other

    cs.LG cs.AI stat.ML

    A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback

    Authors: Kihyun Kim, Jiawei Zhang, Asuman Ozdaglar, Pablo A. Parrilo

    Abstract: Inverse Reinforcement Learning (IRL) and Reinforcement Learning from Human Feedback (RLHF) are pivotal methodologies in reward learning, which involve inferring and shaping the underlying reward function of sequential decision-making problems based on observed human demonstrations and feedback. Most prior work in reward learning has relied on prior knowledge or assumptions about decision or prefer… ▽ More

    Submitted 3 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  2. arXiv:2405.01817  [pdf, other

    cs.LG

    Uniformly Stable Algorithms for Adversarial Training and Beyond

    Authors: Jiancong Xiao, Jiawei Zhang, Zhi-Quan Luo, Asuman Ozdaglar

    Abstract: In adversarial machine learning, neural networks suffer from a significant issue known as robust overfitting, where the robust test accuracy decreases over epochs (Rice et al., 2020). Recent research conducted by Xing et al.,2021; Xiao et al., 2022 has focused on studying the uniform stability of adversarial training. Their investigations revealed that SGD-based adversarial training fails to exhib… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  3. arXiv:2405.00254  [pdf, other

    cs.AI cs.LG

    RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation

    Authors: Chanwoo Park, Mingyang Liu, Dingwen Kong, Kaiqing Zhang, Asuman Ozdaglar

    Abstract: Reinforcement learning from human feedback (RLHF) has been an effective technique for aligning AI systems with human values, with remarkable successes in fine-tuning large-language models recently. Most existing RLHF paradigms make the underlying assumption that human preferences are relatively homogeneous, and can be encoded by a single reward model. In this paper, we focus on addressing the issu… ▽ More

    Submitted 27 May, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

    Comments: Added experiments

  4. arXiv:2403.16843  [pdf, other

    cs.LG cs.AI cs.GT

    Do LLM Agents Have Regret? A Case Study in Online Learning and Games

    Authors: Chanwoo Park, Xiangyu Liu, Asuman Ozdaglar, Kaiqing Zhang

    Abstract: Large language models (LLMs) have been increasingly employed for (interactive) decision-making, via the development of LLM-based autonomous agents. Despite their emerging successes, the performance of LLM agents in decision-making has not been fully investigated through quantitative metrics, especially in the multi-agent setting when they interact with each other, a typical scenario in real-world… ▽ More

    Submitted 26 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Added experimental results for open-source models

  5. arXiv:2401.00313  [pdf, other

    cs.GT cs.LG cs.SI econ.GN

    Matching of Users and Creators in Two-Sided Markets with Departures

    Authors: Daniel Huttenlocher, Hannah Li, Liang Lyu, Asuman Ozdaglar, James Siderius

    Abstract: Many online platforms of today, including social media sites, are two-sided markets bridging content creators and users. Most of the existing literature on platform recommendation algorithms largely focuses on user preferences and decisions, and does not simultaneously address creator incentives. We propose a model of content recommendation that explicitly focuses on the dynamics of user-content m… ▽ More

    Submitted 19 January, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

  6. arXiv:2312.04905  [pdf, ps, other

    cs.LG cs.MA

    Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games

    Authors: Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman Ozdaglar, Adam Wierman

    Abstract: We consider two-player zero-sum stochastic games and propose a two-timescale $Q$-learning algorithm with function approximation that is payoff-based, convergent, rational, and symmetric between the two players. In two-timescale $Q$-learning, the fast-timescale iterates are updated in spirit to the stochastic gradient descent and the slow-timescale iterates (which we use to compute the policies) ar… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  7. arXiv:2308.11518  [pdf, ps, other

    cs.LG stat.ML

    EM for Mixture of Linear Regression with Clustered Data

    Authors: Amirhossein Reisizadeh, Khashayar Gatmiry, Asuman Ozdaglar

    Abstract: Modern data-driven and distributed learning frameworks deal with diverse massive data generated by clients spread across heterogeneous environments. Indeed, data heterogeneity is a major bottleneck in scaling up many distributed learning paradigms. In many settings however, heterogeneous data may be generated in clusters with shared structures, as is the case in several applications such as federa… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  8. arXiv:2307.09470  [pdf, other

    cs.GT cs.LG

    Multi-Player Zero-Sum Markov Games with Networked Separable Interactions

    Authors: Chanwoo Park, Kaiqing Zhang, Asuman Ozdaglar

    Abstract: We study a new class of Markov games, \emph(multi-player) zero-sum Markov Games} with \emph{Networked separable interactions} (zero-sum NMGs), to model the local interaction structure in non-cooperative multi-agent sequential decision-making. We define a zero-sum NMG as a model where {the payoffs of the auxiliary games associated with each state are zero-sum and} have some separable (i.e., polymat… ▽ More

    Submitted 21 March, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

  9. arXiv:2305.00474  [pdf, other

    cs.SI econ.TH

    Learning, Diversity and Adaptation in Changing Environments: The Role of Weak Links

    Authors: Daron Acemoglu, Asuman Ozdaglar, Sarath Pattathil

    Abstract: Adaptation to dynamic conditions requires a certain degree of diversity. If all agents take the best current action, learning that the underlying state has changed and behavior should adapt will be slower. Diversity is harder to maintain when there is fast communication between agents, because they tend to find out and pursue the best action rapidly. We explore these issues using a model of (Bayes… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  10. arXiv:2303.03100  [pdf, ps, other

    cs.GT cs.LG

    A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

    Authors: Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman Ozdaglar, Adam Wierman

    Abstract: We study two-player zero-sum stochastic games, and propose a form of independent learning dynamics called Doubly Smoothed Best-Response dynamics, which integrates a discrete and doubly smoothed variant of the best-response dynamics into temporal-difference (TD)-learning and minimax value iteration. The resulting dynamics are payoff-based, convergent, rational, and symmetric among players. Our main… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  11. arXiv:2212.13861  [pdf, ps, other

    cs.LG math.OC stat.ML

    Revisiting the Linear-Programming Framework for Offline RL with General Function Approximation

    Authors: Asuman Ozdaglar, Sarath Pattathil, Jiawei Zhang, Kaiqing Zhang

    Abstract: Offline reinforcement learning (RL) aims to find an optimal policy for sequential decision-making using a pre-collected dataset, without further interaction with the environment. Recent theoretical progress has focused on developing sample-efficient offline RL algorithms with various relaxed assumptions on data coverage and function approximators, especially to handle the case with excessively lar… ▽ More

    Submitted 8 February, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: 35 pages

  12. arXiv:2210.12812  [pdf, ps, other

    math.OC cs.LG cs.MA stat.ML

    Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence

    Authors: Sarath Pattathil, Kaiqing Zhang, Asuman Ozdaglar

    Abstract: Multi-agent interactions are increasingly important in the context of reinforcement learning, and the theoretical foundations of policy gradient methods have attracted surging research interest. We investigate the global convergence of natural policy gradient (NPG) algorithms in multi-agent learning. We first show that vanilla NPG may not have parameter convergence, i.e., the convergence of the ve… ▽ More

    Submitted 20 March, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: Initially submitted for publication in January 2022

  13. arXiv:2206.09495  [pdf, other

    cs.GT cs.LG

    The Power of Regularization in Solving Extensive-Form Games

    Authors: Mingyang Liu, Asuman Ozdaglar, Tiancheng Yu, Kaiqing Zhang

    Abstract: In this paper, we investigate the power of {\it regularization}, a common technique in reinforcement learning and optimization, in solving extensive-form games (EFGs). We propose a series of new algorithms based on regularizing the payoff functions of the game, and establish a set of convergence results that strictly improve over the existing ones, with either weaker assumptions or stronger conver… ▽ More

    Submitted 9 March, 2023; v1 submitted 19 June, 2022; originally announced June 2022.

  14. arXiv:2206.05637  [pdf, ps, other

    cs.MA

    Convergence and Stability of Coupled Belief--Strategy Learning Dynamics in Continuous Games

    Authors: Manxi Wu, Saurabh Amin, Asuman Ozdaglar

    Abstract: We propose a learning dynamics to model how strategic agents repeatedly play a continuous game while relying on an information platform to learn an unknown payoff-relevant parameter. In each time step, the platform updates a belief estimate of the parameter based on players' strategies and realized payoffs using Bayes's rule. Then, players adopt a generic learning rule to adjust their strategies b… ▽ More

    Submitted 31 October, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2109.00719

  15. arXiv:2206.04502  [pdf, other

    stat.ML cs.LG math.OC

    What is a Good Metric to Study Generalization of Minimax Learners?

    Authors: Asuman Ozdaglar, Sarath Pattathil, Jiawei Zhang, Kaiqing Zhang

    Abstract: Minimax optimization has served as the backbone of many machine learning (ML) problems. Although the convergence behavior of optimization algorithms has been extensively studied in the minimax settings, their generalization guarantees in stochastic minimax optimization problems, i.e., how the solution trained on empirical data performs on unseen testing data, have been relatively underexplored. A… ▽ More

    Submitted 20 June, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 34 pages, 2 figures

  16. arXiv:2205.11389  [pdf, ps, other

    cs.GT

    Fictitious Play in Markov Games with Single Controller

    Authors: Muhammed O. Sayin, Kaiqing Zhang, Asuman Ozdaglar

    Abstract: Certain but important classes of strategic-form games, including zero-sum and identical-interest games, have the fictitious-play-property (FPP), i.e., beliefs formed in fictitious play dynamics always converge to a Nash equilibrium (NE) in the repeated play of these games. Such convergence results are seen as a (behavioral) justification for the game-theoretical equilibrium analysis. Markov games… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted to ACM Conference on Economics and Computation (EC) 2022

  17. arXiv:2201.03968  [pdf, other

    cs.GT cs.CR cs.LG

    Optimal and Differentially Private Data Acquisition: Central and Local Mechanisms

    Authors: Alireza Fallah, Ali Makhdoumi, Azarakhsh Malekian, Asuman Ozdaglar

    Abstract: We consider a platform's problem of collecting data from privacy sensitive users to estimate an underlying parameter of interest. We formulate this question as a Bayesian-optimal mechanism design problem, in which an individual can share her (verifiable) data in exchange for a monetary reward or services, but at the same time has a (private) heterogeneous privacy cost which we quantify using diffe… ▽ More

    Submitted 5 September, 2023; v1 submitted 9 January, 2022; originally announced January 2022.

    Comments: To appear in the Operations Research journal. The abstract appeared in the Proceedings of the 23rd ACM Conference on Economics and Computation (EC 2022)

  18. arXiv:2111.11743  [pdf, ps, other

    cs.GT cs.LG math.DS

    Independent Learning in Stochastic Games

    Authors: Asuman Ozdaglar, Muhammed O. Sayin, Kaiqing Zhang

    Abstract: Reinforcement learning (RL) has recently achieved tremendous successes in many artificial intelligence applications. Many of the forefront applications of RL involve multiple agents, e.g., playing chess and Go games, autonomous driving, and robotics. Unfortunately, the framework upon which classical RL builds is inappropriate for multi-agent learning, as it assumes an agent's environment is statio… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: An invited chapter for the International Congress of Mathematicians 2022 (ICM 2022)

  19. arXiv:2109.00719  [pdf, ps, other

    cs.GT econ.TH

    Multi-agent Bayesian Learning with Best Response Dynamics: Convergence and Stability

    Authors: Manxi Wu, Saurabh Amin, Asuman Ozdaglar

    Abstract: We study learning dynamics induced by strategic agents who repeatedly play a game with an unknown payoff-relevant parameter. In this dynamics, a belief estimate of the parameter is repeatedly updated given players' strategies and realized payoffs using Bayes's rule. Players adjust their strategies by accounting for best response strategies given the belief. We show that, with probability 1, belief… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: text overlap with arXiv:2010.09128

  20. arXiv:2106.07537  [pdf, other

    stat.ML cs.LG math.OC

    A Wasserstein Minimax Framework for Mixed Linear Regression

    Authors: Theo Diamandis, Yonina C. Eldar, Alireza Fallah, Farzan Farnia, Asuman Ozdaglar

    Abstract: Multi-modal distributions are commonly used to model clustered data in statistical learning tasks. In this paper, we consider the Mixed Linear Regression (MLR) problem. We propose an optimal transport-based framework for MLR problems, Wasserstein Mixed Linear Regression (WMLR), which minimizes the Wasserstein distance between the learned and target mixture regression models. Through a model-based… ▽ More

    Submitted 16 June, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: To appear in 38th International Conference on Machine Learning (ICML 2021)

  21. arXiv:2106.02748  [pdf, other

    cs.GT cs.LG cs.MA math.DS

    Decentralized Q-Learning in Zero-sum Markov Games

    Authors: Muhammed O. Sayin, Kaiqing Zhang, David S. Leslie, Tamer Basar, Asuman Ozdaglar

    Abstract: We study multi-agent reinforcement learning (MARL) in infinite-horizon discounted zero-sum Markov games. We focus on the practical but challenging setting of decentralized MARL, where agents make decisions without coordination by a centralized controller, but only based on their own payoffs and local actions executed. The agents need not observe the opponent's actions or payoffs, possibly being ev… ▽ More

    Submitted 12 December, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: To appear at NeurIPS 2021. Strengthened the results in Theorem 1 and Corollary 1

  22. arXiv:2102.08441  [pdf, other

    cs.GT cs.DM

    Optimal intervention in transportation networks

    Authors: Leonardo Cianfanelli, Giacomo Como, Asuman Ozdaglar, Francesca Parise

    Abstract: We study a network design problem (NDP) where the planner aims at selecting the optimal single-link intervention on a transportation network to minimize the travel time under Wardrop equilibrium flows. Our first result is that, if the delay functions are affine and the support of the equilibrium is not modified with interventions, the NDP may be formulated in terms of electrical quantities compute… ▽ More

    Submitted 14 November, 2022; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: 40 pages, 12 figures

    MSC Class: 91A14; 91A43; 91A16; 90B20 ACM Class: G.2.2

  23. arXiv:2102.03832  [pdf, other

    cs.LG math.OC stat.ML

    Generalization of Model-Agnostic Meta-Learning Algorithms: Recurring and Unseen Tasks

    Authors: Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

    Abstract: In this paper, we study the generalization properties of Model-Agnostic Meta-Learning (MAML) algorithms for supervised learning problems. We focus on the setting in which we train the MAML model over $m$ tasks, each with $n$ data points, and characterize its generalization error from two points of view: First, we assume the new task at test time is one of the training tasks, and we show that, for… ▽ More

    Submitted 16 November, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  24. arXiv:2010.12561  [pdf, other

    cs.LG math.OC stat.ML

    Train simultaneously, generalize better: Stability of gradient-based minimax learners

    Authors: Farzan Farnia, Asuman Ozdaglar

    Abstract: The success of minimax learning problems of generative adversarial networks (GANs) has been observed to depend on the minimax optimization algorithm used for their training. This dependence is commonly attributed to the convergence speed and robustness properties of the underlying optimization algorithm. In this paper, we show that the optimization algorithm also plays a key role in the generaliza… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

  25. arXiv:2010.09128  [pdf, ps, other

    eess.SY cs.AI

    Multi-agent Bayesian Learning with Adaptive Strategies: Convergence and Stability

    Authors: Manxi Wu, Saurabh Amin, Asuman Ozdaglar

    Abstract: We study learning dynamics induced by strategic agents who repeatedly play a game with an unknown payoff-relevant parameter. In each step, an information system estimates a belief distribution of the parameter based on the players' strategies and realized payoffs using Bayes' rule. Players adjust their strategies by accounting for an equilibrium strategy or a best response strategy based on the up… ▽ More

    Submitted 18 October, 2020; originally announced October 2020.

  26. arXiv:2010.04223  [pdf, other

    cs.GT cs.LG math.DS

    Fictitious play in zero-sum stochastic games

    Authors: Muhammed O. Sayin, Francesca Parise, Asuman Ozdaglar

    Abstract: We present a novel variant of fictitious play dynamics combining classical fictitious play with Q-learning for stochastic games and analyze its convergence properties in two-player zero-sum stochastic games. Our dynamics involves players forming beliefs on the opponent strategy and their own continuation payoff (Q-function), and playing a greedy best response by using the estimated continuation pa… ▽ More

    Submitted 2 June, 2022; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: The extended arXiv version of the original paper to appear in SIAM Journal on Control and Optimization

  27. arXiv:2002.09124  [pdf, other

    cs.LG cs.GT stat.ML

    GANs May Have No Nash Equilibria

    Authors: Farzan Farnia, Asuman Ozdaglar

    Abstract: Generative adversarial networks (GANs) represent a zero-sum game between two machine players, a generator and a discriminator, designed to learn the distribution of data. While GANs have achieved state-of-the-art performance in several benchmark learning tasks, GAN minimax optimization still poses great theoretical and empirical challenges. GANs trained using first-order optimization methods commo… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

  28. arXiv:2002.07948  [pdf, other

    cs.LG math.OC stat.ML

    Personalized Federated Learning: A Meta-Learning Approach

    Authors: Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

    Abstract: In Federated Learning, we aim to train models across multiple computing units (users), while users can only communicate with a common central server, without exchanging their data samples. This mechanism exploits the computational power of all users and allows users to obtain a richer model as their models are trained over a larger set of data points. However, this scheme only develops a common ou… ▽ More

    Submitted 22 October, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: To appear in 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

  29. arXiv:2002.05683  [pdf, ps, other

    math.OC cs.LG stat.ML

    An Optimal Multistage Stochastic Gradient Method for Minimax Problems

    Authors: Alireza Fallah, Asuman Ozdaglar, Sarath Pattathil

    Abstract: In this paper, we study the minimax optimization problem in the smooth and strongly convex-strongly concave setting when we have access to noisy estimates of gradients. In particular, we first analyze the stochastic Gradient Descent Ascent (GDA) method with constant stepsize, and show that it converges to a neighborhood of the solution of the minimax problem. We further provide tight bounds on the… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  30. arXiv:2002.05135  [pdf, other

    cs.LG math.OC stat.ML

    On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning

    Authors: Alireza Fallah, Kristian Georgiev, Aryan Mokhtari, Asuman Ozdaglar

    Abstract: We consider Model-Agnostic Meta-Learning (MAML) methods for Reinforcement Learning (RL) problems, where the goal is to find a policy using data from several tasks represented by Markov Decision Processes (MDPs) that can be updated by one step of stochastic policy gradient for the realized MDP. In particular, using stochastic gradients in MAML update steps is crucial for RL problems since computati… ▽ More

    Submitted 16 November, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  31. arXiv:2002.00057  [pdf, ps, other

    cs.LG math.OC stat.ML

    Last Iterate is Slower than Averaged Iterate in Smooth Convex-Concave Saddle Point Problems

    Authors: Noah Golowich, Sarath Pattathil, Constantinos Daskalakis, Asuman Ozdaglar

    Abstract: In this paper we study the smooth convex-concave saddle point problem. Specifically, we analyze the last iterate convergence properties of the Extragradient (EG) algorithm. It is well known that the ergodic (averaged) iterates of EG converge at a rate of $O(1/T)$ (Nemirovski, 2004). In this paper, we show that the last iterate of EG converges at a rate of $O(1/\sqrt{T})$. To the best of our knowle… ▽ More

    Submitted 6 July, 2020; v1 submitted 31 January, 2020; originally announced February 2020.

    Comments: 27 pages

  32. arXiv:2001.03232  [pdf, other

    cs.GT

    Optimal dynamic information provision in traffic routing

    Authors: Emily Meigs, Francesca Parise, Asuman Ozdaglar, Daron Acemoglu

    Abstract: We consider a two-road dynamic routing game where the state of one of the roads (the "risky road") is stochastic and may change over time. This generates room for experimentation. A central planner may wish to induce some of the (finite number of atomic) agents to use the risky road even when the expected cost of travel there is high in order to obtain accurate information about the state of the r… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

  33. arXiv:1910.14380  [pdf, other

    math.OC cs.LG stat.ML

    A Decentralized Proximal Point-type Method for Saddle Point Problems

    Authors: Weijie Liu, Aryan Mokhtari, Asuman Ozdaglar, Sarath Pattathil, Zebang Shen, Nenggan Zheng

    Abstract: In this paper, we focus on solving a class of constrained non-convex non-concave saddle point problems in a decentralized manner by a group of nodes in a network. Specifically, we assume that each node has access to a summand of a global objective function and nodes are allowed to exchange information only with their neighboring nodes. We propose a decentralized variant of the proximal point metho… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: 18 pages

  34. arXiv:1910.08701  [pdf, other

    math.OC cs.LG stat.ML

    Robust Distributed Accelerated Stochastic Gradient Methods for Multi-Agent Networks

    Authors: Alireza Fallah, Mert Gurbuzbalaban, Asuman Ozdaglar, Umut Simsekli, Lingjiong Zhu

    Abstract: We study distributed stochastic gradient (D-SG) method and its accelerated variant (D-ASG) for solving decentralized strongly convex stochastic optimization problems where the objective function is distributed over several computational units, lying on a fixed but arbitrary connected communication graph, subject to local communication constraints where noisy estimates of the gradients are availabl… ▽ More

    Submitted 4 October, 2021; v1 submitted 19 October, 2019; originally announced October 2019.

  35. arXiv:1908.10400  [pdf, other

    cs.LG math.OC stat.ML

    On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms

    Authors: Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

    Abstract: We study the convergence of a class of gradient-based Model-Agnostic Meta-Learning (MAML) methods and characterize their overall complexity as well as their best achievable accuracy in terms of gradient norm for nonconvex loss functions. We start with the MAML method and its first-order approximation (FO-MAML) and highlight the challenges that emerge in their analysis. By overcoming these challeng… ▽ More

    Submitted 15 May, 2020; v1 submitted 27 August, 2019; originally announced August 2019.

    Comments: To appear in the proceedings of the $23^{rd}$ International Conference on Artificial Intelligence and Statistics (AISTATS) 2020

  36. arXiv:1906.01115  [pdf, ps, other

    math.OC cs.LG stat.ML

    Convergence Rate of $\mathcal{O}(1/k)$ for Optimistic Gradient and Extra-gradient Methods in Smooth Convex-Concave Saddle Point Problems

    Authors: Aryan Mokhtari, Asuman Ozdaglar, Sarath Pattathil

    Abstract: We study the iteration complexity of the optimistic gradient descent-ascent (OGDA) method and the extra-gradient (EG) method for finding a saddle point of a convex-concave unconstrained min-max problem. To do so, we first show that both OGDA and EG can be interpreted as approximate variants of the proximal point method. This is similar to the approach taken in [Nemirovski, 2004] which analyzes EG… ▽ More

    Submitted 29 September, 2020; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 19 pages

  37. arXiv:1901.08511  [pdf, ps, other

    math.OC cs.LG stat.ML

    A Unified Analysis of Extra-gradient and Optimistic Gradient Methods for Saddle Point Problems: Proximal Point Approach

    Authors: Aryan Mokhtari, Asuman Ozdaglar, Sarath Pattathil

    Abstract: In this paper we consider solving saddle point problems using two variants of Gradient Descent-Ascent algorithms, Extra-gradient (EG) and Optimistic Gradient Descent Ascent (OGDA) methods. We show that both of these algorithms admit a unified analysis as approximations of the classical proximal point method for solving saddle point problems. This viewpoint enables us to develop a new framework for… ▽ More

    Submitted 5 September, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

    Comments: 25 pages, 3 figures

  38. arXiv:1901.08022  [pdf, other

    math.OC cs.LG stat.ML

    A Universally Optimal Multistage Accelerated Stochastic Gradient Method

    Authors: Necdet Serhat Aybat, Alireza Fallah, Mert Gurbuzbalaban, Asuman Ozdaglar

    Abstract: We study the problem of minimizing a strongly convex, smooth function when we have noisy estimates of its gradient. We propose a novel multistage accelerated algorithm that is universally optimal in the sense that it achieves the optimal rate both in the deterministic and stochastic case and operates without knowledge of noise characteristics. The algorithm consists of stages that use a stochastic… ▽ More

    Submitted 27 October, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

  39. arXiv:1809.02162  [pdf, ps, other

    cs.LG math.OC stat.ML

    Escaping Saddle Points in Constrained Optimization

    Authors: Aryan Mokhtari, Asuman Ozdaglar, Ali Jadbabaie

    Abstract: In this paper, we study the problem of escaping from saddle points in smooth nonconvex optimization problems subject to a convex set $\mathcal{C}$. We propose a generic framework that yields convergence to a second-order stationary point of the problem, if the convex set $\mathcal{C}$ is simple for a quadratic objective function. Specifically, our results hold if one can find a $ρ$-approximate sol… ▽ More

    Submitted 9 October, 2018; v1 submitted 6 September, 2018; originally announced September 2018.

  40. arXiv:1809.01485  [pdf, other

    cs.SI eess.SP stat.ML

    Blind Community Detection from Low-rank Excitations of a Graph Filter

    Authors: Hoi-To Wai, Santiago Segarra, Asuman E. Ozdaglar, Anna Scaglione, Ali Jadbabaie

    Abstract: This paper considers a new framework to detect communities in a graph from the observation of signals at its nodes. We model the observed signals as noisy outputs of an unknown network process, represented as a graph filter that is excited by a set of unknown low-rank inputs/excitations. Application scenarios of this model include diffusion dynamics, pricing experiments, and opinion dynamics. Rath… ▽ More

    Submitted 12 April, 2019; v1 submitted 5 September, 2018; originally announced September 2018.

    Comments: Single column format, 32 pages, 9 figures

  41. arXiv:1808.10590  [pdf, ps, other

    cs.GT

    Value of Information in Bayesian Routing Games

    Authors: Manxi Wu, Saurabh Amin, Asuman E. Ozdaglar

    Abstract: We study a routing game in an environment with multiple heterogeneous information systems and an uncertain state that affects edge costs of a congested network. Each information system sends a noisy signal about the state to its subscribed traveler population. Travelers make route choices based on their private beliefs about the state and other populations' signals. The question then arises, "How… ▽ More

    Submitted 6 March, 2020; v1 submitted 31 August, 2018; originally announced August 2018.

  42. arXiv:1807.04428  [pdf, ps, other

    math.OC cs.LG stat.ML

    Convergence Rate of Block-Coordinate Maximization Burer-Monteiro Method for Solving Large SDPs

    Authors: Murat A. Erdogdu, Asuman Ozdaglar, Pablo A. Parrilo, Nuri Denizcan Vanli

    Abstract: Semidefinite programming (SDP) with diagonal constraints arise in many optimization problems, such as Max-Cut, community detection and group synchronization. Although SDPs can be solved to arbitrary precision in polynomial time, generic convex solvers do not scale well with the dimension of the problem. In order to address this issue, Burer and Monteiro proposed to reduce the dimension of the prob… ▽ More

    Submitted 26 November, 2019; v1 submitted 12 July, 2018; originally announced July 2018.

  43. arXiv:1805.10579  [pdf, other

    math.OC cs.LG stat.ML

    Robust Accelerated Gradient Methods for Smooth Strongly Convex Functions

    Authors: Necdet Serhat Aybat, Alireza Fallah, Mert Gurbuzbalaban, Asuman Ozdaglar

    Abstract: We study the trade-offs between convergence rate and robustness to gradient errors in designing a first-order algorithm. We focus on gradient descent (GD) and accelerated gradient (AG) methods for minimizing strongly convex functions when the gradient has random errors in the form of additive white noise. With gradient errors, the function values of the iterates need not converge to the optimal va… ▽ More

    Submitted 5 November, 2019; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: To appear in SIAM Journal on Optimization (SIOPT)

  44. arXiv:1802.00080  [pdf, other

    cs.GT

    Graphon games: A statistical framework for network games and interventions

    Authors: Francesca Parise, Asuman Ozdaglar

    Abstract: In this paper, we present a unifying framework for analyzing equilibria and designing interventions for large network games sampled from a stochastic network formation process represented by a graphon. We first introduce a new class of infinite population games, termed graphon games, where a continuum of heterogeneous agents interact according to a graphon. After studying properties of equilibria… ▽ More

    Submitted 30 June, 2020; v1 submitted 31 January, 2018; originally announced February 2018.

  45. arXiv:1712.08277  [pdf, other

    cs.GT cs.SI eess.SY physics.soc-ph

    A variational inequality framework for network games: Existence, uniqueness, convergence and sensitivity analysis

    Authors: Francesca Parise, Asuman Ozdaglar

    Abstract: We provide a unified variational inequality framework for the study of fundamental properties of the Nash equilibrium in network games. We identify several conditions on the underlying network (in terms of spectral norm, infinity norm and minimum eigenvalue of its adjacency matrix) that guarantee existence, uniqueness, convergence and continuity of equilibrium in general network games with multidi… ▽ More

    Submitted 9 August, 2018; v1 submitted 21 December, 2017; originally announced December 2017.

  46. arXiv:1706.08693  [pdf, other

    cs.GT

    Sensitivity analysis for network aggregative games

    Authors: Francesca Parise, Asuman Ozdaglar

    Abstract: We investigate the sensitivity of the Nash equilibrium of constrained network aggregative games to changes in exogenous parameters affecting the cost function of the players. This setting is motivated by two applications. The first is the analysis of interventions by a social planner with a networked objective function while the second is network routing games with atomic players and information c… ▽ More

    Submitted 27 June, 2017; originally announced June 2017.

  47. arXiv:1706.01131  [pdf, ps, other

    cs.GT

    Strategic Dynamic Pricing with Network Effects

    Authors: Ali Makhdoumi, Azarakhsh Malekian, Asuman Ozdaglar

    Abstract: We study the optimal pricing strategy of a monopolist selling homogeneous goods to customers over multiple periods. The customers choose their time of purchase to maximize their payoff that depends on their valuation of the product, the purchase price, and the utility they derive from past purchases of others, termed the network effect. We first show that the optimal price sequence is non-decreasi… ▽ More

    Submitted 26 June, 2018; v1 submitted 4 June, 2017; originally announced June 2017.

  48. arXiv:1601.02039  [pdf, ps, other

    cs.GT

    Informational Braess' Paradox: The Effect of Information on Traffic Congestion

    Authors: Daron Acemoglu, Ali Makhdoumi, Azarakhsh Malekian, Asuman Ozdaglar

    Abstract: To systematically study the implications of additional information about routes provided to certain users (e.g., via GPS-based route guidance systems), we introduce a new class of congestion games in which users have differing information sets about the available edges and can only use routes consisting of edges in their information set. After defining the notion of Information Constrained Wardrop… ▽ More

    Submitted 3 November, 2017; v1 submitted 8 January, 2016; originally announced January 2016.

  49. arXiv:1510.06055  [pdf, other

    cs.SI

    A lower bound on the performance of dynamic curing policies for epidemics on graphs

    Authors: Kimon Drakopoulos, Asuman Ozdaglar, John N. Tsitsiklis

    Abstract: We consider an SIS-type epidemic process that evolves on a known graph. We assume that a fixed curing budget can be allocated at each instant to the nodes of the graph, towards the objective of minimizing the expected extinction time of the epidemic. We provide a lower bound on the optimal expected extinction time as a function of the available budget, the epidemic parameters, the maximum degree,… ▽ More

    Submitted 20 October, 2015; originally announced October 2015.

  50. arXiv:1510.06054  [pdf, other

    cs.SI

    When is a network epidemic hard to eliminate?

    Authors: Kimon Drakopoulos, Asuman Ozdaglar, John N. Tsitsiklis

    Abstract: We consider the propagation of a contagion process (epidemic) on a network and study the problem of dynamically allocating a fixed curing budget to the nodes of the graph, at each time instant. For bounded degree graphs, we provide a lower bound on the expected time to extinction under any such dynamic allocation policy, in terms of a combinatorial quantity that we call the resistance of the set o… ▽ More

    Submitted 20 October, 2015; originally announced October 2015.