Skip to main content

Showing 1–24 of 24 results for author: Azizan, N

  1. arXiv:2405.18670  [pdf, other

    cs.LG cs.CR cs.DB

    Adapting Differentially Private Synthetic Data to Relational Databases

    Authors: Kaveh Alimohammadi, Hao Wang, Ojas Gulati, Akash Srivastava, Navid Azizan

    Abstract: Existing differentially private (DP) synthetic data generation mechanisms typically assume a single-source table. In practice, data is often distributed across multiple tables with relationships across tables. In this paper, we introduce the first-of-its-kind algorithm that can be combined with any existing DP mechanisms to generate synthetic relational databases. Our algorithm iteratively refines… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2310.09729  [pdf, other

    cs.CR cs.LG

    Private Synthetic Data Meets Ensemble Learning

    Authors: Haoyuan Sun, Navid Azizan, Akash Srivastava, Hao Wang

    Abstract: When machine learning models are trained on synthetic data and then deployed on real data, there is often a performance drop due to the distribution shift between synthetic and real data. In this paper, we introduce a new ensemble strategy for training downstream models, with the goal of enhancing their performance when used on real data. We generate multiple synthetic datasets by applying a diffe… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  3. arXiv:2306.13853  [pdf, other

    cs.LG

    A Unified Approach to Controlling Implicit Regularization via Mirror Descent

    Authors: Haoyuan Sun, Khashayar Gatmiry, Kwangjun Ahn, Navid Azizan

    Abstract: Inspired by the remarkable success of large neural networks, there has been significant interest in understanding the generalization performance of over-parameterized models. Substantial efforts have been invested in characterizing how optimization algorithms impact generalization through their "preferred" solutions, a phenomenon commonly referred to as implicit regularization. In particular, it h… ▽ More

    Submitted 11 January, 2024; v1 submitted 23 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2205.12808

  4. arXiv:2306.00206  [pdf, other

    cs.LG cs.AI

    Quantifying Representation Reliability in Self-Supervised Learning Models

    Authors: Young-Jin Park, Hao Wang, Shervin Ardeshir, Navid Azizan

    Abstract: Self-supervised learning models extract general-purpose representations from data. Quantifying the reliability of these representations is crucial, as many downstream models rely on them as input for their own tasks. To this end, we introduce a formal definition of representation reliability: the representation for a given test point is considered to be reliable if the downstream models built on t… ▽ More

    Submitted 17 May, 2024; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: Presented in UAI 2024

  5. arXiv:2305.16424  [pdf, other

    cs.LG cs.AI stat.ML

    SketchOGD: Memory-Efficient Continual Learning

    Authors: Benjamin Wright, Youngjae Min, Jeremy Bernstein, Navid Azizan

    Abstract: When machine learning models are trained continually on a sequence of tasks, they are liable to forget what they learned on previous tasks -- a phenomenon known as catastrophic forgetting. Proposed solutions to catastrophic forgetting tend to involve storing information about past tasks, meaning that memory usage is a chief consideration in determining their practicality. This paper proposes a mem… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  6. arXiv:2304.10640  [pdf, other

    cs.DC cs.LG math.NA

    On the Effects of Data Heterogeneity on the Convergence Rates of Distributed Linear System Solvers

    Authors: Boris Velasevic, Rohit Parasnis, Christopher G. Brinton, Navid Azizan

    Abstract: We consider the fundamental problem of solving a large-scale system of linear equations. In particular, we consider the setting where a taskmaster intends to solve the system in a distributed/federated fashion with the help of a set of machines, who each have a subset of the equations. Although there exist several approaches for solving this problem, missing is a rigorous comparison between the co… ▽ More

    Submitted 15 February, 2024; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: 11 pages, 5 figures

    ACM Class: G.1.3; I.2.11; I.2.6

  7. arXiv:2304.05187  [pdf, other

    cs.LG cs.AI cs.NE math.NA stat.ML

    Automatic Gradient Descent: Deep Learning without Hyperparameters

    Authors: Jeremy Bernstein, Chris Mingard, Kevin Huang, Navid Azizan, Yisong Yue

    Abstract: The architecture of a deep neural network is defined explicitly in terms of the number of layers, the width of each layer and the general network topology. Existing optimisation frameworks neglect this information in favour of implicit architectural information (e.g. second-order methods) or architecture-agnostic distance functions (e.g. mirror descent). Meanwhile, the most popular optimiser in pr… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  8. arXiv:2303.11522  [pdf, ps, other

    cs.GT cs.LG

    Online Learning for Equilibrium Pricing in Markets under Incomplete Information

    Authors: Devansh Jalota, Haoyuan Sun, Navid Azizan

    Abstract: The study of market equilibria is central to economic theory, particularly in efficiently allocating scarce resources. However, the computation of equilibrium prices at which the supply of goods matches their demand typically relies on having access to complete information on private attributes of agents, e.g., suppliers' cost functions, which are often unavailable in practice. Motivated by this p… ▽ More

    Submitted 27 March, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  9. arXiv:2303.03157  [pdf, other

    eess.SY cs.LG cs.RO

    Data-Driven Control with Inherent Lyapunov Stability

    Authors: Youngjae Min, Spencer M. Richards, Navid Azizan

    Abstract: Recent advances in learning-based control leverage deep function approximators, such as neural networks, to model the evolution of controlled dynamical systems over time. However, the problem of learning a dynamics model and a stabilizing controller persists, since the synthesis of a stabilizing feedback law for known nonlinear systems is a difficult task, let alone for complex parametric represen… ▽ More

    Submitted 4 April, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  10. arXiv:2302.02529  [pdf, other

    eess.SY cs.LG cs.RO

    Learning Control-Oriented Dynamical Structure from Data

    Authors: Spencer M. Richards, Jean-Jacques Slotine, Navid Azizan, Marco Pavone

    Abstract: Even for known nonlinear dynamical systems, feedback controller synthesis is a difficult problem that often requires leveraging the particular structure of the dynamics to induce a stable closed-loop system. For general nonlinear models, including those fit to data, there may not be enough known structure to reliably synthesize a stabilizing feedback controller. In this paper, we discuss a state-d… ▽ More

    Submitted 23 June, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: International Conference on Machine Learning (ICML), Honolulu, 2023

  11. arXiv:2210.01881  [pdf, other

    cs.LG cs.AI

    Uncertainty-Aware Meta-Learning for Multimodal Task Distributions

    Authors: Cesar Almecija, Apoorva Sharma, Navid Azizan

    Abstract: Meta-learning or learning to learn is a popular approach for learning new tasks with limited data (i.e., few-shot learning) by leveraging the commonalities among different tasks. However, meta-learned models can perform poorly when context data is limited, or when data is drawn from an out-of-distribution (OoD) task. Especially in safety-critical settings, this necessitates an uncertainty-aware ap… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: 21 pages, 10 figures

  12. arXiv:2207.13853  [pdf, other

    cs.LG eess.SY stat.ML

    One-Pass Learning via Bridging Orthogonal Gradient Descent and Recursive Least-Squares

    Authors: Youngjae Min, Kwangjun Ahn, Navid Azizan

    Abstract: While deep neural networks are capable of achieving state-of-the-art performance in various domains, their training typically requires iterating for many passes over the dataset. However, due to computational and memory constraints and potential privacy concerns, storing and accessing all the data is impractical in many real-world scenarios where the data arrives in a stream. In this paper, we inv… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: IEEE Conference on Decision and Control, 2022

  13. arXiv:2207.09336  [pdf, other

    cs.LG cs.AI cs.CV eess.IV stat.ML

    Uncertainty in Contrastive Learning: On the Predictability of Downstream Performance

    Authors: Shervin Ardeshir, Navid Azizan

    Abstract: The superior performance of some of today's state-of-the-art deep learning models is to some extent owed to extensive (self-)supervised contrastive pretraining on large-scale datasets. In contrastive learning, the network is presented with pairs of positive (similar) and negative (dissimilar) datapoints and is trained to find an embedding vector for each datapoint, i.e., a representation, which ca… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  14. arXiv:2205.12808  [pdf, other

    cs.LG

    Mirror Descent Maximizes Generalized Margin and Can Be Implemented Efficiently

    Authors: Haoyuan Sun, Kwangjun Ahn, Christos Thrampoulidis, Navid Azizan

    Abstract: Driven by the empirical success and wide use of deep neural networks, understanding the generalization performance of overparameterized models has become an increasingly popular question. To this end, there has been substantial effort to characterize the implicit bias of the optimization algorithms used, such as gradient descent (GD), and the structural properties of their preferred solutions. Thi… ▽ More

    Submitted 29 September, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Journal ref: Advances in Neural Information Processing Systems 35 (NeurIPS 2022)

  15. arXiv:2204.06716  [pdf, other

    cs.RO cs.LG eess.SY

    Control-oriented meta-learning

    Authors: Spencer M. Richards, Navid Azizan, Jean-Jacques Slotine, Marco Pavone

    Abstract: Real-time adaptation is imperative to the control of robots operating in complex, dynamic environments. Adaptive control laws can endow even nonlinear systems with good trajectory tracking performance, provided that any uncertain dynamics terms are linearly parameterizable with known nonlinear features. However, it is often difficult to specify such features a priori, such as for aerodynamic distu… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: First published in Robotics: Science and Systems (RSS) 2021. This extended version is under review for a special issue in the International Journal of Robotics Research (IJRR). arXiv admin note: substantial text overlap with arXiv:2103.04490

  16. arXiv:2203.17150  [pdf, other

    cs.LG cs.GT math.OC

    Online Learning for Traffic Routing under Unknown Preferences

    Authors: Devansh Jalota, Karthik Gopalakrishnan, Navid Azizan, Ramesh Johari, Marco Pavone

    Abstract: In transportation networks, users typically choose routes in a decentralized and self-interested manner to minimize their individual travel costs, which, in practice, often results in inefficient overall outcomes for society. As a result, there has been a growing interest in designing road tolling schemes to cope with these efficiency losses and steer users toward a system-efficient traffic patter… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  17. arXiv:2203.03034  [pdf, other

    math.OC cs.LG

    A Unified View of SDP-based Neural Network Verification through Completely Positive Programming

    Authors: Robin Brown, Edward Schmerling, Navid Azizan, Marco Pavone

    Abstract: Verifying that input-output relationships of a neural network conform to prescribed operational specifications is a key enabler towards deploying these networks in safety-critical applications. Semidefinite programming (SDP)-based approaches to Rectified Linear Unit (ReLU) network verification transcribe this problem into an optimization problem, where the accuracy of any such formulation reflects… ▽ More

    Submitted 6 March, 2022; originally announced March 2022.

  18. arXiv:2202.10788  [pdf, other

    cs.LG math.OC stat.ML

    Explicit Regularization via Regularizer Mirror Descent

    Authors: Navid Azizan, Sahin Lale, Babak Hassibi

    Abstract: Despite perfectly interpolating the training data, deep neural networks (DNNs) can often generalize fairly well, in part due to the "implicit regularization" induced by the learning algorithm. Nonetheless, various forms of regularization, such as "explicit regularization" (via weight decay), are often used to avoid overfitting, especially when the data is corrupted. There are several challenges wi… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  19. arXiv:2103.04490  [pdf, other

    cs.RO cs.LG eess.SY

    Adaptive-Control-Oriented Meta-Learning for Nonlinear Systems

    Authors: Spencer M. Richards, Navid Azizan, Jean-Jacques Slotine, Marco Pavone

    Abstract: Real-time adaptation is imperative to the control of robots operating in complex, dynamic environments. Adaptive control laws can endow even nonlinear systems with good trajectory tracking performance, provided that any uncertain dynamics terms are linearly parameterizable with known nonlinear features. However, it is often difficult to specify such features a priori, such as for aerodynamic distu… ▽ More

    Submitted 19 June, 2021; v1 submitted 7 March, 2021; originally announced March 2021.

    Comments: Robotics: Science and Systems, Virtual, 2021

  20. arXiv:2102.12567  [pdf, other

    cs.LG

    Sketching Curvature for Efficient Out-of-Distribution Detection for Deep Neural Networks

    Authors: Apoorva Sharma, Navid Azizan, Marco Pavone

    Abstract: In order to safely deploy Deep Neural Networks (DNNs) within the perception pipelines of real-time decision making systems, there is a need for safeguards that can detect out-of-training-distribution (OoD) inputs both efficiently and accurately. Building on recent work leveraging the local curvature of DNNs to reason about epistemic uncertainty, we propose Sketching Curvature of OoD Detection (SCO… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  21. arXiv:1910.07104  [pdf, other

    cs.LG stat.ML

    Orthogonal Gradient Descent for Continual Learning

    Authors: Mehrdad Farajtabar, Navid Azizan, Alex Mott, Ang Li

    Abstract: Neural networks are achieving state of the art and sometimes super-human performance on learning tasks across a variety of domains. Whenever these problems require learning in a continual or sequential manner, however, neural networks suffer from the problem of catastrophic forgetting; they forget how to solve previous tasks after being trained on a new task, despite having the essential capacity… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  22. arXiv:1906.03830  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic Mirror Descent on Overparameterized Nonlinear Models: Convergence, Implicit Regularization, and Generalization

    Authors: Navid Azizan, Sahin Lale, Babak Hassibi

    Abstract: Most modern learning problems are highly overparameterized, meaning that there are many more parameters than the number of training data points, and as a result, the training loss may have infinitely many global minima (parameter vectors that perfectly interpolate the training data). Therefore, it is important to understand which interpolating solutions we converge to, how they depend on the initi… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

  23. arXiv:1904.01855  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    A Stochastic Interpretation of Stochastic Mirror Descent: Risk-Sensitive Optimality

    Authors: Navid Azizan, Babak Hassibi

    Abstract: Stochastic mirror descent (SMD) is a fairly new family of algorithms that has recently found a wide range of applications in optimization, machine learning, and control. It can be considered a generalization of the classical stochastic gradient algorithm (SGD), where instead of updating the weight vector along the negative direction of the stochastic gradient, the update is performed in a "mirror… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

  24. arXiv:1806.00952  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Stochastic Gradient/Mirror Descent: Minimax Optimality and Implicit Regularization

    Authors: Navid Azizan, Babak Hassibi

    Abstract: Stochastic descent methods (of the gradient and mirror varieties) have become increasingly popular in optimization. In fact, it is now widely recognized that the success of deep learning is not only due to the special deep architecture of the models, but also due to the behavior of the stochastic descent methods used, which play a key role in reaching "good" solutions that generalize well to unsee… ▽ More

    Submitted 17 January, 2019; v1 submitted 4 June, 2018; originally announced June 2018.