Skip to main content

Showing 1–50 of 70 results for author: Azizzadenesheli, K

  1. arXiv:2407.07873  [pdf, other

    cs.LG math.DS math.OC math.PR stat.ML

    Dynamical Measure Transport and Neural PDE Solvers for Sampling

    Authors: Jingtong Sun, Julius Berner, Lorenz Richter, Marius Zeinhofer, Johannes Müller, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: The task of sampling from a probability density can be approached as transporting a tractable density function to the target, known as dynamical measure transport. In this work, we tackle it through a principled unified framework using deterministic or stochastic evolutions described by partial differential equations (PDEs). This framework incorporates prior trajectory-based sampling methods, such… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2404.02986  [pdf, other

    cs.LG stat.ML

    Universal Functional Regression with Neural Operator Flows

    Authors: Yaozhong Shi, Angela F. Gao, Zachary E. Ross, Kamyar Azizzadenesheli

    Abstract: Regression on function spaces is typically limited to models with Gaussian process priors. We introduce the notion of universal functional regression, in which we aim to learn a prior distribution over non-Gaussian function spaces that remains mathematically tractable for functional regression. To do this, we develop Neural Operator Flows (OpFlow), an infinite-dimensional extension of normalizing… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  3. arXiv:2403.12553  [pdf, other

    cs.LG

    Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs

    Authors: Md Ashiqur Rahman, Robert Joseph George, Mogab Elleithy, Daniel Leibovici, Zongyi Li, Boris Bonev, Colin White, Julius Berner, Raymond A. Yeh, Jean Kossaifi, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Existing neural operator architectures face challenges when solving multiphysics problems with coupled partial differential equations (PDEs), due to complex geometries, interactions between physical variables, and the lack of large amounts of high-resolution training data. To address these issues, we propose Codomain Attention Neural Operator (CoDA-NO), which tokenizes functions along the codomain… ▽ More

    Submitted 5 April, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  4. arXiv:2402.16845  [pdf, other

    cs.LG cs.AI math.NA

    Neural Operators with Localized Integral and Differential Kernels

    Authors: Miguel Liu-Schiaffini, Julius Berner, Boris Bonev, Thorsten Kurth, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Neural operators learn mappings between function spaces, which is practical for learning solution operators of PDEs and other scientific modeling applications. Among them, the Fourier neural operator (FNO) is a popular architecture that performs global convolutions in the Fourier space. However, such global operations are often prone to over-smoothing and may fail to capture local details. In cont… ▽ More

    Submitted 8 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted at 2024 International Conference on Machine Learning

  5. arXiv:2402.01960  [pdf, other

    cs.LG

    Calibrated Uncertainty Quantification for Operator Learning via Conformal Prediction

    Authors: Ziqi Ma, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Operator learning has been increasingly adopted in scientific and engineering applications, many of which require calibrated uncertainty quantification. Since the output of operator learning is a continuous function, quantifying uncertainty simultaneously at all points in the domain is challenging. Current methods consider calibration at a single point or over one scalar function or make strong as… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 14 pages, 7 figures

  6. arXiv:2401.11037  [pdf, other

    cs.LG math.NA q-bio.QM

    Equivariant Graph Neural Operator for Modeling 3D Dynamics

    Authors: Minkai Xu, Jiaqi Han, Aaron Lou, Jean Kossaifi, Arvind Ramanathan, Kamyar Azizzadenesheli, Jure Leskovec, Stefano Ermon, Anima Anandkumar

    Abstract: Modeling the complex three-dimensional (3D) dynamics of relational systems is an important problem in the natural sciences, with applications ranging from molecular simulations to particle mechanics. Machine learning methods have achieved good success by learning graph neural networks to model spatial interactions. However, these approaches do not faithfully capture temporal correlations since the… ▽ More

    Submitted 2 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024. Copyright 2024 by the author(s)

  7. arXiv:2310.00120  [pdf, other

    cs.LG

    Multi-Grid Tensorized Fourier Neural Operator for High-Resolution PDEs

    Authors: Jean Kossaifi, Nikola Kovachki, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Memory complexity and data scarcity have so far prohibited learning solution operators of partial differential equations (PDEs) at high resolutions. We address these limitations by introducing a new data efficient and highly parallelizable operator learning approach with reduced memory requirement and better generalization, called multi-grid tensorized neural operator (MG-TFNO). MG-TFNO scales to… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  8. arXiv:2309.15325  [pdf, other

    cs.LG physics.comp-ph

    Neural Operators for Accelerating Scientific Simulations and Design

    Authors: Kamyar Azizzadenesheli, Nikola Kovachki, Zongyi Li, Miguel Liu-Schiaffini, Jean Kossaifi, Anima Anandkumar

    Abstract: Scientific discovery and engineering design are currently limited by the time and cost of physical experiments, selected mostly through trial-and-error and intuition that require deep domain expertise. Numerical simulations present an alternative to physical experiments but are usually infeasible for complex real-world domains due to the computational requirements of existing numerical methods. Ar… ▽ More

    Submitted 4 January, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  9. arXiv:2309.03447  [pdf, other

    physics.geo-ph cs.LG

    Broadband Ground Motion Synthesis via Generative Adversarial Neural Operators: Development and Validation

    Authors: Yaozhong Shi, Grigorios Lavrentiadis, Domniki Asimaki, Zachary E. Ross, Kamyar Azizzadenesheli

    Abstract: We present a data-driven framework for ground-motion synthesis that generates three-component acceleration time histories conditioned on moment magnitude, rupture distance , time-average shear-wave velocity at the top $30m$ ($V_{S30}$), and style of faulting. We use a Generative Adversarial Neural Operator (GANO), a resolution invariant architecture that guarantees model training independent of th… ▽ More

    Submitted 14 February, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

  10. arXiv:2309.00583  [pdf, other

    cs.LG math.NA

    Geometry-Informed Neural Operator for Large-Scale 3D PDEs

    Authors: Zongyi Li, Nikola Borislavov Kovachki, Chris Choy, Boyi Li, Jean Kossaifi, Shourya Prakash Otta, Mohammad Amin Nabian, Maximilian Stadler, Christian Hundt, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: We propose the geometry-informed neural operator (GINO), a highly efficient approach to learning the solution operator of large-scale partial differential equations with varying geometries. GINO uses a signed distance function and point-cloud representations of the input shape and neural operators based on graph and Fourier architectures to learn the solution operator. The graph neural operator ha… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  11. arXiv:2308.08794  [pdf, other

    cs.LG math.DS

    Tipping Point Forecasting in Non-Stationary Dynamics on Function Spaces

    Authors: Miguel Liu-Schiaffini, Clare E. Singer, Nikola Kovachki, Tapio Schneider, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Tipping points are abrupt, drastic, and often irreversible changes in the evolution of non-stationary and chaotic dynamical systems. For instance, increased greenhouse gas concentrations are predicted to lead to drastic decreases in low cloud cover, referred to as a climatological tipping point. In this paper, we learn the evolution of such non-stationary dynamical systems using a novel recurrent… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 29 pages, 15 figures

  12. arXiv:2307.15034  [pdf, other

    cs.LG math.NA

    Guaranteed Approximation Bounds for Mixed-Precision Neural Operators

    Authors: Renbo Tu, Colin White, Jean Kossaifi, Boris Bonev, Nikola Kovachki, Gennady Pekhimenko, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Neural operators, such as Fourier Neural Operators (FNO), form a principled approach for learning solution operators for PDEs and other mappings between function spaces. However, many real-world problems require high-resolution training data, and the training time and limited GPU memory pose big barriers. One solution is to train neural operators in mixed precision to reduce the memory requirement… ▽ More

    Submitted 5 May, 2024; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: ICLR 2024

  13. arXiv:2307.08423  [pdf, other

    cs.LG physics.comp-ph

    Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems

    Authors: Xuan Zhang, Limei Wang, Jacob Helwig, Youzhi Luo, Cong Fu, Yaochen Xie, Meng Liu, Yuchao Lin, Zhao Xu, Keqiang Yan, Keir Adams, Maurice Weiler, Xiner Li, Tianfan Fu, Yucheng Wang, Haiyang Yu, YuQing Xie, Xiang Fu, Alex Strasser, Shenglong Xu, Yi Liu, Yuanqi Du, Alexandra Saxton, Hongyi Ling, Hannah Lawrence , et al. (38 additional authors not shown)

    Abstract: Advances in artificial intelligence (AI) are fueling a new paradigm of discoveries in natural sciences. Today, AI has started to advance natural sciences by improving, accelerating, and enabling our understanding of natural phenomena at a wide range of spatial and temporal scales, giving rise to a new area of research known as AI for science (AI4Science). Being an emerging research paradigm, AI4Sc… ▽ More

    Submitted 15 November, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  14. arXiv:2307.05953  [pdf, ps, other

    cs.GT

    Reward Selection with Noisy Observations

    Authors: Kamyar Azizzadenesheli, Trung Dang, Aranyak Mehta, Alexandros Psomas, Qian Zhang

    Abstract: We study a fundamental problem in optimization under uncertainty. There are $n$ boxes; each box $i$ contains a hidden reward $x_i$. Rewards are drawn i.i.d. from an unknown distribution $\mathcal{D}$. For each box $i$, we see $y_i$, an unbiased estimate of its reward, which is drawn from a Normal distribution with known standard deviation $σ_i$ (and an unknown mean $x_i$). Our task is to select a… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  15. arXiv:2305.18246  [pdf, other

    cs.LG

    Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

    Authors: Haque Ishfaq, Qingfeng Lan, Pan Xu, A. Rupam Mahmood, Doina Precup, Anima Anandkumar, Kamyar Azizzadenesheli

    Abstract: We present a scalable and effective exploration strategy based on Thompson sampling for reinforcement learning (RL). One of the key shortcomings of existing Thompson sampling algorithms is the need to perform a Gaussian approximation of the posterior distribution, which is not a good surrogate in most practical settings. We instead directly sample the Q function from its posterior distribution, by… ▽ More

    Submitted 17 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Published in The Twelfth International Conference on Learning Representations (ICLR) 2024

  16. arXiv:2302.07400  [pdf, other

    cs.LG math.FA stat.ML

    Score-based Diffusion Models in Function Space

    Authors: Jae Hyun Lim, Nikola B. Kovachki, Ricardo Baptista, Christopher Beckham, Kamyar Azizzadenesheli, Jean Kossaifi, Vikram Voleti, Jiaming Song, Karsten Kreis, Jan Kautz, Christopher Pal, Arash Vahdat, Anima Anandkumar

    Abstract: Diffusion models have recently emerged as a powerful framework for generative modeling. They consist of a forward process that perturbs input data with Gaussian white noise and a reverse process that learns a score function to generate samples by denoising. Despite their tremendous success, they are mostly formulated on finite-dimensional spaces, e.g. Euclidean, limiting their applications to many… ▽ More

    Submitted 22 November, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 52 pages

    MSC Class: 46B09 (Primary); 60J22 (Secondary) ACM Class: I.2.6; J.2

  17. arXiv:2211.16210  [pdf, other

    cs.CV cs.GR cs.LG

    PaCMO: Partner Dependent Human Motion Generation in Dyadic Human Activity using Neural Operators

    Authors: Md Ashiqur Rahman, Jasorsi Ghosh, Hrishikesh Viswanath, Kamyar Azizzadenesheli, Aniket Bera

    Abstract: We address the problem of generating 3D human motions in dyadic activities. In contrast to the concurrent works, which mainly focus on generating the motion of a single actor from the textual description, we generate the motion of one of the actors from the motion of the other participating actor in the action. This is a particularly challenging, under-explored problem, that requires learning intr… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

  18. arXiv:2211.13449  [pdf, other

    cs.LG cs.CV

    Fast Sampling of Diffusion Models via Operator Learning

    Authors: Hongkai Zheng, Weili Nie, Arash Vahdat, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Diffusion models have found widespread adoption in various areas. However, their sampling process is slow because it requires hundreds to thousands of network evaluations to emulate a continuous process defined by differential equations. In this work, we use neural operators, an efficient method to solve the probability flow differential equations, to accelerate the sampling process of diffusion m… ▽ More

    Submitted 22 July, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

  19. arXiv:2210.17051  [pdf, other

    cs.LG physics.flu-dyn

    Real-time high-resolution CO$_2$ geological storage prediction using nested Fourier neural operators

    Authors: Gege Wen, Zongyi Li, Qirui Long, Kamyar Azizzadenesheli, Anima Anandkumar, Sally M. Benson

    Abstract: Carbon capture and storage (CCS) plays an essential role in global decarbonization. Scaling up CCS deployment requires accurate and high-resolution modeling of the storage reservoir pressure buildup and the gaseous plume migration. However, such modeling is very challenging at scale due to the high computational costs of existing numerical methods. This challenge leads to significant uncertainties… ▽ More

    Submitted 1 June, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Journal ref: Energy & Environmental Science, 16(4), 1732-1741 (2023)

  20. arXiv:2209.10444  [pdf, other

    cs.LG cs.AI stat.ML

    Off-Policy Risk Assessment in Markov Decision Processes

    Authors: Audrey Huang, Liu Leqi, Zachary Chase Lipton, Kamyar Azizzadenesheli

    Abstract: Addressing such diverse ends as safety alignment with human preferences, and the efficiency of learning, a growing line of reinforcement learning research focuses on risk functionals that depend on the entire distribution of returns. Recent work on \emph{off-policy risk assessment} (OPRA) for contextual bandits introduced consistent estimators for the target policy's CDF of returns along with fini… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

  21. arXiv:2207.05850  [pdf, other

    math.OC cs.LG eess.SY

    Compactly Restrictable Metric Policy Optimization Problems

    Authors: Victor D. Dorobantu, Kamyar Azizzadenesheli, Yisong Yue

    Abstract: We study policy optimization problems for deterministic Markov decision processes (MDPs) with metric state and action spaces, which we refer to as Metric Policy Optimization Problems (MPOPs). Our goal is to establish theoretical results on the well-posedness of MPOPs that can characterize practically relevant continuous control systems. To do so, we define a special class of MPOPs called Compactly… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: 11 pages, 1 figure, submitted to Transactions on Automatic Control

  22. arXiv:2206.13648  [pdf, other

    stat.ML cs.LG

    Supervised Learning with General Risk Functionals

    Authors: Liu Leqi, Audrey Huang, Zachary C. Lipton, Kamyar Azizzadenesheli

    Abstract: Standard uniform convergence results bound the generalization gap of the expected loss over a hypothesis class. The emergence of risk-sensitive learning requires generalization guarantees for functionals of the loss distribution beyond the expectation. While prior works specialize in uniform convergence of particular functionals, our work provides uniform convergence for a general class of Hölder… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  23. arXiv:2206.11254  [pdf, other

    cs.LG stat.ML

    Langevin Monte Carlo for Contextual Bandits

    Authors: Pan Xu, Hongkai Zheng, Eric Mazumdar, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: We study the efficiency of Thompson sampling for contextual bandits. Existing Thompson sampling-based algorithms need to construct a Laplace approximation (i.e., a Gaussian distribution) of the posterior distribution, which is inefficient to sample in high dimensional applications for general covariance matrices. Moreover, the Gaussian approximation may not be a good surrogate for the posterior di… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 21 pages, 3 figures, 2 tables. To appear in the proceedings of the 39th International Conference on Machine Learning (ICML2022)

  24. arXiv:2206.08520  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    Thompson Sampling Achieves $\tilde O(\sqrt{T})$ Regret in Linear Quadratic Control

    Authors: Taylan Kargin, Sahin Lale, Kamyar Azizzadenesheli, Anima Anandkumar, Babak Hassibi

    Abstract: Thompson Sampling (TS) is an efficient method for decision-making under uncertainty, where an action is sampled from a carefully prescribed distribution which is updated based on the observed data. In this work, we study the problem of adaptive control of stabilizable linear-quadratic regulators (LQRs) using TS, where the system dynamics are unknown. Previous works have established that… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted for presentation at the Conference on Learning Theory (COLT) 2022

  25. arXiv:2206.01704  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    KCRL: Krasovskii-Constrained Reinforcement Learning with Guaranteed Stability in Nonlinear Dynamical Systems

    Authors: Sahin Lale, Yuanyuan Shi, Guannan Qu, Kamyar Azizzadenesheli, Adam Wierman, Anima Anandkumar

    Abstract: Learning a dynamical system requires stabilizing the unknown dynamics to avoid state blow-ups. However, current reinforcement learning (RL) methods lack stabilization guarantees, which limits their applicability for the control of safety-critical systems. We propose a model-based RL framework with formal stability guarantees, Krasovskii Constrained RL (KCRL), that adopts Krasovskii's family of Lya… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  26. arXiv:2205.14545  [pdf, other

    cs.LG math.ST

    Functional Linear Regression of Cumulative Distribution Functions

    Authors: Qian Zhang, Anuran Makur, Kamyar Azizzadenesheli

    Abstract: The estimation of cumulative distribution functions (CDF) is an important learning task with a great variety of downstream applications, such as risk assessments in predictions and decision making. In this paper, we study functional regression of contextual CDFs where each data point is sampled from a linear combination of context dependent CDF basis functions. We propose functional ridge-regressi… ▽ More

    Submitted 7 March, 2024; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: 56 pages, 7 figures, accepted by TMLR

  27. arXiv:2205.14232  [pdf, other

    math.OC cs.LG

    Competitive Gradient Optimization

    Authors: Abhijeet Vyas, Kamyar Azizzadenesheli

    Abstract: We study the problem of convergence to a stationary point in zero-sum games. We propose competitive gradient optimization (CGO ), a gradient-based method that incorporates the interactions between the two players in zero-sum games for optimization updates. We provide continuous-time analysis of CGO and its convergence properties while showing that in the continuous limit, CGO predecessors degenera… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  28. arXiv:2205.06908  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Neural-Fly Enables Rapid Learning for Agile Flight in Strong Winds

    Authors: Michael O'Connell, Guanya Shi, Xichen Shi, Kamyar Azizzadenesheli, Anima Anandkumar, Yisong Yue, Soon-Jo Chung

    Abstract: Executing safe and precise flight maneuvers in dynamic high-speed winds is important for the ongoing commoditization of uninhabited aerial vehicles (UAVs). However, because the relationship between various wind conditions and its effect on aircraft maneuverability is not well understood, it is challenging to design effective robot controllers using traditional control design methods. We present Ne… ▽ More

    Submitted 11 April, 2024; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: This is the accepted version of Science Robotics Vol. 7, Issue 66, eabm6597 (2022). Video: https://youtu.be/TuF9teCZX0U

  29. arXiv:2205.03017  [pdf, other

    cs.LG math.PR

    Generative Adversarial Neural Operators

    Authors: Md Ashiqur Rahman, Manuel A. Florez, Anima Anandkumar, Zachary E. Ross, Kamyar Azizzadenesheli

    Abstract: We propose the generative adversarial neural operator (GANO), a generative model paradigm for learning probabilities on infinite-dimensional function spaces. The natural sciences and engineering are known to have many types of data that are sampled from infinite-dimensional function spaces, where classical finite-dimensional deep generative adversarial networks (GANs) may not be directly applicabl… ▽ More

    Submitted 12 October, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: Transactions on Machine Learning Research 2022

  30. arXiv:2204.11127  [pdf, other

    cs.LG

    U-NO: U-shaped Neural Operators

    Authors: Md Ashiqur Rahman, Zachary E. Ross, Kamyar Azizzadenesheli

    Abstract: Neural operators generalize classical neural networks to maps between infinite-dimensional spaces, e.g., function spaces. Prior works on neural operators proposed a series of novel methods to learn such maps and demonstrated unprecedented success in learning solution operators of partial differential equations. Due to their close proximity to fully connected architectures, these models mainly suff… ▽ More

    Submitted 5 May, 2023; v1 submitted 23 April, 2022; originally announced April 2022.

  31. arXiv:2202.11214  [pdf, other

    physics.ao-ph cs.LG

    FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators

    Authors: Jaideep Pathak, Shashank Subramanian, Peter Harrington, Sanjeev Raja, Ashesh Chattopadhyay, Morteza Mardani, Thorsten Kurth, David Hall, Zongyi Li, Kamyar Azizzadenesheli, Pedram Hassanzadeh, Karthik Kashinath, Animashree Anandkumar

    Abstract: FourCastNet, short for Fourier Forecasting Neural Network, is a global data-driven weather forecasting model that provides accurate short to medium-range global predictions at $0.25^{\circ}$ resolution. FourCastNet accurately forecasts high-resolution, fast-timescale variables such as the surface wind speed, precipitation, and atmospheric water vapor. It has important implications for planning win… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  32. arXiv:2111.03794  [pdf, other

    cs.LG math.NA

    Physics-Informed Neural Operator for Learning Partial Differential Equations

    Authors: Zongyi Li, Hongkai Zheng, Nikola Kovachki, David Jin, Haoxuan Chen, Burigede Liu, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: In this paper, we propose physics-informed neural operators (PINO) that combine training data and physics constraints to learn the solution operator of a given family of parametric Partial Differential Equations (PDE). PINO is the first hybrid approach incorporating data and PDE constraints at different resolutions to learn the operator. Specifically, in PINO, we combine coarse-resolution training… ▽ More

    Submitted 29 July, 2023; v1 submitted 5 November, 2021; originally announced November 2021.

  33. arXiv:2109.03697  [pdf, other

    physics.geo-ph cs.LG

    U-FNO -- An enhanced Fourier neural operator-based deep-learning model for multiphase flow

    Authors: Gege Wen, Zongyi Li, Kamyar Azizzadenesheli, Anima Anandkumar, Sally M. Benson

    Abstract: Numerical simulation of multiphase flow in porous media is essential for many geoscience applications. Machine learning models trained with numerical simulation data can provide a faster alternative to traditional simulators. Here we present U-FNO, a novel neural network architecture for solving multiphase flow problems with superior accuracy, speed, and data efficiency. U-FNO is designed based on… ▽ More

    Submitted 4 May, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

  34. arXiv:2108.11959  [pdf, ps, other

    cs.LG eess.SY math.OC

    Finite-time System Identification and Adaptive Control in Autoregressive Exogenous Systems

    Authors: Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

    Abstract: Autoregressive exogenous (ARX) systems are the general class of input-output dynamical systems used for modeling stochastic linear dynamical systems (LDS) including partially observable LDS such as LQG systems. In this work, we study the problem of system identification and adaptive control of unknown ARX systems. We provide finite-time learning guarantees for the ARX systems under both open-loop… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: 3rd Annual Learning for Dynamics & Control Conference (L4DC)

  35. Neural Operator: Learning Maps Between Function Spaces

    Authors: Nikola Kovachki, Zongyi Li, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: The classical development of neural networks has primarily focused on learning mappings between finite dimensional Euclidean spaces or finite sets. We propose a generalization of neural networks to learn operators, termed neural operators, that map between infinite dimensional function spaces. We formulate the neural operator as a composition of linear integral operators and nonlinear activation f… ▽ More

    Submitted 2 May, 2024; v1 submitted 18 August, 2021; originally announced August 2021.

    Journal ref: The Journal of Machine Learning Research (2023), Volume 24, Issue 1, Article No 89, pp 4061-4157

  36. arXiv:2108.05421  [pdf, other

    physics.geo-ph cs.LG

    Seismic wave propagation and inversion with Neural Operators

    Authors: Yan Yang, Angela F. Gao, Jorge C. Castellanos, Zachary E. Ross, Kamyar Azizzadenesheli, Robert W. Clayton

    Abstract: Seismic wave propagation forms the basis for most aspects of seismological research, yet solving the wave equation is a major computational burden that inhibits the progress of research. This is exacerbated by the fact that new simulations must be performed when the velocity structure or source location is perturbed. Here, we explore a prototype framework for learning general solutions using a rec… ▽ More

    Submitted 13 October, 2021; v1 submitted 11 August, 2021; originally announced August 2021.

  37. arXiv:2106.06898  [pdf, other

    cs.LG math.DS

    Learning Dissipative Dynamics in Chaotic Systems

    Authors: Zongyi Li, Miguel Liu-Schiaffini, Nikola Kovachki, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: Chaotic systems are notoriously challenging to predict because of their sensitivity to perturbations and errors due to time stepping. Despite this unpredictable behavior, for many dissipative systems the statistics of the long term trajectories are governed by an invariant measure supported on a set, known as the global attractor; for many problems this set is finite dimensional, even if the state… ▽ More

    Submitted 27 September, 2022; v1 submitted 12 June, 2021; originally announced June 2021.

  38. arXiv:2106.06098  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Meta-Adaptive Nonlinear Control: Theory and Algorithms

    Authors: Guanya Shi, Kamyar Azizzadenesheli, Michael O'Connell, Soon-Jo Chung, Yisong Yue

    Abstract: We present an online multi-task learning approach for adaptive nonlinear control, which we call Online Meta-Adaptive Control (OMAC). The goal is to control a nonlinear system subject to adversarial disturbance and unknown $\textit{environment-dependent}$ nonlinear dynamics, under the assumption that the environment-dependent dynamics can be well captured with some shared representation. Our approa… ▽ More

    Submitted 26 October, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia

  39. arXiv:2104.08977  [pdf, other

    cs.LG stat.ML

    Off-Policy Risk Assessment in Contextual Bandits

    Authors: Audrey Huang, Liu Leqi, Zachary C. Lipton, Kamyar Azizzadenesheli

    Abstract: Even when unable to run experiments, practitioners can evaluate prospective policies, using previously logged data. However, while the bandits literature has adopted a diverse set of objectives, most research on off-policy evaluation to date focuses on the expected reward. In this paper, we introduce Lipschitz risk functionals, a broad class of objectives that subsumes conditional value-at-risk (C… ▽ More

    Submitted 29 June, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

  40. arXiv:2103.02827  [pdf, other

    cs.LG cs.AI stat.ML

    On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk

    Authors: Audrey Huang, Liu Leqi, Zachary C. Lipton, Kamyar Azizzadenesheli

    Abstract: In order to model risk aversion in reinforcement learning, an emerging line of research adapts familiar algorithms to optimize coherent risk functionals, a class that includes conditional value-at-risk (CVaR). Because optimizing the coherent risk is difficult in Markov decision processes, recent work tends to focus on the Markov coherent risk (MCR), a time-consistent surrogate. While, policy gradi… ▽ More

    Submitted 5 March, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

  41. arXiv:2102.08462  [pdf, ps, other

    cs.LG cs.AI cs.MA

    Multi-Agent Multi-Armed Bandits with Limited Communication

    Authors: Mridul Agarwal, Vaneet Aggarwal, Kamyar Azizzadenesheli

    Abstract: We consider the problem where $N$ agents collaboratively interact with an instance of a stochastic $K$ arm bandit problem for $K \gg N$. The agents aim to simultaneously minimize the cumulative regret over all the agents for a total of $T$ time steps, the number of communication rounds, and the number of bits in each communication round. We present Limited Communication Collaboration - Upper Confi… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

  42. arXiv:2101.03271  [pdf, other

    physics.geo-ph cs.LG

    HypoSVI: Hypocenter inversion with Stein variational inference and Physics Informed Neural Networks

    Authors: Jonathan D. Smith, Zachary E. Ross, Kamyar Azizzadenesheli, Jack B. Muir

    Abstract: We introduce a scheme for probabilistic hypocenter inversion with Stein variational inference. Our approach uses a differentiable forward model in the form of a physics informed neural network, which we train to solve the Eikonal equation. This allows for rapid approximation of the posterior by iteratively optimizing a collection of particles against a kernelized Stein discrepancy. We show that th… ▽ More

    Submitted 17 August, 2022; v1 submitted 8 January, 2021; originally announced January 2021.

    Comments: Updating to accepted version of the paper

  43. arXiv:2011.14251  [pdf, other

    cs.LG

    Importance Weight Estimation and Generalization in Domain Adaptation under Label Shift

    Authors: Kamyar Azizzadenesheli

    Abstract: We study generalization under labeled shift for categorical and general normed label spaces. We propose a series of methods to estimate the importance weights from labeled source to unlabeled target domain and provide confidence bounds for these estimators. We deploy these estimators and provide generalization bounds in the unlabeled target domain.

    Submitted 5 June, 2021; v1 submitted 28 November, 2020; originally announced November 2020.

  44. arXiv:2010.08895  [pdf, other

    cs.LG math.NA

    Fourier Neural Operator for Parametric Partial Differential Equations

    Authors: Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: The classical development of neural networks has primarily focused on learning mappings between finite-dimensional Euclidean spaces. Recently, this has been generalized to neural operators that learn mappings between function spaces. For partial differential equations (PDEs), neural operators directly learn the mapping from any functional parametric dependence to the solution. Thus, they learn an… ▽ More

    Submitted 16 May, 2021; v1 submitted 17 October, 2020; originally announced October 2020.

  45. arXiv:2007.12291  [pdf, other

    cs.LG math.OC stat.ML

    Reinforcement Learning with Fast Stabilization in Linear Dynamical Systems

    Authors: Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

    Abstract: In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynamical systems. When learning a dynamical system, one needs to stabilize the unknown dynamics in order to avoid system blow-ups. We propose an algorithm that certifies fast stabilization of the underlying system by effectively exploring the environment with an improved exploration strategy. We show tha… ▽ More

    Submitted 3 June, 2022; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: 25th International Conference on Artificial Intelligence and Statistics (AISTATS) 2022

  46. arXiv:2006.15637  [pdf, other

    cs.LG stat.ML

    Deep Bayesian Quadrature Policy Optimization

    Authors: Akella Ravi Tej, Kamyar Azizzadenesheli, Mohammad Ghavamzadeh, Anima Anandkumar, Yisong Yue

    Abstract: We study the problem of obtaining accurate policy gradient estimates using a finite number of samples. Monte-Carlo methods have been the default choice for policy gradient estimation, despite suffering from high variance in the gradient estimates. On the other hand, more sample efficient alternatives like Bayesian quadrature methods have received little attention due to their high computational co… ▽ More

    Submitted 16 December, 2020; v1 submitted 28 June, 2020; originally announced June 2020.

    Comments: Conference paper: AAAI-21. Code available at https://github.com/Akella17/Deep-Bayesian-Quadrature-Policy-Optimization

  47. arXiv:2006.10611  [pdf, other

    cs.LG cs.GT cs.MA stat.ML

    Competitive Policy Optimization

    Authors: Manish Prajapat, Kamyar Azizzadenesheli, Alexander Liniger, Yisong Yue, Anima Anandkumar

    Abstract: A core challenge in policy optimization in competitive Markov decision processes is the design of efficient optimization methods with desirable convergence and stability properties. To tackle this, we propose competitive policy optimization (CoPO), a novel policy gradient approach that exploits the game-theoretic nature of competitive games to derive policy updates. Motivated by the competitive gr… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: 11 pages main paper, 6 pages references, and 31 pages appendix. 14 figures

  48. arXiv:2006.09535  [pdf, other

    cs.LG math.NA stat.ML

    Multipole Graph Neural Operator for Parametric Partial Differential Equations

    Authors: Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: One of the main challenges in using deep learning-based methods for simulating physical systems and solving partial differential equations (PDEs) is formulating physics-based data in the desired structure for neural networks. Graph neural networks (GNNs) have gained popularity in this area since graphs offer a natural way of modeling particle interactions and provide a clear way of discretizing th… ▽ More

    Submitted 19 October, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

  49. arXiv:2005.01463  [pdf, other

    cs.LG eess.IV physics.flu-dyn stat.ML

    MeshfreeFlowNet: A Physics-Constrained Deep Continuous Space-Time Super-Resolution Framework

    Authors: Chiyu Max Jiang, Soheil Esmaeilzadeh, Kamyar Azizzadenesheli, Karthik Kashinath, Mustafa Mustafa, Hamdi A. Tchelepi, Philip Marcus, Prabhat, Anima Anandkumar

    Abstract: We propose MeshfreeFlowNet, a novel deep learning-based super-resolution framework to generate continuous (grid-free) spatio-temporal solutions from the low-resolution inputs. While being computationally efficient, MeshfreeFlowNet accurately recovers the fine-scale quantities of interest. MeshfreeFlowNet allows for: (i) the output to be sampled at all spatio-temporal resolutions, (ii) a set of Par… ▽ More

    Submitted 21 August, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: Supplementary Video: https://youtu.be/mjqwPch9gDo. Accepted to SC20

  50. arXiv:2004.00361  [pdf, other

    physics.comp-ph cs.LG physics.geo-ph physics.optics stat.ML

    EikoNet: Solving the Eikonal equation with Deep Neural Networks

    Authors: Jonathan D. Smith, Kamyar Azizzadenesheli, Zachary E. Ross

    Abstract: The recent deep learning revolution has created an enormous opportunity for accelerating compute capabilities in the context of physics-based simulations. Here, we propose EikoNet, a deep learning approach to solving the Eikonal equation, which characterizes the first-arrival-time field in heterogeneous 3D velocity structures. Our grid-free approach allows for rapid determination of the travel tim… ▽ More

    Submitted 11 August, 2020; v1 submitted 24 March, 2020; originally announced April 2020.

    Comments: Revised version