Skip to main content

Showing 1–22 of 22 results for author: Goldstein, M

  1. arXiv:2407.07998  [pdf, other

    cs.LG stat.ML

    What's the score? Automated Denoising Score Matching for Nonlinear Diffusions

    Authors: Raghav Singhal, Mark Goldstein, Rajesh Ranganath

    Abstract: Reversing a diffusion process by learning its score forms the heart of diffusion-based generative modeling and for estimating properties of scientific systems. The diffusion processes that are tractable center on linear processes with a Gaussian stationary distribution. This limits the kinds of models that can be built to those that target a Gaussian prior or more generally limits the kinds of pro… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2403.13724  [pdf, other

    cs.LG stat.ML

    Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes

    Authors: Yifan Chen, Mark Goldstein, Mengjian Hua, Michael S. Albergo, Nicholas M. Boffi, Eric Vanden-Eijnden

    Abstract: We propose a framework for probabilistic forecasting of dynamical systems based on generative modeling. Given observations of the system state over time, we formulate the forecasting problem as sampling from the conditional distribution of the future system state given its current state. To this end, we leverage the framework of stochastic interpolants, which facilitates the construction of a gene… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  3. arXiv:2401.08740  [pdf, other

    cs.CV cs.LG

    SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers

    Authors: Nanye Ma, Mark Goldstein, Michael S. Albergo, Nicholas M. Boffi, Eric Vanden-Eijnden, Saining Xie

    Abstract: We present Scalable Interpolant Transformers (SiT), a family of generative models built on the backbone of Diffusion Transformers (DiT). The interpolant framework, which allows for connecting two distributions in a more flexible way than standard diffusion models, makes possible a modular study of various design choices impacting generative models built on dynamical transport: using discrete vs. c… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Code available: https://github.com/willisma/SiT

  4. arXiv:2310.03725  [pdf, other

    cs.LG stat.ML

    Stochastic interpolants with data-dependent couplings

    Authors: Michael S. Albergo, Mark Goldstein, Nicholas M. Boffi, Rajesh Ranganath, Eric Vanden-Eijnden

    Abstract: Generative models inspired by dynamical transport of measure -- such as flows and diffusions -- construct a continuous-time map between two probability densities. Conventionally, one of these is the target density, only accessible through samples, while the other is taken as a simple base density that is data-agnostic. In this work, using the framework of stochastic interpolants, we formalize how… ▽ More

    Submitted 15 December, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

  5. Large Language Models to Identify Social Determinants of Health in Electronic Health Records

    Authors: Marco Guevara, Shan Chen, Spencer Thomas, Tafadzwa L. Chaunzwa, Idalid Franco, Benjamin Kann, Shalini Moningi, Jack Qian, Madeleine Goldstein, Susan Harper, Hugo JWL Aerts, Guergana K. Savova, Raymond H. Mak, Danielle S. Bitterman

    Abstract: Social determinants of health (SDoH) have an important impact on patient outcomes but are incompletely collected from the electronic health records (EHR). This study researched the ability of large language models to extract SDoH from free text in EHRs, where they are most commonly documented, and explored the role of synthetic clinical text for improving the extraction of these scarcely documente… ▽ More

    Submitted 5 March, 2024; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: Peer-reviewed version published at NPJ Digital Medicine: https://www.nature.com/articles/s41746-023-00970-0

    Journal ref: NPJ Digit Med. 2024 Jan 11;7(1):6

  6. arXiv:2303.12888  [pdf, other

    cs.LG cs.AI

    A dynamic risk score for early prediction of cardiogenic shock using machine learning

    Authors: Yuxuan Hu, Albert Lui, Mark Goldstein, Mukund Sudarshan, Andrea Tinsay, Cindy Tsui, Samuel Maidman, John Medamana, Neil Jethani, Aahlad Puli, Vuthy Nguy, Yindalon Aphinyanaphongs, Nicholas Kiefer, Nathaniel Smilowitz, James Horowitz, Tania Ahuja, Glenn I Fishman, Judith Hochman, Stuart Katz, Samuel Bernard, Rajesh Ranganath

    Abstract: Myocardial infarction and heart failure are major cardiovascular diseases that affect millions of people in the US. The morbidity and mortality are highest among patients who develop cardiogenic shock. Early recognition of cardiogenic shock is critical. Prompt implementation of treatment measures can prevent the deleterious spiral of ischemia, low blood pressure, and reduced cardiac output due to… ▽ More

    Submitted 28 March, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  7. arXiv:2302.07261  [pdf, other

    cs.LG stat.ML

    Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions

    Authors: Raghav Singhal, Mark Goldstein, Rajesh Ranganath

    Abstract: Diffusion-based generative models (DBGMs) perturb data to a target noise distribution and reverse this process to generate samples. The choice of noising process, or inference diffusion process, affects both likelihoods and sample quality. For example, extending the inference process with auxiliary variables leads to improved sample quality. While there are many such multivariate diffusions to exp… ▽ More

    Submitted 3 March, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

  8. arXiv:2208.10759  [pdf, other

    cs.LG stat.ML

    Survival Mixture Density Networks

    Authors: Xintian Han, Mark Goldstein, Rajesh Ranganath

    Abstract: Survival analysis, the art of time-to-event modeling, plays an important role in clinical treatment decisions. Recently, continuous time models built from neural ODEs have been proposed for survival analysis. However, the training of neural ODEs is slow due to the high computational complexity of neural ODE solvers. Here, we propose an efficient alternative for flexible continuous time models, cal… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: Machine Learning for Healthcare 2022

  9. arXiv:2202.07710  [pdf, other

    cs.NI

    Parallel Virtual Machines Placement with Provable Guarantees

    Authors: Itamar Cohen, Gil Einziger, Maayan Goldstein, Yaniv Sa'ar, Gabriel Scalosub, Erez Waisbard

    Abstract: Network Function Virtualization (NFV) carries the potential for on-demand deployment of network algorithms in virtual machines (VMs). In large clouds, however, VM resource allocation incurs delays that hinder the dynamic scaling of such NFV deployment. Parallel resource management is a promising direction for boosting performance, but it may significantly increase the communication overhead and th… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  10. arXiv:2112.00881  [pdf, other

    cs.LG stat.ML

    Learning Invariant Representations with Missing Data

    Authors: Mark Goldstein, Jörn-Henrik Jacobsen, Olina Chau, Adriel Saporta, Aahlad Puli, Rajesh Ranganath, Andrew C. Miller

    Abstract: Spurious correlations allow flexible models to predict well during training but poorly on related test distributions. Recent work has shown that models that satisfy particular independencies involving correlation-inducing \textit{nuisance} variables have guarantees on their test performance. Enforcing such independencies requires nuisances to be observed during training. However, nuisances, such a… ▽ More

    Submitted 8 June, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: CLeaR (Causal Learning and Reasoning) 2022

  11. arXiv:2111.08175  [pdf, other

    cs.LG stat.ML

    Inverse-Weighted Survival Games

    Authors: Xintian Han, Mark Goldstein, Aahlad Puli, Thomas Wies, Adler J Perotte, Rajesh Ranganath

    Abstract: Deep models trained through maximum likelihood have achieved state-of-the-art results for survival analysis. Despite this training scheme, practitioners evaluate models under other criteria, such as binary classification losses at a chosen set of time horizons, e.g. Brier score (BS) and Bernoulli log likelihood (BLL). Models trained with maximum likelihood may have poor BS or BLL since maximum lik… ▽ More

    Submitted 31 January, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: Neurips 2021

  12. arXiv:2107.06908  [pdf, other

    cs.LG

    Understanding Failures in Out-of-Distribution Detection with Deep Generative Models

    Authors: Lily H. Zhang, Mark Goldstein, Rajesh Ranganath

    Abstract: Deep generative models (DGMs) seem a natural fit for detecting out-of-distribution (OOD) inputs, but such models have been shown to assign higher probabilities or densities to OOD images than images from the training distribution. In this work, we explain why this behavior should be attributed to model misestimation. We first prove that no method can guarantee performance beyond random chance with… ▽ More

    Submitted 16 July, 2021; v1 submitted 14 July, 2021; originally announced July 2021.

    Comments: Accepted at ICML 2021

  13. arXiv:2105.11385  [pdf, other

    cs.SE cs.AI

    Augmenting Modelers with Semantic Autocompletion of Processes

    Authors: Maayan Goldstein, Cecilia Gonzalez-Alvarez

    Abstract: Business process modelers need to have expertise and knowledge of the domain that may not always be available to them. Therefore, they may benefit from tools that mine collections of existing processes and recommend element(s) to be added to a new process that they are constructing. In this paper, we present a method for process autocompletion at design time, that is based on the semantic similari… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

    Comments: Accepted for publication at Business Process Management Forum - BPM Forum 2021

  14. arXiv:2101.05346  [pdf, other

    cs.LG stat.ML

    X-CAL: Explicit Calibration for Survival Analysis

    Authors: Mark Goldstein, Xintian Han, Aahlad Puli, Adler J. Perotte, Rajesh Ranganath

    Abstract: Survival analysis models the distribution of time until an event of interest, such as discharge from the hospital or admission to the ICU. When a model's predicted number of events within any time interval is similar to the observed number, it is called well-calibrated. A survival model's calibration can be measured using, for instance, distributional calibration (D-CALIBRATION) [Haider et al., 20… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  15. arXiv:2007.02879  [pdf, other

    cs.LG cs.AI

    Fast Adaptation via Policy-Dynamics Value Functions

    Authors: Roberta Raileanu, Max Goldstein, Arthur Szlam, Rob Fergus

    Abstract: Standard RL algorithms assume fixed environment dynamics and require a significant amount of interaction to adapt to new environments. We introduce Policy-Dynamics Value Functions (PD-VF), a novel approach for rapidly adapting to dynamics different from those previously seen in training. PD-VF explicitly estimates the cumulative reward in a space of policies and environments. An ensemble of conven… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  16. arXiv:2006.12862  [pdf, other

    cs.LG cs.AI

    Automatic Data Augmentation for Generalization in Deep Reinforcement Learning

    Authors: Roberta Raileanu, Max Goldstein, Denis Yarats, Ilya Kostrikov, Rob Fergus

    Abstract: Deep reinforcement learning (RL) agents often fail to generalize to unseen scenarios, even when they are trained on many instances of semantically similar environments. Data augmentation has recently been shown to improve the sample efficiency and generalization of RL agents. However, different tasks tend to benefit from different kinds of data augmentation. In this paper, we compare three approac… ▽ More

    Submitted 20 February, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

  17. arXiv:1906.10991  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Verifying Robustness of Gradient Boosted Models

    Authors: Gil Einziger, Maayan Goldstein, Yaniv Sa'ar, Itai Segall

    Abstract: Gradient boosted models are a fundamental machine learning technique. Robustness to small perturbations of the input is an important quality measure for machine learning models, but the literature lacks a method to prove the robustness of gradient boosted models. This work introduces VeriGB, a tool for quantifying the robustness of gradient boosted models. VeriGB encodes the model and the robustne… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

  18. arXiv:1804.08902  [pdf, other

    cs.SE cs.CR cs.DS cs.LG

    Learning Software Constraints via Installation Attempts

    Authors: Ran Ben Basat, Maayan Goldstein, Itai Segall

    Abstract: Modern software systems are expected to be secure and contain all the latest features, even when new versions of software are released multiple times an hour. Each system may include many interacting packages. The problem of installing multiple dependent packages has been extensively studied in the past, yielding some promising solutions that work well in practice. However, these assume that the d… ▽ More

    Submitted 14 November, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

  19. arXiv:1711.11487  [pdf, ps, other

    eess.SY cs.CR

    FRAPpuccino: Fault-detection through Runtime Analysis of Provenance

    Authors: Xueyuan Han, Thomas Pasquier, Tanvi Ranjan, Mark Goldstein, Margo Seltzer

    Abstract: We present FRAPpuccino (or FRAP), a provenance-based fault detection mechanism for Platform as a Service (PaaS) users, who run many instances of an application on a large cluster of machines. FRAP models, records, and analyzes the behavior of an application and its impact on the system as a directed acyclic provenance graph. It assumes that most instances behave normally and uses their behavior to… ▽ More

    Submitted 30 November, 2017; originally announced November 2017.

    Comments: 7 pages, 2 figures, 1 table

    Journal ref: Han, X., Pasquier, T., Ranjan, T., Goldstein, M. and Seltzer, M., 2017. FRAPpuccino: Fault-detection through Runtime Analysis of Provenance

  20. Practical Whole-System Provenance Capture

    Authors: Thomas Pasquier, Xueyuan Han, Mark Goldstein, Thomas Moyer, David Eyers, Margo Seltzer, Jean Bacon

    Abstract: Data provenance describes how data came to be in its present form. It includes data sources and the transformations that have been applied to them. Data provenance has many uses, from forensics and security to aiding the reproducibility of scientific experiments. We present CamFlow, a whole-system provenance capture mechanism that integrates easily into a PaaS offering. While there have been sever… ▽ More

    Submitted 14 November, 2017; originally announced November 2017.

    Comments: 15 pages, 7 figures

    Journal ref: SoCC '17 Proceedings of the 2017 Symposium on Cloud Computing

  21. arXiv:1708.07280  [pdf, other

    cs.AI

    Learning Generalized Reactive Policies using Deep Neural Networks

    Authors: Edward Groshev, Maxwell Goldstein, Aviv Tamar, Siddharth Srivastava, Pieter Abbeel

    Abstract: We present a new approach to learning for planning, where knowledge acquired while solving a given set of planning problems is used to plan faster in related, but new problem instances. We show that a deep neural network can be used to learn and represent a \emph{generalized reactive policy} (GRP) that maps a problem instance and a state to an action, and that the learned GRPs efficiently solve la… ▽ More

    Submitted 24 July, 2018; v1 submitted 24 August, 2017; originally announced August 2017.

  22. arXiv:1201.3078  [pdf, ps, other

    cs.SE

    Empirical Confirmation (and Refutation) of Presumptions on Software

    Authors: Joseph Gil, Maayan Goldstein, Dany Moshkovich

    Abstract: Code metrics are easy to define, but not so easy to justify. It is hard to prove that a metric is valid, i.e., that measured numerical values imply anything on the vaguely defined, yet crucial software properties such as complexity and maintainability. This paper employs statistical analysis and tests to check some "believable" presumptions on the behavior of software and metrics measured for this… ▽ More

    Submitted 15 January, 2012; originally announced January 2012.