Skip to main content

Showing 1–50 of 61 results for author: Bertsimas, D

  1. arXiv:2405.20486  [pdf, other

    cs.LG

    Policy Trees for Prediction: Interpretable and Adaptive Model Selection for Machine Learning

    Authors: Dimitris Bertsimas, Matthew Peroni

    Abstract: As a multitude of capable machine learning (ML) models become widely available in forms such as open-source software and public APIs, central questions remain regarding their use in real-world applications, especially in high-stakes decision-making. Is there always one best model that should be used? When are the models likely to be error-prone? Should a black-box or interpretable model be used? I… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Submitted to JMLR on 5/30/2024

  2. arXiv:2405.07068  [pdf, other

    math.OC cs.LG

    Catastrophe Insurance: An Adaptive Robust Optimization Approach

    Authors: Dimitris Bertsimas, Cynthia Zeng

    Abstract: The escalating frequency and severity of natural disasters, exacerbated by climate change, underscore the critical role of insurance in facilitating recovery and promoting investments in risk reduction. This work introduces a novel Adaptive Robust Optimization (ARO) framework tailored for the calculation of catastrophe insurance premiums, with a case study applied to the United States National Flo… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  3. arXiv:2404.18975  [pdf

    cs.LG cs.AI

    M3H: Multimodal Multitask Machine Learning for Healthcare

    Authors: Dimitris Bertsimas, Yu Ma

    Abstract: Developing an integrated many-to-many framework leveraging multimodal data for multiple tasks is crucial to unifying healthcare applications ranging from diagnoses to operations. In resource-constrained hospital environments, a scalable and unified machine learning framework that improves previous forecast performances could improve hospital operations and save costs. We introduce M3H, an explaina… ▽ More

    Submitted 8 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  4. arXiv:2403.19871  [pdf, other

    cs.LG cs.AI math.OC

    Towards Stable Machine Learning Model Retraining via Slowly Varying Sequences

    Authors: Dimitris Bertsimas, Vassilis Digalakis Jr, Yu Ma, Phevos Paschalidis

    Abstract: We consider the task of retraining machine learning (ML) models when new batches of data become available. Existing methods focus largely on greedy approaches to find the best-performing model for each batch, without considering the stability of the model's structure across retraining iterations. In this study, we propose a methodology for finding sequences of ML models that are stable across retr… ▽ More

    Submitted 22 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  5. arXiv:2402.01543  [pdf, other

    cs.LG stat.ML

    Adaptive Optimization for Prediction with Missing Data

    Authors: Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet

    Abstract: When training predictive models on data with missing entries, the most widely used and versatile approach is a pipeline technique where we first impute missing entries and then compute predictions. In this paper, we view prediction with missing data as a two-stage adaptive optimization problem and propose a new class of models, adaptive linear regression models, where the regression coefficients a… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2104.03158

  6. arXiv:2401.12181  [pdf, other

    cs.LG cs.AI cs.CL

    Universal Neurons in GPT2 Language Models

    Authors: Wes Gurnee, Theo Horsley, Zifan Carl Guo, Tara Rezaei Kheirkhah, Qinyi Sun, Will Hathaway, Neel Nanda, Dimitris Bertsimas

    Abstract: A basic question within the emerging field of mechanistic interpretability is the degree to which neural networks learn the same underlying mechanisms. In other words, are neural mechanisms universal across different models? In this work, we study the universality of individual neurons across GPT2 models trained from different initial random seeds, motivated by the hypothesis that universal neuron… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  7. arXiv:2311.06960  [pdf, other

    cs.LG math.OC

    Robust Regression over Averaged Uncertainty

    Authors: Dimitris Bertsimas, Yu Ma

    Abstract: We propose a new formulation of robust regression by integrating all realizations of the uncertainty set and taking an averaged approach to obtain the optimal solution for the ordinary least-squared regression problem. We show that this formulation surprisingly recovers ridge regression and establishes the missing link between robust optimization and the mean squared error approaches for existing… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  8. arXiv:2311.01742  [pdf, other

    math.OC cs.LG

    Global Optimization: A Machine Learning Approach

    Authors: Dimitris Bertsimas, Georgios Margaritis

    Abstract: Many approaches for addressing Global Optimization problems typically rely on relaxations of nonlinear constraints over specific mathematical primitives. This is restricting in applications with constraints that are black-box, implicit or consist of more general primitives. Trying to address such limitations, Bertsimas and Ozturk (2023) proposed OCTHaGOn as a way of solving black-box global optimi… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: Submitted to the Journal of Global Optimization. 35 pages

  9. arXiv:2311.01681  [pdf, other

    stat.AP cs.AI stat.ME

    The R.O.A.D. to precision medicine

    Authors: Dimitris Bertsimas, Angelos G. Koulouras, Georgios Antonios Margonis

    Abstract: We propose a prognostic stratum matching framework that addresses the deficiencies of Randomized trial data subgroup analysis and transforms ObservAtional Data to be used as if they were randomized, thus paving the road for precision medicine. Our approach counters the effects of unobserved confounding in observational data by correcting the estimated probabilities of the outcome under a treatment… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  10. arXiv:2307.12409  [pdf, other

    cs.LG math.OC

    A Machine Learning Approach to Two-Stage Adaptive Robust Optimization

    Authors: Dimitris Bertsimas, Cheol Woo Kim

    Abstract: We propose an approach based on machine learning to solve two-stage linear adaptive robust optimization (ARO) problems with binary here-and-now variables and polyhedral uncertainty sets. We encode the optimal here-and-now decisions, the worst-case scenarios associated with the optimal here-and-now decisions, and the optimal wait-and-see decisions into what we denote as the strategy. We solve multi… ▽ More

    Submitted 7 December, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

  11. arXiv:2307.12405  [pdf, other

    cs.LG

    Optimal Control of Multiclass Fluid Queueing Networks: A Machine Learning Approach

    Authors: Dimitris Bertsimas, Cheol Woo Kim

    Abstract: We propose a machine learning approach to the optimal control of multiclass fluid queueing networks (MFQNETs) that provides explicit and insightful control policies. We prove that a threshold type optimal policy exists for MFQNET control problems, where the threshold curves are hyperplanes passing through the origin. We use Optimal Classification Trees with hyperplane splits (OCT-H) to learn an op… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  12. arXiv:2306.04647  [pdf, other

    eess.SP cs.LG stat.ML

    Compressed Sensing: A Discrete Optimization Approach

    Authors: Dimitris Bertsimas, Nicholas A. G. Johnson

    Abstract: We study the Compressed Sensing (CS) problem, which is the problem of finding the most sparse vector that satisfies a set of linear measurements up to some numerical tolerance. We introduce an $\ell_2$ regularized formulation of CS which we reformulate as a mixed integer second order cone program. We derive a second order cone relaxation of this problem and show that under mild conditions on the r… ▽ More

    Submitted 11 July, 2024; v1 submitted 4 June, 2023; originally announced June 2023.

    Journal ref: Springer Machine Learning 2024

  13. arXiv:2305.17299  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Improving Stability in Decision Tree Models

    Authors: Dimitris Bertsimas, Vassilis Digalakis Jr

    Abstract: Owing to their inherently interpretable structure, decision trees are commonly used in applications where interpretability is essential. Recent work has focused on improving various aspects of decision trees, including their predictive power and robustness; however, their instability, albeit well-documented, has been addressed to a lesser extent. In this paper, we take a step towards the stabiliza… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  14. arXiv:2305.15629  [pdf, other

    cs.LG cs.AI

    Patient Outcome Predictions Improve Operations at a Large Hospital Network

    Authors: Liangyuan Na, Kimberly Villalobos Carballo, Jean Pauphilet, Ali Haddad-Sisakht, Daniel Kombert, Melissa Boisjoli-Langlois, Andrew Castiglione, Maram Khalifa, Pooja Hebbal, Barry Stein, Dimitris Bertsimas

    Abstract: Problem definition: Access to accurate predictions of patients' outcomes can enhance medical staff's decision-making, which ultimately benefits all stakeholders in the hospitals. A large hospital network in the US has been collaborating with academics and consultants to predict short-term and long-term outcomes for all inpatients across their seven hospitals. Methodology/results: We develop machin… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 41 pages, 13 figures

  15. arXiv:2305.12292  [pdf, other

    cs.LG math.OC stat.ML

    Optimal Low-Rank Matrix Completion: Semidefinite Relaxations and Eigenvector Disjunctions

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Sean Lo, Jean Pauphilet

    Abstract: Low-rank matrix completion consists of computing a matrix of minimal complexity that recovers a given set of observations as accurately as possible. Unfortunately, existing methods for matrix completion are heuristics that, while highly scalable and often identifying high-quality solutions, do not possess any optimality guarantees. We reexamine matrix completion with an optimality-oriented eye. We… ▽ More

    Submitted 26 January, 2024; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: Updated version with new numerics showcasing relaxation for rank k>1

  16. arXiv:2305.01610  [pdf, other

    cs.LG cs.AI

    Finding Neurons in a Haystack: Case Studies with Sparse Probing

    Authors: Wes Gurnee, Neel Nanda, Matthew Pauly, Katherine Harvey, Dmitrii Troitskii, Dimitris Bertsimas

    Abstract: Despite rapid adoption and deployment of large language models (LLMs), the internal computations of these models remain opaque and poorly understood. In this work, we seek to understand how high-level human-interpretable features are represented within the internal neuron activations of LLMs. We train $k$-sparse linear classifiers (probes) on these internal activations to predict the presence of f… ▽ More

    Submitted 2 June, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

  17. arXiv:2304.04308  [pdf, other

    cs.LG cs.AI math.OC

    Ensemble Modeling for Time Series Forecasting: an Adaptive Robust Optimization Approach

    Authors: Dimitris Bertsimas, Leonard Boussioux

    Abstract: Accurate time series forecasting is critical for a wide range of problems with temporal data. Ensemble modeling is a well-established technique for leveraging multiple predictive models to increase accuracy and robustness, as the performance of a single predictor can be highly variable due to shifts in the underlying data distribution. This paper proposes a new methodology for building robust ense… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  18. arXiv:2303.12285  [pdf, other

    cs.LG cs.AI cs.CY

    Reducing Air Pollution through Machine Learning

    Authors: Dimitris Bertsimas, Leonard Boussioux, Cynthia Zeng

    Abstract: This paper presents a data-driven approach to mitigate the effects of air pollution from industrial plants on nearby cities by linking operational decisions with weather conditions. Our method combines predictive and prescriptive machine learning models to forecast short-term wind speed and direction and recommend operational decisions to reduce or pause the industrial plant's production. We exhib… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Submitted to Manufacturing and Service Operations Management

  19. arXiv:2303.06515  [pdf, other

    math.OC cs.LG

    Multistage Stochastic Optimization via Kernels

    Authors: Dimitris Bertsimas, Kimberly Villalobos Carballo

    Abstract: We develop a non-parametric, data-driven, tractable approach for solving multistage stochastic optimization problems in which decisions do not affect the uncertainty. The proposed framework represents the decision variables as elements of a reproducing kernel Hilbert space and performs functional stochastic gradient descent to minimize the empirical regularized loss. By incorporating sparsificatio… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  20. arXiv:2301.12548  [pdf, other

    cs.LG cs.CY

    Global Flood Prediction: a Multimodal Machine Learning Approach

    Authors: Cynthia Zeng, Dimitris Bertsimas

    Abstract: Flooding is one of the most destructive and costly natural disasters, and climate changes would further increase risks globally. This work presents a novel multimodal machine learning approach for multi-year global flood risk prediction, combining geographical information and historical natural disaster dataset. Our multimodal framework employs state-of-the-art processing techniques to extract emb… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

    Comments: 6 pages

  21. arXiv:2210.08326  [pdf, ps, other

    stat.ME cs.LG math.OC stat.ML

    Distributionally Robust Causal Inference with Observational Data

    Authors: Dimitris Bertsimas, Kosuke Imai, Michael Lingzhi Li

    Abstract: We consider the estimation of average treatment effects in observational studies and propose a new framework of robust causal inference with unobserved confounders. Our approach is based on distributionally robust optimization and proceeds in two steps. We first specify the maximal degree to which the distribution of unobserved potential outcomes may deviate from that of observed outcomes. We then… ▽ More

    Submitted 2 February, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

  22. arXiv:2206.10381  [pdf, other

    cs.LG

    TabText: A Flexible and Contextual Approach to Tabular Data Representation

    Authors: Kimberly Villalobos Carballo, Liangyuan Na, Yu Ma, Léonard Boussioux, Cynthia Zeng, Luis R. Soenksen, Dimitris Bertsimas

    Abstract: Tabular data is essential for applying machine learning tasks across various industries. However, traditional data processing methods do not fully utilize all the information available in the tables, ignoring important contextual information such as column header descriptions. In addition, pre-processing data into a tabular format can remain a labor-intensive bottleneck in model development. This… ▽ More

    Submitted 21 July, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

  23. arXiv:2206.00176  [pdf, other

    cs.LG eess.SY math.OC

    Learning Sparse Nonlinear Dynamics via Mixed-Integer Optimization

    Authors: Dimitris Bertsimas, Wes Gurnee

    Abstract: Discovering governing equations of complex dynamical systems directly from data is a central problem in scientific machine learning. In recent years, the sparse identification of nonlinear dynamics (SINDy) framework, powered by heuristic sparse regression methods, has become a dominant tool for learning parsimonious models. We propose an exact formulation of the SINDy problem using mixed-integer o… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

  24. arXiv:2202.12998  [pdf

    cs.LG cs.AI cs.CY

    Integrated multimodal artificial intelligence framework for healthcare applications

    Authors: Luis R. Soenksen, Yu Ma, Cynthia Zeng, Leonard D. J. Boussioux, Kimberly Villalobos Carballo, Liangyuan Na, Holly M. Wiberg, Michael L. Li, Ignacio Fuentes, Dimitris Bertsimas

    Abstract: Artificial intelligence (AI) systems hold great promise to improve healthcare over the next decades. Specifically, AI systems leveraging multiple data sources and input modalities are poised to become a viable method to deliver more accurate results and deployable pipelines across a wide range of applications. In this work, we propose and evaluate a unified Holistic AI in Medicine (HAIM) framework… ▽ More

    Submitted 26 September, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Journal ref: Nature npj Digital Medicine, 2022

  25. arXiv:2112.09279  [pdf, other

    cs.LG math.OC stat.ML

    Robust Upper Bounds for Adversarial Training

    Authors: Dimitris Bertsimas, Xavier Boix, Kimberly Villalobos Carballo, Dick den Hertog

    Abstract: Many state-of-the-art adversarial training methods for deep learning leverage upper bounds of the adversarial loss to provide security guarantees against adversarial attacks. Yet, these methods rely on convex relaxations to propagate lower and upper bounds for intermediate layers, which affect the tightness of the bound at the output layer. We introduce a new approach to adversarial training by mi… ▽ More

    Submitted 5 April, 2023; v1 submitted 16 December, 2021; originally announced December 2021.

  26. arXiv:2111.04469  [pdf, other

    math.OC cs.LG stat.ML

    Mixed-Integer Optimization with Constraint Learning

    Authors: Donato Maragno, Holly Wiberg, Dimitris Bertsimas, S. Ilker Birbil, Dick den Hertog, Adejuyigbe Fajemisin

    Abstract: We establish a broad methodological foundation for mixed-integer optimization with learned constraints. We propose an end-to-end pipeline for data-driven decision making in which constraints and objectives are directly learned from data using machine learning, and the trained models are embedded in an optimization formulation. We exploit the mixed-integer optimization-representability of many mach… ▽ More

    Submitted 26 October, 2023; v1 submitted 4 November, 2021; originally announced November 2021.

  27. arXiv:2110.15829  [pdf, other

    cs.LG cs.AI

    Holistic Deep Learning

    Authors: Dimitris Bertsimas, Kimberly Villalobos Carballo, Léonard Boussioux, Michael Lingzhi Li, Alex Paskov, Ivan Paskov

    Abstract: This paper presents a novel holistic deep learning framework that simultaneously addresses the challenges of vulnerability to input perturbations, overparametrization, and performance instability from different train-validation splits. The proposed framework holistically improves accuracy, robustness, sparsity, and stability over standard deep learning models, as demonstrated by extensive experime… ▽ More

    Submitted 20 March, 2023; v1 submitted 29 October, 2021; originally announced October 2021.

    Comments: Under review at Machine Learning

  28. arXiv:2109.12701  [pdf, other

    stat.ML cs.LG math.OC

    Sparse Plus Low Rank Matrix Decomposition: A Discrete Optimization Approach

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Nicholas A. G. Johnson

    Abstract: We study the Sparse Plus Low-Rank decomposition problem (SLR), which is the problem of decomposing a corrupted data matrix into a sparse matrix of perturbations plus a low-rank matrix containing the ground truth. SLR is a fundamental problem in Operations Research and Machine Learning which arises in various applications, including data compression, latent semantic indexing, collaborative filterin… ▽ More

    Submitted 1 October, 2023; v1 submitted 26 September, 2021; originally announced September 2021.

    Journal ref: Journal of Machine Learning Research, 24(267), 1-51 (2023)

  29. arXiv:2107.03900  [pdf, other

    cs.CY cs.LG stat.ML

    The Price of Diversity

    Authors: Hari Bandi, Dimitris Bertsimas

    Abstract: Systemic bias with respect to gender, race and ethnicity, often unconscious, is prevalent in datasets involving choices among individuals. Consequently, society has found it challenging to alleviate bias and achieve diversity in a way that maintains meritocracy in such settings. We propose (a) a novel optimization approach based on optimally flipping outcome labels and training classification mode… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  30. arXiv:2106.00839  [pdf, other

    cs.LG q-fin.RM stat.ML

    Algorithmic Insurance

    Authors: Dimitris Bertsimas, Agni Orfanoudaki

    Abstract: As machine learning algorithms start to get integrated into the decision-making process of companies and organizations, insurance products are being developed to protect their owners from liability risk. Algorithmic liability differs from human liability since it is based on a single model compared to multiple heterogeneous decision-makers and its performance is known a priori for a given set of d… ▽ More

    Submitted 14 December, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

  31. arXiv:2105.05947  [pdf, other

    math.OC cs.LG stat.ML

    A new perspective on low-rank optimization

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: A key question in many low-rank problems throughout optimization, machine learning, and statistics is to characterize the convex hulls of simple low-rank sets and judiciously apply these convex hulls to obtain strong yet computationally tractable convex relaxations. We invoke the matrix perspective function - the matrix analog of the perspective function - and characterize explicitly the convex hu… ▽ More

    Submitted 2 March, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: Major revision submitted to Mathematical Programming

  32. arXiv:2104.03158  [pdf, other

    stat.ML cs.LG

    Simple Imputation Rules for Prediction with Missing Data: Contrasting Theoretical Guarantees with Empirical Performance

    Authors: Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet

    Abstract: Missing data is a common issue in real-world datasets. This paper studies the performance of impute-then-regress pipelines by contrasting theoretical and empirical evidence. We establish the asymptotic consistency of such pipelines for a broad family of imputation methods. While common sense suggests that a `good' imputation method produces datasets that are plausible, we show, on the contrary, th… ▽ More

    Submitted 2 February, 2024; v1 submitted 7 April, 2021; originally announced April 2021.

  33. arXiv:2102.10773  [pdf, other

    cs.LG math.OC stat.CO stat.ML

    Slowly Varying Regression under Sparsity

    Authors: Dimitris Bertsimas, Vassilis Digalakis Jr, Michael Linghzi Li, Omar Skali Lami

    Abstract: We present the framework of slowly varying regression under sparsity, allowing sparse regression models to exhibit slow and sparse variations. The problem of parameter estimation is formulated as a mixed-integer optimization problem. We demonstrate that it can be precisely reformulated as a binary convex optimization problem through a novel relaxation technique. This relaxation involves a new equa… ▽ More

    Submitted 11 November, 2023; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: Submitted to Operations Research. First submission: 02/2021

  34. arXiv:2012.04284  [pdf, other

    cs.LG stat.ML

    Optimal Survival Trees

    Authors: Dimitris Bertsimas, Jack Dunn, Emma Gibson, Agni Orfanoudaki

    Abstract: Tree-based models are increasingly popular due to their ability to identify complex relationships that are beyond the scope of parametric models. Survival tree methods adapt these models to allow for the analysis of censored outcomes, which often appear in medical data. We present a new Optimal Survival Trees algorithm that leverages mixed-integer optimization (MIO) and local search techniques to… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

  35. arXiv:2011.06125  [pdf, other

    cs.LG cs.AI physics.ao-ph

    Hurricane Forecasting: A Novel Multimodal Machine Learning Framework

    Authors: Léonard Boussioux, Cynthia Zeng, Théo Guénais, Dimitris Bertsimas

    Abstract: This paper describes a novel machine learning (ML) framework for tropical cyclone intensity and track forecasting, combining multiple ML techniques and utilizing diverse data sources. Our multimodal framework, called Hurricast, efficiently combines spatial-temporal data with statistical data by extracting features with deep-learning encoder-decoder architectures and predicting with gradient-booste… ▽ More

    Submitted 24 September, 2022; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: Published by the AMS' Weather and Forecasting journal; Spotlight talk at NeurIPS 2021, Tackling Climate Change with AI ; https://journals.ametsoc.org/view/journals/wefo/37/6/WAF-D-21-0091.1.xml

    Journal ref: 2022, Weather and Forecasting, 37(6), 817-831

  36. arXiv:2009.10395  [pdf, other

    math.OC cs.LG stat.ML

    Mixed-Projection Conic Optimization: A New Paradigm for Modeling Rank Constraints

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: We propose a framework for modeling and solving low-rank optimization problems to certifiable optimality. We introduce symmetric projection matrices that satisfy $Y^2=Y$, the matrix analog of binary variables that satisfy $z^2=z$, to model rank constraints. By leveraging regularization and strong duality, we prove that this modeling paradigm yields tractable convex optimization problems over the n… ▽ More

    Submitted 2 April, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: major revision submitted to Operations Research

    Journal ref: Operations Research, Articles in Advance 2021

  37. Frequency Estimation in Data Streams: Learning the Optimal Hashing Scheme

    Authors: Dimitris Bertsimas, Vassilis Digalakis Jr

    Abstract: We present a novel approach for the problem of frequency estimation in data streams that is based on optimization and machine learning. Contrary to state-of-the-art streaming frequency estimation algorithms, which heavily rely on random hashing to maintain the frequency distribution of the data steam using limited storage, the proposed approach exploits an observed stream prefix to near-optimally… ▽ More

    Submitted 2 June, 2021; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: Submitted to IEEE Transactions on Knowledge and Data Engineering on 07/2020. Revised on 05/2021

  38. The Backbone Method for Ultra-High Dimensional Sparse Machine Learning

    Authors: Dimitris Bertsimas, Vassilis Digalakis Jr

    Abstract: We present the backbone method, a generic framework that enables sparse and interpretable supervised machine learning methods to scale to ultra-high dimensional problems. We solve sparse regression problems with $10^7$ features in minutes and $10^8$ features in hours, as well as decision tree problems with $10^5$ features in minutes.The proposed method operates in two phases: we first determine th… ▽ More

    Submitted 14 October, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: First submission to Machine Learning: 06/2020. Revised: 10/2021

  39. arXiv:2005.05195  [pdf, other

    math.OC cs.LG math.ST stat.CO

    Solving Large-Scale Sparse PCA to Certifiable (Near) Optimality

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: Sparse principal component analysis (PCA) is a popular dimensionality reduction technique for obtaining principal components which are linear combinations of a small subset of the original features. Existing approaches cannot supply certifiably optimal principal components with more than $p=100s$ of variables. By reformulating sparse PCA as a convex mixed-integer semidefinite optimization problem,… ▽ More

    Submitted 25 August, 2021; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: Revision submitted to JMLR

    Journal ref: Journal of Machine Learning Research 23(13):1-35, 2022

  40. arXiv:2004.03736  [pdf, other

    cs.RO

    Learning Mixed-Integer Convex Optimization Strategies for Robot Planning and Control

    Authors: A. Cauligi, P. Culbertson, B. Stellato, D. Bertsimas, M. Schwager, M. Pavone

    Abstract: Mixed-integer convex programming (MICP) has seen significant algorithmic and hardware improvements with several orders of magnitude solve time speedups compared to 25 years ago. Despite these advances, MICP has been rarely applied to real-world robotic control because the solution times are still too slow for online applications. In this work, we present the CoCo (Combinatorial Offline, Convex Onl… ▽ More

    Submitted 11 April, 2022; v1 submitted 7 April, 2020; originally announced April 2020.

  41. arXiv:1910.09092  [pdf, ps, other

    cs.LG math.OC stat.ME stat.ML

    Fast Exact Matrix Completion: A Unified Optimization Framework for Matrix Completion

    Authors: Dimitris Bertsimas, Michael Lingzhi Li

    Abstract: We formulate the problem of matrix completion with and without side information as a non-convex optimization problem. We design fastImpute based on non-convex gradient descent and show it converges to a global minimum that is guaranteed to recover closely the underlying matrix while it scales to matrices of sizes beyond $10^5 \times 10^5$. We report experiments on both synthetic and real-world dat… ▽ More

    Submitted 31 December, 2020; v1 submitted 20 October, 2019; originally announced October 2019.

    Journal ref: Journal of Machine Learning Research 21 (2020) 1-43

  42. arXiv:1910.08483  [pdf, other

    stat.ML cs.LG stat.AP

    Personalized Treatment for Coronary Artery Disease Patients: A Machine Learning Approach

    Authors: Dimitris Bertsimas, Agni Orfanoudaki, Rory B. Weiner

    Abstract: Current clinical practice guidelines for managing Coronary Artery Disease (CAD) account for general cardiovascular risk factors. However, they do not present a framework that considers personalized patient-specific characteristics. Using the electronic health records of 21,460 patients, we created data-driven models for personalized CAD management that significantly improve health outcomes relativ… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

  43. arXiv:1910.03143  [pdf, other

    math.OC cs.LG stat.ML

    On Polyhedral and Second-Order Cone Decompositions of Semidefinite Optimization Problems

    Authors: Dimitris Bertsimas, Ryan Cory-Wright

    Abstract: We study a cutting-plane method for semidefinite optimization problems (SDOs), and supply a proof of the method's convergence, under a boundedness assumption. By relating the method's rate of convergence to an initial outer approximation's diameter, we argue that the method performs well when initialized with a second-order-cone approximation, instead of a linear approximation. We invoke the metho… ▽ More

    Submitted 2 December, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: Submitted minor revision to Operations Research Letters; removed footnotes and corrected some minor typos in previous version

    Journal ref: Operations Research Letters 48(1):78-85 (2020)

  44. arXiv:1907.07307  [pdf, other

    math.OC cs.LG stat.ML

    Dynamic optimization with side information

    Authors: Dimitris Bertsimas, Christopher McCord, Bradley Sturt

    Abstract: We develop a tractable and flexible approach for incorporating side information into dynamic optimization under uncertainty. The proposed framework uses predictive machine learning methods (such as $k$-nearest neighbors, kernel regression, and random forests) to weight the relative importance of various data-driven uncertainty sets in a robust optimization formulation. Through a novel measure conc… ▽ More

    Submitted 21 July, 2020; v1 submitted 16 July, 2019; originally announced July 2019.

  45. arXiv:1907.04669  [pdf, other

    cs.LG stat.ML

    Optimal Explanations of Linear Models

    Authors: Dimitris Bertsimas, Arthur Delarue, Patrick Jaillet, Sebastien Martin

    Abstract: When predictive models are used to support complex and important decisions, the ability to explain a model's reasoning can increase trust, expose hidden biases, and reduce vulnerability to adversarial attacks. However, attempts at interpreting models are often ad hoc and application-specific, and the concept of interpretability itself is not well-defined. We propose a general optimization framewor… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1907.03419

  46. arXiv:1907.03419  [pdf, other

    cs.LG stat.ML

    The Price of Interpretability

    Authors: Dimitris Bertsimas, Arthur Delarue, Patrick Jaillet, Sebastien Martin

    Abstract: When quantitative models are used to support decision-making on complex and important topics, understanding a model's ``reasoning'' can increase trust in its predictions, expose hidden biases, or reduce vulnerability to adversarial attacks. However, the concept of interpretability remains loosely defined and application-specific. In this paper, we introduce a mathematical framework in which machin… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

  47. arXiv:1907.02206  [pdf, other

    math.OC cs.LG

    Online Mixed-Integer Optimization in Milliseconds

    Authors: Dimitris Bertsimas, Bartolomeo Stellato

    Abstract: We propose a method to solve online mixed-integer optimization (MIO) problems at very high speed using machine learning. By exploiting the repetitive nature of online optimization, we are able to greatly speedup the solution time. Our approach encodes the optimal solution into a small amount of information denoted as strategy using the Voice of Optimization framework proposed in [BS21]. In this wa… ▽ More

    Submitted 22 March, 2021; v1 submitted 3 July, 2019; originally announced July 2019.

  48. arXiv:1907.02109  [pdf, other

    math.OC cs.LG stat.ML

    A unified approach to mixed-integer optimization problems with logical constraints

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: We propose a unified framework to address a family of classical mixed-integer optimization problems with logically constrained decision variables, including network design, facility location, unit commitment, sparse portfolio selection, binary quadratic optimization, sparse principal analysis and sparse learning problems. These problems exhibit logical relationships between continuous and discrete… ▽ More

    Submitted 25 January, 2021; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: Revised version (including title change). The old title was "A unified approach to mixed-integer optimization: Nonlinear formulations and scalable algorithms"

  49. arXiv:1906.10283  [pdf, other

    stat.ML cs.LG math.OC

    Certifiably Optimal Sparse Inverse Covariance Estimation

    Authors: Dimitris Bertsimas, Jourdain Lamperski, Jean Pauphilet

    Abstract: We consider the maximum likelihood estimation of sparse inverse covariance matrices. We demonstrate that current heuristic approaches primarily encourage robustness, instead of the desired sparsity. We give a novel approach that solves the cardinality constrained likelihood problem to certifiable optimality. The approach uses techniques from mixed-integer optimization and convex optimization, and… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    MSC Class: 90C11; 90C22; 62H12

    Journal ref: Mathematical Programming 184 (2020) 491-530

  50. arXiv:1904.11637  [pdf, other

    stat.ML cs.LG

    From Predictions to Prescriptions in Multistage Optimization Problems

    Authors: Dimitris Bertsimas, Christopher McCord

    Abstract: In this paper, we introduce a framework for solving finite-horizon multistage optimization problems under uncertainty in the presence of auxiliary data. We assume the joint distribution of the uncertain quantities is unknown, but noisy observations, along with observations of auxiliary covariates, are available. We utilize effective predictive methods from machine learning (ML), including $k$-near… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.