Skip to main content

Showing 1–20 of 20 results for author: Dervovic, D

  1. arXiv:2406.13427  [pdf, other

    cs.LG

    Are Logistic Models Really Interpretable?

    Authors: Danial Dervovic, Freddy Lécué, Nicolás Marchesotti, Daniele Magazzeni

    Abstract: The demand for open and trustworthy AI models points towards widespread publishing of model weights. Consumers of these model weights must be able to act accordingly with the information provided. That said, one of the simplest AI classification models, Logistic Regression (LR), has an unwieldy interpretation of its model weights, with greater difficulties when extending LR to generalised additive… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 36 pages, 5 Figures. Extended version of paper accepted to IJCAI 2024. arXiv admin note: substantial text overlap with arXiv:2211.06360

  2. arXiv:2406.01899  [pdf, other

    cs.LG

    Cross-Domain Graph Data Scaling: A Showcase with Diffusion Models

    Authors: Wenzhuo Tang, Haitao Mao, Danial Dervovic, Ivan Brugere, Saumitra Mishra, Yuying Xie, Jiliang Tang

    Abstract: Models for natural language and images benefit from data scaling behavior: the more data fed into the model, the better they perform. This 'better with more' phenomenon enables the effectiveness of large-scale pre-training on vast amounts of data. However, current graph pre-training methods struggle to scale up data due to heterogeneity across graphs. To achieve effective data scaling, we aim to d… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  3. arXiv:2404.06162  [pdf, other

    cs.CL cs.AI cs.LG

    Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports

    Authors: Tianyu Cao, Natraj Raman, Danial Dervovic, Chenhao Tan

    Abstract: As large language models (LLMs) expand the power of natural language processing to handle long inputs, rigorous and systematic analyses are necessary to understand their abilities and behavior. A salient application is summarization, due to its ubiquity and controversy (e.g., researchers have declared the death of summarization). In this paper, we use financial report summarization as a case study… ▽ More

    Submitted 8 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  4. arXiv:2403.09925  [pdf, other

    cs.AI

    Surrogate Assisted Monte Carlo Tree Search in Combinatorial Optimization

    Authors: Saeid Amiri, Parisa Zehtabi, Danial Dervovic, Michael Cashmore

    Abstract: Industries frequently adjust their facilities network by opening new branches in promising areas and closing branches in areas where they expect low profits. In this paper, we examine a particular class of facility location problems. Our objective is to minimize the loss of sales resulting from the removal of several retail stores. However, estimating sales accurately is expensive and time-consumi… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted to the ICAPS Planning and Scheduling for Financial Services (FINPLAN) 2023 workshop

  5. arXiv:2403.07724  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Balancing Fairness and Accuracy in Data-Restricted Binary Classification

    Authors: Zachary McBride Lazri, Danial Dervovic, Antigoni Polychroniadou, Ivan Brugere, Dana Dachman-Soled, Min Wu

    Abstract: Applications that deal with sensitive information may have restrictions placed on the data available to a machine learning (ML) classifier. For example, in some applications, a classifier may not have direct access to sensitive attributes, affecting its ability to produce accurate and fair decisions. This paper proposes a framework that models the trade-off between accuracy and fairness under four… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  6. arXiv:2402.04375  [pdf, other

    cs.LG cs.CR

    Bounding the Excess Risk for Linear Models Trained on Marginal-Preserving, Differentially-Private, Synthetic Data

    Authors: Yvonne Zhou, Mingyu Liang, Ivan Brugere, Dana Dachman-Soled, Danial Dervovic, Antigoni Polychroniadou, Min Wu

    Abstract: The growing use of machine learning (ML) has raised concerns that an ML model may reveal private information about an individual who has contributed to the training dataset. To prevent leakage of sensitive data, we consider using differentially-private (DP), synthetic training data instead of real training data to train an ML model. A key desirable property of synthetic data is its ability to pres… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  7. A Canonical Data Transformation for Achieving Inter- and Within-group Fairness

    Authors: Zachary McBride Lazri, Ivan Brugere, Xin Tian, Dana Dachman-Soled, Antigoni Polychroniadou, Danial Dervovic, Min Wu

    Abstract: Increases in the deployment of machine learning algorithms for applications that deal with sensitive data have brought attention to the issue of fairness in machine learning. Many works have been devoted to applications that require different demographic groups to be treated fairly. However, algorithms that aim to satisfy inter-group fairness (also called group fairness) may inadvertently treat in… ▽ More

    Submitted 5 July, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  8. arXiv:2307.06941  [pdf, other

    cs.AI cs.CV cs.GT cs.HC cs.LG

    On the Connection between Game-Theoretic Feature Attributions and Counterfactual Explanations

    Authors: Emanuele Albini, Shubham Sharma, Saumitra Mishra, Danial Dervovic, Daniele Magazzeni

    Abstract: Explainable Artificial Intelligence (XAI) has received widespread interest in recent years, and two of the most popular types of explanations are feature attributions, and counterfactual explanations. These classes of approaches have been largely studied independently and the few attempts at reconciling them have been primarily empirical. This work establishes a clear theoretical connection betwee… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: Accepted at AIES 2023

    ACM Class: I.2; I.5; H.5; F.2

    Journal ref: AIES '23: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society

  9. arXiv:2211.06360  [pdf, ps, other

    cs.LG

    Rethinking Log Odds: Linear Probability Modelling and Expert Advice in Interpretable Machine Learning

    Authors: Danial Dervovic, Nicolas Marchesotti, Freddy Lecue, Daniele Magazzeni

    Abstract: We introduce a family of interpretable machine learning models, with two broad additions: Linearised Additive Models (LAMs) which replace the ubiquitous logistic link function in General Additive Models (GAMs); and SubscaleHedge, an expert advice algorithm for combining base models trained on subsets of features called subscales. LAMs can augment any additive binary classification model equipped w… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: 33 pages, 2 figures. Comments welcome

  10. arXiv:2209.14738  [pdf, other

    stat.ML cs.LG

    Optimal Stopping with Gaussian Processes

    Authors: Kshama Dwarakanath, Danial Dervovic, Peyman Tavallali, Svitlana S Vyetrenko, Tucker Balch

    Abstract: We propose a novel group of Gaussian Process based algorithms for fast approximate optimal stopping of time series with specific applications to financial markets. We show that structural properties commonly exhibited by financial time series (e.g., the tendency to mean-revert) allow the use of Gaussian and Deep Gaussian Process models that further enable us to analytically evaluate optimal stoppi… ▽ More

    Submitted 7 October, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

  11. arXiv:2203.08019  [pdf, other

    cs.LG cs.AI math.OC

    Optimal Admission Control for Multiclass Queues with Time-Varying Arrival Rates via State Abstraction

    Authors: Marc Rigter, Danial Dervovic, Parisa Hassanzadeh, Jason Long, Parisa Zehtabi, Daniele Magazzeni

    Abstract: We consider a novel queuing problem where the decision-maker must choose to accept or reject randomly arriving tasks into a no buffer queue which are processed by $N$ identical servers. Each task has a price, which is a positive real number, and a class. Each class of task has a different price distribution and service rate, and arrives according to an inhomogenous Poisson process. The objective i… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 7+1 pages main text, 16 pages supplementary material, accepted to AAAI 2022

  12. Counterfactual Shapley Additive Explanations

    Authors: Emanuele Albini, Jason Long, Danial Dervovic, Daniele Magazzeni

    Abstract: Feature attributions are a common paradigm for model explanations due to their simplicity in assigning a single numeric score for each input feature to a model. In the actionable recourse setting, wherein the goal of the explanations is to improve outcomes for model consumers, it is often unclear how feature attributions should be correctly used. With this work, we aim to strengthen and clarify th… ▽ More

    Submitted 16 May, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: Accepted at FAccT '22 (2022 ACM Conference on Fairness, Accountability, and Transparency)

    ACM Class: I.2; I.5; H.5

  13. arXiv:2110.02403  [pdf, other

    cs.LG cs.AI

    Tradeoffs in Streaming Binary Classification under Limited Inspection Resources

    Authors: Parisa Hassanzadeh, Danial Dervovic, Samuel Assefa, Prashant Reddy, Manuela Veloso

    Abstract: Institutions are increasingly relying on machine learning models to identify and alert on abnormal events, such as fraud, cyber attacks and system failures. These alerts often need to be manually investigated by specialists. Given the operational cost of manual inspections, the suspicious events are selected by alerting systems with carefully designed thresholds. In this paper, we consider an imba… ▽ More

    Submitted 29 October, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: To appear in Proceedings of the ACM International Conference on AI in Finance (ICAIF '21)

  14. arXiv:2106.15212  [pdf, other

    cs.LG cs.AI cs.CC

    Counterfactual Explanations for Arbitrary Regression Models

    Authors: Thomas Spooner, Danial Dervovic, Jason Long, Jon Shepard, Jiahao Chen, Daniele Magazzeni

    Abstract: We present a new method for counterfactual explanations (CFEs) based on Bayesian optimisation that applies to both classification and regression models. Our method is a globally convergent search algorithm with support for arbitrary regression models and constraints like feature sparsity and actionable recourse, and furthermore can answer multiple counterfactual questions in parallel while learnin… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: 20 pages, 5 figures, 3 tables

  15. arXiv:2106.04944  [pdf, other

    cs.AI cs.LG stat.ML

    Non-Parametric Stochastic Sequential Assignment With Random Arrival Times

    Authors: Danial Dervovic, Parisa Hassanzadeh, Samuel Assefa, Prashant Reddy

    Abstract: We consider a problem wherein jobs arrive at random times and assume random values. Upon each job arrival, the decision-maker must decide immediately whether or not to accept the job and gain the value on offer as a reward, with the constraint that they may only accept at most $n$ jobs over some reference time period. The decision-maker only has access to $M$ independent realisations of the job ar… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted to IJCAI '21, full version with Supplementary Material

    Journal ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence Main Track. Pages 4214-4220. 2021

  16. arXiv:2105.12893  [pdf, other

    stat.ME cs.CE cs.LG

    Calibrating Over-Parametrized Simulation Models: A Framework via Eligibility Set

    Authors: Yuanlu Bai, Tucker Balch, Haoxian Chen, Danial Dervovic, Henry Lam, Svitlana Vyetrenko

    Abstract: Stochastic simulation aims to compute output performance for complex models that lack analytical tractability. To ensure accurate prediction, the model needs to be calibrated and validated against real data. Conventional methods approach these tasks by assessing the model-data match via simple hypothesis tests or distance minimization in an ad hoc fashion, but they can encounter challenges arising… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  17. arXiv:1912.04941  [pdf, other

    q-fin.TR cs.MA

    Get Real: Realism Metrics for Robust Limit Order Book Market Simulations

    Authors: Svitlana Vyetrenko, David Byrd, Nick Petosa, Mahmoud Mahfouz, Danial Dervovic, Manuela Veloso, Tucker Hybinette Balch

    Abstract: Machine learning (especially reinforcement learning) methods for trading are increasingly reliant on simulation for agent training and testing. Furthermore, simulation is important for validation of hand-coded trading strategies and for testing hypotheses about market structure. A challenge, however, concerns the robustness of policies validated in simulation because the simulations lack fidelity.… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Journal ref: NeurIPS 2019 Workshop on Robust AI in Financial Services: Data, Fairness, Explainability, Trustworthiness, and Privacy

  18. arXiv:1802.09844  [pdf, other

    cs.DM math.CO

    Constructing graphs with limited resources

    Authors: Danial Dervovic, Avinash Mocherla, Simone Severini

    Abstract: We discuss the amount of physical resources required to construct a given graph, where vertices are added sequentially. We naturally identify information -- distinct into instructions and memory -- and randomness as resources. Not surprisingly, we show that, in this framework, threshold graphs are the simplest possible graphs, since the construction of threshold graphs requires a single bit of ins… ▽ More

    Submitted 27 February, 2018; originally announced February 2018.

    Comments: 16 pages, 1 figure, comments welcome

  19. arXiv:1802.08227  [pdf, other

    quant-ph cs.DS math.NA

    Quantum linear systems algorithms: a primer

    Authors: Danial Dervovic, Mark Herbster, Peter Mountney, Simone Severini, Naïri Usher, Leonard Wossnig

    Abstract: The Harrow-Hassidim-Lloyd (HHL) quantum algorithm for sampling from the solution of a linear system provides an exponential speed-up over its classical counterpart. The problem of solving a system of linear equations has a wide scope of applications, and thus HHL constitutes an important algorithmic primitive. In these notes, we present the HHL algorithm and its improved versions in detail, includ… ▽ More

    Submitted 22 February, 2018; originally announced February 2018.

    Comments: 55 pages, 5 figures, comments welcome

  20. arXiv:1707.05179   

    math.CO cs.DS

    Weak Modular Product of Bipartite Graphs, Bicliques and Isomorphism

    Authors: Danial Dervovic, Simone Severini

    Abstract: A 1978 theorem of Kozen states that two graphs on $n$ vertices are isomorphic if and only if there is a clique of size $n$ in the weak modular product between the two graphs. Restricting to bipartite graphs and considering complete bipartite subgraphs (bicliques) therein, we study the combinatorics of the weak modular product. We identify cases where isomorphism is tractable using this approach, w… ▽ More

    Submitted 27 September, 2018; v1 submitted 17 July, 2017; originally announced July 2017.

    Comments: Algorithm 1 (IvBE) is irreparably flawed. Moreover, Theorem 2, concerning perfection of weak modular products of balanced, bipartite graphs is incorrect. Thank you to an anonymous reviewer for pointing out these flaws in the paper. We have now enumerated all perfect product graphs in the work at arXiv:1809.09939