Skip to main content

Showing 1–50 of 114 results for author: Doshi-Velez, F

  1. arXiv:2406.08636  [pdf, other

    cs.LG

    Towards Integrating Personal Knowledge into Test-Time Predictions

    Authors: Isaac Lage, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Machine learning (ML) models can make decisions based on large amounts of data, but they can be missing personal knowledge available to human users about whom predictions are made. For example, a model trained to predict psychiatric outcomes may know nothing about a patient's social support system, and social support may look different for different patients. In this work, we introduce the problem… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2406.00116  [pdf, other

    cs.HC cs.LG

    A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning

    Authors: Eura Nofshin, Esther Brown, Brian Lim, Weiwei Pan, Finale Doshi-Velez

    Abstract: Existing user studies suggest that different tasks may require explanations with different properties. However, user studies are expensive. In this paper, we introduce a generalizable, cost-effective method for identifying task-relevant explanation properties in silico, which can guide the design of more expensive user studies. We use our approach to identify relevant proxies for three example tas… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  3. arXiv:2404.14660  [pdf, ps, other

    cs.CY cs.AI

    AI Procurement Checklists: Revisiting Implementation in the Age of AI Governance

    Authors: Tom Zick, Mason Kortz, David Eaves, Finale Doshi-Velez

    Abstract: Public sector use of AI has been quietly on the rise for the past decade, but only recently have efforts to regulate it entered the cultural zeitgeist. While simple to articulate, promoting ethical and effective roll outs of AI systems in government is a notoriously elusive task. On the one hand there are hard-to-address pitfalls associated with AI-based tools, including concerns about bias toward… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  4. arXiv:2403.08941  [pdf, other

    stat.ML cs.LG

    Towards Model-Agnostic Posterior Approximation for Fast and Accurate Variational Autoencoders

    Authors: Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez

    Abstract: Inference for Variational Autoencoders (VAEs) consists of learning two models: (1) a generative model, which transforms a simple distribution over a latent space into the distribution over observed data, and (2) an inference model, which approximates the posterior of the latent codes given data. The two components are learned jointly via a lower bound to the generative model's log marginal likelih… ▽ More

    Submitted 12 June, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted at the Workshop at the 6th Symposium on Advances in Approximate Bayesian Inference (AABI) 2024

  5. arXiv:2402.17003  [pdf, other

    cs.LG cs.AI cs.CY

    Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials

    Authors: Anna L. Trella, Kelly W. Zhang, Inbal Nahum-Shani, Vivek Shetty, Iris Yan, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Online reinforcement learning (RL) algorithms offer great potential for personalizing treatment for participants in clinical trials. However, deploying an online, autonomous algorithm in the high-stakes healthcare setting makes quality control and data quality especially difficult to achieve. This paper proposes algorithm fidelity as a critical requirement for deploying online RL algorithms in cli… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2402.12737  [pdf, other

    cs.LG

    Guarantee Regions for Local Explanations

    Authors: Marton Havasi, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Interpretability methods that utilise local surrogate models (e.g. LIME) are very good at describing the behaviour of the predictive model at a point of interest, but they are not guaranteed to extrapolate to the local region surrounding the point. However, overfitting to the local curvature of the predictive model and malicious tampering can significantly limit extrapolation. We propose an anchor… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  7. arXiv:2402.03110  [pdf, other

    cs.LG cs.AI

    Non-Stationary Latent Auto-Regressive Bandits

    Authors: Anna L. Trella, Walter Dempsey, Finale Doshi-Velez, Susan A. Murphy

    Abstract: We consider the stochastic multi-armed bandit problem with non-stationary rewards. We present a novel formulation of non-stationarity in the environment where changes in the mean reward of the arms over time are due to some unknown, latent, auto-regressive (AR) state of order $k$. We call this new environment the latent AR bandit. Different forms of the latent AR bandit appear in many real-world s… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  8. arXiv:2401.16419  [pdf, other

    cs.LG stat.ML

    Semi-parametric Expert Bayesian Network Learning with Gaussian Processes and Horseshoe Priors

    Authors: Yidou Weng, Finale Doshi-Velez

    Abstract: This paper proposes a model learning Semi-parametric relationships in an Expert Bayesian Network (SEBN) with linear parameter and structure constraints. We use Gaussian Processes and a Horseshoe prior to introduce minimal nonlinear components. To prioritize modifying the expert graph over adding new edges, we optimize differential Horseshoe scales. In real-world datasets with unknown truth, we gen… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 8 pages, 4 figures, AAAI-2024 workshops

  9. arXiv:2401.14923  [pdf, other

    cs.AI cs.LG

    Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks

    Authors: Eura Nofshin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

    Abstract: Many important behavior changes are frictionful; they require individuals to expend effort over a long period with little immediate gratification. Here, an artificial intelligence (AI) agent can provide personalized interventions to help individuals stick to their goals. In these settings, the AI agent must personalize rapidly (before the individual disengages) and interpretably, to help us unders… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: In AAMAS 2024

  10. arXiv:2312.09983  [pdf, other

    cs.LG cs.AI stat.ML

    Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping

    Authors: Lauren H. Cooke, Harvey Klyne, Edwin Zhang, Cassidy Laidlaw, Milind Tambe, Finale Doshi-Velez

    Abstract: Inverse reinforcement learning (IRL) is computationally challenging, with common approaches requiring the solution of multiple reinforcement learning (RL) sub-problems. This work motivates the use of potential-based reward shaping to reduce the computational burden of each RL sub-problem. This work serves as a proof-of-concept and we hope will inspire future developments towards computationally ef… ▽ More

    Submitted 18 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

  11. arXiv:2309.11443  [pdf, other

    cs.CV cs.LG

    Signature Activation: A Sparse Signal View for Holistic Saliency

    Authors: Jose Roberto Tello Ayala, Akl C. Fahed, Weiwei Pan, Eugene V. Pomerantsev, Patrick T. Ellinor, Anthony Philippakis, Finale Doshi-Velez

    Abstract: The adoption of machine learning in healthcare calls for model transparency and explainability. In this work, we introduce Signature Activation, a saliency method that generates holistic and class-agnostic explanations for Convolutional Neural Network (CNN) outputs. Our method exploits the fact that certain kinds of medical images, such as angiograms, have clear foreground and background objects.… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  12. arXiv:2309.00254  [pdf, other

    cs.LG cs.CL cs.CR

    Why do universal adversarial attacks work on large language models?: Geometry might be the answer

    Authors: Varshini Subhash, Anna Bialas, Weiwei Pan, Finale Doshi-Velez

    Abstract: Transformer based large language models with emergent capabilities are becoming increasingly ubiquitous in society. However, the task of understanding and interpreting their internal workings, in the context of adversarial attacks, remains largely unsolved. Gradient-based universal adversarial attacks have been shown to be highly effective on large language models and potentially dangerous due to… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 2nd AdvML Frontiers Workshop at 40th International Conference on Machine Learning, Honolulu, Hawaii, USA, 2023

  13. arXiv:2308.05075  [pdf, other

    cs.LG

    Bayesian Inverse Transition Learning for Offline Settings

    Authors: Leo Benac, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Offline Reinforcement learning is commonly used for sequential decision-making in domains such as healthcare and education, where the rewards are known and the transition dynamics $T$ must be estimated on the basis of batch data. A key challenge for all tasks is how to learn a reliable estimate of the transition dynamics $T$ that produce near-optimal policies that are safe enough so that they neve… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 8 pages, 1 plots, 2 tables

  14. arXiv:2308.01420  [pdf, other

    cs.CL cs.LG

    SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text

    Authors: Charumathi Badrinath, Weiwei Pan, Finale Doshi-Velez

    Abstract: A common way to explore text corpora is through low-dimensional projections of the documents, where one hopes that thematically similar documents will be clustered together in the projected space. However, popular algorithms for dimensionality reduction of text corpora, like Latent Dirichlet Allocation (LDA), often produce projections that do not capture human notions of document similarity. We pr… ▽ More

    Submitted 28 July, 2023; originally announced August 2023.

  15. arXiv:2307.08169  [pdf, other

    cs.LG cs.HC

    Discovering User Types: Mapping User Traits by Task-Specific Behaviors in Reinforcement Learning

    Authors: L. L. Ankile, B. S. Ham, K. Mao, E. Shin, S. Swaroop, F. Doshi-Velez, W. Pan

    Abstract: When assisting human users in reinforcement learning (RL), we can represent users as RL agents and study key parameters, called \emph{user traits}, to inform intervention design. We study the relationship between user behaviors (policy classes) and user traits. Given an environment, we introduce an intuitive tool for studying the breakdown of "user types": broad sets of traits that result in the s… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

  16. arXiv:2307.06541  [pdf, other

    cs.LG cs.AI

    On the Effective Horizon of Inverse Reinforcement Learning

    Authors: Yiqing Xu, Finale Doshi-Velez, David Hsu

    Abstract: Inverse reinforcement learning (IRL) algorithms often rely on (forward) reinforcement learning or planning over a given time horizon to compute an approximately optimal policy for a hypothesized reward function and then match this policy with expert demonstrations. The time horizon plays a critical role in determining both the accuracy of reward estimate and the computational efficiency of IRL alg… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 9 pages, under review

  17. arXiv:2306.12609  [pdf, other

    cs.AI cs.CY

    Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities

    Authors: Xudong Shen, Hannah Brown, Jiashu Tao, Martin Strobel, Yao Tong, Akshay Narayan, Harold Soh, Finale Doshi-Velez

    Abstract: There is increasing attention being given to how to regulate AI systems. As governing bodies grapple with what values to encapsulate into regulation, we consider the technical half of the question: To what extent can AI experts vet an AI system for adherence to regulatory requirements? We investigate this question through the lens of two public sector procurement checklists, identifying what we ca… ▽ More

    Submitted 27 March, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: scheduled for publication in the Communications of the ACM, titled "Directions of Technical Innovation for Regulatable AI Systems"

  18. arXiv:2306.11208  [pdf, other

    cs.LG cs.AI stat.ML

    The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning

    Authors: Sarah Rathnam, Sonali Parbhoo, Weiwei Pan, Susan A. Murphy, Finale Doshi-Velez

    Abstract: Discount regularization, using a shorter planning horizon when calculating the optimal policy, is a popular choice to restrict planning to a less complex set of policies when estimating an MDP from sparse or noisy data (Jiang et al., 2015). It is commonly understood that discount regularization functions by de-emphasizing or ignoring delayed effects. In this paper, we reveal an alternate view of d… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  19. Accuracy-Time Tradeoffs in AI-Assisted Decision Making under Time Pressure

    Authors: Siddharth Swaroop, Zana Buçinca, Krzysztof Z. Gajos, Finale Doshi-Velez

    Abstract: In settings where users both need high accuracy and are time-pressured, such as doctors working in emergency rooms, we want to provide AI assistance that both increases decision accuracy and reduces decision-making time. Current literature focusses on how users interact with AI assistance when there is no time pressure, finding that different AI assistances have different benefits: some can reduce… ▽ More

    Submitted 11 February, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

  20. arXiv:2305.01738  [pdf, other

    cs.LG cs.AI

    Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare

    Authors: Shengpu Tang, Maggie Makar, Michael W. Sjoding, Finale Doshi-Velez, Jenna Wiens

    Abstract: Many reinforcement learning (RL) applications have combinatorial action spaces, where each action is a composition of sub-actions. A standard RL approach ignores this inherent factorization structure, resulting in a potential failure to make meaningful inferences about rarely observed sub-action combinations; this is particularly problematic for offline settings, where data may be limited. In this… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 30 pages, 18 figures, 2 tables. NeurIPS 2022. Code available at https://github.com/MLD3/OfflineRL_FactoredActions

  21. arXiv:2304.03365  [pdf, other

    cs.LG cs.AI

    Decision-Focused Model-based Reinforcement Learning for Reward Transfer

    Authors: Abhishek Sharma, Sonali Parbhoo, Omer Gottesman, Finale Doshi-Velez

    Abstract: Decision-focused (DF) model-based reinforcement learning has recently been introduced as a powerful algorithm that can focus on learning the MDP dynamics that are most relevant for obtaining high returns. While this approach increases the agent's performance by directly optimizing the reward, it does so by learning less accurate dynamics from a maximum likelihood perspective. We demonstrate that w… ▽ More

    Submitted 1 January, 2024; v1 submitted 6 April, 2023; originally announced April 2023.

  22. arXiv:2212.00863  [pdf, other

    cs.LG cs.AI

    Modeling Mobile Health Users as Reinforcement Learning Agents

    Authors: Eura Shin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

    Abstract: Mobile health (mHealth) technologies empower patients to adopt/maintain healthy behaviors in their daily lives, by providing interventions (e.g. push notifications) tailored to the user's needs. In these settings, without intervention, human decision making may be impaired (e.g. valuing near term pleasure over own long term goals). In this work, we formalize this relationship with a framework in w… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  23. arXiv:2211.09184  [pdf, other

    stat.ML cs.LG

    An Empirical Analysis of the Advantages of Finite- v.s. Infinite-Width Bayesian Neural Networks

    Authors: Jiayu Yao, Yaniv Yacoby, Beau Coker, Weiwei Pan, Finale Doshi-Velez

    Abstract: Comparing Bayesian neural networks (BNNs) with different widths is challenging because, as the width increases, multiple model properties change simultaneously, and, inference in the finite-width case is intractable. In this work, we empirically compare finite- and infinite-width BNNs, and provide quantitative and qualitative explanations for their performance difference. We find that when the mod… ▽ More

    Submitted 28 November, 2022; v1 submitted 16 November, 2022; originally announced November 2022.

  24. arXiv:2211.07719  [pdf, other

    cs.LG cs.HC

    (When) Are Contrastive Explanations of Reinforcement Learning Helpful?

    Authors: Sanjana Narayanan, Isaac Lage, Finale Doshi-Velez

    Abstract: Global explanations of a reinforcement learning (RL) agent's expected behavior can make it safer to deploy. However, such explanations are often difficult to understand because of the complicated nature of many RL policies. Effective human explanations are often contrastive, referencing a known contrast (policy) to reduce redundancy. At the same time, these explanations also require the additional… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Accepted to NeurIPS 2022 workshop on Human in the Loop Learning

  25. arXiv:2211.05667  [pdf, ps, other

    cs.LG

    What Makes a Good Explanation?: A Harmonized View of Properties of Explanations

    Authors: Zixi Chen, Varshini Subhash, Marton Havasi, Weiwei Pan, Finale Doshi-Velez

    Abstract: Interpretability provides a means for humans to verify aspects of machine learning (ML) models and empower human+ML teaming in situations where the task cannot be fully automated. Different contexts require explanations with different properties. For example, the kind of explanation required to determine if an early cardiac arrest warning system is ready to be integrated into a care setting is ver… ▽ More

    Submitted 12 July, 2024; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: Short version accepted at NeurIPS 2022 workshops on Progress and Challenges in Building Trustworthy Embodied AI and Trustworthy and Socially Responsible Machine Learning

  26. arXiv:2210.15767  [pdf

    cs.AI

    Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report

    Authors: Michael L. Littman, Ifeoma Ajunwa, Guy Berger, Craig Boutilier, Morgan Currie, Finale Doshi-Velez, Gillian Hadfield, Michael C. Horowitz, Charles Isbell, Hiroaki Kitano, Karen Levy, Terah Lyons, Melanie Mitchell, Julie Shah, Steven Sloman, Shannon Vallor, Toby Walsh

    Abstract: In September 2021, the "One Hundred Year Study on Artificial Intelligence" project (AI100) issued the second report of its planned long-term periodic assessment of artificial intelligence (AI) and its impact on society. It was written by a panel of 17 study authors, each of whom is deeply rooted in AI research, chaired by Michael Littman of Brown University. The report, entitled "Gathering Strengt… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 82 pages, https://ai100.stanford.edu/gathering-strength-gathering-storms-one-hundred-year-study-artificial-intelligence-ai100-2021-study

  27. Towards Robust Off-Policy Evaluation via Human Inputs

    Authors: Harvineet Singh, Shalmali Joshi, Finale Doshi-Velez, Himabindu Lakkaraju

    Abstract: Off-policy Evaluation (OPE) methods are crucial tools for evaluating policies in high-stakes domains such as healthcare, where direct deployment is often infeasible, unethical, or expensive. When deployment environments are expected to undergo changes (that is, dataset shifts), it is important for OPE methods to perform robust evaluation of the policies amidst such changes. Existing approaches con… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 10 pages, 5 figures, 1 table. Appeared at AIES '22: Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society. Expanded version of arXiv:2103.15933

  28. arXiv:2208.07406  [pdf, other

    cs.AI cs.LG

    Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care

    Authors: Anna L. Trella, Kelly W. Zhang, Inbal Nahum-Shani, Vivek Shetty, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Dental disease is one of the most common chronic diseases despite being largely preventable. However, professional advice on optimal oral hygiene practices is often forgotten or abandoned by patients. Therefore patients may benefit from timely and personalized encouragement to engage in oral self-care behaviors. In this paper, we develop an online reinforcement learning (RL) algorithm for use in o… ▽ More

    Submitted 14 September, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

  29. arXiv:2208.01705  [pdf, other

    cs.LG

    Success of Uncertainty-Aware Deep Models Depends on Data Manifold Geometry

    Authors: Mark Penrod, Harrison Termotto, Varshini Reddy, Jiayu Yao, Finale Doshi-Velez, Weiwei Pan

    Abstract: For responsible decision making in safety-critical settings, machine learning models must effectively detect and process edge-case data. Although existing works show that predictive uncertainty is useful for these tasks, it is not evident from literature which uncertainty-aware models are best suited for a given dataset. Thus, we compare six uncertainty-aware deep learning models on a set of edge-… ▽ More

    Submitted 5 August, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

    ACM Class: I.2.6

    Journal ref: International Conference on Machine Learning. PMLR 162 (2022)

  30. arXiv:2208.00250  [pdf, other

    cs.LG cs.AI

    A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes

    Authors: Kelly W. Zhang, Omer Gottesman, Finale Doshi-Velez

    Abstract: In the reinforcement learning literature, there are many algorithms developed for either Contextual Bandit (CB) or Markov Decision Processes (MDP) environments. However, when deploying reinforcement learning algorithms in the real world, even with domain expertise, it is often difficult to know whether it is appropriate to treat a sequential decision making problem as a CB or an MDP. In other word… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: Challenges of Real-World Reinforcement Learning 2020 (NeurIPS Workshop)

  31. arXiv:2207.06269  [pdf, other

    cs.LG

    Policy Optimization with Sparse Global Contrastive Explanations

    Authors: Jiayu Yao, Sonali Parbhoo, Weiwei Pan, Finale Doshi-Velez

    Abstract: We develop a Reinforcement Learning (RL) framework for improving an existing behavior policy via sparse, user-interpretable changes. Our goal is to make minimal changes while gaining as much benefit as possible. We define a minimal change as having a sparse, global contrastive explanation between the original and proposed policy. We improve the current policy with the constraint of keeping that gl… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted at IMLH Workshop, ICML 2022

  32. arXiv:2206.10847  [pdf, other

    cs.AI cs.HC

    Connecting Algorithmic Research and Usage Contexts: A Perspective of Contextualized Evaluation for Explainable AI

    Authors: Q. Vera Liao, Yunfeng Zhang, Ronny Luss, Finale Doshi-Velez, Amit Dhurandhar

    Abstract: Recent years have seen a surge of interest in the field of explainable AI (XAI), with a plethora of algorithms proposed in the literature. However, a lack of consensus on how to evaluate XAI hinders the advancement of the field. We highlight that XAI is not a monolithic set of technologies -- researchers and practitioners have begun to leverage XAI algorithms to build XAI systems that serve differ… ▽ More

    Submitted 20 September, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Forthcoming for AAAI HCOMP 2022

  33. Designing Reinforcement Learning Algorithms for Digital Interventions: Pre-implementation Guidelines

    Authors: Anna L. Trella, Kelly W. Zhang, Inbal Nahum-Shani, Vivek Shetty, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Online reinforcement learning (RL) algorithms are increasingly used to personalize digital interventions in the fields of mobile health and online education. Common challenges in designing and testing an RL algorithm in these settings include ensuring the RL algorithm can learn and run stably under real-time constraints, and accounting for the complexity of the environment, e.g., a lack of accurat… ▽ More

    Submitted 18 August, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

  34. arXiv:2204.03208  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    A Joint Learning Approach for Semi-supervised Neural Topic Modeling

    Authors: Jeffrey Chiu, Rajat Mittal, Neehal Tumma, Abhishek Sharma, Finale Doshi-Velez

    Abstract: Topic models are some of the most popular ways to represent textual data in an interpret-able manner. Recently, advances in deep generative models, specifically auto-encoding variational Bayes (AEVB), have led to the introduction of unsupervised neural topic models, which leverage deep generative models as opposed to traditional statistics-based topic models. We extend upon these neural topic mode… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: To appear in the 6th ACL Workshop on Structured Prediction for NLP (SPNLP)

  35. arXiv:2202.11670  [pdf, other

    cs.LG stat.ML

    Wide Mean-Field Bayesian Neural Networks Ignore the Data

    Authors: Beau Coker, Wessel P. Bruinsma, David R. Burt, Weiwei Pan, Finale Doshi-Velez

    Abstract: Bayesian neural networks (BNNs) combine the expressive power of deep learning with the advantages of Bayesian formalism. In recent years, the analysis of wide, deep BNNs has provided theoretical insight into their priors and posteriors. However, we have no analogous insight into their posteriors under approximate inference. In this work, we show that mean-field variational inference entirely fails… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  36. arXiv:2201.08262  [pdf, other

    cs.LG stat.ML

    Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making

    Authors: Sonali Parbhoo, Shalmali Joshi, Finale Doshi-Velez

    Abstract: Assessing the effects of a policy based on observational data from a different policy is a common problem across several high-stake decision-making domains, and several off-policy evaluation (OPE) techniques have been proposed. However, these methods largely formulate OPE as a problem disassociated from the process used to generate the data (i.e. structural assumptions in the form of a causal grap… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  37. arXiv:2111.14272  [pdf, other

    cs.LG cs.AI stat.ME

    Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation

    Authors: Ramtin Keramati, Omer Gottesman, Leo Anthony Celi, Finale Doshi-Velez, Emma Brunskill

    Abstract: Off-policy policy evaluation methods for sequential decision making can be used to help identify if a proposed decision policy is better than a current baseline policy. However, a new decision policy may be better than a baseline policy for some individuals but not others. This has motivated a push towards personalization and accurate per-state estimates of heterogeneous treatment effects (HTEs).… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

  38. arXiv:2110.13221  [pdf, other

    cs.LG cs.AI stat.ML

    On Learning Prediction-Focused Mixtures

    Authors: Abhishek Sharma, Catherine Zeng, Sanjana Narayanan, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Probabilistic models help us encode latent structures that both model the data and are ideally also useful for specific downstream tasks. Among these, mixture models and their time-series counterparts, hidden Markov models, identify discrete components in the data. In this work, we focus on a constrained capacity setting, where we want to learn a model with relatively few components (e.g. for inte… ▽ More

    Submitted 27 October, 2021; v1 submitted 25 October, 2021; originally announced October 2021.

  39. arXiv:2109.11043  [pdf, other

    cs.LG

    Learning Predictive and Interpretable Timeseries Summaries from ICU Data

    Authors: Nari Johnson, Sonali Parbhoo, Andrew Slavin Ross, Finale Doshi-Velez

    Abstract: Machine learning models that utilize patient data across time (rather than just the most recent measurements) have increased performance for many risk stratification tasks in the intensive care unit. However, many of these models and their learned representations are complex and therefore difficult for clinicians to interpret, creating challenges for validation. Our work proposes a new procedure t… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: 10 pages, 3 figures, AMIA 2021 Annual Symposium

  40. arXiv:2109.08134  [pdf, other

    cs.LG stat.ML

    Comparison and Unification of Three Regularization Methods in Batch Reinforcement Learning

    Authors: Sarah Rathnam, Susan A. Murphy, Finale Doshi-Velez

    Abstract: In batch reinforcement learning, there can be poorly explored state-action pairs resulting in poorly learned, inaccurate models and poorly performing associated policies. Various regularization methods can mitigate the problem of learning overly-complex models in Markov decision processes (MDPs), however they operate in technically and intuitively distinct ways and lack a common form in which to c… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: ICML Workshop on Reinforcement Learning Theory 2021

  41. arXiv:2109.06312  [pdf, other

    cs.LG stat.ML

    Learning-to-defer for sequential medical decision-making under uncertainty

    Authors: Shalmali Joshi, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Learning-to-defer is a framework to automatically defer decision-making to a human expert when ML-based decisions are deemed unreliable. Existing learning-to-defer frameworks are not designed for sequential settings. That is, they defer at every instance independently, based on immediate predictions, while ignoring the potential long-term impact of these interventions. As a result, existing framew… ▽ More

    Submitted 5 December, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

  42. arXiv:2109.06310  [pdf, other

    cs.LG stat.ML

    State Relevance for Off-Policy Evaluation

    Authors: Simon P. Shen, Yecheng Jason Ma, Omer Gottesman, Finale Doshi-Velez

    Abstract: Importance sampling-based estimators for off-policy evaluation (OPE) are valued for their simplicity, unbiasedness, and reliance on relatively few assumptions. However, the variance of these estimators is often high, especially when trajectories are of different lengths. In this work, we introduce Omitting-States-Irrelevant-to-Return Importance Sampling (OSIRIS), an estimator which reduces varianc… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: ICML 2021

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:9537-9546, 2021

  43. arXiv:2107.09949  [pdf, other

    cs.LG stat.ML

    Online structural kernel selection for mobile health

    Authors: Eura Shin, Pedja Klasnja, Susan Murphy, Finale Doshi-Velez

    Abstract: Motivated by the need for efficient and personalized learning in mobile health, we investigate the problem of online kernel selection for Gaussian Process regression in the multi-task setting. We propose a novel generative process on the kernel composition for this purpose. Our method demonstrates that trajectories of kernel evolutions can be transferred between users to improve learning and that… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: Workshop paper in ICML IMLH 2021

  44. arXiv:2106.13314  [pdf, other

    cs.LG cs.AI

    Promises and Pitfalls of Black-Box Concept Learning Models

    Authors: Anita Mahinpei, Justin Clark, Isaac Lage, Finale Doshi-Velez, Weiwei Pan

    Abstract: Machine learning models that incorporate concept learning as an intermediate step in their decision making process can match the performance of black-box predictive models while retaining the ability to explain outcomes in human understandable terms. However, we demonstrate that the concept representations learned by these models encode information beyond the pre-defined concepts, and that natural… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  45. arXiv:2106.07052  [pdf, other

    cs.LG stat.ML

    Wide Mean-Field Variational Bayesian Neural Networks Ignore the Data

    Authors: Beau Coker, Weiwei Pan, Finale Doshi-Velez

    Abstract: Variational inference enables approximate posterior inference of the highly over-parameterized neural networks that are popular in modern machine learning. Unfortunately, such posteriors are known to exhibit various pathological behaviors. We prove that as the number of hidden units in a single-layer Bayesian neural network tends to infinity, the function-space posterior mean under mean-field vari… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  46. arXiv:2106.03279  [pdf, other

    cs.LG

    Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning

    Authors: Kai Wang, Sanket Shah, Haipeng Chen, Andrew Perrault, Finale Doshi-Velez, Milind Tambe

    Abstract: In the predict-then-optimize framework, the objective is to train a predictive model, mapping from environment features to parameters of an optimization problem, which maximizes decision quality when the optimization is subsequently solved. Recent work on decision-focused learning shows that embedding the optimization problem in the training pipeline can improve decision quality and help generaliz… ▽ More

    Submitted 16 July, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

  47. arXiv:2103.15933  [pdf, other

    cs.LG stat.ML

    Learning Under Adversarial and Interventional Shifts

    Authors: Harvineet Singh, Shalmali Joshi, Finale Doshi-Velez, Himabindu Lakkaraju

    Abstract: Machine learning models are often trained on data from one distribution and deployed on others. So it becomes important to design models that are robust to distribution shifts. Most of the existing work focuses on optimizing for either adversarial shifts or interventional shifts. Adversarial methods lack expressivity in representing plausible shifts as they consider shifts to joint distributions i… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: 19 pages including 5 pages appendix, 6 figures, 2 tables. Preliminary version presented at Causal Discovery & Causality-Inspired Machine Learning Workshop 2020

  48. arXiv:2102.05185  [pdf, other

    cs.LG cs.AI

    Benchmarks, Algorithms, and Metrics for Hierarchical Disentanglement

    Authors: Andrew Slavin Ross, Finale Doshi-Velez

    Abstract: In representation learning, there has been recent interest in developing algorithms to disentangle the ground-truth generative factors behind a dataset, and metrics to quantify how fully this occurs. However, these algorithms and metrics often assume that both representations and ground-truth factors are flat, continuous, and factorized, whereas many real-world generative processes involve rich hi… ▽ More

    Submitted 8 April, 2022; v1 submitted 9 February, 2021; originally announced February 2021.

    Comments: ICML 2021 paper, fixed incorrect version upload

  49. arXiv:2102.01264  [pdf, other

    cs.LG cs.AI cs.HC

    Evaluating the Interpretability of Generative Models by Interactive Reconstruction

    Authors: Andrew Slavin Ross, Nina Chen, Elisa Zhao Hang, Elena L. Glassman, Finale Doshi-Velez

    Abstract: For machine learning models to be most useful in numerous sociotechnical systems, many have argued that they must be human-interpretable. However, despite increasing interest in interpretability, there remains no firm consensus on how to measure it. This is especially true in representation learning, where interpretability research has focused on "disentanglement" measures only applicable to synth… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: CHI 2021 accepted paper

  50. Designing AI for Trust and Collaboration in Time-Constrained Medical Decisions: A Sociotechnical Lens

    Authors: Maia Jacobs, Jeffrey He, Melanie F. Pradier, Barbara Lam, Andrew C. Ahn, Thomas H. McCoy, Roy H. Perlis, Finale Doshi-Velez, Krzysztof Z. Gajos

    Abstract: Major depressive disorder is a debilitating disease affecting 264 million people worldwide. While many antidepressant medications are available, few clinical guidelines support choosing among them. Decision support tools (DSTs) embodying machine learning models may help improve the treatment selection process, but often fail in clinical practice due to poor system integration. We use an iterativ… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

    Comments: To appear in ACM CHI Conference on Human Factors in Computing Systems (CHI '21), May 8-13, 2021, Yokohama, Japan