Skip to main content

Showing 1–50 of 84 results for author: Varshney, K R

  1. arXiv:2403.12805  [pdf, other

    cs.AI cs.CL

    Contextual Moral Value Alignment Through Context-Based Aggregation

    Authors: Pierre Dognin, Jesus Rios, Ronny Luss, Inkit Padhi, Matthew D Riemer, Miao Liu, Prasanna Sattigeri, Manish Nagireddy, Kush R. Varshney, Djallel Bouneffouf

    Abstract: Developing value-aligned AI agents is a complex undertaking and an ongoing challenge in the field of AI. Specifically within the domain of Large Language Models (LLMs), the capability to consolidate multiple independently trained dialogue agents, each aligned with a distinct moral value, into a unified system that can adapt to and be aligned with multiple moral values is of paramount importance. I… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  2. arXiv:2403.10638  [pdf, other

    cs.LG cs.CY stat.ML

    A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food

    Authors: Conor M. Artman, Aditya Mate, Ezinne Nwankwo, Aliza Heching, Tsuyoshi Idé, Jiří Navrátil, Karthikeyan Shanmugam, Wei Sun, Kush R. Varshney, Lauri Goldkind, Gidi Kroch, Jaclyn Sawyer, Ian Watson

    Abstract: We developed a common algorithmic solution addressing the problem of resource-constrained outreach encountered by social change organizations with different missions and operations: Breaking Ground -- an organization that helps individuals experiencing homelessness in New York transition to permanent housing and Leket -- the national food bank of Israel that rescues food from farms and elsewhere t… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  3. arXiv:2403.09704  [pdf, other

    cs.CL cs.AI cs.LG

    Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations

    Authors: Swapnaja Achintalwar, Ioana Baldini, Djallel Bouneffouf, Joan Byamugisha, Maria Chang, Pierre Dognin, Eitan Farchi, Ndivhuwo Makondo, Aleksandra Mojsilovic, Manish Nagireddy, Karthikeyan Natesan Ramamurthy, Inkit Padhi, Orna Raz, Jesus Rios, Prasanna Sattigeri, Moninder Singh, Siphiwe Thwala, Rosario A. Uceda-Sosa, Kush R. Varshney

    Abstract: The alignment of large language models is usually done by model providers to add or control behaviors that are common or universally understood across use cases and contexts. In contrast, in this article, we present an approach and architecture that empowers application developers to tune a model to their particular values, social norms, laws and other regulations, and orchestrate between potentia… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures

  4. arXiv:2403.06009  [pdf, other

    cs.LG

    Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

    Authors: Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Kirushikesh DB, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Nishtha Madaan, Sameep Mehta, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy , et al. (13 additional authors not shown)

    Abstract: Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations. Due to several limiting factors surrounding LLMs (training cost, API access, data availability, etc.), it may not always be feasible to impose direct safety constraints on a deployed model. Therefore, an efficient and reliable alternative is required. To this end, we presen… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  5. arXiv:2402.08787  [pdf, other

    cs.LG cs.CL

    Rethinking Machine Unlearning for Large Language Models

    Authors: Sijia Liu, Yuanshun Yao, Jinghan Jia, Stephen Casper, Nathalie Baracaldo, Peter Hase, Yuguang Yao, Chris Yuhao Liu, Xiaojun Xu, Hang Li, Kush R. Varshney, Mohit Bansal, Sanmi Koyejo, Yang Liu

    Abstract: We explore machine unlearning (MU) in the domain of large language models (LLMs), referred to as LLM unlearning. This initiative aims to eliminate undesirable data influence (e.g., sensitive or illegal information) and the associated model capabilities, while maintaining the integrity of essential knowledge generation and not affecting causally unrelated information. We envision LLM unlearning bec… ▽ More

    Submitted 14 July, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  6. arXiv:2401.14523  [pdf, ps, other

    cs.CY cs.AI cs.CL

    Empathy and the Right to Be an Exception: What LLMs Can and Cannot Do

    Authors: William Kidder, Jason D'Cruz, Kush R. Varshney

    Abstract: Advances in the performance of large language models (LLMs) have led some researchers to propose the emergence of theory of mind (ToM) in artificial intelligence (AI). LLMs can attribute beliefs, desires, intentions, and emotions, and they will improve in their accuracy. Rather than employing the characteristically human method of empathy, they learn to attribute mental states by recognizing lingu… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  7. arXiv:2309.05030  [pdf, other

    cs.CY cs.AI stat.ML

    Decolonial AI Alignment: Openness, Viśe\d{s}a-Dharma, and Including Excluded Knowledges

    Authors: Kush R. Varshney

    Abstract: Prior work has explicated the coloniality of artificial intelligence (AI) development and deployment through mechanisms such as extractivism, automation, sociological essentialism, surveillance, and containment. However, that work has not engaged much with alignment: teaching behaviors to a large language model (LLM) in line with desired values, and has not considered a mechanism that arises withi… ▽ More

    Submitted 2 May, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

  8. arXiv:2305.12620  [pdf, other

    cs.CL

    Keeping Up with the Language Models: Robustness-Bias Interplay in NLI Data and Models

    Authors: Ioana Baldini, Chhavi Yadav, Payel Das, Kush R. Varshney

    Abstract: Auditing unwanted social bias in language models (LMs) is inherently hard due to the multidisciplinary nature of the work. In addition, the rapid evolution of LMs can make benchmarks irrelevant in no time. Bias auditing is further complicated by LM brittleness: when a presumably biased outcome is observed, is it due to model bias or model brittleness? We propose enlisting the models themselves to… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

  9. arXiv:2304.00416  [pdf, other

    cs.AI cs.CL cs.CY cs.HC cs.LG

    Towards Healthy AI: Large Language Models Need Therapists Too

    Authors: Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi, Kush R. Varshney

    Abstract: Recent advances in large language models (LLMs) have led to the development of powerful AI chatbots capable of engaging in natural and human-like conversations. However, these chatbots can be potentially harmful, exhibiting manipulative, gaslighting, and narcissistic behaviors. We define Healthy AI to be safe, trustworthy and ethical. To create healthy AI systems, we present the SafeguardGPT frame… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  10. arXiv:2302.09190  [pdf, other

    cs.LG cs.CY

    Function Composition in Trustworthy Machine Learning: Implementation Choices, Insights, and Questions

    Authors: Manish Nagireddy, Moninder Singh, Samuel C. Hoffman, Evaline Ju, Karthikeyan Natesan Ramamurthy, Kush R. Varshney

    Abstract: Ensuring trustworthiness in machine learning (ML) models is a multi-dimensional task. In addition to the traditional notion of predictive performance, other notions such as privacy, fairness, robustness to distribution shift, adversarial robustness, interpretability, explainability, and uncertainty quantification are important considerations to evaluate and improve (if deficient). However, these s… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  11. arXiv:2212.06803  [pdf, other

    cs.LG cs.CY stat.ML

    Fair Infinitesimal Jackknife: Mitigating the Influence of Biased Training Data Points Without Refitting

    Authors: Prasanna Sattigeri, Soumya Ghosh, Inkit Padhi, Pierre Dognin, Kush R. Varshney

    Abstract: In consequential decision-making applications, mitigating unwanted biases in machine learning models that yield systematic disadvantage to members of groups delineated by sensitive attributes such as race and gender is one key intervention to strive for equity. Focusing on demographic parity and equality of opportunity, in this paper we propose an algorithm that improves the fairness of a pre-trai… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: Accepted at Neurips 2022

  12. arXiv:2211.01498  [pdf, other

    cs.LG stat.ML

    On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

    Authors: Dennis Wei, Rahul Nair, Amit Dhurandhar, Kush R. Varshney, Elizabeth M. Daly, Moninder Singh

    Abstract: Interpretable and explainable machine learning has seen a recent surge of interest. We focus on safety as a key motivation behind the surge and make the relationship between interpretability and safety more quantitative. Toward assessing safety, we introduce the concept of maximum deviation via an optimization problem to find the largest deviation of a supervised learning model from a reference mo… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: Published at NeurIPS 2022

  13. arXiv:2210.06475  [pdf, other

    cs.LG cs.CL

    Equi-Tuning: Group Equivariant Fine-Tuning of Pretrained Models

    Authors: Sourya Basu, Prasanna Sattigeri, Karthikeyan Natesan Ramamurthy, Vijil Chenthamarakshan, Kush R. Varshney, Lav R. Varshney, Payel Das

    Abstract: We introduce equi-tuning, a novel fine-tuning method that transforms (potentially non-equivariant) pretrained models into group equivariant models while incurring minimum $L_2$ loss between the feature representations of the pretrained and the equivariant models. Large pretrained models can be equi-tuned for different groups to satisfy the needs of various downstream tasks. Equi-tuned models benef… ▽ More

    Submitted 4 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Journal ref: AAAI 2023

  14. arXiv:2208.10451  [pdf, other

    cs.LG cs.CY stat.ML

    Minimax AUC Fairness: Efficient Algorithm with Provable Convergence

    Authors: Zhenhuan Yang, Yan Lok Ko, Kush R. Varshney, Yiming Ying

    Abstract: The use of machine learning models in consequential decision making often exacerbates societal inequity, in particular yielding disparate impact on members of marginalized groups defined by race and gender. The area under the ROC curve (AUC) is widely used to evaluate the performance of a scoring function in machine learning, but is studied in algorithmic fairness less than other performance metri… ▽ More

    Submitted 28 November, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

  15. arXiv:2208.01305  [pdf, other

    cs.CY

    Humble Machines: Attending to the Underappreciated Costs of Misplaced Distrust

    Authors: Bran Knowles, Jason D'Cruz, John T. Richards, Kush R. Varshney

    Abstract: It is curious that AI increasingly outperforms human decision makers, yet much of the public distrusts AI to make decisions affecting their lives. In this paper we explore a novel theory that may explain one reason for this. We propose that public distrust of AI is a moral consequence of designing systems that prioritize reduction of costs of false positives over less tangible costs of false negat… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    ACM Class: K.4.m

  16. arXiv:2201.09046  [pdf, other

    cs.LG cs.CR

    Differentially Private SGDA for Minimax Problems

    Authors: Zhenhuan Yang, Shu Hu, Yunwen Lei, Kush R. Varshney, Siwei Lyu, Yiming Ying

    Abstract: Stochastic gradient descent ascent (SGDA) and its variants have been the workhorse for solving minimax problems. However, in contrast to the well-studied stochastic gradient descent (SGD) with differential privacy (DP) constraints, there is little work on understanding the generalization (utility) of SGDA with DP constraints. In this paper, we use the algorithmic stability approach to establish th… ▽ More

    Submitted 29 July, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

    Comments: To appear in UAI 2022

  17. arXiv:2110.10790  [pdf, other

    cs.AI cs.HC

    Human-Centered Explainable AI (XAI): From Algorithms to User Experiences

    Authors: Q. Vera Liao, Kush R. Varshney

    Abstract: In recent years, the field of explainable AI (XAI) has produced a vast collection of algorithms, providing a useful toolbox for researchers and practitioners to build XAI applications. With the rich application opportunities, explainability is believed to have moved beyond a demand by data scientists or researchers to comprehend the models they develop, to an essential requirement for people to tr… ▽ More

    Submitted 19 April, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: draft for a book chapter

  18. arXiv:2109.14653  [pdf, other

    cs.LG cs.CY

    An Empirical Study of Accuracy, Fairness, Explainability, Distributional Robustness, and Adversarial Robustness

    Authors: Moninder Singh, Gevorg Ghalachyan, Kush R. Varshney, Reginald E. Bryant

    Abstract: To ensure trust in AI models, it is becoming increasingly apparent that evaluation of models must be extended beyond traditional performance metrics, like accuracy, to other dimensions, such as fairness, explainability, adversarial robustness, and distribution shift. We describe an empirical study to evaluate multiple model types on various metrics along these dimensions on several datasets. Our r… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Journal ref: presented at the 2021 KDD Workshop on Measures and Best Practices for Responsible AI

  19. arXiv:2109.12151  [pdf, other

    cs.LG cs.AI

    AI Explainability 360: Impact and Design

    Authors: Vijay Arya, Rachel K. E. Bellamy, Pin-Yu Chen, Amit Dhurandhar, Michael Hind, Samuel C. Hoffman, Stephanie Houde, Q. Vera Liao, Ronny Luss, Aleksandra Mojsilovic, Sami Mourad, Pablo Pedemonte, Ramya Raghavendra, John Richards, Prasanna Sattigeri, Karthikeyan Shanmugam, Moninder Singh, Kush R. Varshney, Dennis Wei, Yunfeng Zhang

    Abstract: As artificial intelligence and machine learning algorithms become increasingly prevalent in society, multiple stakeholders are calling for these algorithms to provide explanations. At the same time, these stakeholders, whether they be affected citizens, government regulators, domain experts, or system developers, have different explanation needs. To address these needs, in 2019, we created AI Expl… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: text overlap with arXiv:1909.03012

    Journal ref: IAAI 2022

  20. arXiv:2108.08077  [pdf, other

    q-bio.QM cs.LG

    Towards Interpreting Zoonotic Potential of Betacoronavirus Sequences With Attention

    Authors: Kahini Wadhawan, Payel Das, Barbara A. Han, Ilya R. Fischhoff, Adrian C. Castellanos, Arvind Varsani, Kush R. Varshney

    Abstract: Current methods for viral discovery target evolutionarily conserved proteins that accurately identify virus families but remain unable to distinguish the zoonotic potential of newly discovered viruses. Here, we apply an attention-enhanced long-short-term memory (LSTM) deep neural net classifier to a highly conserved viral protein target to predict zoonotic potential across betacoronaviruses. The c… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: 11 pages, 8 figures, 1 table, accepted at ICLR 2021 workshop Machine learning for preventing and combating pandemics

  21. arXiv:2106.09502  [pdf, other

    cs.CL cs.LG

    Biomedical Interpretable Entity Representations

    Authors: Diego Garcia-Olano, Yasumasa Onoe, Ioana Baldini, Joydeep Ghosh, Byron C. Wallace, Kush R. Varshney

    Abstract: Pre-trained language models induce dense entity representations that offer strong performance on entity-centric NLP tasks, but such representations are not immediately interpretable. This can be a barrier to model uptake in important domains such as biomedicine. There has been recent work on general interpretable representation learning (Onoe and Durrett, 2020), but these domain-agnostic represent… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted into Findings of ACL-IJCNLP 2021

  22. arXiv:2106.01410  [pdf, other

    cs.AI

    Uncertainty Quantification 360: A Holistic Toolkit for Quantifying and Communicating the Uncertainty of AI

    Authors: Soumya Ghosh, Q. Vera Liao, Karthikeyan Natesan Ramamurthy, Jiri Navratil, Prasanna Sattigeri, Kush R. Varshney, Yunfeng Zhang

    Abstract: In this paper, we describe an open source Python toolkit named Uncertainty Quantification 360 (UQ360) for the uncertainty quantification of AI models. The goal of this toolkit is twofold: first, to provide a broad range of capabilities to streamline as well as foster the common practices of quantifying, evaluating, improving, and communicating uncertainty in the AI application development lifecycl… ▽ More

    Submitted 3 June, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Added references

  23. arXiv:2104.04633  [pdf, other

    cs.CY

    Automated Meta-Analysis: A Causal Learning Perspective

    Authors: Lu Cheng, Dmitriy A. Katz-Rogozhnikov, Kush R. Varshney, Ioana Baldini

    Abstract: Meta-analysis is a systematic approach for understanding a phenomenon by analyzing the results of many previously published experimental studies. It is central to deriving conclusions about the summary effect of treatments and interventions in medicine, poverty alleviation, and other applications with social impact. Unfortunately, meta-analysis involves great human effort, rendering a process that… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: 11 pages, 6 figures

  24. arXiv:2102.02279  [pdf, other

    cs.CY

    Insiders and Outsiders in Research on Machine Learning and Society

    Authors: Yu Tao, Kush R. Varshney

    Abstract: A subset of machine learning research intersects with societal issues, including fairness, accountability and transparency, as well as the use of machine learning for social good. In this work, we analyze the scholars contributing to this research at the intersection of machine learning and society through the lens of the sociology of science. By analyzing the authorship of all machine learning pa… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

  25. Disparate Impact Diminishes Consumer Trust Even for Advantaged Users

    Authors: Tim Draws, Zoltán Szlávik, Benjamin Timmermans, Nava Tintarev, Kush R. Varshney, Michael Hind

    Abstract: Systems aiming to aid consumers in their decision-making (e.g., by implementing persuasive techniques) are more likely to be effective when consumers trust them. However, recent research has demonstrated that the machine learning algorithms that often underlie such technology can act unfairly towards specific groups (e.g., by making more favorable predictions for men than for women). An undesired… ▽ More

    Submitted 5 July, 2021; v1 submitted 29 January, 2021; originally announced January 2021.

    Journal ref: Persuasive Technology, Cham, 2021, p. 135-149

  26. arXiv:2101.02032  [pdf, other

    cs.CY cs.AI

    Socially Responsible AI Algorithms: Issues, Purposes, and Challenges

    Authors: Lu Cheng, Kush R. Varshney, Huan Liu

    Abstract: In the current era, people and society have grown increasingly reliant on artificial intelligence (AI) technologies. AI has the potential to drive us towards a future in which all of humanity flourishes. It also comes with substantial risks for oppression and calamity. Discussions about whether we should (re)trust AI have repeatedly emerged in recent years and in many quarters, including industry,… ▽ More

    Submitted 21 August, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: 45 pages, 8 figures

    Journal ref: Journal of Artificial Intelligence Research 71 (2021) 1137-1181

  27. arXiv:2012.12141  [pdf, other

    cs.LG stat.ML

    Learning to Initialize Gradient Descent Using Gradient Descent

    Authors: Kartik Ahuja, Amit Dhurandhar, Kush R. Varshney

    Abstract: Non-convex optimization problems are challenging to solve; the success and computational expense of a gradient descent algorithm or variant depend heavily on the initialization strategy. Often, either random initialization is used or initialization rules are carefully designed by exploiting the nature of the problem class. As a simple alternative to hand-crafted initialization rules, we propose an… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

  28. arXiv:2010.16412  [pdf, other

    cs.LG stat.ML

    Empirical or Invariant Risk Minimization? A Sample Complexity Perspective

    Authors: Kartik Ahuja, Jun Wang, Amit Dhurandhar, Karthikeyan Shanmugam, Kush R. Varshney

    Abstract: Recently, invariant risk minimization (IRM) was proposed as a promising solution to address out-of-distribution (OOD) generalization. However, it is unclear when IRM should be preferred over the widely-employed empirical risk minimization (ERM) framework. In this work, we analyze both these frameworks from the perspective of sample complexity, thus taking a firm step towards answering this importa… ▽ More

    Submitted 19 August, 2022; v1 submitted 30 October, 2020; originally announced October 2020.

  29. arXiv:2010.07938  [pdf, other

    cs.HC cs.LG

    Deciding Fast and Slow: The Role of Cognitive Biases in AI-assisted Decision-making

    Authors: Charvi Rastogi, Yunfeng Zhang, Dennis Wei, Kush R. Varshney, Amit Dhurandhar, Richard Tomsett

    Abstract: Several strands of research have aimed to bridge the gap between artificial intelligence (AI) and human decision-makers in AI-assisted decision-making, where humans are the consumers of AI model predictions and the ultimate decision-makers in high-stakes applications. However, people's perception and understanding are often distorted by their cognitive biases, such as confirmation bias, anchoring… ▽ More

    Submitted 4 April, 2022; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: 22 pages, 4 figures

  30. arXiv:2006.11356  [pdf, ps, other

    cs.CY cs.CR

    Trust and Transparency in Contact Tracing Applications

    Authors: Stacy Hobson, Michael Hind, Aleksandra Mojsilovic, Kush R. Varshney

    Abstract: The global outbreak of COVID-19 has led to focus on efforts to manage and mitigate the continued spread of the disease. One of these efforts include the use of contact tracing to identify people who are at-risk of developing the disease through exposure to an infected person. Historically, contact tracing has been primarily manual but given the exponential spread of the virus that causes COVID-19,… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

    Comments: 9 pages

  31. arXiv:2006.06053  [pdf, other

    cs.LG cs.CY cs.DB stat.ML

    Causal Feature Selection for Algorithmic Fairness

    Authors: Sainyam Galhotra, Karthikeyan Shanmugam, Prasanna Sattigeri, Kush R. Varshney

    Abstract: The use of machine learning (ML) in high-stakes societal decisions has encouraged the consideration of fairness throughout the ML lifecycle. Although data integration is one of the primary steps to generate high quality training data, most of the fairness literature ignores this stage. In this work, we consider fairness in the integration component of data management, aiming to identify features t… ▽ More

    Submitted 31 March, 2022; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Full version of the paper at SIGMOD 2022

  32. arXiv:2002.04692  [pdf, other

    cs.LG stat.ML

    Invariant Risk Minimization Games

    Authors: Kartik Ahuja, Karthikeyan Shanmugam, Kush R. Varshney, Amit Dhurandhar

    Abstract: The standard risk minimization paradigm of machine learning is brittle when operating in environments whose test distributions are different from the training distribution due to spurious correlations. Training on data from many environments and finding invariant predictors reduces the effect of spurious features by concentrating models on features that have a causal relationship with the outcome.… ▽ More

    Submitted 18 March, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

  33. Joint Optimization of AI Fairness and Utility: A Human-Centered Approach

    Authors: Yunfeng Zhang, Rachel K. E. Bellamy, Kush R. Varshney

    Abstract: Today, AI is increasingly being used in many high-stakes decision-making applications in which fairness is an important concern. Already, there are many examples of AI being biased and making questionable and unfair decisions. The AI research community has proposed many methods to measure and mitigate unwanted biases, but few of them involve inputs from human policy makers. We argue that because d… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

    Comments: To appear in AIES 2020 proceedings

  34. arXiv:1911.08293  [pdf, ps, other

    cs.CY cs.HC

    Experiences with Improving the Transparency of AI Models and Services

    Authors: Michael Hind, Stephanie Houde, Jacquelyn Martino, Aleksandra Mojsilovic, David Piorkowski, John Richards, Kush R. Varshney

    Abstract: AI models and services are used in a growing number of highstakes areas, resulting in a need for increased transparency. Consistent with this, several proposals for higher quality and more consistent documentation of AI data, models, and systems have emerged. Little is known, however, about the needs of those who would produce or consume these new forms of documentation. Through semi-structured de… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

  35. arXiv:1911.07819  [pdf, other

    cs.CL cs.LG stat.ML

    Drug Repurposing for Cancer: An NLP Approach to Identify Low-Cost Therapies

    Authors: Shivashankar Subramanian, Ioana Baldini, Sushma Ravichandran, Dmitriy A. Katz-Rogozhnikov, Karthikeyan Natesan Ramamurthy, Prasanna Sattigeri, Kush R. Varshney, Annmarie Wang, Pradeep Mangalath, Laura B. Kleiman

    Abstract: More than 200 generic drugs approved by the U.S. Food and Drug Administration for non-cancer indications have shown promise for treating cancer. Due to their long history of safe patient use, low cost, and widespread availability, repurposing of generic drugs represents a major opportunity to rapidly improve outcomes for cancer patients and reduce healthcare costs worldwide. Evidence on the effica… ▽ More

    Submitted 5 December, 2019; v1 submitted 18 November, 2019; originally announced November 2019.

  36. arXiv:1911.03674  [pdf, other

    cs.LG cs.CR stat.ML

    Preservation of Anomalous Subgroups On Machine Learning Transformed Data

    Authors: Samuel C. Maina, Reginald E. Bryant, William O. Goal, Robert-Florian Samoilescu, Kush R. Varshney, Komminist Weldemariam

    Abstract: In this paper, we investigate the effect of machine learning based anonymization on anomalous subgroup preservation. In particular, we train a binary classifier to discover the most anomalous subgroup in a dataset by maximizing the bias between the group's predicted odds ratio from the model and observed odds ratio from the data. We then perform anonymization using a variational autoencoder (VAE)… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

    Comments: 5 pages, 3 figures, 2 tables, submitted to icassp 2019

  37. arXiv:1910.13983  [pdf, other

    cs.LG cs.CY stat.ML

    DADI: Dynamic Discovery of Fair Information with Adversarial Reinforcement Learning

    Authors: Michiel A. Bakker, Duy Patrick Tu, Humberto Riverón Valdés, Krishna P. Gummadi, Kush R. Varshney, Adrian Weller, Alex Pentland

    Abstract: We introduce a framework for dynamic adversarial discovery of information (DADI), motivated by a scenario where information (a feature set) is used by third parties with unknown objectives. We train a reinforcement learning agent to sequentially acquire a subset of the information while balancing accuracy and fairness of predictors downstream. Based on the set of already acquired features, the age… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: Accepted at NeurIPS 2019 HCML Workshop

  38. arXiv:1910.13268  [pdf, other

    cs.CV cs.CY stat.ML

    Estimating Skin Tone and Effects on Classification Performance in Dermatology Datasets

    Authors: Newton M. Kinyanjui, Timothy Odonga, Celia Cintas, Noel C. F. Codella, Rameswar Panda, Prasanna Sattigeri, Kush R. Varshney

    Abstract: Recent advances in computer vision and deep learning have led to breakthroughs in the development of automated skin image analysis. In particular, skin cancer classification models have achieved performance higher than trained expert dermatologists. However, no attempt has been made to evaluate the consistency in performance of machine learning models across populations with varying skin tones. In… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019 Workshop on Fair ML for Health

  39. arXiv:1910.07870  [pdf, other

    stat.ML cs.CY cs.IT cs.LG

    Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing

    Authors: Sanghamitra Dutta, Dennis Wei, Hazar Yueksel, Pin-Yu Chen, Sijia Liu, Kush R. Varshney

    Abstract: A trade-off between accuracy and fairness is almost taken as a given in the existing literature on fairness in machine learning. Yet, it is not preordained that accuracy should decrease with increased fairness. Novel to this work, we examine fair classification through the lens of mismatched hypothesis testing: trying to find a classifier that distinguishes between two ideal distributions when giv… ▽ More

    Submitted 10 December, 2020; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: This paper appears in the Proceedings of the 37th International Conference on Machine Learning, pp. 2803--2813, 2020

  40. arXiv:1909.03486  [pdf, other

    cs.CY cs.AI cs.HC

    How Data Scientists Work Together With Domain Experts in Scientific Collaborations: To Find The Right Answer Or To Ask The Right Question?

    Authors: Yaoli Mao, Dakuo Wang, Michael Muller, Kush R. Varshney, Ioana Baldini, Casey Dugan, AleksandraMojsilović

    Abstract: In recent years there has been an increasing trend in which data scientists and domain experts work together to tackle complex scientific questions. However, such collaborations often face challenges. In this paper, we aim to decipher this collaboration complexity through a semi-structured interview study with 22 interviewees from teams of bio-medical scientists collaborating with data scientists.… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.

  41. arXiv:1909.03012  [pdf, other

    cs.AI cs.CV cs.HC stat.ML

    One Explanation Does Not Fit All: A Toolkit and Taxonomy of AI Explainability Techniques

    Authors: Vijay Arya, Rachel K. E. Bellamy, Pin-Yu Chen, Amit Dhurandhar, Michael Hind, Samuel C. Hoffman, Stephanie Houde, Q. Vera Liao, Ronny Luss, Aleksandra Mojsilović, Sami Mourad, Pablo Pedemonte, Ramya Raghavendra, John Richards, Prasanna Sattigeri, Karthikeyan Shanmugam, Moninder Singh, Kush R. Varshney, Dennis Wei, Yunfeng Zhang

    Abstract: As artificial intelligence and machine learning algorithms make further inroads into society, calls are increasing from multiple stakeholders for these algorithms to explain their outputs. At the same time, these stakeholders, whether they be affected citizens, government regulators, domain experts, or system developers, present different requirements for explanations. Toward addressing these need… ▽ More

    Submitted 14 September, 2019; v1 submitted 6 September, 2019; originally announced September 2019.

  42. arXiv:1907.04138  [pdf, other

    cs.LG stat.ML

    Characterization of Overlap in Observational Studies

    Authors: Michael Oberst, Fredrik D. Johansson, Dennis Wei, Tian Gao, Gabriel Brat, David Sontag, Kush R. Varshney

    Abstract: Overlap between treatment groups is required for non-parametric estimation of causal effects. If a subgroup of subjects always receives the same intervention, we cannot estimate the effect of intervention changes on that subgroup without further assumptions. When overlap does not hold globally, characterizing local regions of overlap can inform the relevance of causal conclusions for new subjects,… ▽ More

    Submitted 3 June, 2020; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: To appear at AISTATS 2020

    Journal ref: Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, PMLR 108:788-798, 2020

  43. arXiv:1906.02299  [pdf, other

    cs.LG cs.AI stat.ML

    Teaching AI to Explain its Decisions Using Embeddings and Multi-Task Learning

    Authors: Noel C. F. Codella, Michael Hind, Karthikeyan Natesan Ramamurthy, Murray Campbell, Amit Dhurandhar, Kush R. Varshney, Dennis Wei, Aleksandra Mojsilović

    Abstract: Using machine learning in high-stakes applications often requires predictions to be accompanied by explanations comprehensible to the domain user, who has ultimate responsibility for decisions and outcomes. Recently, a new framework for providing explanations, called TED, has been proposed to provide meaningful explanations for predictions. This framework augments training data to include explanat… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Comments: presented at 2019 ICML Workshop on Human in the Loop Learning (HILL 2019), Long Beach, USA. arXiv admin note: substantial text overlap with arXiv:1805.11648

  44. arXiv:1905.11519  [pdf, other

    cs.CY

    Open Platforms for Artificial Intelligence for Social Good: Common Patterns as a Pathway to True Impact

    Authors: Kush R. Varshney, Aleksandra Mojsilovic

    Abstract: The AI for social good movement has now reached a state in which a large number of one-off demonstrations have illustrated that partnerships of AI practitioners and social change organizations are possible and can address problems faced in sustainable development. In this paper, we discuss how moving from demonstrations to true impact on humanity will require a different course of action, namely o… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Comments: appearing at the 2019 ICML AI for Social Good Workshop

  45. Interpretable Subgroup Discovery in Treatment Effect Estimation with Application to Opioid Prescribing Guidelines

    Authors: Chirag Nagpal, Dennis Wei, Bhanukiran Vinzamuri, Monica Shekhar, Sara E. Berger, Subhro Das, Kush R. Varshney

    Abstract: The dearth of prescribing guidelines for physicians is one key driver of the current opioid epidemic in the United States. In this work, we analyze medical and pharmaceutical claims data to draw insights on characteristics of patients who are more prone to adverse outcomes after an initial synthetic opioid prescription. Toward this end, we propose a generative model that allows discovery from obse… ▽ More

    Submitted 4 March, 2020; v1 submitted 8 May, 2019; originally announced May 2019.

    Journal ref: First ACM Conference on Health, Inference and Learning (CHIL) 2020

  46. arXiv:1812.06135  [pdf, other

    cs.LG cs.CY stat.ML

    Bias Mitigation Post-processing for Individual and Group Fairness

    Authors: Pranay K. Lohia, Karthikeyan Natesan Ramamurthy, Manish Bhide, Diptikalyan Saha, Kush R. Varshney, Ruchir Puri

    Abstract: Whereas previous post-processing approaches for increasing the fairness of predictions of biased classifiers address only group fairness, we propose a method for increasing both individual and group fairness. Our novel framework includes an individual bias detector used to prioritize data samples in a bias mitigation algorithm aiming to improve the group fairness measure of disparate impact. We sh… ▽ More

    Submitted 14 December, 2018; originally announced December 2018.

    Comments: 5 pages, 4 figures

  47. arXiv:1812.00099  [pdf, other

    cs.CV cs.CY stat.ML

    Understanding Unequal Gender Classification Accuracy from Face Images

    Authors: Vidya Muthukumar, Tejaswini Pedapati, Nalini Ratha, Prasanna Sattigeri, Chai-Wah Wu, Brian Kingsbury, Abhishek Kumar, Samuel Thomas, Aleksandra Mojsilovic, Kush R. Varshney

    Abstract: Recent work shows unequal performance of commercial face classification services in the gender classification task across intersectional groups defined by skin type and gender. Accuracy on dark-skinned females is significantly worse than on any other group. In this paper, we conduct several analyses to try to uncover the reason for this gap. The main finding, perhaps surprisingly, is that skin typ… ▽ More

    Submitted 30 November, 2018; originally announced December 2018.

  48. arXiv:1811.04896  [pdf, other

    cs.AI

    TED: Teaching AI to Explain its Decisions

    Authors: Michael Hind, Dennis Wei, Murray Campbell, Noel C. F. Codella, Amit Dhurandhar, Aleksandra Mojsilović, Karthikeyan Natesan Ramamurthy, Kush R. Varshney

    Abstract: Artificial intelligence systems are being increasingly deployed due to their potential to increase the efficiency, scale, consistency, fairness, and accuracy of decisions. However, as many of these systems are opaque in their operation, there is a growing demand for such systems to provide explanations for their decisions. Conventional approaches to this problem attempt to expose or discover the i… ▽ More

    Submitted 15 June, 2019; v1 submitted 12 November, 2018; originally announced November 2018.

    Comments: This article leverages some content from arXiv:1805.11648; presented at ACM/AAAI AIES'19

  49. arXiv:1811.01299  [pdf, other

    cs.CL cs.AI cs.CY

    SimplerVoice: A Key Message & Visual Description Generator System for Illiteracy

    Authors: Minh N. B. Nguyen, Samuel Thomas, Anne E. Gattiker, Sujatha Kashyap, Kush R. Varshney

    Abstract: We introduce SimplerVoice: a key message and visual description generator system to help low-literate adults navigate the information-dense world with confidence, on their own. SimplerVoice can automatically generate sensible sentences describing an unknown object, extract semantic meanings of the object usage in the form of a query string, then, represent the string as multiple types of visual gu… ▽ More

    Submitted 3 November, 2018; originally announced November 2018.

    Journal ref: Data For Good Exchange 2018

  50. arXiv:1810.11126  [pdf, other

    cs.DC

    Promoting Distributed Trust in Machine Learning and Computational Simulation via a Blockchain Network

    Authors: Nelson Kibichii Bore, Ravi Kiran Raman, Isaac M. Markus, Sekou L. Remy, Oliver Bent, Michael Hind, Eleftheria K. Pissadaki, Biplav Srivastava, Roman Vaculin, Kush R. Varshney, Komminist Weldemariam

    Abstract: Policy decisions are increasingly dependent on the outcomes of simulations and/or machine learning models. The ability to share and interact with these outcomes is relevant across multiple fields and is especially critical in the disease modeling community where models are often only accessible and workable to the researchers that generate them. This work presents a blockchain-enabled system that… ▽ More

    Submitted 25 October, 2018; originally announced October 2018.