skip to main content
research-article
Open access

Diversity and Inclusion Metrics in Subset Selection

Published: 07 February 2020 Publication History
  • Get Citation Alerts
  • Abstract

    The ethical concept of fairness has recently been applied in machine learning (ML) settings to describe a wide range of constraints and objectives. When considering the relevance of ethical concepts to subset selection problems, the concepts of diversity and inclusion are additionally applicable in order to create outputs that account for social power and access differentials. We introduce metrics based on these concepts, which can be applied together, separately, and in tandem with additional fairness constraints. Results from human subject experiments lend support to the proposed criteria. Social choice methods can additionally be leveraged to aggregate and choose preferable sets, and we detail how these may be applied.

    References

    [1]
    Nima Anari, Shayan Oveis Gharan, and Alireza Rezaei. 2016. Monte Carlo Markov chain algorithms for sampling strongly Rayleigh distributions and determinantal point processes. In Conference on Learning Theory. 103--115.
    [2]
    Abolfazl Asudeh, Zhongjun Jin, and HV Jagadish. 2019. Assessing and remedying coverage for a given dataset. In 2019 IEEE 35th International Conference on Data Engineering (ICDE). IEEE, 554--565.
    [3]
    Solon Barocas and Andrew D Selbst. 2016. Big data's disparate impact. Calif. L. Rev. 104 (2016), 671.
    [4]
    Andrés Baselga, Alberto Jiménez-Valverde, and Gilles Niccolini. 2007. A multiplesite similarity measure independent of richness. Biology Letters 3, 6 (2007), 642--645.
    [5]
    Joyce M Bell and Douglas Hartmann. 2007. Diversity in everyday discourse: The cultural ambiguities and consequences of "happy talk". American Sociological Review 72, 6 (2007), 895--914.
    [6]
    Ellen Berrey. 2015. The Enigma of Diversity: The Language of Race and the Limits of Racial Justice. University of Chicago Press.
    [7]
    Ioannis Caragiannis, David Kurokawa, Hervé Moulin, Ariel D Procaccia, Nisarg Shah, and Junxing Wang. 2019. The unreasonable fairness of maximum Nash welfare. ACM Transactions on Economics and Computation (TEAC) 7, 3 (2019), 12.
    [8]
    L Elisa Celis, Amit Deshpande, Tarun Kathuria, and Nisheeth K Vishnoi. 2016. How to be fair and diverse? arXiv preprint arXiv:1610.07183 (2016).
    [9]
    L. Elisa Celis, Damian Straszak, and Nisheeth K. Vishnoi. 2017. Ranking with Fairness Constraints. CoRR abs/1704.06840 (2017). arXiv:1704.06840 http://arxiv. org/abs/1704.06840
    [10]
    Sapna Cheryan, Victoria C Plaut, Caitlin Handron, and Lauren Hudson. 2013. The stereotypical computer scientist: Gendered media representations as a barrier to inclusion for women. Sex roles 69, 1--2 (2013), 58--71.
    [11]
    Amit Deshpande and Luis Rademacher. 2010. Efficient volume sampling for row/column subset selection. In 2010 IEEE 51st Annual Symposium on Foundations of Computer Science. IEEE, 329--338.
    [12]
    Frank Dobbin and Alexandra Kalev. 2016. Why Diversity Programs Fail and What Works Better. Harvard Business Review 94, 7--8 (2016), 52--60.
    [13]
    Marina Drosou, HV Jagadish, Evaggelia Pitoura, and Julia Stoyanovich. 2017. Diversity in big data: A review. Big data 5, 2 (2017), 73--84.
    [14]
    Cynthia Dwork, Moritz Hardt, Toniann Pitassi, Omer Reingold, and Richard Zemel. 2012. Fairness through awareness. In Proceedings of the 3rd innovations in theoretical computer science conference. ACM, 214--226.
    [15]
    David G Embrick. 2011. The diversity ideology in the business world: A new oppression for a new age. Critical sociology 37, 5 (2011), 541--556.
    [16]
    Thomas B. Fitzpatrick. 1988. The Validity and Practicality of Sun-Reactive Skin Types I Through VI. JAMA Dermatology 124, 6 (06 1988), 869--871. https: //doi.org/10.1001/archderm.1988.01670060015008
    [17]
    Boqing Gong, Wei-Lun Chao, Kristen Grauman, and Fei Sha. 2014. Diverse sequential subset selection for supervised video summarization. In Advances in Neural Information Processing Systems. 2069--2077.
    [18]
    Lou Jost et al. 2009. Mismeasuring biological diversity: response to Hoffmann and Hoffmann (2008). Ecological Economics 68, 4 (2009), 925--928.
    [19]
    Alexandra Kalev, Frank Dobbin, and Erin Kelly. 2006. Best practices or best guesses? Assessing the efficacy of corporate affirmative action and diversity policies. American sociological review 71, 4 (2006), 589--617.
    [20]
    Alex Kulesza, Ben Taskar, et al. 2012. Determinantal point processes for machine learning. Foundations and Trends® in Machine Learning 5, 2--3 (2012), 123--286.
    [21]
    Pierre Legendre, Daniel Borcard, and Pedro R Peres-Neto. 2008. Analyzing or explaining beta diversity? Comment. Ecology 89, 11 (2008), 3238--3244.
    [22]
    Hui Lin and Jeff A Bilmes. 2012. Learning mixtures of submodular shells with application to document summarization. arXiv preprint arXiv:1210.4871 (2012).
    [23]
    Robert H MacArthur. 1965. Patterns of species diversity. Biological reviews 40, 4 (1965), 510--533.
    [24]
    John Stuart Mill. 2016. Utilitarianism. In Seven masterpieces of philosophy. Routledge, 337--383.
    [25]
    Michal E Mor-Barak and David A Cherin. 1998. A tool to expand organizational understanding ofworkforce diversity: Exploring a measure of inclusion-exclusion. Administration in Social Work 22, 1 (1998), 47--64.
    [26]
    Anthony Paradiso. 2017. Diversity is Being Asked to the Party. Inclusion is Being Asked to Dance. #SHRMDIV. The Society for Human Resource Management (SHRM) Blog (2017). https://blog.shrm.org/blog/diversity-is-being-asked-to-theparty- inclusion-is-being-asked-to-dance-shr
    [27]
    Lisa H. Pelled, Gerald E Ledford, Jr, and Susan A. Mohrman. 1999. Demographic dissimilarity and workplace inclusion. Journal of Management studies 36, 7 (1999), 1013--1031.
    [28]
    John Rawls. 1974. Some reasons for the maximin criterion. The American Economic Review 64, 2 (1974), 141--146.
    [29]
    Quinetta M Roberson. 2006. Disentangling the meanings of diversity and inclusion in organizations. Group & Organization Management 31, 2 (2006), 212--236.
    [30]
    Samira Samadi, Uthaipon Tantipongpipat, Jamie H Morgenstern, Mohit Singh, and Santosh Vempala. 2018. The price of fair PCA: One extra dimension. In Advances in Neural Information Processing Systems. 10976--10987.
    [31]
    Lynn M Shore, Amy E Randel, Beth G Chung, Michelle A Dean, Karen Holcombe Ehrhart, and Gangaram Singh. 2011. Inclusion and diversity in work groups: A review and model for future research. Journal of management 37, 4 (2011), 1262--1289.
    [32]
    Ashudeep Singh and Thorsten Joachims. 2017. Equality of opportunity in rankings. In Workshop on Prioritizing Online Content (WPOC) at NIPS.
    [33]
    Tonie Snell. 2017. Tokenism: The Result of Diversity Without Inclusion. Medium (2017). https://medium.com/@TonieSnell/tokenism-the-result-of-diversitywithout- inclusion-460061db1eb6
    [34]
    Hanna Tuomisto. 2011. Commentary: do we have a consistent terminology for species diversity? Yes, if we choose to use it. Oecologia 167, 4 (2011), 903--911.
    [35]
    Robert Harding Whittaker. 1960. Vegetation of the Siskiyou mountains, Oregon and California. Ecological monographs 30, 3 (1960), 279--338.
    [36]
    Ke Yang and Julia Stoyanovich. 2017. Measuring fairness in ranked outputs. In Proceedings of the 29th International Conference on Scientific and Statistical Database Management. ACM, 22.
    [37]
    Tao Zhou, Zoltán Kuscsik, Jian-Guo Liu, Matú? Medo, Joseph Rushton Wakeling, and Yi-Cheng Zhang. 2010. Solving the apparent diversity-accuracy dilemma of recommender systems. Proceedings of the National Academy of Sciences 107, 10 (2010), 4511--4515.

    Cited By

    View all
    • (2024)Co-ML: Collaborative Machine Learning Model Building for Developing Dataset Design PracticesACM Transactions on Computing Education10.1145/364155224:2(1-37)Online publication date: 16-Apr-2024
    • (2024)Diversity of What? On the Different Conceptualizations of Diversity in Recommender SystemsProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3658926(573-584)Online publication date: 3-Jun-2024
    • (2024)Partiality and Misconception: Investigating Cultural Representativeness in Text-to-Image ModelsProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642877(1-25)Online publication date: 11-May-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    AIES '20: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society
    February 2020
    439 pages
    ISBN:9781450371100
    DOI:10.1145/3375627
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 February 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. diversity and inclusion
    2. machine learning fairness
    3. subset selection

    Qualifiers

    • Research-article

    Conference

    AIES '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 61 of 162 submissions, 38%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1,020
    • Downloads (Last 6 weeks)88

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Co-ML: Collaborative Machine Learning Model Building for Developing Dataset Design PracticesACM Transactions on Computing Education10.1145/364155224:2(1-37)Online publication date: 16-Apr-2024
    • (2024)Diversity of What? On the Different Conceptualizations of Diversity in Recommender SystemsProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3658926(573-584)Online publication date: 3-Jun-2024
    • (2024)Partiality and Misconception: Investigating Cultural Representativeness in Text-to-Image ModelsProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642877(1-25)Online publication date: 11-May-2024
    • (2024)Perceptions in Pixels: Analyzing Perceived Gender and Skin Tone in Real-world Image Search ResultsProceedings of the ACM on Web Conference 202410.1145/3589334.3645666(1249-1259)Online publication date: 13-May-2024
    • (2024)Human-AI Teaming: Following the IMOI FrameworkArtificial Intelligence in HCI10.1007/978-3-031-60611-3_27(387-406)Online publication date: 29-Jun-2024
    • (2023)Stable biasProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668580(56338-56351)Online publication date: 10-Dec-2023
    • (2023)Consensus and subjectivity of skin tone annotation for ML fairnessProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667442(30319-30348)Online publication date: 10-Dec-2023
    • (2023)Ethical Considerations of Using ChatGPT in Health CareJournal of Medical Internet Research10.2196/4800925(e48009)Online publication date: 11-Aug-2023
    • (2023)Community-Engaged School District Design: A Stream-Based ApproachSSRN Electronic Journal10.2139/ssrn.4610313Online publication date: 2023
    • (2023)Gathering as Design Process: Physical Prototyping for Culturally Sustaining Computational TechnologiesProceedings of the 2023 Symposium on Learning, Design and Technology10.1145/3594781.3594802(107-113)Online publication date: 23-Jun-2023
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media