research-article

Open access

Diversity and Inclusion Metrics in Subset Selection

Authors:

Margaret Mitchell,

Nyalleng Moorosi,

Ben Hutchinson,

Timnit Gebru, and

Jamie MorgensternAuthors Info & Claims

AIES '20: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society

February 2020

Pages 117 - 123

https://doi.org/10.1145/3375627.3375832

Published: 07 February 2020 Publication History

Abstract

The ethical concept of fairness has recently been applied in machine learning (ML) settings to describe a wide range of constraints and objectives. When considering the relevance of ethical concepts to subset selection problems, the concepts of diversity and inclusion are additionally applicable in order to create outputs that account for social power and access differentials. We introduce metrics based on these concepts, which can be applied together, separately, and in tandem with additional fairness constraints. Results from human subject experiments lend support to the proposed criteria. Social choice methods can additionally be leveraged to aggregate and choose preferable sets, and we detail how these may be applied.

References

[1]

Nima Anari, Shayan Oveis Gharan, and Alireza Rezaei. 2016. Monte Carlo Markov chain algorithms for sampling strongly Rayleigh distributions and determinantal point processes. In Conference on Learning Theory. 103--115.

[2]

Abolfazl Asudeh, Zhongjun Jin, and HV Jagadish. 2019. Assessing and remedying coverage for a given dataset. In 2019 IEEE 35th International Conference on Data Engineering (ICDE). IEEE, 554--565.

[3]

Solon Barocas and Andrew D Selbst. 2016. Big data's disparate impact. Calif. L. Rev. 104 (2016), 671.

[4]

Andrés Baselga, Alberto Jiménez-Valverde, and Gilles Niccolini. 2007. A multiplesite similarity measure independent of richness. Biology Letters 3, 6 (2007), 642--645.

[5]

Joyce M Bell and Douglas Hartmann. 2007. Diversity in everyday discourse: The cultural ambiguities and consequences of "happy talk". American Sociological Review 72, 6 (2007), 895--914.

[6]

Ellen Berrey. 2015. The Enigma of Diversity: The Language of Race and the Limits of Racial Justice. University of Chicago Press.

[7]

Ioannis Caragiannis, David Kurokawa, Hervé Moulin, Ariel D Procaccia, Nisarg Shah, and Junxing Wang. 2019. The unreasonable fairness of maximum Nash welfare. ACM Transactions on Economics and Computation (TEAC) 7, 3 (2019), 12.

Digital Library

[8]

L Elisa Celis, Amit Deshpande, Tarun Kathuria, and Nisheeth K Vishnoi. 2016. How to be fair and diverse? arXiv preprint arXiv:1610.07183 (2016).

[9]

L. Elisa Celis, Damian Straszak, and Nisheeth K. Vishnoi. 2017. Ranking with Fairness Constraints. CoRR abs/1704.06840 (2017). arXiv:1704.06840 http://arxiv. org/abs/1704.06840

[10]

Sapna Cheryan, Victoria C Plaut, Caitlin Handron, and Lauren Hudson. 2013. The stereotypical computer scientist: Gendered media representations as a barrier to inclusion for women. Sex roles 69, 1--2 (2013), 58--71.

[11]

Amit Deshpande and Luis Rademacher. 2010. Efficient volume sampling for row/column subset selection. In 2010 IEEE 51st Annual Symposium on Foundations of Computer Science. IEEE, 329--338.

Digital Library

[12]

Frank Dobbin and Alexandra Kalev. 2016. Why Diversity Programs Fail and What Works Better. Harvard Business Review 94, 7--8 (2016), 52--60.

[13]

Marina Drosou, HV Jagadish, Evaggelia Pitoura, and Julia Stoyanovich. 2017. Diversity in big data: A review. Big data 5, 2 (2017), 73--84.

[14]

Cynthia Dwork, Moritz Hardt, Toniann Pitassi, Omer Reingold, and Richard Zemel. 2012. Fairness through awareness. In Proceedings of the 3rd innovations in theoretical computer science conference. ACM, 214--226.

Digital Library

[15]

David G Embrick. 2011. The diversity ideology in the business world: A new oppression for a new age. Critical sociology 37, 5 (2011), 541--556.

[16]

Thomas B. Fitzpatrick. 1988. The Validity and Practicality of Sun-Reactive Skin Types I Through VI. JAMA Dermatology 124, 6 (06 1988), 869--871. https: //doi.org/10.1001/archderm.1988.01670060015008

[17]

Boqing Gong, Wei-Lun Chao, Kristen Grauman, and Fei Sha. 2014. Diverse sequential subset selection for supervised video summarization. In Advances in Neural Information Processing Systems. 2069--2077.

[18]

Lou Jost et al. 2009. Mismeasuring biological diversity: response to Hoffmann and Hoffmann (2008). Ecological Economics 68, 4 (2009), 925--928.

[19]

Alexandra Kalev, Frank Dobbin, and Erin Kelly. 2006. Best practices or best guesses? Assessing the efficacy of corporate affirmative action and diversity policies. American sociological review 71, 4 (2006), 589--617.

[20]

Alex Kulesza, Ben Taskar, et al. 2012. Determinantal point processes for machine learning. Foundations and Trends® in Machine Learning 5, 2--3 (2012), 123--286.

[21]

Pierre Legendre, Daniel Borcard, and Pedro R Peres-Neto. 2008. Analyzing or explaining beta diversity? Comment. Ecology 89, 11 (2008), 3238--3244.

[22]

Hui Lin and Jeff A Bilmes. 2012. Learning mixtures of submodular shells with application to document summarization. arXiv preprint arXiv:1210.4871 (2012).

[23]

Robert H MacArthur. 1965. Patterns of species diversity. Biological reviews 40, 4 (1965), 510--533.

[24]

John Stuart Mill. 2016. Utilitarianism. In Seven masterpieces of philosophy. Routledge, 337--383.

[25]

Michal E Mor-Barak and David A Cherin. 1998. A tool to expand organizational understanding ofworkforce diversity: Exploring a measure of inclusion-exclusion. Administration in Social Work 22, 1 (1998), 47--64.

[26]

Anthony Paradiso. 2017. Diversity is Being Asked to the Party. Inclusion is Being Asked to Dance. #SHRMDIV. The Society for Human Resource Management (SHRM) Blog (2017). https://blog.shrm.org/blog/diversity-is-being-asked-to-theparty- inclusion-is-being-asked-to-dance-shr

[27]

Lisa H. Pelled, Gerald E Ledford, Jr, and Susan A. Mohrman. 1999. Demographic dissimilarity and workplace inclusion. Journal of Management studies 36, 7 (1999), 1013--1031.

[28]

John Rawls. 1974. Some reasons for the maximin criterion. The American Economic Review 64, 2 (1974), 141--146.

[29]

Quinetta M Roberson. 2006. Disentangling the meanings of diversity and inclusion in organizations. Group & Organization Management 31, 2 (2006), 212--236.

[30]

Samira Samadi, Uthaipon Tantipongpipat, Jamie H Morgenstern, Mohit Singh, and Santosh Vempala. 2018. The price of fair PCA: One extra dimension. In Advances in Neural Information Processing Systems. 10976--10987.

[31]

Lynn M Shore, Amy E Randel, Beth G Chung, Michelle A Dean, Karen Holcombe Ehrhart, and Gangaram Singh. 2011. Inclusion and diversity in work groups: A review and model for future research. Journal of management 37, 4 (2011), 1262--1289.

[32]

Ashudeep Singh and Thorsten Joachims. 2017. Equality of opportunity in rankings. In Workshop on Prioritizing Online Content (WPOC) at NIPS.

[33]

Tonie Snell. 2017. Tokenism: The Result of Diversity Without Inclusion. Medium (2017). https://medium.com/@TonieSnell/tokenism-the-result-of-diversitywithout- inclusion-460061db1eb6

[34]

Hanna Tuomisto. 2011. Commentary: do we have a consistent terminology for species diversity? Yes, if we choose to use it. Oecologia 167, 4 (2011), 903--911.

[35]

Robert Harding Whittaker. 1960. Vegetation of the Siskiyou mountains, Oregon and California. Ecological monographs 30, 3 (1960), 279--338.

[36]

Ke Yang and Julia Stoyanovich. 2017. Measuring fairness in ranked outputs. In Proceedings of the 29th International Conference on Scientific and Statistical Database Management. ACM, 22.

Digital Library

[37]

Tao Zhou, Zoltán Kuscsik, Jian-Guo Liu, Matú? Medo, Joseph Rushton Wakeling, and Yi-Cheng Zhang. 2010. Solving the apparent diversity-accuracy dilemma of recommender systems. Proceedings of the National Academy of Sciences 107, 10 (2010), 4511--4515.

Cited By

Tseng TDavidson MMorales-Navarro LChen JDelaney VLeibowitz MBeason JShapiro R(2024)Co-ML: Collaborative Machine Learning Model Building for Developing Dataset Design PracticesACM Transactions on Computing Education10.1145/364155224:2(1-37)Online publication date: 16-Apr-2024
https://dl.acm.org/doi/10.1145/3641552
Vrijenhoek SDaniil SSandel JHollink L(2024)Diversity of What? On the Different Conceptualizations of Diversity in Recommender SystemsProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3658926(573-584)Online publication date: 3-Jun-2024
https://dl.acm.org/doi/10.1145/3630106.3658926
Zhang LLiao XYang ZGao BWang CYang QLi D(2024)Partiality and Misconception: Investigating Cultural Representativeness in Text-to-Image ModelsProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642877(1-25)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642877
Show More Cited By

Index Terms

Diversity and Inclusion Metrics in Subset Selection
1. Information systems
  1. Information retrieval
    1. Evaluation of retrieval results
    2. Retrieval models and ranking
      1. Information retrieval diversity

Recommendations

Dynamic Mutation Based Pareto Optimization for Subset Selection
Intelligent Computing Methodologies
Abstract
Subset selection that selects the best k variables from n variables is a fundamental problem in many areas. Pareto optimization for subset selection (called POSS) is a recently proposed approach for subset selection based on Pareto optimization ...
Read More
Accelerating differential evolution based on a subset-to-subset survivor selection operator

Differential evolution (DE) is one of the most powerful and effective evolutionary algorithms for solving global optimization problems. However, just like all other metaheuristics, DE also has some drawbacks, such as slow and/or premature convergence. ...
Read More
Maintaining Healthy Population Diversity Using Adaptive Crossover, Mutation, and Selection

This paper presents ACROMUSE, a novel genetic algorithm (GA) which adapts crossover, mutation, and selection parameters. ACROMUSEs objective is to create and maintain a diverse population of highly-fit (healthy) individuals, capable of adapting quickly ...
Read More

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

AIES '20: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society

February 2020

439 pages

ISBN:9781450371100

DOI:10.1145/3375627

General Chairs:
Annette Markham
Aarhus University | Loyola University
,
Julia Powles
University of Western Australia
,
Toby Walsh
TU Berlin | University of New South Wales | Data61
,
Anne L. Washington
New York University

Copyright © 2020 Owner/Author.

This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

SIGAI: ACM Special Interest Group on Artificial Intelligence

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 February 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

AIES '20

Sponsor:

SIGAI

AIES '20: AAAI/ACM Conference on AI, Ethics, and Society

February 7 - 9, 2020

NY, New York, USA

Acceptance Rates

Overall Acceptance Rate 61 of 162 submissions, 38%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

46
Total Citations
View Citations
4,119
Total Downloads

Downloads (Last 12 months)1,020
Downloads (Last 6 weeks)88

Other Metrics

View Author Metrics

Citations

Cited By

Tseng TDavidson MMorales-Navarro LChen JDelaney VLeibowitz MBeason JShapiro R(2024)Co-ML: Collaborative Machine Learning Model Building for Developing Dataset Design PracticesACM Transactions on Computing Education10.1145/364155224:2(1-37)Online publication date: 16-Apr-2024
https://dl.acm.org/doi/10.1145/3641552
Vrijenhoek SDaniil SSandel JHollink L(2024)Diversity of What? On the Different Conceptualizations of Diversity in Recommender SystemsProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3658926(573-584)Online publication date: 3-Jun-2024
https://dl.acm.org/doi/10.1145/3630106.3658926
Zhang LLiao XYang ZGao BWang CYang QLi D(2024)Partiality and Misconception: Investigating Cultural Representativeness in Text-to-Image ModelsProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642877(1-25)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642877
Gleason JGhosh ARobertson RWilson CChua TNgo CKa-Wei Lee RKumar RLauw H(2024)Perceptions in Pixels: Analyzing Perceived Gender and Skin Tone in Real-world Image Search ResultsProceedings of the ACM on Web Conference 202410.1145/3589334.3645666(1249-1259)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645666
Kleanthous S(2024)Human-AI Teaming: Following the IMOI FrameworkArtificial Intelligence in HCI10.1007/978-3-031-60611-3_27(387-406)Online publication date: 29-Jun-2024
https://dl.acm.org/doi/10.1007/978-3-031-60611-3_27
Luccioni AAkiki CMitchell MJernite YOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Stable biasProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668580(56338-56351)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.5555/3666122.3668580
Schumann COlanubi GWright AMonk EHeldreth CRicco SOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Consensus and subjectivity of skin tone annotation for ML fairnessProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667442(30319-30348)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.5555/3666122.3667442
Wang CLiu SYang HGuo JWu YLiu J(2023)Ethical Considerations of Using ChatGPT in Health CareJournal of Medical Internet Research10.2196/4800925(e48009)Online publication date: 11-Aug-2023
https://doi.org/10.2196/48009
Ozel ASmilowitz KGoldstein L(2023)Community-Engaged School District Design: A Stream-Based ApproachSSRN Electronic Journal10.2139/ssrn.4610313Online publication date: 2023
https://doi.org/10.2139/ssrn.4610313
Litts BAlladin JTehee MCardona-Rivera R(2023)Gathering as Design Process: Physical Prototyping for Culturally Sustaining Computational TechnologiesProceedings of the 2023 Symposium on Learning, Design and Technology10.1145/3594781.3594802(107-113)Online publication date: 23-Jun-2023
https://dl.acm.org/doi/10.1145/3594781.3594802
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents