Skip to main content

Showing 1–16 of 16 results for author: Hagendorff, T

  1. arXiv:2402.08323  [pdf

    cs.CY cs.AI

    Mapping the Ethics of Generative AI: A Comprehensive Scoping Review

    Authors: Thilo Hagendorff

    Abstract: The advent of generative artificial intelligence and the widespread adoption of it in society engendered intensive debates about its ethical implications and risks. These risks often differ from those associated with traditional discriminative machine learning. To synthesize the recent discourse and map its normative concepts, we conducted a scoping review on the ethics of generative artificial in… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  2. Fairness Hacking: The Malicious Practice of Shrouding Unfairness in Algorithms

    Authors: Kristof Meding, Thilo Hagendorff

    Abstract: Fairness in machine learning (ML) is an ever-growing field of research due to the manifold potential for harm from algorithmic discrimination. To prevent such harm, a large body of literature develops new approaches to quantify fairness. Here, we investigate how one can divert the quantification of fairness by describing a practice we call "fairness hacking" for the purpose of shrouding unfairness… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  3. arXiv:2307.16513  [pdf

    cs.CL cs.AI cs.LG

    Deception Abilities Emerged in Large Language Models

    Authors: Thilo Hagendorff

    Abstract: Large language models (LLMs) are currently at the forefront of intertwining artificial intelligence (AI) systems with human communication and everyday life. Thus, aligning them with human values is of great importance. However, given the steady increase in reasoning abilities, future LLMs are under suspicion of becoming able to deceive human operators and utilizing this ability to bypass monitorin… ▽ More

    Submitted 2 February, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

  4. arXiv:2306.07622   

    cs.CL cs.AI cs.LG

    Human-Like Intuitive Behavior and Reasoning Biases Emerged in Language Models -- and Disappeared in GPT-4

    Authors: Thilo Hagendorff, Sarah Fabi

    Abstract: Large language models (LLMs) are currently at the forefront of intertwining AI systems with human communication and everyday life. Therefore, it is of great importance to evaluate their emerging abilities. In this study, we show that LLMs, most notably GPT-3, exhibit behavior that strikingly resembles human-like intuition -- and the cognitive errors that come with it. However, LLMs with higher cog… ▽ More

    Submitted 18 August, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Overlap with arXiv:2212.05206

  5. arXiv:2303.13988  [pdf

    cs.CL cs.AI

    Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods

    Authors: Thilo Hagendorff

    Abstract: Large language models (LLMs) are currently at the forefront of intertwining AI systems with human communication and everyday life. Due to rapid technological advances and their extreme versatility, LLMs nowadays have millions of users and are at the cusp of being the main go-to technology for information retrieval, content generation, problem-solving, etc. Therefore, it is of great importance to t… ▽ More

    Submitted 8 July, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

  6. arXiv:2301.06859  [pdf

    cs.HC cs.AI cs.CL cs.LG

    Methodological reflections for AI alignment research using human feedback

    Authors: Thilo Hagendorff, Sarah Fabi

    Abstract: The field of artificial intelligence (AI) alignment aims to investigate whether AI technologies align with human interests and values and function in a safe and ethical manner. AI alignment is particularly relevant for large language models (LLMs), which have the potential to exhibit unintended behavior due to their ability to learn and adapt in ways that are difficult to predict. In this paper, w… ▽ More

    Submitted 22 December, 2022; originally announced January 2023.

  7. arXiv:2212.05206  [pdf

    cs.CL cs.AI cs.LG

    Thinking Fast and Slow in Large Language Models

    Authors: Thilo Hagendorff, Sarah Fabi, Michal Kosinski

    Abstract: Large language models (LLMs) are currently at the forefront of intertwining AI systems with human communication and everyday life. Therefore, it is of great importance to evaluate their emerging abilities. In this study, we show that LLMs like GPT-3 exhibit behavior that strikingly resembles human-like intuition - and the cognitive errors that come with it. However, LLMs with higher cognitive capa… ▽ More

    Submitted 2 August, 2023; v1 submitted 10 December, 2022; originally announced December 2022.

  8. How to Assess Trustworthy AI in Practice

    Authors: Roberto V. Zicari, Julia Amann, Frédérick Bruneault, Megan Coffee, Boris Düdder, Eleanore Hickman, Alessio Gallucci, Thomas Krendl Gilbert, Thilo Hagendorff, Irmhild van Halem, Elisabeth Hildt, Sune Holm, Georgios Kararigas, Pedro Kringen, Vince I. Madai, Emilie Wiinblad Mathez, Jesmin Jahan Tithi, Dennis Vetter, Magnus Westerlund, Renee Wurth

    Abstract: This report is a methodological reflection on Z-Inspection$^{\small{\circledR}}$. Z-Inspection$^{\small{\circledR}}$ is a holistic process used to evaluate the trustworthiness of AI-based technologies at different stages of the AI lifecycle. It focuses, in particular, on the identification and discussion of ethical issues and tensions through the elaboration of socio-technical scenarios. It uses t… ▽ More

    Submitted 28 June, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: On behalf of the Z-Inspection$^{\small{\circledR}}$ initiative (2022)

  9. Why we need biased AI -- How including cognitive and ethical machine biases can enhance AI systems

    Authors: Sarah Fabi, Thilo Hagendorff

    Abstract: This paper stresses the importance of biases in the field of artificial intelligence (AI) in two regards. First, in order to foster efficient algorithmic decision-making in complex, unstable, and uncertain real-world environments, we argue for the structurewise implementation of human cognitive biases in learning algorithms. Secondly, we argue that in order to achieve ethical machine behavior, fil… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  10. arXiv:2202.10848  [pdf

    cs.LG cs.AI cs.CY

    Speciesist bias in AI -- How AI applications perpetuate discrimination and unfair outcomes against animals

    Authors: Thilo Hagendorff, Leonie Bossert, Tse Yip Fai, Peter Singer

    Abstract: Massive efforts are made to reduce biases in both data and algorithms in order to render AI applications fair. These efforts are propelled by various high-profile cases where biased algorithmic decision-making caused harm to women, people of color, minorities, etc. However, the AI fairness field still succumbs to a blind spot, namely its insensitivity to discrimination against animals. This paper… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  11. arXiv:2011.12750  [pdf

    cs.CY cs.AI cs.LG

    AI virtues -- The missing link in putting AI ethics into practice

    Authors: Thilo Hagendorff

    Abstract: Several seminal ethics initiatives have stipulated sets of principles and standards for good technology development in the AI sector. However, widespread criticism has pointed out a lack of practical realization of these principles. Following that, AI ethics underwent a practical turn, but without deviating from the principled approach and the many shortcomings associated with it. This paper propo… ▽ More

    Submitted 18 February, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

  12. arXiv:2008.11463  [pdf

    cs.CY cs.AI cs.LG

    Ethical behavior in humans and machines -- Evaluating training data quality for beneficial machine learning

    Authors: Thilo Hagendorff

    Abstract: Machine behavior that is based on learning algorithms can be significantly influenced by the exposure to data of different qualities. Up to now, those qualities are solely measured in technical terms, but not in ethical ones, despite the significant role of training and annotation data in supervised machine learning. This is the first study to fill this gap by describing new dimensions of data qua… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Comments: 30 pages, 1 figure

  13. Ethical Considerations and Statistical Analysis of Industry Involvement in Machine Learning Research

    Authors: Thilo Hagendorff, Kristof Meding

    Abstract: Industry involvement in the machine learning (ML) community seems to be increasing. However, the quantitative scale and ethical implications of this influence are rather unknown. For this purpose, we have not only carried out an informed ethical analysis of the field, but have inspected all papers of the main ML conferences NeurIPS, CVPR, and ICML of the last 5 years - almost 11,000 papers in tota… ▽ More

    Submitted 19 October, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

  14. arXiv:1911.08603  [pdf

    cs.LG cs.AI cs.CY stat.ML

    Forbidden knowledge in machine learning -- Reflections on the limits of research and publication

    Authors: Thilo Hagendorff

    Abstract: Certain research strands can yield "forbidden knowledge". This term refers to knowledge that is considered too sensitive, dangerous or taboo to be produced or shared. Discourses about such publication restrictions are already entrenched in scientific fields like IT security, synthetic biology or nuclear physics research. This paper makes the case for transferring this discourse to machine learning… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

  15. arXiv:1907.03848  [pdf

    cs.CY cs.AI cs.LG

    Artificial Intelligence Governance and Ethics: Global Perspectives

    Authors: Angela Daly, Thilo Hagendorff, Li Hui, Monique Mann, Vidushi Marda, Ben Wagner, Wei Wang, Saskia Witteborn

    Abstract: Artificial intelligence (AI) is a technology which is increasingly being utilised in society and the economy worldwide, and its implementation is planned to become more prevalent in coming years. AI is increasingly being embedded in our lives, supplementing our pervasive use of digital technologies. But this is being accompanied by disquiet over problematic and dangerous implementations of AI, or… ▽ More

    Submitted 28 June, 2019; originally announced July 2019.

  16. arXiv:1903.03425  [pdf

    cs.AI cs.CY cs.LG stat.ML

    The Ethics of AI Ethics -- An Evaluation of Guidelines

    Authors: Thilo Hagendorff

    Abstract: Current advances in research, development and application of artificial intelligence (AI) systems have yielded a far-reaching discourse on AI ethics. In consequence, a number of ethics guidelines have been released in recent years. These guidelines comprise normative principles and recommendations aimed to harness the "disruptive" potentials of new AI technologies. Designed as a comprehensive eval… ▽ More

    Submitted 11 October, 2019; v1 submitted 28 February, 2019; originally announced March 2019.

    Comments: 16 pages, 1 table

    Journal ref: Minds & Machines, 2020