-
Challenging the Human-in-the-loop in Algorithmic Decision-making
Authors:
Sebastian Tschiatschek,
Eugenia Stamboliev,
Timoth ee Schmude,
Mark Coeckelbergh,
Laura Koesten
Abstract:
We discuss the role of humans in algorithmic decision-making (ADM) for socially relevant problems from a technical and philosophical perspective. In particular, we illustrate tensions arising from diverse expectations, values, and constraints by and on the humans involved. To this end, we assume that a strategic decision-maker (SDM) introduces ADM to optimize strategic and societal goals while the…
▽ More
We discuss the role of humans in algorithmic decision-making (ADM) for socially relevant problems from a technical and philosophical perspective. In particular, we illustrate tensions arising from diverse expectations, values, and constraints by and on the humans involved. To this end, we assume that a strategic decision-maker (SDM) introduces ADM to optimize strategic and societal goals while the algorithms' recommended actions are overseen by a practical decision-maker (PDM) - a specific human-in-the-loop - who makes the final decisions. While the PDM is typically assumed to be a corrective, it can counteract the realization of the SDM's desired goals and societal values not least because of a misalignment of these values and unmet information needs of the PDM. This has significant implications for the distribution of power between the stakeholders in ADM, their constraints, and information needs. In particular, we emphasize the overseeing PDM's role as a potential political and ethical decision maker, who acts expected to balance strategic, value-driven objectives and on-the-ground individual decisions and constraints. We demonstrate empirically, on a machine learning benchmark dataset, the significant impact an overseeing PDM's decisions can have even if the PDM is constrained to performing only a limited amount of actions differing from the algorithms' recommendations. To ensure that the SDM's intended values are realized, the PDM needs to be provided with appropriate information conveyed through tailored explanations and its role must be characterized clearly. Our findings emphasize the need for an in-depth discussion of the role and power of the PDM and challenge the often-taken view that just including a human-in-the-loop in ADM ensures the 'correct' and 'ethical' functioning of the system.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Tensions between Preference and Performance: Designing for Visual Exploration of Multi-frequency Medical Network Data
Authors:
Christian Knoll,
Laura Koesten,
Isotta Rigoni,
Serge Vulliémoz,
Torsten Möller
Abstract:
The analysis of complex high-dimensional data is a common task in many domains, resulting in bespoke visual exploration tools. Expectations and practices of domain experts as users do not always align with visualization theory. In this paper, we report on a design study in the medical domain where we developed two high-fidelity prototypes encoding EEG-derived brain network data with different type…
▽ More
The analysis of complex high-dimensional data is a common task in many domains, resulting in bespoke visual exploration tools. Expectations and practices of domain experts as users do not always align with visualization theory. In this paper, we report on a design study in the medical domain where we developed two high-fidelity prototypes encoding EEG-derived brain network data with different types of visualizations. We evaluate these prototypes regarding effectiveness, efficiency, and preference with two groups: participants with domain knowledge (domain experts in medical research) and those without domain knowledge, both groups having little or no visualization experience. A requirement analysis and study of low-fidelity prototypes revealed a strong preference for a novel and aesthetically pleasing visualization design, as opposed to a design that is considered more optimal based on visualization theory. Our study highlights the pros and cons of both approaches, discussing trade-offs between task-specific measurements and subjective preference. While the aesthetically pleasing and novel low-fidelity prototype was favored, the results of our evaluation show that, in most cases, this was not reflected in participants' performance or subjective preference for the high-fidelity prototypes.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Information That Matters: Exploring Information Needs of People Affected by Algorithmic Decisions
Authors:
Timothée Schmude,
Laura Koesten,
Torsten Möller,
Sebastian Tschiatschek
Abstract:
Explanations of AI systems rarely address the information needs of people affected by algorithmic decision-making (ADM). This gap between conveyed information and information that matters to affected stakeholders can impede understanding and adherence to regulatory frameworks such as the AI Act. To address this gap, we present the "XAI Novice Question Bank": A catalog of affected stakeholders' inf…
▽ More
Explanations of AI systems rarely address the information needs of people affected by algorithmic decision-making (ADM). This gap between conveyed information and information that matters to affected stakeholders can impede understanding and adherence to regulatory frameworks such as the AI Act. To address this gap, we present the "XAI Novice Question Bank": A catalog of affected stakeholders' information needs in two ADM use cases (employment prediction and health monitoring), covering the categories data, system context, system usage, and system specifications. Information needs were gathered in an interview study where participants received explanations in response to their inquiries. Participants further reported their understanding and decision confidence, showing that while confidence tended to increase after receiving explanations, participants also met understanding challenges, such as being unable to tell why their understanding felt incomplete. Explanations further influenced participants' perceptions of the systems' risks and benefits, which they confirmed or changed depending on the use case. When risks were perceived as high, participants expressed particular interest in explanations about intention, such as why and to what end a system was put in place. With this work, we aim to support the inclusion of affected stakeholders into explainability by contributing an overview of information and challenges relevant to them when deciding on the adoption of ADM systems. We close by summarizing our findings in a list of six key implications that inform the design of future explanations for affected stakeholder audiences.
△ Less
Submitted 29 January, 2024; v1 submitted 24 January, 2024;
originally announced January 2024.
-
Data journeys in popular science: Producing climate change and COVID-19 data visualizations at Scientific American
Authors:
Kathleen Gregory,
Laura Koesten,
Regina Schuster,
Torsten Möller,
Sarah Davies
Abstract:
Vast amounts of (open) data are increasingly used to make arguments about crisis topics such as climate change and global pandemics. Data visualizations are central to bringing these viewpoints to broader publics. However, visualizations often conceal the many contexts involved in their production, ranging from decisions made in research labs about collecting and sharing data to choices made in ed…
▽ More
Vast amounts of (open) data are increasingly used to make arguments about crisis topics such as climate change and global pandemics. Data visualizations are central to bringing these viewpoints to broader publics. However, visualizations often conceal the many contexts involved in their production, ranging from decisions made in research labs about collecting and sharing data to choices made in editorial rooms about which data stories to tell. In this paper, we examine how data visualizations about climate change and COVID-19 are produced in popular science magazines, using Scientific American, an established English-language popular science magazine, as a case study. To do this, we apply the analytical concept of data journeys (Leonelli, 2020) in a mixed methods study that centers on interviews with Scientific American staff and is supplemented by a visualization analysis of selected charts. In particular, we discuss the affordances of working with open data, the role of collaborative data practices, and how the magazine works to counter misinformation and increase transparency. This work provides an empirical contribution by providing insight into the data (visualization) practices of science communicators and demonstrating how the concept of data journeys can be used as an analytical framework.
△ Less
Submitted 27 March, 2024; v1 submitted 27 October, 2023;
originally announced October 2023.
-
Subjective visualization experiences: impact of visual design and experimental design
Authors:
Laura Koesten,
Drew Dimmery,
Michael Gleicher,
Torsten Möller
Abstract:
In contrast to objectively measurable aspects (such as accuracy, reading speed, or memorability), the subjective experience of visualizations has only recently gained importance, and we have less experience how to measure it. We explore how subjective experience is affected by chart design using multiple experimental methods. We measure the effects of changes in color, orientation, and source anno…
▽ More
In contrast to objectively measurable aspects (such as accuracy, reading speed, or memorability), the subjective experience of visualizations has only recently gained importance, and we have less experience how to measure it. We explore how subjective experience is affected by chart design using multiple experimental methods. We measure the effects of changes in color, orientation, and source annotation on the perceived readability and trustworthiness of simple bar charts. Three different experimental designs (single image rating, forced choice comparison, and semi-structured interviews) provide similar but different results. We find that these subjective experiences are different from what prior work on objective dimensions would predict. Seemingly inconsequential choices, like orientation, have large effects for some methods, indicating that study design alters decision-making strategies. Next to insights into the effect of chart design, we provide methodological insights, such as a suggested need to carefully isolate individual elements in charts to study subjective experiences.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
The Gulf of Interpretation: From Chart to Message and Back Again
Authors:
Christian Knoll,
Torsten Möller,
Kathleen Gregory,
Laura Koesten
Abstract:
Charts are used to communicate data visually, but designing an effective chart that a broad set of people can understand is challenging. Usually, we do not know whether a chart's intended message aligns with the message readers perceive. In this mixed-methods study, we investigate how data journalists encode data and how a broad audience engages with, experiences, and understands these data visual…
▽ More
Charts are used to communicate data visually, but designing an effective chart that a broad set of people can understand is challenging. Usually, we do not know whether a chart's intended message aligns with the message readers perceive. In this mixed-methods study, we investigate how data journalists encode data and how a broad audience engages with, experiences, and understands these data visualizations. We conducted a series of workshops and interviews with school students, university students, job seekers, designers, and senior citizens to collect perceived messages and subjective feedback on a sample of eight real-world charts. We analyzed these messages and compared them to the intended message of the chart producer. Four of the collected messages from consumers were then provided to data journalists (including the ones that created the original charts) as a starting point to re-design the charts accordingly. The results from our work underline the difficulty of complex charts such as stacked bar charts and Sankey diagrams. Consumers are often overwhelmed with the amount of data provided and are easily confused with terms (as text) not well known. Chart producers tend to be faithful with data but are willing to abstract further when asked to transport particular messages visually. There are strong conventions on how to visually encode particular information that might not be to the benefit of many consumers.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Who is the Audience? Designing Casual Data Visualizations for the 'General Public'
Authors:
Regina Schuster,
Laura Koesten,
Torsten Möller,
Kathleen Gregory
Abstract:
Casual data visualizations play a vital role in communicating data to lay audiences. Despite this, little is known about how data visualization practitioners make design decisions based on their envisioned target audiences using different media channels. We draw on the findings of a semi-structured interview study to explore how data visualization practitioners working in various settings conceptu…
▽ More
Casual data visualizations play a vital role in communicating data to lay audiences. Despite this, little is known about how data visualization practitioners make design decisions based on their envisioned target audiences using different media channels. We draw on the findings of a semi-structured interview study to explore how data visualization practitioners working in various settings conceptualize and design for lay audiences and how they evaluate their visualization designs. Our findings suggest that practitioners often use broad definitions of their target audience, yet they stress the importance of 'knowing the readers' for their design decisions. At the same time, commonly used evaluation and feedback mechanisms do not allow a deep knowledge of their readers but rely instead on tacit knowledge, simple usage metrics, or testing with colleagues. We conclude by calling for different forms of visualization evaluation that are feasible for practitioners to implement in their daily workflows.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
Applying Interdisciplinary Frameworks to Understand Algorithmic Decision-Making
Authors:
Timothée Schmude,
Laura Koesten,
Torsten Möller,
Sebastian Tschiatschek
Abstract:
We argue that explanations for "algorithmic decision-making" (ADM) systems can profit by adopting practices that are already used in the learning sciences. We shortly introduce the importance of explaining ADM systems, give a brief overview of approaches drawing from other disciplines to improve explanations, and present the results of our qualitative task-based study incorporating the "six facets…
▽ More
We argue that explanations for "algorithmic decision-making" (ADM) systems can profit by adopting practices that are already used in the learning sciences. We shortly introduce the importance of explaining ADM systems, give a brief overview of approaches drawing from other disciplines to improve explanations, and present the results of our qualitative task-based study incorporating the "six facets of understanding" framework. We close with questions guiding the discussion of how future studies can leverage an interdisciplinary approach.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
"The main message is that sustainability would help" -- Reflections on takeaway messages of climate change data visualizations
Authors:
Regina Schuster,
Laura Koesten,
Kathleen Gregory,
Torsten Möller
Abstract:
How do different audiences make sense of climate change data visualizations and what do they take away as a main message? To investigate this question, we are building on the results of a previous study, focusing on expert opinions regarding public climate change communication and the role of data visualizations. Hereby, we conducted semi-structured interviews with 17 experts in the fields of clim…
▽ More
How do different audiences make sense of climate change data visualizations and what do they take away as a main message? To investigate this question, we are building on the results of a previous study, focusing on expert opinions regarding public climate change communication and the role of data visualizations. Hereby, we conducted semi-structured interviews with 17 experts in the fields of climate change, science communication, or data visualization. We also interviewed six lay persons with no professional background in either of these areas. With this analysis, we aim to shed light on how lay audiences arrive at an understanding of climate change data visualizations and what they take away as a main message. For two exemplary data visualizations, we compare their takeaway messages with messages formulated by experts. Through a thematic analysis, we observe differences regarding the included contents, the length and abstraction of messages, and the sensemaking process between and among the participant groups.
△ Less
Submitted 6 May, 2023;
originally announced May 2023.
-
What is the message? Perspectives on Visual Data Communication
Authors:
Laura Koesten,
Kathleen Gregory,
Regina Schuster,
Christian Knoll,
Sarah Davies,
Torsten Möller
Abstract:
Data visualizations are used to communicate messages to diverse audiences. It is unclear whether interpretations of these visualizations match the messages their creators aim to convey. In a mixed-methods study, we investigate how data in the popular science magazine Scientific American are visually communicated and understood. We first analyze visualizations about climate change and pandemics pub…
▽ More
Data visualizations are used to communicate messages to diverse audiences. It is unclear whether interpretations of these visualizations match the messages their creators aim to convey. In a mixed-methods study, we investigate how data in the popular science magazine Scientific American are visually communicated and understood. We first analyze visualizations about climate change and pandemics published in the magazine over a fifty-year period. Acting as chart readers, we then interpret visualizations with and without textual elements, identifying takeaway messages and creating field notes. Finally, we compare a sample of our interpreted messages to the intended messages of chart producers, drawing on interviews conducted with magazine staff. These data allow us to explore understanding visualizations through three perspectives: that of the charts, visualization readers, and visualization producers. Building on our findings from a thematic analysis, we present in-depth insights into data visualization sensemaking, particularly regarding the role of messages and textual elements; we propose a message typology, and we consider more broadly how messages can be conceptualized and understood.
△ Less
Submitted 12 April, 2023;
originally announced April 2023.
-
Passionate Charts: Arguments for Empathetic Emotions in Data Vis
Authors:
Verena Ingrid Prantl,
Torsten Moeller,
Laura Koesten
Abstract:
Aristotle has considered the art of communication as a balance of logos, ethos, and pathos. While in science, logos (reason) and, recently also, ethos (morality) are discussed as aspects not to be neglected, pathos (feeling) is seen critically. In this work, we take a historical perspective on pathos and weigh the pros and cons of applying this rhetorical concept to the field of data visualization…
▽ More
Aristotle has considered the art of communication as a balance of logos, ethos, and pathos. While in science, logos (reason) and, recently also, ethos (morality) are discussed as aspects not to be neglected, pathos (feeling) is seen critically. In this work, we take a historical perspective on pathos and weigh the pros and cons of applying this rhetorical concept to the field of data visualizations. To better understand data, connecting it to the human way of thinking is imperative - appealing to emotions is one building block. The theoretical and empirical basis originates from different scientific fields, like social sciences, economics, and humanities. Tangible techniques to target empathetic emotions in data visualizations are introduced, as well as other rhetorical devices, such as interactivity and contextual framing, are highlighted. Researching these different approaches can provide new insights regarding the creation and influence of empathetic emotions in data visualizations.
△ Less
Submitted 26 June, 2023; v1 submitted 8 April, 2023;
originally announced April 2023.
-
On the Impact of Explanations on Understanding of Algorithmic Decision-Making
Authors:
Timothée Schmude,
Laura Koesten,
Torsten Möller,
Sebastian Tschiatschek
Abstract:
Ethical principles for algorithms are gaining importance as more and more stakeholders are affected by "high-risk" algorithmic decision-making (ADM) systems. Understanding how these systems work enables stakeholders to make informed decisions and to assess the systems' adherence to ethical values. Explanations are a promising way to create understanding, but current explainable artificial intellig…
▽ More
Ethical principles for algorithms are gaining importance as more and more stakeholders are affected by "high-risk" algorithmic decision-making (ADM) systems. Understanding how these systems work enables stakeholders to make informed decisions and to assess the systems' adherence to ethical values. Explanations are a promising way to create understanding, but current explainable artificial intelligence (XAI) research does not always consider existent theories on how understanding is formed and evaluated. In this work, we aim to contribute to a better understanding of understanding by conducting a qualitative task-based study with 30 participants, including users and affected stakeholders. We use three explanation modalities (textual, dialogue, and interactive) to explain a "high-risk" ADM system to participants and analyse their responses both inductively and deductively, using the "six facets of understanding" framework by Wiggins & McTighe. Our findings indicate that the "six facets" framework is a promising approach to analyse participants' thought processes in understanding, providing categories for both rational and emotional understanding. We further introduce the "dialogue" modality as a valid explanation approach to increase participant engagement and interaction with the "explainer", allowing for more insight into their understanding in the process. Our analysis further suggests that individuality in understanding affects participants' perceptions of algorithmic fairness, demonstrating the interdependence between understanding and ADM assessment that previous studies have outlined. We posit that drawing from theories on learning and understanding like the "six facets" and leveraging explanation modalities can guide XAI research to better suit explanations to learning processes of individuals and consequently enable their assessment of ethical values of ADM systems.
△ Less
Submitted 17 May, 2023; v1 submitted 16 February, 2023;
originally announced February 2023.
-
"Being Simple on Complex Issues" -- Accounts on Visual Data Communication about Climate Change
Authors:
Regina Schuster,
Kathleen Gregory,
Torsten Möller,
Laura Koesten
Abstract:
Data visualizations play a critical role in both communicating scientific evidence about climate change and in stimulating engagement and action. To investigate how visualizations can be better utilized to communicate the complexities of climate change to different audiences, we conducted interviews with 17 experts in the fields of climate change, data visualization, and science communication, as…
▽ More
Data visualizations play a critical role in both communicating scientific evidence about climate change and in stimulating engagement and action. To investigate how visualizations can be better utilized to communicate the complexities of climate change to different audiences, we conducted interviews with 17 experts in the fields of climate change, data visualization, and science communication, as well as with 12 laypersons. Besides questions about climate change communication and various aspects of data visualizations, we also asked participants to share what they think is the main takeaway message for two exemplary climate change data visualizations. Through a thematic analysis, we observe differences regarding the included contents, the length and abstraction of messages, and the sensemaking process between and among the participant groups. On average, experts formulated shorter and more abstract messages, often referring to higher-level conclusions rather than specific details. We use our findings to reflect on design decisions for creating more effective visualizations, particularly in news media sources geared toward lay audiences. We hereby discuss the adaption of contents according to the needs of the audience, the trade-off between simplification and accuracy, as well as techniques to make a visualization attractive.
△ Less
Submitted 6 February, 2024; v1 submitted 18 November, 2022;
originally announced November 2022.
-
Talking datasets: Understanding data sensemaking behaviours
Authors:
Laura Koesten,
Kathleen Gregory,
Paul Groth,
Elena Simperl
Abstract:
The sharing and reuse of data are seen as critical to solving the most complex problems of today. Despite this potential, relatively little is known about a key step in data reuse: people's behaviours involved in data-centric sensemaking. We aim to address this gap by presenting a mixed-methods study combining in-depth interviews, a think-aloud task and a screen recording analysis with 31 research…
▽ More
The sharing and reuse of data are seen as critical to solving the most complex problems of today. Despite this potential, relatively little is known about a key step in data reuse: people's behaviours involved in data-centric sensemaking. We aim to address this gap by presenting a mixed-methods study combining in-depth interviews, a think-aloud task and a screen recording analysis with 31 researchers as they summarised and interacted with both familiar and unfamiliar data. We use our findings to identify and detail common activity patterns and necessary data attributes across three clusters of sensemaking activities: inspecting data, engaging with content, and placing data within broader contexts. We conclude by proposing design recommendations for tools and documentation practices which can be used to facilitate sensemaking and subsequent data reuse.
△ Less
Submitted 18 July, 2020; v1 submitted 20 November, 2019;
originally announced November 2019.
-
Dataset search: a survey
Authors:
Adriane Chapman,
Elena Simperl,
Laura Koesten,
George Konstantinidis,
Luis-Daniel Ibáñez-Gonzalez,
Emilia Kacprzak,
Paul Groth
Abstract:
Generating value from data requires the ability to find, access and make sense of datasets. There are many efforts underway to encourage data sharing and reuse, from scientific publishers asking authors to submit data alongside manuscripts to data marketplaces, open data portals and data communities. Google recently beta released a search service for datasets, which allows users to discover data s…
▽ More
Generating value from data requires the ability to find, access and make sense of datasets. There are many efforts underway to encourage data sharing and reuse, from scientific publishers asking authors to submit data alongside manuscripts to data marketplaces, open data portals and data communities. Google recently beta released a search service for datasets, which allows users to discover data stored in various online repositories via keyword queries. These developments foreshadow an emerging research field around dataset search or retrieval that broadly encompasses frameworks, methods and tools that help match a user data need against a collection of datasets. Here, we survey the state of the art of research and commercial systems in dataset retrieval. We identify what makes dataset search a research field in its own right, with unique challenges and methods and highlight open problems. We look at approaches and implementations from related areas dataset search is drawing upon, including information retrieval, databases, entity-centric and tabular search in order to identify possible paths to resolve these open problems as well as immediate next steps that will take the field forward.
△ Less
Submitted 3 January, 2019;
originally announced January 2019.
-
Everything you always wanted to know about a dataset: studies in data summarisation
Authors:
Laura Koesten,
Elena Simperl,
Emilia Kacprzak,
Tom Blount,
Jeni Tennison
Abstract:
Summarising data as text helps people make sense of it. It also improves data discovery, as search algorithms can match this text against keyword queries. In this paper, we explore the characteristics of text summaries of data in order to understand how meaningful summaries look like. We present two complementary studies: a data-search diary study with 69 students, which offers insight into the in…
▽ More
Summarising data as text helps people make sense of it. It also improves data discovery, as search algorithms can match this text against keyword queries. In this paper, we explore the characteristics of text summaries of data in order to understand how meaningful summaries look like. We present two complementary studies: a data-search diary study with 69 students, which offers insight into the information needs of people searching for data; and a summarisation study, with a lab and a crowdsourcing component with overall 80 data-literate participants, which produced summaries for 25 datasets. In each study we carried out a qualitative analysis to identify key themes and commonly mentioned dataset attributes, which people consider when searching and making sense of data. The results helped us design a template to create more meaningful textual representations of data, alongside guidelines for improving data-search experience overall.
△ Less
Submitted 23 October, 2018;
originally announced October 2018.
-
DATA:SEARCH'18 -- Searching Data on the Web
Authors:
Paul Groth,
Laura Koesten,
Philipp Mayr,
Maarten de Rijke,
Elena Simperl
Abstract:
This half day workshop explores challenges in data search, with a particular focus on data on the web. We want to stimulate an interdisciplinary discussion around how to improve the description, discovery, ranking and presentation of structured and semi-structured data, across data formats and domain applications. We welcome contributions describing algorithms and systems, as well as frameworks an…
▽ More
This half day workshop explores challenges in data search, with a particular focus on data on the web. We want to stimulate an interdisciplinary discussion around how to improve the description, discovery, ranking and presentation of structured and semi-structured data, across data formats and domain applications. We welcome contributions describing algorithms and systems, as well as frameworks and studies in human data interaction. The workshop aims to bring together communities interested in making the web of data more discoverable, easier to search and more user friendly.
△ Less
Submitted 30 May, 2018;
originally announced May 2018.