-
What We Augment When We Augment Visualizations: A Design Elicitation Study of How We Visually Express Data Relationships
Authors:
Grace Guo,
John Stasko,
Alex Endert
Abstract:
Visual augmentations are commonly added to charts and graphs in order to convey richer and more nuanced information about relationships in the data. However, many design spaces proposed for categorizing augmentations were defined in a top-down manner, based on expert heuristics or from surveys of published visualizations. Less well understood are user preferences and intuitions when designing augm…
▽ More
Visual augmentations are commonly added to charts and graphs in order to convey richer and more nuanced information about relationships in the data. However, many design spaces proposed for categorizing augmentations were defined in a top-down manner, based on expert heuristics or from surveys of published visualizations. Less well understood are user preferences and intuitions when designing augmentations. In this paper, we address the gap by conducting a design elicitation study, where study participants were asked to draw the different ways they would visually express the meaning of ten different prompts. We obtained 364 drawings from the study, and identified the emergent categories of augmentations used by participants. The contributions of this paper are: (i) a user-defined design space of visualization augmentations, (ii) a repository of hand drawn augmentations made by study participants, and (iii) a discussion of insights into participant considerations, and connections between our study and existing design guidelines.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Understanding and Measuring the Effects of Graphical Dimensions on Viewers' Perceived Chart Credibility
Authors:
Hayeong Song,
John Stasko
Abstract:
Journalists and visualization designers include visualizations in their articles and storytelling tools to deliver their message effectively. But design decisions they make to represent information, such as the graphical dimensions they choose and the viewer's familiarity with the content can impact the viewer's perceived credibility of charts. Especially in a context where little is known about s…
▽ More
Journalists and visualization designers include visualizations in their articles and storytelling tools to deliver their message effectively. But design decisions they make to represent information, such as the graphical dimensions they choose and the viewer's familiarity with the content can impact the viewer's perceived credibility of charts. Especially in a context where little is known about sources of online information. But there is little experimental evidence that designers can refer to make decisions. Hence, this work aims to study and measure the effects of graphical dimensions and people's familiarity with the content on viewers' perceived chart credibility. I plan to conduct a crowd-sourced study with three graphical dimensions conditions, which are traditional charts, text annotation, and infographics. Then I will test these conditions on two user groups, which are domain experts and non-experts. With these results, this work aims to provide chart guidelines for visual designers with experimental evidence.
△ Less
Submitted 31 May, 2023;
originally announced May 2023.
-
Experts prefer text but videos help novices: an analysis of the utility of multi-media content
Authors:
Hayeong Song,
Jennifer Healey,
Alexa Siu,
Curtis Wigington,
John Stasko
Abstract:
Multi-media increases engagement and is increasingly prevalent in online content including news, web blogs, and social media, however, it may not always be beneficial to users. To determine what types of media users actually wanted, we conducted an exploratory study where users got to choose their own media augmentation. Our findings showed that users desired different amounts and types of media d…
▽ More
Multi-media increases engagement and is increasingly prevalent in online content including news, web blogs, and social media, however, it may not always be beneficial to users. To determine what types of media users actually wanted, we conducted an exploratory study where users got to choose their own media augmentation. Our findings showed that users desired different amounts and types of media depending on their familiarity with the content. To further investigate this difference, we created two versions of a media augmented document, one designed for novices and one designed for experts. We evaluated these prototypes in a two-way between-subject study with 48 participants and found that while multi-media enhanced novice readers' perception of usability (p = .0100) and helped them with reading time (p = .0427), time on task (p= .0156), comprehension (p = .0161), experts largely ignored multi-media and primarily utilized text.
△ Less
Submitted 23 April, 2023;
originally announced April 2023.
-
Understanding people's needs in viewing diverse social opinions about controversial topics
Authors:
Hayeong Song,
Zhengyang Qi,
John Stasko,
Diyi Yang
Abstract:
Social media (i.e., Reddit) users are overloaded with people's opinions when viewing discourses about divisive topics. Traditional user interfaces in such media present those opinions in a linear structure, which can limit users in viewing diverse social opinions at scale. Prior work has recognized this limitation, that the linear structure can reinforce biases, where a certain point of view becom…
▽ More
Social media (i.e., Reddit) users are overloaded with people's opinions when viewing discourses about divisive topics. Traditional user interfaces in such media present those opinions in a linear structure, which can limit users in viewing diverse social opinions at scale. Prior work has recognized this limitation, that the linear structure can reinforce biases, where a certain point of view becomes widespread simply because many viewers seem to believe it. This limitation can make it difficult for users to have a truly conversational mode of mediated discussion. Thus, when designing a user interface for viewing people's opinions, we should consider ways to mitigate selective exposure to information and polarization of opinions. We conducted a needs-finding study with 11 Reddit users, who follow climate change threads and make posts and comments regularly. In the study, we aimed to understand key limitations in people viewing online controversial discourses and to extract design implications to address these problems. Our findings discuss potential future directions to address these problems.
△ Less
Submitted 23 April, 2023;
originally announced April 2023.
-
Facilitating Conversational Interaction in Natural Language Interfaces for Visualization
Authors:
Rishab Mitra,
Arpit Narechania,
Alex Endert,
John Stasko
Abstract:
Natural language (NL) toolkits enable visualization developers, who may not have a background in natural language processing (NLP), to create natural language interfaces (NLIs) for end-users to flexibly specify and interact with visualizations. However, these toolkits currently only support one-off utterances, with minimal capability to facilitate a multi-turn dialog between the user and the syste…
▽ More
Natural language (NL) toolkits enable visualization developers, who may not have a background in natural language processing (NLP), to create natural language interfaces (NLIs) for end-users to flexibly specify and interact with visualizations. However, these toolkits currently only support one-off utterances, with minimal capability to facilitate a multi-turn dialog between the user and the system. Developing NLIs with such conversational interaction capabilities remains a challenging task, requiring implementations of low-level NLP techniques to process a new query as an intent to follow-up on an older query. We extend an existing Python-based toolkit, NL4DV, that processes an NL query about a tabular dataset and returns an analytic specification containing data attributes, analytic tasks, and relevant visualizations, modeled as a JSON object. Specifically, NL4DV now enables developers to facilitate multiple simultaneous conversations about a dataset and resolve associated ambiguities, augmenting new conversational information into the output JSON object. We demonstrate these capabilities through three examples: (1) an NLI to learn aspects of the Vega-Lite grammar, (2) a mind mapping application to create free-flowing conversations, and (3) a chatbot to answer questions and resolve ambiguities.
△ Less
Submitted 12 August, 2022; v1 submitted 30 June, 2022;
originally announced July 2022.
-
Collecting and Characterizing Natural Language Utterances for Specifying Data Visualizations
Authors:
Arjun Srinivasan,
Nikhila Nyapathy,
Bongshin Lee,
Steven M. Drucker,
John Stasko
Abstract:
Natural language interfaces (NLIs) for data visualization are becoming increasingly popular both in academic research and in commercial software. Yet, there is a lack of empirical understanding of how people specify visualizations through natural language. To bridge this gap, we conducted an online study with 102 participants. We showed participants a series of ten visualizations for a given datas…
▽ More
Natural language interfaces (NLIs) for data visualization are becoming increasingly popular both in academic research and in commercial software. Yet, there is a lack of empirical understanding of how people specify visualizations through natural language. To bridge this gap, we conducted an online study with 102 participants. We showed participants a series of ten visualizations for a given dataset and asked them to provide utterances they would pose to generate the displayed charts. The curated list of utterances generated from the study is provided below. This corpus of utterances can be used to evaluate existing NLIs for data visualization as well as for creating new systems and models to generate visualizations from natural language utterances.
△ Less
Submitted 1 October, 2021;
originally announced October 2021.
-
Understanding the Effects of Visualizing Missing Values on Visual Data Exploration
Authors:
Hayeong Song,
Yu Fu,
Bahador Saket,
John Stasko
Abstract:
When performing data analysis, people often confront data sets containing missing values. We conducted an empirical study to understand the effects of visualizing those missing values on participants' decision-making processes while performing a visual data exploration task. More specifically, our study participants purchased a hypothetical portfolio of stocks based on a dataset where some stocks…
▽ More
When performing data analysis, people often confront data sets containing missing values. We conducted an empirical study to understand the effects of visualizing those missing values on participants' decision-making processes while performing a visual data exploration task. More specifically, our study participants purchased a hypothetical portfolio of stocks based on a dataset where some stocks had missing values for attributes such as PE ratio, beta, and EPS. The experiment used scatterplots to communicate the stock data. For one group of participants, stocks with missing values simply were not shown, while the second group saw such stocks depicted with estimated values as points with error bars. We measured participants' cognitive load involved in decision-making with data with missing values. Our results indicate that their decision-making workflow was different across two conditions.
△ Less
Submitted 17 September, 2021;
originally announced September 2021.
-
Causal Perception in Question-Answering Systems
Authors:
Po-Ming Law,
Leo Yu-Ho Lo,
Alex Endert,
John Stasko,
Huamin Qu
Abstract:
Root cause analysis is a common data analysis task. While question-answering systems enable people to easily articulate a why question (e.g., why students in Massachusetts have high ACT Math scores on average) and obtain an answer, these systems often produce questionable causal claims. To investigate how such claims might mislead users, we conducted two crowdsourced experiments to study the impac…
▽ More
Root cause analysis is a common data analysis task. While question-answering systems enable people to easily articulate a why question (e.g., why students in Massachusetts have high ACT Math scores on average) and obtain an answer, these systems often produce questionable causal claims. To investigate how such claims might mislead users, we conducted two crowdsourced experiments to study the impact of showing different information on user perceptions of a question-answering system. We found that in a system that occasionally provided unreasonable responses, showing a scatterplot increased the plausibility of unreasonable causal claims. Also, simply warning participants that correlation is not causation seemed to lead participants to accept reasonable causal claims more cautiously. We observed a strong tendency among participants to associate correlation with causation. Yet, the warning appeared to reduce the tendency. Grounded in the findings, we propose ways to reduce the illusion of causality when using question-answering systems.
△ Less
Submitted 6 January, 2021; v1 submitted 28 December, 2020;
originally announced December 2020.
-
Characterizing Automated Data Insights
Authors:
Po-Ming Law,
Alex Endert,
John Stasko
Abstract:
Many researchers have explored tools that aim to recommend data insights to users. These tools automatically communicate a rich diversity of data insights and offer such insights for many different purposes. However, there is a lack of structured understanding concerning what researchers of these tools mean by "insight" and what tasks in the analysis workflow these tools aim to support. We conduct…
▽ More
Many researchers have explored tools that aim to recommend data insights to users. These tools automatically communicate a rich diversity of data insights and offer such insights for many different purposes. However, there is a lack of structured understanding concerning what researchers of these tools mean by "insight" and what tasks in the analysis workflow these tools aim to support. We conducted a systematic review of existing systems that seek to recommend data insights. Grounded in the review, we propose 12 types of automated insights and four purposes of automating insights. We further discuss the design opportunities emerged from our analysis.
△ Less
Submitted 4 September, 2020; v1 submitted 29 August, 2020;
originally announced August 2020.
-
What are Data Insights to Professional Visualization Users?
Authors:
Po-Ming Law,
Alex Endert,
John Stasko
Abstract:
While many visualization researchers have attempted to define data insights, little is known about how visualization users perceive them. We interviewed 23 professional users of end-user visualization platforms (e.g., Tableau and Power BI) about their experiences with data insights. We report on seven characteristics of data insights based on interviewees' descriptions. Grounded in these character…
▽ More
While many visualization researchers have attempted to define data insights, little is known about how visualization users perceive them. We interviewed 23 professional users of end-user visualization platforms (e.g., Tableau and Power BI) about their experiences with data insights. We report on seven characteristics of data insights based on interviewees' descriptions. Grounded in these characteristics, we propose practical implications for creating tools that aim to automatically communicate data insights to users.
△ Less
Submitted 4 October, 2020; v1 submitted 29 August, 2020;
originally announced August 2020.
-
NL4DV: A Toolkit for Generating Analytic Specifications for Data Visualization from Natural Language Queries
Authors:
Arpit Narechania,
Arjun Srinivasan,
John Stasko
Abstract:
Natural language interfaces (NLIs) have shown great promise for visual data analysis, allowing people to flexibly specify and interact with visualizations. However, developing visualization NLIs remains a challenging task, requiring low-level implementation of natural language processing (NLP) techniques as well as knowledge of visual analytic tasks and visualization design. We present NL4DV, a to…
▽ More
Natural language interfaces (NLIs) have shown great promise for visual data analysis, allowing people to flexibly specify and interact with visualizations. However, developing visualization NLIs remains a challenging task, requiring low-level implementation of natural language processing (NLP) techniques as well as knowledge of visual analytic tasks and visualization design. We present NL4DV, a toolkit for natural language-driven data visualization. NL4DV is a Python package that takes as input a tabular dataset and a natural language query about that dataset. In response, the toolkit returns an analytic specification modeled as a JSON object containing data attributes, analytic tasks, and a list of Vega-Lite specifications relevant to the input query. In doing so, NL4DV aids visualization developers who may not have a background in NLP, enabling them to create new visualization NLIs or incorporate natural language input within their existing systems. We demonstrate NL4DV's usage and capabilities through four examples: 1) rendering visualizations using natural language in a Jupyter notebook, 2) developing a NLI to specify and edit Vega-Lite charts, 3) recreating data ambiguity widgets from the DataTone system, and 4) incorporating speech input to create a multimodal visualization system.
△ Less
Submitted 23 November, 2020; v1 submitted 24 August, 2020;
originally announced August 2020.
-
Touch? Speech? or Touch and Speech? Investigating Multimodal Interaction for Visual Network Exploration and Analysis
Authors:
Ayshwarya Saktheeswaran,
Arjun Srinivasan,
John Stasko
Abstract:
Interaction plays a vital role during visual network exploration as users need to engage with both elements in the view (e.g., nodes, links) and interface controls (e.g., sliders, dropdown menus). Particularly as the size and complexity of a network grow, interactive displays supporting multimodal input (e.g., touch, speech, pen, gaze) exhibit the potential to facilitate fluid interaction during v…
▽ More
Interaction plays a vital role during visual network exploration as users need to engage with both elements in the view (e.g., nodes, links) and interface controls (e.g., sliders, dropdown menus). Particularly as the size and complexity of a network grow, interactive displays supporting multimodal input (e.g., touch, speech, pen, gaze) exhibit the potential to facilitate fluid interaction during visual network exploration and analysis. While multimodal interaction with network visualization seems like a promising idea, many open questions remain. For instance, do users actually prefer multimodal input over unimodal input, and if so, why? Does it enable them to interact more naturally, or does having multiple modes of input confuse users? To answer such questions, we conducted a qualitative user study in the context of a network visualization tool, comparing speech- and touch-based unimodal interfaces to a multimodal interface combining the two. Our results confirm that participants strongly prefer multimodal input over unimodal input attributing their preference to: 1) the freedom of expression, 2) the complementary nature of speech and touch, and 3) integrated interactions afforded by the combination of the two modalities. We also describe the interaction patterns participants employed to perform common network visualization operations and highlight themes for future multimodal network visualization systems to consider.
△ Less
Submitted 29 April, 2020;
originally announced April 2020.
-
Interweaving Multimodal Interaction with Flexible Unit Visualizations for Data Exploration
Authors:
Arjun Srinivasan,
Bongshin Lee,
John Stasko
Abstract:
Multimodal interfaces that combine direct manipulation and natural language have shown great promise for data visualization. Such multimodal interfaces allow people to stay in the flow of their visual exploration by leveraging the strengths of one modality to complement the weaknesses of others. In this work, we introduce an approach that interweaves multimodal interaction combining direct manipul…
▽ More
Multimodal interfaces that combine direct manipulation and natural language have shown great promise for data visualization. Such multimodal interfaces allow people to stay in the flow of their visual exploration by leveraging the strengths of one modality to complement the weaknesses of others. In this work, we introduce an approach that interweaves multimodal interaction combining direct manipulation and natural language with flexible unit visualizations. We employ the proposed approach in a proof-of-concept system, DataBreeze. Coupling pen, touch, and speech-based multimodal interaction with flexible unit visualizations, DataBreeze allows people to create and interact with both systematically bound (e.g., scatterplots, unit column charts) and manually customized views, enabling a novel visual data exploration experience. We describe our design process along with DataBreeze's interface and interactions, delineating specific aspects of the design that empower the synergistic use of multiple modalities. We also present a preliminary user study with DataBreeze, highlighting the data exploration patterns that participants employed. Finally, reflecting on our design process and preliminary user study, we discuss future research directions.
△ Less
Submitted 22 April, 2020;
originally announced April 2020.
-
Critical Reflections on Visualization Authoring Systems
Authors:
Arvind Satyanarayan,
Bongshin Lee,
Donghao Ren,
Jeffrey Heer,
John Stasko,
John Thompson,
Matthew Brehmer,
Zhicheng Liu
Abstract:
An emerging generation of visualization authoring systems support expressive information visualization without textual programming. As they vary in their visualization models, system architectures, and user interfaces, it is challenging to directly compare these systems using traditional evaluative methods. Recognizing the value of contextualizing our decisions in the broader design space, we pres…
▽ More
An emerging generation of visualization authoring systems support expressive information visualization without textual programming. As they vary in their visualization models, system architectures, and user interfaces, it is challenging to directly compare these systems using traditional evaluative methods. Recognizing the value of contextualizing our decisions in the broader design space, we present critical reflections on three systems we developed -- Lyra, Data Illustrator, and Charticulator. This paper surfaces knowledge that would have been daunting within the constituent papers of these three systems. We compare and contrast their (previously unmentioned) limitations and trade-offs between expressivity and learnability. We also reflect on common assumptions that we made during the development of our systems, thereby informing future research directions in visualization authoring systems.
△ Less
Submitted 31 July, 2019;
originally announced July 2019.
-
A User-based Visual Analytics Workflow for Exploratory Model Analysis
Authors:
Dylan Cashman,
Shah Rukh Humayoun,
Florian Heimerl,
Kendall Park,
Subhajit Das,
John Thompson,
Bahador Saket,
Abigail Mosca,
John Stasko,
Alex Endert,
Michael Gleicher,
Remco Chang
Abstract:
Many visual analytics systems allow users to interact with machine learning models towards the goals of data exploration and insight generation on a given dataset. However, in some situations, insights may be less important than the production of an accurate predictive model for future use. In that case, users are more interested in generating of diverse and robust predictive models, verifying the…
▽ More
Many visual analytics systems allow users to interact with machine learning models towards the goals of data exploration and insight generation on a given dataset. However, in some situations, insights may be less important than the production of an accurate predictive model for future use. In that case, users are more interested in generating of diverse and robust predictive models, verifying their performance on holdout data, and selecting the most suitable model for their usage scenario. In this paper, we consider the concept of Exploratory Model Analysis (EMA), which is defined as the process of discovering and selecting relevant models that can be used to make predictions on a data source. We delineate the differences between EMA and the well-known term exploratory data analysis in terms of the desired outcome of the analytic process: insights into the data or a set of deployable models. The contributions of this work are a visual analytics system workflow for EMA, a user study, and two use cases validating the effectiveness of the workflow. We found that our system workflow enabled users to generate complex models, to assess them for various qualities, and to select the most relevant model for their task.
△ Less
Submitted 29 July, 2019; v1 submitted 27 September, 2018;
originally announced September 2018.
-
Touching Data: A Discoverability-based Evaluation of a Visualization Interface for Tablet Computers
Authors:
Ramik Sadana,
Meeshu Agnihotri,
John Stasko
Abstract:
While a number of touch-based visualization systems have appeared in recent years, relatively little work has been done to evaluate these systems. The prevailing methods compare these systems to desktop-class applications or utilize traditional training-based usability studies. We argue that existing studies, while useful, fail to address a key aspect of mobile application usage - initial impressi…
▽ More
While a number of touch-based visualization systems have appeared in recent years, relatively little work has been done to evaluate these systems. The prevailing methods compare these systems to desktop-class applications or utilize traditional training-based usability studies. We argue that existing studies, while useful, fail to address a key aspect of mobile application usage - initial impression and discoverability-driven usability. Over the past few years, we have developed a tablet-based visualization system, Tangere, for analyzing tabular data in a multiple coordinated view configuration. This article describes a discoverability-based user study of Tangere in which the system is compared to a commercially available visualization system for tablets - Tableau's Vizable. The study highlights aspects of each system's design that resonate with study participants, and we reflect upon those findings to identify design principles for future tablet-based data visualization systems.
△ Less
Submitted 15 June, 2018;
originally announced June 2018.
-
Interactive Browsing and Navigation in Relational Databases
Authors:
Minsuk Kahng,
Shamkant B. Navathe,
John T. Stasko,
Duen Horng Chau
Abstract:
Although researchers have devoted considerable attention to helping database users formulate queries, many users still find it challenging to specify queries that involve joining tables. To help users construct join queries for exploring relational databases, we propose ETable, a novel presentation data model that provides users with a presentation-level interactive view. This view compactly prese…
▽ More
Although researchers have devoted considerable attention to helping database users formulate queries, many users still find it challenging to specify queries that involve joining tables. To help users construct join queries for exploring relational databases, we propose ETable, a novel presentation data model that provides users with a presentation-level interactive view. This view compactly presents one-to-many and many-to-many relationships within a single enriched table by allowing a cell to contain a set of entity references. Users can directly interact with this enriched table to incrementally construct complex queries and navigate databases on a conceptual entity-relationship level. In a user study, participants performed a range of database querying tasks faster with ETable than with a commercial graphical query builder. Subjective feedback about ETable was also positive. All participants found that ETable was easier to learn and helpful for exploring databases.
△ Less
Submitted 20 August, 2016; v1 submitted 7 March, 2016;
originally announced March 2016.