Skip to main content

Showing 1–23 of 23 results for author: Hecht, B

  1. A Canary in the AI Coal Mine: American Jews May Be Disproportionately Harmed by Intellectual Property Dispossession in Large Language Model Training

    Authors: Heila Precel, Allison McDonald, Brent Hecht, Nicholas Vincent

    Abstract: Systemic property dispossession from minority groups has often been carried out in the name of technological progress. In this paper, we identify evidence that the current paradigm of large language models (LLMs) likely continues this long history. Examining common LLM training datasets, we find that a disproportionate amount of content authored by Jewish Americans is used for training without the… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Preprint, to appear in CHI 2024 proceedings

  2. arXiv:2403.12388  [pdf, other

    cs.IR cs.AI

    Interpretable User Satisfaction Estimation for Conversational Systems with Large Language Models

    Authors: Ying-Chun Lin, Jennifer Neville, Jack W. Stokes, Longqi Yang, Tara Safavi, Mengting Wan, Scott Counts, Siddharth Suri, Reid Andersen, Xiaofeng Xu, Deepak Gupta, Sujay Kumar Jauhar, Xia Song, Georg Buscher, Saurabh Tiwary, Brent Hecht, Jaime Teevan

    Abstract: Accurate and interpretable user satisfaction estimation (USE) is critical for understanding, evaluating, and continuously improving conversational systems. Users express their satisfaction or dissatisfaction with diverse conversational patterns in both general-purpose (ChatGPT and Bing Copilot) and task-oriented (customer service chatbot) conversational systems. Existing approaches based on featur… ▽ More

    Submitted 8 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  3. The Dimensions of Data Labor: A Road Map for Researchers, Activists, and Policymakers to Empower Data Producers

    Authors: Hanlin Li, Nicholas Vincent, Stevie Chancellor, Brent Hecht

    Abstract: Many recent technological advances (e.g. ChatGPT and search engines) are possible only because of massive amounts of user-generated data produced through user interactions with computing systems or scraped from the web (e.g. behavior logs, user-generated content, and artwork). However, data producers have little say in what data is captured, how it is used, or who it benefits. Organizations with t… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: To appear at the 2023 ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT)

  4. arXiv:2207.04049  [pdf, other

    cs.LG cs.AI

    Learning Causal Effects on Hypergraphs

    Authors: Jing Ma, Mengting Wan, Longqi Yang, Jundong Li, Brent Hecht, Jaime Teevan

    Abstract: Hypergraphs provide an effective abstraction for modeling multi-way group interactions among nodes, where each hyperedge can connect any number of nodes. Different from most existing studies which leverage statistical dependencies, we study hypergraphs from the perspective of causality. Specifically, in this paper, we focus on the problem of individual treatment effect (ITE) estimation on hypergra… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  5. arXiv:2205.14529  [pdf

    cs.HC

    All That's Happening behind the Scenes: Putting the Spotlight on Volunteer Moderator Labor in Reddit

    Authors: Hanlin Li, Brent Hecht, Stevie Chancellor

    Abstract: Online volunteers are an uncompensated yet valuable labor force for many social platforms. For example, volunteer content moderators perform a vast amount of labor to maintain online communities. However, as social platforms like Reddit favor revenue generation and user engagement, moderators are under-supported to manage the expansion of online communities. To preserve these online communities, d… ▽ More

    Submitted 5 June, 2022; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: This is a preprint. The paper will be presented at the 2022 International Conference on Web and Social Media (ICWSM'22)

  6. arXiv:2205.14528  [pdf

    cs.HC

    Measuring the Monetary Value of Online Volunteer Work

    Authors: Hanlin Li, Brent Hecht, Stevie Chancellor

    Abstract: Online volunteers are a crucial labor force that keeps many for-profit systems afloat (e.g. social media platforms and online review sites). Despite their substantial role in upholding highly valuable technological systems, online volunteers have no way of knowing the value of their work. This paper uses content moderation as a case study and measures its monetary value to make apparent volunteer… ▽ More

    Submitted 5 June, 2022; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: This is a preprint. The paper will be presented at the 2022 International Conference on Web and Social Media (ICWSM'22)

  7. arXiv:2112.09544  [pdf

    cs.CY

    It's Time to Do Something: Mitigating the Negative Impacts of Computing Through a Change to the Peer Review Process

    Authors: Brent Hecht, Lauren Wilcox, Jeffrey P. Bigham, Johannes Schöning, Ehsan Hoque, Jason Ernst, Yonatan Bisk, Luigi De Russis, Lana Yarosh, Bushra Anjum, Danish Contractor, Cathy Wu

    Abstract: The computing research community needs to work much harder to address the downsides of our innovations. Between the erosion of privacy, threats to democracy, and automation's effect on employment (among many other issues), we can no longer simply assume that our research will have a net positive impact on the world. While bending the arc of computing innovation towards societal benefit may at firs… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: First published on the ACM Future of Computing Academy blog on March 29, 2018. This is the archival version

  8. Learning to Represent Human Motives for Goal-directed Web Browsing

    Authors: Jyun-Yu Jiang, Chia-Jung Lee, Longqi Yang, Bahareh Sarrafzadeh, Brent Hecht, Jaime Teevan

    Abstract: Motives or goals are recognized in psychology literature as the most fundamental drive that explains and predicts why people do what they do, including when they browse the web. Although providing enormous value, these higher-ordered goals are often unobserved, and little is known about how to leverage such goals to assist people's browsing activities. This paper proposes to take a new approach to… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

    Comments: Accepted by RecSys 2021

  9. Large Scale Analysis of Multitasking Behavior During Remote Meetings

    Authors: Hancheng Cao, Chia-Jung Lee, Shamsi Iqbal, Mary Czerwinski, Priscilla Wong, Sean Rintel, Brent Hecht, Jaime Teevan, Longqi Yang

    Abstract: Virtual meetings are critical for remote work because of the need for synchronous collaboration in the absence of in-person interactions. In-meeting multitasking is closely linked to people's productivity and wellbeing. However, we currently have limited understanding of multitasking in remote meetings and its potential impact. In this paper, we present what we believe is the most comprehensive st… ▽ More

    Submitted 28 January, 2021; originally announced January 2021.

    Comments: In ACM CHI 2021

  10. arXiv:2012.09995  [pdf, other

    cs.CY

    Data Leverage: A Framework for Empowering the Public in its Relationship with Technology Companies

    Authors: Nicholas Vincent, Hanlin Li, Nicole Tilly, Stevie Chancellor, Brent Hecht

    Abstract: Many powerful computing technologies rely on implicit and explicit data contributions from the public. This dependency suggests a potential source of leverage for the public in its relationship with technology companies: by reducing, stopping, redirecting, or otherwise manipulating data contributions, the public can reduce the effectiveness of many lucrative technologies. In this paper, we synthes… ▽ More

    Submitted 17 February, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: This is a preprint. The paper will be presented at the 2021 Conference on Fairness, Accountability, and Transparency (FAccT 2021)

  11. Behavioral Use Licensing for Responsible AI

    Authors: Danish Contractor, Daniel McDuff, Julia Haines, Jenny Lee, Christopher Hines, Brent Hecht, Nicholas Vincent, Hanlin Li

    Abstract: With the growing reliance on artificial intelligence (AI) for many different applications, the sharing of code, data, and models is important to ensure the replicability and democratization of scientific knowledge. Many high-profile academic publishing venues expect code and models to be submitted and released with papers. Furthermore, developers often want to release these assets to encourage dev… ▽ More

    Submitted 20 October, 2022; v1 submitted 4 November, 2020; originally announced November 2020.

    Comments: Paper published at ACM FAccT 2022

  12. arXiv:2007.15584  [pdf

    cs.CY cs.HC cs.SE

    How Work From Home Affects Collaboration: A Large-Scale Study of Information Workers in a Natural Experiment During COVID-19

    Authors: Longqi Yang, Sonia Jaffe, David Holtz, Siddharth Suri, Shilpi Sinha, Jeffrey Weston, Connor Joyce, Neha Shah, Kevin Sherman, CJ Lee, Brent Hecht, Jaime Teevan

    Abstract: The COVID-19 pandemic has had a wide-ranging impact on information workers such as higher stress levels, increased workloads, new workstreams, and more caregiving responsibilities during lockdown. COVID-19 also caused the overwhelming majority of information workers to rapidly shift to working from home (WFH). The central question this work addresses is: can we isolate the effects of WFH on inform… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

    Journal ref: Nature Human Behaviour (2021)

  13. arXiv:2006.03196  [pdf, other

    cs.HC

    Towards Better Driver Safety: Empowering Personal Navigation Technologies with Road Safety Awareness

    Authors: Runsheng Xu, Shibo Zhang, Yue Zhao, Peixi Xiong, Allen Yilun Lin, Brent Hecht, Jiaqi Ma

    Abstract: Recent research has found that navigation systems usually assume that all roads are equally safe, directing drivers to dangerous routes, which led to catastrophic consequences. To address this problem, this paper aims to begin the process of adding road safety awareness to navigation systems. To do so, we first created a definition for road safety that navigation systems can easily understand by a… ▽ More

    Submitted 5 December, 2021; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: Submitted to Autonomous Intelligent System Journal

  14. arXiv:2004.10265  [pdf

    cs.CY cs.IR

    A Deeper Investigation of the Importance of Wikipedia Links to the Success of Search Engines

    Authors: Nicholas Vincent, Brent Hecht

    Abstract: A growing body of work has highlighted the important role that Wikipedia's volunteer-created content plays in helping search engines achieve their core goal of addressing the information needs of millions of people. In this paper, we report the results of an investigation into the incidence of Wikipedia links in search engine results pages (SERPs). Our results extend prior work by considering thre… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: This is a pre-print of a paper accepted to the non-archival track of the WikiWorkshop at the Web Conference 2020

  15. arXiv:1912.00757  [pdf

    cs.CY

    Mapping the Potential and Pitfalls of "Data Dividends" as a Means of Sharing the Profits of Artificial Intelligence

    Authors: Nicholas Vincent, Yichun Li, Renee Zha, Brent Hecht

    Abstract: Identifying strategies to more broadly distribute the economic winnings of AI technologies is a growing priority in HCI and other fields. One idea gaining prominence centers on "data dividends", or sharing the profits of AI technologies with the people who generated the data on which these technologies rely. Despite the rapidly growing discussion around data dividends - including backing by promin… ▽ More

    Submitted 18 November, 2019; originally announced December 2019.

    Comments: This is a working draft. It has not been peer-reviewed and is intended for internal discussion in the computing community

  16. arXiv:1908.10954  [pdf

    cs.HC cs.CY cs.SI

    Not at Home on the Range: Peer Production and the Urban/Rural Divide

    Authors: Isaac Johnson, Allen Yilun Lin, Toby Jia-Jun Li, Andrew Hall, Aaron Halfaker, Johannes Schöning, Brent Hecht

    Abstract: Wikipedia articles about places, OpenStreetMap features, and other forms of peer-produced content have become critical sources of geographic knowledge for humans and intelligent technologies. In this paper, we explore the effectiveness of the peer production model across the rural/urban divide, a divide that has been shown to be an important factor in many online social systems. We find that in bo… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Comments: 10 pages, published on CHI'16

    ACM Class: H.5.m

    Journal ref: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems

  17. arXiv:1906.08576  [pdf

    cs.CY

    Measuring the Importance of User-Generated Content to Search Engines

    Authors: Nicholas Vincent, Isaac Johnson, Patrick Sheehan, Brent Hecht

    Abstract: Search engines are some of the most popular and profitable intelligent technologies in existence. Recent research, however, has suggested that search engines may be surprisingly dependent on user-created content like Wikipedia articles to address user information needs. In this paper, we perform a rigorous audit of the extent to which Google leverages Wikipedia and other user-generated content to… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: This version includes a bibliography entry that was missing from the first version of the text due to a processing error. This is a preprint of a paper accepted at ICWSM 2019. Please cite that version instead

  18. Pharos: improving navigation instructions on smartwatches by including global landmarks

    Authors: N. Wenig, D. Wenig, S. Ernst, R. Malaka, B. Hecht, J. Schöning

    Abstract: Landmark-based navigation systems have proven benefits relative to traditional turn-by-turn systems that use street names and distances. However, one obstacle to the implementation of landmark-based navigation systems is the complex challenge of selecting salient local landmarks at each decision point for each user. In this paper, we present Pharos, a novel system that extends turn-by-turn navigat… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: MobileHCI 2017 Proceedings of the 19th International Conference on Human-Computer Interaction with Mobile Devices and Services

  19. The Tower of Babel Meets Web 2.0: User-Generated Content and its Applications in a Multilingual Context

    Authors: B. Hecht, D. Gergle

    Abstract: This study explores language's fragmenting effect on user-generated content by examining the diversity of knowledge representations across 25 different Wikipedia language editions. This diversity is measured at two levels: the concepts that are included in each edition and the ways in which these concepts are described. We demonstrate that the diversity present is greater than has been presumed in… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: CHI 2010 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

  20. SubwayPS: Towards Enabling Smartphone Positioning in Underground Public Transportation Systems

    Authors: T. Stockx, B. Hecht, J. Schöning

    Abstract: Thanks to rapid advances in technologies like GPS and Wi-Fi positioning, smartphone users are able to determine their location almost everywhere they go. This is not true, however, of people who are traveling in underground public transportation networks, one of the few types of high-traffic areas where smartphones do not have access to accurate position information. In this paper, we introduce th… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems 2014 (ACM SIGSPATIAL 2014)

  21. Helping Computers Understand Geographically-Bound Activity Restrictions

    Authors: M. Soll, P. Naumann, J. Schöning, P. Samsonov, B. Hecht

    Abstract: The lack of certain types of geographic data prevents the development of location-aware technologies in a number of important domains. One such type of "unmapped" geographic data is space usage rules (SURs), which are defined as geographically-bound activity restrictions (e.g. "no dogs", "no smoking", "no fishing", "no skateboarding"). Researchers in the area of human-computer interaction have rec… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Journal ref: Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 2016)

  22. Improving Interaction with Virtual Globes through Spatial Thinking: Helping Users Ask "Why?"

    Authors: J. Schöning, B. Hecht, M. Raubal, A. Krüger, M. Marsh, M. Rohs

    Abstract: Virtual globes have progressed from little-known technology to broadly popular software in a mere few years. We investigated this phenomenon through a survey and discovered that, while virtual globes are en vogue, their use is restricted to a small set of tasks so simple that they do not involve any spatial thinking. Spatial thinking requires that users ask "what is where" and "why"; the most comm… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: Proceedings of the International Conference on Intelligent User Interfaces (IUI 2008)

  23. The Geography of Pokémon GO: Beneficial and Problematic Effects on Places and Movement

    Authors: Ashley Colley, Jacob Thebault-Spieker, Allen Yilun Lin, Donald Degraen, Benjamin Fischman, Jonna Häkkilä, Kate Kuehl, Valentina Nisi, Nuno Jardim Nunes, Nina Wenig, Dirk Wenig, Brent Hecht, Johannes Schöning

    Abstract: The widespread popularity of Pokémon GO presents the first opportunity to observe the geographic effects of location-based gaming at scale. This paper reports the results of a mixed methods study of the geography of Pokémon GO that includes a five-country field survey of 375 Pokémon GO players and a large scale geostatistical analysis of game elements. Focusing on the key geographic themes of plac… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: This version of the paper contains a fix for a reference issue that appeared in the original version. Proceedings of the 35th Annual ACM Conference on Human Factors in Computing Systems (CHI 2017)

    ACM Class: H.5.m