Skip to main content

Showing 1–13 of 13 results for author: Halfaker, A

  1. arXiv:2406.08453  [pdf, other

    cs.HC

    ORES-Inspect: A technology probe for machine learning audits on enwiki

    Authors: Zachary Levonian, Lauren Hagen, Lu Li, Jada Lilleboe, Solvejg Wastvedt, Aaron Halfaker, Loren Terveen

    Abstract: Auditing the machine learning (ML) models used on Wikipedia is important for ensuring that vandalism-detection processes remain fair and effective. However, conducting audits is challenging because stakeholders have diverse priorities and assembling evidence for a model's [in]efficacy is technically complex. We designed an interface to enable editors to learn about and audit the performance of the… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Wiki Workshop 2024

    ACM Class: K.4.2

  2. Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia

    Authors: Tzu-Sheng Kuo, Aaron Halfaker, Zirui Cheng, Jiwoo Kim, Meng-Hsin Wu, Tongshuang Wu, Kenneth Holstein, Haiyi Zhu

    Abstract: AI tools are increasingly deployed in community contexts. However, datasets used to evaluate AI are typically created by developers and annotators outside a given community, which can yield misleading conclusions about AI performance. How might we empower communities to drive the intentional design and curation of evaluation datasets for AI that impacts them? We investigate this question on Wikipe… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems (CHI '24)

  3. arXiv:2307.15793  [pdf, other

    cs.HC cs.AI cs.IR

    Summaries, Highlights, and Action items: Design, implementation and evaluation of an LLM-powered meeting recap system

    Authors: Sumit Asthana, Sagih Hilleli, Pengcheng He, Aaron Halfaker

    Abstract: Meetings play a critical infrastructural role in the coordination of work. In recent years, due to shift to hybrid and remote work, more meetings are moving to online Computer Mediated Spaces. This has led to new problems (e.g. more time spent in less engaging meetings) and new opportunities (e.g. automated transcription/captioning and recap support). Recent advances in large language models (LLMs… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: in review for CSCW 23

  4. arXiv:2212.09968  [pdf, other

    cs.CL

    On Improving Summarization Factual Consistency from Natural Language Feedback

    Authors: Yixin Liu, Budhaditya Deb, Milagro Teruel, Aaron Halfaker, Dragomir Radev, Ahmed H. Awadallah

    Abstract: Despite the recent progress in language generation models, their outputs may not always meet user expectations. In this work, we study whether informational feedback in natural language can be leveraged to improve generation quality and user preference alignment. To this end, we consider factual consistency in summarization, the quality that the summary should only contain information supported by… ▽ More

    Submitted 16 October, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: ACL 2023 Camera Ready, GitHub Repo: https://github.com/microsoft/DeFacto

  5. Automatically Labeling Low Quality Content on Wikipedia by Leveraging Patterns in Editing Behaviors

    Authors: Sumit Asthana, Sabrina Tobar Thommel, Aaron Lee Halfaker, Nikola Banovic

    Abstract: Wikipedia articles aim to be definitive sources of encyclopedic content. Yet, only 0.6% of Wikipedia articles have high quality according to its quality scale due to insufficient number of Wikipedia editors and enormous number of articles. Supervised Machine Learning (ML) quality improvement approaches that can automatically identify and fix content issues rely on manual labels of individual Wikip… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  6. arXiv:2006.03121  [pdf, other

    cs.CY cs.HC cs.LG cs.SI

    Effects of algorithmic flagging on fairness: quasi-experimental evidence from Wikipedia

    Authors: Nathan TeBlunthuis, Benjamin Mako Hill, Aaron Halfaker

    Abstract: Online community moderators often rely on social signals such as whether or not a user has an account or a profile page as clues that users may cause problems. Reliance on these clues can lead to overprofiling bias when moderators focus on these signals but overlook the misbehavior of others. We propose that algorithmic flagging systems deployed to improve the efficiency of moderation work can als… ▽ More

    Submitted 5 April, 2021; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: 27 pages, 11 figures, ACM CSCW

    ACM Class: K.4.3

    Journal ref: Proc. ACM Hum.-Comput. Interact. 5, CSCW1, Article 56 (April 2021), 27 pages

  7. Keeping Community in the Loop: Understanding Wikipedia Stakeholder Values for Machine Learning-Based Systems

    Authors: C. Estelle Smith, Bowen Yu, Anjali Srivastava, Aaron Halfaker, Loren Terveen, Haiyi Zhu

    Abstract: On Wikipedia, sophisticated algorithmic tools are used to assess the quality of edits and take corrective actions. However, algorithms can fail to solve the problems they were designed for if they conflict with the values of communities who use them. In this study, we take a Value-Sensitive Algorithm Design approach to understanding a community-created and -maintained machine learning-based algori… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 10 pages, 1 table, accepted paper to CHI 2020 conference

  8. arXiv:1909.05189  [pdf, other

    cs.HC cs.CY cs.LG

    ORES: Lowering Barriers with Participatory Machine Learning in Wikipedia

    Authors: Aaron Halfaker, R. Stuart Geiger

    Abstract: Algorithmic systems---from rule-based bots to machine learning classifiers---have a long history of supporting the essential work of content moderation and other curation work in peer production projects. From counter-vandalism to task routing, basic machine prediction has allowed open knowledge projects like Wikipedia to scale to the largest encyclopedia in the world, while maintaining quality an… ▽ More

    Submitted 20 August, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: 29 pages + 3 pages appendix. Currently under review

  9. arXiv:1908.10954  [pdf

    cs.HC cs.CY cs.SI

    Not at Home on the Range: Peer Production and the Urban/Rural Divide

    Authors: Isaac Johnson, Allen Yilun Lin, Toby Jia-Jun Li, Andrew Hall, Aaron Halfaker, Johannes Schöning, Brent Hecht

    Abstract: Wikipedia articles about places, OpenStreetMap features, and other forms of peer-produced content have become critical sources of geographic knowledge for humans and intelligent technologies. In this paper, we explore the effectiveness of the peer production model across the rural/urban divide, a divide that has been shown to be an important factor in many online social systems. We find that in bo… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Comments: 10 pages, published on CHI'16

    ACM Class: H.5.m

    Journal ref: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems

  10. arXiv:1907.05131  [pdf, other

    cs.LG cs.HC

    PreCall: A Visual Interface for Threshold Optimization in ML Model Selection

    Authors: Christoph Kinkeldey, Claudia Müller-Birn, Tom Gülenman, Jesse Josua Benjamin, Aaron Halfaker

    Abstract: Machine learning systems are ubiquitous in various kinds of digital applications and have a huge impact on our everyday life. But a lack of explainability and interpretability of such systems hinders meaningful participation by people, especially by those without a technical background. Interactive visual interfaces (e.g., providing means for manipulating parameters in the user interface) can help… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: HCML Perspectives Workshop at CHI 2019, May 04, 2019, Glasgow

  11. arXiv:1810.07273  [pdf

    cs.CY cs.HC cs.SI

    Operationalizing Conflict and Cooperation between Automated Software Agents in Wikipedia: A Replication and Expansion of 'Even Good Bots Fight'

    Authors: R. Stuart Geiger, Aaron Halfaker

    Abstract: This paper replicates, extends, and refutes conclusions made in a study published in PLoS ONE ("Even Good Bots Fight"), which claimed to identify substantial levels of conflict between automated software agents (or bots) in Wikipedia using purely quantitative methods. By applying an integrative mixed-methods approach drawing on trace ethnography, we place these alleged cases of bot-bot conflict in… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: 33 pages. In ACM CSCW 2018

    Journal ref: Proc ACM on Human Computer Interaction. 1(2), Article 49. CSCW 2018

  12. Building automated vandalism detection tools for Wikidata

    Authors: Amir Sarabadani, Aaron Halfaker, Dario Taraborelli

    Abstract: Wikidata, like Wikipedia, is a knowledge base that anyone can edit. This open collaboration model is powerful in that it reduces barriers to participation and allows a large number of people to contribute. However, it exposes the knowledge base to the risk of vandalism and low-quality contributions. In this work, we build on past work detecting vandalism in Wikipedia to detect vandalism in Wikidat… ▽ More

    Submitted 10 March, 2017; originally announced March 2017.

  13. arXiv:1411.2878  [pdf, other

    cs.HC cs.SI

    User Session Identification Based on Strong Regularities in Inter-activity Time

    Authors: Aaron Halfaker, Os Keyes, Daniel Kluver, Jacob Thebault-Spieker, Tien Nguyen, Kenneth Shores, Anuradha Uduwage, Morten Warncke-Wang

    Abstract: Session identification is a common strategy used to develop metrics for web analytics and behavioral analyses of user-facing systems. Past work has argued that session identification strategies based on an inactivity threshold is inherently arbitrary or advocated that thresholds be set at about 30 minutes. In this work, we demonstrate a strong regularity in the temporal rhythms of user initiated e… ▽ More

    Submitted 4 August, 2019; v1 submitted 11 November, 2014; originally announced November 2014.

    Comments: 9 pages, 5 figures, 1 table

    ACM Class: H.1.1