Skip to main content

Showing 1–9 of 9 results for author: Herman, B

  1. arXiv:2405.16820  [pdf, other

    cs.LG cs.AI cs.CY cs.HC

    Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings

    Authors: Robert Wolfe, Isaac Slaughter, Bin Han, Bingbing Wen, Yiwei Yang, Lucas Rosenblatt, Bernease Herman, Eva Brown, Zening Qu, Nic Weber, Bill Howe

    Abstract: The rapid proliferation of generative AI has raised questions about the competitiveness of lower-parameter, locally tunable, open-weight models relative to high-parameter, API-guarded, closed-weight models in terms of performance, domain adaptation, cost, and generalization. Centering under-resourced yet risk-intolerant settings in government, research, and healthcare, we see for-profit closed-wei… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted at the ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2024

  2. arXiv:2208.12700  [pdf, other

    cs.CR cs.CY

    Epistemic Parity: Reproducibility as an Evaluation Metric for Differential Privacy

    Authors: Lucas Rosenblatt, Bernease Herman, Anastasia Holovenko, Wonkwon Lee, Joshua Loftus, Elizabeth McKinnie, Taras Rumezhak, Andrii Stadnik, Bill Howe, Julia Stoyanovich

    Abstract: Differential privacy (DP) data synthesizers support public release of sensitive information, offering theoretical guarantees for privacy but limited evidence of utility in practical settings. Utility is typically measured as the error on representative proxy tasks, such as descriptive statistics, accuracy of trained classifiers, or performance over a query workload. The ability for these results t… ▽ More

    Submitted 31 May, 2023; v1 submitted 26 August, 2022; originally announced August 2022.

    Comments: Preprint. 14 pages

  3. arXiv:2205.11473  [pdf, other

    cs.LG cs.AI stat.ML

    Rethinking Streaming Machine Learning Evaluation

    Authors: Shreya Shankar, Bernease Herman, Aditya G. Parameswaran

    Abstract: While most work on evaluating machine learning (ML) models focuses on computing accuracy on batches of data, tracking accuracy alone in a streaming setting (i.e., unbounded, timestamp-ordered datasets) fails to appropriately identify when models are performing unexpectedly. In this position paper, we discuss how the nature of streaming ML problems introduces new real-world challenges (e.g., delaye… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: ML Evaluation Standards Workshop (ICLR 2022)

  4. arXiv:2010.08859  [pdf, other

    cs.HC

    Printmaking, Puzzles, and Studio Closets: Using Artistic Metaphors to Reimagine the User Interface for Designing Immersive Visualizations

    Authors: Bridger Herman, Francesca Samsel, Annie Bares, Seth Johnson, Greg Abram, Daniel F. Keefe

    Abstract: We, as a society, need artists to help us interpret and explain science, but what does an artist's studio look like when today's science is built upon the language of large, increasingly complex data? This paper presents a data visualization design interface that lifts the barriers for artists to engage with actively studied, 3D multivariate datasets. To accomplish this, the interface must weave t… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

  5. arXiv:1912.02943  [pdf, other

    cs.CY cs.AI cs.HC cs.LG cs.SI

    An Algorithmic Equity Toolkit for Technology Audits by Community Advocates and Activists

    Authors: Michael Katell, Meg Young, Bernease Herman, Dharma Dailey, Aaron Tam, Vivian Guetler, Corinne Binz, Daniella Raz, P. M. Krafft

    Abstract: A wave of recent scholarship documenting the discriminatory harms of algorithmic systems has spurred widespread interest in algorithmic accountability and regulation. Yet effective accountability and regulation is stymied by a persistent lack of resources supporting public understanding of algorithms and artificial intelligence. Through interactions with a US-based civil rights organization and th… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  6. Artifact-Based Rendering: Harnessing Natural and Traditional Visual Media for More Expressive and Engaging 3D Visualizations

    Authors: Seth Johnson, Francesca Samsel, Gregory Abram, Daniel Olson, Andrew J. Solis, Bridger Herman, Phillip J. Wolfram, Christophe Lenglet, Daniel F. Keefe

    Abstract: We introduce Artifact-Based Rendering (ABR), a framework of tools, algorithms, and processes that makes it possible to produce real, data-driven 3D scientific visualizations with a visual language derived entirely from colors, lines, textures, and forms created using traditional physical media or found in nature. A theory and process for ABR is presented to address three current needs: (i) designi… ▽ More

    Submitted 15 October, 2019; v1 submitted 30 July, 2019; originally announced July 2019.

    Comments: Published in IEEE VIS 2019, 9 pages of content with 2 pages of references, 12 figures

  7. arXiv:1711.07414   

    cs.AI cs.LG stat.ML

    The Promise and Peril of Human Evaluation for Model Interpretability

    Authors: Bernease Herman

    Abstract: Transparency, user trust, and human comprehension are popular ethical motivations for interpretable machine learning. In support of these goals, researchers evaluate model explanation performance using humans and real world applications. This alone presents a challenge in many areas of artificial intelligence. In this position paper, we propose a distinction between descriptive and persuasive expl… ▽ More

    Submitted 30 October, 2019; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: Presented at NIPS 2017 Symposium on Interpretable Machine Learning. I'm not happy with the writing and presentation of these ideas and hope to submit an updated and extended version in 2020

  8. arXiv:1710.08874  [pdf, other

    cs.CY

    Synthetic Data for Social Good

    Authors: Bill Howe, Julia Stoyanovich, Haoyue Ping, Bernease Herman, Matt Gee

    Abstract: Data for good implies unfettered access to data. But data owners must be conservative about how, when, and why they share data or risk violating the trust of the people they aim to help, losing their funding, or breaking the law. Data sharing agreements can help prevent privacy violations, but require a level of specificity that is premature during preliminary discussions, and can take over a year… ▽ More

    Submitted 24 October, 2017; originally announced October 2017.

    Comments: Presented at the Data For Good Exchange 2017

  9. arXiv:1710.02447  [pdf, other

    cs.CY

    Data science for urban equity: Making gentrification an accessible topic for data scientists, policymakers, and the community

    Authors: Bernease Herman, Gundula Proksch, Rachel Berney, Hillary Dawkins, Jacob Kovacs, Yahui Ma, Jacob Rich, Amanda Tan

    Abstract: The University of Washington eScience Institute runs an annual Data Science for Social Good (DSSG) program that selects four projects each year to train students from a wide range of disciplines while helping community members execute social good projects, often with an urban focus. We present observations and deliberations of one such project, the DSSG 2017 'Equitable Futures' project, which in… ▽ More

    Submitted 6 October, 2017; originally announced October 2017.

    Comments: Presented at the Data For Good Exchange 2017