Skip to main content

Showing 1–50 of 99 results for author: Potts, C

  1. arXiv:2407.10930  [pdf, other

    cs.CL cs.AI cs.LG

    Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together

    Authors: Dilara Soylu, Christopher Potts, Omar Khattab

    Abstract: Natural Language Processing (NLP) systems are increasingly taking the form of multi-stage pipelines involving multiple distinct language models (LMs) and prompting strategies. Here we address the question of how to fine-tune such systems to improve their performance. We cast this as a problem of optimizing the underlying LM weights and the prompting strategies together, and consider a challenging… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2406.11706  [pdf, other

    cs.IR cs.CL cs.LG

    Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels

    Authors: Jasper Xian, Saron Samuel, Faraz Khoubsirat, Ronak Pradeep, Md Arafat Sultan, Radu Florian, Salim Roukos, Avirup Sil, Christopher Potts, Omar Khattab

    Abstract: We develop a method for training small-scale (under 100M parameter) neural information retrieval models with as few as 10 gold relevance labels. The method depends on generating synthetic queries for documents using a language model (LM), and the key step is that we automatically optimize the LM prompt that is used to generate these queries based on training quality. In experiments with the BIRCO… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.11695  [pdf, other

    cs.CL cs.AI cs.LG

    Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

    Authors: Krista Opsahl-Ong, Michael J Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab

    Abstract: Language Model Programs, i.e. sophisticated pipelines of modular language model (LM) calls, are increasingly advancing NLP tasks, but they require crafting prompts that are jointly effective for all modules. We study prompt optimization for LM programs, i.e. how to update these prompts to maximize a downstream metric without access to module-level labels or gradients. To make this tractable, we fa… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Krista and Michael contributed equally to this work

  4. arXiv:2406.09458  [pdf, other

    cs.CV cs.AI cs.CL

    Updating CLIP to Prefer Descriptions Over Captions

    Authors: Amir Zur, Elisa Kreiss, Karel D'Oosterlinck, Christopher Potts, Atticus Geiger

    Abstract: Although CLIPScore is a powerful generic metric that captures the similarity between a text and an image, it fails to distinguish between a caption that is meant to complement the information in an image and a description that is meant to replace an image entirely, e.g., for accessibility. We address this shortcoming by updating the CLIP model with the Concadia dataset to assign higher scores to d… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  5. arXiv:2405.16039  [pdf, other

    cs.LG cs.AI cs.NE

    MoEUT: Mixture-of-Experts Universal Transformers

    Authors: Róbert Csordás, Kazuki Irie, Jürgen Schmidhuber, Christopher Potts, Christopher D. Manning

    Abstract: Previous work on Universal Transformers (UTs) has demonstrated the importance of parameter sharing across layers. By allowing recurrence in depth, UTs have advantages over standard Transformers in learning compositional generalizations, but layer-sharing comes with a practical limitation of parameter-compute ratio: it drastically reduces the parameter count compared to the non-shared model with th… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  6. arXiv:2404.03592  [pdf, other

    cs.CL cs.AI cs.LG

    ReFT: Representation Finetuning for Language Models

    Authors: Zhengxuan Wu, Aryaman Arora, Zheng Wang, Atticus Geiger, Dan Jurafsky, Christopher D. Manning, Christopher Potts

    Abstract: Parameter-efficient finetuning (PEFT) methods seek to adapt large neural models via updates to a small number of weights. However, much prior interpretability work has shown that representations encode rich semantic information, suggesting that editing representations might be a more powerful alternative. We pursue this hypothesis by developing a family of Representation Finetuning (ReFT) methods.… ▽ More

    Submitted 22 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: preprint

  7. arXiv:2404.01268  [pdf, other

    cs.CL cs.AI cs.DL cs.LG cs.SI

    Mapping the Increasing Use of LLMs in Scientific Papers

    Authors: Weixin Liang, Yaohui Zhang, Zhengxuan Wu, Haley Lepp, Wenlong Ji, Xuandong Zhao, Hancheng Cao, Sheng Liu, Siyu He, Zhi Huang, Diyi Yang, Christopher Potts, Christopher D Manning, James Y. Zou

    Abstract: Scientific publishing lays the foundation of science by disseminating research findings, fostering collaboration, encouraging reproducibility, and ensuring that scientific knowledge is accessible, verifiable, and built upon over time. Recently, there has been immense speculation about how many people are using large language models (LLMs) like ChatGPT in their academic writing, and to what extent… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  8. arXiv:2403.18038  [pdf

    cs.CV

    TGGLinesPlus: A robust topological graph-guided computer vision algorithm for line detection from images

    Authors: Liping Yang, Joshua Driscol, Ming Gong, Shujie Wang, Catherine G. Potts

    Abstract: Line detection is a classic and essential problem in image processing, computer vision and machine intelligence. Line detection has many important applications, including image vectorization (e.g., document recognition and art design), indoor mapping, and important societal challenges (e.g., sea ice fracture line extraction from satellite imagery). Many line detection algorithms and methods have b… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Our TGGLinesPlus Python implementation is open source. 27 pages, 8 figures and 4 tables

  9. arXiv:2403.07809  [pdf, other

    cs.LG cs.CL

    pyvene: A Library for Understanding and Improving PyTorch Models via Interventions

    Authors: Zhengxuan Wu, Atticus Geiger, Aryaman Arora, Jing Huang, Zheng Wang, Noah D. Goodman, Christopher D. Manning, Christopher Potts

    Abstract: Interventions on model-internal states are fundamental operations in many areas of AI, including model editing, steering, robustness, and interpretability. To facilitate such research, we introduce $\textbf{pyvene}$, an open-source Python library that supports customizable interventions on a range of different PyTorch modules. $\textbf{pyvene}$ supports complex intervention schemes with an intuiti… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

  10. arXiv:2402.17700  [pdf, other

    cs.CL cs.LG

    RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations

    Authors: Jing Huang, Zhengxuan Wu, Christopher Potts, Mor Geva, Atticus Geiger

    Abstract: Individual neurons participate in the representation of multiple high-level concepts. To what extent can different interpretability methods successfully disentangle these roles? To help address this question, we introduce RAVEL (Resolving Attribute-Value Entanglements in Language Models), a dataset that enables tightly controlled, quantitative comparisons between a variety of existing interpretabi… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  11. arXiv:2402.15002  [pdf, other

    cs.CL cs.CV

    CommVQA: Situating Visual Question Answering in Communicative Contexts

    Authors: Nandita Shankar Naik, Christopher Potts, Elisa Kreiss

    Abstract: Current visual question answering (VQA) models tend to be trained and evaluated on image-question pairs in isolation. However, the questions people ask are dependent on their informational needs and prior knowledge about the image content. To evaluate how situating images within naturalistic contexts shapes visual questions, we introduce CommVQA, a VQA dataset consisting of images, image descripti… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  12. arXiv:2402.12560  [pdf, other

    cs.CL cs.AI

    CausalGym: Benchmarking causal interpretability methods on linguistic tasks

    Authors: Aryaman Arora, Dan Jurafsky, Christopher Potts

    Abstract: Language models (LMs) have proven to be powerful tools for psycholinguistic research, but most prior work has focused on purely behavioural measures (e.g., surprisal comparisons). At the same time, research in model interpretability has begun to illuminate the abstract causal mechanisms shaping LM behavior. To help bring these strands of research closer together, we introduce CausalGym. We adapt a… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 9 pages main text, 26 pages total

    ACM Class: I.2.7

  13. arXiv:2402.07411  [pdf, other

    cs.LG

    Potential-Based Reward Shaping For Intrinsic Motivation

    Authors: Grant C. Forbes, Nitish Gupta, Leonardo Villalobos-Arias, Colin M. Potts, Arnav Jhala, David L. Roberts

    Abstract: Recently there has been a proliferation of intrinsic motivation (IM) reward-shaping methods to learn in complex and sparse-reward environments. These methods can often inadvertently change the set of optimal policies in an environment, leading to suboptimal behavior. Previous work on mitigating the risks of reward shaping, particularly through potential-based reward shaping (PBRS), has not been ap… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Extended version of paper appearing in AAMAS 2024

    ACM Class: I.2.6

  14. arXiv:2401.12631  [pdf, other

    cs.LG cs.AI cs.CL

    A Reply to Makelov et al. (2023)'s "Interpretability Illusion" Arguments

    Authors: Zhengxuan Wu, Atticus Geiger, Jing Huang, Aryaman Arora, Thomas Icard, Christopher Potts, Noah D. Goodman

    Abstract: We respond to the recent paper by Makelov et al. (2023), which reviews subspace interchange intervention methods like distributed alignment search (DAS; Geiger et al. 2023) and claims that these methods potentially cause "interpretability illusions". We first review Makelov et al. (2023)'s technical notion of what an "interpretability illusion" is, and then we show that even intuitive and desirabl… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 20 pages, 14 figures

  15. arXiv:2401.12178  [pdf, other

    cs.CL cs.AI

    In-Context Learning for Extreme Multi-Label Classification

    Authors: Karel D'Oosterlinck, Omar Khattab, François Remy, Thomas Demeester, Chris Develder, Christopher Potts

    Abstract: Multi-label classification problems with thousands of classes are hard to solve with in-context learning alone, as language models (LMs) might lack prior knowledge about the precise classes or how to assign them, and it is generally infeasible to demonstrate every class in a prompt. We propose a general program, $\texttt{Infer--Retrieve--Rank}$, that defines multi-step interactions between LMs and… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  16. arXiv:2401.06416  [pdf, other

    cs.CL cs.AI cs.LG

    Mission: Impossible Language Models

    Authors: Julie Kallini, Isabel Papadimitriou, Richard Futrell, Kyle Mahowald, Christopher Potts

    Abstract: Chomsky and others have very directly claimed that large language models (LLMs) are equally capable of learning languages that are possible and impossible for humans to learn. However, there is very little published experimental evidence to support such a claim. Here, we develop a set of synthetic impossible languages of differing complexity, each designed by systematically altering English data w… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  17. arXiv:2401.05300  [pdf, other

    cs.CL cs.AI

    I am a Strange Dataset: Metalinguistic Tests for Language Models

    Authors: Tristan Thrush, Jared Moore, Miguel Monares, Christopher Potts, Douwe Kiela

    Abstract: Statements involving metalinguistic self-reference ("This paper has six sections.") are prevalent in many domains. Can large language models (LLMs) handle such language? In this paper, we present "I am a Strange Dataset", a new dataset for addressing this question. There are two subtasks: generation and verification. In generation, models continue statements like "The penultimate word in this sent… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  18. arXiv:2401.03590  [pdf, other

    cs.CL

    Building Efficient and Effective OpenQA Systems for Low-Resource Languages

    Authors: Emrah Budur, Rıza Özçelik, Dilara Soylu, Omar Khattab, Tunga Güngör, Christopher Potts

    Abstract: Question answering (QA) is the task of answering questions posed in natural language with free-form natural language answers extracted from a given passage. In the OpenQA variant, only a question text is given, and the system must retrieve relevant passages from an unstructured knowledge source and use them to provide answers, which is the case in the mainstream QA systems on the Web. QA systems c… ▽ More

    Submitted 4 June, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  19. arXiv:2312.13382  [pdf, ps, other

    cs.CL cs.AI cs.PL

    DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines

    Authors: Arnav Singhvi, Manish Shetty, Shangyin Tan, Christopher Potts, Koushik Sen, Matei Zaharia, Omar Khattab

    Abstract: Chaining language model (LM) calls as composable modules is fueling a new way of programming, but ensuring LMs adhere to important constraints requires heuristic "prompt engineering". We introduce LM Assertions, a programming construct for expressing computational constraints that LMs should satisfy. We integrate our constructs into the recent DSPy programming model for LMs, and present new strate… ▽ More

    Submitted 2 February, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Arnav*, Manish*, Shangyin* contributed equally to this work

  20. arXiv:2312.05231  [pdf, other

    cs.LG

    Modeling Risk in Reinforcement Learning: A Literature Mapping

    Authors: Leonardo Villalobos-Arias, Derek Martin, Abhijeet Krishnan, Madeleine Gagné, Colin M. Potts, Arnav Jhala

    Abstract: Safe reinforcement learning deals with mitigating or avoiding unsafe situations by reinforcement learning (RL) agents. Safe RL approaches are based on specific risk representations for particular problems or domains. In order to analyze agent behaviors, compare safe RL approaches, and effectively transfer techniques between application domains, it is necessary to understand the types of risk speci… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 36 pages, 8 figures, Submitted to Artificial Intelligence Reviews

  21. arXiv:2311.11518  [pdf, other

    cs.CL cs.LG

    Multi-teacher Distillation for Multilingual Spelling Correction

    Authors: Jingfen Zhang, Xuan Guo, Sravan Bodapati, Christopher Potts

    Abstract: Accurate spelling correction is a critical step in modern search interfaces, especially in an era of mobile devices and speech-to-text interfaces. For services that are deployed around the world, this poses a significant challenge for multilingual NLP: spelling errors need to be caught and corrected in all languages, and even in queries that use multiple languages. In this paper, we tackle this ch… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  22. arXiv:2311.10905  [pdf, other

    cs.CL cs.AI

    Flexible Model Interpretability through Natural Language Model Editing

    Authors: Karel D'Oosterlinck, Thomas Demeester, Chris Develder, Christopher Potts

    Abstract: Model interpretability and model editing are crucial goals in the age of large language models. Interestingly, there exists a link between these two goals: if a method is able to systematically edit model behavior with regard to a human concept of interest, this editor method can help make internal representations more interpretable by pointing towards relevant representations and systematically m… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Extended Abstract -- work in progress. BlackboxNLP2023

  23. arXiv:2311.09476  [pdf, other

    cs.CL cs.AI cs.IR

    ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems

    Authors: Jon Saad-Falcon, Omar Khattab, Christopher Potts, Matei Zaharia

    Abstract: Evaluating retrieval-augmented generation (RAG) systems traditionally relies on hand annotations for input queries, passages to retrieve, and responses to generate. We introduce ARES, an Automated RAG Evaluation System, for evaluating RAG systems along the dimensions of context relevance, answer faithfulness, and answer relevance. By creating its own synthetic training data, ARES finetunes lightwe… ▽ More

    Submitted 31 March, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: NAACL 2024

  24. arXiv:2310.06165  [pdf, other

    cs.CL cs.AI

    CAW-coref: Conjunction-Aware Word-level Coreference Resolution

    Authors: Karel D'Oosterlinck, Semere Kiros Bitew, Brandon Papineau, Christopher Potts, Thomas Demeester, Chris Develder

    Abstract: State-of-the-art coreference resolutions systems depend on multiple LLM calls per document and are thus prohibitively expensive for many use cases (e.g., information extraction with large corpora). The leading word-level coreference system (WL-coref) attains 96.6% of these SOTA systems' performance while being much more efficient. In this work, we identify a routine yet important failure case of W… ▽ More

    Submitted 19 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted at CRAC 2023

  25. arXiv:2310.03714  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

    Authors: Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia, Christopher Potts

    Abstract: The ML community is rapidly exploring techniques for prompting language models (LMs) and for stacking them into pipelines that solve complex tasks. Unfortunately, existing LM pipelines are typically implemented using hard-coded "prompt templates", i.e. lengthy strings discovered via trial and error. Toward a more systematic approach for developing and optimizing LM pipelines, we introduce DSPy, a… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  26. arXiv:2309.11710  [pdf, other

    cs.CL cs.CV

    ContextRef: Evaluating Referenceless Metrics For Image Description Generation

    Authors: Elisa Kreiss, Eric Zelikman, Christopher Potts, Nick Haber

    Abstract: Referenceless metrics (e.g., CLIPScore) use pretrained vision--language models to assess image descriptions directly without costly ground-truth reference texts. Such methods can facilitate rapid progress, but only if they truly align with human preference judgments. In this paper, we introduce ContextRef, a benchmark for assessing referenceless metrics for such alignment. ContextRef has two compo… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  27. arXiv:2309.10312  [pdf, other

    cs.CL

    Rigorously Assessing Natural Language Explanations of Neurons

    Authors: Jing Huang, Atticus Geiger, Karel D'Oosterlinck, Zhengxuan Wu, Christopher Potts

    Abstract: Natural language is an appealing medium for explaining how large language models process and store information, but evaluating the faithfulness of such explanations is challenging. To help address this, we develop two modes of evaluation for natural language explanations that claim individual neurons represent a concept in a text input. In the observational mode, we evaluate claims that a neuron… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  28. arXiv:2307.15745  [pdf, other

    cs.CL cs.CV

    Context-VQA: Towards Context-Aware and Purposeful Visual Question Answering

    Authors: Nandita Naik, Christopher Potts, Elisa Kreiss

    Abstract: Visual question answering (VQA) has the potential to make the Internet more accessible in an interactive way, allowing people who cannot see images to ask questions about them. However, multiple studies have shown that people who are blind or have low-vision prefer image explanations that incorporate the context in which an image appears, yet current VQA datasets focus on images in isolation. We a… ▽ More

    Submitted 30 August, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: Proceedings of ICCV 2023 Workshop on Closing the Loop Between Vision and Language

  29. arXiv:2306.11670  [pdf, other

    cs.LG cs.AI

    GIO: Gradient Information Optimization for Training Dataset Selection

    Authors: Dante Everaert, Christopher Potts

    Abstract: It is often advantageous to train models on a subset of the available train examples, because the examples are of variable quality or because one would like to train with fewer examples, without sacrificing performance. We present Gradient Information Optimization (GIO), a scalable, task-agnostic approach to this data selection problem that requires only a small set of (unlabeled) examples represe… ▽ More

    Submitted 11 October, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  30. ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning

    Authors: Jingyuan Selena She, Christopher Potts, Samuel R. Bowman, Atticus Geiger

    Abstract: A number of recent benchmarks seek to assess how well models handle natural language negation. However, these benchmarks lack the controlled example paradigms that would allow us to infer whether a model had learned how negation morphemes semantically scope. To fill these analytical gaps, we present the Scoped Negation NLI (ScoNe-NLI) benchmark, which contains contrast sets of six examples with up… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  31. arXiv:2305.14795  [pdf, other

    cs.CL

    MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

    Authors: Zexuan Zhong, Zhengxuan Wu, Christopher D. Manning, Christopher Potts, Danqi Chen

    Abstract: The information stored in large language models (LLMs) falls out of date quickly, and retraining from scratch is often not an option. This has recently given rise to a range of techniques for injecting new facts through updating model weights. Current evaluation paradigms are extremely limited, mainly validating the recall of edited facts, but changing one fact should cause rippling changes to the… ▽ More

    Submitted 29 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023. Our code and datasets are available at https://github.com/princeton-nlp/MQuAKE

  32. arXiv:2305.13395  [pdf, other

    cs.CL

    BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance

    Authors: Karel D'Oosterlinck, François Remy, Johannes Deleu, Thomas Demeester, Chris Develder, Klim Zaporojets, Aneiss Ghodsi, Simon Ellershaw, Jack Collins, Christopher Potts

    Abstract: Timely and accurate extraction of Adverse Drug Events (ADE) from biomedical literature is paramount for public safety, but involves slow and costly manual labor. We set out to improve drug safety monitoring (pharmacovigilance, PV) through the use of Natural Language Processing (NLP). We introduce BioDEX, a large-scale resource for Biomedical adverse Drug Event Extraction, rooted in the historical… ▽ More

    Submitted 20 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 28 pages. EMNLP Findings 2023

  33. arXiv:2305.09038  [pdf

    cs.HC

    Characterizing Image Accessibility on Wikipedia across Languages

    Authors: Elisa Kreiss, Krishna Srinivasan, Tiziano Piccardi, Jesus Adolfo Hermosillo, Cynthia Bennett, Michael S. Bernstein, Meredith Ringel Morris, Christopher Potts

    Abstract: We make a first attempt to characterize image accessibility on Wikipedia across languages, present new experimental results that can inform efforts to assess description quality, and offer some strategies to improve Wikipedia's image accessibility.

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: Presented at Wiki Workshop 2023

  34. arXiv:2305.08809  [pdf, other

    cs.CL

    Interpretability at Scale: Identifying Causal Mechanisms in Alpaca

    Authors: Zhengxuan Wu, Atticus Geiger, Thomas Icard, Christopher Potts, Noah D. Goodman

    Abstract: Obtaining human-interpretable explanations of large, general-purpose language models is an urgent goal for AI safety. However, it is just as important that our interpretability methods are faithful to the causal dynamics underlying model behavior and able to robustly generalize to unseen inputs. Distributed Alignment Search (DAS) is a powerful gradient descent method grounded in a theory of causal… ▽ More

    Submitted 6 February, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 with Author Corrections

  35. arXiv:2303.13716  [pdf, other

    cs.CL

    ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation

    Authors: Zhengxuan Wu, Christopher D. Manning, Christopher Potts

    Abstract: Compositional generalization benchmarks for semantic parsing seek to assess whether models can accurately compute meanings for novel sentences, but operationalize this in terms of logical form (LF) prediction. This raises the concern that semantically irrelevant details of the chosen LFs could shape model performance. We argue that this concern is realized for the COGS benchmark. COGS poses genera… ▽ More

    Submitted 23 January, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: TACL 2023

  36. arXiv:2303.02536  [pdf, other

    cs.AI

    Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations

    Authors: Atticus Geiger, Zhengxuan Wu, Christopher Potts, Thomas Icard, Noah D. Goodman

    Abstract: Causal abstraction is a promising theoretical framework for explainable artificial intelligence that defines when an interpretable high-level causal model is a faithful simplification of a low-level deep learning system. However, existing causal abstraction methods have two major limitations: they require a brute-force search over alignments between the high-level model and the low-level one, and… ▽ More

    Submitted 21 February, 2024; v1 submitted 4 March, 2023; originally announced March 2023.

  37. arXiv:2303.00807  [pdf, other

    cs.IR cs.CL

    UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers

    Authors: Jon Saad-Falcon, Omar Khattab, Keshav Santhanam, Radu Florian, Martin Franz, Salim Roukos, Avirup Sil, Md Arafat Sultan, Christopher Potts

    Abstract: Many information retrieval tasks require large labeled datasets for fine-tuning. However, such datasets are often unavailable, and their utility for real-world applications can diminish quickly due to domain shifts. To address this challenge, we develop and motivate a method for using large language models (LLMs) to generate large numbers of synthetic queries cheaply. The method begins by generati… ▽ More

    Submitted 13 October, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Long Paper at Empirical Methods in Natural Language Processing (EMNLP) 2023

  38. arXiv:2301.04709  [pdf, ps, other

    cs.AI

    Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability

    Authors: Atticus Geiger, Duligur Ibeling, Amir Zur, Maheep Chaudhary, Sonakshi Chauhan, Jing Huang, Aryaman Arora, Zhengxuan Wu, Noah Goodman, Christopher Potts, Thomas Icard

    Abstract: Causal abstraction provides a theoretical foundation for mechanistic interpretability, the field concerned with providing intelligible algorithms that are faithful simplifications of the known, but opaque low-level details of black box AI models. Our contributions are (1) generalizing the theory of causal abstraction from mechanism replacement (i.e., hard and soft interventions) to arbitrary mecha… ▽ More

    Submitted 7 July, 2024; v1 submitted 11 January, 2023; originally announced January 2023.

  39. arXiv:2212.14024  [pdf, other

    cs.CL cs.IR

    Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP

    Authors: Omar Khattab, Keshav Santhanam, Xiang Lisa Li, David Hall, Percy Liang, Christopher Potts, Matei Zaharia

    Abstract: Retrieval-augmented in-context learning has emerged as a powerful approach for addressing knowledge-intensive tasks using frozen language models (LM) and retrieval models (RM). Existing work has combined these in simple "retrieve-then-read" pipelines in which the RM retrieves passages that are inserted into the LM prompt. To begin to fully realize the potential of frozen LMs and RMs, we propose De… ▽ More

    Submitted 23 January, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

  40. arXiv:2212.09897  [pdf, other

    cs.CL

    Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training

    Authors: Jing Huang, Zhengxuan Wu, Kyle Mahowald, Christopher Potts

    Abstract: Language tasks involving character-level manipulations (e.g., spelling corrections, arithmetic operations, word games) are challenging for models operating on subword units. To address this, we develop a causal intervention framework to learn robust and interpretable character representations inside subword-based language models. Our method treats each character as a typed variable in a causal mod… ▽ More

    Submitted 19 December, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Findings of the Association for Computational Linguistics: ACL 2023

  41. arXiv:2212.09867  [pdf, other

    cs.CL

    Detecting Contradictory COVID-19 Drug Efficacy Claims from Biomedical Literature

    Authors: Daniel N. Sosa, Malavika Suresh, Christopher Potts, Russ B. Altman

    Abstract: The COVID-19 pandemic created a deluge of questionable and contradictory scientific claims about drug efficacy -- an "infodemic" with lasting consequences for science and society. In this work, we argue that NLP models can help domain experts distill and understand the literature in this complex, high-stakes area. Our task is to automatically identify contradictory claims about COVID-19 drug effic… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  42. arXiv:2212.01340  [pdf, other

    cs.IR cs.CL

    Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking

    Authors: Keshav Santhanam, Jon Saad-Falcon, Martin Franz, Omar Khattab, Avirup Sil, Radu Florian, Md Arafat Sultan, Salim Roukos, Matei Zaharia, Christopher Potts

    Abstract: Neural information retrieval (IR) systems have progressed rapidly in recent years, in large part due to the release of publicly available benchmarking tasks. Unfortunately, some dimensions of this progress are illusory: the majority of the popular IR benchmarks today focus exclusively on downstream task accuracy and thus conceal the costs incurred by systems that trade away efficiency for quality.… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  43. arXiv:2210.09492  [pdf, other

    cs.CL

    Systematicity in GPT-3's Interpretation of Novel English Noun Compounds

    Authors: Siyan Li, Riley Carlson, Christopher Potts

    Abstract: Levin et al. (2019) show experimentally that the interpretations of novel English noun compounds (e.g., stew skillet), while not fully compositional, are highly predictable based on whether the modifier and head refer to artifacts or natural kinds. Is the large language model GPT-3 governed by the same interpretive principles? To address this question, we first compare Levin et al.'s experimental… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  44. arXiv:2209.14279  [pdf, other

    cs.CL

    Causal Proxy Models for Concept-Based Model Explanations

    Authors: Zhengxuan Wu, Karel D'Oosterlinck, Atticus Geiger, Amir Zur, Christopher Potts

    Abstract: Explainability methods for NLP systems encounter a version of the fundamental problem of causal inference: for a given ground-truth input text, we never truly observe the counterfactual texts necessary for isolating the causal effects of model representations on outputs. In response, many explainability methods make no use of counterfactual texts, assuming they will be unavailable. In this paper,… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 23 pages

  45. arXiv:2209.08141  [pdf, other

    cs.CL cs.AI cs.LG

    Psychologically-informed chain-of-thought prompts for metaphor understanding in large language models

    Authors: Ben Prystawski, Paul Thibodeau, Christopher Potts, Noah D. Goodman

    Abstract: Probabilistic models of language understanding are valuable tools for investigating human language use. However, they need to be hand-designed for a particular domain. In contrast, large language models (LLMs) are trained on text that spans a wide array of domains, but they lack the structure and interpretability of probabilistic models. In this paper, we use chain-of-thought prompts to introduce… ▽ More

    Submitted 19 May, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: 7 pages, 1 figure

  46. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  47. arXiv:2205.14140  [pdf, other

    cs.CL

    CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior

    Authors: Eldar David Abraham, Karel D'Oosterlinck, Amir Feder, Yair Ori Gat, Atticus Geiger, Christopher Potts, Roi Reichart, Zhengxuan Wu

    Abstract: The increasing size and complexity of modern ML systems has improved their predictive capabilities but made their behavior harder to explain. Many techniques for model explanation have been developed in response, but we lack clear criteria for assessing these techniques. In this paper, we cast model explanation as the causal inference problem of estimating causal effects of real-world concepts on… ▽ More

    Submitted 12 October, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to NeurIPS 2022

  48. arXiv:2205.10646  [pdf, other

    cs.CL

    Context Matters for Image Descriptions for Accessibility: Challenges for Referenceless Evaluation Metrics

    Authors: Elisa Kreiss, Cynthia Bennett, Shayan Hooshmand, Eric Zelikman, Meredith Ringel Morris, Christopher Potts

    Abstract: Few images on the Web receive alt-text descriptions that would make them accessible to blind and low vision (BLV) users. Image-based NLG systems have progressed to the point where they can begin to address this persistent societal problem, but these systems will not be fully successful unless we evaluate them on metrics that guide their development correctly. Here, we argue against current referen… ▽ More

    Submitted 27 October, 2022; v1 submitted 21 May, 2022; originally announced May 2022.

    Comments: Proceedings of EMNLP 2022

  49. arXiv:2205.09707  [pdf, other

    cs.IR cs.CL

    PLAID: An Efficient Engine for Late Interaction Retrieval

    Authors: Keshav Santhanam, Omar Khattab, Christopher Potts, Matei Zaharia

    Abstract: Pre-trained language models are increasingly important components across multiple information retrieval (IR) paradigms. Late interaction, introduced with the ColBERT model and recently refined in ColBERTv2, is a popular paradigm that holds state-of-the-art status across many benchmarks. To dramatically speed up the search latency of late interaction, we introduce the Performance-optimized Late Int… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Preprint. Omar and Keshav contributed equally to this work

  50. arXiv:2205.09172  [pdf, other

    cs.CL

    Color Overmodification Emerges from Data-Driven Learning and Pragmatic Reasoning

    Authors: Fei Fang, Kunal Sinha, Noah D. Goodman, Christopher Potts, Elisa Kreiss

    Abstract: Speakers' referential expressions often depart from communicative ideals in ways that help illuminate the nature of pragmatic language use. Patterns of overmodification, in which a speaker uses a modifier that is redundant given their communicative goal, have proven especially informative in this regard. It seems likely that these patterns are shaped by the environment a speaker is exposed to in c… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Proceedings of the Annual Meeting of the Cognitive Science Society (2022)