Skip to main content

Showing 1–17 of 17 results for author: Vig, J

  1. arXiv:2309.15337  [pdf, other

    cs.CL cs.HC

    Beyond the Chat: Executable and Verifiable Text-Editing with LLMs

    Authors: Philippe Laban, Jesse Vig, Marti A. Hearst, Caiming Xiong, Chien-Sheng Wu

    Abstract: Conversational interfaces powered by Large Language Models (LLMs) have recently become a popular way to obtain feedback during document editing. However, standard chat-based conversational interfaces do not support transparency and verifiability of the editing changes that they suggest. To give the author more agency when editing with an LLM, we present InkSync, an editing interface that suggests… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  2. arXiv:2309.03450  [pdf, other

    cs.CL cs.AI cs.LG

    XGen-7B Technical Report

    Authors: Erik Nijkamp, Tian Xie, Hiroaki Hayashi, Bo Pang, Congying Xia, Chen Xing, Jesse Vig, Semih Yavuz, Philippe Laban, Ben Krause, Senthil Purushwalkam, Tong Niu, Wojciech Kryściński, Lidiya Murakhovs'ka, Prafulla Kumar Choubey, Alex Fabbri, Ye Liu, Rui Meng, Lifu Tu, Meghana Bhat, Chien-Sheng Wu, Silvio Savarese, Yingbo Zhou, Shafiq Joty, Caiming Xiong

    Abstract: Large Language Models (LLMs) have become ubiquitous across various domains, transforming the way we interact with information and conduct research. However, most high-performing LLMs remain confined behind proprietary walls, hindering scientific progress. Most open-source LLMs, on the other hand, are limited in their ability to support longer sequence lengths, which is a key requirement for many t… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  3. arXiv:2306.01150  [pdf, other

    cs.CL cs.AI

    Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning

    Authors: Fan Yin, Jesse Vig, Philippe Laban, Shafiq Joty, Caiming Xiong, Chien-Sheng Jason Wu

    Abstract: Large language models (LLMs) have shown impressive performance in following natural language instructions to solve unseen tasks. However, it remains unclear whether models truly understand task definitions and whether the human-written definitions are optimal. In this paper, we systematically study the role of task definitions in instruction learning. We first conduct an ablation analysis informed… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ACL 2023, camera-ready; 10 pages

  4. arXiv:2305.19204  [pdf, other

    cs.CL

    SWiPE: A Dataset for Document-Level Simplification of Wikipedia Pages

    Authors: Philippe Laban, Jesse Vig, Wojciech Kryscinski, Shafiq Joty, Caiming Xiong, Chien-Sheng Wu

    Abstract: Text simplification research has mostly focused on sentence-level simplification, even though many desirable edits - such as adding relevant background information or reordering content - may require document-level context. Prior work has also predominantly framed simplification as a single-step, input-to-output task, only implicitly modeling the fine-grained, span-level edits that elucidate the s… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: ACL 2023, Long Paper

  5. arXiv:2211.06196  [pdf, other

    cs.CL

    Improving Factual Consistency in Summarization with Compression-Based Post-Editing

    Authors: Alexander R. Fabbri, Prafulla Kumar Choubey, Jesse Vig, Chien-Sheng Wu, Caiming Xiong

    Abstract: State-of-the-art summarization models still struggle to be factually consistent with the input text. A model-agnostic way to address this problem is post-editing the generated summaries. However, existing approaches typically fail to remove entity errors if a suitable input entity replacement is not available or may insert erroneous content. In our work, we focus on removing extrinsic entity error… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: EMNLP 2022

  6. arXiv:2205.02894  [pdf, other

    cs.HC cs.AI cs.CL

    Interactive Model Cards: A Human-Centered Approach to Model Documentation

    Authors: Anamaria Crisan, Margaret Drouhard, Jesse Vig, Nazneen Rajani

    Abstract: Deep learning models for natural language processing (NLP) are increasingly adopted and deployed by analysts without formal training in NLP or machine learning (ML). However, the documentation intended to convey the model's details and appropriate use is tailored primarily to individuals with ML or NLP expertise. To address this gap, we conduct a design inquiry into interactive model cards, which… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: To appear at ACM FAccT'22

    MSC Class: 68T01

  7. iSEA: An Interactive Pipeline for Semantic Error Analysis of NLP Models

    Authors: Jun Yuan, Jesse Vig, Nazneen Rajani

    Abstract: Error analysis in NLP models is essential to successful model development and deployment. One common approach for diagnosing errors is to identify subpopulations in the dataset where the model produces the most errors. However, existing approaches typically define subpopulations based on pre-defined features, which requires users to form hypotheses of errors in advance. To complement these approac… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted at IUI 2022, 11 pages, 6 figures

  8. arXiv:2112.07637  [pdf, other

    cs.CL

    Exploring Neural Models for Query-Focused Summarization

    Authors: Jesse Vig, Alexander R. Fabbri, Wojciech Kryściński, Chien-Sheng Wu, Wenhao Liu

    Abstract: Query-focused summarization (QFS) aims to produce summaries that answer particular questions of interest, enabling greater user control and personalization. While recently released datasets, such as QMSum or AQuaMuSe, facilitate research efforts in QFS, the field lacks a comprehensive study of the broad space of applicable modeling methods. In this paper we conduct a systematic exploration of neur… ▽ More

    Submitted 26 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Findings of NAACL 2022

  9. arXiv:2110.07166  [pdf, other

    cs.CL

    CaPE: Contrastive Parameter Ensembling for Reducing Hallucination in Abstractive Summarization

    Authors: Prafulla Kumar Choubey, Alexander R. Fabbri, Jesse Vig, Chien-Sheng Wu, Wenhao Liu, Nazneen Fatema Rajani

    Abstract: Hallucination is a known issue for neural abstractive summarization models. Recent work suggests that the degree of hallucination may depend on errors in the training data. In this work, we propose a new method called Contrastive Parameter Ensembling (CaPE) to use training data more effectively, utilizing variations in noise in training samples to reduce hallucination. We first select clean and no… ▽ More

    Submitted 20 May, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

  10. arXiv:2104.07605  [pdf, other

    cs.CL

    SummVis: Interactive Visual Analysis of Models, Data, and Evaluation for Text Summarization

    Authors: Jesse Vig, Wojciech Kryściński, Karan Goel, Nazneen Fatema Rajani

    Abstract: Novel neural architectures, training strategies, and the availability of large-scale corpora haven been the driving force behind recent progress in abstractive text summarization. However, due to the black-box nature of neural models, uninformative evaluation metrics, and scarce tooling for model and data analysis, the true performance and failure modes of summarization models remain largely unkno… ▽ More

    Submitted 26 July, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted to ACL 2021 System Demonstrations

  11. arXiv:2101.04840  [pdf, other

    cs.CL cs.AI cs.LG

    Robustness Gym: Unifying the NLP Evaluation Landscape

    Authors: Karan Goel, Nazneen Rajani, Jesse Vig, Samson Tan, Jason Wu, Stephan Zheng, Caiming Xiong, Mohit Bansal, Christopher Ré

    Abstract: Despite impressive performance on standard benchmarks, deep neural networks are often brittle when deployed in real-world systems. Consequently, recent research has focused on testing the robustness of such models, resulting in a diverse set of evaluation methodologies ranging from adversarial attacks to rule-based data transformations. In this work, we identify challenges with evaluating NLP syst… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: 34 pages, 8 figures, 6 tables

  12. arXiv:2012.00195  [pdf, other

    cs.LG q-bio.BM

    Profile Prediction: An Alignment-Based Pre-Training Task for Protein Sequence Models

    Authors: Pascal Sturmfels, Jesse Vig, Ali Madani, Nazneen Fatema Rajani

    Abstract: For protein sequence datasets, unlabeled data has greatly outpaced labeled data due to the high cost of wet-lab characterization. Recent deep-learning approaches to protein prediction have shown that pre-training on unlabeled data can yield useful representations for downstream tasks. However, the optimal pre-training strategy remains an open question. Instead of strictly borrowing from natural la… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

  13. arXiv:2006.15222  [pdf, other

    cs.CL cs.LG q-bio.BM

    BERTology Meets Biology: Interpreting Attention in Protein Language Models

    Authors: Jesse Vig, Ali Madani, Lav R. Varshney, Caiming Xiong, Richard Socher, Nazneen Fatema Rajani

    Abstract: Transformer architectures have proven to learn useful representations for protein classification and generation tasks. However, these representations present challenges in interpretability. In this work, we demonstrate a set of methods for analyzing protein Transformer models through the lens of attention. We show that attention: (1) captures the folding structure of proteins, connecting amino aci… ▽ More

    Submitted 28 March, 2021; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: To appear in ICLR 2021

    ACM Class: I.2

  14. arXiv:2004.12265  [pdf, other

    cs.CL

    Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias

    Authors: Jesse Vig, Sebastian Gehrmann, Yonatan Belinkov, Sharon Qian, Daniel Nevo, Simas Sakenis, Jason Huang, Yaron Singer, Stuart Shieber

    Abstract: Common methods for interpreting neural models in natural language processing typically examine either their structure or their behavior, but not both. We propose a methodology grounded in the theory of causal mediation analysis for interpreting which parts of a model are causally implicated in its behavior. It enables us to analyze the mechanisms by which information flows from input to output thr… ▽ More

    Submitted 22 November, 2020; v1 submitted 25 April, 2020; originally announced April 2020.

    Comments: Expanded version

    MSC Class: 68T50 ACM Class: I.2.7

  15. arXiv:1906.05714  [pdf, other

    cs.HC cs.CL cs.LG

    A Multiscale Visualization of Attention in the Transformer Model

    Authors: Jesse Vig

    Abstract: The Transformer is a sequence model that forgoes traditional recurrent architectures in favor of a fully attention-based approach. Besides improving performance, an advantage of using attention is that it can also help to interpret a model by showing how the model assigns weight to different input elements. However, the multi-layer, multi-head attention mechanism in the Transformer model can be di… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: To appear in ACL 2019 (System Demonstrations). arXiv admin note: substantial text overlap with arXiv:1904.02679

  16. arXiv:1906.04284  [pdf, other

    cs.CL cs.LG stat.ML

    Analyzing the Structure of Attention in a Transformer Language Model

    Authors: Jesse Vig, Yonatan Belinkov

    Abstract: The Transformer is a fully attention-based alternative to recurrent networks that has achieved state-of-the-art results across a range of NLP tasks. In this paper, we analyze the structure of attention in a Transformer language model, the GPT-2 small pretrained model. We visualize attention for individual instances and analyze the interaction between attention and syntax over a large corpus. We fi… ▽ More

    Submitted 18 June, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: To appear in ACL BlackboxNLP workshop

  17. arXiv:1904.02679  [pdf, other

    cs.HC cs.LG stat.ML

    Visualizing Attention in Transformer-Based Language Representation Models

    Authors: Jesse Vig

    Abstract: We present an open-source tool for visualizing multi-head self-attention in Transformer-based language representation models. The tool extends earlier work by visualizing attention at three levels of granularity: the attention-head level, the model level, and the neuron level. We describe how each of these views can help to interpret the model, and we demonstrate the tool on the BERT model and the… ▽ More

    Submitted 11 April, 2019; v1 submitted 4 April, 2019; originally announced April 2019.