Skip to main content

Showing 1–16 of 16 results for author: Flanigan, J

  1. arXiv:2406.07933  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Model Unlearning via Embedding-Corrupted Prompts

    Authors: Chris Yuhao Liu, Yaxuan Wang, Jeffrey Flanigan, Yang Liu

    Abstract: Large language models (LLMs) have advanced to encompass extensive knowledge across diverse domains. Yet controlling what a large language model should not know is important for ensuring alignment and thus safe use. However, accurately and efficiently unlearning knowledge from an LLM remains challenging due to the potential collateral damage caused by the fuzzy boundary between retention and forget… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 55 pages, 4 figures, 66 tables

  2. arXiv:2404.15219  [pdf, other

    cs.CL

    The Power of the Noisy Channel: Unsupervised End-to-End Task-Oriented Dialogue with LLMs

    Authors: Brendan King, Jeffrey Flanigan

    Abstract: Training task-oriented dialogue systems typically requires turn-level annotations for interacting with their APIs: e.g. a dialogue state and the system actions taken at each step. These annotations can be costly to produce, error-prone, and require both domain and annotation expertise. With advances in LLMs, we hypothesize unlabelled data and a schema definition are sufficient for building a worki… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 16 Pages, 7 Figures

  3. arXiv:2404.10297  [pdf, other

    cs.CL cs.AI

    Future Language Modeling from Temporal Document History

    Authors: Changmao Li, Jeffrey Flanigan

    Abstract: Predicting the future is of great interest across many aspects of human activity. Businesses are interested in future trends, traders are interested in future stock prices, and companies are highly interested in future technological breakthroughs. While there are many automated systems for predicting future numerical data, such as weather, stock prices, and demand for products, there is relatively… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted by ICLR 2024

    ACM Class: I.2.7

  4. arXiv:2312.16337  [pdf, other

    cs.CL

    Task Contamination: Language Models May Not Be Few-Shot Anymore

    Authors: Changmao Li, Jeffrey Flanigan

    Abstract: Large language models (LLMs) offer impressive performance in various zero-shot and few-shot tasks. However, their success in zero-shot and few-shot settings may be affected by task contamination, a potential limitation that has not been thoroughly examined. This paper investigates how zero-shot and few-shot performance of LLMs has changed chronologically over time. Utilizing GPT-3 series models an… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

    ACM Class: I.2.7

  5. arXiv:2312.03951  [pdf, other

    cs.LG stat.ML

    Understanding the Role of Optimization in Double Descent

    Authors: Chris Yuhao Liu, Jeffrey Flanigan

    Abstract: The phenomenon of model-wise double descent, where the test error peaks and then reduces as the model size increases, is an interesting topic that has attracted the attention of researchers due to the striking observed gap between theory and practice \citep{Belkin2018ReconcilingMM}. Additionally, while double descent has been observed in various tasks and architectures, the peak of double descent… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: NeurIPS Workshop 2023 Optimization for Machine Learning

  6. arXiv:2310.07957  [pdf, other

    cs.CL cs.AI

    A New Approach Towards Autoformalization

    Authors: Nilay Patel, Rahul Saha, Jeffrey Flanigan

    Abstract: Verifying mathematical proofs is difficult, but can be automated with the assistance of a computer. Autoformalization is the task of automatically translating natural language mathematics into a formal language that can be verified by a program. This is a challenging task, and especially for higher-level mathematics found in research papers. Research paper mathematics requires large amounts of bac… ▽ More

    Submitted 9 July, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  7. arXiv:2309.13748  [pdf, other

    cs.CL

    Does the "most sinfully decadent cake ever" taste good? Answering Yes/No Questions from Figurative Contexts

    Authors: Geetanjali Rakshit, Jeffrey Flanigan

    Abstract: Figurative language is commonplace in natural language, and while making communication memorable and creative, can be difficult to understand. In this work, we investigate the robustness of Question Answering (QA) models on figurative text. Yes/no questions, in particular, are a useful probe of figurative language understanding capabilities of large language models. We propose FigurativeQA, a set… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted at RANLP 2023

  8. arXiv:2307.01453  [pdf, other

    cs.CL

    Diverse Retrieval-Augmented In-Context Learning for Dialogue State Tracking

    Authors: Brendan King, Jeffrey Flanigan

    Abstract: There has been significant interest in zero and few-shot learning for dialogue state tracking (DST) due to the high cost of collecting and annotating task-oriented dialogues. Recent work has demonstrated that in-context learning requires very little data and zero parameter updates, and even outperforms trained methods in the few-shot setting (Hu et al. 2022). We propose RefPyDST, which advances th… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 14 pages, 2 figures, to appear in Findings of the ACL 2023

  9. arXiv:2302.12944  [pdf, other

    cs.CL cs.AI

    Dependency Dialogue Acts -- Annotation Scheme and Case Study

    Authors: Jon Z. Cai, Brendan King, Margaret Perkoff, Shiran Dudy, Jie Cao, Marie Grace, Natalia Wojarnik, Ananya Ganesh, James H. Martin, Martha Palmer, Marilyn Walker, Jeffrey Flanigan

    Abstract: In this paper, we introduce Dependency Dialogue Acts (DDA), a novel framework for capturing the structure of speaker-intentions in multi-party dialogues. DDA combines and adapts features from existing dialogue annotation frameworks, and emphasizes the multi-relational response structure of dialogues in addition to the dialogue acts and rhetorical relations. It represents the functional, discourse,… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: The 13th International Workshop on Spoken Dialogue Systems Technology

    Journal ref: The 13th International Workshop on Spoken Dialogue Systems Technology 2023

  10. arXiv:2212.08565  [pdf, other

    cs.CL

    Automatic Identification of Motivation for Code-Switching in Speech Transcripts

    Authors: Ritu Belani, Jeffrey Flanigan

    Abstract: Code-switching, or switching between languages, occurs for many reasons and has important linguistic, sociological, and cultural implications. Multilingual speakers code-switch for a variety of purposes, such as expressing emotions, borrowing terms, making jokes, introducing a new topic, etc. The reason for code-switching may be quite useful for analysis, but is not readily apparent. To remedy thi… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  11. arXiv:2207.06960  [pdf, other

    cs.CL cs.AI cs.LG

    Forming Trees with Treeformers

    Authors: Nilay Patel, Jeffrey Flanigan

    Abstract: Human language is known to exhibit a nested, hierarchical structure, allowing us to form complex sentences out of smaller pieces. However, many state-of-the-art neural networks models such as Transformers have no explicit hierarchical structure in its architecture -- that is, they don't have an inductive bias toward hierarchical structure. Additionally, Transformers are known to perform poorly on… ▽ More

    Submitted 10 July, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted to RANLP 2023

  12. arXiv:2112.08513  [pdf, other

    cs.CL

    DocAMR: Multi-Sentence AMR Representation and Evaluation

    Authors: Tahira Naseem, Austin Blodgett, Sadhana Kumaravel, Tim O'Gorman, Young-Suk Lee, Jeffrey Flanigan, Ramón Fernandez Astudillo, Radu Florian, Salim Roukos, Nathan Schneider

    Abstract: Despite extensive research on parsing of English sentences into Abstraction Meaning Representation (AMR) graphs, which are compared to gold graphs via the Smatch metric, full-document parsing into a unified graph representation lacks well-defined representation and evaluation. Taking advantage of a super-sentential level of coreference annotation from previous work, we introduce a simple algorithm… ▽ More

    Submitted 6 May, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    MSC Class: I.2.7

  13. arXiv:2105.10023  [pdf, other

    cs.CL

    ASQ: Automatically Generating Question-Answer Pairs using AMRs

    Authors: Geetanjali Rakshit, Jeffrey Flanigan

    Abstract: We introduce ASQ, a tool to automatically mine questions and answers from a sentence using the Abstract Meaning Representation (AMR). Previous work has used question-answer pairs to specify the predicate-argument structure of a sentence using natural language, which does not require linguistic expertise or training, and created datasets such as QA-SRL and QAMR, for which the question-answer pair a… ▽ More

    Submitted 20 August, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

  14. arXiv:2011.10683  [pdf, other

    cs.CL

    Athena: Constructing Dialogues Dynamically with Discourse Constraints

    Authors: Vrindavan Harrison, Juraj Juraska, Wen Cui, Lena Reed, Kevin K. Bowden, Jiaqi Wu, Brian Schwarzmann, Abteen Ebrahimi, Rishi Rajasekaran, Nikhil Varghese, Max Wechsler-Azen, Steve Whittaker, Jeffrey Flanigan, Marilyn Walker

    Abstract: This report describes Athena, a dialogue system for spoken conversation on popular topics and current events. We develop a flexible topic-agnostic approach to dialogue management that dynamically configures dialogue based on general principles of entity and topic coherence. Athena's dialogue manager uses a contract-based method where discourse constraints are dispatched to clusters of response gen… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: 3rd Proceedings of Alexa Prize (Alexa Prize 2019)

  15. arXiv:1905.06939  [pdf, other

    cs.CL cs.LG

    The Materials Science Procedural Text Corpus: Annotating Materials Synthesis Procedures with Shallow Semantic Structures

    Authors: Sheshera Mysore, Zach Jensen, Edward Kim, Kevin Huang, Haw-Shiuan Chang, Emma Strubell, Jeffrey Flanigan, Andrew McCallum, Elsa Olivetti

    Abstract: Materials science literature contains millions of materials synthesis procedures described in unstructured natural language text. Large-scale analysis of these synthesis procedures would facilitate deeper scientific understanding of materials synthesis and enable automated synthesis planning. Such analysis requires extracting structured representations of synthesis procedures from the raw text as… ▽ More

    Submitted 13 July, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: Accepted as a long paper at the Linguistic Annotation Workshop (LAW) at ACL 2019

  16. arXiv:1805.10399  [pdf, other

    cs.CL

    Toward Abstractive Summarization Using Semantic Representations

    Authors: Fei Liu, Jeffrey Flanigan, Sam Thomson, Norman Sadeh, Noah A. Smith

    Abstract: We present a novel abstractive summarization framework that draws on the recent development of a treebank for the Abstract Meaning Representation (AMR). In this framework, the source text is parsed to a set of AMR graphs, the graphs are transformed into a summary graph, and then text is generated from the summary graph. We focus on the graph-to-graph transformation that reduces the source semantic… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

    Comments: 10 pages