Skip to main content

Showing 1–50 of 114 results for author: Klein, D

  1. arXiv:2407.00640  [pdf, other

    cs.CE

    Physics-augmented neural networks for constitutive modeling of hyperelastic geometrically exact beams

    Authors: Jasper O. Schommartz, Dominik K. Klein, Juan C. Alzate Cobo, Oliver Weeger

    Abstract: We present neural network-based constitutive models for hyperelastic geometrically exact beams. The proposed models are physics-augmented, i.e., formulated to fulfill important mechanical conditions by construction. Strains and curvatures of the beam are used as input for feed-forward neural networks that represent the effective hyperelastic beam potential. Forces and moments are then received as… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.18872  [pdf, other

    cs.CL

    Efficacy of Language Model Self-Play in Non-Zero-Sum Games

    Authors: Austen Liao, Nicholas Tomlin, Dan Klein

    Abstract: Game-playing agents like AlphaGo have achieved superhuman performance through self-play, which is theoretically guaranteed to yield optimal policies in competitive games. However, most language tasks are partially or fully cooperative, so it is an open question whether techniques like self-play can effectively be used to improve language models. We empirically investigate this question in a negoti… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2406.08818  [pdf, other

    cs.CL cs.CY

    Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination

    Authors: Eve Fleisig, Genevieve Smith, Madeline Bossi, Ishita Rustagi, Xavier Yin, Dan Klein

    Abstract: We present a large-scale study of linguistic bias exhibited by ChatGPT covering ten dialects of English (Standard American English, Standard British English, and eight widely spoken non-"standard" varieties from around the world). We prompted GPT-3.5 Turbo and GPT-4 with text by native speakers of each variety and analyzed the responses via detailed linguistic feature annotation and native speaker… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.04024  [pdf, other

    cs.CL cs.AI

    American Sign Language Handshapes Reflect Pressures for Communicative Efficiency

    Authors: Kayo Yin, Terry Regier, Dan Klein

    Abstract: Communicative efficiency is a key topic in linguistics and cognitive psychology, with many studies demonstrating how the pressure to communicate with minimal effort guides the form of natural language. However, this phenomenon is rarely explored in signed languages. This paper shows how handshapes in American Sign Language (ASL) reflect these efficiency pressures and provides new evidence of commu… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024

  5. arXiv:2405.15784  [pdf, other

    cs.IR cs.AI cs.CL

    CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval

    Authors: Yizhou Chi, Jessy Lin, Kevin Lin, Dan Klein

    Abstract: Users often make ambiguous requests that require clarification. We study the problem of asking clarification questions in an information retrieval setting, where systems often face ambiguous search queries and it is challenging to turn the uncertainty in the retrieval model into a natural language question. We present CLARINET, a system that asks informative clarification questions by choosing que… ▽ More

    Submitted 28 April, 2024; originally announced May 2024.

  6. arXiv:2405.05860  [pdf, other

    cs.LG cs.CL cs.CY

    The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels

    Authors: Eve Fleisig, Su Lin Blodgett, Dan Klein, Zeerak Talat

    Abstract: Longstanding data labeling practices in machine learning involve collecting and aggregating labels from multiple annotators. But what should we do when annotators disagree? Though annotator disagreement has long been seen as a problem to minimize, new perspectivist approaches challenge this assumption by treating disagreement as a valuable source of information. In this position paper, we examine… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  7. arXiv:2405.03689  [pdf, other

    cs.CV cs.CL

    Pose Priors from Language Models

    Authors: Sanjay Subramanian, Evonne Ng, Lea Müller, Dan Klein, Shiry Ginosar, Trevor Darrell

    Abstract: We present a zero-shot pose optimization method that enforces accurate physical contact constraints when estimating the 3D pose of humans. Our central insight is that since language is often used to describe physical interaction, large pretrained text-based models can act as priors on pose estimation. We can thus leverage this insight to improve pose estimation by converting natural language des… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  8. arXiv:2405.00144  [pdf

    cs.HC

    Greater benefits of deep learning-based computer-aided detection systems for finding small signals in 3D volumetric medical images

    Authors: Devi Klein, Srijita Karmakar, Aditya Jonnalagadda, Craig K. Abbey, Miguel P. Eckstein

    Abstract: Purpose: Radiologists are tasked with visually scrutinizing large amounts of data produced by 3D volumetric imaging modalities. Small signals can go unnoticed during the 3d search because they are hard to detect in the visual periphery. Recent advances in machine learning and computer vision have led to effective computer-aided detection (CADe) support systems with the potential to mitigate percep… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  9. arXiv:2404.05966  [pdf, other

    cs.CL cs.AI

    THOUGHTSCULPT: Reasoning with Intermediate Revision and Search

    Authors: Yizhou Chi, Kevin Yang, Dan Klein

    Abstract: We present THOUGHTSCULPT, a general reasoning and search method for tasks with outputs that can be decomposed into components. THOUGHTSCULPT explores a search tree of potential solutions using Monte Carlo Tree Search (MCTS), building solutions one action at a time and evaluating according to any domain-specific heuristic, which in practice is often simply an LLM evaluator. Critically, our action s… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Code and data available at https://github.com/cyzus/thoughtsculpt

  10. arXiv:2402.11782  [pdf, other

    cs.CL cs.LG

    What Evidence Do Language Models Find Convincing?

    Authors: Alexander Wan, Eric Wallace, Dan Klein

    Abstract: Retrieval-augmented language models are being increasingly tasked with subjective, contentious, and conflicting queries such as "is aspartame linked to cancer". To resolve these ambiguous queries, one must search through a large range of websites and consider "which, if any, of this evidence do I find convincing?". In this work, we study how LLMs answer this question. In particular, we construct C… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  11. arXiv:2402.08309  [pdf, other

    cs.LG cs.CL cs.CR

    Prompted Contextual Vectors for Spear-Phishing Detection

    Authors: Daniel Nahmias, Gal Engelberg, Dan Klein, Asaf Shabtai

    Abstract: Spear-phishing attacks present a significant security challenge, with large language models (LLMs) escalating the threat by generating convincing emails and facilitating target reconnaissance. To address this, we propose a detection approach based on a novel document vectorization method that utilizes an ensemble of LLMs to create representation vectors. By prompting LLMs to reason and respond to… ▽ More

    Submitted 14 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    ACM Class: I.2.7

  12. arXiv:2402.07007  [pdf, other

    cs.CE

    Nonlinear electro-elastic finite element analysis with neural network constitutive models

    Authors: Dominik K. Klein, Rogelio Ortigosa, Jesús Martínez-Frutos, Oliver Weeger

    Abstract: In the present work, the applicability of physics-augmented neural network (PANN) constitutive models for complex electro-elastic finite element analysis is demonstrated. For the investigations, PANN models for electro-elastic material behavior at finite deformations are calibrated to different synthetically generated datasets, including an analytical isotropic potential, a homogenised rank-one la… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  13. arXiv:2311.15100  [pdf, other

    cs.CV cs.AI cs.LG

    Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation

    Authors: Luca Eyring, Dominik Klein, Théo Uscidda, Giovanni Palla, Niki Kilbertus, Zeynep Akata, Fabian Theis

    Abstract: In optimal transport (OT), a Monge map is known as a mapping that transports a source distribution to a target distribution in the most cost-efficient way. Recently, multiple neural estimators for Monge maps have been developed and applied in diverse unpaired domain translation tasks, e.g. in single-cell biology and computer vision. However, the classic OT framework enforces mass conservation, whi… ▽ More

    Submitted 11 March, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: ICLR 2024

  14. arXiv:2311.06694  [pdf, other

    cs.CL cs.AI cs.CV cs.RO

    Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding

    Authors: Chancharik Mitra, Abrar Anwar, Rodolfo Corona, Dan Klein, Trevor Darrell, Jesse Thomason

    Abstract: When connecting objects and their language referents in an embodied 3D environment, it is important to note that: (1) an object can be better characterized by leveraging comparative information between itself and other objects, and (2) an object's appearance can vary with camera position. As such, we present the Multi-view Approach to Grounding in Context (MAGiC), which selects an object referent… ▽ More

    Submitted 6 April, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

    Journal ref: North American Chapter of the Association for Computational Linguistics (NAACL), 2024

  15. arXiv:2311.04459  [pdf, other

    cs.CL cs.AI

    Improving Pacing in Long-Form Story Planning

    Authors: Yichen Wang, Kevin Yang, Xiaoming Liu, Dan Klein

    Abstract: Existing LLM-based systems for writing long-form stories or story outlines frequently suffer from unnatural pacing, whether glossing over important events or over-elaborating on insignificant details, resulting in a jarring experience for the reader. We propose a CONCrete Outline ConTrol (CONCOCT) system to improve pacing when automatically generating story outlines. We first train a concreteness… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: EMNLP Findings 2023

  16. arXiv:2311.02802  [pdf, other

    cs.CL cs.AI

    Incorporating Worker Perspectives into MTurk Annotation Practices for NLP

    Authors: Olivia Huang, Eve Fleisig, Dan Klein

    Abstract: Current practices regarding data collection for natural language processing on Amazon Mechanical Turk (MTurk) often rely on a combination of studies on data quality and heuristics shared among NLP researchers. However, without considering the perspectives of MTurk workers, these approaches are susceptible to issues regarding workers' rights and poor response quality. We conducted a critical litera… ▽ More

    Submitted 15 November, 2023; v1 submitted 5 November, 2023; originally announced November 2023.

  17. arXiv:2310.09254  [pdf, other

    stat.ML cs.LG

    Entropic (Gromov) Wasserstein Flow Matching with GENOT

    Authors: Dominik Klein, Théo Uscidda, Fabian Theis, Marco Cuturi

    Abstract: Optimal transport (OT) theory has reshaped the field of generative modeling: Combined with neural networks, recent \textit{Neural OT} (N-OT) solvers use OT as an inductive bias, to focus on ``thrifty'' mappings that minimize average displacement costs. This core principle has fueled the successful application of N-OT solvers to high-stakes scientific challenges, notably single-cell genomics. N-OT… ▽ More

    Submitted 12 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  18. arXiv:2308.10897  [pdf, other

    cs.CV

    Can Language Models Learn to Listen?

    Authors: Evonne Ng, Sanjay Subramanian, Dan Klein, Angjoo Kanazawa, Trevor Darrell, Shiry Ginosar

    Abstract: We present a framework for generating appropriate facial responses from a listener in dyadic social interactions based on the speaker's words. Given an input transcription of the speaker's words with their timestamps, our approach autoregressively predicts a response of a listener: a sequence of listener facial gestures, quantized using a VQ-VAE. Since gesture is a language component, we propose t… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: ICCV 2023; Project page: https://people.eecs.berkeley.edu/~evonne_ng/projects/text2listen/

  19. arXiv:2308.01399  [pdf, other

    cs.CL cs.AI cs.LG

    Learning to Model the World with Language

    Authors: Jessy Lin, Yuqing Du, Olivia Watkins, Danijar Hafner, Pieter Abbeel, Dan Klein, Anca Dragan

    Abstract: To interact with humans and act in the world, agents need to understand the range of language that people use and relate it to the visual world. While current agents can learn to execute simple language instructions, we aim to build agents that leverage diverse language -- language like "this button turns on the TV" or "I put the bowls away" -- that conveys general knowledge, describes the state o… ▽ More

    Submitted 31 May, 2024; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: ICML 2024. Website: https://dynalang.github.io/

  20. arXiv:2307.12950  [pdf, other

    cs.CL cs.AI

    RLCD: Reinforcement Learning from Contrastive Distillation for Language Model Alignment

    Authors: Kevin Yang, Dan Klein, Asli Celikyilmaz, Nanyun Peng, Yuandong Tian

    Abstract: We propose Reinforcement Learning from Contrastive Distillation (RLCD), a method for aligning language models to follow principles expressed in natural language (e.g., to be more harmless) without using human feedback. RLCD creates preference pairs from two contrasting model outputs, one using a positive prompt designed to encourage following the given principles, and one using a negative prompt d… ▽ More

    Submitted 16 March, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: ICLR 2024

  21. arXiv:2307.03463  [pdf, ps, other

    cs.CE

    Parametrised polyconvex hyperelasticity with physics-augmented neural networks

    Authors: Dominik K. Klein, Fabian J. Roth, Iman Valizadeh, Oliver Weeger

    Abstract: In the present work, neural networks are applied to formulate parametrised hyperelastic constitutive models. The models fulfill all common mechanical conditions of hyperelasticity by construction. In particular, partially input-convex neural network (pICNN) architectures are applied based on feed-forward neural networks. Receiving two different sets of input arguments, pICNNs are convex in one of… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  22. arXiv:2307.03214  [pdf, other

    cs.CL

    PREADD: Prefix-Adaptive Decoding for Controlled Text Generation

    Authors: Jonathan Pei, Kevin Yang, Dan Klein

    Abstract: We propose Prefix-Adaptive Decoding (PREADD), a flexible method for controlled text generation. Unlike existing methods that use auxiliary expert models to control for attributes, PREADD does not require an external model, instead relying on linearly combining output logits from multiple prompts. Specifically, PREADD contrasts the output logits generated using a raw prompt against those generated… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: ACL Findings 2023

  23. arXiv:2306.09866  [pdf, ps, other

    cs.CE

    Advanced discretization techniques for hyperelastic physics-augmented neural networks

    Authors: Marlon Franke, Dominik K. Klein, Oliver Weeger, Peter Betsch

    Abstract: In the present work, advanced spatial and temporal discretization techniques are tailored to hyperelastic physics-augmented neural networks, i.e., neural network based constitutive models which fulfill all relevant mechanical conditions of hyperelasticity by construction. The framework takes into account the structure of neural network-based constitutive models, in particular, that their derivativ… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  24. arXiv:2306.05392  [pdf, other

    cs.CL

    Modular Visual Question Answering via Code Generation

    Authors: Sanjay Subramanian, Medhini Narasimhan, Kushal Khangaonkar, Kevin Yang, Arsha Nagrani, Cordelia Schmid, Andy Zeng, Trevor Darrell, Dan Klein

    Abstract: We present a framework that formulates visual question answering as modular code generation. In contrast to prior work on modular approaches to VQA, our approach requires no additional training and relies on pre-trained language models (LMs), visual models pre-trained on image-caption pairs, and fifty VQA examples used for in-context learning. The generated Python programs invoke and compose the o… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: ACL 2023

  25. arXiv:2306.01691  [pdf, other

    cs.GR cs.HC

    The Influence of Variable Frame Timing on First-Person Gaming

    Authors: Devi Klein, Josef Spjut, Ben Boudaoud, Joohwan Kim

    Abstract: Variable frame timing (VFT), or changes in the time intervals between discrete frame images displayed to users, deviates from our traditional conceptualization of frame rate in which all frame times are equal. With the advent of variable refresh rate (VRR) monitor technologies, gamers experience VFT at the display. VRR, coupled with increased display refresh rates and high-end hardware, enables sm… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 12 pages, 9 figures

  26. arXiv:2306.01058  [pdf, other

    cs.CL

    Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents

    Authors: Catherine Chen, Zejiang Shen, Dan Klein, Gabriel Stanovsky, Doug Downey, Kyle Lo

    Abstract: Recent work has shown that infusing layout features into language models (LMs) improves processing of visually-rich documents such as scientific papers. Layout-infused LMs are often evaluated on documents with familiar layout features (e.g., papers from the same publisher), but in practice models encounter documents with unfamiliar distributions of layout features, such as new combinations of text… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: To appear in ACL Findings 2023

  27. arXiv:2305.15053  [pdf, other

    cs.CL cs.IR

    Decomposing Complex Queries for Tip-of-the-tongue Retrieval

    Authors: Kevin Lin, Kyle Lo, Joseph E. Gonzalez, Dan Klein

    Abstract: When re-finding items, users who forget or are uncertain about identifying details often rely on creative strategies for expressing their information needs -- complex queries that describe content elements (e.g., book characters or events), information beyond the document text (e.g., descriptions of book covers), or personal context (e.g., when they read a book). This retrieval setting, called tip… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  28. arXiv:2305.15047  [pdf, other

    cs.CL cs.AI

    Ghostbuster: Detecting Text Ghostwritten by Large Language Models

    Authors: Vivek Verma, Eve Fleisig, Nicholas Tomlin, Dan Klein

    Abstract: We introduce Ghostbuster, a state-of-the-art system for detecting AI-generated text. Our method works by passing documents through a series of weaker language models, running a structured search over possible combinations of their features, and then training a classifier on the selected features to predict whether documents are AI-generated. Crucially, Ghostbuster does not require access to token… ▽ More

    Submitted 5 April, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: NAACL 2024

  29. arXiv:2305.14735  [pdf, other

    cs.CL cs.AI cs.LG

    Centering the Margins: Outlier-Based Identification of Harmed Populations in Toxicity Detection

    Authors: Vyoma Raman, Eve Fleisig, Dan Klein

    Abstract: The impact of AI models on marginalized communities has traditionally been measured by identifying performance differences between specified demographic subgroups. Though this approach aims to center vulnerable groups, it risks obscuring patterns of harm faced by intersectional subgroups or shared across multiple groups. To address this, we draw on theories of marginalization from disability studi… ▽ More

    Submitted 1 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  30. arXiv:2305.12084  [pdf, other

    cs.CL

    Revisiting Entropy Rate Constancy in Text

    Authors: Vivek Verma, Nicholas Tomlin, Dan Klein

    Abstract: The uniform information density (UID) hypothesis states that humans tend to distribute information roughly evenly across an utterance or discourse. Early evidence in support of the UID hypothesis came from Genzel & Charniak (2002), which proposed an entropy rate constancy principle based on the probability of English text under n-gram language models. We re-evaluate the claims of Genzel & Charniak… ▽ More

    Submitted 17 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Findings of EMNLP 2023

  31. arXiv:2305.06626  [pdf, other

    cs.CL cs.AI

    When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks

    Authors: Eve Fleisig, Rediet Abebe, Dan Klein

    Abstract: Though majority vote among annotators is typically used for ground truth labels in natural language processing, annotator disagreement in tasks such as hate speech detection may reflect differences in opinion across groups, not noise. Thus, a crucial problem in hate speech detection is determining whether a statement is offensive to the demographic group that it targets, when that group may consti… ▽ More

    Submitted 17 March, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

  32. arXiv:2305.00944  [pdf, other

    cs.CL cs.CR cs.LG

    Poisoning Language Models During Instruction Tuning

    Authors: Alexander Wan, Eric Wallace, Sheng Shen, Dan Klein

    Abstract: Instruction-tuned LMs such as ChatGPT, FLAN, and InstructGPT are finetuned on datasets that contain user-submitted examples, e.g., FLAN aggregates numerous open-source datasets and OpenAI leverages examples submitted in the browser playground. In this work, we show that adversaries can contribute poison examples to these datasets, allowing them to manipulate model predictions whenever a desired tr… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  33. arXiv:2302.14233  [pdf, other

    cs.CL cs.AI cs.LG

    Goal Driven Discovery of Distributional Differences via Language Descriptions

    Authors: Ruiqi Zhong, Peter Zhang, Steve Li, Jinwoo Ahn, Dan Klein, Jacob Steinhardt

    Abstract: Mining large corpora can generate useful discoveries but is time-consuming for humans. We formulate a new task, D5, that automatically discovers differences between two large corpora in a goal-driven way. The task input is a problem comprising a research goal "$\textit{comparing the side effects of drug A and drug B}$" and a corpus pair (two large collections of patients' self-reported reactions a… ▽ More

    Submitted 24 October, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  34. Neural networks meet hyperelasticity: A guide to enforcing physics

    Authors: Lennart Linden, Dominik K. Klein, Karl A. Kalina, Jörg Brummund, Oliver Weeger, Markus Kästner

    Abstract: In the present work, a hyperelastic constitutive model based on neural networks is proposed which fulfills all common constitutive conditions by construction, and in particular, is applicable to compressible material behavior. Using different sets of invariants as inputs, a hyperelastic potential is formulated as a convex neural network, thus fulfilling symmetry of the stress tensor, objectivity,… ▽ More

    Submitted 6 July, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Journal ref: Journal of the Mechanics and Physics of Solids (2023)

  35. Towards an Ontology-Driven Approach for Process-Aware Risk Propagation

    Authors: Gal Engelberg, Mattia Fumagalli, Adrian Kuboszek, Dan Klein, Pnina Soffer, Giancarlo Guizzardi

    Abstract: The rapid development of cyber-physical systems creates an increasing demand for a general approach to risk, especially considering how physical and digital components affect the processes of the system itself. In risk analytics and management, risk propagation is a central technique, which allows the calculation of the cascading effect of risk within a system and supports risk mitigation activiti… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  36. arXiv:2212.10564  [pdf, other

    cs.CL cs.AI cs.LG

    Re-evaluating the Need for Multimodal Signals in Unsupervised Grammar Induction

    Authors: Boyi Li, Rodolfo Corona, Karttikeya Mangalam, Catherine Chen, Daniel Flaherty, Serge Belongie, Kilian Q. Weinberger, Jitendra Malik, Trevor Darrell, Dan Klein

    Abstract: Are multimodal inputs necessary for grammar induction? Recent work has shown that multimodal training inputs can improve grammar induction. However, these improvements are based on comparisons to weak text-only baselines that were trained on relatively little textual data. To determine whether multimodal inputs are needed in regimes with large amounts of textual training data, we design a stronger… ▽ More

    Submitted 12 April, 2024; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: NAACL Findings 2024

  37. arXiv:2212.10077  [pdf, other

    cs.CL cs.AI

    DOC: Improving Long Story Coherence With Detailed Outline Control

    Authors: Kevin Yang, Dan Klein, Nanyun Peng, Yuandong Tian

    Abstract: We propose the Detailed Outline Control (DOC) framework for improving long-range plot coherence when automatically generating several-thousand-word-long stories. DOC consists of two complementary components: a detailed outliner and a detailed controller. The detailed outliner creates a more detailed, hierarchically structured outline, shifting creative burden from the main drafting procedure to th… ▽ More

    Submitted 14 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  38. arXiv:2212.03827  [pdf, other

    cs.CL cs.AI cs.LG

    Discovering Latent Knowledge in Language Models Without Supervision

    Authors: Collin Burns, Haotian Ye, Dan Klein, Jacob Steinhardt

    Abstract: Existing techniques for training language models can be misaligned with the truth: if we train models with imitation learning, they may reproduce errors that humans make; if we train them to generate text that humans rate highly, they may output errors that human evaluators can't detect. We propose circumventing this issue by directly finding latent knowledge inside the internal activations of a l… ▽ More

    Submitted 2 March, 2024; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: ICLR 2023

  39. arXiv:2211.08684  [pdf, other

    cs.CL

    Neural Unsupervised Reconstruction of Protolanguage Word Forms

    Authors: Andre He, Nicholas Tomlin, Dan Klein

    Abstract: We present a state-of-the-art neural approach to the unsupervised reconstruction of ancient word forms. Previous work in this domain used expectation-maximization to predict simple phonological changes between ancient word forms and their cognates in modern languages. We extend this work with neural models that can capture more complicated phonological and morphological changes. At the same time,… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  40. arXiv:2210.06774  [pdf, other

    cs.CL cs.AI

    Re3: Generating Longer Stories With Recursive Reprompting and Revision

    Authors: Kevin Yang, Yuandong Tian, Nanyun Peng, Dan Klein

    Abstract: We consider the problem of automatically generating longer stories of over two thousand words. Compared to prior work on shorter stories, long-range plot coherence and relevance are more central challenges here. We propose the Recursive Reprompting and Revision framework (Re3) to address these challenges by (a) prompting a general-purpose language model to construct a structured overarching plan,… ▽ More

    Submitted 21 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  41. arXiv:2209.15189  [pdf, other

    cs.CL cs.AI

    Learning by Distilling Context

    Authors: Charlie Snell, Dan Klein, Ruiqi Zhong

    Abstract: Language models significantly benefit from context tokens, such as prompts or scratchpads. They perform better when prompted with informative instructions, and they acquire new reasoning capabilities by generating a scratch-pad before predicting the final answers. However, they do not \textit{internalize} these performance gains, which disappear when the context tokens are gone. Our work proposes… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  42. arXiv:2209.07800  [pdf, other

    cs.CL

    The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction and Constrained Decoding

    Authors: Hao Fang, Anusha Balakrishnan, Harsh Jhamtani, John Bufe, Jean Crawford, Jayant Krishnamurthy, Adam Pauls, Jason Eisner, Jacob Andreas, Dan Klein

    Abstract: In a real-world dialogue system, generated text must be truthful and informative while remaining fluent and adhering to a prescribed style. Satisfying these constraints simultaneously is difficult for the two predominant paradigms in language generation: neural language modeling and rule-based generation. We describe a hybrid architecture for dialogue response generation that combines the strength… ▽ More

    Submitted 26 May, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: Findings of ACL 2023

  43. arXiv:2209.01506  [pdf

    cs.LG cs.AI

    Neural Networks for Chess

    Authors: Dominik Klein

    Abstract: AlphaZero, Leela Chess Zero and Stockfish NNUE revolutionized Computer Chess. This book gives a complete introduction into the technical inner workings of such engines. The book is split into four main chapters -- excluding chapter 1 (introduction) and chapter 6 (conclusion): Chapter 2 introduces neural networks and covers all the basic building blocks that are used to build deep networks such as… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

  44. Finite electro-elasticity with physics-augmented neural networks

    Authors: Dominik K. Klein, Rogelio Ortigosa, Jesús Martínez-Frutos, Oliver Weeger

    Abstract: In the present work, a machine learning based constitutive model for electro-mechanically coupled material behavior at finite deformations is proposed. Using different sets of invariants as inputs, an internal energy density is formulated as a convex neural network. In this way, the model fulfills the polyconvexity condition which ensures material stability, as well as thermodynamic consistency, o… ▽ More

    Submitted 27 August, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

  45. arXiv:2205.12422  [pdf, other

    cs.CL cs.AI cs.PL

    Non-Programmers Can Label Programs Indirectly via Active Examples: A Case Study with Text-to-SQL

    Authors: Ruiqi Zhong, Charlie Snell, Dan Klein, Jason Eisner

    Abstract: Can non-programmers annotate natural language utterances with complex programs that represent their meaning? We introduce APEL, a framework in which non-programmers select among candidate programs generated by a seed semantic parser (e.g., Codex). Since they cannot understand the candidate programs, we ask them to select indirectly by examining the programs' input-ouput examples. For each utteranc… ▽ More

    Submitted 23 October, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

  46. arXiv:2205.09710  [pdf, other

    cs.CL cs.AI cs.CV

    Voxel-informed Language Grounding

    Authors: Rodolfo Corona, Shizhan Zhu, Dan Klein, Trevor Darrell

    Abstract: Natural language applied to natural 2D images describes a fundamentally 3D world. We present the Voxel-informed Language Grounder (VLG), a language grounding model that leverages 3D geometric information in the form of voxel maps derived from the visual input using a volumetric reconstruction model. We show that VLG significantly improves grounding accuracy on SNARE, an object reference game task.… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: ACL 2022

  47. arXiv:2205.09665  [pdf, other

    cs.CL

    Automated Crossword Solving

    Authors: Eric Wallace, Nicholas Tomlin, Albert Xu, Kevin Yang, Eshaan Pathak, Matthew Ginsberg, Dan Klein

    Abstract: We present the Berkeley Crossword Solver, a state-of-the-art approach for automatically solving crossword puzzles. Our system works by generating answer candidates for each crossword clue using neural question answering models and then combines loopy belief propagation with local search to find full puzzle solutions. Compared to existing approaches, our system improves exact puzzle accuracy from 7… ▽ More

    Submitted 3 July, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: ACL 2022

  48. arXiv:2204.07531  [pdf, other

    cs.CL cs.AI

    Understanding Game-Playing Agents with Natural Language Annotations

    Authors: Nicholas Tomlin, Andre He, Dan Klein

    Abstract: We present a new dataset containing 10K human-annotated games of Go and show how these natural language annotations can be used as a tool for model interpretability. Given a board state and its associated comment, our approach uses linear probing to predict mentions of domain-specific terms (e.g., ko, atari) from the intermediate state representations of game-playing agents like AlphaGo Zero. We f… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

  49. arXiv:2204.02515  [pdf, other

    cs.CL cs.AI

    Inferring Rewards from Language in Context

    Authors: Jessy Lin, Daniel Fried, Dan Klein, Anca Dragan

    Abstract: In classic instruction following, language like "I'd like the JetBlue flight" maps to actions (e.g., selecting that flight). However, language also conveys information about a user's underlying reward function (e.g., a general preference for JetBlue), which can allow a model to carry out desirable actions in new contexts. We present a model that infers rewards from language pragmatically: reasonin… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: ACL 2022. Code and dataset: https://github.com/jlin816/rewards-from-language

  50. arXiv:2201.12323  [pdf, other

    cs.CL cs.AI cs.LG

    Describing Differences between Text Distributions with Natural Language

    Authors: Ruiqi Zhong, Charlie Snell, Dan Klein, Jacob Steinhardt

    Abstract: How do two distributions of texts differ? Humans are slow at answering this, since discovering patterns might require tediously reading through hundreds of samples. We propose to automatically summarize the differences by "learning a natural language hypothesis": given two distributions $D_{0}$ and $D_{1}$, we search for a description that is more often true for $D_{1}$, e.g., "is military-related… ▽ More

    Submitted 18 May, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: International Conference on Machine Learning, 2022