Skip to main content

Showing 1–16 of 16 results for author: Valvoda, J

  1. arXiv:2406.10203  [pdf, other

    cs.CL

    A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors

    Authors: Naaman Tan, Josef Valvoda, Anej Svete, Tianyu Liu, Yanxia Qin, Kan Min-Yen, Ryan Cotterell

    Abstract: The relationship between the quality of a string and its probability $p(\boldsymbol{y})$ under a language model has been influential in the development of techniques to build good text generation systems. For example, several decoding algorithms have been motivated to manipulate $p(\boldsymbol{y})$ to produce higher-quality text. In this work, we examine the probability--quality relationship in la… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2406.04289  [pdf, other

    cs.CL

    What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages

    Authors: Nadav Borenstein, Anej Svete, Robin Chan, Josef Valvoda, Franz Nowak, Isabelle Augenstein, Eleanor Chodroff, Ryan Cotterell

    Abstract: What can large language models learn? By definition, language models (LM) are distributions over strings. Therefore, an intuitive way of addressing the above question is to formalize it as a matter of learnability of classes of distributions over strings. While prior work in this direction focused on assessing the theoretical limits, in contrast, we seek to understand the empirical learnability. U… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024

  3. arXiv:2403.16852  [pdf, other

    cs.CL cs.AI

    Towards Explainability in Legal Outcome Prediction Models

    Authors: Josef Valvoda, Ryan Cotterell

    Abstract: Current legal outcome prediction models - a staple of legal NLP - do not explain their reasoning. However, to employ these models in the real world, human legal actors need to be able to understand the model's decisions. In the case of common law, legal practitioners reason towards the outcome of a case by referring to past case law, known as precedent. We contend that precedent is, therefore, a n… ▽ More

    Submitted 15 April, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  4. arXiv:2312.00584  [pdf, other

    cs.CL cs.AI

    The Ethics of Automating Legal Actors

    Authors: Josef Valvoda, Alec Thompson, Ryan Cotterell, Simone Teufel

    Abstract: The introduction of large public legal datasets has brought about a renaissance in legal NLP. Many of these datasets are comprised of legal judgements - the product of judges deciding cases. This fact, together with the way machine learning works, means that several legal NLP models are models of judges. While some have argued for the automation of judges, in this position piece, we argue that aut… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  5. arXiv:2211.06420  [pdf, other

    cs.CL cs.LG

    The Architectural Bottleneck Principle

    Authors: Tiago Pimentel, Josef Valvoda, Niklas Stoehr, Ryan Cotterell

    Abstract: In this paper, we seek to measure how much information a component in a neural network could extract from the representations fed into it. Our work stands in contrast to prior probing work, most of which investigates how much information a model's representations contain. This shift in perspective leads us to propose a new principle for probing, the architectural bottleneck principle: In order to… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: Accepted at EMNLP 2022. Tiago Pimentel and Josef Valvoda contributed equally to this work. Code available in https://github.com/rycolab/attentional-probe

  6. arXiv:2210.03971  [pdf, other

    cs.LG stat.AP

    An Ordinal Latent Variable Model of Conflict Intensity

    Authors: Niklas Stoehr, Lucas Torroba Hennigen, Josef Valvoda, Robert West, Ryan Cotterell, Aaron Schein

    Abstract: Measuring the intensity of events is crucial for monitoring and tracking armed conflict. Advances in automated event extraction have yielded massive data sets of "who did what to whom" micro-records that enable data-driven approaches to monitoring conflict. The Goldstein scale is a widely-used expert-based measure that scores events on a conflictual-cooperative scale. It is based only on the actio… ▽ More

    Submitted 4 June, 2023; v1 submitted 8 October, 2022; originally announced October 2022.

    Comments: Long Paper at ACL 2023

  7. arXiv:2209.11068  [pdf, other

    cs.CL

    Prompting for a conversation: How to control a dialog model?

    Authors: Josef Valvoda, Yimai Fang, David Vandyke

    Abstract: Dialog modelling faces a difficult trade-off. Models are trained on a large amount of text, yet their responses need to be limited to a desired scope and style of a dialog agent. Because the datasets used to achieve the former contain language that is not compatible with the latter, pre-trained dialog models are fine-tuned on smaller curated datasets. However, the fine-tuning process robs them of… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  8. arXiv:2208.08225  [pdf, other

    cs.CY cs.CL

    On the Role of Negative Precedent in Legal Outcome Prediction

    Authors: Josef Valvoda, Ryan Cotterell, Simone Teufel

    Abstract: Every legal case sets a precedent by developing the law in one of the following two ways. It either expands its scope, in which case it sets positive precedent, or it narrows it, in which case it sets negative precedent. Legal outcome prediction, the prediction of positive outcome, is an increasingly popular task in AI. In contrast, we turn our focus to negative outcomes here, and introduce a new… ▽ More

    Submitted 6 October, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

  9. arXiv:2208.08195  [pdf, other

    cs.CL

    Benchmarking Compositionality with Formal Languages

    Authors: Josef Valvoda, Naomi Saphra, Jonathan Rawski, Adina Williams, Ryan Cotterell

    Abstract: Recombining known primitive concepts into larger novel combinations is a quintessentially human cognitive capability. Whether large neural models in NLP can acquire this ability while learning from data is an open question. In this paper, we investigate this problem from the perspective of formal languages. We use deterministic finite-state transducers to make an unbounded number of datasets with… ▽ More

    Submitted 1 August, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: Published at COLING 2022. This version fixes a mistake in Figure 4 and adds a clarifying note in teal. Code is available at https://github.com/valvoda/neuralTransducer

  10. arXiv:2205.03608  [pdf, other

    cs.CL

    UniMorph 4.0: Universal Morphology

    Authors: Khuyagbaatar Batsuren, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Kyle Gorman, Yustinus Ghanggo Ate, Maria Ryskina, Sabrina J. Mielke, Elena Budianskaya, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Lane, Mohit Raj, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Benoît Sagot, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay , et al. (71 additional authors not shown)

    Abstract: The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This pa… ▽ More

    Submitted 19 June, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: LREC 2022; The first two authors made equal contributions

  11. arXiv:2111.04158  [pdf, other

    cs.CL cs.AI

    A Word on Machine Ethics: A Response to Jiang et al. (2021)

    Authors: Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell, Adina Williams

    Abstract: Ethics is one of the longest standing intellectual endeavors of humanity. In recent years, the fields of AI and NLP have attempted to wrangle with how learning systems that interact with humans should be constrained to behave ethically. One proposal in this vein is the construction of morality models that can take in arbitrary text and output a moral judgment about the situation described. In this… ▽ More

    Submitted 7 November, 2021; originally announced November 2021.

    Comments: 11 pages, 2 figures, submitting soon to ACL Rolling Review

  12. arXiv:2104.12133  [pdf, other

    cs.CY

    What About the Precedent: An Information-Theoretic Analysis of Common Law

    Authors: Josef Valvoda, Tiago Pimentel, Niklas Stoehr, Ryan Cotterell, Simone Teufel

    Abstract: In common law, the outcome of a new case is determined mostly by precedent cases, rather than by existing statutes. However, how exactly does the precedent influence the outcome of a new case? Answering this question is crucial for guaranteeing fair and consistent judicial decision-making. We are the first to approach this question computationally by comparing two longstanding jurisprudential view… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

  13. arXiv:2011.06306  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Analyzing Neural Discourse Coherence Models

    Authors: Youmna Farag, Josef Valvoda, Helen Yannakoudakis, Ted Briscoe

    Abstract: In this work, we systematically investigate how well current models of coherence can capture aspects of text implicated in discourse organisation. We devise two datasets of various linguistic alterations that undermine coherence and test model sensitivity to changes in syntax and semantics. We furthermore probe discourse embedding space and examine the knowledge that is encoded in representations… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Journal ref: CODI workshop in EMNLP2020

  14. arXiv:2006.11572  [pdf, other

    cs.CL

    SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection

    Authors: Ekaterina Vylomova, Jennifer White, Elizabeth Salesky, Sabrina J. Mielke, Shijie Wu, Edoardo Ponti, Rowan Hall Maudslay, Ran Zmigrod, Josef Valvoda, Svetlana Toldova, Francis Tyers, Elena Klyachko, Ilya Yegorov, Natalia Krizhanovsky, Paula Czarnowska, Irene Nikkarinen, Andrew Krizhanovsky, Tiago Pimentel, Lucas Torroba Hennigen, Christo Kirov, Garrett Nicolai, Adina Williams, Antonios Anastasopoulos, Hilaria Cruz, Eleanor Chodroff , et al. (3 additional authors not shown)

    Abstract: A broad goal in natural language processing (NLP) is to develop a system that has the capacity to process any natural language. Most systems, however, are developed using data from just one language such as English. The SIGMORPHON 2020 shared task on morphological reinflection aims to investigate systems' ability to generalize across typologically distinct languages, many of which are low resource… ▽ More

    Submitted 14 July, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: 39 pages, SIGMORPHON

  15. arXiv:2005.01641  [pdf, other

    cs.CL

    A Tale of a Probe and a Parser

    Authors: Rowan Hall Maudslay, Josef Valvoda, Tiago Pimentel, Adina Williams, Ryan Cotterell

    Abstract: Measuring what linguistic information is encoded in neural models of language has become popular in NLP. Researchers approach this enterprise by training "probes" - supervised models designed to extract linguistic structure from another model's output. One such probe is the structural probe (Hewitt and Manning, 2019), designed to quantify the extent to which syntactic information is encoded in con… ▽ More

    Submitted 12 May, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

  16. arXiv:2004.03061  [pdf, other

    cs.CL cs.LG

    Information-Theoretic Probing for Linguistic Structure

    Authors: Tiago Pimentel, Josef Valvoda, Rowan Hall Maudslay, Ran Zmigrod, Adina Williams, Ryan Cotterell

    Abstract: The success of neural networks on a diverse set of NLP tasks has led researchers to question how much these networks actually ``know'' about natural language. Probes are a natural way of assessing this. When probing, a researcher chooses a linguistic task and trains a supervised model to predict annotations in that linguistic task from the network's learned representations. If the probe does well,… ▽ More

    Submitted 22 May, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: Accepted for publication at ACL 2020. This is the camera ready version. Code available in https://github.com/rycolab/info-theoretic-probing