Skip to main content

Showing 1–25 of 25 results for author: Wilcox, E

  1. arXiv:2405.09605  [pdf, other

    cs.CL cs.AI cs.LG

    Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models

    Authors: Anna A. Ivanova, Aalok Sathe, Benjamin Lipkin, Unnathi Kumar, Setayesh Radkani, Thomas H. Clark, Carina Kauf, Jennifer Hu, R. T. Pramod, Gabriel Grand, Vivian Paulun, Maria Ryskina, Ekin Akyürek, Ethan Wilcox, Nafisa Rashid, Leshem Choshen, Roger Levy, Evelina Fedorenko, Joshua Tenenbaum, Jacob Andreas

    Abstract: The ability to build and leverage world models is essential for a general-purpose AI agent. Testing such capabilities is hard, in part because the building blocks of world models are ill-defined. We present Elements of World Knowledge (EWOK), a framework for evaluating world modeling in language models by testing their ability to use knowledge of a concept to match a target text with a plausible/i… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 21 pages (11 main), 7 figures. Authors Anna Ivanova, Aalok Sathe, Benjamin Lipkin contributed equally

  2. arXiv:2404.06214  [pdf, other

    cs.CL

    [Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus

    Authors: Leshem Choshen, Ryan Cotterell, Michael Y. Hu, Tal Linzen, Aaron Mueller, Candace Ross, Alex Warstadt, Ethan Wilcox, Adina Williams, Chengxu Zhuang

    Abstract: After last year's successful BabyLM Challenge, the competition will be hosted again in 2024/2025. The overarching goals of the challenge remain the same; however, some of the competition rules will be different. The big changes for this year's competition are as follows: First, we replace the loose track with a paper track, which allows (for example) non-model-based submissions, novel cognitively-… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  3. arXiv:2312.03897  [pdf, other

    cs.CL

    Revisiting the Optimality of Word Lengths

    Authors: Tiago Pimentel, Clara Meister, Ethan Gotlieb Wilcox, Kyle Mahowald, Ryan Cotterell

    Abstract: Zipf (1935) posited that wordforms are optimized to minimize utterances' communicative costs. Under the assumption that cost is given by an utterance's length, he supported this claim by showing that words' lengths are inversely correlated with their frequencies. Communicative cost, however, can be operationalized in different ways. Piantadosi et al. (2011) claim that cost should be measured as th… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Published at EMNLP 2023

  4. arXiv:2312.02931  [pdf, other

    cs.CL cs.AI

    WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words

    Authors: Lukas Wolf, Greta Tuckute, Klemen Kotar, Eghbal Hosseini, Tamar Regev, Ethan Wilcox, Alex Warstadt

    Abstract: Training on multiple modalities of input can augment the capabilities of a language model. Here, we ask whether such a training regime can improve the quality and efficiency of these systems as well. We focus on text--audio and introduce Whisbert, which is inspired by the text--image approach of FLAVA (Singh et al., 2022). In accordance with Babylm guidelines (Warstadt et al., 2023), we pretrain W… ▽ More

    Submitted 6 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Published at the BabyLM Challenge, a shared task co-sponsored by CMCL 2023 and CoNLL 2023, hosted by EMNLP 2023

  5. arXiv:2311.17233  [pdf, other

    cs.CL cs.AI cs.IT cs.LG

    Quantifying the redundancy between prosody and text

    Authors: Lukas Wolf, Tiago Pimentel, Evelina Fedorenko, Ryan Cotterell, Alex Warstadt, Ethan Wilcox, Tamar Regev

    Abstract: Prosody -- the suprasegmental component of speech, including pitch, loudness, and tempo -- carries critical aspects of meaning. However, the relationship between the information conveyed by prosody vs. by the words themselves remains poorly understood. We use large language models (LLMs) to estimate how much information is redundant between prosody and the words themselves. Using a large spoken co… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Published at The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP)

  6. arXiv:2310.07822  [pdf, other

    cs.RO

    Body-mounted MR-conditional Robot for Minimally Invasive Liver Intervention

    Authors: Zhefeng Huang, Anthony L. Gunderman, Samuel E. Wilcox, Saikat Sengupta, Jay Shah, Aiming Lu, David Woodrum, Yue Chen

    Abstract: MR-guided microwave ablation (MWA) has proven effective in treating hepatocellular carcinoma (HCC) with small-sized tumors, but the state-of-the-art technique suffers from sub-optimal workflow due to speed and accuracy of needle placement. This paper presents a compact body-mounted MR-conditional robot that can operate in closed-bore MR scanners for accurate needle guidance. The robotic platform c… ▽ More

    Submitted 25 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 10 figures

  7. arXiv:2307.03749  [pdf, other

    cs.CL

    On the Efficacy of Sampling Adapters

    Authors: Clara Meister, Tiago Pimentel, Luca Malagutti, Ethan G. Wilcox, Ryan Cotterell

    Abstract: Sampling is a common strategy for generating text from probabilistic models, yet standard ancestral sampling often results in text that is incoherent or ungrammatical. To alleviate this issue, various modifications to a model's sampling distribution, such as nucleus or top-k sampling, have been introduced and are now ubiquitously used in language generation systems. We propose a unified framework… ▽ More

    Submitted 5 January, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: ACL 2023 Main Conference Proceedings

  8. arXiv:2307.03667  [pdf, other

    cs.CL

    Testing the Predictions of Surprisal Theory in 11 Languages

    Authors: Ethan Gotlieb Wilcox, Tiago Pimentel, Clara Meister, Ryan Cotterell, Roger P. Levy

    Abstract: A fundamental result in psycholinguistics is that less predictable words take a longer time to process. One theoretical explanation for this finding is Surprisal Theory (Hale, 2001; Levy, 2008), which quantifies a word's predictability as its surprisal, i.e. its negative log-probability given a context. While evidence supporting the predictions of Surprisal Theory have been replicated widely, most… ▽ More

    Submitted 10 July, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: This is a pre-MIT Press publication version of the paper

  9. arXiv:2304.14293  [pdf, other

    cs.CL cs.AI cs.LG

    Controlled Text Generation with Natural Language Instructions

    Authors: Wangchunshu Zhou, Yuchen Eleanor Jiang, Ethan Wilcox, Ryan Cotterell, Mrinmaya Sachan

    Abstract: Large language models generate fluent texts and can follow natural language instructions to solve a wide range of tasks without task-specific training. Nevertheless, it is notoriously difficult to control their generation to satisfy the various constraints required by different applications. In this work, we present InstructCTG, a controlled text generation framework that incorporates different co… ▽ More

    Submitted 8 June, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: ICML 2023

  10. arXiv:2301.11796  [pdf, other

    cs.CL

    Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus

    Authors: Alex Warstadt, Leshem Choshen, Aaron Mueller, Adina Williams, Ethan Wilcox, Chengxu Zhuang

    Abstract: We present the call for papers for the BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus. This shared task is intended for participants with an interest in small scale language modeling, human language acquisition, low-resource NLP, and cognitive modeling. In partnership with CoNLL and CMCL, we provide a platform for approaches to pretraining with a limited-size… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  11. arXiv:2211.14301  [pdf, other

    cs.CL

    On the Effect of Anticipation on Reading Times

    Authors: Tiago Pimentel, Clara Meister, Ethan G. Wilcox, Roger Levy, Ryan Cotterell

    Abstract: Over the past two decades, numerous studies have demonstrated how less predictable (i.e., higher surprisal) words take more time to read. In general, these studies have implicitly assumed the reading process is purely responsive: Readers observe a new word and allocate time to process it as required. We argue that prior results are also compatible with a reading process that is at least partially… ▽ More

    Submitted 14 July, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: This is a pre-MIT Press publication version of the paper. Code is available in https://github.com/rycolab/anticipation-on-reading-times

  12. arXiv:2202.07023  [pdf, other

    cs.CL

    Exhaustivity and anti-exhaustivity in the RSA framework: Testing the effect of prior beliefs

    Authors: Alexandre Cremers, Ethan G. Wilcox, Benjamin Spector

    Abstract: During communication, the interpretation of utterances is sensitive to a listener's probabilistic prior beliefs, something which is captured by one currently influential model of pragmatics, the Rational Speech Act (RSA) framework. In this paper we focus on cases when this sensitivity to priors leads to counterintuitive predictions of the framework. Our domain of interest is exhaustivity effects,… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  13. arXiv:2106.03232  [pdf, other

    cs.CL

    A Targeted Assessment of Incremental Processing in Neural LanguageModels and Humans

    Authors: Ethan Gotlieb Wilcox, Pranali Vani, Roger P. Levy

    Abstract: We present a targeted, scaled-up comparison of incremental processing in humans and neural language models by collecting by-word reaction time data for sixteen different syntactic test suites across a range of structural phenomena. Human reaction time data comes from a novel online experimental paradigm called the Interpolated Maze task. We compare human reaction times to by-word probabilities for… ▽ More

    Submitted 25 October, 2023; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Published in the proceedings of ACL 2021

  14. arXiv:2011.02417  [pdf, other

    cs.CL cs.LG

    Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization

    Authors: Tristan Thrush, Ethan Wilcox, Roger Levy

    Abstract: Previous studies investigating the syntactic abilities of deep learning models have not targeted the relationship between the strength of the grammatical generalization and the amount of evidence to which the model is exposed during training. We address this issue by deploying a novel word-learning paradigm to test BERT's few-shot learning capabilities for two aspects of English verbs: alternation… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

    Comments: Accepted to BlackboxNLP 2020

  15. arXiv:2010.05725  [pdf, other

    cs.CL

    Structural Supervision Improves Few-Shot Learning and Syntactic Generalization in Neural Language Models

    Authors: Ethan Wilcox, Peng Qian, Richard Futrell, Ryosuke Kohita, Roger Levy, Miguel Ballesteros

    Abstract: Humans can learn structural properties about a word from minimal experience, and deploy their learned syntactic representations uniformly in different grammatical contexts. We assess the ability of modern neural language models to reproduce this behavior in English and evaluate the effect of structural supervision on learning outcomes. First, we assess few-shot learning capabilities by developing… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: To appear at EMNLP 2020

  16. arXiv:2006.01912  [pdf, other

    cs.CL

    On the Predictive Power of Neural Language Models for Human Real-Time Comprehension Behavior

    Authors: Ethan Gotlieb Wilcox, Jon Gauthier, Jennifer Hu, Peng Qian, Roger Levy

    Abstract: Human reading behavior is tuned to the statistics of natural language: the time it takes human subjects to read a word can be predicted from estimates of the word's probability in context. However, it remains an open question what computational architecture best characterizes the expectations deployed in real time by humans that determine the behavioral signatures of reading. Here we test over two… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    Comments: To Appear at CogSci 2020

  17. arXiv:2005.03692  [pdf, other

    cs.CL

    A Systematic Assessment of Syntactic Generalization in Neural Language Models

    Authors: Jennifer Hu, Jon Gauthier, Peng Qian, Ethan Wilcox, Roger P. Levy

    Abstract: While state-of-the-art neural network models continue to achieve lower perplexity scores on language modeling benchmarks, it remains unknown whether optimizing for broad-coverage predictive performance leads to human-like syntactic knowledge. Furthermore, existing work has not provided a clear picture about the model properties required to produce proper syntactic generalizations. We present a sys… ▽ More

    Submitted 22 May, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: To appear in the Proceedings of the Association for Computational Linguistics (ACL 2020)

  18. arXiv:1909.04625  [pdf, other

    cs.CL cs.AI cs.LG

    Representation of Constituents in Neural Language Models: Coordination Phrase as a Case Study

    Authors: Aixiu An, Peng Qian, Ethan Wilcox, Roger Levy

    Abstract: Neural language models have achieved state-of-the-art performances on many NLP tasks, and recently have been shown to learn a number of hierarchically-sensitive syntactic dependencies between individual words. However, equally important for language processing is the ability to combine words into phrasal constituents, and use constituent-level features to drive downstream expectations. Here we inv… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: To appear at EMNLP 2019

  19. arXiv:1906.04068  [pdf, other

    cs.CL

    Hierarchical Representation in Neural Language Models: Suppression and Recovery of Expectations

    Authors: Ethan Wilcox, Roger Levy, Richard Futrell

    Abstract: Deep learning sequence models have led to a marked increase in performance for a range of Natural Language Processing tasks, but it remains an open question whether they are able to induce proper hierarchical generalizations for representing natural language from linear input alone. Work using artificial languages as training input has shown that LSTMs are capable of inducing the stack-like data s… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: Proceedings of BlackboxNLP 2019, ACL, Florence, Italy

  20. arXiv:1905.10431  [pdf, other

    cs.CL

    What Syntactic Structures block Dependencies in RNN Language Models?

    Authors: Ethan Wilcox, Roger Levy, Richard Futrell

    Abstract: Recurrent Neural Networks (RNNs) trained on a language modeling task have been shown to acquire a number of non-local grammatical dependencies with some success. Here, we provide new evidence that RNN language models are sensitive to hierarchical syntactic structure by investigating the filler--gap dependency and constraints on it, known as syntactic islands. Previous work is inconclusive about wh… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

    Comments: To Appear at the 41st Annual Meeting of the Cognitive Science Society, Montreal, Canada, July 2019

  21. arXiv:1903.03260  [pdf, other

    cs.CL

    Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State

    Authors: Richard Futrell, Ethan Wilcox, Takashi Morita, Peng Qian, Miguel Ballesteros, Roger Levy

    Abstract: We deploy the methods of controlled psycholinguistic experimentation to shed light on the extent to which the behavior of neural network language models reflects incremental representations of syntactic state. To do so, we examine model behavior on artificial sentences containing a variety of syntactically complex structures. We test four models: two publicly available LSTM sequence models of Engl… ▽ More

    Submitted 7 March, 2019; originally announced March 2019.

    Comments: Accepted to NAACL 2019. Not yet edited into the camera-ready version

  22. arXiv:1903.00943  [pdf, other

    cs.CL

    Structural Supervision Improves Learning of Non-Local Grammatical Dependencies

    Authors: Ethan Wilcox, Peng Qian, Richard Futrell, Miguel Ballesteros, Roger Levy

    Abstract: State-of-the-art LSTM language models trained on large corpora learn sequential contingencies in impressive detail and have been shown to acquire a number of non-local grammatical dependencies with some success. Here we investigate whether supervision with hierarchical structure enhances learning of a range of grammatical dependencies, a question that has previously been addressed only for subject… ▽ More

    Submitted 6 April, 2019; v1 submitted 3 March, 2019; originally announced March 2019.

    Comments: To appear: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

  23. arXiv:1809.01329  [pdf, other

    cs.CL

    RNNs as psycholinguistic subjects: Syntactic state and grammatical dependency

    Authors: Richard Futrell, Ethan Wilcox, Takashi Morita, Roger Levy

    Abstract: Recurrent neural networks (RNNs) are the state of the art in sequence modeling for natural language. However, it remains poorly understood what grammatical characteristics of natural language they implicitly learn and represent as a consequence of optimizing the language modeling objective. Here we deploy the methods of controlled psycholinguistic experimentation to shed light on to what extent RN… ▽ More

    Submitted 5 September, 2018; originally announced September 2018.

  24. arXiv:1809.00042  [pdf, other

    cs.CL

    What do RNN Language Models Learn about Filler-Gap Dependencies?

    Authors: Ethan Wilcox, Roger Levy, Takashi Morita, Richard Futrell

    Abstract: RNN language models have achieved state-of-the-art perplexity results and have proven useful in a suite of NLP tasks, but it is as yet unclear what syntactic generalizations they learn. Here we investigate whether state-of-the-art RNN language models represent long-distance filler-gap dependencies and constraints on them. Examining RNN behavior on experimentally controlled sentences designed to ex… ▽ More

    Submitted 31 August, 2018; originally announced September 2018.

    Comments: 9 pages, to appear in Proceedings of BlackboxNLP 2018

  25. arXiv:1704.04760  [pdf

    cs.AR cs.LG cs.NE

    In-Datacenter Performance Analysis of a Tensor Processing Unit

    Authors: Norman P. Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, Rick Boyle, Pierre-luc Cantin, Clifford Chao, Chris Clark, Jeremy Coriell, Mike Daley, Matt Dau, Jeffrey Dean, Ben Gelb, Tara Vazir Ghaemmaghami, Rajendra Gottipati, William Gulland, Robert Hagmann, C. Richard Ho, Doug Hogberg , et al. (50 additional authors not shown)

    Abstract: Many architects believe that major improvements in cost-energy-performance must now come from domain-specific hardware. This paper evaluates a custom ASIC---called a Tensor Processing Unit (TPU)---deployed in datacenters since 2015 that accelerates the inference phase of neural networks (NN). The heart of the TPU is a 65,536 8-bit MAC matrix multiply unit that offers a peak throughput of 92 TeraOp… ▽ More

    Submitted 16 April, 2017; originally announced April 2017.

    Comments: 17 pages, 11 figures, 8 tables. To appear at the 44th International Symposium on Computer Architecture (ISCA), Toronto, Canada, June 24-28, 2017