Skip to main content

Showing 1–15 of 15 results for author: Poliak, A

  1. arXiv:2306.16722  [pdf, other

    cs.CL cs.AI

    Evaluating Paraphrastic Robustness in Textual Entailment Models

    Authors: Dhruv Verma, Yash Kumar Lal, Shreyashee Sinha, Benjamin Van Durme, Adam Poliak

    Abstract: We present PaRTE, a collection of 1,126 pairs of Recognizing Textual Entailment (RTE) examples to evaluate whether models are robust to paraphrasing. We posit that if RTE models understand language, their predictions should be consistent across inputs that share the same meaning. We use the evaluation set to determine if RTE models' predictions change when examples are paraphrased. In our experime… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  2. arXiv:2204.11742  [pdf, other

    cs.CL cs.SI

    Discovering changes in birthing narratives during COVID-19

    Authors: Daphna Spira, Noreen Mayat, Caitlin Dreisbach, Adam Poliak

    Abstract: We investigate whether, and if so how, birthing narratives written by new parents on Reddit changed during COVID-19. Our results indicate that the presence of family members significantly decreased and themes related to induced labor significantly increased in the narratives during COVID-19. Our work builds upon recent research that analyze how new parents use Reddit to describe their birthing exp… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: Presented at the Fifth Widening NLP Workshop (WiNLP) @ EMNLP (2021)

  3. arXiv:2106.01195  [pdf, ps, other

    cs.CL cs.AI

    Figurative Language in Recognizing Textual Entailment

    Authors: Tuhin Chakrabarty, Debanjan Ghosh, Adam Poliak, Smaranda Muresan

    Abstract: We introduce a collection of recognizing textual entailment (RTE) datasets focused on figurative language. We leverage five existing datasets annotated for a variety of figurative language -- simile, metaphor, and irony -- and frame them into over 12,500 RTE examples.We evaluate how well state-of-the-art models trained on popular RTE datasets capture different aspects of figurative language. Our r… ▽ More

    Submitted 3 June, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: ACL 2021 (Findings)

  4. arXiv:2104.05501  [pdf, other

    cs.CL cs.SI

    Fine-Tuning Transformers for Identifying Self-Reporting Potential Cases and Symptoms of COVID-19 in Tweets

    Authors: Max Fleming, Priyanka Dondeti, Caitlin N. Dreisbach, Adam Poliak

    Abstract: We describe our straight-forward approach for Tasks 5 and 6 of 2021 Social Media Mining for Health Applications (SMM4H) shared tasks. Our system is based on fine-tuning Distill- BERT on each task, as well as first fine-tuning the model on the other task. We explore how much fine-tuning is necessary for accurately classifying tweets as containing self-reported COVID-19 symptoms (Task 5) or whether… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: Social Media Mining for Health Applications 2021 Shared Task

  5. arXiv:2010.03061  [pdf, ps, other

    cs.CL

    A Survey on Recognizing Textual Entailment as an NLP Evaluation

    Authors: Adam Poliak

    Abstract: Recognizing Textual Entailment (RTE) was proposed as a unified evaluation framework to compare semantic understanding of different NLP systems. In this survey paper, we provide an overview of different approaches for evaluating and understanding the reasoning capabilities of NLP systems. We then focus our discussion on RTE by highlighting prominent RTE datasets as well as advances in RTE dataset t… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: 1st Workshop on Evaluation and Comparison for NLP systems (Eval4NLP) at EMNLP 2020; 18 pages

  6. arXiv:2004.04877  [pdf, other

    cs.CL

    Probing Neural Language Models for Human Tacit Assumptions

    Authors: Nathaniel Weir, Adam Poliak, Benjamin Van Durme

    Abstract: Humans carry stereotypic tacit assumptions (STAs) (Prince, 1978), or propositional beliefs about generic concepts. Such associations are crucial for understanding natural language. We construct a diagnostic set of word prediction prompts to evaluate whether recent neural contextualized language models trained on large text corpora capture STAs. Our prompts are based on human responses in a psychol… ▽ More

    Submitted 16 June, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

    Comments: To be published in CogSci 2020

  7. arXiv:1909.03042  [pdf, other

    cs.CL

    Uncertain Natural Language Inference

    Authors: Tongfei Chen, Zhengping Jiang, Adam Poliak, Keisuke Sakaguchi, Benjamin Van Durme

    Abstract: We introduce Uncertain Natural Language Inference (UNLI), a refinement of Natural Language Inference (NLI) that shifts away from categorical labels, targeting instead the direct prediction of subjective probability assessments. We demonstrate the feasibility of collecting annotations for UNLI by relabeling a portion of the SNLI dataset under a probabilistic scale, where items even with the same ca… ▽ More

    Submitted 4 May, 2020; v1 submitted 6 September, 2019; originally announced September 2019.

    Comments: Accepted to ACL 2020

    ACM Class: I.2.7

  8. arXiv:1907.04389  [pdf, other

    cs.CL

    On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference

    Authors: Yonatan Belinkov, Adam Poliak, Stuart M. Shieber, Benjamin Van Durme, Alexander M. Rush

    Abstract: Popular Natural Language Inference (NLI) datasets have been shown to be tainted by hypothesis-only biases. Adversarial learning may help models ignore sensitive biases and spurious correlations in data. We evaluate whether adversarial learning can be used in NLI to encourage models to learn representations free of hypothesis-only biases. Our analyses indicate that the representations learned via a… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: StarSem 2019 - The Eighth Joint Conference on Lexical and Computational Semantics

  9. arXiv:1907.04380  [pdf, other

    cs.CL

    Don't Take the Premise for Granted: Mitigating Artifacts in Natural Language Inference

    Authors: Yonatan Belinkov, Adam Poliak, Stuart M. Shieber, Benjamin Van Durme, Alexander M. Rush

    Abstract: Natural Language Inference (NLI) datasets often contain hypothesis-only biases---artifacts that allow models to achieve non-trivial performance without learning whether a premise entails a hypothesis. We propose two probabilistic methods to build models that are more robust to such biases and better transfer across datasets. In contrast to standard approaches to NLI, our methods predict the probab… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: ACL 2019

  10. arXiv:1905.06316  [pdf, other

    cs.CL

    What do you learn from context? Probing for sentence structure in contextualized word representations

    Authors: Ian Tenney, Patrick Xia, Berlin Chen, Alex Wang, Adam Poliak, R Thomas McCoy, Najoung Kim, Benjamin Van Durme, Samuel R. Bowman, Dipanjan Das, Ellie Pavlick

    Abstract: Contextualized representation models such as ELMo (Peters et al., 2018a) and BERT (Devlin et al., 2018) have recently achieved state-of-the-art results on a diverse array of downstream NLP tasks. Building on recent token-level probing work, we introduce a novel edge probing task design and construct a broad suite of sub-sentence tasks derived from the traditional structured NLP pipeline. We probe… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

    Comments: ICLR 2019 camera-ready version, 17 pages including appendices

  11. arXiv:1904.11544  [pdf, other

    cs.CL

    Probing What Different NLP Tasks Teach Machines about Function Word Comprehension

    Authors: Najoung Kim, Roma Patel, Adam Poliak, Alex Wang, Patrick Xia, R. Thomas McCoy, Ian Tenney, Alexis Ross, Tal Linzen, Benjamin Van Durme, Samuel R. Bowman, Ellie Pavlick

    Abstract: We introduce a set of nine challenge tasks that test for the understanding of function words. These tasks are created by structurally mutating sentences from existing datasets to target the comprehension of specific types of function words (e.g., prepositions, wh-words). Using these probing tasks, we explore the effects of various pretraining objectives for sentence encoders (e.g., language modeli… ▽ More

    Submitted 7 August, 2019; v1 submitted 25 April, 2019; originally announced April 2019.

    Comments: Accepted to *SEM 2019 (revised submission). Corresponding authors: Najoung Kim (n.kim@jhu.edu), Ellie Pavlick (ellie_pavlick@brown.edu)

  12. arXiv:1805.01042  [pdf, other

    cs.CL

    Hypothesis Only Baselines in Natural Language Inference

    Authors: Adam Poliak, Jason Naradowsky, Aparajita Haldar, Rachel Rudinger, Benjamin Van Durme

    Abstract: We propose a hypothesis only baseline for diagnosing Natural Language Inference (NLI). Especially when an NLI dataset assumes inference is occurring based purely on the relationship between a context and a hypothesis, it follows that assessing entailment relations while ignoring the provided context is a degenerate solution. Yet, through experiments on ten distinct NLI datasets, we find that this… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

    Comments: Accepted at *SEM 2018 as long paper. 12 pages

  13. arXiv:1804.09779  [pdf, ps, other

    cs.CL

    On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference

    Authors: Adam Poliak, Yonatan Belinkov, James Glass, Benjamin Van Durme

    Abstract: We propose a process for investigating the extent to which sentence representations arising from neural machine translation (NMT) systems encode distinct semantic phenomena. We use these representations as features to train a natural language inference (NLI) classifier based on datasets recast from existing semantic annotations. In applying this process to a representative NMT system, we find its… ▽ More

    Submitted 6 May, 2018; v1 submitted 25 April, 2018; originally announced April 2018.

    Comments: To be presented at NAACL 2018 - 11 pages

  14. arXiv:1804.08207  [pdf, ps, other

    cs.CL

    Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation

    Authors: Adam Poliak, Aparajita Haldar, Rachel Rudinger, J. Edward Hu, Ellie Pavlick, Aaron Steven White, Benjamin Van Durme

    Abstract: We present a large-scale collection of diverse natural language inference (NLI) datasets that help provide insight into how well a sentence representation captures distinct types of reasoning. The collection results from recasting 13 existing datasets from 7 semantic phenomena into a common NLI structure, resulting in over half a million labeled context-hypothesis pairs in total. We refer to our c… ▽ More

    Submitted 29 August, 2018; v1 submitted 22 April, 2018; originally announced April 2018.

    Comments: To be presented at EMNLP 2018. 15 pages

  15. arXiv:1706.09562  [pdf, other

    cs.CL

    Frame-Based Continuous Lexical Semantics through Exponential Family Tensor Factorization and Semantic Proto-Roles

    Authors: Francis Ferraro, Adam Poliak, Ryan Cotterell, Benjamin Van Durme

    Abstract: We study how different frame annotations complement one another when learning continuous lexical semantics. We learn the representations from a tensorized skip-gram model that consistently encodes syntactic-semantic content better, with multiple 10% gains over baselines.

    Submitted 28 June, 2017; originally announced June 2017.

    Comments: Accepted at the Sixth Joint Conference on Lexical and Computational Semantics (*SEM). Association for Computational Linguistics, Vancouver, Canada. 2017