Skip to main content

Showing 1–50 of 51 results for author: Tsarfaty, R

  1. arXiv:2407.10626  [pdf, other

    cs.CL

    NoviCode: Generating Programs from Natural Language Utterances by Novices

    Authors: Asaf Achi Mordechai, Yoav Goldberg, Reut Tsarfaty

    Abstract: Current Text-to-Code models demonstrate impressive capabilities in generating executable code from natural language snippets. However, current studies focus on technical instructions and programmer-oriented language, and it is an open question whether these models can effectively translate natural language descriptions given by non-technical users and express complex goals, to an executable progra… ▽ More

    Submitted 16 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2407.00402  [pdf, other

    cs.CL cs.AI

    Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

    Authors: Omer Goldman, Alon Jacovi, Aviv Slobodkin, Aviya Maimon, Ido Dagan, Reut Tsarfaty

    Abstract: Improvements in language models' capabilities have pushed their applications towards longer contexts, making long-context evaluation and development an active research area. However, many disparate use-cases are grouped together under the umbrella term of "long-context", defined simply by the total length of the model's input, including - for example - Needle-in-a-Haystack tasks, book summarizatio… ▽ More

    Submitted 11 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

  3. arXiv:2406.19967  [pdf, other

    cs.CL cs.AI

    Into the Unknown: Generating Geospatial Descriptions for New Environments

    Authors: Tzuf Paz-Argaman, John Palowitch, Sayali Kulkarni, Reut Tsarfaty, Jason Baldridge

    Abstract: Similar to vision-and-language navigation (VLN) tasks that focus on bridging the gap between vision and language for embodied navigation, the new Rendezvous (RVS) task requires reasoning over allocentric spatial relationships (independent of the observer's viewpoint) using non-sequential navigation instructions and maps. However, performance substantially drops in new environments with no training… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Journal ref: ACL 2024 Findings

  4. arXiv:2406.03897  [pdf, other

    cs.CL cs.AI

    HeSum: a Novel Dataset for Abstractive Text Summarization in Hebrew

    Authors: Tzuf Paz-Argaman, Itai Mondshine, Asaf Achi Mordechai, Reut Tsarfaty

    Abstract: While large language models (LLMs) excel in various natural language tasks in English, their performance in lower-resourced languages like Hebrew, especially for generative tasks such as abstractive summarization, remains unclear. The high morphological richness in Hebrew adds further challenges due to the ambiguity in sentence comprehension and the complexities in meaning construction. In this pa… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Journal ref: ACL 2024 Findings

  5. arXiv:2405.20967  [pdf, other

    cs.CL

    Superlatives in Context: Explicit and Implicit Domain Restrictions for Superlative Frames

    Authors: Valentina Pyatkin, Bonnie Webber, Ido Dagan, Reut Tsarfaty

    Abstract: Superlatives are used to single out elements with a maximal/minimal property. Semantically, superlatives perform a set comparison: something (or some things) has the min/max property out of a set. As such, superlatives provide an ideal phenomenon for studying implicit phenomena and discourse restrictions. While this comparison set is often not explicitly defined, its (implicit) restrictions can be… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 11 pages

  6. Do Pretrained Contextual Language Models Distinguish between Hebrew Homograph Analyses?

    Authors: Avi Shmidman, Cheyn Shmuel Shmidman, Dan Bareket, Moshe Koppel, Reut Tsarfaty

    Abstract: Semitic morphologically-rich languages (MRLs) are characterized by extreme word ambiguity. Because most vowels are omitted in standard texts, many of the words are homographs with multiple possible analyses, each with a different pronunciation and different morphosyntactic properties. This ambiguity goes beyond word-sense disambiguation (WSD), and may include token segmentation into multiple word… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Journal ref: In Proceedings of EACL 2023, 849-864 (2023)

  7. arXiv:2404.06283  [pdf, other

    cs.CL

    LLMs' Reading Comprehension Is Affected by Parametric Knowledge and Struggles with Hypothetical Statements

    Authors: Victoria Basmov, Yoav Goldberg, Reut Tsarfaty

    Abstract: The task of reading comprehension (RC), often implemented as context-based question answering (QA), provides a primary means to assess language models' natural language understanding (NLU) capabilities. Yet, when applied to large language models (LLMs) with extensive built-in world knowledge, this method can be deceptive. If the context aligns with the LLMs' internal knowledge, it is hard to disce… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  8. arXiv:2403.06970  [pdf, other

    cs.CL

    MRL Parsing Without Tears: The Case of Hebrew

    Authors: Shaltiel Shmidman, Avi Shmidman, Moshe Koppel, Reut Tsarfaty

    Abstract: Syntactic parsing remains a critical tool for relation extraction and information extraction, especially in resource-scarce languages where LLMs are lacking. Yet in morphologically rich languages (MRLs), where parsers need to identify multiple lexical units in each token, existing systems suffer in latency and setup complexity. Some use a pipeline to peel away the layers: first segmentation, then… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  9. arXiv:2403.06265  [pdf, other

    cs.CL cs.AI cs.LG

    Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance

    Authors: Omer Goldman, Avi Caciularu, Matan Eyal, Kris Cao, Idan Szpektor, Reut Tsarfaty

    Abstract: Despite it being the cornerstone of BPE, the most common tokenization algorithm, the importance of compression in the tokenization process is still unclear. In this paper, we argue for the theoretical importance of compression, that can be viewed as 0-gram language modeling where equal probability is assigned to all tokens. We also demonstrate the empirical importance of compression for downstream… ▽ More

    Submitted 22 June, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: EMNLP 2024, Findings

  10. arXiv:2403.04792  [pdf

    cs.CL cs.LG

    Breaking the Language Barrier: Can Direct Inference Outperform Pre-Translation in Multilingual LLM Applications?

    Authors: Yotam Intrator, Matan Halfon, Roman Goldenberg, Reut Tsarfaty, Matan Eyal, Ehud Rivlin, Yossi Matias, Natalia Aizenberg

    Abstract: Large language models hold significant promise in multilingual applications. However, inherent biases stemming from predominantly English-centric pre-training have led to the widespread practice of pre-translation, i.e., translating non-English inputs to English before inference, leading to complexity and information loss. This study re-evaluates the need for pre-translation in the context of PaLM… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  11. arXiv:2402.16364  [pdf, other

    cs.CL cs.LG cs.MM

    Where Do We Go from Here? Multi-scale Allocentric Relational Inference from Natural Spatial Descriptions

    Authors: Tzuf Paz-Argaman, Sayali Kulkarni, John Palowitch, Jason Baldridge, Reut Tsarfaty

    Abstract: When communicating routes in natural language, the concept of {\em acquired spatial knowledge} is crucial for geographic information retrieval (GIR) and in spatial cognitive research. However, NLP navigation studies often overlook the impact of such acquired knowledge on textual descriptions. Current navigation studies concentrate on egocentric local descriptions (e.g., `it will be on your right')… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  12. arXiv:2402.02564  [pdf, other

    cs.CL cs.AI cs.LG

    A Truly Joint Neural Architecture for Segmentation and Parsing

    Authors: Danit Yshaayahu Levi, Reut Tsarfaty

    Abstract: Contemporary multilingual dependency parsers can parse a diverse set of languages, but for Morphologically Rich Languages (MRLs), performance is attested to be lower than other languages. The key challenge is that, due to high morphological complexity and ambiguity of the space-delimited input tokens, the linguistic units that act as nodes in the tree are not known in advance. Pre-neural dependenc… ▽ More

    Submitted 2 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  13. arXiv:2401.01854  [pdf, other

    cs.CL cs.AI cs.LG

    Multilingual Instruction Tuning With Just a Pinch of Multilinguality

    Authors: Uri Shaham, Jonathan Herzig, Roee Aharoni, Idan Szpektor, Reut Tsarfaty, Matan Eyal

    Abstract: As instruction-tuned large language models (LLMs) gain global adoption, their ability to follow instructions in multiple languages becomes increasingly crucial. In this work, we investigate how multilinguality during instruction tuning of a multilingual LLM affects instruction-following across languages from the pre-training corpus. We first show that many languages transfer some instruction-follo… ▽ More

    Submitted 21 May, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: Findings of ACL 2024

  14. arXiv:2311.00658  [pdf, other

    cs.CL

    Explicit Morphological Knowledge Improves Pre-training of Language Models for Hebrew

    Authors: Eylon Gueta, Omer Goldman, Reut Tsarfaty

    Abstract: Pre-trained language models (PLMs) have shown remarkable successes in acquiring a wide range of linguistic knowledge, relying solely on self-supervised training on text streams. Nevertheless, the effectiveness of this language-agnostic approach has been frequently questioned for its sub-optimal performance when applied to morphologically-rich languages (MRLs). We investigate the hypothesis that in… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  15. arXiv:2310.18369  [pdf, other

    cs.CL cs.AI cs.CV

    Apollo: Zero-shot MultiModal Reasoning with Multiple Experts

    Authors: Daniela Ben-David, Tzuf Paz-Argaman, Reut Tsarfaty

    Abstract: We propose a modular framework that leverages the expertise of different foundation models over different modalities and domains in order to perform a single, complex, multi-modal task, without relying on prompt engineering or otherwise tailor-made multi-modal training. Our approach enables decentralized command execution and allows each model to both contribute and benefit from the expertise of t… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: GitHub: https://github.com/danielabd/Apollo-Cap

    ACM Class: I.2.7; I.5.4

  16. arXiv:2310.16329  [pdf, other

    cs.CL cs.AI cs.DB

    CoheSentia: A Novel Benchmark of Incremental versus Holistic Assessment of Coherence in Generated Texts

    Authors: Aviya Maimon, Reut Tsarfaty

    Abstract: Coherence is a linguistic term that refers to the relations between small textual units (sentences, propositions), which make the text logically consistent and meaningful to the reader. With the advances of generative foundational models in NLP, there is a pressing need to automatically assess the human-perceived coherence of automatically generated texts. Up until now, little work has been done o… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  17. arXiv:2310.15905  [pdf, other

    cs.CL cs.AI cs.LG

    Is Probing All You Need? Indicator Tasks as an Alternative to Probing Embedding Spaces

    Authors: Tal Levy, Omer Goldman, Reut Tsarfaty

    Abstract: The ability to identify and control different kinds of linguistic information encoded in vector representations of words has many use cases, especially for explainability and bias removal. This is usually done via a set of simple classification tasks, termed probes, to evaluate the information encoded in the embedding space. However, the involvement of a trainable classifier leads to entanglement… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023

  18. arXiv:2310.00598  [pdf, other

    cs.CL cs.AI

    A Novel Computational and Modeling Foundation for Automatic Coherence Assessment

    Authors: Aviya Maimon, Reut Tsarfaty

    Abstract: Coherence is an essential property of well-written texts, that refers to the way textual units relate to one another. In the era of generative AI, coherence assessment is essential for many NLP tasks; summarization, generation, long-form question-answering, and more. However, in NLP {coherence} is an ill-defined notion, not having a formal definition or evaluation metrics, that would allow for lar… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  19. arXiv:2307.03319  [pdf, other

    cs.CL

    Covering Uncommon Ground: Gap-Focused Question Generation for Answer Assessment

    Authors: Roni Rabin, Alexandre Djerbetian, Roee Engelberg, Lidan Hackmon, Gal Elidan, Reut Tsarfaty, Amir Globerson

    Abstract: Human communication often involves information gaps between the interlocutors. For example, in an educational dialogue, a student often provides an answer that is incomplete, and there is a gap between this answer and the perfect one expected by the teacher. Successful dialogue then hinges on the teacher asking about this gap in an effective manner, thus creating a rich and interactive educational… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  20. arXiv:2307.00509  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    HeGeL: A Novel Dataset for Geo-Location from Hebrew Text

    Authors: Tzuf Paz-Argaman, Tal Bauman, Itai Mondshine, Itzhak Omer, Sagi Dalyot, Reut Tsarfaty

    Abstract: The task of textual geolocation - retrieving the coordinates of a place based on a free-form language description - calls for not only grounding but also natural language understanding and geospatial reasoning. Even though there are quite a few datasets in English used for geolocation, they are currently based on open-source data (Wikipedia and Twitter), where the location of the described place i… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: Accepted for ACL findings 2023

  21. arXiv:2306.12581  [pdf, other

    cs.CL

    Morphological Inflection with Phonological Features

    Authors: David Guriel, Omer Goldman, Reut Tsarfaty

    Abstract: Recent years have brought great advances into solving morphological tasks, mostly due to powerful neural models applied to various tasks as (re)inflection and analysis. Yet, such morphological tasks cannot be considered solved, especially when little training data is available or when generalizing to previously unseen lemmas. This work explores effects on performance obtained through various ways… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: ACL 2023 main conference; 8 pages, 1 figure

  22. arXiv:2305.16740  [pdf, other

    cs.CL

    Conjunct Resolution in the Face of Verbal Omissions

    Authors: Royi Rassin, Yoav Goldberg, Reut Tsarfaty

    Abstract: Verbal omissions are complex syntactic phenomena in VP coordination structures. They occur when verbs and (some of) their arguments are omitted from subsequent clauses after being explicitly stated in an initial clause. Recovering these omitted elements is necessary for accurate interpretation of the sentence, and while humans easily and intuitively fill in the missing information, state-of-the-ar… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  23. arXiv:2305.14785  [pdf, other

    cs.CL cs.AI

    Simple Linguistic Inferences of Large Language Models (LLMs): Blind Spots and Blinds

    Authors: Victoria Basmov, Yoav Goldberg, Reut Tsarfaty

    Abstract: We evaluate LLMs' language understanding capacities on simple inference tasks that most humans find trivial. Specifically, we target (i) grammatically-specified entailments, (ii) premises with evidential adverbs of uncertainty, and (iii) monotonicity entailments. We design evaluation sets for these tasks and conduct experiments in both zero-shot and chain-of-thought setups, and with multiple promp… ▽ More

    Submitted 11 April, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  24. arXiv:2304.00815  [pdf, other

    cs.CL

    Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design

    Authors: Valentina Pyatkin, Frances Yung, Merel C. J. Scholman, Reut Tsarfaty, Ido Dagan, Vera Demberg

    Abstract: Disagreement in natural language annotation has mostly been studied from a perspective of biases introduced by the annotators and the annotation frameworks. Here, we propose to analyze another source of bias: task design bias, which has a particularly strong impact on crowdsourced linguistic annotations where natural language is used to elicit the interpretation of laymen annotators. For this purp… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: Accepted to TACL, pre-MIT Press publication version

  25. arXiv:2212.09682  [pdf, other

    cs.CL

    Multilingual Sequence-to-Sequence Models for Hebrew NLP

    Authors: Matan Eyal, Hila Noga, Roee Aharoni, Idan Szpektor, Reut Tsarfaty

    Abstract: Recent work attributes progress in NLP to large language models (LMs) with increased model size and large quantities of pretraining data. Despite this, current state-of-the-art LMs for Hebrew are both under-parameterized and under-trained compared to LMs in other languages. Additionally, previous work on pretrained Hebrew LMs focused on encoder-only models. While the encoder-only architecture is b… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  26. arXiv:2211.15199  [pdf

    cs.CL

    Large Pre-Trained Models with Extra-Large Vocabularies: A Contrastive Analysis of Hebrew BERT Models and a New One to Outperform Them All

    Authors: Eylon Gueta, Avi Shmidman, Shaltiel Shmidman, Cheyn Shmuel Shmidman, Joshua Guedalia, Moshe Koppel, Dan Bareket, Amit Seker, Reut Tsarfaty

    Abstract: We present a new pre-trained language model (PLM) for modern Hebrew, termed AlephBERTGimmel, which employs a much larger vocabulary (128K items) than standard Hebrew PLMs before. We perform a contrastive analysis of this model against all previous Hebrew PLMs (mBERT, heBERT, AlephBERT) and assess the effects of larger vocabularies on task performance. Our experiments show that larger vocabularies… ▽ More

    Submitted 15 May, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

  27. arXiv:2211.07950  [pdf, other

    cs.CL

    Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs

    Authors: Kyle Richardson, Ronen Tamari, Oren Sultan, Reut Tsarfaty, Dafna Shahaf, Ashish Sabharwal

    Abstract: Can we teach natural language understanding models to track their beliefs through intermediate points in text? We propose a representation learning framework called breakpoint modeling that allows for learning of this type. Given any text encoder and data marked with intermediate states (breakpoints) along with corresponding textual queries viewed as true/false propositions (i.e., the candidate be… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: EMNLP 2022

  28. arXiv:2205.03608  [pdf, other

    cs.CL

    UniMorph 4.0: Universal Morphology

    Authors: Khuyagbaatar Batsuren, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Kyle Gorman, Yustinus Ghanggo Ate, Maria Ryskina, Sabrina J. Mielke, Elena Budianskaya, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Lane, Mohit Raj, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Benoît Sagot, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay , et al. (71 additional authors not shown)

    Abstract: The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This pa… ▽ More

    Submitted 19 June, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: LREC 2022; The first two authors made equal contributions

  29. arXiv:2204.04748  [pdf, other

    cs.CL

    Breaking Character: Are Subwords Good Enough for MRLs After All?

    Authors: Omri Keren, Tal Avinari, Reut Tsarfaty, Omer Levy

    Abstract: Large pretrained language models (PLMs) typically tokenize the input string into contiguous subwords before any pretraining or inference. However, previous studies have claimed that this form of subword tokenization is inadequate for processing morphologically-rich languages (MRLs). We revisit this hypothesis by pretraining a BERT-style masked language model over character sequences instead of wor… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

  30. arXiv:2203.10845  [pdf, other

    cs.CL

    Neural Token Segmentation for High Token-Internal Complexity

    Authors: Idan Brusilovsky, Reut Tsarfaty

    Abstract: Tokenizing raw texts into word units is an essential pre-processing step for critical tasks in the NLP pipeline such as tagging, parsing, named entity recognition, and more. For most languages, this tokenization step straightforward. However, for languages with high token-internal complexity, further token-to-word segmentation is required. Previous canonical segmentation studies were based on char… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  31. arXiv:2203.08527  [pdf, other

    cs.CL

    Morphological Reinflection with Multiple Arguments: An Extended Annotation schema and a Georgian Case Study

    Authors: David Guriel, Omer Goldman, Reut Tsarfaty

    Abstract: In recent years, a flurry of morphological datasets had emerged, most notably UniMorph, a multi-lingual repository of inflection tables. However, the flat structure of the current morphological annotation schema makes the treatment of some languages quirky, if not impossible, specifically in cases of polypersonal agreement, where verbs agree with multiple arguments using true affixes. In this pape… ▽ More

    Submitted 20 March, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  32. arXiv:2202.12832  [pdf, other

    cs.CL

    Morphology Without Borders: Clause-Level Morphology

    Authors: Omer Goldman, Reut Tsarfaty

    Abstract: Morphological tasks use large multi-lingual datasets that organize words into inflection tables, which then serve as training and evaluation data for various tasks. However, a closer inspection of these data reveals profound cross-linguistic inconsistencies, that arise from the lack of a clear linguistic and operational definition of what is a word, and that severely impair the universality of the… ▽ More

    Submitted 19 October, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: To appear on TACL

  33. arXiv:2112.00086  [pdf, other

    cs.CL cs.AI

    Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking

    Authors: Ronen Tamari, Kyle Richardson, Aviad Sar-Shalom, Noam Kahlon, Nelson Liu, Reut Tsarfaty, Dafna Shahaf

    Abstract: While neural language models often perform surprisingly well on natural language understanding (NLU) tasks, their strengths and limitations remain poorly understood. Controlled synthetic tasks are thus an increasingly important resource for diagnosing model behavior. In this work we focus on story understanding, a core competency for NLU systems. However, the main synthetic resource for story unde… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: Code and data will be made available at project page: https://tiny.one/8wjxwd7z

  34. arXiv:2109.12085  [pdf, other

    cs.CL

    Text-based NP Enrichment

    Authors: Yanai Elazar, Victoria Basmov, Yoav Goldberg, Reut Tsarfaty

    Abstract: Understanding the relations between entities denoted by NPs in a text is a critical part of human-like natural language understanding. However, only a fraction of such relations is covered by standard NLP tasks and benchmarks nowadays. In this work, we propose a novel task termed text-based NP enrichment (TNE), in which we aim to enrich each NP in a text with all the preposition-mediated relations… ▽ More

    Submitted 11 April, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: Accepted to the TACL journal, pre-MIT Press publication version

  35. arXiv:2109.04832  [pdf, other

    cs.CL

    Asking It All: Generating Contextualized Questions for any Semantic Role

    Authors: Valentina Pyatkin, Paul Roit, Julian Michael, Reut Tsarfaty, Yoav Goldberg, Ido Dagan

    Abstract: Asking questions about a situation is an inherent step towards understanding it. To this end, we introduce the task of role question generation, which, given a predicate mention and a passage, requires producing a set of questions asking about all possible semantic roles of the predicate. We develop a two-stage model for this task, which first produces a context-independent question prototype for… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: Accepted as a long paper to EMNLP 2021, Main Conference

  36. arXiv:2108.05682  [pdf, other

    cs.CL

    (Un)solving Morphological Inflection: Lemma Overlap Artificially Inflates Models' Performance

    Authors: Omer Goldman, David Guriel, Reut Tsarfaty

    Abstract: In the domain of Morphology, Inflection is a fundamental and important task that gained a lot of traction in recent years, mostly via SIGMORPHON's shared-tasks. With average accuracy above 0.9 over the scores of all languages, the task is considered mostly solved using relatively generic neural seq2seq models, even with little data provided. In this work, we propose to re-evaluate morphological in… ▽ More

    Submitted 20 March, 2022; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: ACL 2022

  37. arXiv:2106.14321  [pdf, other

    cs.CL

    Draw Me a Flower: Processing and Grounding Abstraction in Natural Language

    Authors: Royi Lachmy, Valentina Pyatkin, Avshalom Manevich, Reut Tsarfaty

    Abstract: Abstraction is a core tenet of human cognition and communication. When composing natural language instructions, humans naturally evoke abstraction to convey complex procedures in an efficient and concise way. Yet, interpreting and grounding abstraction expressed in NL has not yet been systematically studied in NLP, with no accepted benchmarks specifically eliciting abstraction in NL. In this work,… ▽ More

    Submitted 30 September, 2022; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: Accepted to the TACL journal. This is a pre-MIT Press publication version

  38. arXiv:2106.08037  [pdf, other

    cs.CL

    The Possible, the Plausible, and the Desirable: Event-Based Modality Detection for Language Processing

    Authors: Valentina Pyatkin, Shoval Sadde, Aynat Rubinstein, Paul Portner, Reut Tsarfaty

    Abstract: Modality is the linguistic ability to describe events with added information such as how desirable, plausible, or feasible they are. Modality is important for many NLP downstream tasks such as the detection of hedging, uncertainty, speculation, and more. Previous studies that address modality detection in NLP often restrict modal expressions to a closed syntactic class, and the modal sense labels… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: ACL 2021

  39. arXiv:2104.08512  [pdf, other

    cs.CL

    Minimal Supervision for Morphological Inflection

    Authors: Omer Goldman, Reut Tsarfaty

    Abstract: Neural models for the various flavours of morphological inflection tasks have proven to be extremely accurate given ample labeled data -- data that may be slow and costly to obtain. In this work we aim to overcome this annotation bottleneck by bootstrapping labeled data from a seed as little as {\em five} labeled paradigms, accompanied by a large bulk of unlabeled text. Our approach exploits diffe… ▽ More

    Submitted 12 October, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021

  40. arXiv:2104.04052  [pdf, other

    cs.CL

    AlephBERT:A Hebrew Large Pre-Trained Language Model to Start-off your Hebrew NLP Application With

    Authors: Amit Seker, Elron Bandel, Dan Bareket, Idan Brusilovsky, Refael Shaked Greenfeld, Reut Tsarfaty

    Abstract: Large Pre-trained Language Models (PLMs) have become ubiquitous in the development of language understanding technology and lie at the heart of many artificial intelligence advances. While advances reported for English using PLMs are unprecedented, reported advances using PLMs in Hebrew are few and far between. The problem is twofold. First, Hebrew resources available for training NLP models are n… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

  41. arXiv:2010.05971  [pdf, other

    cs.CL

    The Extraordinary Failure of Complement Coercion Crowdsourcing

    Authors: Yanai Elazar, Victoria Basmov, Shauli Ravfogel, Yoav Goldberg, Reut Tsarfaty

    Abstract: Crowdsourcing has eased and scaled up the collection of linguistic annotation in recent years. In this work, we follow known methodologies of collecting labeled data for the complement coercion phenomenon. These are constructions with an implied action -- e.g., "I started a new book I bought last week", where the implied action is reading. We aim to collect annotated data for this phenomenon by re… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: Workshop on Insights from Negative Results in NLP, co-located with EMNLP 2020

  42. arXiv:2010.03276  [pdf, other

    cs.CL

    ZEST: Zero-shot Learning from Text Descriptions using Textual Similarity and Visual Summarization

    Authors: Tzuf Paz-Argaman, Yuval Atzmon, Gal Chechik, Reut Tsarfaty

    Abstract: We study the problem of recognizing visual entities from the textual descriptions of their classes. Specifically, given birds' images with free-text descriptions of their species, we learn to classify images of previously-unseen species based on specie descriptions. This setup has been studied in the vision community under the name zero-shot learning from text, focusing on learning to transfer kno… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: 11 pages, Findings of EMNLP 2020

  43. arXiv:2010.02864  [pdf

    cs.CL

    A Novel Challenge Set for Hebrew Morphological Disambiguation and Diacritics Restoration

    Authors: Avi Shmidman, Joshua Guedalia, Shaltiel Shmidman, Moshe Koppel, Reut Tsarfaty

    Abstract: One of the primary tasks of morphological parsers is the disambiguation of homographs. Particularly difficult are cases of unbalanced ambiguity, where one of the possible analyses is far more frequent than the others. In such cases, there may not exist sufficient examples of the minority analyses in order to properly evaluate performance, nor to train effective classifiers. In this paper we addres… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Journal ref: Findings of EMNLP, 2020

  44. arXiv:2010.02815  [pdf, other

    cs.CL

    QADiscourse -- Discourse Relations as QA Pairs: Representation, Crowdsourcing and Baselines

    Authors: Valentina Pyatkin, Ayal Klein, Reut Tsarfaty, Ido Dagan

    Abstract: Discourse relations describe how two propositions relate to one another, and identifying them automatically is an integral part of natural language understanding. However, annotating discourse relations typically requires expert annotators. Recently, different semantic aspects of a sentence have been represented and crowd-sourced via question-and-answer (QA) pairs. This paper proposes a novel repr… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: To appear at EMNLP 2020

  45. Neural Modeling for Named Entities and Morphology (NEMO^2)

    Authors: Dan Bareket, Reut Tsarfaty

    Abstract: Named Entity Recognition (NER) is a fundamental NLP task, commonly formulated as classification over a sequence of tokens. Morphologically-Rich Languages (MRLs) pose a challenge to this basic formulation, as the boundaries of Named Entities do not necessarily coincide with token boundaries, rather, they respect morphological boundaries. To address NER in MRLs we then need to answer two fundamental… ▽ More

    Submitted 10 May, 2021; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: Accepted to TACL. This is a pre-MIT Press publication version

  46. arXiv:2005.01330  [pdf, other

    cs.CL

    From SPMRL to NMRL: What Did We Learn (and Unlearn) in a Decade of Parsing Morphologically-Rich Languages (MRLs)?

    Authors: Reut Tsarfaty, Dan Bareket, Stav Klein, Amit Seker

    Abstract: It has been exactly a decade since the first establishment of SPMRL, a research initiative unifying multiple research efforts to address the peculiar challenges of Statistical Parsing for Morphologically-Rich Languages (MRLs).Here we reflect on parsing MRLs in that decade, highlight the solutions and lessons learned for the architectural, modeling and lexical challenges in the pre-neural era, and… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

  47. arXiv:2005.01306  [pdf, other

    cs.CL

    pyBART: Evidence-based Syntactic Transformations for IE

    Authors: Aryeh Tiktinsky, Yoav Goldberg, Reut Tsarfaty

    Abstract: Syntactic dependencies can be predicted with high accuracy, and are useful for both machine-learned and pattern-based information extraction tasks. However, their utility can be improved. These syntactic dependencies are designed to accurately reflect syntactic relations, and they do not make semantic relations explicit. Therefore, these representations lack many explicit connections between conte… ▽ More

    Submitted 4 June, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: Accepted ACL2020 system demonstration paper

  48. arXiv:2004.02709  [pdf, other

    cs.CL

    Evaluating Models' Local Decision Boundaries via Contrast Sets

    Authors: Matt Gardner, Yoav Artzi, Victoria Basmova, Jonathan Berant, Ben Bogin, Sihao Chen, Pradeep Dasigi, Dheeru Dua, Yanai Elazar, Ananth Gottumukkala, Nitish Gupta, Hanna Hajishirzi, Gabriel Ilharco, Daniel Khashabi, Kevin Lin, Jiangming Liu, Nelson F. Liu, Phoebe Mulcaire, Qiang Ning, Sameer Singh, Noah A. Smith, Sanjay Subramanian, Reut Tsarfaty, Eric Wallace, Ally Zhang , et al. (1 additional authors not shown)

    Abstract: Standard test sets for supervised learning evaluate in-distribution generalization. Unfortunately, when a dataset has systematic gaps (e.g., annotation artifacts), these evaluations are misleading: a model can learn simple decision rules that perform well on the test set but do not capture a dataset's intended capabilities. We propose a new annotation paradigm for NLP that helps to close systemati… ▽ More

    Submitted 1 October, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

  49. arXiv:2003.04567  [pdf, other

    cs.AI cs.CL cs.LG

    Ecological Semantics: Programming Environments for Situated Language Understanding

    Authors: Ronen Tamari, Gabriel Stanovsky, Dafna Shahaf, Reut Tsarfaty

    Abstract: Large-scale natural language understanding (NLU) systems have made impressive progress: they can be applied flexibly across a variety of tasks, and employ minimal structural assumptions. However, extensive empirical research has shown this to be a double-edged sword, coming at the cost of shallow understanding: inferior generalization, grounding and explainability. Grounded language learning appro… ▽ More

    Submitted 24 May, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

    Comments: Camera ready for Bridging AI and Cognitive Science (BAICS) workshop at ICLR2020. For interactive demos, see https://eco-sem.github.io/

  50. arXiv:1909.08970  [pdf, other

    cs.CL

    RUN through the Streets: A New Dataset and Baseline Models for Realistic Urban Navigation

    Authors: Tzuf Paz-Argaman, Reut Tsarfaty

    Abstract: Following navigation instructions in natural language requires a composition of language, action, and knowledge of the environment. Knowledge of the environment may be provided via visual sensors or as a symbolic world representation referred to as a map. Here we introduce the Realistic Urban Navigation (RUN) task, aimed at interpreting navigation instructions based on a real, dense, urban map. Us… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: accepted to appear at the EMNLP 2019 conference