Skip to main content

Showing 1–37 of 37 results for author: Shwartz, V

  1. arXiv:2407.00263  [pdf, other

    cs.CL cs.AI cs.CV

    From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models

    Authors: Mehar Bhatia, Sahithya Ravi, Aditya Chinchure, Eunjeong Hwang, Vered Shwartz

    Abstract: Despite recent advancements in vision-language models, their performance remains suboptimal on images from non-western cultures due to underrepresentation in training datasets. Various benchmarks have been proposed to test models' cultural inclusivity, but they have limited coverage of cultures and do not adequately assess cultural diversity across universal as well as culture-specific local conce… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: Under peer review

  2. arXiv:2404.06664  [pdf, other

    cs.CL cs.AI cs.HC

    CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

    Authors: Yu Ying Chiu, Liwei Jiang, Maria Antoniak, Chan Young Park, Shuyue Stella Li, Mehar Bhatia, Sahithya Ravi, Yulia Tsvetkov, Vered Shwartz, Yejin Choi

    Abstract: Frontier large language models (LLMs) are developed by researchers and practitioners with skewed cultural backgrounds and on datasets with skewed sources. However, LLMs' (lack of) multicultural knowledge cannot be effectively assessed with current methods for developing benchmarks. Existing multicultural evaluations primarily rely on expensive and restricted human annotations or potentially outdat… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Preprint (under review)

  3. arXiv:2403.14895  [pdf, other

    cs.CL cs.AI

    Stance Reasoner: Zero-Shot Stance Detection on Social Media with Explicit Reasoning

    Authors: Maksym Taranukhin, Vered Shwartz, Evangelos Milios

    Abstract: Social media platforms are rich sources of opinionated content. Stance detection allows the automatic extraction of users' opinions on various topics from such content. We focus on zero-shot stance detection, where the model's success relies on (a) having knowledge about the target topic; and (b) learning general reasoning strategies that can be employed for new topics. We present Stance Reasoner,… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted to COLING 2024

  4. arXiv:2403.12678  [pdf, other

    cs.CL cs.AI

    Empowering Air Travelers: A Chatbot for Canadian Air Passenger Rights

    Authors: Maksym Taranukhin, Sahithya Ravi, Gabor Lukacs, Evangelos Milios, Vered Shwartz

    Abstract: The Canadian air travel sector has seen a significant increase in flight delays, cancellations, and other issues concerning passenger rights. Recognizing this demand, we present a chatbot to assist passengers and educate them about their rights. Our system breaks a complex user input into simple queries which are used to retrieve information from a collection of documents detailing air travel regu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: under review

  5. arXiv:2402.18113  [pdf, other

    cs.CL cs.AI

    Small But Funny: A Feedback-Driven Approach to Humor Distillation

    Authors: Sahithya Ravi, Patrick Huber, Akshat Shrivastava, Aditya Sagar, Ahmed Aly, Vered Shwartz, Arash Einolghozati

    Abstract: The emergence of Large Language Models (LLMs) has brought to light promising language generation capabilities, particularly in performing tasks like complex reasoning and creative writing. Consequently, distillation through imitation of teacher responses has emerged as a popular technique to transfer knowledge from LLMs to more accessible, Small Language Models (SLMs). While this works well for si… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  6. arXiv:2311.01684  [pdf, other

    cs.CL

    CASE: Commonsense-Augmented Score with an Expanded Answer Space

    Authors: Wenkai Chen, Sahithya Ravi, Vered Shwartz

    Abstract: LLMs have demonstrated impressive zero-shot performance on NLP tasks thanks to the knowledge they acquired in their training. In multiple-choice QA tasks, the LM probabilities are used as an imperfect measure of the plausibility of each answer choice. One of the major limitations of the basic score is that it treats all words as equally important. We propose CASE, a Commonsense-Augmented Score wit… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Findings of EMNLP 2023

  7. arXiv:2310.20072  [pdf, other

    cs.CL cs.LG

    Automatic Evaluation of Generative Models with Instruction Tuning

    Authors: Shuhaib Mehri, Vered Shwartz

    Abstract: Automatic evaluation of natural language generation has long been an elusive goal in NLP.A recent paradigm fine-tunes pre-trained language models to emulate human judgements for a particular task and evaluation criterion. Inspired by the generalization ability of instruction-tuned models, we propose a learned metric based on instruction tuning. To test our approach, we collected HEAP, a dataset of… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 11 pages, 1 figure

  8. arXiv:2310.15383  [pdf, other

    cs.CL

    GD-COMET: A Geo-Diverse Commonsense Inference Model

    Authors: Mehar Bhatia, Vered Shwartz

    Abstract: With the increasing integration of AI into everyday life, it's becoming crucial to design AI systems that serve users from diverse backgrounds by making them culturally aware. In this paper, we present GD-COMET, a geo-diverse version of the COMET commonsense inference model. GD-COMET goes beyond Western commonsense knowledge and is capable of generating inferences pertaining to a broad range of cu… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 Main Conference

  9. arXiv:2305.14763  [pdf, other

    cs.CL

    Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models

    Authors: Natalie Shapira, Mosh Levy, Seyed Hossein Alavi, Xuhui Zhou, Yejin Choi, Yoav Goldberg, Maarten Sap, Vered Shwartz

    Abstract: The escalating debate on AI's capabilities warrants developing reliable metrics to assess machine "intelligence". Recently, many anecdotal examples were used to suggest that newer large language models (LLMs) like ChatGPT and GPT-4 exhibit Neural Theory-of-Mind (N-ToM); however, prior work reached conflicting conclusions regarding those abilities. We investigate the extent of LLMs' N-ToM through a… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  10. arXiv:2305.14617  [pdf, other

    cs.CL cs.AI

    COMET-M: Reasoning about Multiple Events in Complex Sentences

    Authors: Sahithya Ravi, Raymond Ng, Vered Shwartz

    Abstract: Understanding the speaker's intended meaning often involves drawing commonsense inferences to reason about what is not stated explicitly. In multi-event sentences, it requires understanding the relationships between events based on contextual knowledge. We propose COMET-M (Multi-Event), an event-centric commonsense model capable of generating commonsense inferences for a target event within a comp… ▽ More

    Submitted 23 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  11. arXiv:2305.13703  [pdf, other

    cs.CL

    MemeCap: A Dataset for Captioning and Interpreting Memes

    Authors: EunJeong Hwang, Vered Shwartz

    Abstract: Memes are a widely popular tool for web users to express their thoughts using visual metaphors. Understanding memes requires recognizing and interpreting visual metaphors with respect to the text inside or around the meme, often while employing background knowledge and reasoning abilities. We present the task of meme captioning and release a new dataset, MemeCap. Our dataset contains 6.3K memes al… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  12. arXiv:2305.10568  [pdf, other

    cs.CL

    From chocolate bunny to chocolate crocodile: Do Language Models Understand Noun Compounds?

    Authors: Jordan Coil, Vered Shwartz

    Abstract: Noun compound interpretation is the task of expressing a noun compound (e.g. chocolate bunny) in a free-text paraphrase that makes the relationship between the constituent nouns explicit (e.g. bunny-shaped chocolate). We propose modifications to the data and evaluation setup of the standard task (Hendrickx et al., 2013), and show that GPT-3 solves it almost perfectly. We then investigate the task… ▽ More

    Submitted 24 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  13. arXiv:2302.09715  [pdf, other

    cs.CL

    What happens before and after: Multi-Event Commonsense in Event Coreference Resolution

    Authors: Sahithya Ravi, Chris Tanner, Raymond Ng, Vered Shwartz

    Abstract: Event coreference models cluster event mentions pertaining to the same real-world event. Recent models rely on contextualized representations to recognize coreference among lexically or contextually similar mentions. However, models typically fail to leverage commonsense inferences, which is particularly limiting for resolving lexically-divergent mentions. We propose a model that extends event men… ▽ More

    Submitted 21 February, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: Accepted to EACL 2023

  14. arXiv:2210.13626  [pdf, other

    cs.CV cs.CL

    VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge

    Authors: Sahithya Ravi, Aditya Chinchure, Leonid Sigal, Renjie Liao, Vered Shwartz

    Abstract: There has been a growing interest in solving Visual Question Answering (VQA) tasks that require the model to reason beyond the content present in the image. In this work, we focus on questions that require commonsense reasoning. In contrast to previous methods which inject knowledge from static knowledge bases, we investigate the incorporation of contextualized knowledge using Commonsense Transfor… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted at WACV 2023. For code and supplementary material, see https://github.com/aditya10/VLC-BERT

  15. arXiv:2109.06437  [pdf, other

    cs.CL

    Uncovering Implicit Gender Bias in Narratives through Commonsense Inference

    Authors: Tenghao Huang, Faeze Brahman, Vered Shwartz, Snigdha Chaturvedi

    Abstract: Pre-trained language models learn socially harmful biases from their training corpora, and may repeat these biases when used for generation. We study gender biases associated with the protagonist in model-generated stories. Such biases may be expressed either explicitly ("women can't park") or implicitly (e.g. an unsolicited male character guides her into a parking space). We focus on implicit bia… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: Accepted at Findings of EMNLP 2021

  16. arXiv:2109.00087  [pdf, other

    cs.CL cs.LG

    It's not Rocket Science : Interpreting Figurative Language in Narratives

    Authors: Tuhin Chakrabarty, Yejin Choi, Vered Shwartz

    Abstract: Figurative language is ubiquitous in English. Yet, the vast majority of NLP research focuses on literal language. Existing text representations by design rely on compositionality, while figurative language is often non-compositional. In this paper, we study the interpretation of two non-compositional figurative languages (idioms and similes). We collected datasets of fictional narratives containin… ▽ More

    Submitted 1 March, 2022; v1 submitted 31 August, 2021; originally announced September 2021.

    Comments: Accepted to TACL ( To be presented at ACL 2022, Dublin)

  17. arXiv:2104.08315  [pdf, other

    cs.CL

    Surface Form Competition: Why the Highest Probability Answer Isn't Always Right

    Authors: Ari Holtzman, Peter West, Vered Shwartz, Yejin Choi, Luke Zettlemoyer

    Abstract: Large language models have shown promising results in zero-shot settings (Brown et al.,2020; Radford et al., 2019). For example, they can perform multiple choice tasks simply by conditioning on a question and selecting the answer with the highest probability. However, ranking by string probability can be problematic due to surface form competition-wherein different surface forms compete for prob… ▽ More

    Submitted 20 November, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

  18. arXiv:2012.08012  [pdf, other

    cs.CL

    Learning to Rationalize for Nonmonotonic Reasoning with Distant Supervision

    Authors: Faeze Brahman, Vered Shwartz, Rachel Rudinger, Yejin Choi

    Abstract: The black-box nature of neural models has motivated a line of research that aims to generate natural language rationales to explain why a model made certain predictions. Such rationale generation models, to date, have been trained on dataset-specific crowdsourced rationales, but this approach is costly and is not generalizable to new tasks and domains. In this paper, we investigate the extent to w… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: AAAI 2021

  19. arXiv:2011.00620  [pdf, other

    cs.CL cs.AI

    Social Chemistry 101: Learning to Reason about Social and Moral Norms

    Authors: Maxwell Forbes, Jena D. Hwang, Vered Shwartz, Maarten Sap, Yejin Choi

    Abstract: Social norms -- the unspoken commonsense rules about acceptable social behavior -- are crucial in understanding the underlying causes and intents of people's actions in narratives. For example, underlying an action such as "wanting to call cops on my neighbors" are social norms that inform our conduct, such as "It is expected that you report crimes." We present Social Chemistry, a new conceptual… ▽ More

    Submitted 16 August, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

    Comments: Published at EMNLP 2020

  20. arXiv:2010.05906  [pdf, other

    cs.CL cs.AI cs.LG

    Back to the Future: Unsupervised Backprop-based Decoding for Counterfactual and Abductive Commonsense Reasoning

    Authors: Lianhui Qin, Vered Shwartz, Peter West, Chandra Bhagavatula, Jena Hwang, Ronan Le Bras, Antoine Bosselut, Yejin Choi

    Abstract: Abductive and counterfactual reasoning, core abilities of everyday human cognition, require reasoning about what might have happened at time t, while conditioning on multiple contexts from the relative past and future. However, simultaneous incorporation of past and future contexts using generative language models (LMs) can be challenging, as they are trained either to condition only on the past c… ▽ More

    Submitted 2 August, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020

  21. arXiv:2010.01486  [pdf, other

    cs.CL cs.LG

    Paragraph-level Commonsense Transformers with Recurrent Memory

    Authors: Saadia Gabriel, Chandra Bhagavatula, Vered Shwartz, Ronan Le Bras, Maxwell Forbes, Yejin Choi

    Abstract: Human understanding of narrative texts requires making commonsense inferences beyond what is stated explicitly in the text. A recent model, COMET, can generate such implicit commonsense inferences along several dimensions such as pre- and post-conditions, motivations, and mental states of the participants. However, COMET was trained on commonsense inferences of short phrases, and is therefore disc… ▽ More

    Submitted 2 February, 2021; v1 submitted 4 October, 2020; originally announced October 2020.

    Comments: AAAI 2021

  22. arXiv:2004.14979  [pdf, other

    cs.CL

    Paraphrasing vs Coreferring: Two Sides of the Same Coin

    Authors: Yehudit Meged, Avi Caciularu, Vered Shwartz, Ido Dagan

    Abstract: We study the potential synergy between two different NLP tasks, both confronting predicate lexical variability: identifying predicate paraphrases, and event coreference resolution. First, we used annotations from an event coreference dataset as distant supervision to re-score heuristically-extracted predicate paraphrases. The new scoring gained more than 18 points in average precision upon their r… ▽ More

    Submitted 9 October, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

  23. arXiv:2004.05483  [pdf, other

    cs.CL

    Unsupervised Commonsense Question Answering with Self-Talk

    Authors: Vered Shwartz, Peter West, Ronan Le Bras, Chandra Bhagavatula, Yejin Choi

    Abstract: Natural language understanding involves reading between the lines with implicit background knowledge. Current systems either rely on pre-trained language models as the sole implicit source of world knowledge, or resort to external knowledge bases (KBs) to incorporate additional relevant knowledge. We propose an unsupervised framework based on self-talk as a novel alternative to multiple-choice com… ▽ More

    Submitted 15 September, 2020; v1 submitted 11 April, 2020; originally announced April 2020.

    Comments: EMNLP 2020

  24. arXiv:2004.03012  [pdf, other

    cs.CL

    "You are grounded!": Latent Name Artifacts in Pre-trained Language Models

    Authors: Vered Shwartz, Rachel Rudinger, Oyvind Tafjord

    Abstract: Pre-trained language models (LMs) may perpetuate biases originating in their training corpus to downstream models. We focus on artifacts associated with the representation of given names (e.g., Donald), which, depending on the corpus, may be associated with specific entities, as indicated by next token prediction (e.g., Trump). While helpful in some contexts, grounding happens also in under-specif… ▽ More

    Submitted 15 September, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: EMNLP 2020

  25. arXiv:1910.09302  [pdf, other

    cs.CL

    Diversify Your Datasets: Analyzing Generalization via Controlled Variance in Adversarial Datasets

    Authors: Ohad Rozen, Vered Shwartz, Roee Aharoni, Ido Dagan

    Abstract: Phenomenon-specific "adversarial" datasets have been recently designed to perform targeted stress-tests for particular inference types. Recent work (Liu et al., 2019a) proposed that such datasets can be utilized for training NLI and other types of models, often allowing to learn the phenomenon in focus and improve on the challenge dataset, indicating a "blind spot" in the original training data. Y… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: CoNLL 2019

  26. arXiv:1906.04772  [pdf, ps, other

    cs.CL

    A Systematic Comparison of English Noun Compound Representations

    Authors: Vered Shwartz

    Abstract: Building meaningful representations of noun compounds is not trivial since many of them scarcely appear in the corpus. To that end, composition functions approximate the distributional representation of a noun compound by combining its constituent distributional vectors. In the more general case, phrase embeddings have been trained by minimizing the distance between the vectors representing paraph… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: MWE workshop @ ACL 2019

  27. arXiv:1906.01753  [pdf, other

    cs.CL

    Revisiting Joint Modeling of Cross-document Entity and Event Coreference Resolution

    Authors: Shany Barhom, Vered Shwartz, Alon Eirew, Michael Bugert, Nils Reimers, Ido Dagan

    Abstract: Recognizing coreferring events and entities across multiple texts is crucial for many NLP applications. Despite the task's importance, research focus was given mostly to within-document entity coreference, with rather little attention to the other variants. We propose a neural architecture for cross-document coreference resolution. Inspired by Lee et al (2012), we jointly model entity and event co… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: ACL 2019

  28. arXiv:1902.10618  [pdf, other

    cs.CL

    Still a Pain in the Neck: Evaluating Text Representations on Lexical Composition

    Authors: Vered Shwartz, Ido Dagan

    Abstract: Building meaningful phrase representations is challenging because phrase meanings are not simply the sum of their constituent meanings. Lexical composition can shift the meanings of the constituent words and introduce implicit information. We tested a broad range of textual representations for their capacity to address these issues. We found that as expected, contextualized word representations pe… ▽ More

    Submitted 19 May, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: TACL 2019

  29. arXiv:1810.12686  [pdf, other

    cs.CL

    Evaluating Text GANs as Language Models

    Authors: Guy Tevet, Gavriel Habib, Vered Shwartz, Jonathan Berant

    Abstract: Generative Adversarial Networks (GANs) are a promising approach for text generation that, unlike traditional language models (LM), does not suffer from the problem of ``exposure bias''. However, A major hurdle for understanding the potential of GANs for text generation is the lack of a clear evaluation metric. In this work, we propose to approximate the distribution of text generated by a GAN, whi… ▽ More

    Submitted 24 March, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

  30. arXiv:1805.02442  [pdf, other

    cs.CL

    Paraphrase to Explicate: Revealing Implicit Noun-Compound Relations

    Authors: Vered Shwartz, Ido Dagan

    Abstract: Revealing the implicit semantic relation between the constituents of a noun-compound is important for many NLP applications. It has been addressed in the literature either as a classification task to a set of pre-defined relations or by producing free text paraphrases explicating the relations. Most existing paraphrasing methods lack the ability to generalize, and have a hard time interpreting inf… ▽ More

    Submitted 7 May, 2018; originally announced May 2018.

    Comments: Long paper at ACL 2018

  31. arXiv:1805.02266  [pdf, ps, other

    cs.CL

    Breaking NLI Systems with Sentences that Require Simple Lexical Inferences

    Authors: Max Glockner, Vered Shwartz, Yoav Goldberg

    Abstract: We create a new NLI test set that shows the deficiency of state-of-the-art models in inferences that require lexical and world knowledge. The new examples are simpler than the SNLI test set, containing sentences that differ by at most one word from sentences in the training set. Yet, the performance on the new test set is substantially worse across systems trained on SNLI, demonstrating that these… ▽ More

    Submitted 6 May, 2018; originally announced May 2018.

    Comments: 6 pages, short paper at ACL 2018

  32. arXiv:1804.08845  [pdf, ps, other

    cs.CL

    Integrating Multiplicative Features into Supervised Distributional Methods for Lexical Entailment

    Authors: Tu Vu, Vered Shwartz

    Abstract: Supervised distributional methods are applied successfully in lexical entailment, but recent work questioned whether these methods actually learn a relation between two words. Specifically, Levy et al. (2015) claimed that linear classifiers learn only separate properties of each word. We suggest a cheap and easy way to boost the performance of these methods by integrating multiplicative features i… ▽ More

    Submitted 24 April, 2018; originally announced April 2018.

    Comments: Accepted as a conference paper at *SEM 2018

  33. arXiv:1803.08073  [pdf, ps, other

    cs.CL

    Olive Oil is Made of Olives, Baby Oil is Made for Babies: Interpreting Noun Compounds using Paraphrases in a Neural Model

    Authors: Vered Shwartz, Chris Waterson

    Abstract: Automatic interpretation of the relation between the constituents of a noun compound, e.g. olive oil (source) and baby oil (purpose) is an important task for many NLP applications. Recent approaches are typically based on either noun-compound representations or paraphrases. While the former has initially shown promising results, recent work suggests that the success stems from memorizing single pr… ▽ More

    Submitted 21 March, 2018; originally announced March 2018.

    Comments: 7 pages, short paper at NAACL 2018

  34. arXiv:1612.04460  [pdf, ps, other

    cs.CL

    Hypernyms under Siege: Linguistically-motivated Artillery for Hypernymy Detection

    Authors: Vered Shwartz, Enrico Santus, Dominik Schlechtweg

    Abstract: The fundamental role of hypernymy in NLP has motivated the development of many methods for the automatic identification of this relation, most of which rely on word distribution. We investigate an extensive number of such unsupervised measures, using several distributional semantic models that differ by context type and feature weighting. We analyze the performance of the different methods based o… ▽ More

    Submitted 8 January, 2017; v1 submitted 13 December, 2016; originally announced December 2016.

    Comments: EACL 2017. 9 pages

  35. arXiv:1610.08694  [pdf, ps, other

    cs.CL

    CogALex-V Shared Task: LexNET - Integrated Path-based and Distributional Method for the Identification of Semantic Relations

    Authors: Vered Shwartz, Ido Dagan

    Abstract: We present a submission to the CogALex 2016 shared task on the corpus-based identification of semantic relations, using LexNET (Shwartz and Dagan, 2016), an integrated path-based and distributional method for semantic relation classification. The reported results in the shared task bring this submission to the third place on subtask 1 (word relatedness), and the first place on subtask 2 (semantic… ▽ More

    Submitted 1 November, 2016; v1 submitted 27 October, 2016; originally announced October 2016.

    Comments: 5 pages, accepted to the 5th Workshop on Cognitive Aspects of the Lexicon (CogALex-V), in COLING 2016

  36. arXiv:1608.05014  [pdf, ps, other

    cs.CL

    Path-based vs. Distributional Information in Recognizing Lexical Semantic Relations

    Authors: Vered Shwartz, Ido Dagan

    Abstract: Recognizing various semantic relations between terms is beneficial for many NLP tasks. While path-based and distributional information sources are considered complementary for this task, the superior results the latter showed recently suggested that the former's contribution might have become obsolete. We follow the recent success of an integrated neural method for hypernymy detection (Shwartz et… ▽ More

    Submitted 2 November, 2016; v1 submitted 17 August, 2016; originally announced August 2016.

    Comments: 5 pages, accepted to the 5th Workshop on Cognitive Aspects of the Lexicon (CogALex-V), in COLING 2016

  37. arXiv:1603.06076  [pdf, other

    cs.CL

    Improving Hypernymy Detection with an Integrated Path-based and Distributional Method

    Authors: Vered Shwartz, Yoav Goldberg, Ido Dagan

    Abstract: Detecting hypernymy relations is a key task in NLP, which is addressed in the literature using two complementary approaches. Distributional methods, whose supervised variants are the current best performers, and path-based methods, which received less research attention. We suggest an improved path-based algorithm, in which the dependency paths are encoded using a recurrent neural network, that ac… ▽ More

    Submitted 7 June, 2016; v1 submitted 19 March, 2016; originally announced March 2016.

    Comments: ACL 2016