Skip to main content

Showing 1–29 of 29 results for author: Chakrabarty, T

  1. arXiv:2406.11012  [pdf, other

    cs.CL cs.AI

    Connecting the Dots: Evaluating Abstract Reasoning Capabilities of LLMs Using the New York Times Connections Word Game

    Authors: Prisha Samadarshi, Mariam Mustafa, Anushka Kulkarni, Raven Rothkopf, Tuhin Chakrabarty, Smaranda Muresan

    Abstract: The New York Times Connections game has emerged as a popular and challenging pursuit for word puzzle enthusiasts. We collect 200 Connections games to evaluate the performance of state-of-the-art large language models (LLMs) against expert and novice human players. Our results show that even the best-performing LLM, GPT-4o, which has otherwise shown impressive reasoning abilities on a wide variety… ▽ More

    Submitted 15 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2405.01474  [pdf, other

    cs.CL cs.AI cs.CV

    V-FLUTE: Visual Figurative Language Understanding with Textual Explanations

    Authors: Arkadiy Saakyan, Shreyas Kulkarni, Tuhin Chakrabarty, Smaranda Muresan

    Abstract: Large Vision-Language models (VLMs) have demonstrated strong reasoning capabilities in tasks requiring a fine-grained understanding of literal images and text, such as visual question-answering or visual entailment. However, there has been little exploration of these models' capabilities when presented with images and captions containing figurative phenomena such as metaphors or humor, the meaning… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  3. arXiv:2311.09066  [pdf, other

    cs.CL

    Identifying Self-Disclosures of Use, Misuse and Addiction in Community-based Social Media Posts

    Authors: Chenghao Yang, Tuhin Chakrabarty, Karli R Hochstatter, Melissa N Slavin, Nabila El-Bassel, Smaranda Muresan

    Abstract: In the last decade, the United States has lost more than 500,000 people from an overdose involving prescription and illicit opioids making it a national public health emergency (USDHHS, 2017). Medical practitioners require robust and timely tools that can effectively identify at-risk patients. Community-based social media platforms such as Reddit allow self-disclosure for users to discuss otherwis… ▽ More

    Submitted 13 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: NAACL 2024 Findings (Camera-Ready Version). Codes and Data are available at https://github.com/yangalan123/OpioidID

  4. arXiv:2310.19145  [pdf, other

    cs.CL cs.CV

    Learning to Follow Object-Centric Image Editing Instructions Faithfully

    Authors: Tuhin Chakrabarty, Kanishk Singh, Arkadiy Saakyan, Smaranda Muresan

    Abstract: Natural language instructions are a powerful interface for editing the outputs of text-to-image diffusion models. However, several challenges need to be addressed: 1) underspecification (the need to model the implicit meaning of instructions) 2) grounding (the need to localize where the edit has to be performed), 3) faithfulness (the need to preserve the elements of the image not affected by the e… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023 (Long paper)

  5. arXiv:2309.14556  [pdf, other

    cs.CL cs.AI cs.HC

    Art or Artifice? Large Language Models and the False Promise of Creativity

    Authors: Tuhin Chakrabarty, Philippe Laban, Divyansh Agarwal, Smaranda Muresan, Chien-Sheng Wu

    Abstract: Researchers have argued that large language models (LLMs) exhibit high-quality writing capabilities from blogs to stories. However, evaluating objectively the creativity of a piece of writing is challenging. Inspired by the Torrance Test of Creative Thinking (TTCT), which measures creativity as a process, we use the Consensual Assessment Technique [3] and propose the Torrance Test of Creative Writ… ▽ More

    Submitted 8 March, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: ACM CHI 2024

  6. arXiv:2309.12570  [pdf, other

    cs.HC cs.AI cs.CL cs.CY

    Creativity Support in the Age of Large Language Models: An Empirical Study Involving Emerging Writers

    Authors: Tuhin Chakrabarty, Vishakh Padmakumar, Faeze Brahman, Smaranda Muresan

    Abstract: The development of large language models (LLMs) capable of following instructions and engaging in conversational interactions sparked increased interest in their utilization across various support tools. We investigate the utility of modern LLMs in assisting professional writers via an empirical user study (n=30). The design of our collaborative writing interface is grounded in the cognitive proce… ▽ More

    Submitted 30 January, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

  7. arXiv:2305.14724  [pdf, other

    cs.CL cs.AI cs.CV cs.HC

    I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors

    Authors: Tuhin Chakrabarty, Arkadiy Saakyan, Olivia Winn, Artemis Panagopoulou, Yue Yang, Marianna Apidianaki, Smaranda Muresan

    Abstract: Visual metaphors are powerful rhetorical devices used to persuade or communicate creative ideas through images. Similar to linguistic metaphors, they convey meaning implicitly through symbolism and juxtaposition of the symbols. We propose a new task of generating visual metaphors from linguistic metaphors. This is a challenging task for diffusion-based text-to-image models, such as DALL$\cdot$E 2,… ▽ More

    Submitted 14 July, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: ACL 2023 (Findings)

  8. arXiv:2301.09992  [pdf, other

    cs.CL

    Multitask Instruction-based Prompting for Fallacy Recognition

    Authors: Tariq Alhindi, Tuhin Chakrabarty, Elena Musi, Smaranda Muresan

    Abstract: Fallacies are used as seemingly valid arguments to support a position and persuade the audience about its validity. Recognizing fallacies is an intrinsically difficult task both for humans and machines. Moreover, a big challenge for computational models lies in the fact that fallacies are formulated differently across the datasets with differences in the input format (e.g., question-answer pair, s… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 8172 - 8187

    Journal ref: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 8172 - 8187

  9. arXiv:2210.13669  [pdf, other

    cs.CL

    Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry Writing

    Authors: Tuhin Chakrabarty, Vishakh Padmakumar, He He

    Abstract: Recent work in training large language models (LLMs) to follow natural language instructions has opened up exciting opportunities for natural language interface design. Building on the prior success of LLMs in the realm of computer-assisted creativity, we aim to study if LLMs can improve the quality of user-generated content through collaboration. We present CoPoet, a collaborative poetry writing… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: To appear at EMNLP 2022

  10. arXiv:2210.11536  [pdf, other

    cs.CL

    CONSISTENT: Open-Ended Question Generation From News Articles

    Authors: Tuhin Chakrabarty, Justin Lewis, Smaranda Muresan

    Abstract: Recent work on question generation has largely focused on factoid questions such as who, what, where, when about basic facts. Generating open-ended why, how, what, etc. questions that require long-form answers have proven more difficult. To facilitate the generation of open-ended questions, we propose CONSISTENT, a new end-to-end system for generating open-ended questions that are answerable from… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022 Findings

  11. arXiv:2205.12404  [pdf, other

    cs.CL

    FLUTE: Figurative Language Understanding through Textual Explanations

    Authors: Tuhin Chakrabarty, Arkadiy Saakyan, Debanjan Ghosh, Smaranda Muresan

    Abstract: Figurative language understanding has been recently framed as a recognizing textual entailment (RTE) task (a.k.a. natural language inference, or NLI). However, similar to classical RTE/NLI datasets, the current benchmarks suffer from spurious correlations and annotation artifacts. To tackle this problem, work on NLI has built explanation-based datasets such as e-SNLI, allowing us to probe whether… ▽ More

    Submitted 14 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: EMNLP 2022 Main Conference (Long Paper)

  12. arXiv:2205.12393  [pdf, other

    cs.CL

    Fine-tuned Language Models are Continual Learners

    Authors: Thomas Scialom, Tuhin Chakrabarty, Smaranda Muresan

    Abstract: Recent work on large language models relies on the intuition that most natural language processing tasks can be described via natural language instructions. Language models trained on these instructions show strong zero-shot performance on several standard datasets. However, these models even though impressive still perform poorly on a wide range of tasks outside of their respective training and e… ▽ More

    Submitted 29 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  13. arXiv:2109.05358  [pdf, other

    cs.CL

    Implicit Premise Generation with Discourse-aware Commonsense Knowledge Models

    Authors: Tuhin Chakrabarty, Aadit Trivedi, Smaranda Muresan

    Abstract: Enthymemes are defined as arguments where a premise or conclusion is left implicit. We tackle the task of generating the implicit premise in an enthymeme, which requires not only an understanding of the stated conclusion and premise but also additional inferences that could depend on commonsense knowledge. The largest available dataset for enthymemes (Habernal et al., 2018) consists of 1.7k sample… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 Camera ready

  14. arXiv:2109.02972  [pdf, other

    cs.CL

    Don't Go Far Off: An Empirical Study on Neural Poetry Translation

    Authors: Tuhin Chakrabarty, Arkadiy Saakyan, Smaranda Muresan

    Abstract: Despite constant improvements in machine translation quality, automatic poetry translation remains a challenging problem due to the lack of open-sourced parallel poetic corpora, and to the intrinsic complexities involved in preserving the semantics, style, and figurative nature of poetry. We present an empirical investigation for poetry translation along several dimensions: 1) size and style of tr… ▽ More

    Submitted 10 September, 2021; v1 submitted 7 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 Camera ready

  15. arXiv:2109.00087  [pdf, other

    cs.CL cs.LG

    It's not Rocket Science : Interpreting Figurative Language in Narratives

    Authors: Tuhin Chakrabarty, Yejin Choi, Vered Shwartz

    Abstract: Figurative language is ubiquitous in English. Yet, the vast majority of NLP research focuses on literal language. Existing text representations by design rely on compositionality, while figurative language is often non-compositional. In this paper, we study the interpretation of two non-compositional figurative languages (idioms and similes). We collected datasets of fictional narratives containin… ▽ More

    Submitted 1 March, 2022; v1 submitted 31 August, 2021; originally announced September 2021.

    Comments: Accepted to TACL ( To be presented at ACL 2022, Dublin)

  16. arXiv:2106.03794  [pdf, other

    cs.CL

    COVID-Fact: Fact Extraction and Verification of Real-World Claims on COVID-19 Pandemic

    Authors: Arkadiy Saakyan, Tuhin Chakrabarty, Smaranda Muresan

    Abstract: We introduce a FEVER-like dataset COVID-Fact of $4,086$ claims concerning the COVID-19 pandemic. The dataset contains claims, evidence for the claims, and contradictory claims refuted by the evidence. Unlike previous approaches, we automatically detect true claims and their source articles and then generate counter-claims using automatic methods rather than employing human annotators. Along with o… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: ACL 2021 Camera Ready

  17. arXiv:2106.01228  [pdf, other

    cs.CL

    Metaphor Generation with Conceptual Mappings

    Authors: Kevin Stowe, Tuhin Chakrabarty, Nanyun Peng, Smaranda Muresan, Iryna Gurevych

    Abstract: Generating metaphors is a difficult task as it requires understanding nuanced relationships between abstract concepts. In this paper, we aim to generate a metaphoric sentence given a literal expression by replacing relevant verbs. Guided by conceptual metaphor theory, we propose to control the generation process by encoding conceptual mappings between cognitive domains to generate meaningful metap… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: 13 pages, 3 figures, to be published in the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021)

    ACM Class: I.2.7

  18. arXiv:2106.01195  [pdf, ps, other

    cs.CL cs.AI

    Figurative Language in Recognizing Textual Entailment

    Authors: Tuhin Chakrabarty, Debanjan Ghosh, Adam Poliak, Smaranda Muresan

    Abstract: We introduce a collection of recognizing textual entailment (RTE) datasets focused on figurative language. We leverage five existing datasets annotated for a variety of figurative language -- simile, metaphor, and irony -- and frame them into over 12,500 RTE examples.We evaluate how well state-of-the-art models trained on popular RTE datasets capture different aspects of figurative language. Our r… ▽ More

    Submitted 3 June, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: ACL 2021 (Findings)

  19. arXiv:2103.06779  [pdf, other

    cs.CL

    MERMAID: Metaphor Generation with Symbolism and Discriminative Decoding

    Authors: Tuhin Chakrabarty, Xurui Zhang, Smaranda Muresan, Nanyun Peng

    Abstract: Generating metaphors is a challenging task as it requires a proper understanding of abstract concepts, making connections between unrelated concepts, and deviating from the literal meaning. In this paper, we aim to generate a metaphoric sentence given a literal expression by replacing relevant verbs. Based on a theoretically-grounded connection between metaphors and symbols, we propose a method to… ▽ More

    Submitted 10 April, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: NAACL 2021

  20. arXiv:2103.06758  [pdf, other

    cs.CL cs.AI

    ENTRUST: Argument Reframing with Language Models and Entailment

    Authors: Tuhin Chakrabarty, Christopher Hidey, Smaranda Muresan

    Abstract: Framing involves the positive or negative presentation of an argument or issue depending on the audience and goal of the speaker (Entman 1983). Differences in lexical framing, the focus of our work, can have large effects on peoples' opinions and beliefs. To make progress towards reframing arguments for positive effects, we create a dataset and method for this task. We use a lexical resource for "… ▽ More

    Submitted 10 April, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: NAACL 2021

  21. arXiv:2102.02191  [pdf, other

    cs.CL

    DiSCoL: Toward Engaging Dialogue Systems through Conversational Line Guided Response Generation

    Authors: Sarik Ghazarian, Zixi Liu, Tuhin Chakrabarty, Xuezhe Ma, Aram Galstyan, Nanyun Peng

    Abstract: Having engaging and informative conversations with users is the utmost goal for open-domain conversational systems. Recent advances in transformer-based language models and their applications to dialogue systems have succeeded to generate fluent and human-like responses. However, they still lack control over the generation process towards producing contentful responses and achieving engaging conve… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

  22. arXiv:2009.09870  [pdf, other

    cs.CL cs.AI

    Content Planning for Neural Story Generation with Aristotelian Rescoring

    Authors: Seraphina Goldfarb-Tarrant, Tuhin Chakrabarty, Ralph Weischedel, Nanyun Peng

    Abstract: Long-form narrative text generated from large language models manages a fluent impersonation of human writing, but only at the local sentence level, and lacks structure or global cohesion. We posit that many of the problems of story generation can be addressed via high-quality content planning, and present a system that focuses on how to learn good plot structures to guide story generation. We uti… ▽ More

    Submitted 9 October, 2020; v1 submitted 21 September, 2020; originally announced September 2020.

    Comments: EMNLP 2020, 9 pages

  23. arXiv:2009.08942  [pdf, other

    cs.CL cs.LG

    Generating similes effortlessly like a Pro: A Style Transfer Approach for Simile Generation

    Authors: Tuhin Chakrabarty, Smaranda Muresan, Nanyun Peng

    Abstract: Literary tropes, from poetry to stories, are at the crux of human imagination and communication. Figurative language such as a simile go beyond plain expressions to give readers new insights and inspirations. In this paper, we tackle the problem of simile generation. Generating a simile requires proper understanding for effective mapping of properties between two concepts. To this end, we first pr… ▽ More

    Submitted 3 October, 2020; v1 submitted 18 September, 2020; originally announced September 2020.

    Comments: EMNLP 2020

  24. arXiv:2004.14677  [pdf, other

    cs.CL cs.AI

    AMPERSAND: Argument Mining for PERSuAsive oNline Discussions

    Authors: Tuhin Chakrabarty, Christopher Hidey, Smaranda Muresan, Kathy Mckeown, Alyssa Hwang

    Abstract: Argumentation is a type of discourse where speakers try to persuade their audience about the reasonableness of a claim by presenting supportive arguments. Most work in argument mining has focused on modeling arguments in monologues. We propose a computational model for argument mining in online persuasive discussion forums that brings together the micro-level (argument as product) and macro-level… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

    Comments: EMNLP 2019

  25. arXiv:2004.13248  [pdf, other

    cs.CL cs.AI cs.LG

    $R^3$: Reverse, Retrieve, and Rank for Sarcasm Generation with Commonsense Knowledge

    Authors: Tuhin Chakrabarty, Debanjan Ghosh, Smaranda Muresan, Nanyun Peng

    Abstract: We propose an unsupervised approach for sarcasm generation based on a non-sarcastic input sentence. Our method employs a retrieve-and-edit framework to instantiate two major characteristics of sarcasm: reversal of valence and semantic incongruity with the context which could include shared commonsense or world knowledge between the speaker and the listener. While prior works on sarcasm generation… ▽ More

    Submitted 17 June, 2020; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: Accepted at the 2020 Annual Conference of the Association for Computational Linguistics (ACL)

  26. arXiv:2004.12864  [pdf, other

    cs.CL

    DeSePtion: Dual Sequence Prediction and Adversarial Examples for Improved Fact-Checking

    Authors: Christopher Hidey, Tuhin Chakrabarty, Tariq Alhindi, Siddharth Varia, Kriste Krstovski, Mona Diab, Smaranda Muresan

    Abstract: The increased focus on misinformation has spurred development of data and systems for detecting the veracity of a claim as well as retrieving authoritative evidence. The Fact Extraction and VERification (FEVER) dataset provides such a resource for evaluating end-to-end fact-checking, requiring retrieval of evidence from Wikipedia to validate a veracity prediction. We show that current systems for… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: ACL 2020

  27. arXiv:2004.04938  [pdf, other

    cs.CL cs.AI

    Identifying Distributional Perspective Differences from Colingual Groups

    Authors: Yufei Tian, Tuhin Chakrabarty, Fred Morstatter, Nanyun Peng

    Abstract: Perspective differences exist among different cultures or languages. A lack of mutual understanding among different groups about their perspectives on specific values or events may lead to uninformed decisions or biased opinions. Automatically understanding the group perspectives can provide essential background for many downstream applications of natural language processing techniques. In this pa… ▽ More

    Submitted 12 April, 2021; v1 submitted 10 April, 2020; originally announced April 2020.

  28. arXiv:1905.07000  [pdf, other

    cs.CL

    IMHO Fine-Tuning Improves Claim Detection

    Authors: Tuhin Chakrabarty, Christopher Hidey, Kathleen McKeown

    Abstract: Claims are the central component of an argument. Detecting claims across different domains or data sets can often be challenging due to their varying conceptualization. We propose to alleviate this problem by fine tuning a language model using a Reddit corpus of 5.5 million opinionated claims. These claims are self-labeled by their authors using the internet acronyms IMO/IMHO (in my (humble) opini… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Comments: NAACL 2019

  29. Context-Aware Attention for Understanding Twitter Abuse

    Authors: Tuhin Chakrabarty, Kilol Gupta

    Abstract: The original goal of any social media platform is to facilitate users to indulge in healthy and meaningful conversations. But more often than not, it has been found that it becomes an avenue for wanton attacks. We want to alleviate this issue and hence we try to provide a detailed analysis of how abusive behavior can be monitored in Twitter. The complexity of the natural language constructs makes… ▽ More

    Submitted 29 January, 2020; v1 submitted 23 September, 2018; originally announced September 2018.

    Comments: The full published version of this work is available at: \url{https://www.aclweb.org/anthology/W19-3508/}. Please use the version published in the ACL anthology for citation purposes

    Journal ref: Proc. 3rd Workshop on Abusive Language Online, pp. 70-79, 2019