Skip to main content

Showing 1–8 of 8 results for author: Vacareanu, R

  1. arXiv:2405.00204  [pdf, other

    cs.CL cs.AI

    General Purpose Verification for Chain of Thought Prompting

    Authors: Robert Vacareanu, Anurag Pratik, Evangelia Spiliopoulou, Zheng Qi, Giovanni Paolini, Neha Anna John, Jie Ma, Yassine Benajiba, Miguel Ballesteros

    Abstract: Many of the recent capabilities demonstrated by Large Language Models (LLMs) arise primarily from their ability to exploit contextual information. In this paper, we explore ways to improve reasoning capabilities of LLMs through (1) exploration of different chains of thought and (2) validation of the individual steps of the reasoning process. We propose three general principles that a model should… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 22 pages, preprint

  2. arXiv:2404.07544  [pdf, other

    cs.CL cs.AI

    From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples

    Authors: Robert Vacareanu, Vlad-Andrei Negru, Vasile Suciu, Mihai Surdeanu

    Abstract: We analyze how well pre-trained large language models (e.g., Llama2, GPT-4, Claude 3, etc) can do linear and non-linear regression when given in-context examples, without any additional training or gradient updates. Our findings reveal that several large language models (e.g., GPT-4, Claude 3) are able to perform regression tasks with a performance rivaling (or even outperforming) that of traditio… ▽ More

    Submitted 30 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 50 pages, 48 figures, preprint; Fixed typos

  3. arXiv:2404.04445  [pdf, ps, other

    cs.CL cs.IR

    Towards Realistic Few-Shot Relation Extraction: A New Meta Dataset and Evaluation

    Authors: Fahmida Alam, Md Asiful Islam, Robert Vacareanu, Mihai Surdeanu

    Abstract: We introduce a meta dataset for few-shot relation extraction, which includes two datasets derived from existing supervised relation extraction datasets NYT29 (Takanobu et al., 2019; Nayak and Ng, 2020) and WIKIDATA (Sorokin and Gurevych, 2017) as well as a few-shot form of the TACRED dataset (Sabo et al., 2021). Importantly, all these few-shot datasets were generated under realistic assumptions su… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  4. arXiv:2403.03305  [pdf, other

    cs.CL cs.AI

    Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach for Relation Classification

    Authors: Robert Vacareanu, Fahmida Alam, Md Asiful Islam, Haris Riaz, Mihai Surdeanu

    Abstract: This paper introduces a novel neuro-symbolic architecture for relation classification (RC) that combines rule-based methods with contemporary deep learning techniques. This approach capitalizes on the strengths of both paradigms: the adaptability of rule-based systems and the generalization power of neural networks. Our architecture consists of two components: a declarative rule-based model for tr… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  5. arXiv:2307.05034  [pdf, other

    cs.CL

    Synthetic Dataset for Evaluating Complex Compositional Knowledge for Natural Language Inference

    Authors: Sushma Anand Akoju, Robert Vacareanu, Haris Riaz, Eduardo Blanco, Mihai Surdeanu

    Abstract: We introduce a synthetic dataset called Sentences Involving Complex Compositional Knowledge (SICCK) and a novel analysis that investigates the performance of Natural Language Inference (NLI) models to understand compositionality in logic. We produce 1,304 sentence pairs by modifying 15 examples from the SICK dataset (Marelli et al., 2014). To this end, we modify the original texts using a set of p… ▽ More

    Submitted 11 July, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Accepted to Natural Language Reasoning and Structured Explanations (NLRSE) Workshop, ACL 2023. For dataset, please refer https://github.com/clulab/releases/tree/master/acl2023-nlrse-sicck and https://github.com/sushmaakoju/natural-logic

  6. arXiv:2305.11979  [pdf, other

    cs.CL

    A Weak Supervision Approach for Few-Shot Aspect Based Sentiment

    Authors: Robert Vacareanu, Siddharth Varia, Kishaloy Halder, Shuai Wang, Giovanni Paolini, Neha Anna John, Miguel Ballesteros, Smaranda Muresan

    Abstract: We explore how weak supervision on abundant unlabeled data can be leveraged to improve few-shot performance in aspect-based sentiment analysis (ABSA) tasks. We propose a pipeline approach to construct a noisy ABSA dataset, and we use it to adapt a pre-trained sequence-to-sequence model to the ABSA tasks. We test the resulting model on three widely used ABSA datasets, before and after fine-tuning.… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  7. arXiv:2210.06629  [pdf, other

    cs.CL

    Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis

    Authors: Siddharth Varia, Shuai Wang, Kishaloy Halder, Robert Vacareanu, Miguel Ballesteros, Yassine Benajiba, Neha Anna John, Rishita Anubhai, Smaranda Muresan, Dan Roth

    Abstract: Aspect-based Sentiment Analysis (ABSA) is a fine-grained sentiment analysis task which involves four elements from user-generated texts: aspect term, aspect category, opinion term, and sentiment polarity. Most computational approaches focus on some of the ABSA sub-tasks such as tuple (aspect term, sentiment polarity) or triplet (aspect term, opinion term, sentiment polarity) extraction using eithe… ▽ More

    Submitted 11 June, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Camera ready copy for WASSA at ACL 2023

  8. arXiv:2202.00475  [pdf, ps, other

    cs.CL cs.IR cs.LG

    From Examples to Rules: Neural Guided Rule Synthesis for Information Extraction

    Authors: Robert Vacareanu, Marco A. Valenzuela-Escarcega, George C. G. Barbosa, Rebecca Sharp, Mihai Surdeanu

    Abstract: While deep learning approaches to information extraction have had many successes, they can be difficult to augment or maintain as needs shift. Rule-based methods, on the other hand, can be more easily modified. However, crafting rules requires expertise in linguistics and the domain of interest, making it infeasible for most users. Here we attempt to combine the advantages of these two directions… ▽ More

    Submitted 16 January, 2022; originally announced February 2022.