Skip to main content

Showing 1–3 of 3 results for author: Imbrasaite, V

  1. arXiv:2308.15299  [pdf, other

    cs.CL

    TaskLAMA: Probing the Complex Task Understanding of Language Models

    Authors: Quan Yuan, Mehran Kazemi, Xin Xu, Isaac Noble, Vaiva Imbrasaite, Deepak Ramachandran

    Abstract: Structured Complex Task Decomposition (SCTD) is the problem of breaking down a complex real-world task (such as planning a wedding) into a directed acyclic graph over individual steps that contribute to achieving the task, with edges specifying temporal dependencies between them. SCTD is an important component of assistive planning tools, and a challenge for commonsense reasoning systems. We probe… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  2. arXiv:2306.07934  [pdf, other

    cs.CL cs.AI cs.LG

    BoardgameQA: A Dataset for Natural Language Reasoning with Contradictory Information

    Authors: Mehran Kazemi, Quan Yuan, Deepti Bhatia, Najoung Kim, Xin Xu, Vaiva Imbrasaite, Deepak Ramachandran

    Abstract: Automated reasoning with unstructured natural text is a key requirement for many potential applications of NLP and for developing robust AI systems. Recently, Language Models (LMs) have demonstrated complex reasoning capacities even without any finetuning. However, existing evaluation for automated reasoning assumes access to a consistent and coherent set of information over which models reason. W… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  3. arXiv:2305.14128  [pdf, other

    cs.CL cs.AI

    Dr.ICL: Demonstration-Retrieved In-context Learning

    Authors: Man Luo, Xin Xu, Zhuyun Dai, Panupong Pasupat, Mehran Kazemi, Chitta Baral, Vaiva Imbrasaite, Vincent Y Zhao

    Abstract: In-context learning (ICL), teaching a large language model (LLM) to perform a task with few-shot demonstrations rather than adjusting the model parameters, has emerged as a strong paradigm for using LLMs. While early studies primarily used a fixed or random set of demonstrations for all test queries, recent research suggests that retrieving semantically similar demonstrations to the input from a p… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.