Skip to main content

Showing 1–4 of 4 results for author: Rothkopf, R

  1. arXiv:2406.11012  [pdf, other

    cs.CL cs.AI

    Connecting the Dots: Evaluating Abstract Reasoning Capabilities of LLMs Using the New York Times Connections Word Game

    Authors: Prisha Samadarshi, Mariam Mustafa, Anushka Kulkarni, Raven Rothkopf, Tuhin Chakrabarty, Smaranda Muresan

    Abstract: The New York Times Connections game has emerged as a popular and challenging pursuit for word puzzle enthusiasts. We collect 200 Connections games to evaluate the performance of state-of-the-art large language models (LLMs) against expert and novice human players. Our results show that even the best-performing LLM, GPT-4o, which has otherwise shown impressive reasoning abilities on a wide variety… ▽ More

    Submitted 15 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2406.07400  [pdf, other

    cs.LG cs.LO

    Guiding LLM Temporal Logic Generation with Explicit Separation of Data and Control

    Authors: William Murphy, Nikolaus Holzer, Nathan Koenig, Leyi Cui, Raven Rothkopf, Feitong Qiao, Mark Santolucito

    Abstract: Temporal logics are powerful tools that are widely used for the synthesis and verification of reactive systems. The recent progress on Large Language Models (LLMs) has the potential to make the process of writing such specifications more accessible. However, writing specifications in temporal logics remains challenging for all but the most expert users. A key question in using LLMs for temporal lo… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2402.17743  [pdf, other

    cs.PL

    Rose: Composable Autodiff for the Interactive Web

    Authors: Sam Estep, Wode Ni, Raven Rothkopf, Joshua Sunshine

    Abstract: Reverse-mode automatic differentiation (autodiff) has been popularized by deep learning, but its ability to compute gradients is also valuable for interactive use cases such as bidirectional computer-aided design, embedded physics simulations, visualizing causal inference, and more. Unfortunately, the web is ill-served by existing autodiff frameworks, which use autodiff strategies that perform poo… ▽ More

    Submitted 12 July, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  4. arXiv:2402.16905  [pdf, ps, other

    cs.AI cs.LG cs.LO

    Enforcing Temporal Constraints on Generative Agent Behavior with Reactive Synthesis

    Authors: Raven Rothkopf, Hannah Tongxin Zeng, Mark Santolucito

    Abstract: The surge in popularity of Large Language Models (LLMs) has opened doors for new approaches to the creation of interactive agents. However, managing the temporal behavior of such agents over the course of an interaction remains challenging. The stateful, long-term horizon and quantitative reasoning required for coherent agent behavior does not fit well into the LLM paradigm. We propose a combinati… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 22 pages