Skip to main content

Showing 1–2 of 2 results for author: Saraf, J

  1. arXiv:2404.12926  [pdf, other

    cs.AI

    MM-PhyRLHF: Reinforcement Learning Framework for Multimodal Physics Question-Answering

    Authors: Avinash Anand, Janak Kapuriya, Chhavi Kirtani, Apoorv Singh, Jay Saraf, Naman Lal, Jatin Kumar, Adarsh Raj Shivam, Astha Verma, Rajiv Ratn Shah, Roger Zimmermann

    Abstract: Recent advancements in LLMs have shown their significant potential in tasks like text summarization and generation. Yet, they often encounter difficulty while solving complex physics problems that require arithmetic calculation and a good understanding of concepts. Moreover, many physics problems include images that contain important details required to understand the problem's context. We propose… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  2. arXiv:2404.08704  [pdf, other

    cs.CL cs.AI

    MM-PhyQA: Multimodal Physics Question-Answering With Multi-Image CoT Prompting

    Authors: Avinash Anand, Janak Kapuriya, Apoorv Singh, Jay Saraf, Naman Lal, Astha Verma, Rushali Gupta, Rajiv Shah

    Abstract: While Large Language Models (LLMs) can achieve human-level performance in various tasks, they continue to face challenges when it comes to effectively tackling multi-step physics reasoning tasks. To identify the shortcomings of existing models and facilitate further research in this area, we curated a novel dataset, MM-PhyQA, which comprises well-constructed, high schoollevel multimodal physics pr… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.