Skip to main content

Showing 1–9 of 9 results for author: Unhelkar, V

  1. arXiv:2404.16989  [pdf, other

    cs.LG cs.AI cs.RO

    IDIL: Imitation Learning of Intent-Driven Expert Behavior

    Authors: Sangwon Seo, Vaibhav Unhelkar

    Abstract: When faced with accomplishing a task, human experts exhibit intentional behavior. Their unique intents shape their plans and decisions, resulting in experts demonstrating diverse behaviors to accomplish the same task. Due to the uncertainties encountered in the real world and their bounded rationality, experts sometimes adjust their intents, which in turn influences their behaviors during task exe… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Extended version of an identically-titled paper accepted at AAMAS 2024

  2. arXiv:2312.12102  [pdf, other

    cs.AI cs.CV cs.HC cs.LG

    I-CEE: Tailoring Explanations of Image Classification Models to User Expertise

    Authors: Yao Rong, Peizhu Qian, Vaibhav Unhelkar, Enkelejda Kasneci

    Abstract: Effectively explaining decisions of black-box machine learning models is critical to responsible deployment of AI systems that rely on them. Recognizing their importance, the field of explainable AI (XAI) provides several techniques to generate these explanations. Yet, there is relatively little emphasis on the user (the explainee) in this growing body of work and most XAI techniques generate "one… ▽ More

    Submitted 10 January, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  3. arXiv:2312.10802  [pdf, other

    cs.LG cs.AI

    GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation

    Authors: Abhinav Jain, Vaibhav Unhelkar

    Abstract: Offline imitation learning (IL) refers to learning expert behavior solely from demonstrations, without any additional interaction with the environment. Despite significant advances in offline IL, existing techniques find it challenging to learn policies for long-horizon tasks and require significant re-training when task specifications change. Towards addressing these limitations, we present GO-DI… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: Extended version of an identically-titled paper accepted at AAAI 2024

  4. arXiv:2303.00413  [pdf, other

    cs.AI cs.LG cs.MA

    Automated Task-Time Interventions to Improve Teamwork using Imitation Learning

    Authors: Sangwon Seo, Bing Han, Vaibhav Unhelkar

    Abstract: Effective human-human and human-autonomy teamwork is critical but often challenging to perfect. The challenge is particularly relevant in time-critical domains, such as healthcare and disaster response, where the time pressures can make coordination increasingly difficult to achieve and the consequences of imperfect coordination can be severe. To improve teamwork in these and other domains, we pre… ▽ More

    Submitted 2 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Extended version of an identically-titled paper accepted at AAMAS 2023

  5. arXiv:2210.11584  [pdf, other

    cs.AI cs.HC

    Towards Human-centered Explainable AI: A Survey of User Studies for Model Explanations

    Authors: Yao Rong, Tobias Leemann, Thai-trang Nguyen, Lisa Fiedler, Peizhu Qian, Vaibhav Unhelkar, Tina Seidel, Gjergji Kasneci, Enkelejda Kasneci

    Abstract: Explainable AI (XAI) is widely viewed as a sine qua non for ever-expanding AI research. A better understanding of the needs of XAI users, as well as human-centered evaluations of explainable models are both a necessity and a challenge. In this paper, we explore how HCI and AI researchers conduct user studies in XAI applications based on a systematic literature review. After identifying and thoroug… ▽ More

    Submitted 19 December, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

  6. arXiv:2205.02959  [pdf, other

    cs.AI cs.LG cs.MA

    Semi-Supervised Imitation Learning of Team Policies from Suboptimal Demonstrations

    Authors: Sangwon Seo, Vaibhav V. Unhelkar

    Abstract: We present Bayesian Team Imitation Learner (BTIL), an imitation learning algorithm to model the behavior of teams performing sequential tasks in Markovian domains. In contrast to existing multi-agent imitation learning techniques, BTIL explicitly models and infers the time-varying mental states of team members, thereby enabling learning of decentralized team policies from demonstrations of subopti… ▽ More

    Submitted 19 September, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: Extended version of an identically-titled paper accepted at IJCAI 2022

  7. arXiv:2103.15171  [pdf, other

    cs.AI

    A Bayesian Approach to Identifying Representational Errors

    Authors: Ramya Ramakrishnan, Vaibhav Unhelkar, Ece Kamar, Julie Shah

    Abstract: Trained AI systems and expert decision makers can make errors that are often difficult to identify and understand. Determining the root cause for these errors can improve future decisions. This work presents Generative Error Model (GEM), a generative model for inferring representational errors based on observations of an actor's behavior (either simulated agent, robot, or human). The model conside… ▽ More

    Submitted 28 March, 2021; originally announced March 2021.

  8. arXiv:2102.08507  [pdf, other

    cs.AI cs.HC cs.LG cs.MA

    Towards an AI Coach to Infer Team Mental Model Alignment in Healthcare

    Authors: Sangwon Seo, Lauren R. Kennedy-Metz, Marco A. Zenati, Julie A. Shah, Roger D. Dias, Vaibhav V. Unhelkar

    Abstract: Shared mental models are critical to team success; however, in practice, team members may have misaligned models due to a variety of factors. In safety-critical domains (e.g., aviation, healthcare), lack of shared mental models can lead to preventable errors and harm. Towards the goal of mitigating such preventable errors, here, we present a Bayesian approach to infer misalignment in team members'… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: Submitted to the 2021 IEEE Conference on Cognitive and Computational Aspects of Situation Management (CogSIMA)

    MSC Class: 68T37; 62F15 (Primary) 90C40; 62M05; 62P10; 91C99 (Secondary) ACM Class: I.2.m; G.3; J.3

  9. arXiv:2011.08458  [pdf, other

    cs.RO

    Learning Dense Rewards for Contact-Rich Manipulation Tasks

    Authors: Zheng Wu, Wenzhao Lian, Vaibhav Unhelkar, Masayoshi Tomizuka, Stefan Schaal

    Abstract: Rewards play a crucial role in reinforcement learning. To arrive at the desired policy, the design of a suitable reward function often requires significant domain expertise as well as trial-and-error. Here, we aim to minimize the effort involved in designing reward functions for contact-rich manipulation tasks. In particular, we provide an approach capable of extracting dense reward functions algo… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: 8 pages, 5 figures