Skip to main content

Showing 1–9 of 9 results for author: Chawla, R

  1. arXiv:2405.15341  [pdf, other

    cs.AI cs.CV

    V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM

    Authors: Abdur Rahman, Rajat Chawla, Muskaan Kumar, Arkajit Datta, Adarsh Jha, Mukunda NS, Ishaan Bhola

    Abstract: In the rapidly evolving landscape of AI research and application, Multimodal Large Language Models (MLLMs) have emerged as a transformative force, adept at interpreting and integrating information from diverse modalities such as text, images, and Graphical User Interfaces (GUIs). Despite these advancements, the nuanced interaction and understanding of GUIs pose a significant challenge, limiting th… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2404.16048  [pdf, other

    cs.HC cs.AI

    GUIDE: Graphical User Interface Data for Execution

    Authors: Rajat Chawla, Adarsh Jha, Muskaan Kumar, Mukunda NS, Ishaan Bhola

    Abstract: In this paper, we introduce GUIDE, a novel dataset tailored for the advancement of Multimodal Large Language Model (MLLM) applications, particularly focusing on Robotic Process Automation (RPA) use cases. Our dataset encompasses diverse data from various websites including Apollo(62.67\%), Gmail(3.43\%), Calendar(10.98\%) and Canva(22.92\%). Each data entry includes an image, a task description, t… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 11 pages, 8 figures, 3 Tables and 1 Algorithm

  3. arXiv:2403.10171  [pdf

    cs.AI cs.CV

    AUTONODE: A Neuro-Graphic Self-Learnable Engine for Cognitive GUI Automation

    Authors: Arkajit Datta, Tushar Verma, Rajat Chawla, Mukunda N. S, Ishaan Bhola

    Abstract: In recent advancements within the domain of Large Language Models (LLMs), there has been a notable emergence of agents capable of addressing Robotic Process Automation (RPA) challenges through enhanced cognitive capabilities and sophisticated reasoning. This development heralds a new era of scalability and human-like adaptability in goal attainment. In this context, we introduce AUTONODE (Autonomo… ▽ More

    Submitted 27 May, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted in MIPR-2024

  4. arXiv:2403.08773  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    Veagle: Advancements in Multimodal Representation Learning

    Authors: Rajat Chawla, Arkajit Datta, Tushar Verma, Adarsh Jha, Anmol Gautam, Ayush Vatsal, Sukrit Chaterjee, Mukunda NS, Ishaan Bhola

    Abstract: Lately, researchers in artificial intelligence have been really interested in how language and vision come together, giving rise to the development of multimodal models that aim to seamlessly integrate textual and visual information. Multimodal models, an extension of Large Language Models (LLMs), have exhibited remarkable capabilities in addressing a diverse array of tasks, ranging from image cap… ▽ More

    Submitted 18 January, 2024; originally announced March 2024.

  5. From Information to Choice: A Critical Inquiry Into Visualization Tools for Decision Making

    Authors: Emre Oral, Ria Chawla, Michel Wijkstra, Narges Mahyar, Evanthia Dimara

    Abstract: In the face of complex decisions, people often engage in a three-stage process that spans from (1) exploring and analyzing pertinent information (intelligence); (2) generating and exploring alternative options (design); and ultimately culminating in (3) selecting the optimal decision by evaluating discerning criteria (choice). We can fairly assume that all good visualizations aid in the intelligen… ▽ More

    Submitted 2 August, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  6. arXiv:2305.18784  [pdf, ps, other

    cs.LG cs.DC cs.MA cs.SI stat.ML

    Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits

    Authors: Ronshee Chawla, Daniel Vial, Sanjay Shakkottai, R. Srikant

    Abstract: The study of collaborative multi-agent bandits has attracted significant attention recently. In light of this, we initiate the study of a new collaborative setting, consisting of $N$ agents such that each agent is learning one of $M$ stochastic multi-armed bandits to minimize their group cumulative regret. We develop decentralized algorithms which facilitate collaboration between the agents under… ▽ More

    Submitted 2 July, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: To appear in the proceedings of ICML 2023

  7. arXiv:2007.01442  [pdf, other

    cs.LG cs.DC cs.SI stat.ML

    Multi-Agent Low-Dimensional Linear Bandits

    Authors: Ronshee Chawla, Abishek Sankararaman, Sanjay Shakkottai

    Abstract: We study a multi-agent stochastic linear bandit with side information, parameterized by an unknown vector $θ^* \in \mathbb{R}^d$. The side information consists of a finite collection of low-dimensional subspaces, one of which contains $θ^*$. In our setting, agents can collaborate to reduce regret by sending recommendations across a communication graph connecting them. We present a novel decentrali… ▽ More

    Submitted 25 May, 2022; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: To appear in IEEE Transactions on Automatic Control

  8. arXiv:2001.05452  [pdf, other

    cs.LG cs.DC cs.NI cs.SI stat.ML

    The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits

    Authors: Ronshee Chawla, Abishek Sankararaman, Ayalvadi Ganesh, Sanjay Shakkottai

    Abstract: We consider a decentralized multi-agent Multi Armed Bandit (MAB) setup consisting of $N$ agents, solving the same MAB instance to minimize individual cumulative regret. In our model, agents collaborate by exchanging messages through pairwise gossip style communications on an arbitrary connected graph. We develop two novel algorithms, where each agent only plays from a subset of all the arms. Agent… ▽ More

    Submitted 2 July, 2024; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: To Appear in AISTATS 2020. The first two authors contributed equally

  9. arXiv:1312.0114  [pdf

    cs.DC cs.CR

    Extended Role Based Access Control with Blob Service on Cloud

    Authors: Mamoon Rashid, Er. Rishma Chawla

    Abstract: Role-based access control (RBAC) models have generated a great interest in the security community as a powerful and generalized approach to security management and ability to model organizational structure and their capability to reduce administrative expenses. In this paper, we highlight the drawbacks of latest developed RBAC models in terms of access control and authorization and later provide a… ▽ More

    Submitted 30 November, 2013; originally announced December 2013.

    Comments: 6 page and 1 figure