Skip to main content

Showing 1–17 of 17 results for author: Raman, N

  1. arXiv:2406.00738  [pdf, other

    cs.LG cs.AI cs.CY

    Global Rewards in Restless Multi-Armed Bandits

    Authors: Naveen Raman, Zheyuan Ryan Shi, Fei Fang

    Abstract: Restless multi-armed bandits (RMAB) extend multi-armed bandits so pulling an arm impacts future states. Despite the success of RMABs, a key limiting assumption is the separability of rewards into a sum across arms. We address this deficiency by proposing restless-multi-armed bandit with global rewards (RMAB-G), a generalization of RMABs to global non-separable rewards. To solve RMAB-G, we develop… ▽ More

    Submitted 7 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: 27 pages

  2. arXiv:2405.18217  [pdf, other

    cs.LG

    Understanding Inter-Concept Relationships in Concept-Based Models

    Authors: Naveen Raman, Mateo Espinosa Zarlenga, Mateja Jamnik

    Abstract: Concept-based explainability methods provide insight into deep learning systems by constructing explanations using human-understandable concepts. While the literature on human reasoning demonstrates that we exploit relationships between concepts when solving tasks, it is unclear whether concept-based methods incorporate the rich structure of inter-concept relationships. We analyse the concept repr… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted at ICML 2024

  3. arXiv:2404.06162  [pdf, other

    cs.CL cs.AI cs.LG

    Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports

    Authors: Tianyu Cao, Natraj Raman, Danial Dervovic, Chenhao Tan

    Abstract: As large language models (LLMs) expand the power of natural language processing to handle long inputs, rigorous and systematic analyses are necessary to understand their abilities and behavior. A salient application is summarization, due to its ubiquity and controversy (e.g., researchers have declared the death of summarization). In this paper, we use financial report summarization as a case study… ▽ More

    Submitted 8 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  4. arXiv:2402.09552  [pdf, other

    cs.CL econ.GN

    STEER: Assessing the Economic Rationality of Large Language Models

    Authors: Narun Raman, Taylor Lundy, Samuel Amouyal, Yoav Levine, Kevin Leyton-Brown, Moshe Tennenholtz

    Abstract: There is increasing interest in using LLMs as decision-making "agents." Doing so includes many degrees of freedom: which model should be used; how should it be prompted; should it be asked to introspect, conduct chain-of-thought reasoning, etc? Settling these questions -- and more broadly, determining whether an LLM agent is reliable enough to be trusted -- requires a methodology for assessing suc… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  5. arXiv:2401.01259  [pdf, other

    cs.LG cs.AI

    Do Concept Bottleneck Models Obey Locality?

    Authors: Naveen Raman, Mateo Espinosa Zarlenga, Juyeon Heo, Mateja Jamnik

    Abstract: Concept-based methods explain model predictions using human-understandable concepts. These models require accurate concept predictors, yet the faithfulness of existing concept predictors to their underlying concepts is unclear. In this paper, we investigate the faithfulness of Concept Bottleneck Models (CBMs), a popular family of concept-based architectures, by looking at whether they respect "loc… ▽ More

    Submitted 28 May, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: Previous Version Accepted at NeurIPs 23 XAI in Action Workshop

  6. arXiv:2401.00908  [pdf, other

    cs.CL

    DocLLM: A layout-aware generative language model for multimodal document understanding

    Authors: Dongsheng Wang, Natraj Raman, Mathieu Sibue, Zhiqiang Ma, Petr Babkin, Simerjot Kaur, Yulong Pei, Armineh Nourbakhsh, Xiaomo Liu

    Abstract: Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the intersection of textual and spatial modalities. The visual cues offered by their complex layouts play a crucial role in comprehending these documents effectively. In this paper, we present DocLLM, a lightweight extension to traditional large language models (LLMs… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: 16 pages, 4 figures

  7. arXiv:2401.00081  [pdf, other

    cs.LG q-fin.GN

    Synthetic Data Applications in Finance

    Authors: Vamsi K. Potluru, Daniel Borrajo, Andrea Coletta, Niccolò Dalmasso, Yousef El-Laham, Elizabeth Fons, Mohsen Ghassemi, Sriram Gopalakrishnan, Vikesh Gosai, Eleonora Kreačić, Ganapathy Mani, Saheed Obitayo, Deepak Paramanand, Natraj Raman, Mikhail Solonin, Srijan Sood, Svitlana Vyetrenko, Haibei Zhu, Manuela Veloso, Tucker Balch

    Abstract: Synthetic data has made tremendous strides in various commercial settings including finance, healthcare, and virtual reality. We present a broad overview of prototypical applications of synthetic data in the financial sector and in particular provide richer details for a few select ones. These cover a wide variety of data modalities including tabular, time-series, event-series, and unstructured ar… ▽ More

    Submitted 20 March, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: 50 pages, journal submission; updated 6 privacy levels

  8. arXiv:2312.10205  [pdf, other

    cs.GT

    Pay to (Not) Play: Monetizing Impatience in Mobile Games

    Authors: Taylor Lundy, Narun Raman, Hu Fu, Kevin Leyton-Brown

    Abstract: Mobile gaming is a rapidly growing and incredibly profitable sector; having grown seven-fold over the past 10 years, it now grosses over $100 billion annually. This growth was due in large part to a shift in monetization strategies: rather than charging players an upfront cost ("pay-to-play"), games often request optional microtransactions throughout gameplay ("free-to-play"). We focus on a common… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 18 pages

  9. arXiv:2309.06550  [pdf, other

    cs.CL cs.AI

    Synthetic Text Generation using Hypergraph Representations

    Authors: Natraj Raman, Sameena Shah

    Abstract: Generating synthetic variants of a document is often posed as text-to-text transformation. We propose an alternate LLM based method that first decomposes a document into semantic frames and then generates text using this interim sparse format. The frames are modeled using a hypergraph, which allows perturbing the frame contents in a principled manner. Specifically, new hyperedges are mined through… ▽ More

    Submitted 2 December, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

  10. arXiv:2303.12872  [pdf, other

    cs.HC cs.AI cs.LG

    Human Uncertainty in Concept-Based AI Systems

    Authors: Katherine M. Collins, Matthew Barker, Mateo Espinosa Zarlenga, Naveen Raman, Umang Bhatt, Mateja Jamnik, Ilia Sucholutsky, Adrian Weller, Krishnamurthy Dvijotham

    Abstract: Placing a human in the loop may abate the risks of deploying AI systems in safety-critical settings (e.g., a clinician working with a medical AI system). However, mitigating risks arising from human error and uncertainty within such human-AI interactions is an important and understudied issue. In this work, we study human uncertainty in the context of concept-based models, a family of AI systems t… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  11. arXiv:2301.08833  [pdf, other

    cs.LG cs.AI

    Bayesian Hierarchical Models for Counterfactual Estimation

    Authors: Natraj Raman, Daniele Magazzeni, Sameena Shah

    Abstract: Counterfactual explanations utilize feature perturbations to analyze the outcome of an original decision and recommend an actionable recourse. We argue that it is beneficial to provide several alternative explanations rather than a single point solution and propose a probabilistic paradigm to estimate a diverse set of counterfactuals. Specifically, we treat the perturbations as random variables en… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

  12. arXiv:2211.00083  [pdf, other

    cs.CL cs.AI cs.LG

    WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain

    Authors: Raj Sanjay Shah, Kunal Chawla, Dheeraj Eidnani, Agam Shah, Wendi Du, Sudheer Chava, Natraj Raman, Charese Smiley, Jiaao Chen, Diyi Yang

    Abstract: Pre-trained language models have shown impressive performance on a variety of tasks and domains. Previous research on financial language models usually employs a generic training scheme to train standard model architectures, without completely leveraging the richness of the financial data. We propose a novel domain specific Financial LANGuage model (FLANG) which uses financial keywords and phrases… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

  13. arXiv:2201.03720  [pdf, other

    cs.IR

    Structure and Semantics Preserving Document Representations

    Authors: Natraj Raman, Sameena Shah, Manuela Veloso

    Abstract: Retrieving relevant documents from a corpus is typically based on the semantic similarity between the document content and query text. The inclusion of structural relationship between documents can benefit the retrieval mechanism by addressing semantic gaps. However, incorporating these relationships requires tractable mechanisms that balance structure with semantics and take advantage of the prev… ▽ More

    Submitted 1 April, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

  14. arXiv:2112.10768  [pdf, other

    cs.LG cs.AI cs.HC

    Improving Learning-to-Defer Algorithms Through Fine-Tuning

    Authors: Naveen Raman, Michael Yee

    Abstract: The ubiquity of AI leads to situations where humans and AI work together, creating the need for learning-to-defer algorithms that determine how to partition tasks between AI and humans. We work to improve learning-to-defer algorithms when paired with specific individuals by incorporating two fine-tuning algorithms and testing their efficacy using both synthetic and image datasets. We find that fin… ▽ More

    Submitted 18 December, 2021; originally announced December 2021.

  15. Synthetic Document Generator for Annotation-free Layout Recognition

    Authors: Natraj Raman, Sameena Shah, Manuela Veloso

    Abstract: Analyzing the layout of a document to identify headers, sections, tables, figures etc. is critical to understanding its content. Deep learning based approaches for detecting the layout structure of document images have been promising. However, these methods require a large number of annotated examples during training, which are both expensive and time consuming to obtain. We describe here a synthe… ▽ More

    Submitted 24 July, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

  16. arXiv:2110.03524  [pdf, other

    cs.AI

    Data-Driven Methods for Balancing Fairness and Efficiency in Ride-Pooling

    Authors: Naveen Raman, Sanket Shah, John Dickerson

    Abstract: Rideshare and ride-pooling platforms use artificial intelligence-based matching algorithms to pair riders and drivers. However, these platforms can induce inequality either through an unequal income distribution or disparate treatment of riders. We investigate two methods to reduce forms of inequality in ride-pooling platforms: (1) incorporating fairness constraints into the objective function and… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

  17. arXiv:2010.12681  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Robust Document Representations using Latent Topics and Metadata

    Authors: Natraj Raman, Armineh Nourbakhsh, Sameena Shah, Manuela Veloso

    Abstract: Task specific fine-tuning of a pre-trained neural language model using a custom softmax output layer is the de facto approach of late when dealing with document classification problems. This technique is not adequate when labeled examples are not available at training time and when the metadata artifacts in a document must be exploited. We address these challenges by generating document representa… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: 9 pages, 7 figures

    ACM Class: I.2.7; I.7.0