Skip to main content

Showing 1–50 of 230 results for author: Jain, P

  1. arXiv:2407.09719  [pdf, other

    cs.LG cs.AI

    MSEval: A Dataset for Material Selection in Conceptual Design to Evaluate Algorithmic Models

    Authors: Yash Patawari Jain, Daniele Grandi, Allin Groom, Brandon Cramer, Christopher McComb

    Abstract: Material selection plays a pivotal role in many industries, from manufacturing to construction. Material selection is usually carried out after several cycles of conceptual design, during which designers iteratively refine the design solution and the intended manufacturing approach. In design research, material selection is typically treated as an optimization problem with a single correct answer.… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2405.03695

  2. arXiv:2407.09506  [pdf, other

    cs.CL

    Integrating Large Language Models with Graph-based Reasoning for Conversational Question Answering

    Authors: Parag Jain, Mirella Lapata

    Abstract: We focus on a conversational question answering task which combines the challenges of understanding questions in context and reasoning over evidence gathered from heterogeneous sources like text, knowledge graphs, tables, and infoboxes. Our method utilizes a graph structured representation to aggregate information about a question and its context (i.e., the conversation so far and evidence retriev… ▽ More

    Submitted 14 June, 2024; originally announced July 2024.

  3. arXiv:2407.00361  [pdf, other

    cs.CL cs.AI

    From RAG to RICHES: Retrieval Interlaced with Sequence Generation

    Authors: Palak Jain, Livio Baldini Soares, Tom Kwiatkowski

    Abstract: We present RICHES, a novel approach that interleaves retrieval with sequence generation tasks. RICHES offers an alternative to conventional RAG systems by eliminating the need for separate retriever and generator. It retrieves documents by directly decoding their contents, constrained on the corpus. Unifying retrieval with generation allows us to adapt to diverse new tasks via prompting alone. RIC… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 18 pages, 3 figures, Preprint

  4. arXiv:2406.03458  [pdf, other

    cs.LG

    Distributional Adversarial Loss

    Authors: Saba Ahmadi, Siddharth Bhandari, Avrim Blum, Chen Dan, Prabhav Jain

    Abstract: A major challenge in defending against adversarial attacks is the enormous space of possible attacks that even a simple adversary might perform. To address this, prior work has proposed a variety of defenses that effectively reduce the size of this space. These include randomized smoothing methods that add noise to the input to take away some of the adversary's impact. Another approach is input di… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  5. arXiv:2405.15372  [pdf, other

    cs.DS cs.GT

    When far is better: The Chamberlin-Courant approach to obnoxious committee selection

    Authors: Sushmita Gupta, Tanmay Inamdar, Pallavi Jain, Daniel Lokshtanov, Fahad Panolan, Saket Saurabh

    Abstract: Classical work on metric space based committee selection problem interprets distance as ``near is better''. In this work, motivated by real-life situations, we interpret distance as ``far is better''. Formally stated, we initiate the study of ``obnoxious'' committee scoring rules when the voters' preferences are expressed via a metric space. To this end, we propose a model where large distances im… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  6. arXiv:2405.06467  [pdf, other

    cs.CV

    Attend, Distill, Detect: Attention-aware Entropy Distillation for Anomaly Detection

    Authors: Sushovan Jena, Vishwas Saini, Ujjwal Shaw, Pavitra Jain, Abhay Singh Raihal, Anoushka Banerjee, Sharad Joshi, Ananth Ganesh, Arnav Bhavsar

    Abstract: Unsupervised anomaly detection encompasses diverse applications in industrial settings where a high-throughput and precision is imperative. Early works were centered around one-class-one-model paradigm, which poses significant challenges in large-scale production environments. Knowledge-distillation based multi-class anomaly detection promises a low latency with a reasonably good performance but w… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 15 pages

    MSC Class: 68T07 ACM Class: I.2.10

  7. arXiv:2405.03695  [pdf, other

    cs.CL

    Evaluating Large Language Models for Material Selection

    Authors: Daniele Grandi, Yash Patawari Jain, Allin Groom, Brandon Cramer, Christopher McComb

    Abstract: Material selection is a crucial step in conceptual design due to its significant impact on the functionality, aesthetics, manufacturability, and sustainability impact of the final product. This study investigates the use of Large Language Models (LLMs) for material selection in the product design process and compares the performance of LLMs against expert choices for various design scenarios. By c… ▽ More

    Submitted 23 April, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2307.03109 by other authors

  8. arXiv:2404.09866  [pdf, other

    cs.SE

    Reimagining Self-Adaptation in the Age of Large Language Models

    Authors: Raghav Donakanti, Prakhar Jain, Shubham Kulkarni, Karthik Vaidhyanathan

    Abstract: Modern software systems are subjected to various types of uncertainties arising from context, environment, etc. To this end, self-adaptation techniques have been sought out as potential solutions. Although recent advances in self-adaptation through the use of ML techniques have demonstrated promising results, the capabilities are limited by constraints imposed by the ML techniques, such as the nee… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  9. arXiv:2403.20327  [pdf, other

    cs.CL cs.AI

    Gecko: Versatile Text Embeddings Distilled from Large Language Models

    Authors: Jinhyuk Lee, Zhuyun Dai, Xiaoqi Ren, Blair Chen, Daniel Cer, Jeremy R. Cole, Kai Hui, Michael Boratko, Rajvi Kapadia, Wen Ding, Yi Luan, Sai Meher Karthik Duddu, Gustavo Hernandez Abrego, Weiqiang Shi, Nithi Gupta, Aditya Kusupati, Prateek Jain, Siddhartha Reddy Jonnalagadda, Ming-Wei Chang, Iftekhar Naim

    Abstract: We present Gecko, a compact and versatile text embedding model. Gecko achieves strong retrieval performance by leveraging a key idea: distilling knowledge from large language models (LLMs) into a retriever. Our two-step distillation process begins with generating diverse, synthetic paired data using an LLM. Next, we further refine the data quality by retrieving a set of candidate passages for each… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 18 pages

  10. arXiv:2403.07558  [pdf, other

    cs.GT cs.DS

    Controlling Delegations in Liquid Democracy

    Authors: Shiri Alouf-Heffetz, Tanmay Inamdar, Pallavi Jain, Yash More, Nimrod Talmon

    Abstract: In liquid democracy, agents can either vote directly or delegate their vote to a different agent of their choice. This results in a power structure in which certain agents possess more voting weight than others. As a result, it opens up certain possibilities of vote manipulation, including control and bribery, that do not exist in standard voting scenarios of direct democracy. Here we formalize a… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted in 23rd International Conference on Autonomous Agents and Multiagent Systems(AAMAS 2024)

  11. arXiv:2403.07328  [pdf, other

    cs.DS

    Satisfiability to Coverage in Presence of Fairness, Matroid, and Global Constraints

    Authors: Tanmay Inamdar, Pallavi Jain, Daniel Lokshtanov, Abhishek Sahu, Saket Saurabh, Anannya Upasana

    Abstract: In MaxSAT with Cardinality Constraint problem (CC-MaxSAT), we are given a CNF-formula $Φ$, and $k \ge 0$, and the goal is to find an assignment $β$ with at most $k$ variables set to true (also called a weight $k$-assignment) such that the number of clauses satisfied by $β$ is maximized. MaxCov can be seen as a special case of CC-MaxSAT, where the formula $Φ$ is monotone, i.e., does not contain any… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Abstract shortened due to arxiv restrictions

  12. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  13. arXiv:2403.04630  [pdf, ps, other

    cs.DS cs.CR

    Time-Aware Projections: Truly Node-Private Graph Statistics under Continual Observation

    Authors: Palak Jain, Adam Smith, Connor Wagaman

    Abstract: We describe the first algorithms that satisfy the standard notion of node-differential privacy in the continual release setting (i.e., without an assumed promise on input streams). Previous work addresses node-private continual release by assuming an unenforced promise on the maximum degree in a graph; indeed, the algorithms from these works exhibit blatant privacy violations when the degree bound… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  14. arXiv:2403.04265  [pdf, other

    cs.GT cs.DS

    Conflict and Fairness in Resource Allocation

    Authors: Susobhan Bandopadhyay, Aritra Banik, Sushmita Gupta, Pallavi Jain, Abhishek Sahu, Saket Saurabh, Prafullkumar Tale

    Abstract: In the standard model of fair allocation of resources to agents, every agent has some utility for every resource, and the goal is to assign resources to agents so that the agents' welfare is maximized. Motivated by job scheduling, interest in this problem dates back to the work of Deuermeyer et al. [SIAM J. on Algebraic Discrete Methods'82]. Recent works consider the compatibility between resource… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2309.04995

  15. arXiv:2402.13636  [pdf, other

    cs.CV cs.CL cs.CY

    A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models

    Authors: Ashutosh Sathe, Prachi Jain, Sunayana Sitaram

    Abstract: Vision-language models (VLMs) have gained widespread adoption in both industry and academia. In this study, we propose a unified framework for systematically evaluating gender, race, and age biases in VLMs with respect to professions. Our evaluation encompasses all supported inference modes of the recent VLMs, including image-to-text, text-to-text, text-to-image, and image-to-image. Additionally,… ▽ More

    Submitted 17 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  16. arXiv:2402.09360  [pdf, other

    cs.LG cs.AI

    HiRE: High Recall Approximate Top-$k$ Estimation for Efficient LLM Inference

    Authors: Yashas Samaga B L, Varun Yerram, Chong You, Srinadh Bhojanapalli, Sanjiv Kumar, Prateek Jain, Praneeth Netrapalli

    Abstract: Autoregressive decoding with generative Large Language Models (LLMs) on accelerators (GPUs/TPUs) is often memory-bound where most of the time is spent on transferring model parameters from high bandwidth memory (HBM) to cache. On the other hand, recent works show that LLMs can maintain quality with significant sparsity/redundancy in the feedforward (FFN) layers by appropriately training the model… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  17. arXiv:2402.08644  [pdf, other

    cs.AI cs.CL

    Tandem Transformers for Inference Efficient LLMs

    Authors: Aishwarya P S, Pranav Ajit Nair, Yashas Samaga, Toby Boyd, Sanjiv Kumar, Prateek Jain, Praneeth Netrapalli

    Abstract: The autoregressive nature of conventional large language models (LLMs) inherently limits inference speed, as tokens are generated sequentially. While speculative and parallel decoding techniques attempt to mitigate this, they face limitations: either relying on less accurate smaller models for generation or failing to fully leverage the base LLM's representations. We introduce a novel architectu… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  18. arXiv:2402.07519  [pdf, other

    cs.CL cs.CY

    MAFIA: Multi-Adapter Fused Inclusive LanguAge Models

    Authors: Prachi Jain, Ashutosh Sathe, Varun Gumma, Kabir Ahuja, Sunayana Sitaram

    Abstract: Pretrained Language Models (PLMs) are widely used in NLP for various tasks. Recent studies have identified various biases that such models exhibit and have proposed methods to correct these biases. However, most of the works address a limited set of bias dimensions independently such as gender, race, or religion. Moreover, the methods typically involve finetuning the full model to maintain the per… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024

  19. arXiv:2402.02719  [pdf, ps, other

    cs.DS cs.GT

    Budget-feasible Egalitarian Allocation of Conflicting Jobs

    Authors: Sushmita Gupta, Pallavi Jain, A. Mohanapriya, Vikash Tripathi

    Abstract: Allocating conflicting jobs among individuals while respecting a budget constraint for each individual is an optimization problem that arises in various real-world scenarios. In this paper, we consider the situation where each individual derives some satisfaction from each job. We focus on finding a feasible allocation of conflicting jobs that maximize egalitarian cost, i.e. the satisfaction of th… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted in 23rd International Conference on Autonomous Agents and Multiagent Systems(AAMAS 2024)

  20. arXiv:2401.15724  [pdf, other

    cs.CL

    RE-GAINS & EnChAnT: Intelligent Tool Manipulation Systems For Enhanced Query Responses

    Authors: Sahil Girhepuje, Siva Sankar Sajeev, Purvam Jain, Arya Sikder, Adithya Rama Varma, Ryan George, Akshay Govind Srinivasan, Mahendra Kurup, Ashmit Sinha, Sudip Mondal

    Abstract: Large Language Models (LLMs) currently struggle with tool invocation and chaining, as they often hallucinate or miss essential steps in a sequence. We propose RE-GAINS and EnChAnT, two novel frameworks that empower LLMs to tackle complex user queries by making API calls to external tools based on tool descriptions and argument lists. Tools are chained based on the expected output, without receivin… ▽ More

    Submitted 20 June, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  21. arXiv:2401.06837  [pdf, other

    cs.CL cs.AI

    Structsum Generation for Faster Text Comprehension

    Authors: Parag Jain, Andreea Marzoca, Francesco Piccinno

    Abstract: We consider the task of generating structured representations of text using large language models (LLMs). We focus on tables and mind maps as representative modalities. Tables are more organized way of representing data, while mind maps provide a visually dynamic and flexible approach, particularly suitable for sparse content. Despite the effectiveness of LLMs on different tasks, we show that curr… ▽ More

    Submitted 19 June, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: camera ready

  22. arXiv:2401.02412  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    LLM Augmented LLMs: Expanding Capabilities through Composition

    Authors: Rachit Bansal, Bidisha Samanta, Siddharth Dalmia, Nitish Gupta, Shikhar Vashishth, Sriram Ganapathy, Abhishek Bapna, Prateek Jain, Partha Talukdar

    Abstract: Foundational models with billions of parameters which have been trained on large corpora of data have demonstrated non-trivial skills in a variety of domains. However, due to their monolithic structure, it is challenging and expensive to augment them or impart new skills. On the other hand, due to their adaptation abilities, several new instances of these models are being trained towards new domai… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: 17 pages, 2 figures, 8 tables

  23. arXiv:2312.09167  [pdf, other

    cs.GT

    Maximizing Nash Social Welfare under Two-Sided Preferences

    Authors: Pallavi Jain, Rohit Vaish

    Abstract: The maximum Nash social welfare (NSW) -- which maximizes the geometric mean of agents' utilities -- is a fundamental solution concept with remarkable fairness and efficiency guarantees. The computational aspects of NSW have been extensively studied for one-sided preferences where a set of agents have preferences over a set of resources. Our work deviates from this trend and studies NSW maximizatio… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  24. arXiv:2311.07463  [pdf, other

    cs.CL

    MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks

    Authors: Sanchit Ahuja, Divyanshu Aggarwal, Varun Gumma, Ishaan Watts, Ashutosh Sathe, Millicent Ochieng, Rishav Hada, Prachi Jain, Maxamed Axmed, Kalika Bali, Sunayana Sitaram

    Abstract: There has been a surge in LLM evaluation research to understand LLM capabilities and limitations. However, much of this research has been confined to English, leaving LLM building and evaluation for non-English languages relatively unexplored. Several new LLMs have been introduced recently, necessitating their evaluation on non-English languages. This study aims to perform a thorough evaluation of… ▽ More

    Submitted 2 April, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 40 pages, 35 figures and 34 tables

  25. arXiv:2311.03376  [pdf, other

    cs.IR cs.LG stat.ML

    Blocked Collaborative Bandits: Online Collaborative Filtering with Per-Item Budget Constraints

    Authors: Soumyabrata Pal, Arun Sai Suggala, Karthikeyan Shanmugam, Prateek Jain

    Abstract: We consider the problem of \emph{blocked} collaborative bandits where there are multiple users, each with an associated multi-armed bandit problem. These users are grouped into \emph{latent} clusters such that the mean reward vectors of users within the same cluster are identical. Our goal is to design algorithms that maximize the cumulative reward accrued by all the users over time, under the \em… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: 44 pages, To Appear in NeurIPS 2023

  26. arXiv:2310.16568  [pdf, other

    cs.CL

    1-PAGER: One Pass Answer Generation and Evidence Retrieval

    Authors: Palak Jain, Livio Baldini Soares, Tom Kwiatkowski

    Abstract: We present 1-Pager the first system that answers a question and retrieves evidence using a single Transformer-based model and decoding process. 1-Pager incrementally partitions the retrieval corpus using constrained decoding to select a document and answer string, and we show that this is competitive with comparable retrieve-and-read alternatives according to both retrieval and answer accuracy met… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 (Findings)

  27. arXiv:2310.10636  [pdf, other

    cs.LG

    Dual-Encoders for Extreme Multi-Label Classification

    Authors: Nilesh Gupta, Devvrit Khatri, Ankit S Rawat, Srinadh Bhojanapalli, Prateek Jain, Inderjit Dhillon

    Abstract: Dual-encoder (DE) models are widely used in retrieval tasks, most commonly studied on open QA benchmarks that are often characterized by multi-class and limited training data. In contrast, their performance in multi-label and data-rich retrieval settings like extreme multi-label classification (XMC), remains under-explored. Current empirical evidence indicates that DE models fall significantly sho… ▽ More

    Submitted 17 March, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 27 pages, 8 figures

    Journal ref: ICLR 2024 camera-ready publication

  28. arXiv:2310.08891  [pdf, other

    cs.LG cs.IR

    EHI: End-to-end Learning of Hierarchical Index for Efficient Dense Retrieval

    Authors: Ramnath Kumar, Anshul Mittal, Nilesh Gupta, Aditya Kusupati, Inderjit Dhillon, Prateek Jain

    Abstract: Dense embedding-based retrieval is now the industry standard for semantic search and ranking problems, like obtaining relevant web documents for a given query. Such techniques use a two-stage process: (a) contrastive learning to train a dual encoder to embed both the query and documents and (b) approximate nearest neighbor search (ANNS) for finding similar documents for a given query. These two st… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  29. arXiv:2310.07707  [pdf, other

    cs.LG cs.CL cs.CV

    MatFormer: Nested Transformer for Elastic Inference

    Authors: Devvrit, Sneha Kudugunta, Aditya Kusupati, Tim Dettmers, Kaifeng Chen, Inderjit Dhillon, Yulia Tsvetkov, Hannaneh Hajishirzi, Sham Kakade, Ali Farhadi, Prateek Jain

    Abstract: Transformer models are deployed in a wide range of settings, from multi-accelerator clusters to standalone mobile phones. The diverse inference constraints in these scenarios necessitate practitioners to train foundation models such as PaLM 2, Llama, & ViTs as a series of models of varying sizes. Due to significant training costs, only a select few model sizes are trained and supported, limiting m… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 31 pages, 12 figures, first three authors contributed equally

  30. arXiv:2309.04995  [pdf, ps, other

    cs.DS cs.GT

    How to assign volunteers to tasks compatibly ? A graph theoretic and parameterized approach

    Authors: Sushmita Gupta, Pallavi Jain, Saket Saurabh

    Abstract: In this paper we study a resource allocation problem that encodes correlation between items in terms of \conflict and maximizes the minimum utility of the agents under a conflict free allocation. Admittedly, the problem is computationally hard even under stringent restrictions because it encodes a variant of the {\sc Maximum Weight Independent Set} problem which is one of the canonical hard proble… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  31. arXiv:2307.05339  [pdf, other

    eess.SP cs.LG

    A Self-Supervised Algorithm for Denoising Photoplethysmography Signals for Heart Rate Estimation from Wearables

    Authors: Pranay Jain, Cheng Ding, Cynthia Rudin, Xiao Hu

    Abstract: Smart watches and other wearable devices are equipped with photoplethysmography (PPG) sensors for monitoring heart rate and other aspects of cardiovascular health. However, PPG signals collected from such devices are susceptible to corruption from noise and motion artifacts, which cause errors in heart rate estimation. Typical denoising approaches filter or reconstruct the signal in ways that elim… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 13 pages, 6 figures

  32. arXiv:2306.06723  [pdf, ps, other

    cs.DS cs.CR

    Counting Distinct Elements in the Turnstile Model with Differential Privacy under Continual Observation

    Authors: Palak Jain, Iden Kalemaj, Sofya Raskhodnikova, Satchit Sivakumar, Adam Smith

    Abstract: Privacy is a central challenge for systems that learn from sensitive data sets, especially when a system's outputs must be continuously updated to reflect changing data. We consider the achievable error for differentially private continual release of a basic statistic - the number of distinct items - in a stream where items may be both inserted and deleted (the turnstile model). With only insertio… ▽ More

    Submitted 10 July, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  33. arXiv:2306.05785  [pdf, other

    cs.LG

    End-to-End Neural Network Compression via $\frac{\ell_1}{\ell_2}$ Regularized Latency Surrogates

    Authors: Anshul Nasery, Hardik Shah, Arun Sai Suggala, Prateek Jain

    Abstract: Neural network (NN) compression via techniques such as pruning, quantization requires setting compression hyperparameters (e.g., number of channels to be pruned, bitwidths for quantization) for each layer either manually or via neural architecture search (NAS) which can be computationally expensive. We address this problem by providing an end-to-end technique that optimizes for model's Floating Po… ▽ More

    Submitted 13 June, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  34. arXiv:2306.03577  [pdf, other

    cs.CV

    An Open Patch Generator based Fingerprint Presentation Attack Detection using Generative Adversarial Network

    Authors: Anuj Rai, Ashutosh Anshul, Ashwini Jha, Prayag Jain, Ramprakash Sharma, Somnath Dey

    Abstract: The low-cost, user-friendly, and convenient nature of Automatic Fingerprint Recognition Systems (AFRS) makes them suitable for a wide range of applications. This spreading use of AFRS also makes them vulnerable to various security threats. Presentation Attack (PA) or spoofing is one of the threats which is caused by presenting a spoof of a genuine fingerprint to the sensor of AFRS. Fingerprint Pre… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  35. arXiv:2305.19435  [pdf, other

    cs.LG cs.IR

    AdANNS: A Framework for Adaptive Semantic Search

    Authors: Aniket Rege, Aditya Kusupati, Sharan Ranjit S, Alan Fan, Qingqing Cao, Sham Kakade, Prateek Jain, Ali Farhadi

    Abstract: Web-scale search systems learn an encoder to embed a given query which is then hooked into an approximate nearest neighbor search (ANNS) pipeline to retrieve similar data points. To accurately capture tail queries and data points, learned representations typically are rigid, high-dimensional vectors that are generally used as-is in the entire ANNS pipeline and can lead to computationally expensive… ▽ More

    Submitted 18 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: 25 pages, 15 figures. NeurIPS 2023 camera ready publication

  36. arXiv:2305.06164  [pdf, other

    cs.CL cs.AI

    Conversational Semantic Parsing using Dynamic Context Graphs

    Authors: Parag Jain, Mirella Lapata

    Abstract: In this paper we consider the task of conversational semantic parsing over general purpose knowledge graphs (KGs) with millions of entities, and thousands of relation-types. We focus on models which are capable of interactively mapping user utterances into executable logical forms (e.g., Sparql) in the context of the conversational history. Our key idea is to represent information about an utteran… ▽ More

    Submitted 7 December, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: camera ready

  37. arXiv:2303.12528  [pdf, other

    cs.CL

    MEGA: Multilingual Evaluation of Generative AI

    Authors: Kabir Ahuja, Harshita Diddee, Rishav Hada, Millicent Ochieng, Krithika Ramesh, Prachi Jain, Akshay Nambi, Tanuja Ganu, Sameer Segal, Maxamed Axmed, Kalika Bali, Sunayana Sitaram

    Abstract: Generative AI models have shown impressive performance on many Natural Language Processing tasks such as language understanding, reasoning, and language generation. An important question being asked by the AI community today is about the capabilities and limits of these models, and it is clear that evaluating generative AI is very challenging. Most studies on generative LLMs have been restricted t… ▽ More

    Submitted 22 October, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: EMNLP 2023

  38. arXiv:2303.00242  [pdf, other

    cs.CL

    DIFFQG: Generating Questions to Summarize Factual Changes

    Authors: Jeremy R. Cole, Palak Jain, Julian Martin Eisenschlos, Michael J. Q. Zhang, Eunsol Choi, Bhuwan Dhingra

    Abstract: Identifying the difference between two versions of the same article is useful to update knowledge bases and to understand how articles evolve. Paired texts occur naturally in diverse situations: reporters write similar news stories and maintainers of authoritative websites must keep their information up to date. We propose representing factual changes between paired documents as question-answer pa… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: 14 pages. Accepted at EACL 2023 (main, long)

  39. arXiv:2302.11593  [pdf, other

    quant-ph cond-mat.mes-hall cs.IT math.MG

    Quantum spherical codes

    Authors: Shubham P. Jain, Joseph T. Iosue, Alexander Barg, Victor V. Albert

    Abstract: We introduce a framework for constructing quantum codes defined on spheres by recasting such codes as quantum analogues of the classical spherical codes. We apply this framework to bosonic coding, obtaining multimode extensions of the cat codes that can outperform previous constructions while requiring a similar type of overhead. Our polytope-based cat codes consist of sets of points with large se… ▽ More

    Submitted 7 December, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: 5 + 12 pages, 3 figures, 5 tables

  40. arXiv:2302.07975  [pdf, other

    cs.LG cs.CR stat.ML

    Multi-Task Differential Privacy Under Distribution Skew

    Authors: Walid Krichene, Prateek Jain, Shuang Song, Mukund Sundararajan, Abhradeep Thakurta, Li Zhang

    Abstract: We study the problem of multi-task learning under user-level differential privacy, in which $n$ users contribute data to $m$ tasks, each involving a subset of users. One important aspect of the problem, that can significantly impact quality, is the distribution skew among tasks. Certain tasks may have much fewer data samples than others, making them more susceptible to the noise added for privacy.… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  41. arXiv:2302.00457  [pdf, other

    cs.LG cs.AI stat.ML

    Simplicity Bias in 1-Hidden Layer Neural Networks

    Authors: Depen Morwani, Jatin Batra, Prateek Jain, Praneeth Netrapalli

    Abstract: Recent works have demonstrated that neural networks exhibit extreme simplicity bias(SB). That is, they learn only the simplest features to solve a task at hand, even in the presence of other, more robust but more complex features. Due to the lack of a general and rigorous definition of features, these works showcase SB on semi-synthetic datasets such as Color-MNIST, MNIST-CIFAR where defining feat… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    ACM Class: I.5.1; I.2.6

  42. arXiv:2302.00332  [pdf, other

    cs.AI cs.CV

    iPAL: A Machine Learning Based Smart Healthcare Framework For Automatic Diagnosis Of Attention Deficit/Hyperactivity Disorder (ADHD)

    Authors: Abhishek Sharma, Arpit Jain, Shubhangi Sharma, Ashutosh Gupta, Prateek Jain, Saraju P. Mohanty

    Abstract: ADHD is a prevalent disorder among the younger population. Standard evaluation techniques currently use evaluation forms, interviews with the patient, and more. However, its symptoms are similar to those of many other disorders like depression, conduct disorder, and oppositional defiant disorder, and these current diagnosis techniques are not very effective. Thus, a sophisticated computing model h… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  43. arXiv:2301.13273  [pdf, other

    cs.LG cs.CR math.ST stat.ML

    Near Optimal Private and Robust Linear Regression

    Authors: Xiyang Liu, Prateek Jain, Weihao Kong, Sewoong Oh, Arun Sai Suggala

    Abstract: We study the canonical statistical estimation problem of linear regression from $n$ i.i.d.~examples under $(\varepsilon,δ)$-differential privacy when some response variables are adversarially corrupted. We propose a variant of the popular differentially private stochastic gradient descent (DP-SGD) algorithm with two innovations: a full-batch gradient descent to improve sample complexity and a nove… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

  44. arXiv:2301.12217  [pdf, other

    cs.CL

    Semantic Parsing for Conversational Question Answering over Knowledge Graphs

    Authors: Laura Perez-Beltrachini, Parag Jain, Emilio Monti, Mirella Lapata

    Abstract: In this paper, we are interested in developing semantic parsers which understand natural language questions embedded in a conversation with a user and ground them to formal queries over definitions in a general purpose knowledge graph (KG) with very large vocabularies (covering thousands of concept names and relations, and millions of entities). To this end, we develop a dataset where user questio… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

    Comments: EACL 2023

  45. arXiv:2301.07710  [pdf

    cs.LG cs.NE eess.SP

    Fully Elman Neural Network: A Novel Deep Recurrent Neural Network Optimized by an Improved Harris Hawks Algorithm for Classification of Pulmonary Arterial Wedge Pressure

    Authors: Masoud Fetanat, Michael Stevens, Pankaj Jain, Christopher Hayward, Erik Meijering, Nigel H. Lovell

    Abstract: Heart failure (HF) is one of the most prevalent life-threatening cardiovascular diseases in which 6.5 million people are suffering in the USA and more than 23 million worldwide. Mechanical circulatory support of HF patients can be achieved by implanting a left ventricular assist device (LVAD) into HF patients as a bridge to transplant, recovery or destination therapy and can be controlled by measu… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Journal ref: IEEE Transactions on Biomedical Engineering, 2022

  46. arXiv:2301.07040  [pdf, other

    cs.LG stat.ML

    Optimal Algorithms for Latent Bandits with Cluster Structure

    Authors: Soumyabrata Pal, Arun Sai Suggala, Karthikeyan Shanmugam, Prateek Jain

    Abstract: We consider the problem of latent bandits with cluster structure where there are multiple users, each with an associated multi-armed bandit problem. These users are grouped into \emph{latent} clusters such that the mean reward vectors of users within the same cluster are identical. At each round, a user, selected uniformly at random, pulls an arm and observes a corresponding noisy reward. The goal… ▽ More

    Submitted 11 July, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: 48 pages. Accepted to AISTATS 2023. Added Experiments

  47. arXiv:2301.03040  [pdf

    cs.HC cs.AI

    Digital Twin: Where do humans fit in?

    Authors: Ashwin Agrawal, Robert Thiel, Pooja Jain, Vishal Singh, Martin Fischer

    Abstract: Digital Twin (DT) technology is far from being comprehensive and mature, resulting in their piecemeal implementation in practice where some functions are automated by DTs, and others are still performed by humans. This piecemeal implementation of DTs often leaves practitioners wondering what roles (or functions) to allocate to DTs in a work system, and how might it impact humans. A lack of knowled… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

    Comments: Accepted for publication in Automation in Construction

  48. arXiv:2212.07826  [pdf, other

    quant-ph cs.LG q-bio.BM

    Hybrid Quantum Generative Adversarial Networks for Molecular Simulation and Drug Discovery

    Authors: Prateek Jain, Srinjoy Ganguly

    Abstract: In molecular research, simulation \& design of molecules are key areas with significant implications for drug development, material science, and other fields. Current classical computational power falls inadequate to simulate any more than small molecules, let alone protein chains on hundreds of peptide. Therefore these experiment are done physically in wet-lab, but it takes a lot of time \& not p… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  49. arXiv:2211.07854  [pdf, other

    quant-ph cs.LG

    Variational Quantum Algorithms for Chemical Simulation and Drug Discovery

    Authors: Hasan Mustafa, Sai Nandan Morapakula, Prateek Jain, Srinjoy Ganguly

    Abstract: Quantum computing has gained a lot of attention recently, and scientists have seen potential applications in this field using quantum computing for Cryptography and Communication to Machine Learning and Healthcare. Protein folding has been one of the most interesting areas to study, and it is also one of the biggest problems of biochemistry. Each protein folds distinctively, and the difficulty of… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

  50. arXiv:2211.01454  [pdf, other

    cs.LG

    Speeding up NAS with Adaptive Subset Selection

    Authors: Vishak Prasad C, Colin White, Paarth Jain, Sibasis Nayak, Ganesh Ramakrishnan

    Abstract: A majority of recent developments in neural architecture search (NAS) have been aimed at decreasing the computational cost of various techniques without affecting their final performance. Towards this goal, several low-fidelity and performance prediction methods have been considered, including those that train only on subsets of the training data. In this work, we present an adaptive subset select… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.