Skip to main content

Showing 1–21 of 21 results for author: Gupta, Y

  1. arXiv:2404.10779  [pdf, other

    cs.SE cs.LG

    Fine Tuning LLM for Enterprise: Practical Guidelines and Recommendations

    Authors: Mathav Raj J, Kushala VM, Harikrishna Warrier, Yogesh Gupta

    Abstract: There is a compelling necessity from enterprises for fine tuning LLMs (Large Language Models) o get them trained on proprietary domain knowledge. The challenge is to imbibe the LLMs with domain specific knowledge using the most optimial resource and cost and in the best possible time. Many enterprises rely on RAG (Retrieval Augmented Generation) which does not need LLMs to be ine-tuned but they ar… ▽ More

    Submitted 23 March, 2024; originally announced April 2024.

    Comments: 17 pages, 12 tables, 3 figures

  2. arXiv:2401.15290  [pdf, other

    cs.LG

    Benchmarking with MIMIC-IV, an irregular, spare clinical time series dataset

    Authors: Hung Bui, Harikrishna Warrier, Yogesh Gupta

    Abstract: Electronic health record (EHR) is more and more popular, and it comes with applying machine learning solutions to resolve various problems in the domain. This growing research area also raises the need for EHRs accessibility. Medical Information Mart for Intensive Care (MIMIC) dataset is a popular, public, and free EHR dataset in a raw format that has been used in numerous studies. However, despit… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 7 pages, 1 figure, 3 tables

  3. arXiv:2401.02677  [pdf, other

    cs.CV cs.AI

    Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

    Authors: Yatharth Gupta, Vishnu V. Jaddipal, Harish Prabhala, Sayak Paul, Patrick Von Platen

    Abstract: Stable Diffusion XL (SDXL) has become the best open source text-to-image model (T2I) for its versatility and top-notch image quality. Efficiently addressing the computational demands of SDXL models is crucial for wider reach and applicability. In this work, we introduce two scaled-down variants, Segmind Stable Diffusion (SSD-1B) and Segmind-Vega, with 1.3B and 0.74B parameter UNets, respectively,… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  4. arXiv:2310.18832  [pdf, other

    cs.AI

    Responsible AI (RAI) Games and Ensembles

    Authors: Yash Gupta, Runtian Zhai, Arun Suggala, Pradeep Ravikumar

    Abstract: Several recent works have studied the societal effects of AI; these include issues such as fairness, robustness, and safety. In many of these objectives, a learner seeks to minimize its worst-case loss over a set of predefined distributions (known as uncertainty sets), with usual examples being perturbed versions of the empirical distribution. In other words, aforementioned problems can be written… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  5. arXiv:2207.08391  [pdf, other

    cs.LG cs.DC

    Federated Learning for Non-IID Data via Client Variance Reduction and Adaptive Server Update

    Authors: Hiep Nguyen, Lam Phan, Harikrishna Warrier, Yogesh Gupta

    Abstract: Federated learning (FL) is an emerging technique used to collaboratively train a global machine learning model while keeping the data localized on the user devices. The main obstacle to FL's practical implementation is the Non-Independent and Identical (Non-IID) data distribution across users, which slows convergence and degrades performance. To tackle this fundamental issue, we propose a method (… ▽ More

    Submitted 29 July, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

  6. arXiv:2205.14331  [pdf

    cs.LG cs.AI

    Survival Analysis on Structured Data using Deep Reinforcement Learning

    Authors: Renith G, Harikrishna Warrier, Yogesh Gupta

    Abstract: Survival analysis is playing a major role in manufacturing sector by analyzing occurrence of any unwanted event based on the input data. Predictive maintenance, which is a part of survival analysis, helps to find any device failure based on the current incoming data from different sensor or any equipment. Deep learning techniques were used to automate the predictive maintenance problem to some ext… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

  7. arXiv:2204.07062  [pdf

    cs.MM cs.LG eess.IV

    Network state Estimation using Raw Video Analysis: vQoS-GAN based non-intrusive Deep Learning Approach

    Authors: Renith G, Harikrishna Warrier, Yogesh Gupta

    Abstract: Content based providers transmits real time complex signal such as video data from one region to another. During this transmission process, the signals usually end up distorted or degraded where the actual information present in the video is lost. This normally happens in the streaming video services applications. Hence there is a need to know the level of degradation that happened in the receiver… ▽ More

    Submitted 22 March, 2022; originally announced April 2022.

    Comments: 18 pages, 10 figures

  8. arXiv:2204.01487  [pdf

    cs.CY cs.CR

    Identifying Security Risks in NFT Platforms

    Authors: Yash Gupta, Jayanth Kumar, Dr. Andrew Reifers

    Abstract: This paper examines the effects of inherent risks in the emerging technology of non-fungible tokens and proposes an actionable set of solutions for stakeholders in this ecosystem and observers. Web3 and NFTs are a fast-growing 300 billion dollar economy with some clear, highly publicized harms that came to light recently. We set out to explore the risks to understand their nature and scope, and if… ▽ More

    Submitted 5 April, 2022; v1 submitted 18 March, 2022; originally announced April 2022.

  9. arXiv:2112.07337  [pdf, other

    cs.CL cs.AI

    Multi-Row, Multi-Span Distant Supervision For Table+Text Question

    Authors: Vishwajeet Kumar, Yash Gupta, Saneem Chemmengath, Jaydeep Sen, Soumen Chakrabarti, Samarth Bharadwaj, FeiFei Pan

    Abstract: Question answering (QA) over tables and linked text, also called TextTableQA, has witnessed significant research in recent years, as tables are often found embedded in documents along with related text. HybridQA and OTT-QA are the two best-known TextTableQA datasets, with questions that are best answered by combining information from both table cells and linked text passages. A common challenge in… ▽ More

    Submitted 11 June, 2023; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: ACL 2023

  10. arXiv:2107.05166  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Stateful Detection of Model Extraction Attacks

    Authors: Soham Pal, Yash Gupta, Aditya Kanade, Shirish Shevade

    Abstract: Machine-Learning-as-a-Service providers expose machine learning (ML) models through application programming interfaces (APIs) to developers. Recent work has shown that attackers can exploit these APIs to extract good approximations of such ML models, by querying them with samples of their choosing. We propose VarDetect, a stateful monitor that tracks the distribution of queries made by users of su… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

  11. arXiv:2106.13675  [pdf

    cs.HC

    Creating and Implementing a Smart Speaker

    Authors: Sanskar Jethi, Avinash Kumar Choudhary, Yash Gupta, Abhishek Chaudhary

    Abstract: We have seen significant advancements in Artificial Intelligence and Machine Learning in the 21st century. It has enabled a new technology where we can have a human-like conversation with the machines. The most significant use of this speech recognition and contextual understanding technology exists in the form of a Smart Speaker. We have a wide variety of Smart Speaker products available to us. T… ▽ More

    Submitted 30 May, 2021; originally announced June 2021.

    Journal ref: IT in Industry, Vol. 9, No.3, 2021

  12. arXiv:2102.10978  [pdf

    cs.LG cs.AI stat.AP

    Markov model with machine learning integration for fraud detection in health insurance

    Authors: Rohan Yashraj Gupta, Satya Sai Mudigonda, Pallav Kumar Baruah, Phani Krishna Kandala

    Abstract: Fraud has led to a huge addition of expenses in health insurance sector in India. The work is aimed to provide methods applied to health insurance fraud detection. The work presents two approaches - a markov model and an improved markov model using gradient boosting method in health insurance claims. The dataset 382,587 claims of which 38,082 claims are fraudulent. The markov based model gave the… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: 6 pages, 10 figures

  13. arXiv:1907.01960  [pdf, other

    cs.OH

    Fashion Retail: Forecasting Demand for New Items

    Authors: Pawan Kumar Singh, Yadunath Gupta, Nilpa Jha, Aruna Rajan

    Abstract: Fashion merchandising is one of the most complicated problems in forecasting, given the transient nature of trends in colours, prints, cuts, patterns, and materials in fashion, the economies of scale achievable only in bulk production, as well as geographical variations in consumption. Retailers that serve a large customer base spend a lot of money and resources to stay prepared for meeting changi… ▽ More

    Submitted 27 June, 2019; originally announced July 2019.

    Comments: KDD - AI4Fashion Workshop

  14. arXiv:1905.09165  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    A framework for the extraction of Deep Neural Networks by leveraging public data

    Authors: Soham Pal, Yash Gupta, Aditya Shukla, Aditya Kanade, Shirish Shevade, Vinod Ganapathy

    Abstract: Machine learning models trained on confidential datasets are increasingly being deployed for profit. Machine Learning as a Service (MLaaS) has made such models easily accessible to end-users. Prior work has developed model extraction attacks, in which an adversary extracts an approximation of MLaaS models by making black-box queries to it. However, none of these works is able to satisfy all the th… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

  15. arXiv:1904.12483  [pdf, other

    cs.CV

    Self-Attention Capsule Networks for Object Classification

    Authors: Assaf Hoogi, Brian Wilcox, Yachee Gupta, Daniel L. Rubin

    Abstract: We propose a novel architecture for object classification, called Self-Attention Capsule Networks (SACN). SACN is the first model that incorporates the Self-Attention mechanism as an integral layer within the Capsule Network (CapsNet). While the Self-Attention mechanism supplies a long-range dependencies, results in selecting the more dominant image regions to focus on, the CapsNet analyzes the re… ▽ More

    Submitted 19 November, 2019; v1 submitted 29 April, 2019; originally announced April 2019.

  16. arXiv:1806.11424  [pdf, other

    cs.IR stat.ML

    Understanding Fashionability: What drives sales of a style?

    Authors: Aniket Jain, Yadunath Gupta, Pawan Kumar Singh, Aruna Rajan

    Abstract: We use customer demand data for fashion articles on Myntra, and derive a fashionability or style quotient, which represents customer demand for the stylistic content of a fashion article, decoupled with its commercials (price, offers, etc.). We demonstrate learning for assortment planning in fashion that would aim to keep a healthy mix of breadth and depth across various styles, and we show the re… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

  17. arXiv:1512.03184  [pdf, other

    cs.SI physics.soc-ph

    Social Network Analysis of the Caste-Based Reservation System in India

    Authors: Akrati Saxena, Jaspal Singh Saini, Yayati Gupta, Aishwarya Parasuram, Neeharika, S. R. S. Iyengar

    Abstract: It has been argued that the reservation system in India, which has existed since the time of Indian Independence (1947), has caused more havoc and degradation than progress. This being a popular public opinion, has not been based on any rigorous scientific study or research. In this paper, we revisit the cultural divide among the Indian population from a purely social network based approach. We st… ▽ More

    Submitted 8 December, 2018; v1 submitted 10 December, 2015; originally announced December 2015.

  18. arXiv:1507.07838  [pdf, other

    cs.SI physics.soc-ph

    Shifting Behaviour of Users: Towards Understanding the Fundamental Law of Social Networks

    Authors: Yayati Gupta, S. R. S. Iyengar, Jaspal Singh Saini, Nidhi Sridhar

    Abstract: Social Networking Sites (SNSs) are powerful marketing and communication tools. There are hundreds of SNSs that have entered and exited the market over time. The coexistence of multiple SNSs is a rarely observed phenomenon. Most coexisting SNSs either serve different purposes for its users or have cultural differences among them. The introduction of a new SNS with a better set of features can lead… ▽ More

    Submitted 7 November, 2015; v1 submitted 28 July, 2015; originally announced July 2015.

  19. arXiv:1507.07833  [pdf, other

    cs.SI physics.soc-ph

    Pseudo-Cores: The Terminus of an Intelligent Viral Meme's Trajectory

    Authors: Yayati Gupta, Debarati Das, S. R. S. Iyengar

    Abstract: Comprehending the virality of a meme can help us in addressing the problems pertaining to disciplines like epidemiology and digital marketing. Therefore, it is not surprising that memetics remains a highly analyzed research topic ever since the mid 1990s. Some scientists choose to investigate the intrinsic contagiousness of a meme while others study the problem from a network theory perspective. I… ▽ More

    Submitted 30 October, 2015; v1 submitted 28 July, 2015; originally announced July 2015.

  20. arXiv:1505.00457  [pdf, other

    cs.SI physics.soc-ph

    Modeling Memetics using Edge Diversity

    Authors: Yayati Gupta, Akrati Saxena, Debarati Das, S. R. S. Iyengar

    Abstract: The study of meme propagation and the prediction of meme trajectory are emerging areas of interest in the field of complex networks research. In addition to the properties of the meme itself, the structural properties of the underlying network decides the speed and the trajectory of the propagating meme. In this paper, we provide an artificial framework for studying the meme propagation patterns.… ▽ More

    Submitted 13 December, 2015; v1 submitted 3 May, 2015; originally announced May 2015.

  21. arXiv:1503.06301  [pdf, ps, other

    cs.PF

    Effective Handling of Urgent Jobs - Speed Up Scheduling for Computing Applications

    Authors: Yash Gupta, Kamalakar Karlapalem

    Abstract: A queue is required when a service provider is not able to handle jobs arriving over the time. In a highly flexible and dynamic environment, some jobs might demand for faster execution at run-time especially when the resources are limited and the jobs are competing for acquiring resources. A user might demand for speed up (reduced wait time) for some of the jobs present in the queue at run time. I… ▽ More

    Submitted 21 March, 2015; originally announced March 2015.

    Comments: Paper covering main contributions from MS Thesis of Yash Gupta http://web2py.iiit.ac.in/research_centres/publications/view_publication/mastersthesis/247 - presented in ACM format

    Report number: MS Thesis Number IIIT/TH/2014/7