Skip to main content

Showing 1–50 of 176 results for author: Saha, A

  1. arXiv:2406.18135  [pdf

    cs.CL cs.SD eess.AS

    Automatic Speech Recognition for Hindi

    Authors: Anish Saha, A. G. Ramakrishnan

    Abstract: Automatic speech recognition (ASR) is a key area in computational linguistics, focusing on developing technologies that enable computers to convert spoken language into text. This field combines linguistics and machine learning. ASR models, which map speech audio to transcripts through supervised learning, require handling real and unrestricted text. Text-to-speech systems directly work with real… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.02450  [pdf, other

    cs.LG cs.AI

    A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies

    Authors: Md Mirajul Islam, Xi Yang, John Hostetter, Adittya Soukarjya Saha, Min Chi

    Abstract: A key challenge in e-learning environments like Intelligent Tutoring Systems (ITSs) is to induce effective pedagogical policies efficiently. While Deep Reinforcement Learning (DRL) often suffers from sample inefficiency and reward function design difficulty, Apprenticeship Learning(AL) algorithms can overcome them. However, most AL algorithms can not handle heterogeneity as they assume all demonst… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2406.00551  [pdf, other

    cs.LG cs.GT

    Strategic Linear Contextual Bandits

    Authors: Thomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis, Haifeng Xu

    Abstract: Motivated by the phenomenon of strategic agents gaming a recommender system to maximize the number of times they are recommended to users, we study a strategic variant of the linear contextual bandit problem, where the arms can strategically misreport their privately observed contexts to the learner. We treat the algorithm design problem as one of mechanism design under uncertainty and propose the… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  4. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  5. arXiv:2404.16687  [pdf, other

    cs.CV

    NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, Haoning Wu, Yixuan Gao, Yuqin Cao, Zicheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng , et al. (89 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Conte… ▽ More

    Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  6. arXiv:2404.09666  [pdf, other

    eess.IV cs.CV q-bio.QM

    Deformable MRI Sequence Registration for AI-based Prostate Cancer Diagnosis

    Authors: Alessa Hering, Sarah de Boer, Anindo Saha, Jasper J. Twilt, Mattias P. Heinrich, Derya Yakar, Maarten de Rooij, Henkjan Huisman, Joeran S. Bosma

    Abstract: The PI-CAI (Prostate Imaging: Cancer AI) challenge led to expert-level diagnostic algorithms for clinically significant prostate cancer detection. The algorithms receive biparametric MRI scans as input, which consist of T2-weighted and diffusion-weighted scans. These scans can be misaligned due to multiple factors in the scanning process. Image registration can alleviate this issue by predicting t… ▽ More

    Submitted 28 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  7. arXiv:2404.09067  [pdf, other

    cs.CV cs.AI

    Exploring Explainability in Video Action Recognition

    Authors: Avinab Saha, Shashank Gupta, Sravan Kumar Ankireddy, Karl Chahine, Joydeep Ghosh

    Abstract: Image Classification and Video Action Recognition are perhaps the two most foundational tasks in computer vision. Consequently, explaining the inner workings of trained deep neural networks is of prime importance. While numerous efforts focus on explaining the decisions of trained deep neural networks in image classification, exploration in the domain of its temporal version, video action recognit… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 6 pages, 10 figures, Accepted to the 3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024

  8. arXiv:2403.16365  [pdf, other

    cs.LG cs.CR cs.CV

    Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion

    Authors: Hossein Souri, Arpit Bansal, Hamid Kazemi, Liam Fowl, Aniruddha Saha, Jonas Geiping, Andrew Gordon Wilson, Rama Chellappa, Tom Goldstein, Micah Goldblum

    Abstract: Modern neural networks are often trained on massive datasets that are web scraped with minimal human inspection. As a result of this insecure curation pipeline, an adversary can poison or backdoor the resulting model by uploading malicious data to the internet and waiting for a victim to scrape and train on it. Existing approaches for creating poisons and backdoors start with randomly sampled clea… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  9. arXiv:2403.15045  [pdf, ps, other

    cs.LG cs.CR

    DP-Dueling: Learning from Preference Feedback without Compromising User Privacy

    Authors: Aadirupa Saha, Hilal Asi

    Abstract: We consider the well-studied dueling bandit problem, where a learner aims to identify near-optimal actions using pairwise comparisons, under the constraint of differential privacy. We consider a general class of utility-based preference matrices for large (potentially unbounded) decision spaces and give the first differentially private dueling bandit algorithm for active learning with user prefere… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  10. arXiv:2403.13861  [pdf

    cs.LG stat.AP

    Machine Learning-based Layer-wise Detection of Overheating Anomaly in LPBF using Photodiode Data

    Authors: Nazmul Hasan, Apurba Kumar Saha, Andrew Wessman, Mohammed Shafae

    Abstract: Overheating anomaly detection is essential for the quality and reliability of parts produced by laser powder bed fusion (LPBF) additive manufacturing (AM). In this research, we focus on the detection of overheating anomalies using photodiode sensor data. Photodiode sensors can collect high-frequency data from the melt pool, reflecting the process dynamics and thermal history. Hence, the proposed m… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 12 pages (including references); 5 figures; 4 tables

  11. arXiv:2403.05589  [pdf, other

    cs.HC cs.AI

    Ergonomic Design of Computer Laboratory Furniture: Mismatch Analysis Utilizing Anthropometric Data of University Students

    Authors: Anik Kumar Saha, Md Abrar Jahin, Md. Rafiquzzaman, M. F. Mridha

    Abstract: Many studies have shown how ergonomically designed furniture improves productivity and well-being. As computers have become a part of students' academic lives, they will grow further in the future. We propose anthropometric-based furniture dimensions suitable for university students to improve computer laboratory ergonomics. We collected data from 380 participants and analyzed 11 anthropometric me… ▽ More

    Submitted 15 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  12. arXiv:2403.04085  [pdf, other

    cs.CL cs.CY

    Don't Blame the Data, Blame the Model: Understanding Noise and Bias When Learning from Subjective Annotations

    Authors: Abhishek Anand, Negar Mokhberian, Prathyusha Naresh Kumar, Anweasha Saha, Zihao He, Ashwin Rao, Fred Morstatter, Kristina Lerman

    Abstract: Researchers have raised awareness about the harms of aggregating labels especially in subjective tasks that naturally contain disagreements among human annotators. In this work we show that models that are only provided aggregated labels show low confidence on high-disagreement data instances. While previous studies consider such instances as mislabeled, we argue that the reason the high-disagreem… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  13. arXiv:2403.00306  [pdf, other

    cs.DS

    qPMS Sigma -- An Efficient and Exact Parallel Algorithm for the Planted $(l, d)$ Motif Search Problem

    Authors: Saurav Dhar, Amlan Saha, Dhiman Goswami, Md. Abul Kashem Mia

    Abstract: Motif finding is an important step for the detection of rare events occurring in a set of DNA or protein sequences. Extraction of information about these rare events can lead to new biological discoveries. Motifs are some important patterns that have numerous applications including the identification of transcription factors and their binding sites, composite regulatory patterns, similarity betwee… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  14. arXiv:2402.18917  [pdf, other

    cs.LG cs.IR

    Stop Relying on No-Choice and Do not Repeat the Moves: Optimal, Efficient and Practical Algorithms for Assortment Optimization

    Authors: Aadirupa Saha, Pierre Gaillard

    Abstract: We address the problem of active online assortment optimization problem with preference feedback, which is a framework for modeling user choices and subsetwise utility maximization. The framework is useful in various real-world applications including ad placement, online retail, recommender systems, fine-tuning language models, amongst many. The problem, although has been studied in the past, lack… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  15. arXiv:2402.13573  [pdf, other

    cs.CV cs.AI cs.LG

    ToDo: Token Downsampling for Efficient Generation of High-Resolution Images

    Authors: Ethan Smith, Nayan Saxena, Aninda Saha

    Abstract: Attention mechanism has been crucial for image diffusion models, however, their quadratic computational complexity limits the sizes of images we can process within reasonable time and memory constraints. This paper investigates the importance of dense attention in generative image models, which often contain redundant features, making them suitable for sparser attention mechanisms. We propose a no… ▽ More

    Submitted 8 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Journal ref: 2024, Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence

  16. arXiv:2402.05592  [pdf, other

    cs.HC

    MERP: Metaverse Extended Realtiy Portal

    Authors: Anisha Ghosh, Aditya Mitra, Anik Saha, Sibi Chakkaravarthy Sethuraman, Anitha Subramanian

    Abstract: A standardized control system called Metaverse Extended Reality Portal (MERP) is presented as a solution to the issues with conventional VR eyewear. The MERP system improves user awareness of the physical world while offering an immersive 3D view of the metaverse by using a shouldermounted projector to display a Heads-Up Display (HUD) in a designated Metaverse Experience Room. To provide natural a… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  17. arXiv:2401.12070  [pdf, other

    cs.CL cs.AI cs.LG

    Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

    Authors: Abhimanyu Hans, Avi Schwarzschild, Valeriia Cherepanova, Hamid Kazemi, Aniruddha Saha, Micah Goldblum, Jonas Geiping, Tom Goldstein

    Abstract: Detecting text generated by modern large language models is thought to be hard, as both LLMs and humans can exhibit a wide range of complex behaviors. However, we find that a score based on contrasting two closely related language models is highly accurate at separating human-generated and machine-generated text. Based on this mechanism, we propose a novel LLM detector that only requires simple ca… ▽ More

    Submitted 1 July, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: 20 pages, code available at https://github.com/ahans30/Binoculars

  18. arXiv:2401.10895  [pdf, other

    cs.LG cs.CE

    AI in Supply Chain Risk Assessment: A Systematic Literature Review and Bibliometric Analysis

    Authors: Md Abrar Jahin, Saleh Akram Naife, Anik Kumar Saha, M. F. Mridha

    Abstract: Supply chain risk assessment (SCRA) has witnessed a profound evolution through the integration of artificial intelligence (AI) and machine learning (ML) techniques, revolutionizing predictive capabilities and risk mitigation strategies. The significance of this evolution stems from the critical role of robust risk management strategies in ensuring operational resilience and continuity within moder… ▽ More

    Submitted 25 January, 2024; v1 submitted 12 December, 2023; originally announced January 2024.

  19. arXiv:2312.17229  [pdf, other

    cs.LG stat.ML

    Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources

    Authors: Rohan Deb, Aadirupa Saha

    Abstract: We consider the problem of reward maximization in the dueling bandit setup along with constraints on resource consumption. As in the classic dueling bandits, at each round the learner has to choose a pair of items from a set of $K$ items and observe a relative feedback for the current pair. Additionally, for both items, the learner also observes a vector of resource consumptions. The objective of… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  20. arXiv:2312.15444  [pdf, other

    cs.ET

    Variation-Resilient FeFET-Based In-Memory Computing Leveraging Probabilistic Deep Learning

    Authors: Bibhas Manna, Arnob Saha, Zhouhang Jiang, Kai Ni, Abhronil Sengupta

    Abstract: Reliability issues stemming from device level non-idealities of non-volatile emerging technologies like ferroelectric field-effect transistors (FeFET), especially at scaled dimensions, cause substantial degradation in the accuracy of In-Memory crossbar-based AI systems. In this work, we present a variation-aware design technique to characterize the device level variations and to mitigate their imp… ▽ More

    Submitted 13 March, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

  21. arXiv:2312.11788  [pdf, other

    cs.LG math.OC

    Faster Convergence with Multiway Preferences

    Authors: Aadirupa Saha, Vitaly Feldman, Tomer Koren, Yishay Mansour

    Abstract: We address the problem of convex optimization with preference feedback, where the goal is to minimize a convex function given a weaker form of comparison queries. Each query consists of two points and the dueling feedback returns a (noisy) single-bit binary comparison of the function values of the two queried points. Here we consider the sign-function-based comparison feedback model and analyze th… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  22. Performance of externally validated machine learning models based on histopathology images for the diagnosis, classification, prognosis, or treatment outcome prediction in female breast cancer: A systematic review

    Authors: Ricardo Gonzalez, Peyman Nejat, Ashirbani Saha, Clinton J. V. Campbell, Andrew P. Norgan, Cynthia Lokker

    Abstract: Numerous machine learning (ML) models have been developed for breast cancer using various types of data. Successful external validation (EV) of ML models is important evidence of their generalizability. The aim of this systematic review was to assess the performance of externally validated ML models based on histopathology images for diagnosis, classification, prognosis, or treatment outcome predi… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Journal ref: Journal of Pathology Informatics. 2023;15:100348

  23. Seeing the random forest through the decision trees. Supporting learning health systems from histopathology with machine learning models: Challenges and opportunities

    Authors: Ricardo Gonzalez, Ashirbani Saha, Clinton J. V. Campbell, Peyman Nejat, Cynthia Lokker, Andrew P. Norgan

    Abstract: This paper discusses some overlooked challenges faced when working with machine learning models for histopathology and presents a novel opportunity to support "Learning Health Systems" with them. Initially, the authors elaborate on these challenges after separating them according to their mitigation strategies: those that need innovative approaches, time, or future technological capabilities and t… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Journal ref: Journal of Pathology Informatics 15 (2024) 100347

  24. arXiv:2311.17586  [pdf, other

    cs.LG math.OC stat.ML

    Federated Online and Bandit Convex Optimization

    Authors: Kumar Kshitij Patel, Lingxiao Wang, Aadirupa Saha, Nati Sebro

    Abstract: We study the problems of distributed online and bandit convex optimization against an adaptive adversary. We aim to minimize the average regret on $M$ machines working in parallel over $T$ rounds with $R$ intermittent communications. Assuming the underlying cost functions are convex and can be generated adaptively, our results show that collaboration is not beneficial when the machines have access… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  25. arXiv:2311.15647  [pdf, other

    cs.LG cs.GT

    Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation

    Authors: Thomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis, Haifeng Xu

    Abstract: We study a strategic variant of the multi-armed bandit problem, which we coin the strategic click-bandit. This model is motivated by applications in online recommendation where the choice of recommended items depends on both the click-through rates and the post-click rewards. Like in classical bandits, rewards follow a fixed unknown distribution. However, we assume that the click-rate of each arm… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  26. arXiv:2311.11185  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Dueling Optimization with a Monotone Adversary

    Authors: Avrim Blum, Meghal Gupta, Gene Li, Naren Sarayu Manoj, Aadirupa Saha, Yuanyuan Yang

    Abstract: We introduce and study the problem of dueling optimization with a monotone adversary, which is a generalization of (noiseless) dueling convex optimization. The goal is to design an online algorithm to find a minimizer $\mathbf{x}^{*}$ for a function $f\colon X \to \mathbb{R}$, where $X \subseteq \mathbb{R}^d$. In each round, the algorithm submits a pair of guesses, i.e., $\mathbf{x}^{(1)}$ and… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: 21 pages. comments welcome

  27. arXiv:2311.11059  [pdf, other

    cs.CV cs.MM eess.IV

    HIDRO-VQA: High Dynamic Range Oracle for Video Quality Assessment

    Authors: Shreshth Saini, Avinab Saha, Alan C. Bovik

    Abstract: We introduce HIDRO-VQA, a no-reference (NR) video quality assessment model designed to provide precise quality evaluations of High Dynamic Range (HDR) videos. HDR videos exhibit a broader spectrum of luminance, detail, and color than Standard Dynamic Range (SDR) videos. As HDR content becomes increasingly popular, there is a growing demand for video quality assessment (VQA) algorithms that effecti… ▽ More

    Submitted 20 December, 2023; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: WACV 2024 Workshop Paper. Shreshth Saini, Avinab Saha contributed equally to this work

  28. arXiv:2311.10731  [pdf

    cs.LG physics.med-ph physics.soc-ph

    Gender-Based Comparative Study of Type 2 Diabetes Risk Factors in Kolkata, India: A Machine Learning Approach

    Authors: Rahul Jain, Anoushka Saha, Gourav Daga, Durba Bhattacharya, Madhura Das Gupta, Sourav Chowdhury, Suparna Roychowdhury

    Abstract: Type 2 diabetes mellitus represents a prevalent and widespread global health concern, necessitating a comprehensive assessment of its risk factors. This study aimed towards learning whether there is any differential impact of age, Lifestyle, BMI and Waist to height ratio on the risk of Type 2 diabetes mellitus in males and females in Kolkata, West Bengal, India based on a sample observed from the… ▽ More

    Submitted 14 October, 2023; originally announced November 2023.

    Comments: 10 pages, 7 tables,3 figures, submitted to a conference

  29. arXiv:2310.20524  [pdf, other

    cs.LG

    Group-Feature (Sensor) Selection With Controlled Redundancy Using Neural Networks

    Authors: Aytijhya Saha, Nikhil R. Pal

    Abstract: In this paper, we present a novel embedded feature selection method based on a Multi-layer Perceptron (MLP) network and generalize it for group-feature or sensor selection problems, which can control the level of redundancy among the selected features or groups. Additionally, we have generalized the group lasso penalty for feature selection to encompass a mechanism for selecting valuable group fea… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  30. arXiv:2310.20280  [pdf, other

    cs.LG cs.AI

    AutoMixer for Improved Multivariate Time-Series Forecasting on Business and IT Observability Data

    Authors: Santosh Palaskar, Vijay Ekambaram, Arindam Jati, Neelamadhav Gantayat, Avirup Saha, Seema Nagar, Nam H. Nguyen, Pankaj Dayama, Renuka Sindhgatta, Prateeti Mohapatra, Harshit Kumar, Jayant Kalagnanam, Nandyala Hemachandra, Narayan Rangaraj

    Abstract: The efficiency of business processes relies on business key performance indicators (Biz-KPIs), that can be negatively impacted by IT failures. Business and IT Observability (BizITObs) data fuses both Biz-KPIs and IT event channels together as multivariate time series data. Forecasting Biz-KPIs in advance can enhance efficiency and revenue through proactive corrective measures. However, BizITObs da… ▽ More

    Submitted 2 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: Accepted in the Thirty-Sixth Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-24)

  31. arXiv:2310.18628  [pdf, other

    cs.CL cs.LG

    Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation

    Authors: Hailin Chen, Amrita Saha, Steven Hoi, Shafiq Joty

    Abstract: With the rise of powerful closed-sourced LLMs (ChatGPT, GPT-4), there are increasing interests in distilling the capabilies of close-sourced LLMs to smaller open-sourced LLMs. Previous distillation methods usually prompt ChatGPT to generate a set of instructions and answers, for the student model to learn. However, such standard distillation approach neglects the merits and conditions of the stude… ▽ More

    Submitted 26 January, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023; Codes at: https://github.com/SalesforceAIResearch/PersDistill

  32. arXiv:2310.08992  [pdf, other

    cs.AI cs.CL cs.PL

    CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules

    Authors: Hung Le, Hailin Chen, Amrita Saha, Akash Gokul, Doyen Sahoo, Shafiq Joty

    Abstract: Large Language Models (LLMs) have already become quite proficient at solving simpler programming tasks like those in HumanEval or MBPP benchmarks. However, solving more complex and competitive programming tasks is still quite challenging for these models - possibly due to their tendency to generate solutions as monolithic code blocks instead of decomposing them into logical sub-tasks and sub-modul… ▽ More

    Submitted 13 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  33. arXiv:2310.05914  [pdf, other

    cs.CL cs.LG

    NEFTune: Noisy Embeddings Improve Instruction Finetuning

    Authors: Neel Jain, Ping-yeh Chiang, Yuxin Wen, John Kirchenbauer, Hong-Min Chu, Gowthami Somepalli, Brian R. Bartoldson, Bhavya Kailkhura, Avi Schwarzschild, Aniruddha Saha, Micah Goldblum, Jonas Geiping, Tom Goldstein

    Abstract: We show that language model finetuning can be improved, sometimes dramatically, with a simple augmentation. NEFTune adds noise to the embedding vectors during training. Standard finetuning of LLaMA-2-7B using Alpaca achieves 29.79% on AlpacaEval, which rises to 64.69% using noisy embeddings. NEFTune also improves over strong baselines on modern instruction datasets. Models trained with Evol-Instru… ▽ More

    Submitted 10 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 25 pages, Code is available on Github: https://github.com/neelsjain/NEFTune

  34. arXiv:2310.02449  [pdf, other

    cs.DC

    Impact of geography on the importance of parameters in infectious disease models

    Authors: Arindam Saha, Maziar Ghorbani, Diana Suleimenova, Anastasia Anagnostou, Derek Groen

    Abstract: Agent-based models are widely used to predict infectious disease spread. For these predictions, one needs to understand how each input parameter affects the result. Here, some parameters may affect the sensitivities of others, requiring the analysis of higher order coefficients through e.g. Sobol sensitivity analysis. The geographical structures of real-world regions are distinct in that they are… ▽ More

    Submitted 20 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  35. FragQC: An Efficient Quantum Error Reduction Technique using Quantum Circuit Fragmentation

    Authors: Saikat Basu, Arnav Das, Amit Saha, Amlan Chakrabarti, Susmita Sur-Kolay

    Abstract: Quantum computers must meet extremely stringent qualitative and quantitative requirements on their qubits in order to solve real-life problems. Quantum circuit fragmentation techniques divide a large quantum circuit into a number of sub-circuits that can be executed on the smaller noisy quantum hardware available. However, the process of quantum circuit fragmentation involves finding an ideal cut… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: 30 pages, 9 figures

    Journal ref: Journal of Systems and Software 2024

  36. Experience and Evidence are the eyes of an excellent summarizer! Towards Knowledge Infused Multi-modal Clinical Conversation Summarization

    Authors: Abhisek Tiwari, Anisha Saha, Sriparna Saha, Pushpak Bhattacharyya, Minakshi Dhar

    Abstract: With the advancement of telemedicine, both researchers and medical practitioners are working hand-in-hand to develop various techniques to automate various medical operations, such as diagnosis report generation. In this paper, we first present a multi-modal clinical conversation summary generation task that takes a clinician-patient interaction (both textual and visual information) and generates… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  37. arXiv:2309.05961  [pdf, other

    cs.SI cs.CL cs.IR cs.LG

    Evaluating the Ebb and Flow: An In-depth Analysis of Question-Answering Trends across Diverse Platforms

    Authors: Rima Hazra, Agnik Saha, Somnath Banerjee, Animesh Mukherjee

    Abstract: Community Question Answering (CQA) platforms steadily gain popularity as they provide users with fast responses to their queries. The swiftness of these responses is contingent on a mixture of query-specific and user-related elements. This paper scrutinizes these contributing factors within the context of six highly popular CQA platforms, identified through their standout answering speed. Our inve… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted as POSTER

  38. arXiv:2309.00614  [pdf, other

    cs.LG cs.CL cs.CR

    Baseline Defenses for Adversarial Attacks Against Aligned Language Models

    Authors: Neel Jain, Avi Schwarzschild, Yuxin Wen, Gowthami Somepalli, John Kirchenbauer, Ping-yeh Chiang, Micah Goldblum, Aniruddha Saha, Jonas Geiping, Tom Goldstein

    Abstract: As Large Language Models quickly become ubiquitous, it becomes critical to understand their security vulnerabilities. Recent work shows that text optimizers can produce jailbreaking prompts that bypass moderation and alignment. Drawing from the rich body of work on adversarial machine learning, we approach these attacks with three questions: What threat models are practically useful in this domain… ▽ More

    Submitted 4 September, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: 12 pages

  39. arXiv:2308.15027  [pdf, ps, other

    cs.IR cs.CL

    Improving Neural Ranking Models with Traditional IR Methods

    Authors: Anik Saha, Oktie Hassanzadeh, Alex Gittens, Jian Ni, Kavitha Srinivas, Bulent Yener

    Abstract: Neural ranking methods based on large transformer models have recently gained significant attention in the information retrieval community, and have been adopted by major commercial solutions. Nevertheless, they are computationally expensive to create, and require a great deal of labeled data for specialized corpora. In this paper, we explore a low resource alternative which is a bag-of-embedding… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: Short paper, 4 pages

  40. arXiv:2308.03891  [pdf, other

    cs.CL

    A Cross-Domain Evaluation of Approaches for Causal Knowledge Extraction

    Authors: Anik Saha, Oktie Hassanzadeh, Alex Gittens, Jian Ni, Kavitha Srinivas, Bulent Yener

    Abstract: Causal knowledge extraction is the task of extracting relevant causes and effects from text by detecting the causal relation. Although this task is important for language understanding and knowledge discovery, recent works in this domain have largely focused on binary classification of a text segment as causal or non-causal. In this regard, we perform a thorough analysis of three sequence tagging… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  41. arXiv:2307.11892  [pdf, ps, other

    cs.LG cs.AI cs.CR cs.CY

    On the Vulnerability of Fairness Constrained Learning to Malicious Noise

    Authors: Avrim Blum, Princewill Okoroafor, Aadirupa Saha, Kevin Stangl

    Abstract: We consider the vulnerability of fairness-constrained learning to small amounts of malicious noise in the training data. Konstantinov and Lampert (2021) initiated the study of this question and presented negative results showing there exist data distributions where for several fairness constraints, any proper learner will exhibit high vulnerability when group sizes are imbalanced. Here, we present… ▽ More

    Submitted 26 July, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    MSC Class: I.2

  42. arXiv:2307.09065  [pdf, other

    cs.CV cs.LG

    Learning Adaptive Neighborhoods for Graph Neural Networks

    Authors: Avishkar Saha, Oscar Mendez, Chris Russell, Richard Bowden

    Abstract: Graph convolutional networks (GCNs) enable end-to-end learning on graph structured data. However, many works assume a given graph structure. When the input graph is noisy or unavailable, one approach is to construct or learn a latent graph structure. These methods typically fix the choice of node degree for the entire graph, which is suboptimal. Instead, we propose a novel end-to-end differentiabl… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: ICCV 2023

  43. arXiv:2306.13985  [pdf, other

    stat.ML cs.AI cs.LG

    Robust Classification of High-Dimensional Data using Data-Adaptive Energy Distance

    Authors: Jyotishka Ray Choudhury, Aytijhya Saha, Sarbojit Roy, Subhajit Dutta

    Abstract: Classification of high-dimensional low sample size (HDLSS) data poses a challenge in a variety of real-world situations, such as gene expression studies, cancer research, and medical imaging. This article presents the development and analysis of some classifiers that are specifically designed for HDLSS data. These classifiers are free of tuning parameters and are robust, in the sense that they are… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: Accepted to be published at the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), 2023

  44. arXiv:2306.13651  [pdf, other

    cs.CL cs.LG

    Bring Your Own Data! Self-Supervised Evaluation for Large Language Models

    Authors: Neel Jain, Khalid Saifullah, Yuxin Wen, John Kirchenbauer, Manli Shu, Aniruddha Saha, Micah Goldblum, Jonas Geiping, Tom Goldstein

    Abstract: With the rise of Large Language Models (LLMs) and their ubiquitous deployment in diverse domains, measuring language model behavior on realistic data is imperative. For example, a company deploying a client-facing chatbot must ensure that the model will not respond to client requests with profanity. Current evaluations approach this problem using small, domain-specific datasets with human-curated… ▽ More

    Submitted 29 June, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

    Comments: Code is available at https://github.com/neelsjain/BYOD. First two authors contributed equally. 21 pages, 22 figures

  45. arXiv:2306.12610  [pdf, other

    cs.CV

    Revisiting Image Classifier Training for Improved Certified Robust Defense against Adversarial Patches

    Authors: Aniruddha Saha, Shuhua Yu, Arash Norouzzadeh, Wan-Yi Lin, Chaithanya Kumar Mummadi

    Abstract: Certifiably robust defenses against adversarial patches for image classifiers ensure correct prediction against any changes to a constrained neighborhood of pixels. PatchCleanser arXiv:2108.09135 [cs.CV], the state-of-the-art certified defense, uses a double-masking strategy for robust classification. The success of this strategy relies heavily on the model's invariance to image pixel masking. In… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 12 pages, 5 figures

  46. arXiv:2306.04634  [pdf, other

    cs.LG cs.CL cs.CR

    On the Reliability of Watermarks for Large Language Models

    Authors: John Kirchenbauer, Jonas Geiping, Yuxin Wen, Manli Shu, Khalid Saifullah, Kezhi Kong, Kasun Fernando, Aniruddha Saha, Micah Goldblum, Tom Goldstein

    Abstract: As LLMs become commonplace, machine-generated text has the potential to flood the internet with spam, social media bots, and valueless content. Watermarking is a simple and effective strategy for mitigating such harms by enabling the detection and documentation of LLM-generated text. Yet a crucial question remains: How reliable is watermarking in realistic settings in the wild? There, watermarked… ▽ More

    Submitted 1 May, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 9 pages in the main body. Published at ICLR 2024. Code is available at https://github.com/jwkirchenbauer/lm-watermarking

  47. Study of Subjective and Objective Quality Assessment of Mobile Cloud Gaming Videos

    Authors: Avinab Saha, Yu-Chih Chen, Chase Davis, Bo Qiu, Xiaoming Wang, Rahul Gowda, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: We present the outcomes of a recent large-scale subjective study of Mobile Cloud Gaming Video Quality Assessment (MCG-VQA) on a diverse set of gaming videos. Rapid advancements in cloud services, faster video encoding technologies, and increased access to high-speed, low-latency wireless internet have all contributed to the exponential growth of the Mobile Cloud Gaming industry. Consequently, the… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Transactions on Image Processing, 2023. The database will be publicly available by 1st week of July 2023

  48. arXiv:2305.05984  [pdf, other

    eess.IV cs.CV

    Uncertainty-Aware Semi-Supervised Learning for Prostate MRI Zonal Segmentation

    Authors: Matin Hosseinzadeh, Anindo Saha, Joeran Bosma, Henkjan Huisman

    Abstract: Quality of deep convolutional neural network predictions strongly depends on the size of the training dataset and the quality of the annotations. Creating annotations, especially for 3D medical image segmentation, is time-consuming and requires expert knowledge. We propose a novel semi-supervised learning (SSL) approach that requires only a relatively small number of annotations while being able t… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 9 pages

  49. arXiv:2305.02422  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    GAMIVAL: Video Quality Prediction on Mobile Cloud Gaming Content

    Authors: Yu-Chih Chen, Avinab Saha, Chase Davis, Bo Qiu, Xiaoming Wang, Rahul Gowda, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: The mobile cloud gaming industry has been rapidly growing over the last decade. When streaming gaming videos are transmitted to customers' client devices from cloud servers, algorithms that can monitor distorted video quality without having any reference video available are desirable tools. However, creating No-Reference Video Quality Assessment (NR VQA) models that can accurately predict the qual… ▽ More

    Submitted 29 August, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE SPL 2023. The implementation of GAMIVAL has been made available online: https://github.com/lskdream/GAMIVAL

    MSC Class: 68U10

    Journal ref: IEEE Signal Processing Letters, vol. 30, pp. 324-328, 2023

  50. arXiv:2304.10642  [pdf, other

    cs.CL

    Word Sense Induction with Knowledge Distillation from BERT

    Authors: Anik Saha, Alex Gittens, Bulent Yener

    Abstract: Pre-trained contextual language models are ubiquitously employed for language understanding tasks, but are unsuitable for resource-constrained systems. Noncontextual word embeddings are an efficient alternative in these settings. Such methods typically use one vector to encode multiple different meanings of a word, and incur errors due to polysemy. This paper proposes a two-stage method to distill… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.