Skip to main content

Showing 1–50 of 343 results for author: Sarkar, S

  1. arXiv:2407.08152  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Privacy-Preserving Data Deduplication for Enhancing Federated Learning of Language Models

    Authors: Aydin Abadi, Vishnu Asutosh Dasu, Sumanta Sarkar

    Abstract: Deduplication is a vital preprocessing step that enhances machine learning model performance and saves training time and energy. However, enhancing federated learning through deduplication poses challenges, especially regarding scalability and potential privacy violations if deduplication involves sharing all clients' data. In this paper, we address the problem of deduplication in a federated setu… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2407.06538  [pdf, other

    cs.CL

    Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case Study

    Authors: Aniruddha Roy, Pretam Ray, Ayush Maheshwari, Sudeshna Sarkar, Pawan Goyal

    Abstract: Neural Machine Translation (NMT) remains a formidable challenge, especially when dealing with low-resource languages. Pre-trained sequence-to-sequence (seq2seq) multi-lingual models, such as mBART-50, have demonstrated impressive performance in various low-resource NMT tasks. However, their pre-training has been confined to 50 languages, leaving out support for numerous low-resource languages, par… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Published at Seventh LoResMT Workshop at ACL 2024

  3. arXiv:2406.17720  [pdf, other

    cs.CV

    Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity

    Authors: Chih-Hsuan Yang, Benjamin Feuer, Zaki Jubery, Zi K. Deng, Andre Nakkab, Md Zahid Hasan, Shivani Chiranjeevi, Kelly Marshall, Nirmal Baishnab, Asheesh K Singh, Arti Singh, Soumik Sarkar, Nirav Merchant, Chinmay Hegde, Baskar Ganapathysubramanian

    Abstract: We introduce Arboretum, the largest publicly accessible dataset designed to advance AI for biodiversity applications. This dataset, curated from the iNaturalist community science platform and vetted by domain experts to ensure accuracy, includes 134.6 million images, surpassing existing datasets in scale by an order of magnitude. The dataset encompasses image-language paired data for a diverse set… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Preprint under review

  4. arXiv:2406.15211  [pdf, other

    cs.CL cs.AI

    How Effective is GPT-4 Turbo in Generating School-Level Questions from Textbooks Based on Bloom's Revised Taxonomy?

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: We evaluate the effectiveness of GPT-4 Turbo in generating educational questions from NCERT textbooks in zero-shot mode. Our study highlights GPT-4 Turbo's ability to generate questions that require higher-order thinking skills, especially at the "understanding" level according to Bloom's Revised Taxonomy. While we find a notable consistency between questions generated by GPT-4 Turbo and those ass… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted at Learnersourcing: Student-Generated Content @ Scale 2024

  5. arXiv:2406.15128  [pdf, other

    eess.IV cs.AI cs.CV

    A Wavelet Guided Attention Module for Skin Cancer Classification with Gradient-based Feature Fusion

    Authors: Ayush Roy, Sujan Sarkar, Sohom Ghosal, Dmitrii Kaplun, Asya Lyanova, Ram Sarkar

    Abstract: Skin cancer is a highly dangerous type of cancer that requires an accurate diagnosis from experienced physicians. To help physicians diagnose skin cancer more efficiently, a computer-aided diagnosis (CAD) system can be very helpful. In this paper, we propose a novel model, which uses a novel attention mechanism to pinpoint the differences in features across the spatial dimensions and symmetry of t… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  6. arXiv:2406.13081  [pdf, other

    cs.CV

    Class-specific Data Augmentation for Plant Stress Classification

    Authors: Nasla Saleem, Aditya Balu, Talukder Zaki Jubery, Arti Singh, Asheesh K. Singh, Soumik Sarkar, Baskar Ganapathysubramanian

    Abstract: Data augmentation is a powerful tool for improving deep learning-based image classifiers for plant stress identification and classification. However, selecting an effective set of augmentations from a large pool of candidates remains a key challenge, particularly in imbalanced and confounding datasets. We propose an approach for automated class-specific data augmentation using a genetic algorithm.… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.11868  [pdf, other

    cs.CY cs.AI

    Ethical Framework for Responsible Foundational Models in Medical Imaging

    Authors: Abhijit Das, Debesh Jha, Jasmer Sanjotra, Onkar Susladkar, Suramyaa Sarkar, Ashish Rauniyar, Nikhil Tomar, Vanshali Sharma, Ulas Bagci

    Abstract: Foundational models (FMs) have tremendous potential to revolutionize medical imaging. However, their deployment in real-world clinical settings demands extensive ethical considerations. This paper aims to highlight the ethical concerns related to FMs and propose a framework to guide their responsible development and implementation within medicine. We meticulously examine ethical issues such as pri… ▽ More

    Submitted 13 April, 2024; originally announced June 2024.

  8. arXiv:2406.00039  [pdf

    cs.CL

    How Ready Are Generative Pre-trained Large Language Models for Explaining Bengali Grammatical Errors?

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: Grammatical error correction (GEC) tools, powered by advanced generative artificial intelligence (AI), competently correct linguistic inaccuracies in user input. However, they often fall short in providing essential natural language explanations, which are crucial for learning languages and gaining a deeper understanding of the grammatical rules. There is limited exploration of these tools in low-… ▽ More

    Submitted 27 May, 2024; originally announced June 2024.

    Comments: Accepted at Educational Data Mining 2024

  9. arXiv:2405.11579  [pdf, ps, other

    cs.CL

    Exploring the Capabilities of Prompted Large Language Models in Educational and Assessment Applications

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: In the era of generative artificial intelligence (AI), the fusion of large language models (LLMs) offers unprecedented opportunities for innovation in the field of modern education. We embark on an exploration of prompted LLMs within the context of educational and assessment applications to uncover their potential. Through a series of carefully crafted research questions, we investigate the effect… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Accepted at EDM 2024

  10. arXiv:2405.10951  [pdf, other

    cs.CV cs.LG

    Block Selective Reprogramming for On-device Training of Vision Transformers

    Authors: Sreetama Sarkar, Souvik Kundu, Kai Zheng, Peter A. Beerel

    Abstract: The ubiquity of vision transformers (ViTs) for various edge applications, including personalized learning, has created the demand for on-device fine-tuning. However, training with the limited memory and computation power of edge devices remains a significant challenge. In particular, the memory required for training is much higher than that needed for inference, primarily due to the need to store… ▽ More

    Submitted 25 March, 2024; originally announced May 2024.

  11. arXiv:2405.08751  [pdf, other

    cs.CL cs.IR

    From Text to Context: An Entailment Approach for News Stakeholder Classification

    Authors: Alapan Kuila, Sudeshna Sarkar

    Abstract: Navigating the complex landscape of news articles involves understanding the various actors or entities involved, referred to as news stakeholders. These stakeholders, ranging from policymakers to opposition figures, citizens, and more, play pivotal roles in shaping news narratives. Recognizing their stakeholder types, reflecting their roles, political alignments, social standing, and more, is par… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Accepted in SIGIR 2024

  12. arXiv:2405.00012  [pdf, other

    quant-ph cs.CC cs.DS

    A quantum neural network framework for scalable quantum circuit approximation of unitary matrices

    Authors: Rohit Sarma Sarkar, Bibhas Adhikari

    Abstract: In this paper, we develop a Lie group theoretic approach for parametric representation of unitary matrices. This leads to develop a quantum neural network framework for quantum circuit approximation of multi-qubit unitary gates. Layers of the neural networks are defined by product of exponential of certain elements of the Standard Recursive Block Basis, which we introduce as an alternative to Paul… ▽ More

    Submitted 7 February, 2024; originally announced May 2024.

    Comments: 58 pages. arXiv admin note: substantial text overlap with arXiv:2304.14096

  13. arXiv:2404.12498  [pdf

    cs.LG cs.AI eess.SY

    A Configurable Pythonic Data Center Model for Sustainable Cooling and ML Integration

    Authors: Avisek Naug, Antonio Guillen, Ricardo Luna Gutierrez, Vineet Gundecha, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Soumyendu Sarkar

    Abstract: There have been growing discussions on estimating and subsequently reducing the operational carbon footprint of enterprise data centers. The design and intelligent control for data centers have an important impact on data center carbon footprint. In this paper, we showcase PyDCM, a Python library that enables extremely fast prototyping of data center design and applies reinforcement learning-enabl… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning https://www.climatechange.ai/papers/neurips2023/15. arXiv admin note: substantial text overlap with arXiv:2310.03906

  14. arXiv:2404.10991  [pdf

    cs.AI cs.LG eess.SY

    Function Approximation for Reinforcement Learning Controller for Energy from Spread Waves

    Authors: Soumyendu Sarkar, Vineet Gundecha, Sahand Ghorbanpour, Alexander Shmakov, Ashwin Ramesh Babu, Avisek Naug, Alexandre Pichard, Mathieu Cocho

    Abstract: The industrial multi-generator Wave Energy Converters (WEC) must handle multiple simultaneous waves coming from different directions called spread waves. These complex devices in challenging circumstances need controllers with multiple objectives of energy capture efficiency, reduction of structural stress to limit maintenance, and proactive protection against high waves. The Multi-Agent Reinforce… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: IJCAI 2023, Proceedings of the Thirty-Second International Joint Conference on Artificial IntelligenceAugust 2023

    Journal ref: IJCAI 2023, Proceedings of the Thirty-Second International Joint Conference on Artificial IntelligenceAugust 2023, Article No 688, Pages 6201 to 6209

  15. arXiv:2404.10786  [pdf

    cs.DC cs.AI cs.LG cs.MA eess.SY

    Sustainability of Data Center Digital Twins with Reinforcement Learning

    Authors: Soumyendu Sarkar, Avisek Naug, Antonio Guillen, Ricardo Luna, Vineet Gundecha, Ashwin Ramesh Babu, Sajad Mousavi

    Abstract: The rapid growth of machine learning (ML) has led to an increased demand for computational power, resulting in larger data centers (DCs) and higher energy consumption. To address this issue and reduce carbon emissions, intelligent design and control of DC components such as IT servers, cabinets, HVAC cooling, flexible load shifting, and battery energy storage are essential. However, the complexity… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 2024 Proceedings of the AAAI Conference on Artificial Intelligence

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 20, pp. 22322-22330, Mar. 2024

  16. arXiv:2404.10575  [pdf, other

    cs.LG cs.AI cs.CV math.OC

    EMC$^2$: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence

    Authors: Chung-Yiu Yau, Hoi-To Wai, Parameswaran Raman, Soumajyoti Sarkar, Mingyi Hong

    Abstract: A key challenge in contrastive learning is to generate negative samples from a large sample set to contrast with positive samples, for learning better encoding of the data. These negative samples often follow a softmax distribution which are dynamically updated during the training process. However, sampling from this distribution is non-trivial due to the high computational costs in computing the… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 20 pages

  17. The Impact of Machine Learning on Society: An Analysis of Current Trends and Future Implications

    Authors: Md Kamrul Hossain Siam, Manidipa Bhattacharjee, Shakik Mahmud, Md. Saem Sarkar, Md. Masud Rana

    Abstract: The Machine learning (ML) is a rapidly evolving field of technology that has the potential to greatly impact society in a variety of ways. However, there are also concerns about the potential negative effects of ML on society, such as job displacement and privacy issues. This research aimed to conduct a comprehensive analysis of the current and future impact of ML on society. The research included… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 12 pages

  18. arXiv:2404.08079  [pdf, other

    cs.LG cs.CV math.OC

    DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models

    Authors: Nastaran Saadati, Minh Pham, Nasla Saleem, Joshua R. Waite, Aditya Balu, Zhanhong Jiang, Chinmay Hegde, Soumik Sarkar

    Abstract: Recent advances in decentralized deep learning algorithms have demonstrated cutting-edge performance on various tasks with large pre-trained models. However, a pivotal prerequisite for achieving this level of competitiveness is the significant communication and computation overheads when updating these models, which prohibits the applications of them to real-world scenarios. To address this issue,… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 accepted paper, 22 pages, 12 figures

  19. arXiv:2404.06423  [pdf, other

    cs.RO cs.AI cs.LG

    Deep Reinforcement Learning-Based Approach for a Single Vehicle Persistent Surveillance Problem with Fuel Constraints

    Authors: Manav Mishra, Hritik Bana, Saswata Sarkar, Sujeevraja Sanjeevi, PB Sujit, Kaarthik Sundar

    Abstract: This article presents a deep reinforcement learning-based approach to tackle a persistent surveillance mission requiring a single unmanned aerial vehicle initially stationed at a depot with fuel or time-of-flight constraints to repeatedly visit a set of targets with equal priority. Owing to the vehicle's fuel or time-of-flight constraints, the vehicle must be regularly refueled, or its battery mus… ▽ More

    Submitted 2 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: 6 pages

    Report number: LA-UR-24-23186

  20. arXiv:2404.04361  [pdf, other

    cs.CL

    Deciphering Political Entity Sentiment in News with Large Language Models: Zero-Shot and Few-Shot Strategies

    Authors: Alapan Kuila, Sudeshna Sarkar

    Abstract: Sentiment analysis plays a pivotal role in understanding public opinion, particularly in the political domain where the portrayal of entities in news articles influences public perception. In this paper, we investigate the effectiveness of Large Language Models (LLMs) in predicting entity-specific sentiment from political news articles. Leveraging zero-shot and few-shot strategies, we explore the… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Accepted in PoliticalNLP workshop co-located with LREC-COLING 2024

  21. arXiv:2403.19062  [pdf, other

    eess.SY cs.RO

    GENESIS-RL: GEnerating Natural Edge-cases with Systematic Integration of Safety considerations and Reinforcement Learning

    Authors: Hsin-Jung Yang, Joe Beck, Md Zahid Hasan, Ekin Beyazit, Subhadeep Chakraborty, Tichakorn Wongpiromsarn, Soumik Sarkar

    Abstract: In the rapidly evolving field of autonomous systems, the safety and reliability of the system components are fundamental requirements. These components are often vulnerable to complex and unforeseen environments, making natural edge-case generation essential for enhancing system resilience. This paper presents GENESIS-RL, a novel framework that leverages system-level safety considerations and rein… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  22. arXiv:2403.18985  [pdf

    cs.LG cs.AI cs.CR cs.CV cs.MA

    Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning

    Authors: Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Vineet Gundecha, Avisek Naug, Sahand Ghorbanpour

    Abstract: We present a generic Reinforcement Learning (RL) framework optimized for crafting adversarial attacks on different model types spanning from ECG signal analysis (1D), image classification (2D), and video classification (3D). The framework focuses on identifying sensitive regions and inducing misclassifications with minimal distortions and various distortion types. The novel RL method outperforms s… ▽ More

    Submitted 22 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: AAAI Proceedings reference: https://ojs.aaai.org/index.php/AAAI/article/view/30579

    Journal ref: 2024 Proceedings of the AAAI Conference on Artificial Intelligence

  23. arXiv:2403.14092  [pdf

    cs.LG cs.AI cs.MA eess.SY

    Carbon Footprint Reduction for Sustainable Data Centers in Real-Time

    Authors: Soumyendu Sarkar, Avisek Naug, Ricardo Luna, Antonio Guillen, Vineet Gundecha, Sahand Ghorbanpour, Sajad Mousavi, Dejan Markovikj, Ashwin Ramesh Babu

    Abstract: As machine learning workloads significantly increase energy consumption, sustainable data centers with low carbon emissions are becoming a top priority for governments and corporations worldwide. This requires a paradigm shift in optimizing power consumption in cooling and IT loads, shifting flexible loads based on the availability of renewable energy in the power grid, and leveraging battery stor… ▽ More

    Submitted 25 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Journal ref: 2024 Proceedings of the AAAI Conference on Artificial Intelligence

  24. arXiv:2403.07025  [pdf, other

    quant-ph cs.LG

    Enhancing Quantum Variational Algorithms with Zero Noise Extrapolation via Neural Networks

    Authors: Subhasree Bhattacharjee, Soumyadip Sarkar, Kunal Das, Bikramjit Sarkar

    Abstract: In the emergent realm of quantum computing, the Variational Quantum Eigensolver (VQE) stands out as a promising algorithm for solving complex quantum problems, especially in the noisy intermediate-scale quantum (NISQ) era. However, the ubiquitous presence of noise in quantum devices often limits the accuracy and reliability of VQE outcomes. This research introduces a novel approach to ameliorate t… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  25. arXiv:2403.03920  [pdf, other

    cs.AI cs.CL cs.HC

    Enhancing Instructional Quality: Leveraging Computer-Assisted Textual Analysis to Generate In-Depth Insights from Educational Artifacts

    Authors: Zewei Tian, Min Sun, Alex Liu, Shawon Sarkar, Jing Liu

    Abstract: This paper explores the transformative potential of computer-assisted textual analysis in enhancing instructional quality through in-depth insights from educational artifacts. We integrate Richard Elmore's Instructional Core Framework to examine how artificial intelligence (AI) and machine learning (ML) methods, particularly natural language processing (NLP), can analyze educational content, teach… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  26. arXiv:2402.18751  [pdf, other

    cs.LG cs.CV

    Multi-Sensor and Multi-temporal High-Throughput Phenotyping for Monitoring and Early Detection of Water-Limiting Stress in Soybean

    Authors: Sarah E. Jones, Timilehin Ayanlade, Benjamin Fallen, Talukder Z. Jubery, Arti Singh, Baskar Ganapathysubramanian, Soumik Sarkar, Asheesh K. Singh

    Abstract: Soybean production is susceptible to biotic and abiotic stresses, exacerbated by extreme weather events. Water limiting stress, i.e. drought, emerges as a significant risk for soybean production, underscoring the need for advancements in stress monitoring for crop breeding and production. This project combines multi-modal information to identify the most effective and efficient automated methods t… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 25 pages, 5 figures

  27. arXiv:2402.17346  [pdf, other

    physics.flu-dyn cs.LG

    Understanding the training of PINNs for unsteady flow past a plunging foil through the lens of input subdomain level loss function gradients

    Authors: Rahul Sundar, Didier Lucor, Sunetra Sarkar

    Abstract: Recently immersed boundary method-inspired physics-informed neural networks (PINNs) including the moving boundary-enabled PINNs (MB-PINNs) have shown the ability to accurately reconstruct velocity and recover pressure as a hidden variable for unsteady flow past moving bodies. Considering flow past a plunging foil, MB-PINNs were trained with global physics loss relaxation and also in conjunction wi… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  28. arXiv:2402.17337  [pdf, other

    cs.DC physics.comp-ph physics.flu-dyn

    Massive parallelization and performance enhancement of an immersed boundary method based unsteady flow solver

    Authors: Rahul Sundar, Dipanjan Majumdar, Chhote Lal Shah, Sunetra Sarkar

    Abstract: High-fidelity simulations of unsteady fluid flow are now possible with advancements in high-performance computing hardware and software frameworks. Since computational fluid dynamics (CFD) computations are dominated by linear algebraic routines, they can be significantly accelerated through massive parallelization on graphics processing units (GPUs). Thus, GPU implementation of high-fidelity CFD s… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  29. arXiv:2402.17008  [pdf, other

    cs.CL

    Benchmarking LLMs on the Semantic Overlap Summarization Task

    Authors: John Salvador, Naman Bansal, Mousumi Akter, Souvika Sarkar, Anupam Das, Shubhra Kanti Karmaker

    Abstract: Semantic Overlap Summarization (SOS) is a constrained multi-document summarization task, where the constraint is to capture the common/overlapping information between two alternative narratives. While recent advancements in Large Language Models (LLMs) have achieved superior performance in numerous summarization tasks, a benchmarking study of the SOS task using LLMs is yet to be performed. As LLMs… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  30. arXiv:2402.15589  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Prompting LLMs to Compose Meta-Review Drafts from Peer-Review Narratives of Scholarly Manuscripts

    Authors: Shubhra Kanti Karmaker Santu, Sanjeev Kumar Sinha, Naman Bansal, Alex Knipper, Souvika Sarkar, John Salvador, Yash Mahajan, Sri Guttikonda, Mousumi Akter, Matthew Freestone, Matthew C. Williams Jr

    Abstract: One of the most important yet onerous tasks in the academic peer-reviewing process is composing meta-reviews, which involves understanding the core contributions, strengths, and weaknesses of a scholarly manuscript based on peer-review narratives from multiple experts and then summarizing those multiple experts' perspectives into a concise holistic overview. Given the latest major developments in… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    ACM Class: I.2.7

  31. arXiv:2402.10344  [pdf, other

    cs.CV

    Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions

    Authors: Muhammad Arbab Arshad, Talukder Jubery, James Afful, Anushrut Jignasu, Aditya Balu, Baskar Ganapathysubramanian, Soumik Sarkar, Adarsh Krishnamurthy

    Abstract: We evaluate different Neural Radiance Fields (NeRFs) techniques for reconstructing (3D) plants in varied environments, from indoor settings to outdoor fields. Traditional techniques often struggle to capture the complex details of plants, which is crucial for botanical and agricultural understanding. We evaluate three scenarios with increasing complexity and compare the results with the point clou… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  32. arXiv:2402.08055  [pdf, other

    quant-ph cs.DC cs.ET

    A Quantum Algorithm Based Heuristic to Hide Sensitive Itemsets

    Authors: Abhijeet Ghoshal, Yan Li, Syam Menon, Sumit Sarkar

    Abstract: Quantum devices use qubits to represent information, which allows them to exploit important properties from quantum physics, specifically superposition and entanglement. As a result, quantum computers have the potential to outperform the most advanced classical computers. In recent years, quantum algorithms have shown hints of this promise, and many algorithms have been proposed for the quantum do… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Journal ref: Workshop on Information Technologies and Systems WITS 2023

  33. arXiv:2402.07281  [pdf, other

    cs.LG

    Can Tree Based Approaches Surpass Deep Learning in Anomaly Detection? A Benchmarking Study

    Authors: Santonu Sarkar, Shanay Mehta, Nicole Fernandes, Jyotirmoy Sarkar, Snehanshu Saha

    Abstract: Detection of anomalous situations for complex mission-critical systems holds paramount importance when their service continuity needs to be ensured. A major challenge in detecting anomalies from the operational data arises due to the imbalanced class distribution problem since the anomalies are supposed to be rare events. This paper evaluates a diverse array of machine learning-based anomaly detec… ▽ More

    Submitted 25 February, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  34. arXiv:2402.05521  [pdf, other

    cs.LG cs.AI cs.CR

    Linearizing Models for Efficient yet Robust Private Inference

    Authors: Sreetama Sarkar, Souvik Kundu, Peter A. Beerel

    Abstract: The growing concern about data privacy has led to the development of private inference (PI) frameworks in client-server applications which protects both data privacy and model IP. However, the cryptographic primitives required yield significant latency overhead which limits its wide-spread application. At the same time, changing environments demand the PI service to be robust against various natur… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  35. arXiv:2402.02145  [pdf, other

    cs.CL

    Analyzing Sentiment Polarity Reduction in News Presentation through Contextual Perturbation and Large Language Models

    Authors: Alapan Kuila, Somnath Jena, Sudeshna Sarkar, Partha Pratim Chakrabarti

    Abstract: In today's media landscape, where news outlets play a pivotal role in shaping public opinion, it is imperative to address the issue of sentiment manipulation within news text. News writers often inject their own biases and emotional language, which can distort the objectivity of reporting. This paper introduces a novel approach to tackle this problem by reducing the polarity of latent sentiments i… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: Accepted in ICON 2023

  36. arXiv:2401.16577  [pdf, other

    cs.CL cs.AI

    LLMs as On-demand Customizable Service

    Authors: Souvika Sarkar, Mohammad Fakhruddin Babar, Monowar Hasan, Shubhra Kanti Karmaker

    Abstract: Large Language Models (LLMs) have demonstrated remarkable language understanding and generation capabilities. However, training, deploying, and accessing these models pose notable challenges, including resource-intensive demands, extended training durations, and scalability issues. To address these issues, we introduce a concept of hierarchical, distributed LLM architecture that aims at enhancing… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  37. arXiv:2401.11023  [pdf, other

    quant-ph cs.DM

    Quantum circuit model for discrete-time three-state quantum walks on Cayley graphs

    Authors: Rohit Sarma Sarkar, Bibhas Adhikari

    Abstract: We develop qutrit circuit models for discrete-time three-state quantum walks on Cayley graphs corresponding to Dihedral groups $D_N$ and the additive groups of integers modulo any positive integer $N$. The proposed circuits comprise of elementary qutrit gates such as qutrit rotation gates, qutrit-$X$ gates and two-qutrit controlled-$X$ gates. First, we propose qutrit circuit representation of spec… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  38. arXiv:2401.07977  [pdf, ps, other

    cs.CL

    Leveraging External Knowledge Resources to Enable Domain-Specific Comprehension

    Authors: Saptarshi Sengupta, Connor Heaton, Prasenjit Mitra, Soumalya Sarkar

    Abstract: Machine Reading Comprehension (MRC) has been a long-standing problem in NLP and, with the recent introduction of the BERT family of transformer based language models, it has come a long way to getting solved. Unfortunately, however, when BERT variants trained on general text corpora are applied to domain-specific text, their performance inevitably degrades on account of the domain shift i.e. genre… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  39. arXiv:2401.07098  [pdf, other

    cs.CL

    A Novel Multi-Stage Prompting Approach for Language Agnostic MCQ Generation using GPT

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: We introduce a multi-stage prompting approach (MSP) for the generation of multiple choice questions (MCQs), harnessing the capabilities of GPT models such as text-davinci-003 and GPT-4, renowned for their excellence across various NLP tasks. Our approach incorporates the innovative concept of chain-of-thought prompting, a progressive technique in which the GPT model is provided with a series of in… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: Accepted at ECIR 2024(short paper)

  40. arXiv:2401.06981  [pdf, ps, other

    cs.DS

    Online Matroid Intersection: Submodular Water-Filling and Matroidal Welfare Maximization

    Authors: Daniel Hathcock, Billy Jin, Kalen Patton, Sherry Sarkar, Michael Zlatin

    Abstract: We study two problems in online matroid intersection. First, we consider the problem of maximizing the size of a common independent set between a general matroid and a partition matroid whose parts arrive online. This captures the classic online bipartite matching problem when both matroids are partition matroids. Our main result is a $(1 - \frac{1}{e})$-competitive algorithm for the fractional ve… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  41. arXiv:2312.12338  [pdf, other

    cs.CY

    Smart Connected Farms and Networked Farmers to Tackle Climate Challenges Impacting Agricultural Production

    Authors: Behzad J. Balabaygloo, Barituka Bekee, Samuel W. Blair, Suzanne Fey, Fateme Fotouhi, Ashish Gupta, Kevin Menke, Anusha Vangala, Jorge C. M. Palomares, Aaron Prestholt, Vishesh K. Tanwar, Xu Tao, Matthew E. Carroll, Sajal Das, Gil Depaula, Peter Kyveryga, Soumik Sarkar, Michelle Segovia, Simone Sylvestri, Corinne Valdivia, Asheesh K. Singh

    Abstract: To meet the grand challenges of agricultural production including climate change impacts on crop production, a tight integration of social science, technology and agriculture experts including farmers are needed. There are rapid advances in information and communication technology, precision agriculture and data analytics, which are creating a fertile field for the creation of smart connected farm… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  42. arXiv:2312.03088  [pdf, other

    cs.CL

    LLMs for Multi-Modal Knowledge Extraction and Analysis in Intelligence/Safety-Critical Applications

    Authors: Brett Israelsen, Soumalya Sarkar

    Abstract: Large Language Models have seen rapid progress in capability in recent years; this progress has been accelerating and their capabilities, measured by various benchmarks, are beginning to approach those of humans. There is a strong demand to use such models in a wide variety of applications but, due to unresolved vulnerabilities and limitations, great care needs to be used before applying them to i… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: initial draft

  43. arXiv:2312.02722  [pdf, other

    cs.CG cs.DS

    Improved Algorithms for Minimum-Membership Geometric Set Cover

    Authors: Sathish Govindarajan, Siddhartha Sarkar

    Abstract: Bandyapadhyay et al. introduced the generalized minimum-membership geometric set cover (GMMGSC) problem [SoCG, 2023], which is defined as follows. We are given two sets $P$ and $P'$ of points in $\mathbb{R}^{2}$, $n=\max(|P|, |P'|)$, and a set $\mathcal{S}$ of $m$ axis-parallel unit squares. The goal is to find a subset $\mathcal{S}^{*}\subseteq \mathcal{S}$ that covers all the points in $P$ while… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: To appear in CALDAM 2024

  44. arXiv:2312.01032  [pdf, other

    cs.CL cs.AI

    Harnessing the Power of Prompt-based Techniques for Generating School-Level Questions using Large Language Models

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: Designing high-quality educational questions is a challenging and time-consuming task. In this work, we propose a novel approach that utilizes prompt-based techniques to generate descriptive and reasoning-based questions. However, current question-answering (QA) datasets are inadequate for conducting our experiments on prompt-based question generation (QG) in an educational setting. Therefore, we… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  45. arXiv:2311.13152  [pdf, other

    cs.CV

    Test-Time Augmentation for 3D Point Cloud Classification and Segmentation

    Authors: Tuan-Anh Vu, Srinjay Sarkar, Zhiyuan Zhang, Binh-Son Hua, Sai-Kit Yeung

    Abstract: Data augmentation is a powerful technique to enhance the performance of a deep learning task but has received less attention in 3D deep learning. It is well known that when 3D shapes are sparsely represented with low point density, the performance of the downstream tasks drops significantly. This work explores test-time augmentation (TTA) for 3D point clouds. We are inspired by the recent revoluti… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: This paper is accepted in 3DV 2024

  46. arXiv:2311.12404  [pdf, other

    cs.CL cs.IR

    InterPrompt: Interpretable Prompting for Interrelated Interpersonal Risk Factors in Reddit Posts

    Authors: MSVPJ Sathvik, Surjodeep Sarkar, Chandni Saxena, Sunghwan Sohn, Muskan Garg

    Abstract: Mental health professionals and clinicians have observed the upsurge of mental disorders due to Interpersonal Risk Factors (IRFs). To simulate the human-in-the-loop triaging scenario for early detection of mental health disorders, we recognized textual indications to ascertain these IRFs : Thwarted Belongingness (TBe) and Perceived Burdensomeness (PBu) within personal narratives. In light of this,… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 5 pages

  47. arXiv:2311.10718  [pdf, ps, other

    q-fin.TR cs.LG

    Harnessing Deep Q-Learning for Enhanced Statistical Arbitrage in High-Frequency Trading: A Comprehensive Exploration

    Authors: Soumyadip Sarkar

    Abstract: The realm of High-Frequency Trading (HFT) is characterized by rapid decision-making processes that capitalize on fleeting market inefficiencies. As the financial markets become increasingly competitive, there is a pressing need for innovative strategies that can adapt and evolve with changing market dynamics. Enter Reinforcement Learning (RL), a branch of machine learning where agents learn by int… ▽ More

    Submitted 13 September, 2023; originally announced November 2023.

  48. arXiv:2311.10548  [pdf, other

    cs.DC

    Efficient Profit Maximization in Reliability Concerned Static Vehicular Cloud System

    Authors: Suvarthi Sarkar, Akshat Arun, Harshit Surekha, Aryabartta Sahu

    Abstract: Modern electric VUs are equipped with a variety of increasingly potent computing, communication, and storage resources, and with this tremendous computation power in their arsenal can be used to enhance the computing power of regular cloud systems, which is termed as vehicular cloud. Unlike in the traditional cloud computing resources, these vehicular cloud resource moves around and participates i… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  49. arXiv:2311.09024  [pdf, other

    cs.CV

    Fast Certification of Vision-Language Models Using Incremental Randomized Smoothing

    Authors: A K Nirala, A Joshi, C Hegde, S Sarkar

    Abstract: A key benefit of deep vision-language models such as CLIP is that they enable zero-shot open vocabulary classification; the user has the ability to define novel class labels via natural language prompts at inference time. However, while CLIP-based zero-shot classifiers have demonstrated competitive performance across a range of domain shifts, they remain highly vulnerable to adversarial attacks. T… ▽ More

    Submitted 4 January, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  50. LocaliseBot: Multi-view 3D object localisation with differentiable rendering for robot grasping

    Authors: Sujal Vijayaraghavan, Redwan Alqasemi, Rajiv Dubey, Sudeep Sarkar

    Abstract: Robot grasp typically follows five stages: object detection, object localisation, object pose estimation, grasp pose estimation, and grasp planning. We focus on object pose estimation. Our approach relies on three pieces of information: multiple views of the object, the camera's extrinsic parameters at those viewpoints, and 3D CAD models of objects. The first step involves a standard deep learning… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.