Skip to main content

Showing 1–50 of 84 results for author: Sharma, N

  1. arXiv:2407.08811  [pdf, other

    eess.IV cs.CV

    CXR-Agent: Vision-language models for chest X-ray interpretation with uncertainty aware radiology reporting

    Authors: Naman Sharma

    Abstract: Recently large vision-language models have shown potential when interpreting complex images and generating natural language descriptions using advanced reasoning. Medicine's inherently multimodal nature incorporating scans and text-based medical histories to write reports makes it conducive to benefit from these leaps in AI capabilities. We evaluate the publicly available, state of the art, founda… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Supervised by Professor Ben Glocker

  2. arXiv:2407.06547  [pdf, other

    cs.CL

    Deciphering Assamese Vowel Harmony with Featural InfoWaveGAN

    Authors: Sneha Ray Barman, Shakuntala Mahanta, Neeraj Kumar Sharma

    Abstract: Traditional approaches for understanding phonological learning have predominantly relied on curated text data. Although insightful, such approaches limit the knowledge captured in textual representations of the spoken language. To overcome this limitation, we investigate the potential of the Featural InfoWaveGAN model to learn iterative long-distance vowel harmony using raw speech data. We focus o… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: to be included in the Interspeech Proceedings

  3. arXiv:2407.05887  [pdf, other

    cs.CL cs.AI cs.LG

    Generation and De-Identification of Indian Clinical Discharge Summaries using LLMs

    Authors: Sanjeet Singh, Shreya Gupta, Niralee Gupta, Naimish Sharma, Lokesh Srivastava, Vibhu Agarwal, Ashutosh Modi

    Abstract: The consequences of a healthcare data breach can be devastating for the patients, providers, and payers. The average financial impact of a data breach in recent months has been estimated to be close to USD 10 million. This is especially significant for healthcare organizations in India that are managing rapid digitization while still establishing data governance procedures that align with the lett… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted at BioNLP Workshop at ACL 2024; 21 pages (9 pages main content)

  4. arXiv:2407.05502  [pdf, other

    cs.CL cs.AI cs.IR

    Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models

    Authors: Nikhil Sharma, Kenton Murray, Ziang Xiao

    Abstract: With Retrieval Augmented Generation (RAG), Large Language Models (LLMs) are playing a pivotal role in information search and are being adopted globally. Although the multilingual capability of LLMs offers new opportunities to bridge the language barrier, do these capabilities translate into real-life scenarios where linguistic divide and knowledge conflicts between multilingual sources are known o… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  5. arXiv:2405.18061  [pdf, other

    cs.CL

    Context is Important in Depressive Language: A Study of the Interaction Between the Sentiments and Linguistic Markers in Reddit Discussions

    Authors: Neha Sharma, Kairit Sirts

    Abstract: Research exploring linguistic markers in individuals with depression has demonstrated that language usage can serve as an indicator of mental health. This study investigates the impact of discussion topic as context on linguistic markers and emotional expression in depression, using a Reddit dataset to explore interaction effects. Contrary to common findings, our sentiment analysis revealed a broa… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  6. arXiv:2404.11949  [pdf, other

    cs.CV cs.AI cs.LG

    Sketch-guided Image Inpainting with Partial Discrete Diffusion Process

    Authors: Nakul Sharma, Aditay Tripathi, Anirban Chakraborty, Anand Mishra

    Abstract: In this work, we study the task of sketch-guided image inpainting. Unlike the well-explored natural language-guided image inpainting, which excels in capturing semantic details, the relatively less-studied sketch-guided inpainting offers greater user control in specifying the object's shape and pose to be inpainted. As one of the early solutions to this task, we introduce a novel partial discrete… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Accepted to NTIRE Workshop @ CVPR 2024

  7. arXiv:2403.13272  [pdf, other

    cs.CY cs.CL cs.SI

    Community Needs and Assets: A Computational Analysis of Community Conversations

    Authors: Md Towhidul Absar Chowdhury, Naveen Sharma, Ashiqur R. KhudaBukhsh

    Abstract: A community needs assessment is a tool used by non-profits and government agencies to quantify the strengths and issues of a community, allowing them to allocate their resources better. Such approaches are transitioning towards leveraging social media conversations to analyze the needs of communities and the assets already present within them. However, manual analysis of exponentially increasing s… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  8. arXiv:2403.10507  [pdf, other

    cs.SE

    Demystifying Faulty Code with LLM: Step-by-Step Reasoning for Explainable Fault Localization

    Authors: Ratnadira Widyasari, Jia Wei Ang, Truong Giang Nguyen, Neil Sharma, David Lo

    Abstract: Fault localization is a critical process that involves identifying specific program elements responsible for program failures. Manually pinpointing these elements, such as classes, methods, or statements, which are associated with a fault is laborious and time-consuming. To overcome this challenge, various fault localization tools have been developed. These tools typically generate a ranked list o… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: To be appeared at 2024 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)

  9. arXiv:2403.06140  [pdf, other

    cs.CE

    RADS : Restricted Anisotropic Diffusion Spectrum model for Axonal Health quantification in Multiple Sclerosis

    Authors: Nand Sharma

    Abstract: Axonal damage is the primary pathological correlate of long-term impairment in multiple sclerosis (MS). Our previous work using our method - diffusion basis spectrum imaging (DBSI) - demonstrated a strong, quantitative relationship between axial diffusivity and axonal damage. In the present work, we develop an extension of DBSI which can be used to quantify the fraction of diseased and healthy axo… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  10. arXiv:2402.13528  [pdf, other

    cs.CY cs.CL cs.LG cs.SI

    Infrastructure Ombudsman: Mining Future Failure Concerns from Structural Disaster Response

    Authors: Md Towhidul Absar Chowdhury, Soumyajit Datta, Naveen Sharma, Ashiqur R. KhudaBukhsh

    Abstract: Current research concentrates on studying discussions on social media related to structural failures to improve disaster response strategies. However, detecting social web posts discussing concerns about anticipatory failures is under-explored. If such concerns are channeled to the appropriate authorities, it can aid in the prevention and mitigation of potential infrastructural failures. In this p… ▽ More

    Submitted 21 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  11. arXiv:2402.05880  [pdf, other

    cs.CL cs.AI cs.HC

    Generative Echo Chamber? Effects of LLM-Powered Search Systems on Diverse Information Seeking

    Authors: Nikhil Sharma, Q. Vera Liao, Ziang Xiao

    Abstract: Large language models (LLMs) powered conversational search systems have already been used by hundreds of millions of people, and are believed to bring many benefits over conventional search. However, while decades of research and public discourse interrogated the risk of search systems in increasing selective exposure and creating echo chambers -- limiting exposure to diverse opinions and leading… ▽ More

    Submitted 10 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted in CHI'24. Supplementary material will be available online with the official submission in CHI 2024

  12. arXiv:2402.01931  [pdf, other

    cs.LG cs.CL cs.SD eess.AS

    Digits micro-model for accurate and secure transactions

    Authors: Chirag Chhablani, Nikhita Sharma, Jordan Hosier, Vijay K. Gurbani

    Abstract: Automatic Speech Recognition (ASR) systems are used in the financial domain to enhance the caller experience by enabling natural language understanding and facilitating efficient and intuitive interactions. Increasing use of ASR systems requires that such systems exhibit very low error rates. The predominant ASR models to collect numeric data are large, general-purpose commercial models -- Google… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 7 pages, 1 figure, 5 tables

  13. Applications of Machine Learning to Optimizing Polyolefin Manufacturing

    Authors: Niket Sharma, Y. A. Liu

    Abstract: This chapter is a preprint from our book by , focusing on leveraging machine learning (ML) in chemical and polyolefin manufacturing optimization. It's crafted for both novices and seasoned professionals keen on the latest ML applications in chemical processes. We trace the evolution of AI and ML in chemical industries, delineate core ML components, and provide resources for ML beginners. A detaile… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  14. arXiv:2401.08987  [pdf, other

    quant-ph cs.CR

    The Quantum Cryptography Approach: Unleashing the Potential of Quantum Key Reconciliation Protocol for Secure Communication

    Authors: Neha Sharma, Vikas Saxena

    Abstract: Quantum cryptography is the study of delivering secret communications across a quantum channel. Recently, Quantum Key Distribution (QKD) has been recognized as the most important breakthrough in quantum cryptography. This process facilitates two distant parties to share secure communications based on physical laws. The BB84 protocol was developed in 1984 and remains the most widely used among BB92… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  15. arXiv:2401.07070  [pdf, other

    econ.TH cs.MA

    A Dynamic Agent Based Model of the Real Economy with Monopolistic Competition, Perfect Product Differentiation, Heterogeneous Agents, Increasing Returns to Scale and Trade in Disequilibrium

    Authors: Subhamon Supantha, Naresh Kumar Sharma

    Abstract: We have used agent-based modeling as our numerical method to artificially simulate a dynamic real economy where agents are rational maximizers of an objective function of Cobb-Douglas type. The economy is characterised by heterogeneous agents, acting out of local or imperfect information, monopolistic competition, perfect product differentiation, allowance for increasing returns to scale technolog… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  16. arXiv:2312.07813  [pdf, other

    cs.OS cs.LG

    On a Foundation Model for Operating Systems

    Authors: Divyanshu Saxena, Nihal Sharma, Donghyun Kim, Rohit Dwivedula, Jiayi Chen, Chenxi Yang, Sriram Ravula, Zichao Hu, Aditya Akella, Sebastian Angel, Joydeep Biswas, Swarat Chaudhuri, Isil Dillig, Alex Dimakis, P. Brighten Godfrey, Daehyeok Kim, Chris Rossbach, Gang Wang

    Abstract: This paper lays down the research agenda for a domain-specific foundation model for operating systems (OSes). Our case for a foundation model revolves around the observations that several OS components such as CPU, memory, and network subsystems are interrelated and that OS traces offer the ideal dataset for a foundation model to grasp the intricacies of diverse OS components and their behavior in… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Machine Learning for Systems Workshop at 37th NeurIPS Conference, 2023, New Orleans, LA, USA

  17. arXiv:2310.08056  [pdf, other

    cs.LG cs.AI

    Learning from Label Proportions: Bootstrapping Supervised Learners via Belief Propagation

    Authors: Shreyas Havaldar, Navodita Sharma, Shubhi Sareen, Karthikeyan Shanmugam, Aravindan Raghuveer

    Abstract: Learning from Label Proportions (LLP) is a learning problem where only aggregate level labels are available for groups of instances, called bags, during training, and the aim is to get the best performance at the instance-level on the test data. This setting arises in domains like advertising and medicine due to privacy considerations. We propose a novel algorithmic framework for this problem that… ▽ More

    Submitted 20 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Published as a conference paper at The Twelfth International Conference on Learning Representations (ICLR 2024) & Oral Presentation at Regulatable ML @ NeurIPS 2023

  18. arXiv:2310.02462  [pdf, other

    cs.RO cs.AI cs.HC

    Improved Inference of Human Intent by Combining Plan Recognition and Language Feedback

    Authors: Ifrah Idrees, Tian Yun, Naveen Sharma, Yunxin Deng, Nakul Gopalan, George Konidaris, Stefanie Tellex

    Abstract: Conversational assistive robots can aid people, especially those with cognitive impairments, to accomplish various tasks such as cooking meals, performing exercises, or operating machines. However, to interact with people effectively, robots must recognize human plans and goals from noisy observations of human actions, even when the user acts sub-optimally. Previous works on Plan and Goal Recognit… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Published in IROS 2023

  19. arXiv:2305.12741  [pdf, other

    eess.AS cs.LG cs.SD q-bio.QM

    Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection

    Authors: Debarpan Bhattacharya, Neeraj Kumar Sharma, Debottam Dutta, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, Chandrakiran C, Sahiti Nori, Suhail K K, Sadhana Gonuguntla, Murali Alagesan

    Abstract: This paper presents the Coswara dataset, a dataset containing diverse set of respiratory sounds and rich meta-data, recorded between April-2020 and February-2022 from 2635 individuals (1819 SARS-CoV-2 negative, 674 positive, and 142 recovered subjects). The respiratory sounds contained nine sound categories associated with variants of breathing, cough and speech. The rich metadata contained demogr… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted for publiation in Nature Scientific Data

  20. mlpack 4: a fast, header-only C++ machine learning library

    Authors: Ryan R. Curtin, Marcus Edel, Omar Shrit, Shubham Agrawal, Suryoday Basak, James J. Balamuta, Ryan Birmingham, Kartik Dutt, Dirk Eddelbuettel, Rishabh Garg, Shikhar Jaiswal, Aakash Kaushik, Sangyeon Kim, Anjishnu Mukherjee, Nanubala Gnana Sai, Nippun Sharma, Yashwant Singh Parihar, Roshan Swain, Conrad Sanderson

    Abstract: For over 15 years, the mlpack machine learning library has served as a "swiss army knife" for C++-based machine learning. Its efficient implementations of common and cutting-edge machine learning algorithms have been used in a wide variety of scientific and industrial applications. This paper overviews mlpack 4, a significant upgrade over its predecessor. The library has been significantly refacto… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Journal ref: Journal of Open Source Software, Vol. 8, No. 82, 2023

  21. arXiv:2211.12926  [pdf, other

    cs.CV cs.AI cs.LG

    Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred Thousand-Scale One-Shot Logo Identification

    Authors: Nakul Sharma, Abhirama S. Penamakuri, Anand Mishra

    Abstract: In this paper, we study the problem of identifying logos of business brands in natural scenes in an open-set one-shot setting. This problem setup is significantly more challenging than traditionally-studied 'closed-set' and 'large-scale training samples per category' logo recognition settings. We propose a novel multi-view textual-visual encoding framework that encodes text appearing in the logos… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted to ICVGIP 2022

  22. arXiv:2210.00590   

    cs.CL

    Community Learning: Understanding A Community Through NLP for Positive Impact

    Authors: Md Towhidul Absar Chowdhury, Naveen Sharma

    Abstract: A post-pandemic world resulted in economic upheaval, particularly for the cities' communities. While significant work in NLP4PI focuses on national and international events, there is a gap in bringing such state-of-the-art methods into the community development field. In order to help with community development, we must learn about the communities we develop. To that end, we propose the task of co… ▽ More

    Submitted 10 October, 2022; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: The article has been withdrawn as the work is incomplete at this point in time. There are significant evaluations required before this work is ready for pre-print. Furthermore, the dataset of NextDoor used in this paper is also not complete. As of this time this work is not applicable

  23. arXiv:2209.09865  [pdf, other

    cs.RO

    Collisionless Pattern Discovery in Robot Swarms Using Deep Reinforcement Learning

    Authors: Nelson Sharma, Aswini Ghosh, Rajiv Misra, Supratik Mukhopadhyay, Gokarna Sharma

    Abstract: We present a deep reinforcement learning-based framework for automatically discovering patterns available in any given initial configuration of fat robot swarms. In particular, we model the problem of collision-less gathering and mutual visibility in fat robot swarms and discover patterns for solving them using our framework. We show that by shaping reward signals based on certain constraints like… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  24. arXiv:2207.08365  [pdf, other

    cs.AI

    CausNet : Generational orderings based search for optimal Bayesian networks via dynamic programming with parent set constraints

    Authors: Nand Sharma, Joshua Millstein

    Abstract: Finding a globally optimal Bayesian Network using exhaustive search is a problem with super-exponential complexity, which severely restricts the number of variables that it can work for. We implement a dynamic programming based algorithm with built-in dimensionality reduction and parent set identification. This reduces the search space drastically and can be applied to large-dimensional data. We u… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  25. arXiv:2206.12309  [pdf, other

    eess.AS cs.LG eess.SP

    Analyzing the impact of SARS-CoV-2 variants on respiratory sound signals

    Authors: Debarpan Bhattacharya, Debottam Dutta, Neeraj Kumar Sharma, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, Chandrakiran C, Sahiti Nori, Suhail K K, Sadhana Gonuguntla, Murali Alagesan

    Abstract: The COVID-19 outbreak resulted in multiple waves of infections that have been associated with different SARS-CoV-2 variants. Studies have reported differential impact of the variants on respiratory health of patients. We explore whether acoustic signals, collected from COVID-19 subjects, show computationally distinguishable acoustic patterns suggesting a possibility to predict the underlying virus… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Journal ref: Interspeech, 2022

  26. arXiv:2206.05053  [pdf, other

    cs.HC cs.LG cs.SD eess.AS eess.SP

    Coswara: A website application enabling COVID-19 screening by analysing respiratory sound samples and health symptoms

    Authors: Debarpan Bhattacharya, Debottam Dutta, Neeraj Kumar Sharma, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, Chandrakiran C, Sahiti Nori, Suhail K K, Sadhana Gonuguntla, Murali Alagesan

    Abstract: The COVID-19 pandemic has accelerated research on design of alternative, quick and effective COVID-19 diagnosis approaches. In this paper, we describe the Coswara tool, a website application designed to enable COVID-19 detection by analysing respiratory sound samples and health symptoms. A user using this service can log into a website using any device connected to the internet, provide there curr… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Journal ref: Interspeech, 2022

  27. arXiv:2202.02481  [pdf, other

    cs.CY cs.LG

    LotRec: A Recommender for Urban Vacant Lot Conversion

    Authors: Md Towhidul A Chowdhury, Naveen Sharma

    Abstract: Vacant lots are neglected properties in a city that lead to environmental hazards and poor standard of living for the community. Thus, reclaiming vacant lots and putting them to productive use is an important consideration for many cities. Given a large number of vacant lots and resource constraints for conversion, two key questions for a city are (1) whether to convert a vacant lot or not; and (2… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  28. arXiv:2201.07729  [pdf

    cs.HC

    Ergonomics Integrated Design Methodology using Parameter Optimization, Computer-Aided Design, and Digital Human Modelling: A Case Study of a Cleaning Equipment

    Authors: Neelesh Kr. Sharma, Mayank Tiwari, Atul Thakur, Anindya K. Ganguli

    Abstract: Challenges of enhancing productivity by amplifying efficiency and man-machine compatibility of equipment can be achieved by adopting advanced technologies. This study aims to present and exemplify methodology for incorporating ergonomics pro-actively into the design using computer-aided design and digital human modeling-based analysis. The cleaning equipment is parametrized to detect the critical… ▽ More

    Submitted 5 April, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: page count: 33; word count (Excluding references and abstract): 5413; abstract word count: 161; number of figures: 11; number of tables: 3

  29. A Hybrid Science-Guided Machine Learning Approach for Modeling and Optimizing Chemical Processes

    Authors: Niket Sharma, Y. A. Liu

    Abstract: This study presents a broad perspective of hybrid process modeling and optimization combining the scientific knowledge and data analytics in bioprocessing and chemical engineering with a science-guided machine learning (SGML) approach. We divide the approach into two major categories. The first refers to the case where a data-based ML model compliments and makes the first-principle science-based m… ▽ More

    Submitted 24 January, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: 29 pages 12 figures, 1 table

  30. arXiv:2111.11885  [pdf, other

    cs.CR cs.NI

    SACRIFICE: A Secure Road Condition Monitoring Scheme over Fog-based VANETs

    Authors: Nishttha Sharma, Jayasree Sengupta, Sipra Das Bit

    Abstract: With the rapid growth of Vehicular Ad-Hoc Networks (VANETs), huge amounts of road condition data are constantly being generated and sent to the cloud for processing. However, this introduces a significant load on the network bandwidth causing delay in the network and for a time-critical application like VANET such delay may have severe impact on real-time traffic management. This delay maybe reduc… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 9 pages, 6 figures, 6 tables. Accepted at 14th International Conference on COMmunication Systems & NETworkS (COMSNETS 2022)

  31. arXiv:2110.02531  [pdf, other

    cs.CV

    3D-FCT: Simultaneous 3D Object Detection and Tracking Using Feature Correlation

    Authors: Naman Sharma, Hocksoon Lim

    Abstract: 3D object detection using LiDAR data remains a key task for applications like autonomous driving and robotics. Unlike in the case of 2D images, LiDAR data is almost always collected over a period of time. However, most work in this area has focused on performing detection independent of the temporal domain. In this paper we present 3D-FCT, a Siamese network architecture that utilizes temporal info… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    ACM Class: I.2.10; I.4.0

  32. arXiv:2110.01177  [pdf, other

    eess.AS cs.SD q-bio.QM

    The Second DiCOVA Challenge: Dataset and performance analysis for COVID-19 diagnosis using acoustics

    Authors: Neeraj Kumar Sharma, Srikanth Raj Chetupalli, Debarpan Bhattacharya, Debottam Dutta, Pravin Mote, Sriram Ganapathy

    Abstract: The Second Diagnosis of COVID-19 using Acoustics (DiCOVA) Challenge aimed at accelerating the research in acoustics based detection of COVID-19, a topic at the intersection of acoustics, signal processing, machine learning, and healthcare. This paper presents the details of the challenge, which was an open call for researchers to analyze a dataset of audio recordings consisting of breathing, cough… ▽ More

    Submitted 11 October, 2021; v1 submitted 4 October, 2021; originally announced October 2021.

  33. Interactive GIS Web-Atlas for Twelve Pacific Islands Countries

    Authors: Fabrice Lartigou, Michael Govorov, Tofiga Aisake, Pankajeshwara N. Sharma

    Abstract: This article deals with the development of an interactive up-to-date Pacific Islands Web GIS Atlas. It focuses on the compilation of spatial data from the twelve member countries of the University of the South Pacific (Cook Islands, Fiji Islands, Kiribati Islands, Marshall Islands, Nauru, Niue, Tonga, Tuvalu, Tokelau, Solomon Islands, Vanuatu, and Western Samoa). A previous bitmap web Atlas was cr… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: Project report article to Intergraph. (GeoMedia Research Laboratory Initiative)

  34. arXiv:2107.03263  [pdf, other

    cs.LG

    Episodic Bandits with Stochastic Experts

    Authors: Nihal Sharma, Soumya Basu, Karthikeyan Shanmugam, Sanjay Shakkottai

    Abstract: We study a version of the contextual bandit problem where an agent can intervene through a set of stochastic expert policies. The agent interacts with the environment over episodes, with each episode having different context distributions; this results in the `best expert' changing across episodes. Our goal is to develop an agent that tracks the best expert over episodes. We introduce the Empirica… ▽ More

    Submitted 26 October, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

  35. Mentorship Network Structure: How Relationships Emerge Online and What They Mean for Amateur Creators

    Authors: Ruby Davis, Jenna Frens, Niharika Sharma, Meena Devii Muralikumar, Cecilia Aragon, Sarah Evans

    Abstract: Relationships form the core of connected learning. In this study, we apply and extend social network analysis methods to uncover the layered network structure of relationships among Fanfiction.net authors and reviewers. Fanfiction.net, one of the world's largest fanfiction communities, is a space where millions of young people engage with written media, connect over shared interests, and receive s… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

    Comments: 10 pages, 2 figures, published in The Proceedings of the 2020 Connected Learning Summit

    Journal ref: Jeremiah H. Kalir and Danielle Filipiak. 2021. Proceedings of the 2020 Connected Learning Summit. 36-45

  36. arXiv:2106.10997  [pdf, other

    eess.AS cs.SD

    Towards sound based testing of COVID-19 -- Summary of the first Diagnostics of COVID-19 using Acoustics (DiCOVA) Challenge

    Authors: Neeraj Kumar Sharma, Ananya Muguli, Prashant Krishnan, Rohit Kumar, Srikanth Raj Chetupalli, Sriram Ganapathy

    Abstract: The technology development for point-of-care tests (POCTs) targeting respiratory diseases has witnessed a growing demand in the recent past. Investigating the presence of acoustic biomarkers in modalities such as cough, breathing and speech sounds, and using them for building POCTs can offer fast, contactless and inexpensive testing. In view of this, over the past year, we launched the ``Coswara''… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: Manuscript in review in the Elsevier Computer Speech and Language journal

  37. Influence of Roles in Decision-Making during OSS Development -- A Study of Python

    Authors: Pankajeshwara Nand Sharma, Bastin Tony Roy Savarimuthu, Nigel Stanger

    Abstract: Governance has been highlighted as a key factor in the success of an Open Source Software (OSS) project. It is generally seen that in a mixed meritocracy and autocracy governance model, the decision-making (DM) responsibility regarding what features are included in the OSS is shared among members from select roles; prominently the project leader. However, less examination has been made whether mem… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

  38. arXiv:2106.00639  [pdf, other

    eess.AS cs.SD eess.SP

    Multi-modal Point-of-Care Diagnostics for COVID-19 Based On Acoustics and Symptoms

    Authors: Srikanth Raj Chetupalli, Prashant Krishnan, Neeraj Sharma, Ananya Muguli, Rohit Kumar, Viral Nanda, Lancelot Mark Pinto, Prasanta Kumar Ghosh, Sriram Ganapathy

    Abstract: The research direction of identifying acoustic bio-markers of respiratory diseases has received renewed interest following the onset of COVID-19 pandemic. In this paper, we design an approach to COVID-19 diagnostic using crowd-sourced multi-modal data. The data resource, consisting of acoustic signals like cough, breathing, and speech signals, along with the data of symptoms, are recorded using a… ▽ More

    Submitted 5 June, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: The Manuscript is submitted to IEEE-EMBS Journal of Biomedical and Health Informatics on June 1, 2021

  39. arXiv:2104.12862  [pdf

    eess.IV cs.CV cs.LG

    A digital score of tumour-associated stroma infiltrating lymphocytes predicts survival in head and neck squamous cell carcinoma

    Authors: Muhammad Shaban, Shan E Ahmed Raza, Mariam Hassan, Arif Jamshed, Sajid Mushtaq, Asif Loya, Nikolaos Batis, Jill Brooks, Paul Nankivell, Neil Sharma, Max Robinson, Hisham Mehanna, Syed Ali Khurram, Nasir Rajpoot

    Abstract: The infiltration of T-lymphocytes in the stroma and tumour is an indication of an effective immune response against the tumour, resulting in better survival. In this study, our aim is to explore the prognostic significance of tumour-associated stroma infiltrating lymphocytes (TASILs) in head and neck squamous cell carcinoma (HNSCC) through an AI based automated method. A deep learning based automa… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  40. arXiv:2103.09148  [pdf, other

    eess.AS cs.SD

    DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics

    Authors: Ananya Muguli, Lancelot Pinto, Nirmala R., Neeraj Sharma, Prashant Krishnan, Prasanta Kumar Ghosh, Rohit Kumar, Shrirama Bhat, Srikanth Raj Chetupalli, Sriram Ganapathy, Shreyas Ramoji, Viral Nanda

    Abstract: The DiCOVA challenge aims at accelerating research in diagnosing COVID-19 using acoustics (DiCOVA), a topic at the intersection of speech and audio processing, respiratory health diagnosis, and machine learning. This challenge is an open call for researchers to analyze a dataset of sound recordings collected from COVID-19 infected and non-COVID-19 individuals for a two-class classification. These… ▽ More

    Submitted 17 June, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: To appear in Proceedings of Interspeech, 2021

  41. arXiv:2102.05232  [pdf, other

    cs.SE

    Extracting Rationale for Open Source Software Development Decisions -- A Study of Python Email Archives

    Authors: Pankajeshwara Nand Sharma, Bastin Tony Roy Savarimuthu, Nigel Stanger

    Abstract: A sound Decision-Making (DM) process is key to the successful governance of software projects. In many Open Source Software Development (OSSD) communities, DM processes lie buried amongst vast amounts of publicly available data. Hidden within this data lie the rationale for decisions that led to the evolution and maintenance of software products. While there have been some efforts to extract DM pr… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 12 pages, 5 figures, 3 tables, appears in the proceedings of the 43rd International Conference on Software Engineering (ICSE 2021)

    ACM Class: D.2

  42. arXiv:2102.03749  [pdf, other

    cs.IR

    Role of Attentive History Selection in Conversational Information Seeking

    Authors: Somil Gupta, Neeraj Sharma

    Abstract: The rise of intelligent assistant systems like Siri and Alexa have led to the emergence of Conversational Search, a research track of Information Retrieval (IR) that involves interactive and iterative information-seeking user-system dialog. Recently released OR-QuAC and TCAsT19 datasets narrow their research focus on the retrieval aspect of conversational search i.e. fetching the relevant document… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

  43. arXiv:2011.07124  [pdf, other

    astro-ph.IM astro-ph.GA cs.LG eess.IV

    Survey2Survey: A deep learning generative model approach for cross-survey image mapping

    Authors: Brandon Buncher, Awshesh Nath Sharma, Matias Carrasco Kind

    Abstract: During the last decade, there has been an explosive growth in survey data and deep learning techniques, both of which have enabled great advances for astronomy. The amount of data from various surveys from multiple epochs with a wide range of wavelengths, albeit with varying brightness and quality, is overwhelming, and leveraging information from overlapping observations from different surveys has… ▽ More

    Submitted 5 February, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: 24 pages, 19 figures. Accepted by MNRAS

  44. arXiv:2009.13978  [pdf, ps, other

    cs.CR

    Anonymous proof-of-asset transactions using designated blind signatures

    Authors: Neetu Sharma, Rajeev Anand Sahu, Vishal Saraswat, Joaquin Garcia-Alfaro

    Abstract: We propose a scheme to preserve the anonymity of users in proof-of-asset transactions. We assume bitcoin-like cryptocurrency systems in which a user must prove the strength of its assets (i.e., solvency), prior conducting further transactions. The traditional way of addressing such a problem is the use of blind signatures, i.e., a kind of digital signature whose properties satisfy the anonymity of… ▽ More

    Submitted 26 October, 2020; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: 17 pages, extended conference version

  45. arXiv:2006.00364  [pdf, other

    cs.AR

    CLARINET: A RISC-V Based Framework for Posit Arithmetic Empiricism

    Authors: Niraj Sharma, Riya Jain, Madhumita Mohan, Sachin Patkar, Rainer Leupers, Nikhil Rishiyur, Farhad Merchant

    Abstract: Many engineering and scientific applications require high precision arithmetic. IEEE~754-2008 compliant (floating-point) arithmetic is the de facto standard for performing these computations. Recently, posit arithmetic has been proposed as a drop-in replacement for floating-point arithmetic. The posit\texttrademark data representation and arithmetic claim several absolute advantages over the float… ▽ More

    Submitted 27 October, 2021; v1 submitted 30 May, 2020; originally announced June 2020.

  46. Coswara -- A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis

    Authors: Neeraj Sharma, Prashant Krishnan, Rohit Kumar, Shreyas Ramoji, Srikanth Raj Chetupalli, Nirmala R., Prasanta Kumar Ghosh, Sriram Ganapathy

    Abstract: The COVID-19 pandemic presents global challenges transcending boundaries of country, race, religion, and economy. The current gold standard method for COVID-19 detection is the reverse transcription polymerase chain reaction (RT-PCR) testing. However, this method is expensive, time-consuming, and violates social distancing. Also, as the pandemic is expected to stay for a while, there is a need for… ▽ More

    Submitted 11 August, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: A description of Coswara dataset to evaluate COVID-19 diagnosis using respiratory sounds

  47. Temporal Attribute Prediction via Joint Modeling of Multi-Relational Structure Evolution

    Authors: Sankalp Garg, Navodita Sharma, Woojeong Jin, Xiang Ren

    Abstract: Time series prediction is an important problem in machine learning. Previous methods for time series prediction did not involve additional information. With a lot of dynamic knowledge graphs available, we can use this additional information to predict the time series better. Recently, there has been a focus on the application of deep representation learning on dynamic graphs. These methods predict… ▽ More

    Submitted 13 July, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: In Proceedings of IJCAI 2020. Code can be found at https://github.com/INK-USC/DArtNet . The sole copyright holder is IJCAI (International Joint Conferences on Artificial Intelligence), all rights reserved. Original Publication available at https://www.ijcai.org/Proceedings/2020/386

  48. arXiv:2003.02503  [pdf

    cs.NI

    Dual link failure survivability with recovery time constraint: A Parallel cross connection backup route recovery strategy

    Authors: Dinesh Kumar, Rajiv Kumar, Neeru Sharma

    Abstract: In this paper, we proposed a fast recovery strategy for a dual link failure in elastic optical network. The elastic optical network is a promising solution to meet the next generation higher bandwidth demand. The survivability of high speed network is very crucial. As the network size increases the probability of the dual link failure and node failure also increases. Here, we proposed a parallel c… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Comments: 12 pages, 5 figures

    MSC Class: no

  49. arXiv:2002.08405  [pdf, other

    cs.LG stat.ML

    On Under-exploration in Bandits with Mean Bounds from Confounded Data

    Authors: Nihal Sharma, Soumya Basu, Karthikeyan Shanmugam, Sanjay Shakkottai

    Abstract: We study a variant of the multi-armed bandit problem where side information in the form of bounds on the mean of each arm is provided. We develop the novel non-optimistic Global Under-Explore (GLUE) algorithm which uses the provided mean bounds (across all the arms) to infer pseudo-variances for each arm, which in turn decide the rate of exploration for the arms. We analyze the regret of GLUE and… ▽ More

    Submitted 10 June, 2021; v1 submitted 19 February, 2020; originally announced February 2020.

  50. arXiv:1909.02481  [pdf, other

    cs.PL cs.CR

    Duet: An Expressive Higher-order Language and Linear Type System for Statically Enforcing Differential Privacy

    Authors: Joseph P. Near, David Darais, Chike Abuah, Tim Stevens, Pranav Gaddamadugu, Lun Wang, Neel Somani, Mu Zhang, Nikhil Sharma, Alex Shan, Dawn Song

    Abstract: During the past decade, differential privacy has become the gold standard for protecting the privacy of individuals. However, verifying that a particular program provides differential privacy often remains a manual task to be completed by an expert in the field. Language-based techniques have been proposed for fully automating proofs of differential privacy via type system design, however these re… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

    Comments: Extended version of OOPSLA 2019 paper