Skip to main content

Showing 1–24 of 24 results for author: Chaudhary, S

  1. arXiv:2404.00665  [pdf, ps, other

    cs.IT

    On cumulative and relative cumulative past information generating function

    Authors: Santosh Kumar Chaudhary, Nitin Gupta, Achintya Roy

    Abstract: In this paper, we introduce the cumulative past information generating function (CPIG) and relative cumulative past information generating function (RCPIG). We study its properties. We establish its relation with generalized cumulative past entropy (GCPE). We defined CPIG stochastic order and its relation with dispersive order. We provide the results for the CPIG measure of the convoluted random v… ▽ More

    Submitted 22 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  2. arXiv:2403.10704  [pdf, other

    cs.LG cs.AI cs.CL

    PERL: Parameter Efficient Reinforcement Learning from Human Feedback

    Authors: Hakim Sidahmed, Samrat Phatale, Alex Hutcheson, Zhuonan Lin, Zhang Chen, Zac Yu, Jarvis Jin, Roman Komarytsia, Christiane Ahlheim, Yonghao Zhu, Simral Chaudhary, Bowen Li, Saravanan Ganesh, Bill Byrne, Jessica Hoffmann, Hassan Mansoor, Wei Li, Abhinav Rastogi, Lucas Dixon

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has proven to be a strong method to align Pretrained Large Language Models (LLMs) with human preferences. But training models with RLHF is computationally expensive, and an overall complex process. In this work, we study RLHF where the underlying models are trained using the parameter efficient method of Low-Rank Adaptation (LoRA) introduced by Hu… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  3. arXiv:2402.05000  [pdf, other

    cs.CL

    Pedagogical Alignment of Large Language Models

    Authors: Shashank Sonkar, Kangqi Ni, Sapana Chaudhary, Richard G. Baraniuk

    Abstract: In this paper, we introduce the novel concept of pedagogically aligned Large Language Models (LLMs) that signifies a transformative shift in the application of LLMs within educational contexts. Rather than providing direct responses to user queries, pedagogically-aligned LLMs function as scaffolding tools, breaking complex problems into manageable subproblems and guiding students towards the final… ▽ More

    Submitted 12 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  4. arXiv:2310.14164  [pdf, other

    cs.LG math.OC

    $α$-Fair Contextual Bandits

    Authors: Siddhant Chaudhary, Abhishek Sinha

    Abstract: Contextual bandit algorithms are at the core of many applications, including recommender systems, clinical trials, and optimal portfolio selection. One of the most popular problems studied in the contextual bandit literature is to maximize the sum of the rewards in each round by ensuring a sublinear regret against the best-fixed context-dependent policy. However, in many applications, the cumulati… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  5. arXiv:2305.13725  [pdf, other

    cs.CL cs.IR

    Conversational Recommendation as Retrieval: A Simple, Strong Baseline

    Authors: Raghav Gupta, Renat Aksitov, Samrat Phatale, Simral Chaudhary, Harrison Lee, Abhinav Rastogi

    Abstract: Conversational recommendation systems (CRS) aim to recommend suitable items to users through natural language conversation. However, most CRS approaches do not effectively utilize the signal provided by these conversations. They rely heavily on explicit external knowledge e.g., knowledge graphs to augment the models' understanding of the items and attributes, which is quite hard to scale. To allev… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: To appear at the 5th NLP4ConvAI workshop

  6. arXiv:2302.12320  [pdf, other

    math.OC cs.LG eess.SY

    Dynamic Regret Analysis of Safe Distributed Online Optimization for Convex and Non-convex Problems

    Authors: Ting-Jui Chang, Sapana Chaudhary, Dileep Kalathil, Shahin Shahrampour

    Abstract: This paper addresses safe distributed online optimization over an unknown set of linear safety constraints. A network of agents aims at jointly minimizing a global, time-varying function, which is only partially observable to each individual agent. Therefore, agents must engage in local communications to generate a safe sequence of actions competitive with the best minimizer sequence in hindsight,… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  7. arXiv:2301.00659  [pdf, ps, other

    cs.IT math.CA math.ST

    On partial monotonicity of some extropy measures

    Authors: Nitin Gupta, Santosh Kumar Chaudhary

    Abstract: Gupta and Chaudhary [14] introduced general weighted extropy and studied related properties. In this paper, we study conditional extropy and define the monotonic behaviour of conditional extropy. Also, we obtain results on the convolution of general weighted extropy.

    Submitted 29 November, 2022; originally announced January 2023.

    MSC Class: 94A17; 62N05; 60E15

  8. arXiv:2209.14222  [pdf, ps, other

    cs.LG cs.AI cs.GT

    Online Subset Selection using $α$-Core with no Augmented Regret

    Authors: Sourav Sahoo, Siddhant Chaudhary, Samrat Mukhopadhyay, Abhishek Sinha

    Abstract: We revisit the classic problem of optimal subset selection in the online learning set-up. Assume that the set $[N]$ consists of $N$ distinct elements. On the $t$th round, an adversary chooses a monotone reward function $f_t: 2^{[N]} \to \mathbb{R}_+$ that assigns a non-negative reward to each subset of $[N].$ An online policy selects (perhaps randomly) a subset $S_t \subseteq [N]$ consisting of… ▽ More

    Submitted 9 February, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

  9. arXiv:2209.13048  [pdf, other

    cs.LG cs.RO

    Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments

    Authors: Desik Rengarajan, Sapana Chaudhary, Jaewon Kim, Dileep Kalathil, Srinivas Shakkottai

    Abstract: Meta reinforcement learning (Meta-RL) is an approach wherein the experience gained from solving a variety of tasks is distilled into a meta-policy. The meta-policy, when adapted over only a small (or just a single) number of steps, is able to perform near-optimally on a new, related task. However, a major challenge to adopting this approach to solve real-world problems is that they are often assoc… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: Accepted to NeurIPS 2022; first two authors contributed equally

  10. arXiv:2208.12964  [pdf

    eess.IV cs.CE

    Uniformly Sampled Polar and Cylindrical Grid Approach for 2D, 3D Image Reconstruction using Algebraic Algorithm

    Authors: Sudhir Kumar Chaudhary, Pankaj Wahi, Prabhat Munshi

    Abstract: Image reconstruction by Algebraic Methods (AM) outperforms the transform methods in situations where the data collection procedure is constrained by time, space, and radiation dose. AM algorithms can also be applied for the cases where these constraints are not present but their high computational and storage requirement prohibit their actual breakthrough in such cases. In the present work, we pro… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Comments: 16 figures, 5 tables

  11. arXiv:2203.11977  [pdf, other

    cs.NI eess.SY

    YouTube over Google's QUIC vs Internet Middleboxes: A Tug of War between Protocol Sustainability and Application QoE

    Authors: Sapna Chaudhary, Prince Sachdeva, Abhijit Mondal, Sandip Chakraborty, Mukulika Maity

    Abstract: Middleboxes such as web proxies, firewalls, etc. are widely deployed in today's network infrastructure. As a result, most protocols need to adapt their behavior to co-exist. One of the most commonly used transport protocols, QUIC, adapts to such middleboxes by falling back to TCP, where they block it. In this paper, we argue that the blind fallback behavior of QUIC, i.e., not distinguishing betwee… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

  12. arXiv:2111.07430  [pdf, other

    cs.LG math.OC

    Safe Online Convex Optimization with Unknown Linear Safety Constraints

    Authors: Sapana Chaudhary, Dileep Kalathil

    Abstract: We study the problem of safe online convex optimization, where the action at each time step must satisfy a set of linear safety constraints. The goal is to select a sequence of actions to minimize the regret without violating the safety constraints at any time step (with high probability). The parameters that specify the linear safety constraints are unknown to the algorithm. The algorithm has acc… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

    Comments: 18 pages

  13. Smooth Imitation Learning via Smooth Costs and Smooth Policies

    Authors: Sapana Chaudhary, Balaraman Ravindran

    Abstract: Imitation learning (IL) is a popular approach in the continuous control setting as among other reasons it circumvents the problems of reward mis-specification and exploration in reinforcement learning (RL). In IL from demonstrations, an important challenge is to obtain agent policies that are smooth with respect to the inputs. Learning through imitation a policy that is smooth as a function of a l… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: To appear in the Proceedings of the Fifth Joint International Conference on Data Science and Management of Data (CoDS-COMAD 2022). Research Track. ACM DL

  14. arXiv:2104.05121  [pdf, other

    eess.IV cs.CV cs.LG

    Detecting COVID-19 and Community Acquired Pneumonia using Chest CT scan images with Deep Learning

    Authors: Shubham Chaudhary, Sadbhawna, Vinit Jakhetiya, Badri N Subudhi, Ujjwal Baid, Sharath Chandra Guntuku

    Abstract: We propose a two-stage Convolutional Neural Network (CNN) based classification framework for detecting COVID-19 and Community-Acquired Pneumonia (CAP) using the chest Computed Tomography (CT) scan images. In the first stage, an infection - COVID-19 or CAP, is detected using a pre-trained DenseNet architecture. Then, in the second stage, a fine-grained three-way classification is done using Efficie… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

    Comments: Top Ranked Model Paper at the ICASSP 2021 COVID-19 Grand Challenge

  15. arXiv:2103.09052  [pdf, other

    cs.LG

    Selective Intervention Planning using Restless Multi-Armed Bandits to Improve Maternal and Child Health Outcomes

    Authors: Siddharth Nishtala, Lovish Madaan, Aditya Mate, Harshavardhan Kamarthi, Anirudh Grama, Divy Thakkar, Dhyanesh Narayanan, Suresh Chaudhary, Neha Madhiwalla, Ramesh Padmanabhan, Aparna Hegde, Pradeep Varakantham, Balaraman Ravindran, Milind Tambe

    Abstract: India has a maternal mortality ratio of 113 and child mortality ratio of 2830 per 100,000 live births. Lack of access to preventive care information is a major contributing factor for these deaths, especially in low resource households. We partner with ARMMAN, a non-profit based in India employing a call-based information program to disseminate health-related information to pregnant women and wome… ▽ More

    Submitted 18 October, 2021; v1 submitted 7 March, 2021; originally announced March 2021.

    Comments: 7 pages. Camera-ready version for AASG 2021 Workshop

  16. arXiv:2012.11510  [pdf

    cs.LG

    Design Rule Checking with a CNN Based Feature Extractor

    Authors: Luis Francisco, Tanmay Lagare, Arpit Jain, Somal Chaudhary, Madhura Kulkarni, Divya Sardana, W. Rhett Davis, Paul Franzon

    Abstract: Design rule checking (DRC) is getting increasingly complex in advanced nodes technologies. It would be highly desirable to have a fast interactive DRC engine that could be used during layout. In this work, we establish the proof of feasibility for such an engine. The proposed model consists of a convolutional neural network (CNN) trained to detect DRC violations. The model was trained with artific… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

  17. arXiv:2007.04673  [pdf, other

    cs.CR

    The Road Not Taken: Re-thinking the Feasibility of Voice Calling Over Tor

    Authors: Piyush Kumar Sharma, Shashwat Chaudhary, Nikhil Hassija, Mukulika Maity, Sambuddho Chakravarty

    Abstract: Anonymous VoIP calls over the Internet holds great significance for privacy-conscious users, whistle-blowers and political activists alike. Prior research deems popular anonymization systems like Tor unsuitable for providing requisite performance guarantees that real-time applications like VoIP need. Their claims are backed by studies that may no longer be valid due to constant advancements in Tor… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

  18. arXiv:2006.07590  [pdf, other

    cs.CY cs.LG

    Missed calls, Automated Calls and Health Support: Using AI to improve maternal health outcomes by increasing program engagement

    Authors: Siddharth Nishtala, Harshavardhan Kamarthi, Divy Thakkar, Dhyanesh Narayanan, Anirudh Grama, Aparna Hegde, Ramesh Padmanabhan, Neha Madhiwalla, Suresh Chaudhary, Balaraman Ravindran, Milind Tambe

    Abstract: India accounts for 11% of maternal deaths globally where a woman dies in childbirth every fifteen minutes. Lack of access to preventive care information is a significant problem contributing to high maternal morbidity and mortality numbers, especially in low-income households. We work with ARMMAN, a non-profit based in India, to further the use of call-based information programs by early-on identi… ▽ More

    Submitted 6 July, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

  19. arXiv:1705.07848  [pdf, other

    cs.SE

    A Testbed for Experimenting Internet of Things Applications

    Authors: Parthkumar Patel, Jayraj Dave, Shreedhar Dalal, Pankesh Patel, Sanjay Chaudhary

    Abstract: The idea of IoT world has grown to multiple dimensions enclosing different technologies and standards which can provide solutions and goal oriented intelligence to the widespread things via network or internet. In spite of different advancement in technology, challenges related to assessment of IoT solutions under real scenarios and empirical deployments still hinder their evolvement and significa… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

    Comments: 13 pages

  20. arXiv:1606.02849  [pdf, ps, other

    cs.IT

    Time Optimal Spectrum Sensing

    Authors: Garimella Rama Murthy, Rhishi Pratap Singh, Samdarshi Abhijeet, Sachin Chaudhary

    Abstract: Spectrum sensing is a fundamental operation in cognitive radio environment. It gives information about spectrum availability by scanning the bands. Usually a fixed amount of time is given to scan individual bands. Most of the times, historical information about the traffic in the spectrum bands is not used. But this information gives the idea, how busy a specific band is. Therefore, instead of sca… ▽ More

    Submitted 9 June, 2016; originally announced June 2016.

  21. Developing Postfix-GP Framework for Symbolic Regression Problems

    Authors: Vipul K. Dabhi, Sanjay Chaudhary

    Abstract: This paper describes Postfix-GP system, postfix notation based Genetic Programming (GP), for solving symbolic regression problems. It presents an object-oriented architecture of Postfix-GP framework. It assists the user in understanding of the implementation details of various components of Postfix-GP. Postfix-GP provides graphical user interface which allows user to configure the experiment, to v… ▽ More

    Submitted 7 July, 2015; originally announced July 2015.

    Comments: 8 pages, 6 figures

  22. arXiv:1409.6470  [pdf, other

    cs.SI physics.soc-ph

    An Efficient Heuristic for Betweenness-Ordering

    Authors: Rishi Ranjan Singh, Shubham Chaudhary, Manas Agarwal

    Abstract: Centrality measures, erstwhile popular amongst the sociologists and psychologists, have seen broad and increasing applications across several disciplines of late. Amongst a plethora of application specific definitions available in the literature to rank the vertices, closeness centrality, betweenness centrality and eigenvector centrality (page-rank) have been the most important and widely applied… ▽ More

    Submitted 22 March, 2017; v1 submitted 23 September, 2014; originally announced September 2014.

    Comments: This is an expanded and extended version of the results appeared in CompleNet 2015

  23. A Survey on Techniques of Improving Generalization Ability of Genetic Programming Solutions

    Authors: Vipul K. Dabhi, Sanjay Chaudhary

    Abstract: In the field of empirical modeling using Genetic Programming (GP), it is important to evolve solution with good generalization ability. Generalization ability of GP solutions get affected by two important issues: bloat and over-fitting. We surveyed and classified existing literature related to different techniques used by GP research community to deal with these issues. We also point out limitatio… ▽ More

    Submitted 6 November, 2012; originally announced November 2012.

  24. arXiv:0904.2769  [pdf, ps, other

    cs.SE

    Non Homogeneous Poisson Process Model based Optimal Modular Software Testing using Fault Tolerance

    Authors: Amit K Awasthi, Sanjay Chaudhary

    Abstract: In software development process we come across various modules. Which raise the idea of priority of the different modules of a software so that important modules are tested on preference. This approach is desirable because it is not possible to test each module regressively due to time and cost constraints. This paper discuss on some parameters, required to prioritize several modules of a softwa… ▽ More

    Submitted 10 May, 2009; v1 submitted 17 April, 2009; originally announced April 2009.