Skip to main content

Showing 1–13 of 13 results for author: Rajamani, S

  1. arXiv:2309.12938  [pdf, other

    cs.AI cs.SE

    Frustrated with Code Quality Issues? LLMs can Help!

    Authors: Nalin Wadhwa, Jui Pradhan, Atharv Sonwane, Surya Prakash Sahu, Nagarajan Natarajan, Aditya Kanade, Suresh Parthasarathy, Sriram Rajamani

    Abstract: As software projects progress, quality of code assumes paramount importance as it affects reliability, maintainability and security of software. For this reason, static analysis tools are used in developer workflows to flag code quality issues. However, developers need to spend extra efforts to revise their code to improve code quality based on the tool findings. In this work, we investigate the u… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  2. arXiv:2309.12499  [pdf, other

    cs.SE

    CodePlan: Repository-level Coding using LLMs and Planning

    Authors: Ramakrishna Bairi, Atharv Sonwane, Aditya Kanade, Vageesh D C, Arun Iyer, Suresh Parthasarathy, Sriram Rajamani, B. Ashok, Shashank Shet

    Abstract: Software engineering activities such as package migration, fixing errors reports from static analysis or testing, and adding type annotations or other specifications to a codebase, involve pervasively editing the entire repository of code. We formulate these activities as repository-level coding tasks. Recent tools like GitHub Copilot, which are powered by Large Language Models (LLMs), have succ… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  3. arXiv:2309.09495  [pdf, other

    cs.HC cs.SE

    PwR: Exploring the Role of Representations in Conversational Programming

    Authors: Pradyumna YM, Vinod Ganesan, Dinesh Kumar Arumugam, Meghna Gupta, Nischith Shadagopan, Tanay Dixit, Sameer Segal, Pratyush Kumar, Mohit Jain, Sriram Rajamani

    Abstract: Large Language Models (LLMs) have revolutionized programming and software engineering. AI programming assistants such as GitHub Copilot X enable conversational programming, narrowing the gap between human intent and code generation. However, prior literature has identified a key challenge--there is a gap between user's mental model of the system's understanding after a sequence of natural language… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 23 pages, 3 figures, 2 tables, under submission for ACM CHI 2024

    ACM Class: H.5.2

  4. arXiv:2307.12465  [pdf, other

    cs.SE

    StaticFixer: From Static Analysis to Static Repair

    Authors: Naman Jain, Shubham Gandhi, Atharv Sonwane, Aditya Kanade, Nagarajan Natarajan, Suresh Parthasarathy, Sriram Rajamani, Rahul Sharma

    Abstract: Static analysis tools are traditionally used to detect and flag programs that violate properties. We show that static analysis tools can also be used to perturb programs that satisfy a property to construct variants that violate the property. Using this insight we can construct paired data sets of unsafe-safe program pairs, and learn strategies to automatically repair property violations. We prese… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  5. arXiv:2306.10763  [pdf, other

    cs.CL cs.AI cs.LG cs.PL cs.SE

    Guiding Language Models of Code with Global Context using Monitors

    Authors: Lakshya A Agrawal, Aditya Kanade, Navin Goyal, Shuvendu K. Lahiri, Sriram K. Rajamani

    Abstract: Language models of code (LMs) work well when the surrounding code provides sufficient context. This is not true when it becomes necessary to use types, functionality or APIs defined elsewhere in the repository or a linked library, especially those not seen during training. LMs suffer from limited awareness of such global context and end up hallucinating. Integrated development environments (IDEs… ▽ More

    Submitted 3 November, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023 and to appear as "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context" at https://neurips.cc/virtual/2023/poster/70362 . Contents: 11 pages, 15 additional pages of appendix, 13 figures, 3 tables

    ACM Class: I.2.2; I.2.7; I.2.5

  6. arXiv:2301.10477  [pdf, other

    cs.SD cs.CY eess.AS

    HEAR4Health: A blueprint for making computer audition a staple of modern healthcare

    Authors: Andreas Triantafyllopoulos, Alexander Kathan, Alice Baird, Lukas Christ, Alexander Gebhard, Maurice Gerczuk, Vincent Karas, Tobias Hübner, Xin Jing, Shuo Liu, Adria Mallol-Ragolta, Manuel Milling, Sandra Ottl, Anastasia Semertzidou, Srividya Tirunellai Rajamani, Tianhao Yan, Zijiang Yang, Judith Dineley, Shahin Amiriparian, Katrin D. Bartl-Pokorny, Anton Batliner, Florian B. Pokorny, Björn W. Schuller

    Abstract: Recent years have seen a rapid increase in digital medicine research in an attempt to transform traditional healthcare systems to their modern, intelligent, and versatile equivalents that are adequately equipped to tackle contemporary challenges. This has led to a wave of applications that utilise AI technologies; first and foremost in the fields of medical imaging, but also in the use of wearable… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

  7. arXiv:2205.07861  [pdf, other

    cs.LG cs.AI

    Depression Diagnosis and Forecast based on Mobile Phone Sensor Data

    Authors: Xiangheng He, Andreas Triantafyllopoulos, Alexander Kathan, Manuel Milling, Tianhao Yan, Srividya Tirunellai Rajamani, Ludwig Küster, Mathias Harrer, Elena Heber, Inga Grossmann, David D. Ebert, Björn W. Schuller

    Abstract: Previous studies have shown the correlation between sensor data collected from mobile phones and human depression states. Compared to the traditional self-assessment questionnaires, the passive data collected from mobile phones is easier to access and less time-consuming. In particular, passive mobile phone data can be collected on a flexible time interval, thus detecting moment-by-moment psycholo… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: Accepted by EMBC 2022

  8. arXiv:2205.03391  [pdf, other

    cs.LG cs.AI

    Journaling Data for Daily PHQ-2 Depression Prediction and Forecasting

    Authors: Alexander Kathan, Andreas Triantafyllopoulos, Xiangheng He, Manuel Milling, Tianhao Yan, Srividya Tirunellai Rajamani, Ludwig Küster, Mathias Harrer, Elena Heber, Inga Grossmann, David D. Ebert, Björn W. Schuller

    Abstract: Digital health applications are becoming increasingly important for assessing and monitoring the wellbeing of people suffering from mental health conditions like depression. A common target of said applications is to predict the results of self-assessed Patient-Health-Questionnaires (PHQ), indicating current symptom severity of depressive individuals. In this work, we explore the potential of usin… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  9. arXiv:2204.05021  [pdf, other

    cs.SE cs.IR cs.PL

    Landmarks and Regions: A Robust Approach to Data Extraction

    Authors: Suresh Parthasarathy, Lincy Pattanaik, Anirudh Khatry, Arun Iyer, Arjun Radhakrishna, Sriram Rajamani, Mohammad Raza

    Abstract: We propose a new approach to extracting data items or field values from semi-structured documents. Examples of such problems include extracting passenger name, departure time and departure airport from a travel itinerary, or extracting price of an item from a purchase receipt. Traditional approaches to data extraction use machine learning or program synthesis to process the whole document to extra… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: To be published at PLDI,2022

  10. arXiv:2112.02969  [pdf, other

    cs.SE cs.PL

    Jigsaw: Large Language Models meet Program Synthesis

    Authors: Naman Jain, Skanda Vaidyanath, Arun Iyer, Nagarajan Natarajan, Suresh Parthasarathy, Sriram Rajamani, Rahul Sharma

    Abstract: Large pre-trained language models such as GPT-3, Codex, and Google's language model are now capable of generating code from natural language specifications of programmer intent. We view these developments with a mixture of optimism and caution. On the optimistic side, such large language models have the potential to improve productivity by providing an automated AI pair programmer for every progra… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted to ICSE'22

  11. arXiv:2007.06835  [pdf, other

    cs.LG cs.AI cs.PL cs.SE stat.ML

    Programming by Rewards

    Authors: Nagarajan Natarajan, Ajaykrishna Karthikeyan, Prateek Jain, Ivan Radicek, Sriram Rajamani, Sumit Gulwani, Johannes Gehrke

    Abstract: We formalize and study ``programming by rewards'' (PBR), a new approach for specifying and synthesizing subroutines for optimizing some quantitative metric such as performance, resource utilization, or correctness over a benchmark. A PBR specification consists of (1) input features $x$, and (2) a reward function $r$, modeled as a black-box component (which we can only run), that assigns a reward f… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  12. arXiv:1603.07292  [pdf, other

    cs.LG cs.AI cs.PL stat.ML

    Debugging Machine Learning Tasks

    Authors: Aleksandar Chakarov, Aditya Nori, Sriram Rajamani, Shayak Sen, Deepak Vijaykeerthy

    Abstract: Unlike traditional programs (such as operating systems or word processors) which have large amounts of code, machine learning tasks use programs with relatively small amounts of code (written in machine learning libraries), but voluminous amounts of data. Just like developers of traditional programs debug errors in their code, developers of machine learning tasks debug and fix errors in their data… ▽ More

    Submitted 23 March, 2016; originally announced March 2016.

    ACM Class: D.2.5; I.2.3

  13. arXiv:0803.3230  [pdf, ps, other

    cs.CR cs.OS cs.PL

    A Type System for Data-Flow Integrity on Windows Vista

    Authors: Avik Chaudhuri, Prasad Naldurg, Sriram Rajamani

    Abstract: The Windows Vista operating system implements an interesting model of multi-level integrity. We observe that in this model, trusted code can be blamed for any information-flow attack; thus, it is possible to eliminate such attacks by static analysis of trusted code. We formalize this model by designing a type system that can efficiently enforce data-flow integrity on Windows Vista. Typechecking… ▽ More

    Submitted 7 May, 2008; v1 submitted 21 March, 2008; originally announced March 2008.

    ACM Class: D.4.6; D.2.4; F.3.1