Skip to main content

Showing 1–20 of 20 results for author: Hellendoorn, V

  1. arXiv:2404.15236  [pdf, other

    cs.SE

    Revisiting Unnaturalness for Automated Program Repair in the Era of Large Language Models

    Authors: Aidan Z. H. Yang, Sophia Kolak, Vincent J. Hellendoorn, Ruben Martins, Claire Le Goues

    Abstract: Language models have improved by orders of magnitude with the recent emergence of Transformer-based Large Language Models (LLMs). LLMs have demonstrated their ability to generate natural code that is highly similar to code written by professional developers. One intermediate value an LLM can emit is entropy, which measures the naturalness of a token of code. We hypothesize that entropy can be used… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  2. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  4. arXiv:2311.00931  [pdf, other

    cs.LG

    Learning Defect Prediction from Unrealistic Data

    Authors: Kamel Alrashedy, Vincent J. Hellendoorn, Alessandro Orso

    Abstract: Pretrained models of code, such as CodeBERT and CodeT5, have become popular choices for code understanding and generation tasks. Such models tend to be large and require commensurate volumes of training data, which are rarely available for downstream tasks. Instead, it has become popular to train models with far larger but less realistic datasets, such as functions with artificially injected bugs.… ▽ More

    Submitted 20 January, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  5. arXiv:2310.01726  [pdf, other

    cs.SE cs.LG

    Large Language Models for Test-Free Fault Localization

    Authors: Aidan Z. H. Yang, Ruben Martins, Claire Le Goues, Vincent J. Hellendoorn

    Abstract: Fault Localization (FL) aims to automatically localize buggy lines of code, a key first step in many manual and automatic debugging tasks. Previous FL techniques assume the provision of input tests, and often require extensive program analysis, program instrumentation, or data preprocessing. Prior work on deep learning for APR struggles to learn from small datasets and produces limited results on… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  6. arXiv:2310.01602  [pdf, other

    cs.SE cs.AI

    CAT-LM: Training Language Models on Aligned Code And Tests

    Authors: Nikitha Rao, Kush Jain, Uri Alon, Claire Le Goues, Vincent J. Hellendoorn

    Abstract: Testing is an integral part of the software development process. Yet, writing tests is time-consuming and therefore often neglected. Classical test generation tools such as EvoSuite generate behavioral test suites by optimizing for coverage, but tend to produce tests that are hard to understand. Language models trained on code can generate code that is highly similar to that written by humans, but… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  7. arXiv:2307.08177  [pdf, other

    cs.SE cs.AI cs.HC

    Using an LLM to Help With Code Understanding

    Authors: Daye Nam, Andrew Macvean, Vincent Hellendoorn, Bogdan Vasilescu, Brad Myers

    Abstract: Understanding code is challenging, especially when working in new and complex development environments. Code comments and documentation can help, but are typically scarce or hard to navigate. Large language models (LLMs) are revolutionizing the process of writing code. Can they do the same for helping understand it? In this study, we provide a first investigation of an LLM-based conversational UI… ▽ More

    Submitted 16 January, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

  8. arXiv:2306.03268  [pdf, other

    cs.CL cs.SE

    "Medium" LMs of Code in the Era of LLMs: Lessons From StackOverflow

    Authors: Manisha Mukherjee, Vincent J. Hellendoorn

    Abstract: Large pre-trained neural language models have brought immense progress to both NLP and software engineering. Models in OpenAI's GPT series now dwarf Google's BERT and Meta's RoBERTa, which previously set new benchmarks on a wide range of NLP applications. These models are trained on massive corpora of heterogeneous data from web crawls, which enables them to learn general language patterns and sem… ▽ More

    Submitted 24 January, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

  9. arXiv:2305.20015  [pdf, other

    cs.SE cs.AI

    AI for Low-Code for AI

    Authors: Nikitha Rao, Jason Tsay, Kiran Kate, Vincent J. Hellendoorn, Martin Hirzel

    Abstract: Low-code programming allows citizen developers to create programs with minimal coding effort, typically via visual (e.g. drag-and-drop) interfaces. In parallel, recent AI-powered tools such as Copilot and ChatGPT generate programs from natural language instructions. We argue that these modalities are complementary: tools like ChatGPT greatly reduce the need to memorize large APIs but still require… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  10. arXiv:2210.16886  [pdf, other

    cs.CL cs.LG

    DiffusER: Discrete Diffusion via Edit-based Reconstruction

    Authors: Machel Reid, Vincent J. Hellendoorn, Graham Neubig

    Abstract: In text generation, models that generate text from scratch one token at a time are currently the dominant paradigm. Despite being performant, these models lack the ability to revise existing text, which limits their usability in many practical scenarios. We look to address this, with DiffusER (Diffusion via Edit-based Reconstruction), a new edit-based generative model for text based on denoising d… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: Preprint. Work in progress

  11. arXiv:2208.07461  [pdf, other

    cs.LG cs.PL cs.SE

    A Library for Representing Python Programs as Graphs for Machine Learning

    Authors: David Bieber, Kensen Shi, Petros Maniatis, Charles Sutton, Vincent Hellendoorn, Daniel Johnson, Daniel Tarlow

    Abstract: Graph representations of programs are commonly a central element of machine learning for code research. We introduce an open source Python library python_graphs that applies static analysis to construct graph representations of Python programs suitable for training machine learning models. Our library admits the construction of control-flow graphs, data-flow graphs, and composite ``program graphs'… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: 21 pages, 14 figures

  12. Comments on Comments: Where Code Review and Documentation Meet

    Authors: Nikitha Rao, Jason Tsay, Martin Hirzel, Vincent J. Hellendoorn

    Abstract: A central function of code review is to increase understanding; helping reviewers understand a code change aids in knowledge transfer and finding bugs. Comments in code largely serve a similar purpose, helping future readers understand the program. It is thus natural to study what happens when these two forms of understanding collide. We ask: what documentation-related comments do reviewers make a… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

  13. arXiv:2202.13169  [pdf, other

    cs.PL cs.CL

    A Systematic Evaluation of Large Language Models of Code

    Authors: Frank F. Xu, Uri Alon, Graham Neubig, Vincent J. Hellendoorn

    Abstract: Large language models (LMs) of code have recently shown tremendous promise in completing code and synthesizing code from natural language descriptions. However, the current state-of-the-art code LMs (e.g., Codex (Chen et al., 2021)) are not publicly available, leaving many questions about their model and data design decisions. We aim to fill in some of these blanks through a systematic evaluation… ▽ More

    Submitted 4 May, 2022; v1 submitted 26 February, 2022; originally announced February 2022.

    Comments: DL4C@ICLR 2022, and MAPS@PLDI 2022

  14. arXiv:2110.02870  [pdf, other

    cs.CL cs.SE

    Capturing Structural Locality in Non-parametric Language Models

    Authors: Frank F. Xu, Junxian He, Graham Neubig, Vincent J. Hellendoorn

    Abstract: Structural locality is a ubiquitous feature of real-world datasets, wherein data points are organized into local hierarchies. Some examples include topical clusters in text or project hierarchies in source code repositories. In this paper, we explore utilizing this structural locality within non-parametric language models, which generate sequences that reference retrieved examples from an external… ▽ More

    Submitted 1 February, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: ICLR 2022

  15. Memorization and Generalization in Neural Code Intelligence Models

    Authors: Md Rafiqul Islam Rabin, Aftab Hussain, Mohammad Amin Alipour, Vincent J. Hellendoorn

    Abstract: Deep Neural Networks (DNNs) are increasingly being used in software engineering and code intelligence tasks. These are powerful tools that are capable of learning highly generalizable patterns from large datasets through millions of parameters. At the same time, their large capacity can render them prone to memorizing data points. Recent work suggests that the memorization risk manifests especiall… ▽ More

    Submitted 12 September, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Information and Software Technology, IST Journal 2022, Elsevier

  16. arXiv:2106.03353  [pdf, other

    cs.SE cs.LG cs.PL

    Understanding Neural Code Intelligence Through Program Simplification

    Authors: Md Rafiqul Islam Rabin, Vincent J. Hellendoorn, Mohammad Amin Alipour

    Abstract: A wide range of code intelligence (CI) tools, powered by deep neural networks, have been developed recently to improve programming productivity and perform program analysis. To reliably use such tools, developers often need to reason about the behavior of the underlying models and the factors that affect them. This is especially challenging for tools backed by deep neural networks. Various methods… ▽ More

    Submitted 9 September, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: The 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE'21)

  17. arXiv:2008.10707  [pdf, other

    cs.SE cs.LG cs.PL

    Patching as Translation: the Data and the Metaphor

    Authors: Yangruibo Ding, Baishakhi Ray, Premkumar Devanbu, Vincent J. Hellendoorn

    Abstract: Machine Learning models from other fields, like Computational Linguistics, have been transplanted to Software Engineering tasks, often quite successfully. Yet a transplanted model's initial success at a given task does not necessarily mean it is well-suited for the task. In this work, we examine a common example of this phenomenon: the conceit that "software patching is like language translation".… ▽ More

    Submitted 31 August, 2020; v1 submitted 24 August, 2020; originally announced August 2020.

  18. Learning Lenient Parsing & Typing via Indirect Supervision

    Authors: Toufique Ahmed, Premkumar Devanbu, Vincent Hellendoorn

    Abstract: Both professional coders and teachers frequently deal with imperfect (fragmentary, incomplete, ill-formed) code. Such fragments are common in STACKOVERFLOW; students also frequently produce ill-formed code, for which instructors, TAs (or students themselves) must find repairs. In either case, the developer experience could be greatly improved if such code could somehow be parsed & typed; this make… ▽ More

    Submitted 9 February, 2021; v1 submitted 13 October, 2019; originally announced October 2019.

    Comments: Accepted at EMSE (Empirical Software Engineering Journal)

    Report number: 29

    Journal ref: Empirical Software Engineering volume 26 (2021)

  19. arXiv:1903.06089  [pdf, ps, other

    cs.SE

    Are My Invariants Valid? A Learning Approach

    Authors: Vincent J. Hellendoorn, Premkumar T. Devanbu, Oleksandr Polozov, Mark Marron

    Abstract: Ensuring that a program operates correctly is a difficult task in large, complex systems. Enshrining invariants -- desired properties of correct execution -- in code or comments can support maintainability and help sustain correctness. Tools that can automatically infer and recommend invariants can thus be very beneficial. However, current invariant-suggesting tools, such as Daikon, suffer from hi… ▽ More

    Submitted 15 March, 2019; v1 submitted 14 March, 2019; originally announced March 2019.

    Comments: 10 pages

  20. arXiv:1506.01159  [pdf, other

    cs.SE

    On the "Naturalness" of Buggy Code

    Authors: Baishakhi Ray, Vincent Hellendoorn, Saheel Godhane, Zhaopeng Tu, Alberto Bacchelli, Premkumar Devanbu

    Abstract: Real software, the kind working programmers produce by the kLOC to solve real-world problems, tends to be "natural", like speech or natural language; it tends to be highly repetitive and predictable. Researchers have captured this naturalness of software through statistical models and used them to good effect in suggestion engines, porting tools, coding standards checkers, and idiom miners. This s… ▽ More

    Submitted 10 September, 2015; v1 submitted 3 June, 2015; originally announced June 2015.

    Comments: 12 pages

    MSC Class: 68N30