Skip to main content

Showing 1–5 of 5 results for author: Karampatsis, R

  1. arXiv:2111.01564  [pdf, other

    cs.LG cs.SC

    MultiplexNet: Towards Fully Satisfied Logical Constraints in Neural Networks

    Authors: Nicholas Hoernle, Rafael Michael Karampatsis, Vaishak Belle, Kobi Gal

    Abstract: We propose a novel way to incorporate expert knowledge into the training of deep neural networks. Many approaches encode domain constraints directly into the network architecture, requiring non-trivial or domain-specific engineering. In contrast, our approach, called MultiplexNet, represents domain knowledge as a logical formula in disjunctive normal form (DNF) which is easy to encode and to elici… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: Submitted to AAAI2022

  2. arXiv:2004.13214  [pdf, ps, other

    cs.SE cs.LG

    SCELMo: Source Code Embeddings from Language Models

    Authors: Rafael - Michael Karampatsis, Charles Sutton

    Abstract: Continuous embeddings of tokens in computer programs have been used to support a variety of software development tools, including readability, code search, and program repair. Contextual embeddings are common in natural language processing but have not been previously applied in software engineering. We introduce a new set of deep contextualized word representations for computer programs based on… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: 12 pages

  3. Big Code != Big Vocabulary: Open-Vocabulary Models for Source Code

    Authors: Rafael-Michael Karampatsis, Hlib Babii, Romain Robbes, Charles Sutton, Andrea Janes

    Abstract: Statistical language modeling techniques have successfully been applied to large source code corpora, yielding a variety of new software development tools, such as tools for code suggestion, improving readability, and API migration. A major issue with these techniques is that code introduces new vocabulary at a far higher rate than natural language, as new identifier names proliferate. Both large… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: 13 pages; to appear in Proceedings of ICSE 2020

  4. How Often Do Single-Statement Bugs Occur? The ManySStuBs4J Dataset

    Authors: Rafael-Michael Karampatsis, Charles Sutton

    Abstract: Program repair is an important but difficult software engineering problem. One way to achieve acceptable performance is to focus on classes of simple bugs, such as bugs with single statement fixes, or that match a small set of bug templates. However, it is very difficult to estimate the recall of repair techniques for simple bugs, as there are no datasets about how often the associated bugs occur… ▽ More

    Submitted 10 April, 2020; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: 5 pages; to appear in Proceedings of MSR 2020

  5. arXiv:1903.05734  [pdf, ps, other

    cs.SE cs.LG

    Maybe Deep Neural Networks are the Best Choice for Modeling Source Code

    Authors: Rafael-Michael Karampatsis, Charles Sutton

    Abstract: Statistical language modeling techniques have successfully been applied to source code, yielding a variety of new software development tools, such as tools for code suggestion and improving readability. A major issue with these techniques is that code introduces new vocabulary at a far higher rate than natural language, as new identifier names proliferate. But traditional language models limit the… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.