Skip to main content

Showing 1–50 of 110 results for author: Khomh, F

  1. arXiv:2406.18075  [pdf, other

    cs.SE

    A Context-Driven Approach for Co-Auditing Smart Contracts with The Support of GPT-4 code interpreter

    Authors: Mohamed Salah Bouafif, Chen Zheng, Ilham Ahmed Qasse, Ed Zulkoski, Mohammad Hamdaqa, Foutse Khomh

    Abstract: The surge in the adoption of smart contracts necessitates rigorous auditing to ensure their security and reliability. Manual auditing, although comprehensive, is time-consuming and heavily reliant on the auditor's expertise. With the rise of Large Language Models (LLMs), there is growing interest in leveraging them to assist auditors in the auditing process (co-auditing). However, the effectivenes… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2405.13932  [pdf, other

    cs.SE cs.AI

    Chain of Targeted Verification Questions to Improve the Reliability of Code Generated by LLMs

    Authors: Sylvain Kouemo Ngassom, Arghavan Moradi Dakhel, Florian Tambon, Foutse Khomh

    Abstract: LLM-based assistants, such as GitHub Copilot and ChatGPT, have the potential to generate code that fulfills a programming task described in a natural language description, referred to as a prompt. The widespread accessibility of these assistants enables users with diverse backgrounds to generate code and integrate it into software projects. However, studies show that code generated by LLMs is pron… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 10 pages, 2 figures

  3. arXiv:2405.13740  [pdf, other

    cs.SE cs.LG

    Mining Action Rules for Defect Reduction Planning

    Authors: Khouloud Oueslati, Gabriel Laberge, Maxime Lamothe, Foutse Khomh

    Abstract: Defect reduction planning plays a vital role in enhancing software quality and minimizing software maintenance costs. By training a black box machine learning model and "explaining" its predictions, explainable AI for software engineering aims to identify the code characteristics that impact maintenance risks. However, post-hoc explanations do not always faithfully reflect what the original model… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  4. arXiv:2405.12450  [pdf, other

    cs.SE cs.AI

    PathOCL: Path-Based Prompt Augmentation for OCL Generation with GPT-4

    Authors: Seif Abukhalaf, Mohammad Hamdaqa, Foutse Khomh

    Abstract: The rapid progress of AI-powered programming assistants, such as GitHub Copilot, has facilitated the development of software applications. These assistants rely on large language models (LLMs), which are foundation models (FMs) that support a wide range of tasks related to understanding and generating language. LLMs have demonstrated their ability to express UML model specifications using formal l… ▽ More

    Submitted 6 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: Updated affiliations. This paper has been accepted to be published in the 2024 IEEE/ACM First International Conference on AI Foundation Models and Software Engineering (Forge)

  5. arXiv:2404.12241  [pdf, other

    cs.CL cs.AI

    Introducing v0.5 of the AI Safety Benchmark from MLCommons

    Authors: Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Max Bartolo, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller , et al. (75 additional authors not shown)

    Abstract: This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-pu… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  6. arXiv:2404.00897  [pdf, ps, other

    cs.LG cs.AI cs.SE

    Machine Learning Robustness: A Primer

    Authors: Houssem Ben Braiek, Foutse Khomh

    Abstract: This chapter explores the foundational concept of robustness in Machine Learning (ML) and its integral role in establishing trustworthiness in Artificial Intelligence (AI) systems. The discussion begins with a detailed definition of robustness, portraying it as the ability of ML models to maintain stable performance across varied and unexpected environmental conditions. ML robustness is dissected… ▽ More

    Submitted 3 May, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  7. arXiv:2403.08937  [pdf, other

    cs.SE cs.AI

    Bugs in Large Language Models Generated Code: An Empirical Study

    Authors: Florian Tambon, Arghavan Moradi Dakhel, Amin Nikanjam, Foutse Khomh, Michel C. Desmarais, Giuliano Antoniol

    Abstract: Large Language Models (LLMs) for code have gained significant attention recently. They can generate code in different programming languages based on provided prompts, fulfilling a long-lasting dream in Software Engineering (SE), i.e., automatic code generation. Similar to human-written code, LLM-generated code is prone to bugs, and these bugs have not yet been thoroughly examined by the community.… ▽ More

    Submitted 18 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 47 pages, 7 figures

  8. arXiv:2402.09299  [pdf, other

    cs.SE cs.LG

    Trained Without My Consent: Detecting Code Inclusion In Language Models Trained on Code

    Authors: Vahid Majdinasab, Amin Nikanjam, Foutse Khomh

    Abstract: Code auditing ensures that the developed code adheres to standards, regulations, and copyright protection by verifying that it does not contain code from protected sources. The recent advent of Large Language Models (LLMs) as coding assistants in the software development process poses new challenges for code auditing. The dataset for training these models is mainly collected from publicly availabl… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Submitted to TOSEM (ACM Transactions on Software Engineering and Methodology)

  9. arXiv:2402.08801  [pdf, ps, other

    cs.SE cs.AI

    ChatGPT vs LLaMA: Impact, Reliability, and Challenges in Stack Overflow Discussions

    Authors: Leuson Da Silva, Jordan Samhi, Foutse Khomh

    Abstract: Since its release in November 2022, ChatGPT has shaken up Stack Overflow, the premier platform for developers' queries on programming and software development. Demonstrating an ability to generate instant, human-like responses to technical questions, ChatGPT has ignited debates within the developer community about the evolving role of human-driven platforms in the age of generative AI. Two months… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 36 pages, 9 figures

  10. arXiv:2401.13177  [pdf, other

    cs.SE cs.CY cs.LG

    Deep Learning Model Reuse in the HuggingFace Community: Challenges, Benefit and Trends

    Authors: Mina Taraghi, Gianolli Dorcelus, Armstrong Foundjem, Florian Tambon, Foutse Khomh

    Abstract: The ubiquity of large-scale Pre-Trained Models (PTMs) is on the rise, sparking interest in model hubs, and dedicated platforms for hosting PTMs. Despite this trend, a comprehensive exploration of the challenges that users encounter and how the community leverages PTMs remains lacking. To address this gap, we conducted an extensive mixed-methods empirical study by focusing on discussion forums and… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted by IEEE SANER 2024

  11. arXiv:2401.03069  [pdf, other

    cs.SE cs.LG

    Towards Enhancing the Reproducibility of Deep Learning Bugs: An Empirical Study

    Authors: Mehil B. Shah, Mohammad Masudur Rahman, Foutse Khomh

    Abstract: Context: Deep learning has achieved remarkable progress in various domains. However, like any software system, deep learning systems contain bugs, some of which can have severe impacts, as evidenced by crashes involving autonomous vehicles. Despite substantial advancements in deep learning techniques, little research has focused on reproducing deep learning bugs, which is an essential step for the… ▽ More

    Submitted 18 June, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Under Major Revision at the EMSE (Empirical Software Engineering) Journal

  12. arXiv:2312.15536  [pdf, other

    cs.SE

    Harnessing Pre-trained Generalist Agents for Software Engineering Tasks

    Authors: Paulina Stevia Nouwou Mindom, Amin Nikanjam, Foutse Khomh

    Abstract: Nowadays, we are witnessing an increasing adoption of Artificial Intelligence (AI) to develop techniques aimed at improving the reliability, effectiveness, and overall quality of software systems. Deep reinforcement learning (DRL) has recently been successfully used for automation in complex tasks such as game testing and solving the job-shop scheduling problem. However, these specialized DRL agen… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  13. arXiv:2312.15068  [pdf, other

    cs.SE cs.CL cs.LG

    Refining GPT-3 Embeddings with a Siamese Structure for Technical Post Duplicate Detection

    Authors: Xingfang Wu, Heng Li, Nobukazu Yoshioka, Hironori Washizaki, Foutse Khomh

    Abstract: One goal of technical online communities is to help developers find the right answer in one place. A single question can be asked in different ways with different wordings, leading to the existence of duplicate posts on technical forums. The question of how to discover and link duplicate posts has garnered the attention of both developer communities and researchers. For example, Stack Overflow ado… ▽ More

    Submitted 4 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: SANER 2024

  14. arXiv:2312.14279  [pdf, other

    cs.SE cs.CL cs.LG

    Characterizing and Classifying Developer Forum Posts with their Intentions

    Authors: Xingfang Wu, Eric Laufer, Heng Li, Foutse Khomh, Santhosh Srinivasan, Jayden Luo

    Abstract: With the rapid growth of the developer community, the amount of posts on online technical forums has been growing rapidly, which poses difficulties for users to filter useful posts and find important information. Tags provide a concise feature dimension for users to locate their interested posts and for search engines to index the most relevant posts according to the queries. However, most tags ar… ▽ More

    Submitted 10 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Journal of Empirical Software Engineering, 40 pages

  15. arXiv:2312.12604  [pdf

    cs.SE cs.LG

    Studying the Practices of Testing Machine Learning Software in the Wild

    Authors: Moses Openja, Foutse Khomh, Armstrong Foundjem, Zhen Ming, Jiang, Mouna Abidi, Ahmed E. Hassan

    Abstract: Background: We are witnessing an increasing adoption of machine learning (ML), especially deep learning (DL) algorithms in many software systems, including safety-critical systems such as health care systems or autonomous driving vehicles. Ensuring the software quality of these systems is yet an open challenge for the research community, mainly due to the inductive nature of ML software systems. T… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  16. arXiv:2311.12019  [pdf, other

    cs.SE

    An Empirical Study of Self-Admitted Technical Debt in Machine Learning Software

    Authors: Aaditya Bhatia, Foutse Khomh, Bram Adams, Ahmed E Hassan

    Abstract: The emergence of open-source ML libraries such as TensorFlow and Google Auto ML has enabled developers to harness state-of-the-art ML algorithms with minimal overhead. However, during this accelerated ML development process, said developers may often make sub-optimal design and implementation decisions, leading to the introduction of technical debt that, if not addressed promptly, can have a signi… ▽ More

    Submitted 9 June, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

  17. arXiv:2311.11177  [pdf, other

    cs.SE

    Assessing the Security of GitHub Copilot Generated Code -- A Targeted Replication Study

    Authors: Vahid Majdinasab, Michael Joshua Bishop, Shawn Rasheed, Arghavan Moradidakhel, Amjed Tahir, Foutse Khomh

    Abstract: AI-powered code generation models have been developing rapidly, allowing developers to expedite code generation and thus improve their productivity. These models are trained on large corpora of code (primarily sourced from public repositories), which may contain bugs and vulnerabilities. Several concerns have been raised about the security of the code generated by these models. Recent studies have… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  18. arXiv:2311.00801  [pdf, other

    cs.LG cs.SE

    GIST: Generated Inputs Sets Transferability in Deep Learning

    Authors: Florian Tambon, Foutse Khomh, Giuliano Antoniol

    Abstract: To foster the verifiability and testability of Deep Neural Networks (DNN), an increasing number of methods for test case generation techniques are being developed. When confronted with testing DNN models, the user can apply any existing test generation technique. However, it needs to do so for each technique and each DNN model under test, which can be expensive. Therefore, a paradigm shift could… ▽ More

    Submitted 20 May, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: accepted for publication in the "ACM Transactions on Software Engineering and Methodology" journal

  19. arXiv:2310.12805  [pdf, other

    cs.LG

    Detection and Evaluation of bias-inducing Features in Machine learning

    Authors: Moses Openja, Gabriel Laberge, Foutse Khomh

    Abstract: The cause-to-effect analysis can help us decompose all the likely causes of a problem, such as an undesirable business situation or unintended harm to the individual(s). This implies that we can identify how the problems are inherited, rank the causes to help prioritize fixes, simplify a complex problem and visualize them. In the context of machine learning (ML), one can use cause-to-effect analys… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 65 pages, manuscript accepted at EMSE journal, manuscript number, EMSE-D-22-00330R3

  20. arXiv:2310.09575  [pdf, other

    cs.SE

    Common Challenges of Deep Reinforcement Learning Applications Development: An Empirical Study

    Authors: Mohammad Mehdi Morovati, Florian Tambon, Mina Taraghi, Amin Nikanjam, Foutse Khomh

    Abstract: Machine Learning (ML) is increasingly being adopted in different industries. Deep Reinforcement Learning (DRL) is a subdomain of ML used to produce intelligent agents. Despite recent developments in DRL technology, the main challenges that developers face in the development of DRL applications are still unknown. To fill this gap, in this paper, we conduct a large-scale empirical study of 927 DRL-r… ▽ More

    Submitted 18 May, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: Submitted to Empirical Software Engineering journal

  21. A Large-Scale Exploratory Study of Android Sports Apps in the Google Play Store

    Authors: Bhagya Chembakottu, Heng Li, Foutse Khomh

    Abstract: Prior studies on mobile app analysis often analyze apps across different categories or focus on a small set of apps within a category. These studies either provide general insights for an entire app store which consists of millions of apps, or provide specific insights for a small set of apps. However, a single app category can often contain tens of thousands to hundreds of thousands of apps. For… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Journal paper

    Journal ref: Information and Software Technology (2023): 107321

  22. arXiv:2310.01765  [pdf, other

    cs.LG cs.DB

    Data Cleaning and Machine Learning: A Systematic Literature Review

    Authors: Pierre-Olivier Côté, Amin Nikanjam, Nafisa Ahmed, Dmytro Humeniuk, Foutse Khomh

    Abstract: Context: Machine Learning (ML) is integrated into a growing number of systems for various applications. Because the performance of an ML model is highly dependent on the quality of the data it has been trained on, there is a growing interest in approaches to detect and repair data errors (i.e., data cleaning). Researchers are also exploring how ML can be used for data cleaning; hence creating a du… ▽ More

    Submitted 30 May, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Published in the Automated Software Engineering Journal

  23. arXiv:2308.16557  [pdf, other

    cs.SE

    Effective Test Generation Using Pre-trained Large Language Models and Mutation Testing

    Authors: Arghavan Moradi Dakhel, Amin Nikanjam, Vahid Majdinasab, Foutse Khomh, Michel C. Desmarais

    Abstract: One of the critical phases in software development is software testing. Testing helps with identifying potential bugs and reducing maintenance costs. The goal of automated test generation tools is to ease the development of tests by suggesting efficient bug-revealing tests. Recently, researchers have leveraged Large Language Models (LLMs) of code to generate unit tests. While the code coverage of… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 16 pages, 3 figures

  24. arXiv:2308.12762  [pdf, other

    cs.RO cs.NE

    Reinforcement Learning Informed Evolutionary Search for Autonomous Systems Testing

    Authors: Dmytro Humeniuk, Foutse Khomh, Giuliano Antoniol

    Abstract: Evolutionary search-based techniques are commonly used for testing autonomous robotic systems. However, these approaches often rely on computationally expensive simulator-based models for test scenario evaluation. To improve the computational efficiency of the search-based testing, we propose augmenting the evolutionary search (ES) with a reinforcement learning (RL) agent trained using surrogate r… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  25. arXiv:2308.12445  [pdf, other

    cs.LG cs.AI cs.SE stat.ML

    An Intentional Forgetting-Driven Self-Healing Method For Deep Reinforcement Learning Systems

    Authors: Ahmed Haj Yahmed, Rached Bouchoucha, Houssem Ben Braiek, Foutse Khomh

    Abstract: Deep reinforcement learning (DRL) is increasingly applied in large-scale productions like Netflix and Facebook. As with most data-driven systems, DRL systems can exhibit undesirable behaviors due to environmental drifts, which often occur in constantly-changing production settings. Continual Learning (CL) is the inherent self-healing approach for adapting the DRL agent in response to the environme… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in The 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023)

  26. arXiv:2308.12438  [pdf, other

    cs.LG cs.AI cs.SE

    Deploying Deep Reinforcement Learning Systems: A Taxonomy of Challenges

    Authors: Ahmed Haj Yahmed, Altaf Allah Abbassi, Amin Nikanjam, Heng Li, Foutse Khomh

    Abstract: Deep reinforcement learning (DRL), leveraging Deep Learning (DL) in reinforcement learning, has shown significant potential in achieving human-level autonomy in a wide range of domains, including robotics, computer vision, and computer games. This potential justifies the enthusiasm and growing interest in DRL in both academia and industry. However, the community currently focuses mostly on the dev… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in The International Conference on Software Maintenance and Evolution (ICSME 2023)

  27. On the Effectiveness of Log Representation for Log-based Anomaly Detection

    Authors: Xingfang Wu, Heng Li, Foutse Khomh

    Abstract: Logs are an essential source of information for people to understand the running status of a software system. Due to the evolving modern software architecture and maintenance methods, more research efforts have been devoted to automated log analysis. In particular, machine learning (ML) has been widely used in log analysis tasks. In ML-based log analysis tasks, converting textual log data into num… ▽ More

    Submitted 8 April, 2024; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: Accepted by Journal of Empirical Software Engineering (EMSE)

    Journal ref: Empirical Software Engineering (2023) 28:137

  28. arXiv:2308.03952  [pdf, other

    cs.CR cs.SE

    Exploring Security Practices in Infrastructure as Code: An Empirical Study

    Authors: Alexandre Verdet, Mohammad Hamdaqa, Leuson Da Silva, Foutse Khomh

    Abstract: Cloud computing has become popular thanks to the widespread use of Infrastructure as Code (IaC) tools, allowing the community to conveniently manage and configure cloud infrastructure using scripts. However, the scripting process itself does not automatically prevent practitioners from introducing misconfigurations, vulnerabilities, or privacy risks. As a result, ensuring security relies on practi… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 50 pages, 13 figures, 10 tables

  29. arXiv:2307.14512  [pdf, other

    cs.SE cs.LG

    Bug Characterization in Machine Learning-based Systems

    Authors: Mohammad Mehdi Morovati, Amin Nikanjam, Florian Tambon, Foutse Khomh, Zhen Ming, Jiang

    Abstract: Rapid growth of applying Machine Learning (ML) in different domains, especially in safety-critical areas, increases the need for reliable ML components, i.e., a software component operating based on ML. Understanding the bugs characteristics and maintenance challenges in ML-based systems can help developers of these systems to identify where to focus maintenance and testing efforts, by giving insi… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  30. arXiv:2307.13777  [pdf, other

    cs.SE cs.AI

    An Empirical Study on Bugs Inside PyTorch: A Replication Study

    Authors: Sharon Chee Yin Ho, Vahid Majdinasab, Mohayeminul Islam, Diego Elias Costa, Emad Shihab, Foutse Khomh, Sarah Nadi, Muhammad Raza

    Abstract: Software systems are increasingly relying on deep learning components, due to their remarkable capability of identifying complex data patterns and powering intelligent behaviour. A core enabler of this change in software development is the availability of easy-to-use deep learning libraries. Libraries like PyTorch and TensorFlow empower a large variety of intelligent systems, offering a multitude… ▽ More

    Submitted 1 August, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

  31. arXiv:2306.15007  [pdf, other

    cs.SE cs.LG

    Quality Issues in Machine Learning Software Systems

    Authors: Pierre-Olivier Côté, Amin Nikanjam, Rached Bouchoucha, Ilan Basta, Mouna Abidi, Foutse Khomh

    Abstract: Context: An increasing demand is observed in various domains to employ Machine Learning (ML) for solving complex problems. ML models are implemented as software components and deployed in Machine Learning Software Systems (MLSSs). Problem: There is a strong need for ensuring the serving quality of MLSSs. False or poor decisions of such systems can lead to malfunction of other systems, significant… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    MSC Class: 68T05

  32. arXiv:2306.01788  [pdf, other

    cs.SE cs.AI cs.LG

    Responsible Design Patterns for Machine Learning Pipelines

    Authors: Saud Hakem Al Harbi, Lionel Nganyewou Tidjon, Foutse Khomh

    Abstract: Integrating ethical practices into the AI development process for artificial intelligence (AI) is essential to ensure safe, fair, and responsible operation. AI ethics involves applying ethical principles to the entire life cycle of AI systems. This is essential to mitigate potential risks and harms associated with AI, such as algorithm biases. To achieve this goal, responsible design patterns (RDP… ▽ More

    Submitted 7 June, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: 20 pages, 4 figures, 5 tables

  33. arXiv:2305.00323  [pdf

    cs.SE cs.AI cs.LG

    Leveraging Data Mining Algorithms to Recommend Source Code Changes

    Authors: AmirHossein Naghshzan, Saeed Khalilazar, Pierre Poilane, Olga Baysal, Latifa Guerrouj, Foutse Khomh

    Abstract: Context: Recent research has used data mining to develop techniques that can guide developers through source code changes. To the best of our knowledge, very few studies have investigated data mining techniques and--or compared their results with other algorithms or a baseline. Objectives: This paper proposes an automatic method for recommending source code changes using four data mining algorithm… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  34. arXiv:2303.16244  [pdf, other

    cs.SE cs.AI

    On Codex Prompt Engineering for OCL Generation: An Empirical Study

    Authors: Seif Abukhalaf, Mohammad Hamdaqa, Foutse Khomh

    Abstract: The Object Constraint Language (OCL) is a declarative language that adds constraints and object query expressions to MOF models. Despite its potential to provide precision and conciseness to UML models, the unfamiliar syntax of OCL has hindered its adoption. Recent advancements in LLMs, such as GPT-3, have shown their capability in many NLP tasks, including semantic parsing and text generation. Co… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 10 pages. Full abstract in the pre-print. Accepted to be published to the 2023 IEEE/ACM 20th International Conference on Mining Software Repositories (MSR)

  35. arXiv:2301.05651  [pdf, other

    cs.LG cs.SE

    Mutation Testing of Deep Reinforcement Learning Based on Real Faults

    Authors: Florian Tambon, Vahid Majdinasab, Amin Nikanjam, Foutse Khomh, Giuliano Antonio

    Abstract: Testing Deep Learning (DL) systems is a complex task as they do not behave like traditional systems would, notably because of their stochastic nature. Nonetheless, being able to adapt existing testing techniques such as Mutation Testing (MT) to DL settings would greatly improve their potential verifiability. While some efforts have been made to extend MT to the Supervised Learning paradigm, little… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: Accepted to the International Conference of Software Testing (ICST2023)

  36. arXiv:2301.04234  [pdf, other

    cs.SE

    Studying Logging Practice in Machine Learning-based Applications

    Authors: Patrick Loic Foalem, Foutse Khomh, Heng Li

    Abstract: Logging is a common practice in traditional software development. Several research works have been done to investigate the different characteristics of logging practices in traditional software systems (e.g., Android applications, JAVA applications, C/C++ applications). Nowadays, we are witnessing more and more development of Machine Learning-based applications (ML-based applications). Today, ther… ▽ More

    Submitted 10 January, 2023; originally announced January 2023.

  37. arXiv:2301.01234  [pdf, other

    cs.RO cs.NE

    AmbieGen: A Search-based Framework for Autonomous Systems Testing

    Authors: Dmytro Humeniuk, Foutse Khomh, Giuliano Antoniol

    Abstract: Thorough testing of safety-critical autonomous systems, such as self-driving cars, autonomous robots, and drones, is essential for detecting potential failures before deployment. One crucial testing stage is model-in-the-loop testing, where the system model is evaluated by executing various scenarios in a simulator. However, the search space of possible parameters defining these test scenarios is… ▽ More

    Submitted 1 January, 2023; originally announced January 2023.

    Comments: 17 pages, 10 figures

  38. arXiv:2212.13245  [pdf, other

    cs.SE

    Studying the Characteristics of AIOps Projects on GitHub

    Authors: Roozbeh Aghili, Heng Li, Foutse Khomh

    Abstract: Artificial Intelligence for IT Operations (AIOps) leverages AI approaches to handle the massive amount of data generated during the operations of software systems. Prior works have proposed various AIOps solutions to support different tasks in system operations and maintenance, such as anomaly detection. In this study, we conduct an in-depth analysis of open-source AIOps projects to understand the… ▽ More

    Submitted 5 September, 2023; v1 submitted 26 December, 2022; originally announced December 2022.

    Comments: 46 pages, 8 pages of references, 14 figures, 16 tables

  39. arXiv:2212.02614  [pdf

    cs.LG cs.AI cs.CY

    Can Ensembling Pre-processing Algorithms Lead to Better Machine Learning Fairness?

    Authors: Khaled Badran, Pierre-Olivier Côté, Amanda Kolopanis, Rached Bouchoucha, Antonio Collante, Diego Elias Costa, Emad Shihab, Foutse Khomh

    Abstract: As machine learning (ML) systems get adopted in more critical areas, it has become increasingly crucial to address the bias that could occur in these systems. Several fairness pre-processing algorithms are available to alleviate implicit biases during model training. These algorithms employ different concepts of fairness, often leading to conflicting strategies with consequential trade-offs betwee… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  40. arXiv:2211.15733  [pdf, other

    cs.SE cs.AI

    An Empirical Study of Library Usage and Dependency in Deep Learning Frameworks

    Authors: Mohamed Raed El aoun, Lionel Nganyewou Tidjon, Ben Rombaut, Foutse Khomh, Ahmed E. Hassan

    Abstract: Recent advances in deep learning (dl) have led to the release of several dl software libraries such as pytorch, Caffe, and TensorFlow, in order to assist machine learning (ml) practitioners in developing and deploying state-of-the-art deep neural networks (DNN), but they are not able to properly cope with limitations in the dl libraries such as testing or data processing. In this paper, we present… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  41. arXiv:2211.01535  [pdf, other

    cs.CR cs.AI cs.LG

    Reliable Malware Analysis and Detection using Topology Data Analysis

    Authors: Lionel Nganyewou Tidjon, Foutse Khomh

    Abstract: Increasingly, malwares are becoming complex and they are spreading on networks targeting different infrastructures and personal-end devices to collect, modify, and destroy victim information. Malware behaviors are polymorphic, metamorphic, persistent, able to hide to bypass detectors and adapt to new environments, and even leverage machine learning techniques to better damage targets. Thus, it mak… ▽ More

    Submitted 8 November, 2022; v1 submitted 2 November, 2022; originally announced November 2022.

  42. arXiv:2209.03438  [pdf, other

    cs.LG cs.CE eess.SY

    SmOOD: Smoothness-based Out-of-Distribution Detection Approach for Surrogate Neural Networks in Aircraft Design

    Authors: Houssem Ben Braiek, Ali Tfaily, Foutse Khomh, Thomas Reid, Ciro Guida

    Abstract: Aircraft industry is constantly striving for more efficient design optimization methods in terms of human efforts, computation time, and resource consumption. Hybrid surrogate optimization maintains high results quality while providing rapid design assessments when both the surrogate model and the switch mechanism for eventually transitioning to the HF model are calibrated properly. Feedforward ne… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  43. arXiv:2209.03431  [pdf, other

    cs.LG cs.CE eess.SY

    Physics-Guided Adversarial Machine Learning for Aircraft Systems Simulation

    Authors: Houssem Ben Braiek, Thomas Reid, Foutse Khomh

    Abstract: In the context of aircraft system performance assessment, deep learning technologies allow to quickly infer models from experimental measurements, with less detailed system knowledge than usually required by physics-based modeling. However, this inexpensive model development also comes with new challenges regarding model trustworthiness. This work presents a novel approach, physics-guided adversar… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  44. arXiv:2208.13116  [pdf, other

    cs.SE cs.AI cs.LG

    An Empirical Study on the Usage of Automated Machine Learning Tools

    Authors: Forough Majidi, Moses Openja, Foutse Khomh, Heng Li

    Abstract: The popularity of automated machine learning (AutoML) tools in different domains has increased over the past few years. Machine learning (ML) practitioners use AutoML tools to automate and optimize the process of feature engineering, model training, and hyperparameter optimization and so on. Recent work performed qualitative studies on practitioners' experiences of using AutoML tools and compared… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Comments: 10 pages, 2 reference pages, 7 figures, accepted at the International Conference on Software Maintenance and Evolution (ICSME) 2022

  45. arXiv:2208.12136  [pdf, other

    cs.SE cs.LG

    A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks

    Authors: Paulina Stevia Nouwou Mindom, Amin Nikanjam, Foutse Khomh

    Abstract: Software testing activities scrutinize the artifacts and the behavior of a software product to find possible defects and ensure that the product meets its expected requirements. Recently, Deep Reinforcement Learning (DRL) has been successfully employed in complex testing tasks such as game testing, regression testing, and test case prioritization to automate the process and provide continuous adap… ▽ More

    Submitted 29 June, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: Accepted for publication at EMSE (Empirical Software Engineering journal) 2023

  46. arXiv:2208.08982  [pdf, ps, other

    cs.SE cs.LG

    Quality issues in Machine Learning Software Systems

    Authors: Pierre-Olivier Côté, Amin Nikanjam, Rached Bouchoucha, Foutse Khomh

    Abstract: Context: An increasing demand is observed in various domains to employ Machine Learning (ML) for solving complex problems. ML models are implemented as software components and deployed in Machine Learning Software Systems (MLSSs). Problem: There is a strong need for ensuring the serving quality of MLSSs. False or poor decisions of such systems can lead to malfunction of other systems, significant… ▽ More

    Submitted 22 August, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted as a registered report by ICSME 2022

  47. arXiv:2208.08918  [pdf, other

    cs.SE

    Data-access performance anti-patterns in data-intensive systems

    Authors: Biruk Asmare Muse, Kawser Wazed Nafi, Foutse Khomh, Giuliano Antoniol

    Abstract: Data-intensive systems handle variable, high volume, and high-velocity data generated by human and digital devices. Like traditional software, data-intensive systems are prone to technical debts introduced to cope-up with the pressure of time and resource constraints on developers. Data-access is a critical component of data-intensive systems as it determines the overall performance and functional… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

  48. A Probabilistic Framework for Mutation Testing in Deep Neural Networks

    Authors: Florian Tambon, Foutse Khomh, Giuliano Antoniol

    Abstract: Context: Mutation Testing (MT) is an important tool in traditional Software Engineering (SE) white-box testing. It aims to artificially inject faults in a system to evaluate a test suite's capability to detect them, assuming that the test suite defects finding capability will then translate to real faults. If MT has long been used in SE, it is only recently that it started gaining the attention of… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: 23 pages, 5 figures, 4 tables. Submitted to Information and Software Technology Journal

    Journal ref: Information and Software Technology. 155 (2023) 107129

  49. DiverGet: A Search-Based Software Testing Approach for Deep Neural Network Quantization Assessment

    Authors: Ahmed Haj Yahmed, Houssem Ben Braiek, Foutse Khomh, Sonia Bouzidi, Rania Zaatour

    Abstract: Quantization is one of the most applied Deep Neural Network (DNN) compression strategies, when deploying a trained DNN model on an embedded system or a cell phone. This is owing to its simplicity and adaptability to a wide range of applications and circumstances, as opposed to specific Artificial Intelligence (AI) accelerators and compilers that are often designed only for certain specific hardwar… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted for publication in The Empirical Software Engineering Journal (EMSE)

  50. Dev2vec: Representing Domain Expertise of Developers in an Embedding Space

    Authors: Arghavan Moradi Dakhel, Michel C. Desmarais, Foutse Khomh

    Abstract: Accurate assessment of the domain expertise of developers is important for assigning the proper candidate to contribute to a project or to attend a job role. Since the potential candidate can come from a large pool, the automated assessment of this domain expertise is a desirable goal. While previous methods have had some success within a single software project, the assessment of a developer's do… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: 30 pages, 5 figures