Skip to main content

Showing 1–29 of 29 results for author: Nikanjam, A

  1. arXiv:2407.08890  [pdf, other

    cs.SE cs.AI cs.LG

    DeepCodeProbe: Towards Understanding What Models Trained on Code Learn

    Authors: Vahid Majdinasab, Amin Nikanjam, Foutse Khomh

    Abstract: Machine learning models trained on code and related artifacts offer valuable support for software maintenance but suffer from interpretability issues due to their complex internal variables. These concerns are particularly significant in safety-critical applications where the models' decision-making processes must be reliable. The specific features and representations learned by these models remai… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    ACM Class: I.2.5; D.2.3

  2. arXiv:2404.12241  [pdf, other

    cs.CL cs.AI

    Introducing v0.5 of the AI Safety Benchmark from MLCommons

    Authors: Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Max Bartolo, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller , et al. (75 additional authors not shown)

    Abstract: This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-pu… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  3. arXiv:2403.08937  [pdf, other

    cs.SE cs.AI

    Bugs in Large Language Models Generated Code: An Empirical Study

    Authors: Florian Tambon, Arghavan Moradi Dakhel, Amin Nikanjam, Foutse Khomh, Michel C. Desmarais, Giuliano Antoniol

    Abstract: Large Language Models (LLMs) for code have gained significant attention recently. They can generate code in different programming languages based on provided prompts, fulfilling a long-lasting dream in Software Engineering (SE), i.e., automatic code generation. Similar to human-written code, LLM-generated code is prone to bugs, and these bugs have not yet been thoroughly examined by the community.… ▽ More

    Submitted 18 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 47 pages, 7 figures

  4. arXiv:2402.09299  [pdf, other

    cs.SE cs.LG

    Trained Without My Consent: Detecting Code Inclusion In Language Models Trained on Code

    Authors: Vahid Majdinasab, Amin Nikanjam, Foutse Khomh

    Abstract: Code auditing ensures that the developed code adheres to standards, regulations, and copyright protection by verifying that it does not contain code from protected sources. The recent advent of Large Language Models (LLMs) as coding assistants in the software development process poses new challenges for code auditing. The dataset for training these models is mainly collected from publicly availabl… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Submitted to TOSEM (ACM Transactions on Software Engineering and Methodology)

  5. arXiv:2312.15536  [pdf, other

    cs.SE

    Harnessing Pre-trained Generalist Agents for Software Engineering Tasks

    Authors: Paulina Stevia Nouwou Mindom, Amin Nikanjam, Foutse Khomh

    Abstract: Nowadays, we are witnessing an increasing adoption of Artificial Intelligence (AI) to develop techniques aimed at improving the reliability, effectiveness, and overall quality of software systems. Deep reinforcement learning (DRL) has recently been successfully used for automation in complex tasks such as game testing and solving the job-shop scheduling problem. However, these specialized DRL agen… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  6. arXiv:2310.09575  [pdf, other

    cs.SE

    Common Challenges of Deep Reinforcement Learning Applications Development: An Empirical Study

    Authors: Mohammad Mehdi Morovati, Florian Tambon, Mina Taraghi, Amin Nikanjam, Foutse Khomh

    Abstract: Machine Learning (ML) is increasingly being adopted in different industries. Deep Reinforcement Learning (DRL) is a subdomain of ML used to produce intelligent agents. Despite recent developments in DRL technology, the main challenges that developers face in the development of DRL applications are still unknown. To fill this gap, in this paper, we conduct a large-scale empirical study of 927 DRL-r… ▽ More

    Submitted 18 May, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: Submitted to Empirical Software Engineering journal

  7. arXiv:2310.01765  [pdf, other

    cs.LG cs.DB

    Data Cleaning and Machine Learning: A Systematic Literature Review

    Authors: Pierre-Olivier Côté, Amin Nikanjam, Nafisa Ahmed, Dmytro Humeniuk, Foutse Khomh

    Abstract: Context: Machine Learning (ML) is integrated into a growing number of systems for various applications. Because the performance of an ML model is highly dependent on the quality of the data it has been trained on, there is a growing interest in approaches to detect and repair data errors (i.e., data cleaning). Researchers are also exploring how ML can be used for data cleaning; hence creating a du… ▽ More

    Submitted 30 May, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Published in the Automated Software Engineering Journal

  8. arXiv:2308.16557  [pdf, other

    cs.SE

    Effective Test Generation Using Pre-trained Large Language Models and Mutation Testing

    Authors: Arghavan Moradi Dakhel, Amin Nikanjam, Vahid Majdinasab, Foutse Khomh, Michel C. Desmarais

    Abstract: One of the critical phases in software development is software testing. Testing helps with identifying potential bugs and reducing maintenance costs. The goal of automated test generation tools is to ease the development of tests by suggesting efficient bug-revealing tests. Recently, researchers have leveraged Large Language Models (LLMs) of code to generate unit tests. While the code coverage of… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 16 pages, 3 figures

  9. arXiv:2308.12438  [pdf, other

    cs.LG cs.AI cs.SE

    Deploying Deep Reinforcement Learning Systems: A Taxonomy of Challenges

    Authors: Ahmed Haj Yahmed, Altaf Allah Abbassi, Amin Nikanjam, Heng Li, Foutse Khomh

    Abstract: Deep reinforcement learning (DRL), leveraging Deep Learning (DL) in reinforcement learning, has shown significant potential in achieving human-level autonomy in a wide range of domains, including robotics, computer vision, and computer games. This potential justifies the enthusiasm and growing interest in DRL in both academia and industry. However, the community currently focuses mostly on the dev… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in The International Conference on Software Maintenance and Evolution (ICSME 2023)

  10. arXiv:2307.14512  [pdf, other

    cs.SE cs.LG

    Bug Characterization in Machine Learning-based Systems

    Authors: Mohammad Mehdi Morovati, Amin Nikanjam, Florian Tambon, Foutse Khomh, Zhen Ming, Jiang

    Abstract: Rapid growth of applying Machine Learning (ML) in different domains, especially in safety-critical areas, increases the need for reliable ML components, i.e., a software component operating based on ML. Understanding the bugs characteristics and maintenance challenges in ML-based systems can help developers of these systems to identify where to focus maintenance and testing efforts, by giving insi… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  11. arXiv:2306.15007  [pdf, other

    cs.SE cs.LG

    Quality Issues in Machine Learning Software Systems

    Authors: Pierre-Olivier Côté, Amin Nikanjam, Rached Bouchoucha, Ilan Basta, Mouna Abidi, Foutse Khomh

    Abstract: Context: An increasing demand is observed in various domains to employ Machine Learning (ML) for solving complex problems. ML models are implemented as software components and deployed in Machine Learning Software Systems (MLSSs). Problem: There is a strong need for ensuring the serving quality of MLSSs. False or poor decisions of such systems can lead to malfunction of other systems, significant… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    MSC Class: 68T05

  12. arXiv:2305.16323  [pdf

    cs.SE cs.LG

    Detecting Concept Drift for the reliability prediction of Software Defects using Instance Interpretation

    Authors: Zeynab Chitsazian, Saeed Sedighian Kashi, Amin Nikanjam

    Abstract: In the context of Just-In-Time Software Defect Prediction (JIT-SDP), Concept drift (CD) can occur due to changes in the software development process, the complexity of the software, or changes in user behavior that may affect the stability of the JIT-SDP model over time. Additionally, the challenge of class imbalance in JIT-SDP data poses a potential risk to the accuracy of CD detection methods if… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

  13. arXiv:2301.05651  [pdf, other

    cs.LG cs.SE

    Mutation Testing of Deep Reinforcement Learning Based on Real Faults

    Authors: Florian Tambon, Vahid Majdinasab, Amin Nikanjam, Foutse Khomh, Giuliano Antonio

    Abstract: Testing Deep Learning (DL) systems is a complex task as they do not behave like traditional systems would, notably because of their stochastic nature. Nonetheless, being able to adapt existing testing techniques such as Mutation Testing (MT) to DL settings would greatly improve their potential verifiability. While some efforts have been made to extend MT to the Supervised Learning paradigm, little… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: Accepted to the International Conference of Software Testing (ICST2023)

  14. arXiv:2208.12136  [pdf, other

    cs.SE cs.LG

    A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks

    Authors: Paulina Stevia Nouwou Mindom, Amin Nikanjam, Foutse Khomh

    Abstract: Software testing activities scrutinize the artifacts and the behavior of a software product to find possible defects and ensure that the product meets its expected requirements. Recently, Deep Reinforcement Learning (DRL) has been successfully employed in complex testing tasks such as game testing, regression testing, and test case prioritization to automate the process and provide continuous adap… ▽ More

    Submitted 29 June, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: Accepted for publication at EMSE (Empirical Software Engineering journal) 2023

  15. arXiv:2208.08982  [pdf, ps, other

    cs.SE cs.LG

    Quality issues in Machine Learning Software Systems

    Authors: Pierre-Olivier Côté, Amin Nikanjam, Rached Bouchoucha, Foutse Khomh

    Abstract: Context: An increasing demand is observed in various domains to employ Machine Learning (ML) for solving complex problems. ML models are implemented as software components and deployed in Machine Learning Software Systems (MLSSs). Problem: There is a strong need for ensuring the serving quality of MLSSs. False or poor decisions of such systems can lead to malfunction of other systems, significant… ▽ More

    Submitted 22 August, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted as a registered report by ICSME 2022

  16. arXiv:2206.15331  [pdf, other

    cs.SE cs.LG

    GitHub Copilot AI pair programmer: Asset or Liability?

    Authors: Arghavan Moradi Dakhel, Vahid Majdinasab, Amin Nikanjam, Foutse Khomh, Michel C. Desmarais, Zhen Ming, Jiang

    Abstract: Automatic program synthesis is a long-lasting dream in software engineering. Recently, a promising Deep Learning (DL) based solution, called Copilot, has been proposed by OpenAI and Microsoft as an industrial product. Although some studies evaluate the correctness of Copilot solutions and report its issues, more empirical evaluations are necessary to understand how developers can benefit from it e… ▽ More

    Submitted 14 April, 2023; v1 submitted 30 June, 2022; originally announced June 2022.

    Comments: 27 pages, 8 figures

  17. arXiv:2206.14322  [pdf, other

    cs.LG

    An Empirical Study of Challenges in Converting Deep Learning Models

    Authors: Moses Openja, Amin Nikanjam, Ahmed Haj Yahmed, Foutse Khomh, Zhen Ming, Jiang

    Abstract: There is an increase in deploying Deep Learning (DL)-based software systems in real-world applications. Usually DL models are developed and trained using DL frameworks that have their own internal mechanisms/formats to represent and train DL models, and usually those formats cannot be recognized by other frameworks. Moreover, trained models are usually deployed in environments different from where… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: Accepted for publication in ICSME 2022

  18. arXiv:2206.12311  [pdf, other

    cs.SE cs.LG

    Bugs in Machine Learning-based Systems: A Faultload Benchmark

    Authors: Mohammad Mehdi Morovati, Amin Nikanjam, Foutse Khomh, Zhen Ming, Jiang

    Abstract: The rapid escalation of applying Machine Learning (ML) in various domains has led to paying more attention to the quality of ML components. There is then a growth of techniques and tools aiming at improving the quality of ML components and integrating them into the ML-based system safely. Although most of these tools use bugs' lifecycle, there is no standard benchmark of bugs to assess their perfo… ▽ More

    Submitted 16 January, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

  19. arXiv:2203.13195  [pdf

    cs.NE cs.AI

    Novel Metric based on Walsh Coefficients for measuring problem difficulty in Estimation of Distribution Algorithms

    Authors: Saeed Ghadiri, Amin Nikanjam

    Abstract: Estimation of distribution algorithms are evolutionary algorithms that use extracted information from the population instead of traditional genetic operators to generate new solutions. This information is represented as a probabilistic model and the effectiveness of these algorithms is dependent on the quality of these models. However, some studies have shown that even multivariate EDAs fail to bu… ▽ More

    Submitted 24 February, 2022; originally announced March 2022.

  20. arXiv:2112.13314  [pdf, other

    cs.SE cs.LG

    Silent Bugs in Deep Learning Frameworks: An Empirical Study of Keras and TensorFlow

    Authors: Florian Tambon, Amin Nikanjam, Le An, Foutse Khomh, Giuliano Antoniol

    Abstract: Deep Learning (DL) frameworks are now widely used, simplifying the creation of complex models as well as their integration to various applications even to non DL experts. However, like any other programs, they are prone to bugs. This paper deals with the subcategory of bugs named silent bugs: they lead to wrong behavior but they do not cause system crashes or hangs, nor show an error message to th… ▽ More

    Submitted 1 September, 2023; v1 submitted 25 December, 2021; originally announced December 2021.

  21. arXiv:2111.04865  [pdf, other

    cs.LG

    On Assessing The Safety of Reinforcement Learning algorithms Using Formal Methods

    Authors: Paulina Stevia Nouwou Mindom, Amin Nikanjam, Foutse Khomh, John Mullins

    Abstract: The increasing adoption of Reinforcement Learning in safety-critical systems domains such as autonomous vehicles, health, and aviation raises the need for ensuring their safety. Existing safety mechanisms such as adversarial training, adversarial detection, and robust learning are not always adapted to all disturbances in which the agent is deployed. Those disturbances include moving adversaries w… ▽ More

    Submitted 9 November, 2021; v1 submitted 8 November, 2021; originally announced November 2021.

  22. arXiv:2109.03991  [pdf, other

    cs.SE cs.LG

    The challenge of reproducible ML: an empirical study on the impact of bugs

    Authors: Emilio Rivera-Landos, Foutse Khomh, Amin Nikanjam

    Abstract: Reproducibility is a crucial requirement in scientific research. When results of research studies and scientific papers have been found difficult or impossible to reproduce, we face a challenge which is called reproducibility crisis. Although the demand for reproducibility in Machine Learning (ML) is acknowledged in the literature, a main barrier is inherent non-determinism in ML training and infe… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

  23. arXiv:2108.09501  [pdf, other

    cs.LG stat.ML

    A Stochastic Variance-Reduced Coordinate Descent Algorithm for Learning Sparse Bayesian Network from Discrete High-Dimensional Data

    Authors: Nazanin Shajoonnezhad, Amin Nikanjam

    Abstract: This paper addresses the problem of learning a sparse structure Bayesian network from high-dimensional discrete data. Compared to continuous Bayesian networks, learning a discrete Bayesian network is a challenging problem due to the large parameter space. Although many approaches have been developed for learning continuous Bayesian networks, few approaches have been proposed for the discrete ones.… ▽ More

    Submitted 24 September, 2022; v1 submitted 21 August, 2021; originally announced August 2021.

    Comments: Accepted for publication by International Journal of Machine Learning and Cybernetics

  24. How to Certify Machine Learning Based Safety-critical Systems? A Systematic Literature Review

    Authors: Florian Tambon, Gabriel Laberge, Le An, Amin Nikanjam, Paulina Stevia Nouwou Mindom, Yann Pequignot, Foutse Khomh, Giulio Antoniol, Ettore Merlo, François Laviolette

    Abstract: Context: Machine Learning (ML) has been at the heart of many innovations over the past years. However, including it in so-called 'safety-critical' systems such as automotive or aeronautic has proven to be very challenging, since the shift in paradigm that ML brings completely changes traditional certification approaches. Objective: This paper aims to elucidate challenges related to the certifica… ▽ More

    Submitted 1 December, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: 60 pages (92 pages with references and complements), submitted to a journal (Automated Software Engineering). Changes: Emphasizing difference traditional software engineering / ML approach. Adding Related Works, Threats to Validity and Complementary Materials. Adding a table listing papers reference for each section/subsections

    Journal ref: Autom Softw Eng 29, 38 (2022)

  25. arXiv:2107.09807  [pdf

    cs.MA cs.AI cs.LG

    Improved Reinforcement Learning in Cooperative Multi-agent Environments Using Knowledge Transfer

    Authors: Mahnoosh Mahdavimoghaddam, Amin Nikanjam, Monireh Abdoos

    Abstract: Nowadays, cooperative multi-agent systems are used to learn how to achieve goals in large-scale dynamic environments. However, learning in these environments is challenging: from the effect of search space size on learning time to inefficient cooperation among agents. Moreover, reinforcement learning algorithms may suffer from a long time of convergence in such environments. In this paper, a commu… ▽ More

    Submitted 17 January, 2022; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: Accepted for publication by The Journal of Supercomputing

  26. arXiv:2107.02279  [pdf, other

    cs.SE cs.LG

    Design Smells in Deep Learning Programs: An Empirical Study

    Authors: Amin Nikanjam, Foutse Khomh

    Abstract: Nowadays, we are witnessing an increasing adoption of Deep Learning (DL) based software systems in many industries. Designing a DL program requires constructing a deep neural network (DNN) and then training it on a dataset. This process requires that developers make multiple architectural (e.g., type, size, number, and order of layers) and configuration (e.g., optimizer, regularization methods, an… ▽ More

    Submitted 7 July, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: Accepted for publication by ICSME 2021

  27. arXiv:2105.08095  [pdf, other

    cs.SE cs.LG

    Automatic Fault Detection for Deep Learning Programs Using Graph Transformations

    Authors: Amin Nikanjam, Houssem Ben Braiek, Mohammad Mehdi Morovati, Foutse Khomh

    Abstract: Nowadays, we are witnessing an increasing demand in both corporates and academia for exploiting Deep Learning (DL) to solve complex real-world problems. A DL program encodes the network structure of a desirable DL model and the process by which the model learns from the training dataset. Like any software, a DL program can be faulty, which implies substantial challenges of software quality assuran… ▽ More

    Submitted 30 May, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

  28. arXiv:2101.00135  [pdf, other

    cs.SE cs.LG

    Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection Approach

    Authors: Amin Nikanjam, Mohammad Mehdi Morovati, Foutse Khomh, Houssem Ben Braiek

    Abstract: A growing demand is witnessed in both industry and academia for employing Deep Learning (DL) in various domains to solve real-world problems. Deep Reinforcement Learning (DRL) is the application of DL in the domain of Reinforcement Learning (RL). Like any software systems, DRL applications can fail because of faults in their programs. In this paper, we present the first attempt to categorize fault… ▽ More

    Submitted 28 November, 2021; v1 submitted 31 December, 2020; originally announced January 2021.

  29. arXiv:2008.03456  [pdf, other

    cs.RO

    Cyrus 2D Simulation Team Description Paper2018

    Authors: Nader Zare, Mohsen Sadeghipour, Ashkan Keshavarzi, Mahtab Sarvmaili, Amin Nikanjam, Reza Aghayari, Arad Firouzkoohi, Mohammad Abolnejad, Sina Elahimanesh, Amin Akhgari

    Abstract: Cyrus 2D Soccer Simulation was established 2012 with the aim of research and develop in multi agents systems. This year we have joined with Ziziphus for collaboration and speed up our researches. This paper express a brief description of a method for predicting player's behavior in a multi agent system using neural network with a dataset in three level (low, mid, high). The dataset was obtained fr… ▽ More

    Submitted 8 August, 2020; originally announced August 2020.