Skip to main content

Showing 1–50 of 50 results for author: Cordy, M

  1. arXiv:2406.14361  [pdf, other

    cs.AI eess.SY

    Robustness Analysis of AI Models in Critical Energy Systems

    Authors: Pantelis Dogoulis, Matthieu Jimenez, Salah Ghamizi, Maxime Cordy, Yves Le Traon

    Abstract: This paper analyzes the robustness of state-of-the-art AI-based models for power grid operations under the $N-1$ security criterion. While these models perform well in regular grid settings, our results highlight a significant loss in accuracy following the disconnection of a line.%under this security criterion. Using graph theory-based analysis, we demonstrate the impact of node connectivity on t… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.00775  [pdf, other

    cs.LG cs.CR

    Constrained Adaptive Attack: Effective Adversarial Attack Against Deep Neural Networks for Tabular Data

    Authors: Thibault Simonetto, Salah Ghamizi, Maxime Cordy

    Abstract: State-of-the-art deep learning models for tabular data have recently achieved acceptable performance to be deployed in industrial settings. However, the robustness of these models remains scarcely explored. Contrary to computer vision, there are no effective attacks to properly evaluate the adversarial robustness of deep tabular models due to intrinsic properties of tabular data, such as categoric… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  3. arXiv:2404.14419  [pdf, other

    cs.SE cs.CL cs.LG

    Enhancing Fault Detection for Large Language Models via Mutation-Based Confidence Smoothing

    Authors: Qiang Hu, Jin Wen, Maxime Cordy, Yuheng Huang, Xiaofei Xie, Lei Ma

    Abstract: Large language models (LLMs) achieved great success in multiple application domains and attracted huge attention from different research communities recently. Unfortunately, even for the best LLM, there still exist many faults that LLM cannot correctly predict. Such faults will harm the usability of LLMs. How to quickly reveal them in LLMs is important, but challenging. The reasons are twofold, 1)… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  4. arXiv:2402.15769  [pdf, other

    cs.SE cs.AI

    Importance Guided Data Augmentation for Neural-Based Code Understanding

    Authors: Zeming Dong, Qiang Hu, Xiaofei Xie, Maxime Cordy, Mike Papadakis, Jianjun Zhao

    Abstract: Pre-trained code models lead the era of code intelligence. Many models have been designed with impressive performance recently. However, one important problem, data augmentation for code data that automatically helps developers prepare training data lacks study in the field of code learning. In this paper, we introduce a general data augmentation framework, GenCode, to enhance the training of code… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  5. arXiv:2402.04823  [pdf, other

    cs.LG

    How Realistic Is Your Synthetic Data? Constraining Deep Generative Models for Tabular Data

    Authors: Mihaela Cătălina Stoian, Salijona Dyrmishi, Maxime Cordy, Thomas Lukasiewicz, Eleonora Giunchiglia

    Abstract: Deep Generative Models (DGMs) have been shown to be powerful tools for generating tabular data, as they have been increasingly able to capture the complex distributions that characterize them. However, to generate realistic synthetic data, it is often not enough to have a good approximation of their distribution, as it also requires compliance with constraints that encode essential background know… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024

  6. arXiv:2311.04503  [pdf, other

    cs.LG

    Constrained Adaptive Attacks: Realistic Evaluation of Adversarial Examples and Robust Training of Deep Neural Networks for Tabular Data

    Authors: Thibault Simonetto, Salah Ghamizi, Antoine Desjardins, Maxime Cordy, Yves Le Traon

    Abstract: State-of-the-art deep learning models for tabular data have recently achieved acceptable performance to be deployed in industrial settings. However, the robustness of these models remains scarcely explored. Contrary to computer vision, there is to date no realistic protocol to properly evaluate the adversarial robustness of deep tabular models due to intrinsic properties of tabular data such as ca… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  7. arXiv:2309.05381  [pdf, other

    cs.SE cs.AI

    Hazards in Deep Learning Testing: Prevalence, Impact and Recommendations

    Authors: Salah Ghamizi, Maxime Cordy, Yuejun Guo, Mike Papadakis, And Yves Le Traon

    Abstract: Much research on Machine Learning testing relies on empirical studies that evaluate and show their potential. However, in this context empirical results are sensitive to a number of parameters that can adversely impact the results of the experiments and potentially lead to wrong conclusions (Type I errors, i.e., incorrectly rejecting the Null Hypothesis). To this end, we survey the related literat… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  8. arXiv:2308.01314  [pdf, other

    cs.LG cs.SE stat.ML

    Evaluating the Robustness of Test Selection Methods for Deep Neural Networks

    Authors: Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Wei Ma, Mike Papadakis, Yves Le Traon

    Abstract: Testing deep learning-based systems is crucial but challenging due to the required time and labor for labeling collected raw data. To alleviate the labeling effort, multiple test selection methods have been proposed where only a subset of test data needs to be labeled while satisfying testing requirements. However, we observe that such methods with reported promising results are only evaluated und… ▽ More

    Submitted 29 July, 2023; originally announced August 2023.

    Comments: 12 pages

  9. arXiv:2306.01250  [pdf, other

    cs.SE

    Active Code Learning: Benchmarking Sample-Efficient Training of Code Models

    Authors: Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Lei Ma, Mike Papadakis, Yves Le Traon

    Abstract: The costly human effort required to prepare the training data of machine learning (ML) models hinders their practical development and usage in software engineering (ML4Code), especially for those with limited budgets. Therefore, efficiently training models of code with less human effort has become an emergent problem. Active learning is such a technique to address this issue that allows developers… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 12 pages, ongoing work

  10. arXiv:2305.15587  [pdf, other

    cs.CL cs.AI

    How do humans perceive adversarial text? A reality check on the validity and naturalness of word-based adversarial attacks

    Authors: Salijona Dyrmishi, Salah Ghamizi, Maxime Cordy

    Abstract: Natural Language Processing (NLP) models based on Machine Learning (ML) are susceptible to adversarial attacks -- malicious algorithms that imperceptibly modify input text to force models into making incorrect predictions. However, evaluations of these attacks ignore the property of imperceptibility or study it under limited settings. This entails that adversarial perturbations would not pass any… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  11. arXiv:2304.02688  [pdf, other

    cs.LG cs.CV stat.ML

    Going Further: Flatness at the Rescue of Early Stopping for Adversarial Example Transferability

    Authors: Martin Gubri, Maxime Cordy, Yves Le Traon

    Abstract: Transferability is the property of adversarial examples to be misclassified by other models than the surrogate model for which they were crafted. Previous research has shown that early stopping the training of the surrogate model substantially increases transferability. A common hypothesis to explain this is that deep neural networks (DNNs) first learn robust features, which are more generic, thus… ▽ More

    Submitted 20 February, 2024; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Version 2: originally submitted in April 2023 and revised in February 2024

  12. arXiv:2303.06808  [pdf, other

    cs.SE cs.AI

    Boosting Source Code Learning with Data Augmentation: An Empirical Study

    Authors: Zeming Dong, Qiang Hu, Yuejun Guo, Zhenya Zhang, Maxime Cordy, Mike Papadakis, Yves Le Traon, Jianjun Zhao

    Abstract: The next era of program understanding is being propelled by the use of machine learning to solve software problems. Recent studies have shown surprising results of source code learning, which applies deep neural networks (DNNs) to various critical software tasks, e.g., bug detection and clone detection. This success can be greatly attributed to the utilization of massive high-quality training data… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

  13. arXiv:2303.05213  [pdf, other

    cs.SE

    ACoRe: Automated Goal-Conflict Resolution

    Authors: Luiz Carvalho, Renzo Degiovanni, Matìas Brizzio, Maxime Cordy, Nazareno Aguirre, Yves Le Traon, Mike Papadakis

    Abstract: System goals are the statements that, in the context of software requirements specification, capture how the software should behave. Many times, the understanding of stakeholders on what the system should do, as captured in the goals, can lead to different problems, from clearly contradicting goals, to more subtle situations in which the satisfaction of some goals inhibits the satisfaction of othe… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  14. arXiv:2302.10594  [pdf, other

    cs.SE

    The Importance of Discerning Flaky from Fault-triggering Test Failures: A Case Study on the Chromium CI

    Authors: Guillaume Haben, Sarra Habchi, Mike Papadakis, Maxime Cordy, Yves Le Traon

    Abstract: Flaky tests are tests that pass and fail on different executions of the same version of a program under test. They waste valuable developer time by making developers investigate false alerts (flaky test failures). To deal with this problem, many prediction methods that identify flaky tests have been proposed. While promising, the actual utility of these methods remains unclear since they have not… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  15. arXiv:2302.02907  [pdf, other

    cs.CV cs.CR cs.LG

    GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks

    Authors: Salah Ghamizi, Jingfeng Zhang, Maxime Cordy, Mike Papadakis, Masashi Sugiyama, Yves Le Traon

    Abstract: While leveraging additional training data is well established to improve adversarial robustness, it incurs the unavoidable cost of data collection and the heavy computation to train models. To mitigate the costs, we propose Guided Adversarial Training (GAT), a novel adversarial training technique that exploits auxiliary tasks under a limited set of training data. Our approach extends single-task m… ▽ More

    Submitted 25 May, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  16. arXiv:2301.12284  [pdf, other

    cs.SE

    Assertion Inferring Mutants

    Authors: Aayush Garg, Renzo Degiovanni, Facundo Molina, Mike Papadakis, Nazareno Aguirre, Maxime Cordy, Yves Le Traon

    Abstract: Specification inference techniques aim at (automatically) inferring a set of assertions that capture the exhibited software behaviour by generating and filtering assertions through dynamic test executions and mutation testing. Although powerful, such techniques are computationally expensive due to a large number of assertions, test cases and mutated versions that need to be executed. To overcome t… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

  17. arXiv:2212.08130  [pdf, other

    eess.IV cs.CV cs.LG

    On Evaluating Adversarial Robustness of Chest X-ray Classification: Pitfalls and Best Practices

    Authors: Salah Ghamizi, Maxime Cordy, Michail Papadakis, Yves Le Traon

    Abstract: Vulnerability to adversarial attacks is a well-known weakness of Deep Neural Networks. While most of the studies focus on natural images with standardized benchmarks like ImageNet and CIFAR, little research has considered real world applications, in particular in the medical domain. Our research shows that, contrary to previous claims, robustness of chest x-ray classification is much harder to eva… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  18. arXiv:2210.03123  [pdf, other

    cs.LG cs.AI

    On the Effectiveness of Hybrid Pooling in Mixup-Based Graph Learning for Language Processing

    Authors: Zeming Dong, Qiang Hu, Zhenya Zhang, Yuejun Guo, Maxime Cordy, Mike Papadakis, Yves Le Traon, Jianjun Zhao

    Abstract: Graph neural network (GNN)-based graph learning has been popular in natural language and programming language processing, particularly in text and source code classification. Typically, GNNs are constructed by incorporating alternating layers which learn transformations of graph node features, along with graph pooling layers that use graph pooling operators (e.g., Max-pooling) to effectively reduc… ▽ More

    Submitted 21 May, 2024; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: Accepted by Journal of Systems and Software (JSS) 2024

  19. arXiv:2210.03003  [pdf, other

    cs.SE cs.AI

    MIXCODE: Enhancing Code Classification by Mixup-Based Data Augmentation

    Authors: Zeming Dong, Qiang Hu, Yuejun Guo, Maxime Cordy, Mike Papadakis, Zhenya Zhang, Yves Le Traon, Jianjun Zhao

    Abstract: Inspired by the great success of Deep Neural Networks (DNNs) in natural language processing (NLP), DNNs have been increasingly applied in source code analysis and attracted significant attention from the software engineering community. Due to its data-driven nature, a DNN model requires massive and high-quality labeled training data to achieve expert-level performance. Collecting such data is ofte… ▽ More

    Submitted 10 January, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: Accepted by SANER 2023

  20. arXiv:2207.13129  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity

    Authors: Martin Gubri, Maxime Cordy, Mike Papadakis, Yves Le Traon, Koushik Sen

    Abstract: We propose transferability from Large Geometric Vicinity (LGV), a new technique to increase the transferability of black-box adversarial attacks. LGV starts from a pretrained surrogate model and collects multiple weight sets from a few additional training epochs with a constant and high learning rate. LGV exploits two geometric properties that we relate to transferability. First, models that belon… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: Accepted at ECCV 2022

  21. arXiv:2207.11018   

    cs.SE

    Learning from what we know: How to perform vulnerability prediction using noisy historical data

    Authors: Aayush Garg, Renzo Degiovanni, Matthieu Jimenez, Maxime Cordy, Mike Papadakis, Yves LeTraon

    Abstract: Vulnerability prediction refers to the problem of identifying system components that are most likely to be vulnerable. Typically, this problem is tackled by training binary classifiers on historical data. Unfortunately, recent research has shown that such approaches underperform due to the following two reasons: a) the imbalanced nature of the problem, and b) the inherently noisy historical data,… ▽ More

    Submitted 25 July, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Please do not consider this new version of the article for citations. The article (with its previous versions) is already available here: arXiv:2012.11701

  22. arXiv:2207.10942  [pdf, other

    cs.SE cs.AI

    Aries: Efficient Testing of Deep Neural Networks via Labeling-Free Accuracy Estimation

    Authors: Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Lei Ma, Mike Papadakis, Yves Le Traon

    Abstract: Deep learning (DL) plays a more and more important role in our daily life due to its competitive performance in industrial application domains. As the core of DL-enabled systems, deep neural networks (DNNs) need to be carefully evaluated to ensure the produced models match the expected requirements. In practice, the \emph{de facto standard} to assess the quality of DNNs in the industry is to check… ▽ More

    Submitted 3 February, 2023; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: accepted to ICSE'23, preprint version

  23. arXiv:2207.10143  [pdf, other

    cs.SE

    What Made This Test Flake? Pinpointing Classes Responsible for Test Flakiness

    Authors: Sarra Habchi, Guillaume Haben, Jeongju Sohn, Adriano Franci, Mike Papadakis, Maxime Cordy, Yves Le Traon

    Abstract: Flaky tests are defined as tests that manifest non-deterministic behaviour by passing and failing intermittently for the same version of the code. These tests cripple continuous integration with false alerts that waste developers' time and break their trust in regression testing. To mitigate the effects of flakiness, both researchers and industrial experts proposed strategies and tools to detect a… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted at the 38th IEEE International Conference on Software Maintenance and Evolution (ICSME)

  24. arXiv:2206.05480  [pdf, ps, other

    cs.SE cs.AI

    CodeS: Towards Code Model Generalization Under Distribution Shift

    Authors: Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Lei Ma, Mike Papadakis, Yves Le Traon

    Abstract: Distribution shift has been a longstanding challenge for the reliable deployment of deep learning (DL) models due to unexpected accuracy degradation. Although DL has been becoming a driving force for large-scale source code analysis in the big code era, limited progress has been made on distribution shift analysis and benchmarking for source code tasks. To fill this gap, this paper initiates to pr… ▽ More

    Submitted 4 February, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

    Comments: accepted by ICSE'23-NIER

  25. arXiv:2205.08809  [pdf

    cs.SE

    Software Fairness: An Analysis and Survey

    Authors: Ezekiel Soremekun, Mike Papadakis, Maxime Cordy, Yves Le Traon

    Abstract: In the last decade, researchers have studied fairness as a software property. In particular, how to engineer fair software systems? This includes specifying, designing, and validating fairness properties. However, the landscape of works addressing bias as a software engineering concern is unclear, i.e., techniques and studies that analyze the fairness properties of learning-based software. In this… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  26. arXiv:2204.04220  [pdf, other

    cs.LG cs.AI cs.SE

    Characterizing and Understanding the Behavior of Quantized Models for Reliable Deployment

    Authors: Qiang Hu, Yuejun Guo, Maxime Cordy, Xiaofei Xie, Wei Ma, Mike Papadakis, Yves Le Traon

    Abstract: Deep Neural Networks (DNNs) have gained considerable attention in the past decades due to their astounding performance in different applications, such as natural language modeling, self-driving assistance, and source code understanding. With rapid exploration, more and more complex DNN architectures have been proposed along with huge pre-trained model parameters. The common way to use such DNN mod… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: 12 pages

  27. arXiv:2204.03994  [pdf, other

    cs.LG cs.AI cs.SE

    LaF: Labeling-Free Model Selection for Automated Deep Neural Network Reusing

    Authors: Qiang Hu, Yuejun Guo, Maxime Cordy, Xiaofei Xie, Mike Papadakis, Yves Le Traon

    Abstract: Applying deep learning to science is a new trend in recent years which leads DL engineering to become an important problem. Although training data preparation, model architecture design, and model training are the normal processes to build DL models, all of them are complex and costly. Therefore, reusing the open-sourced pre-trained model is a practical way to bypass this hurdle for developers. Gi… ▽ More

    Submitted 20 January, 2023; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: 22 pages

  28. arXiv:2202.03277  [pdf, other

    cs.LG cs.CR

    On The Empirical Effectiveness of Unrealistic Adversarial Hardening Against Realistic Adversarial Attacks

    Authors: Salijona Dyrmishi, Salah Ghamizi, Thibault Simonetto, Yves Le Traon, Maxime Cordy

    Abstract: While the literature on security attacks and defense of Machine Learning (ML) systems mostly focuses on unrealistic adversarial examples, recent research has raised concern about the under-explored field of realistic adversarial attacks and their implications on the robustness of real-world systems. Our paper paves the way for a better understanding of adversarial robustness against realistic atta… ▽ More

    Submitted 21 May, 2023; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: S&P 2023

  29. arXiv:2112.04919  [pdf, ps, other

    cs.SE

    A Qualitative Study on the Sources, Impacts, and Mitigation Strategies of Flaky Tests

    Authors: Sarra Habchi, Guillaume Haben, Mike Papadakis, Maxime Cordy, Yves Le Traon

    Abstract: Test flakiness forms a major testing concern. Flaky tests manifest non-deterministic outcomes that cripple continuous integration and lead developers to investigate false alerts. Industrial reports indicate that on a large scale, the accrual of flaky tests breaks the trust in test suites and entails significant computational cost. To alleviate this, practitioners are constrained to identify flaky… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  30. arXiv:2112.02542  [pdf, other

    cs.LG cs.AI

    Robust Active Learning: Sample-Efficient Training of Robust Deep Learning Models

    Authors: Yuejun Guo, Qiang Hu, Maxime Cordy, Mike Papadakis, Yves Le Traon

    Abstract: Active learning is an established technique to reduce the labeling cost to build high-quality machine learning models. A core component of active learning is the acquisition function that determines which data should be selected to annotate. State-of-the-art acquisition functions -- and more largely, active learning techniques -- have been designed to maximize the clean performance (e.g. accuracy)… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Comments: 10 pages

  31. arXiv:2112.01218  [pdf, other

    cs.SE

    GraphCode2Vec: Generic Code Embedding via Lexical and Program Dependence Analyses

    Authors: Wei Ma, Mengjie Zhao, Ezekiel Soremekun, Qiang Hu, Jie Zhang, Mike Papadakis, Maxime Cordy, Xiaofei Xie, Yves Le Traon

    Abstract: Code embedding is a keystone in the application of machine learning on several Software Engineering (SE) tasks. To effectively support a plethora of SE tasks, the embedding needs to capture program syntax and semantics in a way that is generic. To this end, we propose the first self-supervised pre-training approach (called GraphCode2Vec) which produces task-agnostic embedding of lexical and progra… ▽ More

    Submitted 21 January, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

  32. arXiv:2112.01156  [pdf, other

    cs.AI cs.LG

    A Unified Framework for Adversarial Attack and Defense in Constrained Feature Space

    Authors: Thibault Simonetto, Salijona Dyrmishi, Salah Ghamizi, Maxime Cordy, Yves Le Traon

    Abstract: The generation of feasible adversarial examples is necessary for properly assessing models that work in constrained feature space. However, it remains a challenging task to enforce constraints into attacks that were designed for computer vision. We propose a unified framework to generate feasible adversarial examples that satisfy given domain constraints. Our framework can handle both linear and n… ▽ More

    Submitted 3 May, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

  33. arXiv:2111.03382  [pdf, other

    cs.SE

    Discerning Legitimate Failures From False Alerts: A Study of Chromium's Continuous Integration

    Authors: Guillaume Haben, Sarra Habchi, Mike Papadakis, Maxime Cordy, Yves Le Traon

    Abstract: Flakiness is a major concern in Software testing. Flaky tests pass and fail for the same version of a program and mislead developers who spend time and resources investigating test failures only to discover that they are false alerts. In practice, the defacto approach to address this concern is to rerun failing tests hoping that they would pass and manifest as false alerts. Nonetheless, completely… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

  34. arXiv:2110.15053  [pdf, other

    cs.LG cs.AI cs.CV

    Adversarial Robustness in Multi-Task Learning: Promises and Illusions

    Authors: Salah Ghamizi, Maxime Cordy, Mike Papadakis, Yves Le Traon

    Abstract: Vulnerability to adversarial attacks is a well-known weakness of Deep Neural networks. While most of the studies focus on single-task neural networks with computer vision datasets, very little research has considered complex multi-task models that are common in real applications. In this paper, we evaluate the design choices that impact the robustness of multi-task deep learning networks. We provi… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  35. arXiv:2109.12838  [pdf, other

    cs.LG cs.CR cs.CV

    MUTEN: Boosting Gradient-Based Adversarial Attacks via Mutant-Based Ensembles

    Authors: Yuejun Guo, Qiang Hu, Maxime Cordy, Michail Papadakis, Yves Le Traon

    Abstract: Deep Neural Networks (DNNs) are vulnerable to adversarial examples, which causes serious threats to security-critical applications. This motivated much research on providing mechanisms to make models more robust against adversarial attacks. Unfortunately, most of these defenses, such as gradient masking, are easily overcome through different attack means. In this paper, we propose MUTEN, a low-cos… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  36. Automated Repair of Unrealisable LTL Specifications Guided by Model Counting

    Authors: Matías Brizzio, Maxime Cordy, Mike Papadakis, César Sánchez, Nazareno Aguirre, Renzo Degiovanni

    Abstract: The reactive synthesis problem consists of automatically producing correct-by-construction operational models of systems from high-level formal specifications of their behaviours. However, specifications are often unrealisable, meaning that no system can be synthesised from the specification. To deal with this problem, we present AuRUS, a search-based approach to repair unrealisable Linear-Time Te… ▽ More

    Submitted 14 April, 2023; v1 submitted 26 May, 2021; originally announced May 2021.

  37. arXiv:2104.07441  [pdf, other

    cs.SE

    On the Use of Mutation in Injecting Test Order-Dependency

    Authors: Sarra Habchi, Maxime Cordy, Mike Papadakis, Yves Le Traon

    Abstract: Background: Test flakiness is identified as a major issue that compromises the regression testing process of complex software systems. Flaky tests manifest non-deterministic behaviour, send confusing signals to developers, and break their trust in test suites. Both industrial reports and research studies highlighted the negative impact of flakiness on software quality and developers' productivity.… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  38. arXiv:2012.11701  [pdf, other

    cs.CR cs.SE

    Learning from What We Know: How to Perform Vulnerability Prediction using Noisy Historical Data

    Authors: Aayush Garg, Renzo Degiovanni, Matthieu Jimenez, Maxime Cordy, Mike Papadakis, Yves Le Traon

    Abstract: Vulnerability prediction refers to the problem of identifying system components that are most likely to be vulnerable. Typically, this problem is tackled by training binary classifiers on historical data. Unfortunately, recent research has shown that such approaches underperform due to the following two reasons: a) the imbalanced nature of the problem, and b) the inherently noisy historical data,… ▽ More

    Submitted 25 July, 2022; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: The article was accepted in Empirical Software Engineering (EMSE) on July 02, 2022

  39. Influence-Driven Data Poisoning in Graph-Based Semi-Supervised Classifiers

    Authors: Adriano Franci, Maxime Cordy, Martin Gubri, Mike Papadakis, Yves Le Traon

    Abstract: Graph-based Semi-Supervised Learning (GSSL) is a practical solution to learn from a limited amount of labelled data together with a vast amount of unlabelled data. However, due to their reliance on the known labels to infer the unknown labels, these algorithms are sensitive to data quality. It is therefore essential to study the potential threats related to the labelled data, more specifically, la… ▽ More

    Submitted 11 May, 2022; v1 submitted 14 December, 2020; originally announced December 2020.

  40. IBIR: Bug Report driven Fault Injection

    Authors: Ahmed Khanfir, Anil Koyuncu, Mike Papadakis, Maxime Cordy, Tegawendé F. Bissyandé, Jacques Klein, Yves Le Traon

    Abstract: Much research on software engineering and software testing relies on experimental studies based on fault injection. Fault injection, however, is not often relevant to emulate real-world software faults since it "blindly" injects large numbers of faults. It remains indeed challenging to inject few but realistic faults that target a particular functionality in a program. In this work, we introduce I… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

  41. arXiv:2011.05074  [pdf, other

    cs.LG stat.ML

    Efficient and Transferable Adversarial Examples from Bayesian Neural Networks

    Authors: Martin Gubri, Maxime Cordy, Mike Papadakis, Yves Le Traon, Koushik Sen

    Abstract: An established way to improve the transferability of black-box evasion attacks is to craft the adversarial examples on an ensemble-based surrogate to increase diversity. We argue that transferability is fundamentally related to uncertainty. Based on a state-of-the-art Bayesian Deep Learning technique, we propose a new method to efficiently build a surrogate by sampling approximately from the poste… ▽ More

    Submitted 18 June, 2022; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: Accepted at UAI 2022

  42. arXiv:2006.07087  [pdf, other

    cs.CY physics.soc-ph q-bio.PE

    Data-driven Simulation and Optimization for Covid-19 Exit Strategies

    Authors: Salah Ghamizi, Renaud Rwemalika, Lisa Veiber, Maxime Cordy, Tegawende F. Bissyande, Mike Papadakis, Jacques Klein, Yves Le Traon

    Abstract: The rapid spread of the Coronavirus SARS-2 is a major challenge that led almost all governments worldwide to take drastic measures to respond to the tragedy. Chief among those measures is the massive lockdown of entire countries and cities, which beyond its global economic impact has created some deep social and psychological tensions within populations. While the adopted mitigation measures (incl… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  43. arXiv:2001.02941  [pdf, other

    cs.SE

    Killing Stubborn Mutants with Symbolic Execution

    Authors: Thierry Titcheu Chekam, Mike Papadakis, Maxime Cordy, Yves Le Traon

    Abstract: We introduce SeMu, a Dynamic Symbolic Execution technique that generates test inputs capable of killing stubborn mutants (killable mutants that remain undetected after a reasonable amount of testing). SeMu aims at mutant propagation (triggering erroneous states to the program output) by incrementally searching for divergent program behaviours between the original and the mutant versions. We model… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

  44. arXiv:1912.03197  [pdf, other

    cs.SE

    FlakiMe: Laboratory-Controlled Test Flakiness Impact Assessment. A Case Study on Mutation Testing and Program Repair

    Authors: Maxime Cordy, Renaud Rwemalika, Mike Papadakis, Mark Harman

    Abstract: Much research on software testing makes an implicit assumption that test failures are deterministic such that they always witness the presence of the same defects. However, this assumption is not always true because some test failures are due to so-called flaky tests, i.e., tests with non-deterministic outcomes. Unfortunately, flaky tests have major implications for testing and test-dependent acti… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

  45. arXiv:1912.01487  [pdf, other

    cs.CR cs.LG

    Adversarial Embedding: A robust and elusive Steganography and Watermarking technique

    Authors: Salah Ghamizi, Maxime Cordy, Mike Papadakis, Yves Le Traon

    Abstract: We propose adversarial embedding, a new steganography and watermarking technique that embeds secret information within images. The key idea of our method is to use deep neural networks for image classification and adversarial attacks to embed secret information within images. Thus, we use the attacks to embed an encoding of the message within images and the related deep neural network outputs to e… ▽ More

    Submitted 14 November, 2019; originally announced December 2019.

  46. arXiv:1904.13195  [pdf, other

    cs.LG cs.SE stat.ML

    Test Selection for Deep Learning Systems

    Authors: Wei Ma, Mike Papadakis, Anestis Tsakmalis, Maxime Cordy, Yves Le Traon

    Abstract: Testing of deep learning models is challenging due to the excessive number and complexity of computations involved. As a result, test data selection is performed manually and in an ad hoc way. This raises the question of how we can automatically select candidate test data to test deep learning models. Recent research has focused on adapting test selection metrics from code-based software testing (… ▽ More

    Submitted 30 April, 2019; originally announced April 2019.

  47. arXiv:1904.04612  [pdf, other

    cs.LG cs.CV

    Automated Search for Configurations of Deep Neural Network Architectures

    Authors: Salah Ghamizi, Maxime Cordy, Mike Papadakis, Yves Le Traon

    Abstract: Deep Neural Networks (DNNs) are intensively used to solve a wide variety of complex problems. Although powerful, such systems require manual configuration and tuning. To this end, we view DNNs as configurable systems and propose an end-to-end framework that allows the configuration, evaluation and automated search for DNN architectures. Therefore, our contribution is threefold. First, we model the… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

  48. State Machine Flattening: Mapping Study and Assessment

    Authors: Xavier Devroey, Gilles Perrouin, Maxime Cordy, Axel Legay, Pierre-Yves Schobbens, Patrick Heymans

    Abstract: State machine formalisms equipped with hierarchy and parallelism allow to compactly model complex system behaviours. Such models can then be transformed into executable code or inputs for model-based testing and verification techniques. Generated artifacts are mostly flat descriptions of system behaviour. \emph{Flattening} is thus an essential step of these transformations. To assess the importanc… ▽ More

    Submitted 21 March, 2014; originally announced March 2014.

  49. arXiv:1311.1343  [pdf, other

    cs.SE

    Verification for Reliable Product Lines

    Authors: Maxime Cordy, Patrick Heymans, Pierre-Yves Schobbens, Amir Molzam Sharifloo, Carlo Ghezzi, Axel Legay

    Abstract: Many product lines are critical, and therefore reliability is a vital part of their requirements. Reliability is a probabilistic property. We therefore propose a model for feature-aware discrete-time Markov chains as a basis for verifying probabilistic properties of product lines, including reliability. We compare three verification techniques: The enumerative technique uses PRISM, a state-of-the-… ▽ More

    Submitted 6 November, 2013; originally announced November 2013.

  50. Towards Statistical Prioritization for Software Product Lines Testing

    Authors: Xavier Devroey, Maxime Cordy, Gilles Perrouin, Pierre-Yves Schobbens, Axel Legay, Patrick Heymans

    Abstract: Software Product Lines (SPL) are inherently difficult to test due to the combinatorial explosion of the number of products to consider. To reduce the number of products to test, sampling techniques such as combinatorial interaction testing have been proposed. They usually start from a feature model and apply a coverage criterion (e.g. pairwise feature interaction or dissimilarity) to generate trac… ▽ More

    Submitted 23 January, 2014; v1 submitted 9 October, 2013; originally announced October 2013.

    Comments: Extended version published at VaMoS '14 (http://dx.doi.org/10.1145/2556624.2556635)