Skip to main content

Showing 1–21 of 21 results for author: Costa, D E

  1. arXiv:2407.11955  [pdf, other

    cs.SE

    A Transformer-based Approach for Augmenting Software Engineering Chatbots Datasets

    Authors: Ahmad Abdellatif, Khaled Badran, Diego Elias Costa, Emad Shihab

    Abstract: Background: The adoption of chatbots into software development tasks has become increasingly popular among practitioners, driven by the advantages of cost reduction and acceleration of the software development process. Chatbots understand users' queries through the Natural Language Understanding component (NLU). To yield reasonable performance, NLUs have to be trained with extensive, high-quality… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2401.16340  [pdf, other

    cs.SE

    The role of library versions in Developer-ChatGPT conversations

    Authors: Rachna Raj, Diego Elias Costa

    Abstract: The latest breakthroughs in large language models (LLM) have empowered software development tools, such as ChatGPT, to aid developers in complex tasks. Developers use ChatGPT to write code, review code changes, and even debug their programs. In these interactions, ChatGPT often recommends code snippets that depend on external libraries. However, code from libraries changes over time, invalidating… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  3. arXiv:2401.13667  [pdf, ps, other

    cs.SE

    Predicting the Impact of Crashes Across Release Channels

    Authors: Suhaib Mujahid, Diego Elias Costa, Marco Castelluccio

    Abstract: Software maintenance faces a persistent challenge with crash bugs, especially across diverse release channels catering to distinct user bases. Nightly builds, favoured by enthusiasts, often reveal crashes that are cheaper to fix but may differ significantly from those in stable releases. In this paper, we emphasize the need for a data-driven solution to predict the impact of crashes happening on n… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  4. arXiv:2311.07786  [pdf, other

    cs.SE cs.LG

    Predicting the First Response Latency of Maintainers and Contributors in Pull Requests

    Authors: SayedHassan Khatoonabadi, Ahmad Abdellatif, Diego Elias Costa, Emad Shihab

    Abstract: The success of a Pull Request (PR) depends on the responsiveness of the maintainers and the contributor during the review process. Being aware of the expected waiting times can lead to better interactions and managed expectations for both the maintainers and the contributor. In this paper, we propose a machine-learning approach to predict the first response latency of the maintainers following the… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Manuscript submitted to IEEE Transactions on Software Engineering (TSE)

  5. arXiv:2310.07847  [pdf, other

    cs.SE

    Dependency Practices for Vulnerability Mitigation

    Authors: Abbas Javan Jafari, Diego Elias Costa, Ahmad Abdellatif, Emad Shihab

    Abstract: Relying on dependency packages accelerates software development, but it also increases the exposure to security vulnerabilities that may be present in dependencies. While developers have full control over which dependency packages (and which version) they use, they have no control over the dependencies of their dependencies. Such transitive dependencies, which often amount to a greater number than… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  6. arXiv:2308.08667  [pdf, other

    cs.SE

    Where to Go Now? Finding Alternatives for Declining Packages in the npm Ecosystem

    Authors: Suhaib Mujahid, Diego Elias Costa, Rabe Abdalkareem, Emad Shihab

    Abstract: Software ecosystems (e.g., npm, PyPI) are the backbone of modern software developments. Developers add new packages to ecosystems every day to solve new problems or provide alternative solutions, causing obsolete packages to decline in their importance to the community. Packages in decline are reused less overtime and may become less frequently maintained. Thus, developers usually migrate their de… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  7. arXiv:2307.13777  [pdf, other

    cs.SE cs.AI

    An Empirical Study on Bugs Inside PyTorch: A Replication Study

    Authors: Sharon Chee Yin Ho, Vahid Majdinasab, Mohayeminul Islam, Diego Elias Costa, Emad Shihab, Foutse Khomh, Sarah Nadi, Muhammad Raza

    Abstract: Software systems are increasingly relying on deep learning components, due to their remarkable capability of identifying complex data patterns and powering intelligent behaviour. A core enabler of this change in software development is the availability of easy-to-use deep learning libraries. Libraries like PyTorch and TensorFlow empower a large variety of intelligent systems, offering a multitude… ▽ More

    Submitted 1 August, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

  8. arXiv:2305.18150  [pdf, other

    cs.SE cs.LG

    Understanding the Helpfulness of Stale Bot for Pull-based Development: An Empirical Study of 20 Large Open-Source Projects

    Authors: SayedHassan Khatoonabadi, Diego Elias Costa, Suhaib Mujahid, Emad Shihab

    Abstract: Pull Requests (PRs) that are neither progressed nor resolved clutter the list of PRs, making it difficult for the maintainers to manage and prioritize unresolved PRs. To automatically track, follow up, and close such inactive PRs, Stale bot was introduced by GitHub. Despite its increasing adoption, there are ongoing debates on whether using Stale bot alleviates or exacerbates the problem of inacti… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Manuscript submitted to ACM Transactions on Software Engineering and Methodology

  9. arXiv:2305.15675  [pdf, other

    cs.SE

    Dependency Update Strategies and Package Characteristics

    Authors: Abbas Javan Jafari, Diego Elias Costa, Emad Shihab, Rabe Abdalkareem

    Abstract: Managing project dependencies is a key maintenance issue in software development. Developers need to choose an update strategy that allows them to receive important updates and fixes while protecting them from breaking changes. Semantic Versioning was proposed to address this dilemma but many have opted for more restrictive or permissive alternatives. This empirical study explores the association… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  10. arXiv:2302.00042  [pdf, other

    cs.SE

    Diversity Awareness in Software Engineering Participant Research

    Authors: Riya Dutta, Diego Elias Costa, Emad Shihab, Tanja Tajmel

    Abstract: Diversity and inclusion are necessary prerequisites for shaping technological innovation that benefits society as a whole. A common indicator of diversity consideration is the representation of different social groups among software engineering (SE) researchers, developers, and students. However, this does not necessarily entail that diversity is considered in the SE research itself. In our stud… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

  11. arXiv:2212.02614  [pdf

    cs.LG cs.AI cs.CY

    Can Ensembling Pre-processing Algorithms Lead to Better Machine Learning Fairness?

    Authors: Khaled Badran, Pierre-Olivier Côté, Amanda Kolopanis, Rached Bouchoucha, Antonio Collante, Diego Elias Costa, Emad Shihab, Foutse Khomh

    Abstract: As machine learning (ML) systems get adopted in more critical areas, it has become increasingly crucial to address the bias that could occur in these systems. Several fairness pre-processing algorithms are available to alleviate implicit biases during model training. These algorithms employ different concepts of fairness, often leading to conflicting strategies with consequential trade-offs betwee… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  12. arXiv:2208.05860  [pdf, other

    cs.SE

    Achievement Unlocked: A Case Study on Gamifying DevOps Practices in Industry

    Authors: Patrick Ayoup, Diego Elias Costa, Emad Shihab

    Abstract: Gamification is the use of game elements such as points, leaderboards, and badges in a non-game context to encourage a desired behavior from individuals interacting with an environment. Recently, gamification has found its way into software engineering contexts as a means to promote certain activities to practitioners. Previous studies investigated the use of gamification to promote the adoption o… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

  13. arXiv:2207.14711  [pdf, other

    cs.SE

    Not All Dependencies are Equal: An Empirical Study on Production Dependencies in NPM

    Authors: Jasmine Latendresse, Suhaib Mujahid, Diego Elias Costa, Emad Shihab

    Abstract: Modern software systems are often built by leveraging code written by others in the form of libraries and packages to accelerate their development. While there are many benefits to using third-party packages, software projects often become dependent on a large number of software packages. Consequently, developers are faced with the difficult challenge of maintaining their project dependencies by k… ▽ More

    Submitted 29 August, 2022; v1 submitted 29 July, 2022; originally announced July 2022.

    Comments: The 37th IEEE/ACM International Conference on Automated Software Engineering

  14. arXiv:2206.10358  [pdf, other

    cs.SE

    Open Source Software: An Approach to Controlling Usage and Risk in Application Ecosystems

    Authors: Stan Zajdel, Diego Elias Costa, Hafedh Mili

    Abstract: The Open Source Software movement has been growing exponentially for a number of years with no signs of slowing. Driving this growth is the widespread availability of libraries and frameworks that provide many functionalities. Developers are saving time and money incorporating this functionality into their applications resulting in faster more feature-rich releases. Despite the growing success and… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  15. arXiv:2110.15447  [pdf, other

    cs.SE cs.LG

    On Wasted Contributions: Understanding the Dynamics of Contributor-Abandoned Pull Requests

    Authors: SayedHassan Khatoonabadi, Diego Elias Costa, Rabe Abdalkareem, Emad Shihab

    Abstract: Pull-based development has enabled numerous volunteers to contribute to open-source projects with fewer barriers. Nevertheless, a considerable amount of pull requests (PRs) with valid contributions are abandoned by their contributors, wasting the effort and time put in by both the contributors and maintainers. To better understand the underlying dynamics of contributor-abandoned PRs, we conduct a… ▽ More

    Submitted 20 May, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: Manuscript accepted for publication in ACM Transactions on Software Engineering and Methodology (TOSEM)

  16. arXiv:2107.13320  [pdf, other

    cs.DC cs.SE

    A Case Study on the Stability of Performance Tests for Serverless Applications

    Authors: Simon Eismann, Diego Elias Costa, Lizhi Liao, Cor-Paul Bezemer, Weiyi Shang, André van Hoorn, Samuel Kounev

    Abstract: Context. While in serverless computing, application resource management and operational concerns are generally delegated to the cloud provider, ensuring that serverless applications meet their performance requirements is still a responsibility of the developers. Performance testing is a commonly used performance assessment practice; however, it traditionally requires visibility of the resource env… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    ACM Class: C.4; D.2

  17. Towards Using Package Centrality Trend to Identify Packages in Decline

    Authors: Suhaib Mujahid, Diego Elias Costa, Rabe Abdalkareem, Emad Shihab, Mohamed Aymen Saied, Bram Adams

    Abstract: Due to their increasing complexity, today's software systems are frequently built by leveraging reusable code in the form of libraries and packages. Software ecosystems (e.g., npm) are the primary enablers of this code reuse, providing developers with a platform to share their own and use others' code. These ecosystems evolve rapidly: developers add new packages every day to solve new problems or… ▽ More

    Submitted 19 October, 2021; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: Accepted in the Special Issue on Collaboration and Innovation Dynamics in Software Ecosystems

    ACM Class: D.2.0; D.2.13

    Journal ref: IEEE Transactions on Engineering Management Journal (TEM), 2021

  18. arXiv:2012.02640  [pdf, other

    cs.SE cs.AI cs.CL

    A Comparison of Natural Language Understanding Platforms for Chatbots in Software Engineering

    Authors: Ahmad Abdellatif, Khaled Badran, Diego Elias Costa, Emad Shihab

    Abstract: Chatbots are envisioned to dramatically change the future of Software Engineering, allowing practitioners to chat and inquire about their software projects and interact with different services using natural language. At the heart of every chatbot is a Natural Language Understanding (NLU) component that enables the chatbot to understand natural language input. Recently, many NLU platforms were prov… ▽ More

    Submitted 22 July, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

    Journal ref: IEEE Transactions on Software Engineering (TSE), 2021

  19. Dependency Smells in JavaScript Projects

    Authors: Abbas Javan Jafari, Diego Elias Costa, Rabe Abdalkareem, Emad Shihab, Nikolaos Tsantalis

    Abstract: Dependency management in modern software development poses many challenges for developers who wish to stay up to date with the latest features and fixes whilst ensuring backwards compatibility. Project maintainers have opted for varied, and sometimes conflicting, approaches for maintaining their dependencies. Opting for unsuitable approaches can introduce bugs and vulnerabilities into the project,… ▽ More

    Submitted 18 August, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: Keywords: Dependency smells, Software ecosystems, Dependency management, npm

    Journal ref: IEEE Transactions on Software Engineering (TSE), 2021

  20. arXiv:2009.09019  [pdf, other

    cs.SE

    On the Threat of npm Vulnerable Dependencies in Node.js Applications

    Authors: Mahmoud Alfadel, Diego Elias Costa, Mouafak Mokhallalati, Emad Shihab, Bram Adams

    Abstract: Software vulnerabilities have a large negative impact on the software systems that we depend on daily. Reports on software vulnerabilities always paint a grim picture, with some reports showing that 83% of organizations depend on vulnerable software. However, our experience leads us to believe that, in the grand scheme of things, these software vulnerabilities may have less impact than what is rep… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

  21. arXiv:2006.09973  [pdf, other

    cs.SE cs.LG cs.PL

    Breaking Type Safety in Go: An Empirical Study on the Usage of the unsafe Package

    Authors: Diego Elias Costa, Suhaib Mujahid, Rabe Abdalkareem, Emad Shihab

    Abstract: A decade after its first release, the Go programming language has become a major programming language in the development landscape. While praised for its clean syntax and C-like performance, Go also contains a strong static type-system that prevents arbitrary type casting and arbitrary memory access, making the language type-safe by design. However, to give developers the possibility of implementi… ▽ More

    Submitted 22 July, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Journal ref: IEEE Transactions on Software Engineering (TSE), 2021