Author ORCID Identifier
Mahmoud Jahanshahi https://orcid.org/0000-0003-4408-1183
David Reid https://orcid.org/0000-0002-0973-7127
Audris Mockus https://orcid.org/0000-0002-7987-7598
Document Type
Article
Publication Date
2025
DOI
https://doi.org/10.1145/3715907
Abstract
In Open Source Software, resources of any project are open for reuse by introducing dependencies or copying the resource itself. In contrast to dependency-based reuse, the infrastructure to systematically support copy-based reuse appears to be entirely missing. Our aim is to enable future research and tool development to increase efficiency and reduce the risks of copy-based reuse. We seek a better understanding of such reuse by measuring its prevalence and identifying factors affecting the propensity to reuse. To identify reused artifacts and trace their origins, our method exploits World of Code infrastructure. We begin with a set of theory-derived factors related to the propensity to reuse, sample instances of different reuse types, and survey developers to better understand their intentions. Our results indicate that copy-based reuse is common, with many developers being aware of it when writing code. The propensity for a file to be reused varies greatly among languages and between source code and binary files, consistently decreasing over time. Files introduced by popular projects are more likely to be reused, but at least half of reused resources originate from “small” and “medium” projects. Developers had various reasons for reuse but were generally positive about using a package manager.
Recommended Citation
Jahanshahi, Mahmoud; Reid, David; and Mockus, Audris, "Beyond Dependencies: The Role of Copy-Based Reuse in Open Source Software Development" (2025). Electrical Engineering and Computer Science Publications and Other Works.
https://trace.tennessee.edu/utk-comppubs/11