Skip to main content

Showing 1–12 of 12 results for author: Di Cosmo, R

.
  1. arXiv:2501.10415  [pdf

    cs.DL cs.IR cs.LG cs.SE

    Making Software FAIR: A machine-assisted workflow for the research software lifecycle

    Authors: Petr Knoth, Laurent Romary, Patrice Lopez, Roberto Di Cosmo, Pavel Smrz, Tomasz Umerle, Melissa Harrison, Alain Monteil, Matteo Cancellieri, David Pride

    Abstract: A key issue hindering discoverability, attribution and reusability of open research software is that its existence often remains hidden within the manuscript of research papers. For these resources to become first-class bibliographic records, they first need to be identified and subsequently registered with persistent identifiers (PIDs) to be made FAIR (Findable, Accessible, Interoperable and Reus… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: 5 pages

  2. The Software Heritage Open Science Ecosystem

    Authors: Roberto Di Cosmo, Stefano Zacchiroli

    Abstract: Software Heritage is the largest public archive of software source code and associated development history, as captured by modern version control systems. As of July 2023, it has archived more than 16 billion unique source code files coming from more than 250 million collaborative development projects. In this chapter, we describe the Software Heritage ecosystem, focusing on research and open scie… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Journal ref: Software Ecosystems, Springer International Publishing, pp.33-61, 2023

  3. Dependency Solving Is Still Hard, but We Are Getting Better at It

    Authors: Pietro Abate, Roberto Di Cosmo, Georgios Gousios, Stefano Zacchiroli

    Abstract: Dependency solving is a hard (NP-complete) problem in all non-trivial component models due to either mutually incompatible versions of the same packages or explicitly declared package conflicts. As such, software upgrade planning needs to rely on highly specialized dependency solvers, lest falling into pitfalls such as incompleteness-a combination of package versions that satisfy dependency constr… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

    Journal ref: 2020 IEEE 27th International Conference on Software Analysis, Evolution and Reengineering (SANER), Feb 2020, London, Canada. pp.547-551

  4. Archiving and referencing source code with Software Heritage

    Authors: Roberto Di Cosmo

    Abstract: Software, and software source code in particular, is widely used in modern research. It must be properly archived, referenced, described and cited in order to build a stable and long lasting corpus of scientic knowledge. In this article we show how the Software Heritage universal source code archive provides a means to fully address the first two concerns, by archiving seamlessly all publicly avai… ▽ More

    Submitted 31 March, 2020; originally announced April 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1909.10760

  5. Referencing Source Code Artifacts: a Separate Concern in Software Citation

    Authors: Roberto Di Cosmo, Morane Gruenpeter, Stefano Zacchiroli

    Abstract: Among the entities involved in software citation, software source code requires special attention, due to the role it plays in ensuring scientific reproducibility. To reference source code we need identifiers that are not only unique and persistent, but also support \emph{integrity} checking intrinsically. Suitable identifiers must guarantee that denotedobjects will always stay the same, without r… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

    Comments: Computing in Science & Engineering, IEEE, In press

  6. arXiv:1909.10760  [pdf, other

    cs.DL

    How to use Software Heritage for archiving and referencing your source code: guidelines and walkthrough

    Authors: Roberto Di Cosmo

    Abstract: Software source code is an essential research output, and many research communities strongly encourage making the source code of the artefact available by archiving it in publicly-accessible long-term archives.Software Heritage is a non profit, long term universal archive specifically designed for software source code, and able to store not only a software artifact, but also its full development h… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

  7. arXiv:1906.08076  [pdf, other

    cs.SE

    Growth and Duplication of Public Source Code over Time: Provenance Tracking at Scale

    Authors: Guillaume Rousseau, Roberto Di Cosmo, Stefano Zacchiroli

    Abstract: We study the evolution of the largest known corpus of publicly available source code, i.e., the Software Heritage archive (4B unique source code files, 1B commits capturing their development histories across 50M software projects). On such corpus we quantify the growth rate of original, never-seen-before source code files and commits. We find the growth rates to be exponential over a period of mor… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  8. Attributing and Referencing (Research) Software: Best Practices and Outlook from Inria

    Authors: Pierre Alliez, Roberto Di Cosmo, Benjamin Guedj, Alain Girault, Mohand-Said Hacid, Arnaud Legrand, Nicolas P. Rougier

    Abstract: Software is a fundamental pillar of modern scientiic research, not only in computer science, but actually across all elds and disciplines. However, there is a lack of adequate means to cite and reference software, for many reasons. An obvious rst reason is software authorship, which can range from a single developer to a whole team, and can even vary in time. The panorama is even more complex than… ▽ More

    Submitted 25 November, 2019; v1 submitted 27 May, 2019; originally announced May 2019.

    Journal ref: Computing in Science \& Engineering 2020

  9. arXiv:1110.1354  [pdf, ps, other

    cs.SE

    Sources of Inter-package Conflicts in Debian

    Authors: Cyrille Artho, Roberto Di Cosmo, Kuniyasu Suzaki, Stefano Zacchiroli

    Abstract: Inter-package conflicts require the presence of two or more packages in a particular configuration, and thus tend to be harder to detect and localize than conventional (intra-package) defects. Hundreds of such inter-package conflicts go undetected by the normal testing and distribution process until they are later reported by a user. The reason for this is that current meta-data is not fine-graine… ▽ More

    Submitted 6 October, 2011; originally announced October 2011.

    Comments: LoCoCo 2011 International Workshop on Logics for Component Configuration, Perugia : Italy (2011)

  10. Aligning component upgrades

    Authors: Roberto Di Cosmo, Olivier Lhomme, Claude Michel

    Abstract: Modern software systems, like GNU/Linux distributions or Eclipse-based development environment, are often deployed by selecting components out of large component repositories. Maintaining such software systems by performing component upgrades is a complex task, and the users need to have an expressive preferences language at their disposal to specify the kind of upgrades they are interested in. R… ▽ More

    Submitted 1 September, 2011; originally announced September 2011.

    Comments: In Proceedings LoCoCo 2011, arXiv:1108.6097

    Journal ref: EPTCS 65, 2011, pp. 1-11

  11. arXiv:0905.4226  [pdf, ps, other

    cs.SE

    Strong Dependencies between Software Components

    Authors: Pietro Abate, Jaap Boender, Roberto Di Cosmo, Stefano Zacchiroli

    Abstract: Component-based systems often describe context requirements in terms of explicit inter-component dependencies. Studying large instances of such systems?such as free and open source software (FOSS) distributions?in terms of declared dependencies between packages is appealing. It is however also misleading when the language to express dependencies is as expressive as boolean formulae, which is oft… ▽ More

    Submitted 26 May, 2009; originally announced May 2009.

  12. Package upgrades in FOSS distributions: details and challenges

    Authors: Roberto Di Cosmo, Stefano Zacchiroli, Paulo Trezentos

    Abstract: The upgrade problems faced by Free and Open Source Software distributions have characteristics not easily found elsewhere. We describe the structure of packages and their role in the upgrade process. We show that state of the art package managers have shortcomings inhibiting their ability to cope with frequent upgrade failures. We survey current countermeasures to such failures, argue that they… ▽ More

    Submitted 10 February, 2009; originally announced February 2009.

    Journal ref: International Workshop On Hot Topics In Software Upgrades Proceedings of the 1st International Workshop on Hot Topics in Software Upgrades, Nashville, Tennessee : États-Unis d'Amérique (2008)