Mostrar el registro sencillo del ítem
Toward Long-Term and Archivable Reproducibility
dc.contributor.author | Akhlaghi, Mohammad | |
dc.contributor.author | Infante-Sainz, Raul | |
dc.contributor.author | Roukema, Boudewijn F. | |
dc.contributor.author | Khellat, Mohammadreza | |
dc.contributor.author | Valls-Gabaud, David | |
dc.contributor.author | Baena-Galle, Roberto | |
dc.date | 2021 | |
dc.date.accessioned | 2021-12-21T13:50:28Z | |
dc.date.available | 2021-12-21T13:50:28Z | |
dc.identifier.issn | 1521-9615 | |
dc.identifier.uri | https://reunir.unir.net/handle/123456789/12238 | |
dc.description.abstract | Analysis pipelines commonly use high-level technologies that are popular when created, but are unlikely to be readable, executable, or sustainable in the long term. A set of criteria is introduced to address this problem: completeness (no execution requirement beyond a minimal Unix-like operating system, no administrator privileges, no network connection, and storage primarily in plain text); modular design; minimal complexity; scalability; verifiable inputs and outputs; version control; linking analysis with narrative; and free and open-source software. As a proof of concept, we introduce "Maneage" (managing data lineage), enabling cheap archiving, provenance extraction, and peer verification that has been tested in several research publications. We show that longevity is a realistic requirement that does not sacrifice immediate or short-term reproducibility. The caveats (with proposed solutions) are then discussed and we conclude with the benefits for the various stakeholders. This article is itself a Maneage'd project (project commit 313db0b). Appendices-Two comprehensive appendices that review the longevity of existing solutions are available as supplementary "Web extras," which are available in the IEEE Computer Society Digital Library at http://doi.ieeecomputersociety.org/10.1109/MCSE.2021.3072860. Reproducibility-All products available in zenodo.4913277, the Git history of this paper's source is at git.maneage.org/paper-concept.git, which is also archived in Software Heritage Heritage: swh:1:dir:33fea87068c1612daf011f161b97787b9a0df39f. Clicking on the SWHIDs in the digital format will provide more "context" for same content. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Computing in science&engineering | es_ES |
dc.relation.ispartofseries | ;vol. 23, nº 3 | |
dc.relation.uri | https://ieeexplore.ieee.org/document/9403875 | es_ES |
dc.rights | openAccess | es_ES |
dc.subject | software | es_ES |
dc.subject | containers | es_ES |
dc.subject | kernel | es_ES |
dc.subject | libraries | es_ES |
dc.subject | tools | es_ES |
dc.subject | virtual machining | es_ES |
dc.subject | buildings | es_ES |
dc.subject | workflow management | es_ES |
dc.subject | systems | es_ES |
dc.subject | database management | es_ES |
dc.subject | information technology and systems | es_ES |
dc.subject | knowledge and data engineering tools and techniquek | es_ES |
dc.subject | computers in other systems | es_ES |
dc.subject | computer applications | es_ES |
dc.subject | WOS(2) | es_ES |
dc.subject | Scopus | es_ES |
dc.title | Toward Long-Term and Archivable Reproducibility | es_ES |
dc.type | article | es_ES |
reunir.tag | ~ARI | es_ES |
dc.identifier.doi | http://dx.doi.org/10.1109/MCSE.2021.3072860 |
Ficheros en el ítem
Ficheros | Tamaño | Formato | Ver |
---|---|---|---|
No hay ficheros asociados a este ítem. |