Toward Long-Term and Archivable Reproducibility
Autor:
Akhlaghi, Mohammad
; Infante-Sainz, Raul
; Roukema, Boudewijn F.
; Khellat, Mohammadreza
; Valls-Gabaud, David
; Baena-Galle, Roberto
Fecha:
2021Palabra clave:
Revista / editorial:
Computing in science&engineeringTipo de Ítem:
articleDirección web:
https://ieeexplore.ieee.org/document/9403875
Resumen:
Analysis pipelines commonly use high-level technologies that are popular when created, but are unlikely to be readable, executable, or sustainable in the long term. A set of criteria is introduced to address this problem: completeness (no execution requirement beyond a minimal Unix-like operating system, no administrator privileges, no network connection, and storage primarily in plain text); modular design; minimal complexity; scalability; verifiable inputs and outputs; version control; linking analysis with narrative; and free and open-source software. As a proof of concept, we introduce "Maneage" (managing data lineage), enabling cheap archiving, provenance extraction, and peer verification that has been tested in several research publications. We show that longevity is a realistic requirement that does not sacrifice immediate or short-term reproducibility. The caveats (with proposed solutions) are then discussed and we conclude with the benefits for the various stakeholders. This article is itself a Maneage'd project (project commit 313db0b). Appendices-Two comprehensive appendices that review the longevity of existing solutions are available as supplementary "Web extras," which are available in the IEEE Computer Society Digital Library at http://doi.ieeecomputersociety.org/10.1109/MCSE.2021.3072860. Reproducibility-All products available in zenodo.4913277, the Git history of this paper's source is at git.maneage.org/paper-concept.git, which is also archived in Software Heritage Heritage: swh:1:dir:33fea87068c1612daf011f161b97787b9a0df39f. Clicking on the SWHIDs in the digital format will provide more "context" for same content.
Este ítem aparece en la(s) siguiente(s) colección(es)
Estadísticas de uso
| Año |
| 2012 |
| 2013 |
| 2014 |
| 2015 |
| 2016 |
| 2017 |
| 2018 |
| 2019 |
| 2020 |
| 2021 |
| 2022 |
| 2023 |
| 2024 |
| 2025 |
| 2026 |
| Vistas |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 13 |
| 31 |
| 41 |
| 64 |
| 121 |
| 19 |
| Descargas |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
Ítems relacionados
Mostrando ítems relacionados por Título, autor o materia.
-
Galaxy And Mass Assembly (GAMA): extended intragroup light in a group at z=0.2 from deep Hyper Suprime-Cam images
Martinez-Lombilla, Cristina; Brough, Sarah; Montes, Mireia; Baena-Galle, Roberto; Akhlaghi, Mohammad; Infante-Sainz, Raul; Driver, Simon P.; Holwerda, Benne W.; Pimbblet, Kevin A.; Robotham, Aaron S. G. (Monthly Notices of the Royal Astronomical Society, 2023)We present a pilot study to assess the potential of Hyper Suprime-Cam Public Data Release 2 (HSC-PDR2) images for the analysis of extended faint structures within groups of galaxies. We examine the intragroup light (IGL) ... -
Stellar Population Properties in the Stellar Streams around SPRC047
Laine, Seppo; Martínez-Delgado, David; Webb, Kristi; Akhlaghi, Mohammad; Baena-Gallé, Roberto; Paudel, Sanjaya; Stein, Michael; Erkal, Denis (The Astrophysical Journal, 2024)We have investigated the properties (e.g., age, metallicity) of the stellar populations of a ringlike tidal stellar stream (or streams) around the edge-on galaxy SPRC047 (z = 0.031) using spectral energy distribution (SED) ... -
Star-image Centering with Deep Learning: HST/WFPC2 Images
Casetti-Dinescu, Dana I.; Girard, Terrence M.; Baena-Galle, Roberto; Martone, Max; Schwendemann, Kate (Publications of the Astronomical Society of the Pacific, 2023)A deep learning (DL) algorithm is built and tested for its ability to determine centers of star images in HST/WFPC2 exposures, in filters F555W and F814W. These archival observations hold great potential for proper-motion ...





