Skip to main content

Showing 1–7 of 7 results for author: Crosas, M

Searching in archive cs. Search in all archives.
.
  1. Packaging research artefacts with RO-Crate

    Authors: Stian Soiland-Reyes, Peter Sefton, Mercè Crosas, Leyla Jael Castro, Frederik Coppens, José M. Fernández, Daniel Garijo, Björn Grüning, Marco La Rosa, Simone Leo, Eoghan Ó Carragáin, Marc Portier, Ana Trisovic, RO-Crate Community, Paul Groth, Carole Goble

    Abstract: An increasing number of researchers support reproducibility by including pointers to and descriptions of datasets, software and methods in their publications. However, scientific articles may be ambiguous, incomplete and difficult to process by automated systems. In this paper we introduce RO-Crate, an open, community-driven, and lightweight approach to packaging research artefacts along with thei… ▽ More

    Submitted 6 December, 2021; v1 submitted 14 August, 2021; originally announced August 2021.

    Comments: 44 pages. Accepted for Data Science

    ACM Class: H.1.1; H.3.2

    Journal ref: Data Science 2022

  2. arXiv:2103.12793  [pdf, other

    cs.SE cs.DL

    A large-scale study on research code quality and execution

    Authors: Ana Trisovic, Matthew K. Lau, Thomas Pasquier, Mercè Crosas

    Abstract: This article presents a study on the quality and execution of research code from publicly-available replication datasets at the Harvard Dataverse repository. Research code is typically created by a group of scientists and published together with academic papers to facilitate research transparency and reproducibility. For this study, we define ten questions to address aspects impacting research rep… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: 30 pages

  3. arXiv:2005.02985  [pdf, other

    cs.DL cs.SE

    Advancing computational reproducibility in the Dataverse data repository platform

    Authors: Ana Trisovic, Philip Durbin, Tania Schlatter, Gustavo Durand, Sonia Barbosa, Danny Brooke, Mercè Crosas

    Abstract: Recent reproducibility case studies have raised concerns showing that much of the deposited research has not been reproducible. One of their conclusions was that the way data repositories store research data and code cannot fully facilitate reproducibility due to the absence of a runtime environment needed for the code execution. New specialized reproducibility tools provide cloud-based computatio… ▽ More

    Submitted 16 June, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 6 pages, 5 figures

  4. arXiv:1905.08674  [pdf

    cs.CY cs.DL

    Software Citation Implementation Challenges

    Authors: Daniel S. Katz, Daina Bouquin, Neil P. Chue Hong, Jessica Hausman, Catherine Jones, Daniel Chivvis, Tim Clark, Mercè Crosas, Stephan Druskat, Martin Fenner, Tom Gillespie, Alejandra Gonzalez-Beltran, Morane Gruenpeter, Ted Habermann, Robert Haines, Melissa Harrison, Edwin Henneken, Lorraine Hwang, Matthew B. Jones, Alastair A. Kelly, David N. Kennedy, Katrin Leinweber, Fernando Rios, Carly B. Robinson, Ilian Todorov , et al. (2 additional authors not shown)

    Abstract: The main output of the FORCE11 Software Citation working group (https://www.force11.org/group/software-citation-working-group) was a paper on software citation principles (https://doi.org/10.7717/peerj-cs.86) published in September 2016. This paper laid out a set of six high-level principles for software citation (importance, credit and attribution, unique identification, persistence, accessibilit… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

  5. arXiv:1803.05808  [pdf, other

    cs.DL

    Sharing and Preserving Computational Analyses for Posterity with encapsulator

    Authors: Thomas Pasquier, Matthew K. Lau, Xueyuan Han, Elizabeth Fong, Barbara S. Lerner, Emery Boose, Merce Crosas, Aaron M. Ellison, Margo Seltzer

    Abstract: Open data and open-source software may be part of the solution to science's "reproducibility crisis", but they are insufficient to guarantee reproducibility. Requiring minimal end-user expertise, encapsulator creates a "time capsule" with reproducible code in a self-contained computational environment. encapsulator provides end-users with a fully-featured desktop environment for reproducible resea… ▽ More

    Submitted 6 May, 2018; v1 submitted 15 March, 2018; originally announced March 2018.

    Comments: 11 pages, 6 figures

  6. arXiv:1506.05632  [pdf

    cs.CY cs.DL

    An Open Science Platform for the Next Generation of Data

    Authors: Latanya Sweeney, Merce Crosas

    Abstract: Imagine an online work environment where researchers have direct and immediate access to myriad data sources and tools and data management resources, useful throughout the research lifecycle. This is our vision for the next generation of the Dataverse Network: an Open Science Platform (OSP). For the first time, researchers would be able to seamlessly access and create primary and derived data from… ▽ More

    Submitted 18 June, 2015; originally announced June 2015.

    Comments: 32 pages, 8 figures

    ACM Class: H.3.1; H.3.2; H.3.3; H.3.5; H.3.6; H.3.7; H.2.7; H.2.8

  7. arXiv:1401.2134  [pdf, other

    cs.DL astro-ph.IM cs.CY

    10 Simple Rules for the Care and Feeding of Scientific Data

    Authors: Alyssa Goodman, Alberto Pepe, Alexander W. Blocker, Christine L. Borgman, Kyle Cranmer, Mercè Crosas, Rosanne Di Stefano, Yolanda Gil, Paul Groth, Margaret Hedstrom, David W. Hogg, Vinay Kashyap, Ashish Mahabal, Aneta Siemiginowska, Aleksandra Slavkovic

    Abstract: This article offers a short guide to the steps scientists can take to ensure that their data and associated analyses continue to be of value and to be recognized. In just the past few years, hundreds of scholarly papers and reports have been written on questions of data sharing, data provenance, research reproducibility, licensing, attribution, privacy, and more, but our goal here is not to review… ▽ More

    Submitted 9 January, 2014; originally announced January 2014.

    Comments: Accepted in PLOS Computational Biology. This paper was written collaboratively, on the web, in the open, using Authorea. The living version of this article, which includes sources and history, is available at http://www.authorea.com/3410/