<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="other" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.156917.2</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Case Study</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>A libraries reproducibility hackathon: connecting students to university research and testing the longevity of published code</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 2; peer review: 3 approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Griego</surname>
                        <given-names>Chasz</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-2051-7491</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Scotti</surname>
                        <given-names>Kristen</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Terveen</surname>
                        <given-names>Elizabeth</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Chan</surname>
                        <given-names>Joseph</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Sheng</surname>
                        <given-names>Daisy</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Gonz&#x00e1;lez-Espinoza</surname>
                        <given-names>Alfredo</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Warren</surname>
                        <given-names>Christopher</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-9881-682X</uri>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>University Libraries, Carnegie Mellon University, Pittsburgh, Pennsylvania, 15213, USA</aff>
                <aff id="a2">
                    <label>2</label>Carnegie Mellon University School of Computer Science, Pittsburgh, Pennsylvania, 15213, USA</aff>
                <aff id="a3">
                    <label>3</label>Carnegie Mellon University Department of English, Pittsburgh, Pennsylvania, 15213, USA</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:cgriego@andrew.cmu.edu">cgriego@andrew.cmu.edu</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>9</day>
                <month>9</month>
                <year>2025</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2024</year>
            </pub-date>
            <volume>13</volume>
            <elocation-id>1305</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>4</day>
                    <month>9</month>
                    <year>2025</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Griego C et al.</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/13-1305/pdf"/>
            <abstract>
                <sec>
                    <title>Background</title>
                    <p>Reproducibility is a basis of scientific integrity, yet it remains a significant challenge across disciplines in computational science. This reproducibility crisis is now being met with an Open Science movement, which has risen to prominence within the scientific community and academic libraries especially. At the Carnegie Mellon University Libraries, the Open Science and Data Collaborations (OSDC) Program promotes Open Science practices with resources, services, and events. Hosting hackathons in academic libraries may show promise for furthering such efforts.</p>
                </sec>
                <sec>
                    <title>Methods</title>
                    <p>To address the need for reproducible computational research and promote Open Science within the community, members of the OSDC Program organized a single-day hackathon centered around reproducibility. Partnering with a faculty researcher in English and Digital Humanities, we invited community members to reuse Python code and data from a research publication deposited to Harvard Dataverse. We also published these materials as a compute capsule in Code Ocean that participants could also access. Additionally, we investigated ways to use ChatGPT to troubleshoot errors from rerunning this code.</p>
                </sec>
                <sec>
                    <title>Results</title>
                    <p>Three students from the School of Computer Science participated in this hackathon. Accessing materials from Harvard Dataverse, these students found success reproducing most of the data visualizations, but they required some manual setup and modifications to address depreciated libraries used in the code. Alternatively, we found Code Ocean to be a highly accessible option, free from depreciation risk. Last, ChatGPT also aided in finding and addressing the same roadblocks to successfully reproduce the same figures as the participating students.</p>
                </sec>
                <sec>
                    <title>Conclusions</title>
                    <p>This hackathon allowed several students an opportunity to interact with and evaluate real research outputs, testing the reproducibility of computational data analyses. Partnering with faculty opened opportunities to improve open research materials. This case study outlines one approach for other academic libraries to highlight challenges that face reproducibility in an interactive setting.</p>
                </sec>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Reproducibility</kwd>
                <kwd>Hackathon</kwd>
                <kwd>Academic Libraries</kwd>
                <kwd>Open Science</kwd>
                <kwd>Metascience</kwd>
                <kwd>Digital Humanities</kwd>
                <kwd>Computational Research</kwd>
                <kwd>Software</kwd>
                <kwd>Community Engagement</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>The author(s) declared that no grants were involved in supporting this work.</funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 1</title>
                <p>This version of the article includes further context and details that we agreed to add following peer review. Note that the abstract has been modified to follow the formatting for a case study article, which we hope allows readers to follow our process as they find the article. Also, we have added further points in the conclusion section, which we believe strengthens the message derived from the outcomes of this work. Finally, Figure 1 was modified to give readers a more informative graphic that will again aid in explaining our process.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec id="sec1" sec-type="intro">
            <title>Introduction</title>
            <p>Reproducibility in scientific research is a highly regarded concept that measures both the credibility and soundness of rigorous studies.
                <sup>
                    <xref ref-type="bibr" rid="ref1">1</xref>,
                    <xref ref-type="bibr" rid="ref2">2</xref>
                </sup> In the context of research, the term reproducibility is often loosely defined or used interchangeably with replicability. The National Academy of Sciences convened in 2019 to develop a definition
                <sup>
                    <xref ref-type="bibr" rid="ref3">3</xref>
                </sup> and differentiate between reproducibility, replicability, and generalizability.
                <sup>
                    <xref ref-type="bibr" rid="ref2">2</xref>,
                    <xref ref-type="bibr" rid="ref3">3</xref>
                </sup> Reproducibility, which is generally consistent with 
                <italic toggle="yes">computational</italic> reproducibility, is defined as consistent computational outcomes by utilizing identical input data, procedures, methods, code, and analytical conditions. Replicability is defined as uniform outcomes across research projects designed to address the same scientific question, where each project collects its own data. And finally, generalizability is defined as how well findings of a study can be applied to different contexts or populations beyond the original.</p>
            <p>Modern research continues to face multiple challenges surrounding reproducibility. The adoptable skills and practices that promote reproducibility may vary between fields,
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref6">6</xref>
                </sup> awareness or guidance is often lacking,
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>,
                    <xref ref-type="bibr" rid="ref6">6</xref>,
                    <xref ref-type="bibr" rid="ref7">7</xref>
                </sup> and the time and resources devoted to teaching and practicing reproducibility may be deficient.
                <sup>
                    <xref ref-type="bibr" rid="ref8">8</xref>
                </sup> Computation research, in particular, faces barriers to reproducibility
                <sup>
                    <xref ref-type="bibr" rid="ref9">9</xref>
                </sup> with software inconsistencies, changing versions and dependencies,
                <sup>
                    <xref ref-type="bibr" rid="ref10">10</xref>
                </sup> and insufficient documentation of datasets
                <sup>
                    <xref ref-type="bibr" rid="ref11">11</xref>
                </sup> or methods.
                <sup>
                    <xref ref-type="bibr" rid="ref12">12</xref>
                </sup> For example, Liu and Salganik
                <sup>
                    <xref ref-type="bibr" rid="ref11">11</xref>
                </sup> found that the complexity of employed computational methods impacts reproducibility. The more common methods in a given field may ease reproducibility, as they are familiar and well-documented, but more advanced methods can pose greater challenges, requiring specialized knowledge, environments, and resources that are not as readily available or standardized.</p>
            <p>Investing in tools and practices around computational reproducibility enhances the transparency and reliability of scientific findings by enabling independent verification of results and fostering further research on established findings. Such a shift in computational research practice is important since it increases research credibility and efficiency and avoids wasted efforts in attempting to build on unreliable results.
                <sup>
                    <xref ref-type="bibr" rid="ref13">13</xref>,
                    <xref ref-type="bibr" rid="ref14">14</xref>
                </sup> The scientific community is increasingly advocating for the use of various tools and practices aimed at improving reproducibility of computational research,
                <sup>
                    <xref ref-type="bibr" rid="ref12">12</xref>,
                    <xref ref-type="bibr" rid="ref15">15</xref>
                </sup> including adopting open-source software,
                <sup>
                    <xref ref-type="bibr" rid="ref12">12</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref16">16</xref>
                </sup> conducting version control (e.g., Git and GitHub),
                <sup>
                    <xref ref-type="bibr" rid="ref17">17</xref>
                </sup> encapsulating computing environments (e.g., Docker and Guix),
                <sup>
                    <xref ref-type="bibr" rid="ref12">12</xref>,
                    <xref ref-type="bibr" rid="ref18">18</xref>,
                    <xref ref-type="bibr" rid="ref19">19</xref>
                </sup> and following community-driven standards and frameworks. Reproducible journals like Image Processing On Line (IPOL)
                <sup>
                    <xref ref-type="bibr" rid="ref20">20</xref>
                </sup> and repositories like Zenodo and Software Heritage are further supporting this shifting practice, allowing further sharing and archiving of research software and source code.</p>
            <p>In general, scientific communities are upping advocacy through the Open Science movement.
                <sup>
                    <xref ref-type="bibr" rid="ref21">21</xref>
                </sup> Open Science promotes transparency of research process, data, methodologies, and outputs; enhancing accessibility and usability by the broader scientific community.
                <sup>
                    <xref ref-type="bibr" rid="ref22">22</xref>,
                    <xref ref-type="bibr" rid="ref23">23</xref>
                </sup> Reproducibility is fundamental to open science,
                <sup>
                    <xref ref-type="bibr" rid="ref23">23</xref>,
                    <xref ref-type="bibr" rid="ref24">24</xref>
                </sup> by ensuring verifiable scientific claims to the community; made possible via open sharing of datasets, methodology, code, and using open source software. Overall, alignment with reproducible practices moves the scientific community towards a more collaborative environment.
                <sup>
                    <xref ref-type="bibr" rid="ref22">22</xref>
                </sup>
            </p>
            <p>At the Carnegie Mellon University Libraries, a dedicated Open Science &amp; Data Collaborations Program (OSDC) was established in 2018, which highlights the Libraries&#x2019; resources and tools that foster open, transparent, and reproducible research.
                <sup>
                    <xref ref-type="bibr" rid="ref25">25</xref>
                </sup> In this program, librarians and functional specialists serve as advocates, consultants, and collaborators for researchers accessing the existing Open Science infrastructure in the Libraries, including CMU&#x2019;s institutional repository, KiltHub,
                <sup>
                    <xref ref-type="bibr" rid="ref26">26</xref>
                </sup> and research data management services.
                <sup>
                    <xref ref-type="bibr" rid="ref27">27</xref>
                </sup> In addition, the Libraries have held multiple iterations of the Open Science Symposium,
                <sup>
                    <xref ref-type="bibr" rid="ref28">28</xref>
                </sup> which gathered global researchers and thought leaders in academia, industry, and publishing to discuss the ways that Open Science has transformed research. The OSDC program also created roles for STEM PhDs to transition into Open Science and librarianship through an Open Science Postdoctoral Associate position, with Chasz Griego and Kristen Scotti being the first and second candidates in this role, respectively. Through this postdoctoral role, Griego considered ways to explore reproducibility integration and evaluation among researchers and students at CMU.
                <sup>
                    <xref ref-type="bibr" rid="ref29">29</xref>
                </sup>
            </p>
            <p>Motivated by the challenges surrounding reproducibility and the growing awareness of Open Science, members of the CMU Libraries and OSDC Program organized a single-day hackathon centered around reproducibility. Hackathons, which are time-bound events for groups to collectively solve or explore a technical problem, often find libraries as a suitable host as a neutral and welcoming space on campus that harbors learning, discovery, and collaboration.
                <sup>
                    <xref ref-type="bibr" rid="ref30">30</xref>
                </sup> The CMU Libraries has effectively hosted multiple hackathons, including Biohackathons and a recent AI Literacy Resource Hackathon.
                <sup>
                    <xref ref-type="bibr" rid="ref31">31</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref33">33</xref>
                </sup>
            </p>
            <p>Libraries and hackathons surrounding reproducibility is also not a novel concept. The Center for Digital Scholarship at Leiden University held ReprohackNL in 2019, where participants attempted to reproduce published results using authors&#x2019; provided data and code in a single day event.
                <sup>
                    <xref ref-type="bibr" rid="ref34">34</xref>
                </sup>
            </p>
            <p>At the ReprohackNL event, the organizers observed how participants faced challenges with insufficient documentation, data behind a paywall, and problems with code and proprietary software. The CMU Libraries Reproducibility Hackathon aimed to offer a similar model, but here, we focused on the reproducibility of a study with shared data and code around a subject that could create interdisciplinary collaboration across students and researchers at CMU.</p>
        </sec>
        <sec id="sec2">
            <title>Purpose of the event</title>
            <p>The CMU Libraries Reproducibility Hackathon was an event that allowed any participant the opportunity to reproduce research results produced by a university professor. The purpose of such an event is to increase awareness of reproducibility as a realistic piece of the research life cycle, and demonstrate the outcomes of research scrutiny, which could theoretically be done by anyone. Most are familiar with the concept of reproducibility at a surface level, but many may not actually have a grasp on what that may look like, how it may happen, or how impactful it could be for any study. In an extremely idealistic world, a person with any background, experience, or research interest would have access to every material behind a particular study. These materials, again in an ideal case, would include all resources and guidance for a person to reproduce each result, assuring the findings first-hand. In reality, most research is very distanced from such an idealistic scenario, however, an event like a reproducibility hackathon can actually attempt to measure this distance for researchers willing to participate. For this hackathon, we offered one academic researcher the opportunity to put their research outputs up to scrutiny. Regardless of the outcome, a volunteer researcher helps set an example. If the results are highly reproducible, this presents a recognizable feat, and if the results prove more difficult to reproduce, this presents an opportunity to reflect on ways to improve. Work that is less reproducible does not have to be met with shame or guilt, but a humbling opportunity to learn and do better.</p>
            <p>For the CMU Libraries Reproducibility Hackathon, we teamed up with Christopher Warren, a Professor of English and Associate Department Head in the Dietrich College of Humanities and Social Sciences at CMU. As the subject for a reproducibility assessment, Warren offered the content of a published data analysis of the Oxford Dictionary of National Biography (ODNB),
                <sup>
                    <xref ref-type="bibr" rid="ref35">35</xref>,
                    <xref ref-type="bibr" rid="ref36">36</xref>
                </sup> a collection of biographies for over 60,000 influential figures in British history. Warren&#x2019;s work is credited as an &#x201c;audit&#x201d; of the ODNB,
                <sup>
                    <xref ref-type="bibr" rid="ref37">37</xref>
                </sup> revealing the biases and assumptions hidden beneath the vastness of big data infrastructure. This subject was well aligned with the event, because with an analysis such as Warren&#x2019;s, which scrutinizes the soundness of a massive data collection, the Python code that reveals the findings must also hold up to its own audit, ensuring that changing versions and dependencies do not prevent repeat analyses. With Warren offering his humanities research as an exemplary case, we offer further dialog around inclusivity of all research areas beyond STEM in the Open Science movement.
                <sup>
                    <xref ref-type="bibr" rid="ref38">38</xref>
                </sup>
            </p>
        </sec>
        <sec id="sec3">
            <title>Event execution</title>
            <p>We openly distributed a call for participants to anyone interested from CMU and other institutions in the greater Pittsburgh area. The call noted that knowledge and awareness of scientific programming with Python and/or R was a desired, but not required, condition for participation. Participants submitted their interest and background through a short form and had the option to attend one of two information sessions a week prior to the event. During the information session, participants received a link to Warren&#x2019;s manuscript to provide sufficient time to read into the details of the work. They were asked to not search for any items related to the work, avoiding the risk of early access to data and code posted to a repository.</p>
            <p>The hackathon was scheduled for just one day, with participants and organizers convening in a library space during typical business hours (9am - 5pm). A project page on Open Science Framework (OSF)
                <sup>
                    <xref ref-type="bibr" rid="ref39">39</xref>
                </sup> was created for the event, where participants could access event information and guidance, a copy of Warren&#x2019;s manuscript, and documents with links to access data and code. In addition to the published manuscript, Warren deposited supplementary research outputs to Harvard Dataverse, which includes a Jupyter Notebook, CSV files, Open Refine transformation JSON files, and others.
                <sup>
                    <xref ref-type="bibr" rid="ref36">36</xref>
                </sup> This collection of outputs served as one option for participants to use to attempt to reproduce the study (
                <xref ref-type="fig" rid="f1">
Figure 1</xref>). This option is arguably open, having research outputs deposited in a repository to ensure secure storage, public access, and findability with a DOI, and this option used non-proprietary file formats such as CSV and JSON, and open-source Python code written in notebooks. Such attributes greatly promote reusability once users download each output and ensure that Python and all necessary libraries are installed.</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>
Figure 1. </label>
                <caption>
                    <title>Diagram depicting the two options to reproduce research during the hackathon.</title>
                    <p>The first option (top) uses research outputs stored in Harvard Dataverse, and the second option (bottom) uses a capsule in Code Ocean. The end of the diagram (right) shows a reproduced version of 
                        <xref ref-type="fig" rid="f1">Figure 1</xref> from Warren&#x2019;s work.
                        <sup>
                            <xref ref-type="bibr" rid="ref31">31</xref>
                        </sup> Icons attributed to the Noun Project, the Dataverse Project, and Code Ocean.</p>
                </caption>
                <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/187759/160ae7c8-5283-4ed4-afec-3144a01e7399_figure1.gif"/>
            </fig>
            <p>We also provided a second option that aims to greatly facilitate reproducible computational research (
                <xref ref-type="fig" rid="f1">
Figure 1</xref>). The same data and code found in Harvard Dataverse were organized and configured in a Code Ocean capsule.
                <sup>
                    <xref ref-type="bibr" rid="ref35">35</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref40">40</xref>
                </sup> Code Ocean is a platform that allows researchers to archive research data and code along with preserved language and library versions plus dependencies and a metadata record in a self-contained capsule. Capsules are discoverable with a DOI, and outside users can perform reproducible runs that compile and execute code in a virtual environment, producing results that match the original analysis.</p>
        </sec>
        <sec id="sec4">
            <title>Observations and reflections</title>
            <p>The reproducibility hackathon hosted three undergraduate participants, who were each in the first or second year of their degree program in the School of Computer Science with established Python programming experience. Each student is an author of this work, with the following summarizing the notable outcomes, experiences, and contributions of these participants. Some notes are also listed briefly in 
                <xref ref-type="table" rid="T1">
Table 1</xref>. One participant, Elizabeth Terveen, chose the option of downloading the materials from Harvard Dataverse and attempting to run the Jupyter Notebook in a new Python environment working with the Anaconda package manager. To run the code, Terveen had to install four Python libraries (Pandas, Nltk, Matplotlib, and Plotly). One of the existing import statements in the notebook, regarding Plotly, returned an error due to a module depreciation.</p>
            <table-wrap id="T1" orientation="portrait" position="float">
                <label>
Table 1. </label>
                <caption>
                    <title>Technical issues experienced from participants attempting to reproduce results by running the code corresponding to Warren&#x2019;s work.
                        <sup>
                            <xref ref-type="bibr" rid="ref31">31</xref>
                        </sup>
                    </title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1" valign="top">Notable issues</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Several Python libraries required installation</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Some import statements needed to be updated because of module depreciation</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">New arguments were required to successfully run a function</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">A function keyword argument name was updated</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">File paths in the code were not consistent with file organization in the repository</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>Of the twenty-five figures produced in the notebook, Terveen noted fourteen figures that were successfully reproduced, but six of these fourteen figures did require some troubleshooting. This included supplying values for arguments in functions that were not specified in the original code, but a resulting error stated as required. The name of a keyword argument related to a Matplotlib function had also become depreciated, which required an update to the original code. Eight figures were not successfully reproduced because of file paths in the code that were not consistent with directory structure Terveen had after downloading from the repository. The reproducibility of two figures, however, werewas not documented during the event.</p>
            <p>A second participant, Joseph Chan, also chose the option to download the materials from Harvard Dataverse. Chan immediately noted that if a &#x201c;requirements.txt&#x201d; file was included with the original deposit, any user trying to reproduce the results would be able to download and install the appropriate packages and versions. Based on Chan&#x2019;s setup, five python libraries and Jupyter needed to be installed, including Pandas, Nltk, Pickle, Numpy, and Matplotlib. Additionally, a plotting library was deprecated, requiring an import statement to be updated. Once again, keyword arguments were depreciated and had to be updated. Two figures did not display upon running and were unable to be solved.</p>
            <p>Chan also created a modified version of the research outputs as a zip file uploaded to OSF (
                <ext-link ext-link-type="uri" xlink:href="https://osf.io/hvbzm">https://osf.io/hvbzm</ext-link>), which contains a requirements.txt file, all necessary data, and all the code from the notebook in a single Python (.py) file. With this, users can simply reproduce the work by executing a shell script from a terminal. Copies of figures that were successfully reproduced were uploaded to OSF (
                <ext-link ext-link-type="uri" xlink:href="https://osf.io/gqk8e/">https://osf.io/gqk8e/</ext-link>), with titles altered with either a note or a name to verify that the figures were original copies (see 
                <xref ref-type="fig" rid="f1">
Figure 1</xref>).</p>
            <p>Overall, this process of reproducing results using files downloaded from a repository produced several challenges with requirements to install packages, understand depreciations for some packages, and troubleshooting changes in syntax from evolving versions of software. Though there was trial and error, participants were able to reproduce a majority of the figures from this study. Additionally, participants provided their own insight to develop an updated collection of research outputs that help others reproduce this work more easily.</p>
        </sec>
        <sec id="sec5">
            <title>Additional outcomes and lessons learned</title>
            <p>Alfredo Gonz&#x00e1;lez-Espinoza, our colleague who participated in the assessment as well, offered an interesting anecdote while pursuing reproducibility via both Harvard Dataverse and Code Ocean (
                <ext-link ext-link-type="uri" xlink:href="https://osf.io/brn6d">https://osf.io/brn6d</ext-link>). In order to run the code downloaded from the repository, he had to install Python, Jupyter, all required libraries, and found the same depreciation errors as other participants; however, he was successful with reproducibility via the Code Ocean capsule. Gonz&#x00e1;lez-Espinoza&#x2019;s scenario was interesting because he used a gaming laptop to reproduce from the repository, which doesn&#x2019;t make a difference when the issue is installing software, but for the capsule he accessed the code via an old Chromebook tablet. This individual experience calls to attention the fact that a platform like Code Ocean may promote accessibility for people that may have limited access to computers, and instead only have access to devices that one can&#x2019;t use to download and install software as easily.</p>
            <p>The hackathon offered an additional opportunity for exploration. Kristen Scotti investigated ways to troubleshoot reproducing the output of the code using generative AI. Python, like many other programming languages, undergoes frequent updates, deprecating functions or changing syntax overtime, making author-provided code more difficult to work with or potentially incompatible with newer versions. Here, Scotti ran the author-provided code through ChatGPT to request updates (
                <ext-link ext-link-type="uri" xlink:href="https://osf.io/r65ev">https://osf.io/r65ev</ext-link>). ChatGPT&#x2019;s programming capabilities include the ability to interpret, debug, and suggest updates to code.
                <sup>
                    <xref ref-type="bibr" rid="ref41">41</xref>
                </sup> ChatGPT updated code successfully for all recreated plots except one, which also provided issues for other participants. The likely culprit was formatting issues due to syntax changes and complexity of the provided code block. A more effective approach might be to ask ChatGPT to convert the code in smaller sections, though this often requires someone experienced in the language to fine-tune both the prompts and the outputs.</p>
            <p>Outside the hackathon setting, we revisited the capsule version of Warren&#x2019;s research to test how easily his work could be adapted within this platform. For instance, figures 28 and 29 in the manuscript have prompted discussion in other sources.
                <sup>
                    <xref ref-type="bibr" rid="ref37">37</xref>,
                    <xref ref-type="bibr" rid="ref42">42</xref>
                </sup> Figure 29, in particular, displays the historical significance of selected women in the ODNB, and the span of living years of these women from 1830-2000 (listed here in 
                <xref ref-type="fig" rid="f2">
Figure 2b</xref>). However, horizontal bars representing a span of years could instead incidentally suggest the number of women belonging to the corresponding historical significance. We addressed this ambiguity by creating a new version of the capsule on Code Ocean
                <sup>
                    <xref ref-type="bibr" rid="ref43">43</xref>
                </sup> where additional cells in the Jupyter Notebook create new versions of these figures which explicitly state the numbers of years each vertical bar spans (
                <xref ref-type="fig" rid="f2">
Figure 2a</xref>).</p>
            <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                <label>
Figure 2. </label>
                <caption>
                    <title>Adapted version (a) of Figure 29 (b) from Warren&#x2019;s work.
                        <sup>
                            <xref ref-type="bibr" rid="ref31">31</xref>
                        </sup>
                    </title>
                </caption>
                <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/187759/160ae7c8-5283-4ed4-afec-3144a01e7399_figure2.gif"/>
            </fig>
        </sec>
        <sec id="sec6" sec-type="discussion|conclusions">
            <title>Discussion and conclusions</title>
            <p>We have outlined the execution and outcomes of a Reproducibility Hackathon, hosted by CMU Libraries, focusing on the reusability of data and code shared by a faculty member from the university. This event allowed students to interact with real research outputs to learn potentially new skills and fields of study while simultaneously offering their perspective. In reviewing the reproducibility of research code from the participating faculty member, Christopher Warren, students showed that manual setup was needed to rerun research code, and along with that, depreciations required them to write updates and modifications. One participating student sought a way to encapsulate the code so that everything was packaged and shareable, enabling an outside user to properly reproduce the run by executing a single script. Lastly, we attempted to rerun the code while using a ChatGPT API to debug. This option identified and addressed the same errors that student participants met and successfully reproduced the same figures.</p>
            <p>The Reproducibility Hackathon presented just one of countless cases of research code that won&#x2019;t rerun reliable as it. Python code and its many libraries for scientific analysis will update regularly, where over the years, significant changes like those we found will likely happen. Documentation is important to circumvent these changes to code so that external users are confidently aware of the original file organization and hierarchy, versions and dependencies of software libraries, and even potentially the original operating system or computing hardware. Though even with all details noted, an external user may lack all of the resources necessary to fully reproduce the analysis, but the combination of open source licensing, thorough documentation, and full transparency gives these users confident starting points to further research progress.</p>
            <p>Code Ocean presented an alternative to sharing code without manual installations or depreciations. An alternative of Warren&#x2019;s outputs was assembled and published in a reproducible capsule on Code Ocean. This option presented itself as a more accessible way for a user to interact with computational research, as a participating colleague revealed the ease of interacting with the capsule on a tablet device. Further applying this option, we modified Warren&#x2019;s code in Code Ocean to produce an alternative depiction of data visualizations presented in the original work. Doing this offered an example of how a reproducibility-centered platform can promote research as ongoing conversation that openly invites community members to offer constructive insights.</p>
            <p>In addition to Code Ocean, we also promoted the usage of Open Science Framework (OSF) throughout the Reproducibility Hackathon. Both platforms did greatly facilitate access of research documents, data, and code, but we also highlight that these platforms invoke terms of use related to the research data, software, and other content that is stored and manipulated within them. These terms, or the nature of sharing research material on these platforms from such companies, may not always align with the goals or values of individual researchers, teams, or institutions, and discretion is always recommended. These platforms are just a few of the numerous options, including institutional platforms and services, available to researchers to share open and reproducible outputs.</p>
            <p>Hosting a reproducibility hackathon at an academic institution created an open environment for researchers to put their published work to the test, opening themselves to constructive feedback around reproducibility, and allowing students or other researchers to interact with real research artifacts. The CMU Libraries reproducibility hackathon allowed three undergraduate students from the School of Computer Science to interact with a unique study in Digital Humanities. This was particularly meaningful because these students may not have otherwise had the opportunity to interact with real research in this way and see examples of the work done in this discipline. Collaborating with a faculty researcher, like Warren, further sets examples of researchers who not only open up research materials, but embraced opportunities to revisit and improve their work to maintain its relevance and scholarly integrity.</p>
            <p>We initially aimed for a higher number of participants, but we only received interest from eight students, with three being the listed participants here. For future iterations of this event, we will consider other factors to gain interest and retention. The time commitment of a full day may be excessive, considering class schedules. A shorter event, hosted after most classes in the evening could promote higher participation. Furthermore, the subject matter of the research may not be widely interesting across campus, suggesting a more targeted form of marketing, either to individual academic departments or courses. While we stated several benefits from our collaboration with Warren, other interested faculty researchers may have varying abilities to commit for subsequent events at any given time. While our execution was not one-size-fits-all, nor optimum, we have presented our experience here to offer a blueprint for other academic libraries to execute similar events, boost awareness of the threats to preserving research code, and highlight how new platforms eliminate these threats to reproducibility and enhance adaptation along the research lifecycle.</p>
        </sec>
        <sec id="sec7">
            <title>Ethics and consent</title>
            <p>Ethical approval and consent were not required.</p>
        </sec>
        <sec id="sec8">
            <title>Author contributions</title>
            <p>

                <bold>Chasz G:</bold> Conceptualization, Data Curation, Formal Analysis, Methodology, Project Administration, Resources, Software, Supervision, Validation, Writing &#x2013; Original Draft Preparation; 
                <bold>Kristen S:</bold> Conceptualization, Data Curation, Formal Analysis, Resources, Software, Validation, Visualization, Writing &#x2013; Original Draft Preparation; 
                <bold>Elizabeth T:</bold> Data Curation, Formal Analysis, Validation, Visualization; 
                <bold>Joseph C:</bold> Data Curation, Formal Analysis, Software, Validation; 
                <bold>Daisy S:</bold> Data Curation, Visualization; 
                <bold>Alfredo G-E:</bold> Data Curation, Formal Analysis, Validation; 

                <bold>Christopher W:</bold> Conceptualization, Data Curation, Resources, Software, Visualization, Writing &#x2013; Review &amp; Editing</p>
        </sec>
    </body>
    <back>
        <sec id="sec11" sec-type="data-availability">
            <title>Data availability</title>
            <p>Open Science Framework: Reproducibility Hackathon 2024. 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.17605/OSF.IO/AQ69R">https://doi.org/10.17605/OSF.IO/AQ69R</ext-link>.
                <sup>
                    <xref ref-type="bibr" rid="ref39">39</xref>
                </sup>
            </p>
            <p>The project contains the following underlying data:
                <list list-type="bullet">
                    <list-item>
                        <label>&#x2022;</label>
                        <p>All original documents and materials prepared and produced from the event.</p>
                    </list-item>
                </list>
            </p>
            <p>Data are available under the terms of the 
                <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International license</ext-link> (CC-BY 4.0).</p>
            <p>Harvard Dataverse: Replication Data for: Historiography&#x2019;s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB). 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.7910/DVN/D3KFLP">https://doi.org/10.7910/DVN/D3KFLP</ext-link>.
                <sup>
                    <xref ref-type="bibr" rid="ref40">40</xref>
                </sup>
            </p>
            <p>The project contains the following underlying data:
                <list list-type="bullet">
                    <list-item>
                        <label>&#x2022;</label>
                        <p>All data and code supporting the original work that was the focus of the hackathon.</p>
                    </list-item>
                </list>
            </p>
            <p>Data are available under the terms of the 
                <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/publicdomain/zero/1.0/">Creative Commons 1.0 Universal Dead</ext-link> (CC0 1.0)</p>
            <p>Code Ocean: Replication Data for: Historiography&#x2019;s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB) (Version 1). 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.24433/CO.6313661.v1">https://doi.org/10.24433/CO.6313661.v1</ext-link>.
                <sup>
                    <xref ref-type="bibr" rid="ref35">35</xref>
                </sup>
            </p>
            <p>The project contains the following underlying data:
                <list list-type="bullet">
                    <list-item>
                        <label>&#x2022;</label>
                        <p>All data, code, and software environment details supporting the original work that was the focus of the hackathon.</p>
                    </list-item>
                </list>
            </p>
            <p>Data and code are available under the terms of the Creative Commons CC0 licenses with no rights reserved.</p>
            <p>Code Ocean: Replication Data for: Historiography&#x2019;s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB) (Version 2). 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.24433/CO.6313661.v2">https://doi.org/10.24433/CO.6313661.v2</ext-link>.
                <sup>
                    <xref ref-type="bibr" rid="ref43">43</xref>
                </sup>
            </p>
            <p>The project contains the following underlying data:
                <list list-type="bullet">
                    <list-item>
                        <label>&#x2022;</label>
                        <p>All data, code, and software environment details supporting the original work that was the focus of the hackathon as well as appended code that adapts Figures 28 and 29 from the original study.</p>
                    </list-item>
                </list>
            </p>
            <p>Data and code are available under the terms of the Creative Commons CC0 1.0 licenses with no rights reserved.</p>
        </sec>
        <ack>
            <title>Acknowledgements</title>
            <p>We thank Melanie Gainey and Tom Hughes from the CMU Libraries for useful discussions about hackathons and their support during the event.</p>
        </ack>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <label>1</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Buckheit</surname>
                            <given-names>JB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Donoho</surname>
                            <given-names>DL</given-names>
                        </name>
</person-group>:
                    <chapter-title>WaveLab and Reproducible Research.</chapter-title>
                    <person-group person-group-type="editor">

                        <name name-style="western">
                            <surname>Antoniadis</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Oppenheim</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>, editors.
                    <source>

                        <italic toggle="yes">Wavelets Stat.</italic>
</source>
                    <publisher-loc>New York, NY</publisher-loc>:
                    <publisher-name>Springer</publisher-name>;<year>1995</year>; pp.<fpage>55</fpage>&#x2013;<lpage>81</lpage>.
                    <pub-id pub-id-type="doi">10.1007/978-1-4612-2544-7_5</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref2">
                <label>2</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Barba</surname>
                            <given-names>LA</given-names>
                        </name>
</person-group>:
                    <article-title>Terminologies for reproducible research.</article-title>
                    <source>

                        <italic toggle="yes">ArXiv Prepr. ArXiv180203311.</italic>
</source>
                    <year>2018</year>.</mixed-citation>
            </ref>
            <ref id="ref3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <article-title>Reproducibility and Replicability in Science.</article-title>
                    <source>

                        <italic toggle="yes">Natl. Acad. Sci.</italic>
</source>
                    <year>2019</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/books/NBK547537/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Baker</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Baker</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <article-title>1,500 scientists lift the lid on reproducibility.</article-title>
                    <source>

                        <italic toggle="yes">Nature.</italic>
</source>
                    <year>2016</year>;<volume>533</volume>:<fpage>452</fpage>&#x2013;<lpage>454</lpage>.
                    <pub-id pub-id-type="doi">10.1038/533452a</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mullane</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Williams</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <article-title>Enhancing reproducibility: Failures from Reproducibility Initiatives underline core challenges.</article-title>
                    <source>

                        <italic toggle="yes">Biochem. Pharmacol.</italic>
</source>
                    <year>2017</year>;<volume>138</volume>:<fpage>7</fpage>&#x2013;<lpage>18</lpage>.
                    <pub-id pub-id-type="pmid">28396196</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.bcp.2017.04.008</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kohrs</surname>
                            <given-names>FE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Auer</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bannach-Brown</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Eleven strategies for making reproducible research and open science training the norm at research institutions.</article-title>
                    <source>

                        <italic toggle="yes">elife.</italic>
</source>
                    <year>2023</year>;<volume>12</volume>:<fpage>e89736</fpage>.
                    <pub-id pub-id-type="pmid">37994903</pub-id>
                    <pub-id pub-id-type="doi">10.7554/eLife.89736</pub-id>
                    <pub-id pub-id-type="pmcid">PMC10666927</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rich-Edwards</surname>
                            <given-names>JW</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maney</surname>
                            <given-names>DL</given-names>
                        </name>
</person-group>:
                    <article-title>Best practices to promote rigor and reproducibility in the era of sex-inclusive research.</article-title>
                    <source>

                        <italic toggle="yes">elife.</italic>
</source>
                    <year>2023</year>;<volume>12</volume>:<fpage>e90623</fpage>.
                    <pub-id pub-id-type="pmid">37917121</pub-id>
                    <pub-id pub-id-type="doi">10.7554/eLife.90623</pub-id>
                    <pub-id pub-id-type="pmcid">PMC10622144</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref8">
                <label>8</label>
                <mixed-citation publication-type="other">
                    <article-title>Six factors affecting reproducibility in life science research and how to handle them.</article-title>
                    <year>n.d.</year>(accessed May 28, 2024).
                    <ext-link ext-link-type="uri" xlink:href="https://www.nature.com/articles/d42473-019-00004-y">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Antunes</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hill</surname>
                            <given-names>DR</given-names>
                        </name>
</person-group>:
                    <article-title>Reproducibility, Replicability and Repeatability: A survey of reproducible research with a focus on high performance computing.</article-title>
                    <source>

                        <italic toggle="yes">Comput. Sci. Rev.</italic>
</source>
                    <year>2024</year>;<volume>53</volume>:<fpage>100655</fpage>.</mixed-citation>
            </ref>
            <ref id="ref10">
                <label>10</label>
                <mixed-citation publication-type="book">
                    <source>

                        <italic toggle="yes">Implementing Reproducible Research.</italic>
</source>
                    <publisher-name>Routledge CRC Press</publisher-name>;<year>n.d.</year>(accessed May 28, 2024).
                    <ext-link ext-link-type="uri" xlink:href="https://www.routledge.com/Implementing-Reproducible-Research/Stodden-Leisch-Peng/p/book/9780367576172">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>DM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Salganik</surname>
                            <given-names>MJ</given-names>
                        </name>
</person-group>:
                    <article-title>Successes and struggles with computational reproducibility: lessons from the fragile families challenge.</article-title>
                    <source>

                        <italic toggle="yes">Socius.</italic>
</source>
                    <year>2019</year>;<volume>5</volume>:<fpage>2378023119849803</fpage>.
                    <pub-id pub-id-type="pmid">37309413</pub-id>
                    <pub-id pub-id-type="doi">10.1177/2378023119849803</pub-id>
                    <pub-id pub-id-type="pmcid">PMC10260256</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Piccolo</surname>
                            <given-names>SR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Frampton</surname>
                            <given-names>MB</given-names>
                        </name>
</person-group>:
                    <article-title>Tools and techniques for computational reproducibility.</article-title>
                    <source>

                        <italic toggle="yes">Gigascience.</italic>
</source>
                    <year>2016</year>;<volume>5</volume>(<issue>1</issue>):<fpage>s13742</fpage>&#x2013;<lpage>s13016</lpage>.
                    <pub-id pub-id-type="doi">10.1186/s13742-016-0135-4</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref13">
                <label>13</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Peng</surname>
                            <given-names>RD</given-names>
                        </name>
</person-group>:
                    <article-title>Reproducible Research in Computational Science.</article-title>
                    <source>

                        <italic toggle="yes">Science.</italic>
</source>
                    <year>2011</year>;<volume>334</volume>(<issue>6060</issue>):<fpage>1226</fpage>&#x2013;<lpage>1227</lpage>.
                    <pub-id pub-id-type="doi">10.1126/science.1213847</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ioannidis</surname>
                            <given-names>JPA</given-names>
                        </name>
</person-group>:
                    <article-title>Why Most Published Research Findings Are False.</article-title>
                    <source>

                        <italic toggle="yes">PLoS Med.</italic>
</source>
                    <year>2005</year>;<volume>2</volume>(<issue>8</issue>):<fpage>e124</fpage>.
                    <pub-id pub-id-type="pmid">16060722</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pmed.0020124</pub-id>
                    <pub-id pub-id-type="pmcid">PMC1182327</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Brito</surname>
                            <given-names>JJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Moore</surname>
                            <given-names>JH</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Recommendations to enhance rigor and reproducibility in biomedical research.</article-title>
                    <source>

                        <italic toggle="yes">GigaScience.</italic>
</source>
                    <year>2020</year>;<volume>9</volume>(<issue>6</issue>):<fpage>giaa056</fpage>.
                    <pub-id pub-id-type="pmid">32479592</pub-id>
                    <pub-id pub-id-type="doi">10.1093/gigascience/giaa056</pub-id>
                    <pub-id pub-id-type="pmcid">PMC7263079</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fogel</surname>
                            <given-names>K</given-names>
                        </name>
</person-group>:
                    <article-title>How To Run A Successful Free Software Project-Producing Open Source Software.</article-title>
                    <source>

                        <italic toggle="yes">CreateSpace.</italic>
</source>
                    <year>2009</year>. Accessed: Aug. 26, 2025.
                    <ext-link ext-link-type="uri" xlink:href="https://dl.acm.org/doi/abs/10.5555/1593439">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ram</surname>
                            <given-names>K</given-names>
                        </name>
</person-group>:
                    <article-title>Git can facilitate greater reproducibility and increased transparency in science, Source Code.</article-title>
                    <source>

                        <italic toggle="yes">Biol. Med.</italic>
</source>
                    <year>2013</year>;<volume>8</volume>:<fpage>1</fpage>&#x2013;<lpage>8</lpage>.
                    <pub-id pub-id-type="doi">10.1186/1751-0473-8-7</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Boettiger</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>An introduction to Docker for reproducible research.</article-title>
                    <source>

                        <italic toggle="yes">ACM SIGOPS Oper. Syst. Rev.</italic>
</source>
                    <year>2015</year>;<volume>49</volume>(<issue>1</issue>):<fpage>71</fpage>&#x2013;<lpage>79</lpage>.
                    <pub-id pub-id-type="doi">10.1145/2723872.2723882</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Cheifet</surname>
                            <given-names>B</given-names>
                        </name>
</person-group>:
                    <article-title>Promoting reproducibility with Code Ocean.</article-title>
                    <source>

                        <italic toggle="yes">Genome Biol.</italic>
</source>
                    <year>2021</year>;<volume>22</volume>(<issue>1</issue>):<fpage>65</fpage>.
                    <pub-id pub-id-type="pmid">33608018</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s13059-021-02299-x</pub-id>
                    <pub-id pub-id-type="pmcid">PMC7893895</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nicola&#x00ef;</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>The approach to reproducible research of the Image Processing On Line (IPOL) Journal.</article-title>
                    <source>

                        <italic toggle="yes">Informatio.</italic>
</source>
                    <year>2022</year>;<volume>27</volume>(<issue>1</issue>):<fpage>76</fpage>&#x2013;<lpage>112</lpage>.</mixed-citation>
            </ref>
            <ref id="ref21">
                <label>21</label>
                <mixed-citation publication-type="other">
                    <collab>T. UNESCO</collab>
                    <article-title>UNESCO recommendation on open science.</article-title>
                    <publisher-name>United Nations Educational, Scientific and Cultural Organization</publisher-name>;<year>2021</year>.</mixed-citation>
            </ref>
            <ref id="ref22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>McKiernan</surname>
                            <given-names>EC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bourne</surname>
                            <given-names>PE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Brown</surname>
                            <given-names>CT</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>How open science helps researchers succeed.</article-title>
                    <source>

                        <italic toggle="yes">elife.</italic>
</source>
                    <year>2016</year>;<volume>5</volume>.
                    <pub-id pub-id-type="pmid">27387362</pub-id>
                    <pub-id pub-id-type="doi">10.7554/elife.16800</pub-id>
                    <pub-id pub-id-type="pmcid">PMC4973366</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Munaf&#x00f2;</surname>
                            <given-names>MR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nosek</surname>
                            <given-names>BA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bishop</surname>
                            <given-names>DVM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A manifesto for reproducible science.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Hum. Behav.</italic>
</source>
                    <year>2017</year>;<volume>1</volume>:<fpage>0021</fpage>&#x2013;<lpage>0021</lpage>.
                    <pub-id pub-id-type="pmid">33954258</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41562-016-0021</pub-id>
                    <pub-id pub-id-type="pmcid">PMC7610724</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Munaf&#x00f2;</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <article-title>Open science and research reproducibility.</article-title>
                    <source>

                        <italic toggle="yes">Ecancermedicalscience.</italic>
</source>
                    <year>2016</year>;<volume>10</volume>.
                    <pub-id pub-id-type="pmid">27350794</pub-id>
                    <pub-id pub-id-type="doi">10.3332/ecancer.2016.ed56</pub-id>
                    <pub-id pub-id-type="pmcid">PMC4898932</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gainey</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Campbell</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Implementation and assessment of an end-to-end Open Science &amp; Data Collaborations program.</article-title>
                    <source>

                        <italic toggle="yes">F1000 Res.</italic>
</source>
                    <year>2022</year>;<volume>11</volume>:<fpage>501</fpage>.
                    <pub-id pub-id-type="pmid">36545378</pub-id>
                    <pub-id pub-id-type="doi">10.12688/f1000research.110355.2</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9748376</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref26">
                <label>26</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Scherer</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Valen</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Balancing Multiple Roles of Repositories.</italic>
</source>
                    <publisher-name>Developing a Comprehensive Repository at Carnegie Mellon University, Publications</publisher-name>;<year>2019</year>; vol.<volume>7</volume>; p.<fpage>30</fpage>.
                    <pub-id pub-id-type="doi">10.3390/publications7020030</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref27">
                <label>27</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Van Tuyl</surname>
                            <given-names>SI</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Developing a research data management services infrastructure at.</italic>
</source>
                    <publisher-name>Carnegie Mellon University, Carnegie Mellon University</publisher-name>;<year>2013</year>.</mixed-citation>
            </ref>
            <ref id="ref28">
                <label>28</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gainey</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Van Gulick</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>Carnegie Mellon&#x2019;s first Open Science Symposium - Themes about research data and their reuse.</article-title>
                    <year>2019</year>. (accessed April 1, 2024).
                    <ext-link ext-link-type="uri" xlink:href="https://kilthub.cmu.edu/articles/poster/Carnegie_Mellon_s_first_Open_Science_Symposium_-_Themes_about_research_data_and_their_reuse/7928714/1">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref29">
                <label>29</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Griego</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Integrating and Evaluating Best Practices in Research Reproducibility.</italic>
</source>
                    <publisher-name>Carnegie Mellon University</publisher-name>;<year>2023</year>. (accessed March 9, 2024).
                    <ext-link ext-link-type="uri" xlink:href="https://osf.io/4vgqh/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref30">
                <label>30</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Longmeier</surname>
                            <given-names>MM</given-names>
                        </name>
</person-group>:
                    <article-title>Hackathons and libraries: the evolving landscape 2014-2020.</article-title>
                    <year>2021</year>.</mixed-citation>
            </ref>
            <ref id="ref31">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bongiovanni</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Slayton</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Agate</surname>
                            <given-names>N</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>AI Literacy Resource Hackathon.</article-title>
                    <source>

                        <italic toggle="yes">Open Sci. Framew.</italic>
</source>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.17605/OSF.IO/WS2CE</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref32">
                <label>32</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Al Khleifat</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Smith</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Blobner</surname>
                            <given-names>B</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>SnpReportR: A Tool for Clinical Reporting of RNAseq Expression and Variants.</article-title>
                    <source>

                        <italic toggle="yes">BioHackrXiv.</italic>
</source>
                    <year>2021</year>.</mixed-citation>
            </ref>
            <ref id="ref33">
                <label>33</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kubica</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kumar</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tan</surname>
                            <given-names>G</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The fourth annual Carnegie Mellon Libraries hackathon for biomedical data management, knowledge graphs, and deep learning.</article-title>
                    <source>

                        <italic toggle="yes">BioHackrXiv.</italic>
</source>
                    <year>2023</year>.</mixed-citation>
            </ref>
            <ref id="ref34">
                <label>34</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hettne</surname>
                            <given-names>KM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Proppert</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nab</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>ReprohackNL 2019: how libraries can promote research reproducibility through community engagement.</article-title>
                    <source>

                        <italic toggle="yes">IASSIST Q.</italic>
</source>
                    <year>2020</year>;<volume>44</volume>(<issue>1/2</issue>):<fpage>1</fpage>&#x2013;<lpage>10</lpage>.
                    <pub-id pub-id-type="doi">10.29173/iq977</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref35">
                <label>35</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Warren</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>Replication Data for: Historiography&#x2019;s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB).</article-title>
                    <source>

                        <italic toggle="yes">Code Ocean.</italic>
</source>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.24433/CO.6313661.V1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref36">
                <label>36</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Warren</surname>
                            <given-names>CN</given-names>
                        </name>
</person-group>:
                    <article-title>Historiography&#x2019;s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB).</article-title>
                    <source>

                        <italic toggle="yes">J. Cult. Anal.</italic>
</source>
                    <year>2018</year>;<volume>3</volume>(<issue>1</issue>).
                    <pub-id pub-id-type="doi">10.22148/16.028</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref37">
                <label>37</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Guldi</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">The Dangerous Art of Text Mining: A Methodology for Digital History.</italic>
</source>
                    <publisher-loc>Cambridge</publisher-loc>:
                    <publisher-name>Cambridge University Press</publisher-name>;<year>2023</year>.
                    <pub-id pub-id-type="doi">10.1017/9781009263016</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref38">
                <label>38</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Longley Arthur</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hearn</surname>
                            <given-names>L</given-names>
                        </name>
</person-group>:
                    <article-title>Toward open research: A narrative review of the challenges and opportunities for open humanities.</article-title>
                    <source>

                        <italic toggle="yes">J. Commun.</italic>
</source>
                    <year>2021</year>;<volume>71</volume>(<issue>5</issue>):<fpage>827</fpage>&#x2013;<lpage>853</lpage>.</mixed-citation>
            </ref>
            <ref id="ref39">
                <label>39</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Griego</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Warren</surname>
                            <given-names>CN</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Scotti</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Reproducibility Hackathon.</article-title>
                    <year>2024</year>;<volume>2024</volume>. (accessed April 23, 2024).
                    <ext-link ext-link-type="uri" xlink:href="https://osf.io/aq69r/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref40">
                <label>40</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Warren</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>Replication Data for: Historiography&#x2019;s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB).</article-title>
                    <source>

                        <italic toggle="yes">Harvard Dataverse.</italic>
</source>
                    <year>Nov. 19, 2018</year>.
                    <pub-id pub-id-type="doi">10.7910/DVN/D3KFLP</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref41">
                <label>41</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Biswas</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Role of ChatGPT in Computer Programming.: ChatGPT in Computer Programming.</article-title>
                    <source>

                        <italic toggle="yes">Mesop. J. Comput. Sci.</italic>
</source>
                    <year>2023</year>;<volume>2023</volume>:<fpage>8</fpage>&#x2013;<lpage>16</lpage>.</mixed-citation>
            </ref>
            <ref id="ref42">
                <label>42</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Guldi</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>From Critique to Audit: A Pragmatic Response to the Climate Emergency from the Humanities and Social Sciences, and a Call to Action.</article-title>
                    <source>

                        <italic toggle="yes">KNOW J. Form. Knowl.</italic>
</source>
                    <year>2021</year>;<volume>5</volume>:<fpage>169</fpage>&#x2013;<lpage>196</lpage>.
                    <pub-id pub-id-type="doi">10.1086/716854</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref43">
                <label>43</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Warren</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Scotti</surname>
                            <given-names>K</given-names>
                        </name>
</person-group>:
                    <article-title>Replication Data for: Historiography&#x2019;s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB).</article-title>
                    <source>

                        <italic toggle="yes">Code Ocean.</italic>
</source>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.24433/CO.6313661.V2</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report413599">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.187759.r413599</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Eglen</surname>
                        <given-names>Stephen J</given-names>
                    </name>
                    <xref ref-type="aff" rid="r413599a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-8607-8025</uri>
                </contrib>
                <aff id="r413599a1">
                    <label>1</label>Cambridge University, Cambridge, UK</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>I am one of the leads on the project https://codecheck.org.uk</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>15</day>
                <month>10</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Eglen SJ</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport413599" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.156917.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>As this paper has already been through review, it looks like most of the key points have already been noted and corrected.</p>
            <p> </p>
            <p> I found the study easy to read and am supportive of indexing. I have just a few minor suggestions that might improve the paper.</p>
            <p> </p>
            <p> "Eight figures were not successfully reproduced because of file paths in the code": could you fix the code, or did the author get feedback about what went wrong here?</p>
            <p> </p>
            <p> "were was not documented"&#x00a0;</p>
            <p> </p>
            <p> Was the author present when you did the hackathon? (Our experience in codecheck hackathons is that they go much quicker, as expected, when authors are present to address any issues.)</p>
            <p> </p>
            <p> Thank you for providing the supplementary information that shows what you did at the event; could you possibly annotate (e.g. a README) for your osf.io collection so that people can easily see what you have shared? &#x00a0;It wasn't immediately clear to me what you had provided.</p>
            <p> </p>
            <p> As a general comment, if you were to run another hackathon along these lines, is there anything that you would do differently next time?</p>
            <p>Is the case presented with sufficient detail to be useful for teaching or other practitioners?</p>
            <p>Partly</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Is the background of the case&#x2019;s history and progression described in sufficient detail?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Computational Neuroscience; Reproducible Research (codecheck.org.uk)</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report412602">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.187759.r412602</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Gomez-Diaz</surname>
                        <given-names>Teresa</given-names>
                    </name>
                    <xref ref-type="aff" rid="r412602a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-7834-145X</uri>
                </contrib>
                <contrib contrib-type="author">
                    <name>
                        <surname>Recio</surname>
                        <given-names>Tomas</given-names>
                    </name>
                    <xref ref-type="aff" rid="r412602a2">2</xref>
                    <role>Co-referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-1011-295X</uri>
                </contrib>
                <aff id="r412602a1">
                    <label>1</label>Universite Gustave Eiffel / Laboratoire d'informatique Gaspard-Monge / CNRS, Marne-la-Vall&#x00e9;e, &#x00ce;le-de-France, France</aff>
                <aff id="r412602a2">
                    <label>2</label>Universidad Nebrija, Madrid, Spain</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>6</day>
                <month>10</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Gomez-Diaz T and Recio T</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport412602" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.156917.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The second version is fine. The authors have replied to all comments, and they have fixed the main issues. This work relates a very interesting reproducibility experience that took place at the Carnegie Mellon University Libraries.</p>
            <p>Is the case presented with sufficient detail to be useful for teaching or other practitioners?</p>
            <p>Partly</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Is the background of the case&#x2019;s history and progression described in sufficient detail?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Research Software, Research Data, Open Science, dissemination, evaluation</p>
            <p>We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report355235">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.172297.r355235</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Gomez-Diaz</surname>
                        <given-names>Teresa</given-names>
                    </name>
                    <xref ref-type="aff" rid="r355235a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-7834-145X</uri>
                </contrib>
                <contrib contrib-type="author">
                    <name>
                        <surname>Recio</surname>
                        <given-names>Tomas</given-names>
                    </name>
                    <xref ref-type="aff" rid="r355235a2">2</xref>
                    <role>Co-referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-1011-295X</uri>
                </contrib>
                <aff id="r355235a1">
                    <label>1</label>Universite Gustave Eiffel / Laboratoire d'informatique Gaspard-Monge / CNRS, Marne-la-Vall&#x00e9;e, &#x00ce;le-de-France, France</aff>
                <aff id="r355235a2">
                    <label>2</label>Universidad Nebrija, Madrid, Spain</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>7</day>
                <month>1</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Gomez-Diaz T and Recio T</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport355235" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.156917.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>CASE STUDY</p>
            <p> A libraries reproducibility hackathon: connecting students to University research and testing the longevity of published code 
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/articles/13-1305/v1">https://f1000research.com/articles/13-1305/v1</ext-link>
            </p>
            <p> </p>
            <p> This case study refers to a hackathon conducted by the members of the Carnegie Mellon University (CMU) Libraries in order to reproduce research results produced by Professor Christopher Warren on data analysis of the Oxford Dictionary of National Biography (ODNB). The event was open for participants coming from CMU and other institutions in the greater Pittsburgh area and was conducted by members of the CMU Libraries. This experience gives an interesting setting to explore reproducibility of published research results by a Professor of the University who also provides data and the produced software involved in the research, and shows a good example of a collaboration between Librarians and members (students?) of the Computer Science School.</p>
            <p> </p>
            <p> However, there are a few points in this manuscript that need attention.</p>
            <p> </p>
            <p> 1. Abstract. As included in the Case Studies guidelines [ 
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/for-authors/article-guidelines/case-studies">https://f1000research.com/for-authors/article-guidelines/case-studies</ext-link> ], abstract should be structured into 
                <bold>Background, Methods, Results,</bold> and 
                <bold>Conclusions</bold> . In particular, the conclusions should be better presented in the abstract (and in the manuscript, see later).</p>
            <p> </p>
            <p> 2. Hackathon context. As explained in the document (see also [ 
                <ext-link ext-link-type="uri" xlink:href="https://www.library.cmu.edu/about/news/2024-04/reproducibility-hackathon">https://www.library.cmu.edu/abou)t/news/2024-04/reproducibility-hackathon</ext-link> ], the event was open to a large (institutional) public, but the title of the manuscript refers to students. More insight about the participants (how many, how many students, which level, which scientific background&#x2026;) should be provided in order to better seize the complexity of the event. Please, clarify if the three authors of the manuscript coming from the Computer Science School where also participants of the event and if they are students or Members of the School. Moreover, it was indicated that &#x201c;participants could have knowledge and awareness of scientific programming with Python and/or R&#x201d; was desired, but was not required. Was this a difficulty? How many participants had or had not programming knowledge?</p>
            <p> </p>
            <p> 3. Problems with data and code It is mentioned that: &#x201c;At this hackathon event, the organizers observed how participants faced challenges with insufficient documentation, data behind a paywall, and problems with code and proprietary software.&#x201d; But there is no further information about these observations.</p>
            <p> </p>
            <p> 4. Reproducibility success. It is mentioned that: &#x201c;Of the figures produced in the notebook, fourteen figures were successfully reproduced by Terveen &#x2026; and eight figures were not successfully reproduced.&#x201d; Or that &#x201c;Though there was trial and error, participants were able to successfully reproduce figures from this study&#x201d;. But which is the percentage of success? How many figures were in the initial work by Prof. Warren and how many were reproduced by all the participants?</p>
            <p> </p>
            <p> 5. Insufficient documentation and code longevity. As noticed by participant Chan, a &#x201c;requirements.txt&#x201d; file could help to facilitate the understanding of the software context of the code produced by Prof. Warren. Our work (see Gomez-Diaz T, et al., 2022 [Ref 1]) goes further in this kind of recommendations, proposing flexible procedures for research software and research data dissemination.</p>
            <p> It is extremely idealistic to expect that software released at some time will find easily runable environments, unless there is the available resources (human, funding) to keep the necessary maintenance of the produced software (who can use Windows version 5?). To accompany the research produced software with good documentation and description of the necessary components will help to get to reproducibility environments, that is, to follow appropriate dissemination procedures will be of help for reproducibility issues. Nevertheless, it will be not always possible to ensure the full reproducibility and, in our opinion, a partial solution consists in to release the research software as free/open source software, where the accessibility and the study of the source code is confirmed by the use of free/open source licenses.</p>
            <p> </p>
            <p> 6. Conclusions. Conclusions are poorly described, and more information on the benefits, found drawbacks and other consequences of the event, for participants, students, Prof. Warren and event organizers should be provided to complete this case study.</p>
            <p> </p>
            <p> 7. References. We would like to suggest the inclusion of these references: 
                <list list-type="bullet">
                    <list-item>
                        <p>for Open Science: UNESCO: UNESCO Recommendation on Open Science. 2021 [Ref 2]</p>
                    </list-item>
                    <list-item>
                        <p>for Free/Open Source Software: K. Fogel (2005-2022). Producing Open Source Software. How to Run a Successful Free Software Project,&#x00a0; 
                            <ext-link ext-link-type="uri" xlink:href="https://producingoss.com/">https://producingoss.com</ext-link>
                        </p>
                    </list-item>
                    <list-item>
                        <p>for an example of a Journal giving a reproducibility environment for their publications:</p>
                    </list-item>
                    <list-item>
                        <p>IPOL Journal &#x00b7; Image Processing On Line [ 
                            <ext-link ext-link-type="uri" xlink:href="https://www.ipol.im/">https://www.ipol.im/</ext-link>
                        </p>
                    </list-item>
                    <list-item>
                        <p>Nicola&#x00ef;, A., et al., 2022 [Ref 3]</p>
                    </list-item>
                </list> 8. Use of privately owned companies. We would like to raise awareness about the use of privately owned companies like Code Ocean [ 
                <ext-link ext-link-type="uri" xlink:href="https://codeocean.com/">https://codeocean.com/</ext-link> ] for the storage and manipulation of data and code produced within the scientific community. Usually the scientific community is not well aware of legal issues related to the terms of use of these platforms and their consequences, like, for example: `` 
                <italic>You hereby grant Code Ocean a worldwide, perpetual, irrevocable, royalty-free right and license to use and exploit any ideas, suggestions, comments, recommendations, enhancement requests or other input provided by you about the Platform to Code Ocean.</italic> '' (see point 3. 
                <bold>Feedback
                    <bold> </bold>
                </bold> in [ 
                <ext-link ext-link-type="uri" xlink:href="https://codeocean.com/terms-of-use">https://codeocean.com/terms-of-use</ext-link> ]). Regarding OSF, the Center for Open Science, Inc. (referred to as "the COS", "Open Science Framework", "OSF") declares to be a nonprofit organization, but also provides with similar terms of use: `` 
                <italic>by depositing any Content (including data and links, but not the linked content) in a COS account, you grant the COS a perpetual, irrevocable, worldwide, royalty-free license to store, reproduce, transmit, distribute, publicly perform, and publicly display such Happy&#x2026;</italic> '' (see further information in [ 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/CenterForOpenScience/cos.io/blob/master/TERMS_OF_USE.md">https://github.com/CenterForOpenScience/cos.io/blob/master/TERMS_OF_USE.md</ext-link> ]). Universities and other Research Performing Institutions, and, in particular, University and other institutional Libraries, could study the proposition of University or institutionally owned platforms and/or services to deal with their scientific production.</p>
            <p> </p>
            <p> 9. To complete the background context of this case study, some information on previously hosted hackathons by the CMU Libraries, like Biohackathons and an AI Literacy Resource Hackathon, could be included as they can be of interest for the Hackathon community.
                <bold> </bold>
            </p>
            <p>Is the case presented with sufficient detail to be useful for teaching or other practitioners?</p>
            <p>Partly</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Is the background of the case&#x2019;s history and progression described in sufficient detail?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Research Software, Research Data, Open Science, dissemination, evaluation</p>
            <p>We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however we have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-355235-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Research Software vs. Research Data II: Protocols for Research Data dissemination and evaluation in the Open Science context.</article-title>
                        <source>
                            <italic>F1000Res</italic>
                        </source>.<year>2022</year>;<volume>11</volume>:
                        <elocation-id>10.12688/f1000research.78459.2</elocation-id>
                        <fpage>117</fpage>
                        <pub-id pub-id-type="pmid">36483317</pub-id>
                        <pub-id pub-id-type="doi">10.12688/f1000research.78459.2</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-355235-2">
                    <label>2</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>UNESCO Recommendation on Open Science</article-title>.<year>2021</year>;
                        <elocation-id>10.54677/MNMH8546</elocation-id>
                        <pub-id pub-id-type="doi">10.54677/MNMH8546</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-355235-3">
                    <label>3</label>
                    <mixed-citation>
                        <person-group person-group-type="author"/>:
                        <article-title>The approach to reproducible research of the Image Processing On Line (IPOL) journal</article-title>.
                        <source>
                            <italic>http://www.scielo.edu.uy/pdf/info/v27n1/2301-1378-info-27-01-76.pdf</italic>
                        </source>.<year>2022</year>;</mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment14503-355235">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Griego</surname>
                            <given-names>Chasz</given-names>
                        </name>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>4</day>
                    <month>9</month>
                    <year>2025</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thank you, we really appreciate your review and recommendations. We are grateful that you could offer your experience and perspective to help strengthen our paper, as well as offer questions that helped us add more clarity and critical points in our writing. Please see our responses to your comments, where we also note any changes made to the original manuscript:</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;abstract should be structured into Background, Methods, Results, and Conclusions . In particular, the conclusions should be better presented in the abstract (and in the manuscript, see later).&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>Thank you for bringing this to our attention. We have restructured the abstract, resulting in something that we think will be easier for a reader to navigate. We also revised our conclusions, both here and later in the manuscript (also mentioned later in this response).</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;More insight about the participants (how many, how many students, which level, which scientific background&#x2026;).... Please, clarify if the three authors of the manuscript coming from the Computer Science School where also participants of the event and if they are students or Members of the School.&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>Thank you for noting this ambiguity. We've added a statement clarifying that the three authors from the School of Computer Science were undergraduate students participating in the event. This clarification did significantly improve our narrative and summary.&#x00a0;</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;it was indicated that &#x201c;participants could have knowledge and awareness of scientific programming with Python and/or R&#x201d; was desired, but was not required. Was this a difficulty? How many participants had or had not programming knowledge?&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>This is good to clarify. Each student already had an established skillset with Python programming. With the limited pool of participants, a desire for programming knowledge was not a challenge, but this could be something to consider for future events, especially considering the aspect of reproducibility by any person or team.</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;It is mentioned that: &#x201c;At this hackathon event, the organizers observed how participants faced challenges with insufficient documentation, data behind a paywall, and problems with code and proprietary software.&#x201d; But there is no further information about these observations.&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>We are grateful that you noted this because it actually revealed an error in our editing. This sentence is in reference to the cited hackathon in the previous statement (ReprohackNL). The paragraph break between these statements was an error. We fixed this error to hopefully make better sense.</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;But which is the percentage of success? How many figures were in the initial work by Prof. Warren and how many were reproduced by all the participants?&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>Thank you for pointing out these details, which are a bit vague. We've edited the manuscript to mention that the original work had twenty-five figures to reproduce. We also changed the wording in the latter sentence to read with more accuracy. Our paper is not focusing on if the entire work was reproduced during this event, rather, the paper is focusing on the details and nuances of research code that are revealed when a group comes together to try to reproduce the outcomes.</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;Our work (see Gomez-Diaz T, et al., 2022 [Ref 1]) goes further in this kind of recommendations, proposing flexible procedures for research software and research data dissemination. It is extremely idealistic to expect that software released at some time will find easily runable environments, unless there is the available resources&#x2026; Nevertheless, it will be not always possible to ensure the full reproducibility and, in our opinion, a partial solution consists in to release the research software as free/open source software, where the accessibility and the study of the source code is confirmed by the use of free/open source licenses.&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>These were all excellent points, especially those written in your work. We decided to add another paragraph to the conclusion section to address how reproducibility also relies on adequate resources and systems, which their availability may also depreciate with time, and that documentation offers further guidance and transparency to help other users reuse research software and data, even when resources are scarce. We appreciated the opportunity to add this critical aspect to our conclusions.&#x00a0;</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;Conclusions are poorly described, and more information on the benefits, found drawbacks and other consequences of the event, for participants, students, Prof. Warren and event organizers should be provided to complete this case study.&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>Thank you for this suggestion. Your review greatly helped us better communicate the impact and implications of running this type of hackathon. It also adds further context for readers to learn how to readapt this model. The final paragraphs of this paper now offer further detail about the benefits, drawbacks, and lessons learned.</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;We would like to suggest the inclusion of these references&#x2026;&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>Thank you for these great recommendations. These references were added to the appropriate places in the manuscript.</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;Use of privately owned companies&#x2026;&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>We found your point here one of the most important to call to attention. In our libraries, and even within our Open Science initiatives, we often promote tools and platforms citing the benefits of places that host your research materials and offer increased accessibility and interoperability. However, it is true that while these companies create these products, they also have business models to sustain and insurance around their products. This was important to note in our concluding section to highlight that researchers and librarians may have rightful reservations when using these products.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report339316">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.172297.r339316</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Antunes</surname>
                        <given-names>Benjamin</given-names>
                    </name>
                    <xref ref-type="aff" rid="r339316a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-0700-6558</uri>
                </contrib>
                <aff id="r339316a1">
                    <label>1</label>Universit&#x00e9; Clermont Auvergne, Clermont-Ferrand, France</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>4</day>
                <month>12</month>
                <year>2024</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2024 Antunes B</copyright-statement>
                <copyright-year>2024</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport339316" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.156917.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This paper presents a reproducibility hackathon where students attempted to replicate the work of Professor Christopher Warren. In my opinion, this is an excellent initiative. Education is a critical avenue for improving reproducibility, as it raises awareness among students and future researchers about its importance.</p>
            <p> </p>
            <p> The paper is not highly technical; instead, it provides an overview of the types of challenges encountered when attempting to reproduce results. It highlights the use of literate programming and ChatGPT to facilitate code reproduction and illustrates common difficulties, such as handling software dependencies and library versioning in Python. The hackathon focused on research in the field of Digital Humanities.</p>
            <p> </p>
            <p> My Remarks:</p>
            <p> Introduction &#x2013; First Paragraph: As noted by Barba (2018), computer science was one of the only field to use swaped definitions between reproducibility and replicability (reproducibility is doing the same experiment to obtain the same results, while replicability is doing another experiment to obtain the same scientific conclusion). ACM recently adopted the NISO-recommended terminology for computer science:</p>
            <p> </p>
            <p> "
                <italic>*As a result of discussions with the National Information Standards Organization (NISO), it was recommended that ACM harmonize its terminology and definitions with those used in the broader scientific research community, and ACM agreed with NISO&#x2019;s recommendation to swap the terms &#x201c;reproducibility&#x201d; and &#x201c;replication&#x201d; with the existing definitions used by ACM as part of its artifact review and badging initiative. ACM took action to update all prior badging to ensure consistency.</italic>"</p>
            <p> 
                <ext-link ext-link-type="uri" xlink:href="https://www.acm.org/publications/policies/artifact-review-and-badging-current">https://www.acm.org/publications/policies/artifact-review-and-badging-current</ext-link>
            </p>
            <p> </p>
            <p> This shift swaped the definition of reproducibility and replicability. However, from an epistemological standpoint, reproducibility remains the broader, general term.</p>
            <p> The paper omits a crucial concept in computer science: repeatability (bitwise repeatability), which is indispensable for debugging (in the context of computer science)</p>
            <p> Code execution depends on the entire software environment. Open-source software is essential, starting with the operating system (e.g., Linux). However, Docker might not be the best tool for encapsulating computing environments (in the context of reproducible research), as it was not initially designed for reproducibility. I suggest exploring Guix, which is purpose-built for reproducible research, and Software Heritage for archiving software.</p>
            <p> </p>
            <p> You rightly emphasize the importance of sharing code and data, aligning with the open science movement. It would be helpful to mention tools like Software Heritage and Zenodo for sharing resources. I noticed the mention of a locally managed public repository, KiltHub. If this repository is not accessible to researchers outside Carnegie Mellon University, highlighting alternative platforms like Zenodo remains relevant.</p>
            <p> </p>
            <p> The inclusion of Figure 1 might not add value to the reader. This paper is a practical feedback, primarily aimed at showcasing the challenges of reproducing work. Instead, figures illustrating reproducibility workflows or tools (e.g., Table 1) might be more informative.</p>
            <p> </p>
            <p> While Python is widely used due to its ease of use, it is not ideal for reproducibility because of its instability compared to languages like Bash or C, that are more stable. Nonetheless, I agree that results should be reproducible by all, including non-experts, so Python remains a good practical choice.</p>
            <p> </p>
            <p> The paper briefly mentions Docker in the introduction but does not elaborate on its usage. It may be more relevant to discuss tools like Guix and Software Heritage, which were explicitly created to address reproducibility challenges. While the paper focuses on Digital Humanities, tools such as CodeOcean (originating from bioinformatics) and FAIR principles are broadly applicable.</p>
            <p> </p>
            <p> The paper focus on Python Jupyter notebooks and literate programming is a solid first step for enabling reproducibility. However, other fields like biology and physics, among others, also face reproducibility crises. Broader discussions could include data storage, workflow tools (for example Snakemake to keep it simple), virtual machines, containers, floating-point arithmetic issues, or parallel computing. But of course, the objective of the paper is to present the hackathon, so we cannot talk about everything. I just write it here if you want to take a look.</p>
            <p> </p>
            <p> Computers, often seen as &#x201c;black boxes,&#x201d; create reproducibility challenges across disciplines. A short introduction to computer science and reproducible research should be incorporated into all scientific curricula. I can recommend you the MOOC: 
                <ext-link ext-link-type="uri" xlink:href="https://www.fun-mooc.fr/en/courses/reproducible-research-methodological-principles-transparent-scie/">https://www.fun-mooc.fr/en/courses/reproducible-research-methodological-principles-transparent-scie/</ext-link>
            </p>
            <p> In addition, I have written one survey about reproducible research, even if it was more oriented to High performance computing, you might benefit from its read:</p>
            <p> Antunes B, et al., 2024 (Ref 1)</p>
            <p> </p>
            <p> One detail: In the PDF version of the paper, the abstract text is not justified. Shouldn&#x2019;t it be?</p>
            <p> </p>
            <p> Conclusion</p>
            <p> This hackathon is an excellent initiative to raise awareness among students and young researchers about the challenges of reproducibility. While the paper provides a high-level overview, focusing on literate programming with Jupyter notebooks and the problem of software dependency hell, it serves as a good introduction to the topic.</p>
            <p>Is the case presented with sufficient detail to be useful for teaching or other practitioners?</p>
            <p>Yes</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Not applicable</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Is the background of the case&#x2019;s history and progression described in sufficient detail?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>My research area if reproducible research in high performance computing. This paper could benefit from having broader information about reproducibility tools, however, it is a short paper describing a Hackathon.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-339316-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Reproducibility, Replicability and Repeatability: A survey of reproducible research with a focus on high performance computing</article-title>.
                        <source>
                            <italic>Computer Science Review</italic>
                        </source>.<year>2024</year>;<volume>53</volume>:
                        <elocation-id>10.1016/j.cosrev.2024.100655</elocation-id>
                        <pub-id pub-id-type="doi">10.1016/j.cosrev.2024.100655</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment14502-339316">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Griego</surname>
                            <given-names>Chasz</given-names>
                        </name>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>4</day>
                    <month>9</month>
                    <year>2025</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thank you for your review. We really appreciate your thoughtful assessment of our work, and we are thrilled to receive your perspective and recognition. Please see our responses to your comments, where we also note any changes made to the original manuscript:</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;ACM recently adopted the NISO-recommended terminology for computer science&#x2026;&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:</bold> We appreciate you noting the consistency of the definitions of reproducibility and replicability, between ACM (and NISO) with what is mentioned&#x00a0; in our paper, especially in the context of artifact badging. This was good to note, showing how there exists multiple guidelines that follow this convention.</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;The paper omits a crucial concept in computer science: repeatability (bitwise repeatability), which is indispensable for debugging (in the context of computer science)&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>We definitely agree that bitwise repeatability is highly notable for fully reproducible computer science, and we're grateful that you note it here. However, as this paper is addressing a less technical scope of reproducible computational research (ie. basic data science and analysis, in contrast to high performance computing), we are content with the scope of detail on computational reproducible that we chose to mention.</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;It would be helpful to mention tools like Software Heritage and Zenodo for sharing resources. I noticed the mention of a locally managed public repository, KiltHub. If this repository is not accessible to researchers outside Carnegie Mellon University, highlighting alternative platforms like Zenodo remains relevant.&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>Thank you for mentioning both of these platforms and suggesting we mention them. We agree and have included them in the third paragraph of the introduction, as they are greatly contributing to sharing and preserving software. We agree that a repository like Zenodo is a great alternative to an institutional repository, like KiltHub. However, we prefer to not highlight Zotero or other platforms in the paragraph with KiltHub, since the paragraph was designed to focus specifically on the background behind CMU's Open Science infrastructure, rather than the array of options available for open and reproducible research.</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;Instead, figures illustrating reproducibility workflows or tools (e.g., Table 1) might be more informative.&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>This is a great suggestion. We instead decided to convert this figure into a diagram that illustrates the two options (paths) to reproducibility. This diagram still uses one of the original data visualizations as an example, but in much better context.&#x00a0;</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;The paper briefly mentions Docker in the introduction but does not elaborate on its usage. It may be more relevant to discuss tools like Guix and Software Heritage, which were explicitly created to address reproducibility challenges.&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>Those are fair points. Again, we only mention tools/platforms and avoid adding excessive detail for the sake of the paper&#x2019;s scope. However, we have now included mentions to Guix and Software Heritage based on your recommendation.</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;Broader discussions could include data storage, workflow tools (for example Snakemake to keep it simple), virtual machines, containers, floating-point arithmetic issues, or parallel computing.&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>We really appreciate these points you mention. Reproducibility, let alone computational reproducibility, is so nuanced in each field. Issues surrounding data storage, high-throughput computing, and other software architectures are important to consider, and we hope that efforts like this hackathon, and conversations around it, will promote further awareness and practice that leads to these factors being further addressed.</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;I can recommend you the MOOC&#x2026;. In addition, I have written one survey about reproducible research, even if it was more oriented to High performance computing, you might benefit from its read&#x2026;&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>Thank you for sharing the MOOC. The content is very reflective of the practices we aim to instill in our students through the libraries, especially topics around structured Markdown notes, Git version control, and notebooks.</p>
                <p> </p>
                <p> Thank you for also sharing your survey. We hope to learn further about reproducibility challenges, especially with high performance computing, and apply this to further efforts. We are especially grateful that we could receive your comments, seeing that you authored such a thorough and extensive survey on the topic. We will also include this reference in the manuscript.</p>
                <p> </p>
                <p> 
                    <italic>&#x201c;One detail: In the PDF version of the paper, the abstract text is not justified. Shouldn&#x2019;t it be?&#x201d;</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Response:&#x00a0;</bold>Thanks for asking this. We will redirect this to the editor</p>
            </body>
        </sub-article>
    </sub-article>
</article>
