<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="other" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.75071.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Software Tool Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>SPARClink: an interactive tool to visualize the impact of the SPARC program</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 2 approved, 1 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Soundarajan</surname>
                        <given-names>Sanjay</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-2829-8032</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Kuruppu</surname>
                        <given-names>Sachira</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-3829-6797</uri>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Singh</surname>
                        <given-names>Ashutosh</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Kim</surname>
                        <given-names>Jongchan</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Achalla</surname>
                        <given-names>Monalisa</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <xref ref-type="aff" rid="a5">5</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Fair Data Innovations Hub, California Medical Innovations Institute, San Diego, California, USA</aff>
                <aff id="a2">
                    <label>2</label>Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand</aff>
                <aff id="a3">
                    <label>3</label>Electrical and Computer Engineering Department, Northeastern University, Boston, Massachusetts, USA</aff>
                <aff id="a4">
                    <label>4</label>Data Science, The George Washington University, Washington, District of Columbia, USA</aff>
                <aff id="a5">
                    <label>5</label>Clarkson Center for Complex Systems Science, Clarkson University, Post, Potsdam, New York, USA</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:ssoundarajan@calmi2.org">ssoundarajan@calmi2.org</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold> NIH SPARC is the primary funder for the publication of this research article. </p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>31</day>
                <month>1</month>
                <year>2022</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2022</year>
            </pub-date>
            <volume>11</volume>
            <elocation-id>124</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>21</day>
                    <month>1</month>
                    <year>2022</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2022 Soundarajan S et al.</copyright-statement>
                <copyright-year>2022</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/11-124/pdf"/>
            <abstract>
                <p>The National Institutes of Health (NIH) Stimulating Peripheral Activity to Relieve Conditions (SPARC) program seeks to accelerate the development of therapeutic devices that modulate electrical activity in nerves to improve organ function. SPARC-funded researchers are generating rich datasets from neuromodulation research that are curated and shared according to FAIR (Findable, Accessible, Interoperable, and Reusable) guidelines and are accessible to the public on the SPARC data portal. Keeping track of the utilization of these datasets within the larger research community is a feature that will benefit data-generating researchers in showcasing the impact of their SPARC outcomes. This will also allow the SPARC program to display the impact of the FAIR data curation and sharing practices that have been implemented. This manuscript provides the methods and outcomes of SPARClink, our web tool for visualizing the impact of SPARC, which won the Second prize at the 2021 SPARC FAIR Codeathon. With SPARClink, we built a system that automatically and continuously finds new published SPARC scientific outputs (datasets, publications, protocols) and the external resources referring to them. SPARC datasets and protocols are queried using publicly accessible REST application programming interfaces (APIs, provided by Pennsieve and Protocols.io) and stored in a publicly accessible database. Citation information for these resources is retrieved using the NIH reporter API and National Center for Biotechnology Information (NCBI) Entrez system. A novel knowledge graph-based structure was created to visualize the results of these queries and showcase the impact that the FAIR data principles can have on the research landscape when they are adopted by a consortium.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Visualization</kwd>
                <kwd>machine-learning</kwd>
                <kwd>citations</kwd>
                <kwd>FAIR</kwd>
                <kwd>data sharing</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>The author(s) declared that no grants were involved in supporting this work.</funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec id="sec1" sec-type="intro">
            <title>Introduction</title>
            <p>The National Institutes of Health (NIH) Common Fund&#x2019;s Stimulating Peripheral Activity to Relieve Conditions (SPARC) program aims to transform our understanding of nerve-organ interactions with the intent of advancing bioelectronic medicine towards treatments that change lives.
                <sup>
                    <xref ref-type="bibr" rid="ref1">1</xref>
                </sup> The SPARC program employs a Findable, Accessible, Interoperable, and Reusable (FAIR) first approach for its datasets, protocols, and publications, hence enabling the data to be easily reused by research communities globally. The SPARC 
                <ext-link ext-link-type="uri" xlink:href="https://sparc.science/">data portal</ext-link> can be used as the gateway to access fully curated datasets at any time.
                <sup>
                    <xref ref-type="bibr" rid="ref2">2</xref>
                </sup> Using the portal, researchers can search for data used in real-world experiments to verify or corroborate studies in device development. There is also potential for the data generated by the SPARC program to be useful outside the current field of study showcasing the benefits of multi-discipline data generation and sharing.
                <sup>
                    <xref ref-type="bibr" rid="ref3">3</xref>
                </sup>
            </p>
            <p>All SPARC datasets are curated by the researchers according to the SPARC Data Standards (SDS), a data and metadata structure derived from the Brain Imaging Data Structure (BIDS).
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>
                </sup> Several resources are made available to SPARC researchers for making their data FAIR, such as the cloud data platform 
                <ext-link ext-link-type="uri" xlink:href="https://app.pennsieve.io/">Pennsieve</ext-link>, the curated vocabulary selector and annotation platform 
                <ext-link ext-link-type="uri" xlink:href="https://scicrunch.org/">SciCrunch</ext-link>, the open-source computational modeling platform 
                <ext-link ext-link-type="uri" xlink:href="https://osparc.io/">o
                    <sup>2</sup>S
                    <sup>2</sup>PARC</ext-link>, the online microscopy image viewer 
                <ext-link ext-link-type="uri" xlink:href="https://www.biolucida.net/">Biolucida</ext-link>, and the data curation software 
                <ext-link ext-link-type="uri" xlink:href="https://fairdataihub.org/sodaforsparc">SODA</ext-link>.
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>
                </sup>
                <sup>&#x2013;</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref6">6</xref>
                </sup> The datasets submitted by researchers also follow an extensive curation process where teams from the SPARC Data Resource Center (DRC) examine the submitted data and work with the researchers to ensure all aspects of the FAIR data principles are being followed.
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref6">6</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref7">7</xref>
                </sup> Once these datasets are made public, access to them is provided through the Pennsieve Discover service and sparc.science, the official access point of the SPARC Portal.
                <sup>
                    <xref ref-type="bibr" rid="ref8">8</xref>
                </sup>
            </p>
            <p>While the submission and curation of data are simplified by such tools, one of the greater benefits of the FAIR guidelines is the ability to reuse data in other studies by other researchers around the world. However, a researcher who has submitted a dataset might not always be aware of the reuse of their original submitted data since current citation indexing tools, like Google Scholar, do not account for datasets. To address this shortcoming, we developed SPARClink during the 2021 SPARC FAIR Codeathon (July 12th, 2021 &#x2013; July 26th, 2021),
                <sup>
                    <xref ref-type="bibr" rid="ref9">9</xref>
                </sup> a system that queries all external publications using open source tools and platforms and creates a database and visualizations of citations that are helpful to showcase the impact of the SPARC consortium. In this instance, we define impact as the frequency of citations of SPARC-funded resources. By using citations as the key measure in SPARClink, we have created a method for showcasing the reuse of generated data and the benefits that FAIR data generation practices have on the overall scientific community. A visual representation of the reuse of data will allow both researchers and the general public to see the benefits of the concept of FAIR data and the immediate utilization of publicly funded datasets in advancing the field of bioelectronic medicine.</p>
        </sec>
        <sec id="sec2" sec-type="methods">
            <title>Methods</title>
            <p>Our solution can broadly be categorized into four steps. The first step involves the backend extraction of data using various application programming interfaces (APIs). The second step is setting up and storing the extracted data on a real-time database. The third step involves using machine learning to improve user experience by developing context-sensitive word clouds and smart keyword searches in the portal. The final step is used to create an engaging visualization that users of the SPARClink system will be able to interact with to view the extracted data. A visual representation of this workflow is shown in 
                <xref ref-type="fig" rid="f1">Figure 1</xref>.</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>Figure 1. </label>
                <caption>
                    <title>The flow of data between the submodules of SPARClink.</title>
                </caption>
                <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/78888/b8cae2e1-e9d2-4306-9fe5-95f4cf0676bc_figure1.gif"/>
            </fig>
            <sec id="sec3">
                <title>Extraction of data using APIs</title>
                <p>We used the dataset information retrieved directly from the Pennsieve data storage platform by running the Pennsieve API to gather all publicly available SPARC datasets.
                    <sup>
                        <xref ref-type="bibr" rid="ref10">10</xref>
                    </sup> The protocols stored on Protocols.io under the SPARC group were also queried via this method.
                    <sup>
                        <xref ref-type="bibr" rid="ref11">11</xref>
                    </sup> A list of public and published DOIs was created in our database with additional information regarding the study authors and descriptions.</p>
                <p>We used NIH RePORTER to retrieve data about the papers published as part of SPARC funding. Research articles that reference or mention these datasets, protocols, and publications were queried from NCBI (PubMed, PubMed central) repositories using the search endpoint of their Python API.
                    <sup>
                        <xref ref-type="bibr" rid="ref12">12</xref>
                    </sup> 
                    <xref ref-type="fig" rid="f2">Figure 2</xref> shows the overall flow of data between the APIs and resources queried to get the data. The NIH RePORTER API uses project number (also known as the award number) of NIH funding associated with SPARC datasets (this is provided by the author as additional metadata required when publishing a dataset) as an input to get details including a study identifier, name of the organization that received funding, country of the organization, amount of funding received and keywords of the project topic. The NCBI API uses an identifier for PubMed Central articles to retrieve information such as article name, journal name, year of publications, and authors.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <title>Methods implemented to gather citations of datasets, protocols, SPARC, and external publications.</title>
                    </caption>
                    <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/78888/b8cae2e1-e9d2-4306-9fe5-95f4cf0676bc_figure2.gif"/>
                </fig>
            </sec>
            <sec id="sec4">
                <title>Storing extracted data in a database</title>
                <p>We used Google&#x2019;s Firebase real-time database to store all the information retrieved via the NIH RePORTER system. The data was stored in a JSON format with read access available to anyone via a dedicated URL. The data in this database was split up into four separate sections labeled Awards, Datasets, Publications, and Protocols. All the entries within this database were given a unique identifier. These identifiers were used to link the data within the database to form a relational database. The links within the data represent the citations or use of resources within other publications. All publications within the database were uniquely identified as either SPARC-funded publications or non-SPARC publications (external publications that cite SPARC datasets and publications.)</p>
            </sec>
            <sec id="sec5">
                <title>Displaying the extracted data to the user</title>
                <p>The front-end demo of the SPARClink web page uses 
                    <ext-link ext-link-type="uri" xlink:href="https://vuejs.org/">Vue.js</ext-link> to create a functional prototype of the SPARClink system. An interactive force-based undirected graph visualization was created using the D3.js JavaScript library. The choice to represent the results through such a graph was motivated by the desire to show an intuitively understandable way of showing the connected nature of citations and data reuse. The website itself is hosted on Vercel as a static front end.
                    <sup>
                        <xref ref-type="bibr" rid="ref13">13</xref>
                    </sup> On the webpage, the visualizations can be filtered by key terms or resource type to get a better understanding of the resources created using the SPARC program. A screenshot of the webpage is shown in 
                    <xref ref-type="fig" rid="f3">Figure 3</xref>.</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>Figure 3. </label>
                    <caption>
                        <title>The design of the SPARClink webpage where results from the machine learning module are shown alongside the visualizations of SPARClink.</title>
                        <p>The visualizations and the results in this figure have been filtered with the vagus and cardiac keywords.</p>
                    </caption>
                    <graphic id="gr3" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/78888/b8cae2e1-e9d2-4306-9fe5-95f4cf0676bc_figure3.gif"/>
                </fig>
            </sec>
            <sec id="sec6">
                <title>Machine Learning Data Indexing Engine</title>
                <p>To provide some additional functionality on the front-end demo of SPARClink, we used machine learning algorithms to enhance the user experience. We called this function of the SPARClink project the Machine Learning Data Indexing Engine.</p>
                <p>We used the Symspell algorithm present in the scikit learn package and trained it on the vocabulary built using the SPARClink database.
                    <sup>
                        <xref ref-type="bibr" rid="ref14">14</xref>
                    </sup> We used delete-only edit candidate generation for generating different combinations of spelling errors, and used both character-level embedding and word embedding for recommending the most probable correct spelling. The output of the spell correction algorithm was used to generate sentence-level embedding and was then compared with the embeddings of different descriptions of the items in the dataset. We obtained a ranking of all the items in the dataset based on their similarity with the searched string. The top 10 were chosen to be shown on the front end.</p>
                <p>This module was also used to generate keywords using the 
                    <ext-link ext-link-type="uri" xlink:href="https://maartengr.github.io/KeyBERT/">keyBERT</ext-link> pretrained model.
                    <sup>
                        <xref ref-type="bibr" rid="ref15">15</xref>
                    </sup> It generated the top 50 keywords associated with the whole document. It also made use of the Maximal marginal relevance algorithm to pick keywords that have a higher distance among them.
                    <sup>
                        <xref ref-type="bibr" rid="ref16">16</xref>
                    </sup> This ensures diversity among the chosen keywords.</p>
                <p>The engine also contains algorithms that learn vector embeddings of the descriptors of the elements present in the SPARClink database. Based on these vector embeddings, the algorithms compute the similarity between the vector representation of each word in the vocabulary with the vector representing the whole dataset and find keywords that would describe the resource. A word cloud is generated based on the relevance of these results to further enhance the user experience.</p>
            </sec>
        </sec>
        <sec id="sec7" sec-type="results">
            <title>Results</title>
            <p>Using SPARClink, researchers can aggregate all the resources created through the SPARC program and quantify their impact. The visualization created by the SPARClink system is shown in 
                <xref ref-type="fig" rid="f4">Figure 4</xref>. The nodes in the undirected graph signify a unique SPARC resource (publication, protocol, and dataset) and the edges in the graph signify the citations or references as found by SPARClink. A well-connected graph of datasets and publications were observed but a significant number of protocols were seemingly distinct from the rest of the resources despite being pulled from the SPARC protocols.io group. This could be associated with protocols that are published on protocols.io but for which the associated datasets have not been made public yet.</p>
            <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                <label>Figure 4. </label>
                <caption>
                    <title>An interactive visualization created by SPARClink showing the connected nature of all SPARC resources.</title>
                </caption>
                <graphic id="gr4" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/78888/b8cae2e1-e9d2-4306-9fe5-95f4cf0676bc_figure4.gif"/>
            </fig>
            <p>The word map generated from the main dataset visualizations is shown in 
                <xref ref-type="fig" rid="f5">Figure 5</xref>. The size of the word with respect to its neighbors corresponds to the frequency and significance of the word within all the searchable metadata that we have indexed. Selecting any of the words in this map will automatically filter the SPARClink visualizations. Using a keyword filter on the graph will also prompt the top-ranking items for the keyword to be displayed on the side of the page. This ranking is shown as a scrollable list, as seen in 
                <xref ref-type="fig" rid="f6">Figure 6</xref>. Both the word map and top-ranked recommendations are continuously updating themselves when new input terms are entered via the SPARClink webpage.</p>
            <fig fig-type="figure" id="f5" orientation="portrait" position="float">
                <label>Figure 5. </label>
                <caption>
                    <title>The word maps created by SPARClink are a visual representation of the most significant words shown in the graph-based visualization.</title>
                </caption>
                <graphic id="gr5" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/78888/b8cae2e1-e9d2-4306-9fe5-95f4cf0676bc_figure5.gif"/>
            </fig>
            <fig fig-type="figure" id="f6" orientation="portrait" position="float">
                <label>Figure 6. </label>
                <caption>
                    <title>A list of resources that are recommended by SPARClink when a search term filter is provided by the user.</title>
                </caption>
                <graphic id="gr6" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/78888/b8cae2e1-e9d2-4306-9fe5-95f4cf0676bc_figure6.gif"/>
            </fig>
        </sec>
        <sec id="sec8" sec-type="discussion|conclusions">
            <title>Discussion and conclusions</title>
            <p>Using FAIR standards can greatly improve the use of data across multiple disciplines and potentially lead to new and exciting discoveries in the field of biomedical science. The benefits of employing the FAIR data principles for data generation, curation, and sharing can, however, be hard to quantify for researchers or members of the general public. Using a system like SPARClink, researchers at all levels can get up-to-date feedback on the use of their data and all the advantages that the FAIR standards provide to efforts in advancing biomedical science. In this work, we developed such a tool for the SPARC program to enable quantification of the reuse of the FAIR SPARC resources (datasets, manuscripts, protocols).</p>
            <p>The primary challenge in accomplishing this task lies in the fact that the SPARC datasets and protocols are not referenced in the bibliography of research manuscripts, which is the common practice. Instead, the SPARC dataset and protocol identifiers or URLs are only mentioned in the text or under supplementary materials, which makes querying this information a challenging task. Furthermore, datasets created in the SPARC program can be embargoed for up to 12 months to allow researchers enough time to document and publish their findings. However, protocols are made public immediately since protocols.io does not have an option to embargo the open publishing of these protocols. This could also add to the sparse graphs and we can expect the connectedness of this graph to improve as time goes on.</p>
            <p>In the future, we plan on adding the Google Scholar system as an additional resource for data extraction. This should improve the connectedness of our extracted data network as well. Additional filtering functions and performance improvements for very large numbers of nodes are also planned. Currently, the tool is hosted on an independent webpage, but we also aim to integrate it directly within the SPARC portal so that visitors can conveniently visualize the reuse and impact of the different SPARC-generated resources.</p>
        </sec>
        <sec id="sec9">
            <title>Data availability</title>
            <p>At the time of publication, the SPARClink system visualizations can be found at 
                <ext-link ext-link-type="uri" xlink:href="https://sparclink.vercel.app">https://sparclink.vercel.app</ext-link> and are expected to be always online going forward. The backend system that queries all the publications is currently paused due to a lack of system resources. The code for SPARClink has been developed to be accessible to anyone who wants to fork the repository from GitHub and run a local version of this project. Instructions on how to run the modules locally are also available in the GitHub repository. The database of currently extracted citation data can be queried via REST protocols using the links provided below. The machine learning data indexing engine is hosted on a web server provided by 
                <ext-link ext-link-type="uri" xlink:href="http://pythonanywhere.com">pythonanywhere.com</ext-link> and is publicly accessible via its API endpoints. This module is also available to be run in local configuration seamlessly.</p>
        </sec>
        <sec id="sec10">
            <title>Software availability</title>
            <p>Source code available from: 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/fairdataihub/SPARClink">https://github.com/fairdataihub/SPARClink</ext-link>
            </p>
            <p>Archived source code as at time of publication: 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.5550844">https://doi.org/10.5281/zenodo.5550844</ext-link>
            </p>
            <p>License: MIT</p>
        </sec>
        <sec id="sec11">
            <title>Author endorsement</title>
            <p>David Nickerson confirms that the author has an appropriate level of expertise to conduct this research and confirms that the submission is of an acceptable scientific standard. David Nickerson declares they were an organizer of the Hackathon in which the work described in this paper was performed. Affiliation: Auckland Bioengineering Institute, University of Auckland, New Zealand.</p>
        </sec>
    </body>
    <back>
        <ack>
            <title>Acknowledgments</title>
            <p>We would like to thank the NIH Common Fund&#x2019;s SPARC Program and the organizers of the 2021 SPARC FAIR Codeathon for their support during the development of this project.</p>
        </ack>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <label>1</label>
                <mixed-citation publication-type="other">
                    <collab>National Institutes of Health</collab>:
                    <article-title>Stimulating Peripheral Activity to Relieve Conditions (SPARC).</article-title>
                    <year>2014 [cited 2021 Oct 22]</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://commonfund.nih.gov/sparc">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref2">
                <label>2</label>
                <mixed-citation publication-type="other">
                    <collab>National Institutes of Health</collab>:
                    <article-title>SPARC Portal.</article-title>
                    <year>[cited 2021 Oct 22]</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://sparc.science/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref3">
                <label>3</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Quey</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schiefer</surname>
                            <given-names>MA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kiran</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>KnowMore: An Automated Knowledge Discovery Tool for the FAIR SPARC Datasets.</article-title>
                    <source>

                        <italic toggle="yes">bioRxiv.</italic>
</source>
                    <year>2021 [cited 2021 Oct 22]</year>; p.<fpage>2021.08.08.455581</fpage>.
                    <pub-id pub-id-type="doi">10.1101/2021.08.08.455581.abstract</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref4">
                <label>4</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bandrowski</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Grethe</surname>
                            <given-names>JS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pilko</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>SPARC Data Structure: Rationale and Design of a FAIR Standard for Biomedical Research Data.</article-title>
                    <source>

                        <italic toggle="yes">bioRxiv.</italic>
</source>
                    <year>2021 [cited 2021 Oct 22]</year>; p.<fpage>2021.02.10.430563</fpage>.
                    <pub-id pub-id-type="doi">10.1101/2021.02.10.430563v2.abstract</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Patel</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Srivastava</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Aghasafari</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>SPARC: SODA, an interactive software for curating SPARC datasets.</article-title>
                    <source>

                        <italic toggle="yes">FASEB J.</italic>
</source>
                    <year>2020 Apr</year>;<volume>34</volume>(<issue>S1</issue>):<fpage>1</fpage>&#x2013;<lpage>1</lpage>.
                    <pub-id pub-id-type="doi">10.1096/fasebj.2020.34.s1.02483</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Osanlouy</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bandrowski</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bono</surname>
                            <given-names>B</given-names>
                            <prefix>de</prefix>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The SPARC DRC: Building a Resource for the Autonomic Nervous System Community.</article-title>
                    <source>

                        <italic toggle="yes">Front Physiol.</italic>
</source>
                    <year>2021 Jun 24</year>;<volume>12</volume>:<fpage>693735</fpage>.
                    <pub-id pub-id-type="pmid">34248680</pub-id>
                    <pub-id pub-id-type="doi">10.3389/fphys.2021.693735</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wilkinson</surname>
                            <given-names>MD</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dumontier</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Aalbersberg</surname>
                            <given-names>IJJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The FAIR Guiding Principles for scientific data management and stewardship.</article-title>
                    <source>

                        <italic toggle="yes">Sci Data.</italic>
</source>
                    <year>2016 Mar 15</year>;<volume>3</volume>:<fpage>160018</fpage>.
                    <pub-id pub-id-type="pmid">26978244</pub-id>
                    <pub-id pub-id-type="doi">10.1038/sdata.2016.18</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref8">
                <label>8</label>
                <mixed-citation publication-type="other">
                    <collab>The University of Pennsylvania</collab>:
                    <article-title>Pennsieve Discover.</article-title>
                    <year>[cited 2021 Oct 22]</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://discover.pennsieve.io/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref9">
                <label>9</label>
                <mixed-citation publication-type="other">
                    <collab>SPARC</collab>:
                    <article-title>2021 SPARC FAIR Codeathon. SPARC Portal.</article-title>
                    <year>[cited 2021 Oct 22]</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://sparc.science/help/2021-sparc-fair-codeathon">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref10">
                <label>10</label>
                <mixed-citation publication-type="other">
                    <collab>The University of Pennsylvania</collab>:
                    <article-title>Pennsieve API.</article-title>
                    <year>[cited 2021 Oct 22]</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://docs.pennsieve.io/reference/discover_datasets">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref11">
                <label>11</label>
                <mixed-citation publication-type="other">
                    <collab>Protocols I</collab>:
                    <article-title>Protocols.io for developers.</article-title>
                    <year>[cited 2021 Oct 22]</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://www.protocols.io/developers">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref12">
                <label>12</label>
                <mixed-citation publication-type="other">
                    <collab>National Institutes of Health</collab>:
                    <article-title>NIH RePORTER API.</article-title>
                    <year>[cited 2021 Oct 22]</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://api.reporter.nih.gov/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref13">
                <label>13</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Soundarajan</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>SPARClink Portal.</article-title>
                    <year>2021 [cited 2021 Oct 22]</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://sparclink.vercel.app/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref14">
                <label>14</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Garbe</surname>
                            <given-names>W</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">SymSpell: SymSpell: 1 million times faster spelling correction &amp; fuzzy search through Symmetric Delete spelling correction algorithm.</italic>
</source>
                    <publisher-name>Github</publisher-name>;<year>[cited 2021 Oct 22]</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/wolfgarbe/SymSpell">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref15">
                <label>15</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Grootendorst</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">KeyBERT: Minimal keyword extraction with BERT.</italic>
</source>
                    <publisher-name>Github</publisher-name>;<year>[cited 2021 Oct 22]</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/MaartenGr/KeyBERT">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref16">
                <label>16</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Carbinell</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Goldstein</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries.</italic>
</source>
                    <publisher-name>ACM SIGIR Forum</publisher-name>;<year>2017</year>;<volume>51</volume>:<fpage>209</fpage>&#x2013;<lpage>210</lpage>.
                    <pub-id pub-id-type="doi">10.1145/3130348.3130369</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report160993">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.78888.r160993</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Pinot de Moira</surname>
                        <given-names>Angela</given-names>
                    </name>
                    <xref ref-type="aff" rid="r160993a1">1</xref>
                    <xref ref-type="aff" rid="r160993a2">2</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-3593-8472</uri>
                </contrib>
                <aff id="r160993a1">
                    <label>1</label>National Heart and Lung Institute, Imperial College London, London, UK</aff>
                <aff id="r160993a2">
                    <label>2</label>Section of Epidemiology, University of Copenhagen, Copenhagen, Denmark</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>16</day>
                <month>2</month>
                <year>2023</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2023 Pinot de Moira A</copyright-statement>
                <copyright-year>2023</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport160993" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.75071.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This is a clearly written paper describing SPARClink, a web tool that queries all external publications and creates a database and visualisations of publications that have utilised SPARC-funded resources, including datasets, publications and protocols. As well as documenting the impact of the SPARC consortium, the tool will be an invaluable resource for researchers utilising SPARC outputs for their research.</p>
            <p> </p>
            <p> My main comment to the paper is regarding the aim to demonstrate the benefits of the FAIR principles. Currently, the paper mainly focuses on the retrieval of any publication utilising SPARC resources, i.e. including publications directly funded by SPARC. To demonstrate how the tool can be used to highlight/quantify the benefits of the FAIR principles, it would be useful to provide examples of this in the paper, i.e. demonstrate the reuse of SPARC data. If I have understood correctly, this is possible by filtering extracted publications on whether they are SPARC-funded or non-SPARC funded. It would be useful to provide a visualization with the non-SPARC funded filter applied, so that the extent of data reuse can be seen.</p>
            <p> </p>
            <p> My second suggestion is to include a box of key terminology in the paper. There are a lot of terms that may not be understood by the reader and although links to definitions are provided, a glossary box may aid reading.</p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Yes</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Epidemiology, FAIR principles</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report160964">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.78888.r160964</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Fouad</surname>
                        <given-names>Karim</given-names>
                    </name>
                    <xref ref-type="aff" rid="r160964a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <contrib contrib-type="author">
                    <name>
                        <surname>Torres-Espin</surname>
                        <given-names>Abel</given-names>
                    </name>
                    <xref ref-type="aff" rid="r160964a2">2</xref>
                    <xref ref-type="aff" rid="r160964a1">1</xref>
                    <role>Co-referee</role>
                </contrib>
                <aff id="r160964a1">
                    <label>1</label>University of Alberta, Edmonton, Canada</aff>
                <aff id="r160964a2">
                    <label>2</label>University of California San Francisco, San Francisco, CA, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>26</day>
                <month>1</month>
                <year>2023</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2023 Fouad K and Torres-Espin A</copyright-statement>
                <copyright-year>2023</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport160964" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.75071.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This is a very succinct article introducing a web tool for querying and visualizing the scientific output of the SPARC program and how it has been utilized by the research community. The process begins with extracting and storing related data via APIs. Then word clouds and key word searches allow users to refine their search to then visualize the relation between the SPARC output products. The product is a very useful tool to explore the ongoing impact and use of SPARC related research and products. Potentially the impact of the tool is not really on what the authors mention (i.e., show the impact of FAIR data sharing on the overall scientific community) and thus somewhat overstated. The visualization tool simply provides a summary of the reuse of SPARC related work products, but not a comparison to other approaches for data sharing that would allow for assessing the impact of FAIR. The strength lies in the demonstration of data reuse, specific impact of their data sets, etc. This is a highly valuable tool on so many levels including strategies for future research and general education of the value of data sharing.</p>
            <p> </p>
            <p> The manuscript would benefit from a few clarifications and details especially in the figure legends. For example, the difference between a SPARC publication and a data set should be explained. Are SPARC data sets not published?</p>
            <p> </p>
            <p> Figure 3 show a lot of white space, and the legend would benefit from more detail. What does define the size of the nodes in the visualization? Did the authors consider directional links between the items in the form of a directed graph, which would simplify comprehension of what is shown? For example, the edge of the graph could show the direction from the node citing to the one receptor of a citation.</p>
            <p> </p>
            <p> Figure 4 requires a legend for the different colors, an explanation for the different size of the nodes, and once again would benefit from directional edges. Lastly, not surprisingly the overwhelming links to publications mask the reuse of data sets. Maybe a different filter could be applied to show that important relation.</p>
            <p> </p>
            <p> Minor suggestions: 
                <list list-type="bullet">
                    <list-item>
                        <p>In the Abstract, consider adding &#x2018;electrical&#x2019; in front of neuromodulation.</p>
                    </list-item>
                    <list-item>
                        <p>3d paragraph in introduction, consider not to refer (i.e., these tools) when starting a new paragraph.</p>
                    </list-item>
                    <list-item>
                        <p>Please define &#x201c;citations of SPARC-funded resources&#x201d;.</p>
                    </list-item>
                </list> </p>
            <p> </p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Yes</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Neuroplasticity, data sharing</p>
            <p>We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report140292">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.78888.r140292</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Zeng</surname>
                        <given-names>Tao</given-names>
                    </name>
                    <xref ref-type="aff" rid="r140292a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r140292a1">
                    <label>1</label>AS Key Laboratory of Computational Biology, Bio-Med Big Data Center, University of Chinese Academy of Sciences, Shanghai, China</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>27</day>
                <month>6</month>
                <year>2022</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2022 Zeng T</copyright-statement>
                <copyright-year>2022</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport140292" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.75071.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In the paper &#x201c;SPARClink: an interactive tool to visualize the impact of the SPARC program&#x201d;, authors aim to introduce a web tool SPARClink for visualizing the impact of SPARC, whose methods and outcomes support FAIR guidelines.</p>
            <p> </p>
            <p> SPARClink should be a useful tool/software for supporting SPARC program and corresponding consortium. For the work and introduction in this paper, I have several suggestions: 
                <list list-type="order">
                    <list-item>
                        <p>SPARC program employs FAIR, so, it is necessary to introduce more about corresponding function of SPARClink serving each FAIR guideline, e.g. for &#x201c;Reusable&#x201d;, what data or protocol can be reused and how other researchers can obtain and reuse them.</p>
                    </list-item>
                    <list-item>
                        <p>Similar to other programs, the data produced in SPARC would have raw data, pre-processed data, or analyzed data, or summary data, etc. The organization or level of SPARC data is better to be clearly introduced, and supply detailed cases of how SPARClink can manage and share these data.</p>
                    </list-item>
                    <list-item>
                        <p>In current implementation, &#x201c;interactive force-based undirected graph visualization&#x201d; is simple, and the directed relation is better to consider, e.g. the paper and its public data, and the paper and its reused data, would have different relation directions. Also, in the abstract, the authors stated &#x201c;a novel knowledge graph-based structure was created to visualize &#x2026;&#x201d; Thus, the novelty of the network structure and representation should have highlights in revision.</p>
                    </list-item>
                    <list-item>
                        <p>For interactive visualization shown in Figure 4, or used in current web tool, the network has less information - it is better to directly show the information about each node or edge on network in web page in a scalable manner, especially for large knowledge network.</p>
                    </list-item>
                    <list-item>
                        <p>The word maps shown in Figure 5 would be a general function in many other tools, it is necessary to offer the interesting word or sentence association between key words and SPARC outcomes.</p>
                    </list-item>
                    <list-item>
                        <p>Indeed, for FAIR, the software SPARClink itself is suggested to supply Docker version, so other users can easily reuse such useful framework into their applications.</p>
                    </list-item>
                </list>
            </p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Partly</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Machine learning and bioinformatics</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
</article>
