<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="other" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.131014.3</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Software Tool Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>RBP-Tar &#x2013; a searchable database for experimental RBP binding sites</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 3; peer review: 2 approved, 4 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Gresova</surname>
                        <given-names>Katarina</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Racek</surname>
                        <given-names>Tomas</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Martinek</surname>
                        <given-names>Vlastimil</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Cechak</surname>
                        <given-names>David</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Svobodova</surname>
                        <given-names>Radka</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Alexiou</surname>
                        <given-names>Panagiotis</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-3437-7482</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                    <xref ref-type="aff" rid="a3">3</xref>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno, Czech Republic</aff>
                <aff id="a2">
                    <label>2</label>Central European Institute of Technology (CEITEC), Masaryk University, Brno, Czech Republic</aff>
                <aff id="a3">
                    <label>3</label>Centre for Molecular Medicine &amp; Biobanking, University of Malta, Msida, Malta</aff>
                <aff id="a4">
                    <label>4</label>Department of Applied Biomedical science, University of Malta, Msida, Malta</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:panagiotis.alexiou@um.edu.mt">panagiotis.alexiou@um.edu.mt</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>25</day>
                <month>11</month>
                <year>2024</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2023</year>
            </pub-date>
            <volume>12</volume>
            <elocation-id>755</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>20</day>
                    <month>11</month>
                    <year>2024</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2024 Gresova K et al.</copyright-statement>
                <copyright-year>2024</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/12-755/pdf"/>
            <abstract>
                <sec>
                    <title>Background</title>
                    <p>
RNA-binding proteins (RBPs) play a critical role in regulating gene expression by binding to specific sites on RNA molecules. Identifying these binding sites is crucial for understanding the many functions of RBPs in cellular function, development and disease. Current experimental methods for identifying RBP binding sites, such as ultra-violet (UV) crosslinking and immunoprecipitation (CLIP), and especially the enhanced CLIP (eCLIP) protocol, were developed to identify authentic RBP binding sites experimentally.</p>
                </sec>
                <sec>
                    <title>Methods</title>
                    <p>To make this data more accessible to the scientific community, we have developed RBP-Tar (
                        <ext-link ext-link-type="uri" xlink:href="https://ncbr.muni.cz/RBP-Tar">https://ncbr.muni.cz/RBP-Tar
</ext-link>), a web server and database that utilises eCLIP data for 167 RBPs mapped on the human genome. The web server allows researchers to easily search and retrieve binding site information by genomic location and RBP name.</p>
                </sec>
                <sec>
                    <title>Use case</title>
                    <p>Researchers can produce lists of all known RBP binding sites on a gene of interest, or produce lists of binding sites for one RBP on different genomic loci.</p>
                </sec>
                <sec>
                    <title>Conclusions</title>
                    <p>Our future goal is to continue to populate the web server with additional experimental datasets from CLIP experiments as they become available and processed, making it an increasingly valuable resource for the scientific community.</p>
                </sec>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>RNA Binding Proteins</kwd>
                <kwd>CLIP</kwd>
                <kwd>RBP</kwd>
                <kwd>Web-server</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1">
                    <funding-source>ELIXIR-CZ</funding-source>
                    <award-id>LM2018131</award-id>
                </award-group>
                <award-group id="fund-2">
                    <funding-source>MEYS</funding-source>
                    <award-id>LM2018131</award-id>
                </award-group>
                <award-group id="fund-3">
                    <funding-source>Horizon europe </funding-source>
                    <award-id>101086768</award-id>
                </award-group>
                <award-group id="fund-4">
                    <funding-source>Ministry of Education, Youth and Sports of the Czech Republic</funding-source>
                    <award-id>e-INFRACZLM2018140</award-id>
                </award-group>
                <funding-statement>Computational resources were supplied by the project "e-Infrastruktura CZ" (e-INFRA CZ LM2018140) supported by the Ministry of Education, Youth and Sports of the Czech Republic. Computational resources were provided by the ELIXIR-CZ project (LM2018131), part of the international ELIXIR infrastructure. Core Facility Biological Data Management and Analysis of CEITEC Masaryk University, supported by ELIXIR CZ research infrastructure (MEYS Grant No: LM2018131), is gratefully acknowledged for obtaining the scientific data presented in this paper. The Horizon Europe, ERA Chair grant (BioGeMT, ID: 101086768) supplied funding for publication fees.</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 2</title>
                <p>In this update, we have added a p-value column to provide statistical context for evaluating RNA binding protein binding sites. Additionally, we introduced filtering and sorting by p-value to enhance usability, allowing users to prioritize data based on significance.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec id="sec1" sec-type="intro">
            <title>Introduction</title>
            <p>RNA binding proteins (RBPs) are key players in a broad spectrum of RNA regulation, including all stages of the RNA lifecycle (
                <xref ref-type="bibr" rid="ref3">Gerstberger 
                    <italic toggle="yes">et al</italic>. 2014</xref>; 
                <xref ref-type="bibr" rid="ref4">Gebauer 
                    <italic toggle="yes">et al.</italic> 2021</xref>). Eukaryotic genomes typically encode hundreds of RBPs. For example, over 1500 human RBPs involved in the maturation, transport, stability and translation of coding and non-coding RNA were recently characterised and manually curated (
                <xref ref-type="bibr" rid="ref3">Gerstberger 
                    <italic toggle="yes">et al.</italic> 2014</xref>). Each RBP can typically target hundreds of RNAs in a complex coordinated fashion (
                <xref ref-type="bibr" rid="ref6">Hogan 
                    <italic toggle="yes">et al.</italic> 2008</xref>). The general transcriptomic locations of thousands of RNA binding sites corresponding to hundreds of RBPs have been identified using a family of experimental techniques based on RBP CrossLinking, ImmunoPrecipitation and Sequencing (CLIP-Seq) (
                <xref ref-type="bibr" rid="ref1">Chi 
                    <italic toggle="yes">et al.</italic> 2009</xref>; 
                <xref ref-type="bibr" rid="ref8">Licatalosi 
                    <italic toggle="yes">et al.</italic> 2008</xref>; 
                <xref ref-type="bibr" rid="ref10">Moore 
                    <italic toggle="yes">et al.</italic> 2014</xref>; 
                <xref ref-type="bibr" rid="ref5">Hafner 
                    <italic toggle="yes">et al.</italic> 2010</xref>). A thorough exploration of tens of RBPs binding characteristics in vitro has shown that RBPs can differentiate their binding sites with context preferences beyond narrowly defined binding sequence motif and secondary structure often involving complex binding configurations (
                <xref ref-type="bibr" rid="ref2">Dominguez 
                    <italic toggle="yes">et al.</italic> 2018</xref>). Among the experimental techniques available to date, the enhanced CLIP (eCLIP) protocol (Van 
                <xref ref-type="bibr" rid="ref12">Nostrand 
                    <italic toggle="yes">et al.</italic> 2016</xref>) is particularly important, as it significantly reduces required amplification and increases specificity in identifying authentic binding sites. This improves the efficiency and accuracy of RBP binding site identification and allows for a deeper understanding of the role of RBPs in gene regulation.</p>
            <p>The availability of a large amount of data points produced from the same experimental technique can be very beneficial for applications such as machine learning, as it allows researchers to train and test models with more confidence. Here we present RBP-Tar (
                <xref ref-type="bibr" rid="ref11">Tom&#x00e1;&#x0161; Ra&#x010d;ek 2023</xref>), a centralised and searchable database of experimentally identified RBP binding sites that can significantly facilitate the study of the RBP mediated gene regulation. Using RBP-Tar, researchers can quickly and cleanly retrieve RBP binding sites constrained by both genomic location and associated RBP for hundreds of RBPs.</p>
        </sec>
        <sec id="sec2" sec-type="methods">
            <title>Methods</title>
            <sec id="sec3">
                <title>Implementation</title>
                <p>

                    <bold>Pipeline for reproducible data download and annotation</bold>
                </p>
                <p>We have developed a reproducible and easy-to-use pipeline for downloading and annotating RBP eCLIP data from the ENCODE (
                    <xref ref-type="bibr" rid="ref9">Luo 
                        <italic toggle="yes">et al</italic>. 2020</xref>) database.</p>
                <p>Metadata for eCLIP experiments, as well as additional files containing genomic coordinates are downloaded from the ENCODE database with the following parameters:</p>
                <disp-quote>
                    <p>

                        <italic toggle="yes">&#x201c;status= released&amp;</italic>
</p>
                    <p>

                        <italic toggle="yes">internal_tags=ENCORE&amp;</italic>
</p>
                    <p>

                        <italic toggle="yes">assay_title=eCLIP&amp;</italic>
</p>
                    <p>

                        <italic toggle="yes">biosample_ontology.term_name=K562&amp;</italic>
</p>
                    <p>

                        <italic toggle="yes">biosample_ontology.term_name=HepG2&amp;</italic>
</p>
                    <p>

                        <italic toggle="yes">files.file_type=bed+narrowPeak&amp;</italic>
</p>
                    <p>

                        <italic toggle="yes">type=Experiment</italic>
</p>
                    <p>

                        <italic toggle="yes">files.analyses.status=released&#x201d;.</italic>
                    </p>
                </disp-quote>
                <p>Following the download, information about the chromosome, start position, end position, strand and p values of reads are extracted for each RBP binding site. Binding sites are filtered by length, excluding ones shorter than 20 and longer than 100 nucleotides. As a last step, genomic sequences of binding sites are retrieved.</p>
                <p>The described pipeline is implemented as a set of python scripts and is freely available at 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/ML-Bioinfo-CEITEC/rbp_encode_eclip">GitHub</ext-link>.</p>
                <p>Using this pipeline, data for 168 RBPs on two cell lines (K562, HepG2) were downloaded. In total, 42 MB of data, representing more than 400 thousand binding sites, were thus processed.</p>
                <p>

                    <bold>Web server</bold>
                </p>
                <p>Here we present RBP-Tar, a web server that can access the above-curated dataset of RBP binding sites (
                    <ext-link ext-link-type="uri" xlink:href="https://rbp-tar.biodata.ceitec.cz/">https://rbp-tar.biodata.ceitec.cz/</ext-link>) and was built with Python (RRID:SCR_008394), the web development framework Flask, and a simple SQLite database (RRID:SCR_017672). The application&#x2019;s source codes can be found on the project&#x2019;s 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/krab1k/RBP-Tar">GitHub</ext-link> page, along with the requirements and instructions for the deployment if a user wants to run the application locally.</p>
                <p>The web user interface allows searching and filtering based on the start and end position of the binding site, strand, chromosome, p value and protein name. It offers the download of the filtered data based on the search done by the user. Due to the size of the dataset, the view is limited to 10,000 results. However, the whole dataset can be conveniently downloaded as a gzipped CSV (14 MB) (
                    <ext-link ext-link-type="uri" xlink:href="https://rbp-tar.biodata.ceitec.cz/download_all">https://rbp-tar.biodata.ceitec.cz/download_all</ext-link>).</p>
            </sec>
            <sec id="sec4">
                <title>Operation</title>
                <p>The RBP-Tar web server can take as input any of the following user-provided parameters: [Start min, Start max] denote the limits of the low genomic coordinate of the locus of interest. Similarly, [End min, End max] denote the limits of the high coordinate. [Strand] and [Chromosome] can be used to narrow down the search to only one strand and a specific chromosome and [-log10(p-value) min, -log10(p-value) max] allows for filtering based on the significance of binding sites. Using these combinations of parameters, a user can easily search for binding sites on their favourite gene, exon, or even a whole chromosome. The last parameter is [Protein name], which brings out a drop-down menu of all the RBPs in the database. If this parameter is not set, all RBPs are queried.</p>
                <p>Results are shown as a table with the [Chromosome, Start, End, Strand] genomic location of the binding site, followed by the [-log10(p-value)] p-value of the binding site, [Protein name] of the associated RBP and the [Sequence] of the binding site contains the genomic sequence of the binding site. Results can be seen online in a table format or downloaded as a CSV file with one button click. We expect most users to download the results and use them for further downstream analyses.</p>
            </sec>
        </sec>
        <sec id="sec5">
            <title>Use cases</title>
            <sec id="sec6">
                <title>Use case 1: all known RBP binding sites on the gene of interest</title>
                <p>The first and potentially most common use case would be the query of all known RBP binding sites on a gene of interest. For example, we can query our web server with the coordinates of the Fused in Sarcoma (FUS) gene (chr16: 31180139-31191605, +) and leave the protein field empty. After this search, all 345 known RBP binding sites on this gene are returned and can be easily downloaded in a CSV file for further analysis (
                    <xref ref-type="fig" rid="f1">Figure 1</xref>).</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>
Figure 1. </label>
                    <caption>
                        <title>Extracting binding sites of all RNA-binding proteins on one locus (use case 1).</title>
                    </caption>
                    <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/175053/e19a63fa-2b0b-4c9e-914d-279f2cd6ce69_figure1.gif"/>
                </fig>
                <p>In fact, the gene we used encodes the RBP FUS, which plays important roles, among others, in neurodegeneration and cancer progression. We can use the filter [Protein Name] to identify the potential self-targeting of FUS on itself. Indeed, we can thus identify 19 potential FUS self-targeting binding sites identified 
                    <italic toggle="yes">via</italic> eCLIP (
                    <xref ref-type="fig" rid="f2">Figure 2</xref>). Of course, the biological relevance of this type of finding is left to the users, as is the further validation of high-throughput derived RBP binding sites.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>
Figure 2. </label>
                    <caption>
                        <title>Fused in Sarcoma (FUS) self-targeting identified by filtering by gene location and protein name.</title>
                    </caption>
                    <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/175053/e19a63fa-2b0b-4c9e-914d-279f2cd6ce69_figure2.gif"/>
                </fig>
            </sec>
            <sec id="sec7">
                <title>Use case 2: training/testing dataset for one RBP</title>
                <p>A potential user may want to develop an RBP binding site machine learning tool that would be able to predict binding sites based on a sequence. It is important to make sure that training and testing sets for their machine learning method are not overlapping. Using our web server, they can download all binding sites for a specific RBP, for example, on chromosome 1, and use them as a training set (
                    <xref ref-type="fig" rid="f3">
Figure 3</xref>). Then, do the same for chromosome 2 and use it as an independent testing set, thus ensuring that the training and testing sets do not overlap.</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>
Figure 3. </label>
                    <caption>
                        <title>Extracting all binding sites of a single protein on a single chromosome (use case 2).</title>
                    </caption>
                    <graphic id="gr3" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/175053/e19a63fa-2b0b-4c9e-914d-279f2cd6ce69_figure3.gif"/>
                </fig>
            </sec>
        </sec>
        <sec id="sec8" sec-type="conclusion">
            <title>Conclusion</title>
            <p>Recent advancements in experimental techniques, such as eCLIP, have greatly expanded our understanding of RBP binding preferences and their role in gene regulation. The development of centralised and searchable databases of experimentally identified RBP binding sites allows researchers to access and analyse the binding preferences of RBPs easily. This information can be used to identify known binding sites on genes of interest and aid in training machine-learning models for RBP binding site prediction. This paper presents RBP-Tar, a centralised web server that can retrieve RBP Target sites with location and RBP constraints. RBP-Tar has been designed to be easily accessible by non-experts. It is still confined to a single source of data, which is helpful for avoiding experimental design effects, but makes its scope limited. We plan to expand the web server with other sources of data, as well as ways for the user to be able to take into account provenance and experimental variation.</p>
        </sec>
    </body>
    <back>
        <sec id="sec14" sec-type="data-availability">
            <title>Data availability</title>
            <sec id="sec15">
                <title>Source data</title>
                <p>All data used in RBP-Tar has been downloaded from ENCODE and Ensembl projects in November 2024. RBP eCLIP metadata were downloaded from 
                    <ext-link ext-link-type="uri" xlink:href="https://www.encodeproject.org/metadata/?status=released&amp;internal_tags=ENCORE&amp;assay_title=eCLIP&amp;biosample_ontology.term_name=K562&amp;biosample_ontology.term_name=HepG2&amp;files.file_type=bed+narrowPeak&amp;type=Experiment&amp;files.analyses.status=released">https://www.encodeproject.org/metadata/?status=released&amp;internal_tags=ENCORE
&amp;assay_title=eCLIP&amp;biosample_ontology.term_name=K562&amp;biosample_ontology.term_name=HepG2&amp;files.file_type=
bed+narrowPeak&amp;type=Experiment&amp;files.analyses.status=released</ext-link>. A list of all downloaded files containing genomic coordinates can be found here: 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/ML-Bioinfo-CEITEC/rbp_encode_eclip/blob/main/csv/coord_links.txt">https://github.com/ML-Bioinfo-CEITEC/rbp_encode_eclip/blob/main/csv/coord_links.txt</ext-link>. The reference genome was downloaded from 
                    <ext-link ext-link-type="uri" xlink:href="http://ftp.ensembl.org/pub/release-97/fasta/homo_sapiens/dna/Homo_sapiens.GRCh38.dna.toplevel.fa.gz">http://ftp.ensembl.org/pub/release-97/fasta/homo_sapiens/dna/Homo_sapiens.GRCh38.dna.toplevel.fa.gz</ext-link>
                </p>
                <p>Columns &#x2018;chr&#x2019;, &#x2018;start&#x2019;, &#x2018;end&#x2019;, &#x2018;strand&#x2019; and &#x2018;pValue&#x2019; from downloaded files containing genomic coordinates can be found here: 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/ML-Bioinfo-CEITEC/rbp_encode_eclip/tree/main/csv">https://github.com/ML-Bioinfo-CEITEC/rbp_encode_eclip/tree/main/csv</ext-link>. The whole dataset (curated and including sequence) can be downloaded from 
                    <ext-link ext-link-type="uri" xlink:href="https://rbp-tar.biodata.ceitec.cz/">https://rbp-tar.biodata.ceitec.cz/</ext-link> as a gzipped CSV (18 MB) (
                    <ext-link ext-link-type="uri" xlink:href="https://rbp-tar.biodata.ceitec.cz//download_all">https://rbp-tar.biodata.ceitec.cz//download_all</ext-link>).</p>
            </sec>
        </sec>
        <sec id="sec9">
            <title>Software availability</title>
            <sec id="sec10">
                <title>RBP-Tar
</title>
                <p>

                    <bold>Software:</bold> 
                    <ext-link ext-link-type="uri" xlink:href="https://rbp-tar.biodata.ceitec.cz/">https://rbp-tar.biodata.ceitec.cz/</ext-link>
                </p>
                <p>

                    <bold>Source code:</bold> 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/sb-ncbr/RBP-Tar/releases/tag/v1.0.2">https://github.com/sb-ncbr/RBP-Tar/releases/tag/v1.0.2</ext-link>
                </p>
                <p>

                    <bold>Archived source code at the time of publication:</bold> 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.14135388">https://doi.org/10.5281/zenodo.14135388</ext-link> (
                    <xref ref-type="bibr" rid="ref11">Ra&#x010d;ek 2024</xref>)</p>
                <p>

                    <bold>License:</bold> 
                    <ext-link ext-link-type="uri" xlink:href="https://opensource.org/license/mit/">MIT</ext-link>
                </p>
            </sec>
            <sec id="sec11">
                <title>eCLIP RBP</title>
                <p>

                    <bold>Source code:</bold> 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/ML-Bioinfo-CEITEC/rbp_encode_eclip/tree/v1.0.3">https://github.com/ML-Bioinfo-CEITEC/rbp_encode_eclip/tree/v1.0.3</ext-link>
                </p>
                <p>

                    <bold>Archived source code at the time of publication:</bold> 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.14082502">https://doi.org/10.5281/zenodo.14082502</ext-link> (
                    <xref ref-type="bibr" rid="ref7">Gre&#x0161;ov&#x00e1; &amp; Ra&#x010d;ek 2024</xref>)</p>
                <p>

                    <bold>License:</bold> 
                    <ext-link ext-link-type="uri" xlink:href="https://www.apache.org/licenses/LICENSE-2.0">Apache 2.0</ext-link>
                </p>
            </sec>
        </sec>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chi</surname>
                            <given-names>SW</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zang</surname>
                            <given-names>JB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mele</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Argonaute HITS-CLIP Decodes microRNA&#x2013;mRNA Interaction Maps.</article-title>
                    <source>

                        <italic toggle="yes">Nature.</italic>
</source>
                    <year>2009</year>;<volume>460</volume>:<fpage>479</fpage>&#x2013;<lpage>486</lpage>.
                    <pub-id pub-id-type="pmid">19536157</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nature08170</pub-id>
                    <pub-id pub-id-type="pmcid">PMC2733940</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref2">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dominguez</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Freese</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Alexis</surname>
                            <given-names>MS</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Sequence, Structure, and Context Preferences of Human RNA Binding Proteins.</article-title>
                    <source>

                        <italic toggle="yes">Mol. Cell</italic>
</source>
                    <year>2018</year>;<volume>70</volume>(<issue>5</issue>):<fpage>854</fpage>&#x2013;<lpage>67.e9</lpage>.
                    <pub-id pub-id-type="pmid">29883606</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.molcel.2018.05.001</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6062212</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref3">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gerstberger</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hafner</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tuschl</surname>
                            <given-names>T</given-names>
                        </name>
</person-group>:
                    <article-title>A Census of Human RNA-Binding Proteins.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Rev. Genet.</italic>
</source>
                    <year>2014</year>;<volume>15</volume>(<issue>12</issue>):<fpage>829</fpage>&#x2013;<lpage>845</lpage>.
                    <pub-id pub-id-type="pmid">25365966</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nrg3813</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref4">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gebauer</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schwarzl</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Valc&#x00e1;rcel</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A-binding proteins in human genetic disease.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Rev. Genet.</italic>
</source>
                    <year>2021</year>;<volume>22</volume>:<fpage>185</fpage>&#x2013;<lpage>198</lpage>.
                    <pub-id pub-id-type="doi">10.1038/s41576-020-00302-y</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref7">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gre&#x0161;ov&#x00e1;</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ra&#x010d;ek</surname>
                            <given-names>T</given-names>
                        </name>
</person-group>:
                    <article-title>&#x201c;ML-Bioinfo-CEITEC/rbp_encode_eclip: v1.0.3 (v1.0.3).&#x201d;</article-title>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.5281/zenodo.14082502</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref5">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hafner</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Landthaler</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Burger</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Transcriptome-Wide Identification of RNA-Binding Protein and microRNA Target Sites by PAR-CLIP.</article-title>
                    <source>

                        <italic toggle="yes">Cell.</italic>
</source>
                    <year>2010</year>;<volume>141</volume>(<issue>1</issue>):<fpage>129</fpage>&#x2013;<lpage>141</lpage>.
                    <pub-id pub-id-type="pmid">20371350</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.cell.2010.03.009</pub-id>
                    <pub-id pub-id-type="pmcid">PMC2861495</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref6">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hogan</surname>
                            <given-names>DJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Riordan</surname>
                            <given-names>DP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gerber</surname>
                            <given-names>AP</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Diverse RNA-Binding Proteins Interact with Functionally Related Sets of RNAs, Suggesting an Extensive Regulatory System.</article-title>
                    <source>

                        <italic toggle="yes">PLoS Biol.</italic>
</source>
                    <year>2008</year>;<volume>6</volume>(<issue>10</issue>):<fpage>e255</fpage>.
                    <pub-id pub-id-type="pmid">18959479</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pbio.0060255</pub-id>
                    <pub-id pub-id-type="pmcid">PMC2573929</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref8">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Licatalosi</surname>
                            <given-names>DD</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mele</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fak</surname>
                            <given-names>JJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>HITS-CLIP Yields Genome-Wide Insights into Brain Alternative RNA Processing.</article-title>
                    <source>

                        <italic toggle="yes">Nature</italic>
</source>
                    <year>2008</year>;<volume>456</volume>(<issue>7221</issue>):<fpage>464</fpage>&#x2013;<lpage>469</lpage>.
                    <pub-id pub-id-type="pmid">18978773</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nature07488</pub-id>
                    <pub-id pub-id-type="pmcid">PMC2597294</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref9">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Luo</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hitz</surname>
                            <given-names>BC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gabdank</surname>
                            <given-names>I</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>New developments on the Encyclopedia of DNA Elements (ENCODE) data portal.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2020 Jan 8</year>;<volume>48</volume>(<issue>D1</issue>):<fpage>D882</fpage>&#x2013;<lpage>D889</lpage>.
                    <pub-id pub-id-type="pmid">31713622</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkz1062</pub-id>
                    <pub-id pub-id-type="pmcid">PMC7061942</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref10">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Moore</surname>
                            <given-names>MJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gantman</surname>
                            <given-names>EC</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Mapping Argonaute and Conventional RNA-Binding Protein Interactions with RNA at Single-Nucleotide Resolution Using HITS-CLIP and CIMS Analysis.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Protoc.</italic>
</source>
                    <year>2014</year>;<volume>9</volume>(<issue>2</issue>):<fpage>263</fpage>&#x2013;<lpage>293</lpage>.
                    <pub-id pub-id-type="doi">10.1038/nprot.2014.012</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref12">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nostrand</surname>
                            <given-names>V</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Eric</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pratt</surname>
                            <given-names>GA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Robust Transcriptome-Wide Discovery of RNA-Binding Protein Binding Sites with Enhanced CLIP (eCLIP).</article-title>
                    <source>

                        <italic toggle="yes">Nat. Methods</italic>
</source>
                    <year>2016</year>;<volume>13</volume>(<issue>6</issue>):<fpage>508</fpage>&#x2013;<lpage>514</lpage>.
                    <pub-id pub-id-type="pmid">27018577</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nmeth.3810</pub-id>
                    <pub-id pub-id-type="pmcid">PMC4887338</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref11">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ra&#x010d;ek</surname>
                            <given-names>T</given-names>
                        </name>
</person-group>:
                    <article-title>&#x201c;sb-ncbr/RBP-Tar: v1.0.2 (v1.0.2).&#x201d;</article-title>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.5281/zenodo.14135388</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report354325">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.175053.r354325</article-id>
            <title-group>
                <article-title>Reviewer response for version 3</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Liu</surname>
                        <given-names>Zhi-Ping</given-names>
                    </name>
                    <xref ref-type="aff" rid="r354325a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-7742-9161</uri>
                </contrib>
                <aff id="r354325a1">
                    <label>1</label>Shandong University, Jinan, Shandong, China</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>16</day>
                <month>1</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Liu ZP</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport354325" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.131014.3"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In this paper, the authors present the development of RBP-Tar, a searchable database and web server dedicated to experimental RNA-binding protein (RBP) binding sites. RBP-Tar currently focuses on eCLIP data for 167 RBPs mapped on the human genome, though its scope has the potential to be expanded to include a broader range of CLIP data to enhance its utility.</p>
            <p> </p>
            <p> To address the current limitation of focusing solely on eCLIP data, the authors suggest exploring the inclusion of other CLIP data types in future updates. Additionally, to clarify the purpose and content of this paper, the authors recommend refining the title and abstract to more accurately reflect the scope and contributions of their work. Furthermore, to provide context for their work, the authors need include a discussion of existing databases similar to RBP-Tar. This section should summarize the features and specificity of RBP-Tar compared to these existing resources. If possible, a comparison of the overlapping information with these databases would be beneficial to highlight the unique aspects of RBP-Tar. For the web server component of RBP-Tar, the authors need consider incorporating submission and download options for original data and query results. These features would enhance the usability and accessibility of the server, making it a more comprehensive resource for researchers studying RNA-binding proteins.</p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Partly</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Bioinformatics, Systems Biology.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report354315">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.175053.r354315</article-id>
            <title-group>
                <article-title>Reviewer response for version 3</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Nishanth</surname>
                        <given-names>M. J.</given-names>
                    </name>
                    <xref ref-type="aff" rid="r354315a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r354315a1">
                    <label>1</label>St Joseph&#x2019;s University, Bengaluru, India</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>16</day>
                <month>1</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Nishanth MJ</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport354315" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.131014.3"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors have developed RBP-Tar, a database and tool that can be used to identify the known target motifs of RBPs on human genes or chromosomal loci. This would be a useful tool for scientific community to identify RBP target motifs. Following are the technical queries/suggestions for the authors:</p>
            <p> 1. The authors state that the target motifs &lt; 20 bases in length or &gt;100 bases are excluded from the result-list. Why are the motifs &lt; 20 bases excluded? The rationale for this parameter needs to be given.</p>
            <p> 2. It may be beneficial to add an 'evidence' column in the result/output table. This column could have the reference of the primary study/experiment which forms the evidence for the particular target site being displayed in the result table. This would be helpful for the users if further validation of the target motifs is needed.</p>
            <p> 3. The authors could consider adding a summary of the existing software tools in the realm of RBP target motif identification and related bioinformatic analyses. They could elaborate on the utility of RBP-Tar in the context of the existing software tools, which would give a broader perspective of the usefulness of RBP-Tar. A few of the existing relevant tools that could be discussed: RBP Map (https://rbpmap.technion.ac.il/), Transite (https://transite.mit.edu/), oRNAment (https://rnabiology.ircm.qc.ca/oRNAment/), and ATtRACT (https://attract.cnic.es/).</p>
            <p> 4. The authors need to detail the statistical tests and analyses on which the p-value calculations are based.</p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Partly</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Yes</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Molecular biology, gene regulation</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report349169">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.175053.r349169</article-id>
            <title-group>
                <article-title>Reviewer response for version 3</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Van Nostrand</surname>
                        <given-names>Eric</given-names>
                    </name>
                    <xref ref-type="aff" rid="r349169a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r349169a1">
                    <label>1</label>Baylor College of Medicine, Houston, TX, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>10</day>
                <month>1</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Van Nostrand E</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport349169" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.131014.3"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This article by Gresova et al build a searchable web tool for the ENCODE eCLIP data that profiles RNA binding protein interactions.&#x00a0;</p>
            <p> </p>
            <p> Since this is a named review, I can specifically comment - as one other reviewer notes, some of the filtering choices here are done without really justifying why they are appropriate - i.e., why are peaks shorter than 20 or longer than 100nt excluded? That's something that should either be justified with analysis showing that it improves signal-to-noise, or not done. (In particular, the CLIP per peak caller used in ENCODE tends to err on the side of calling short peaks, so I highly suspect filtering peaks shorter than 20 is removing a significant amount of real signal)</p>
            <p> </p>
            <p> Similarly, we reported the fold-enrichment versus input because we found that to often be more informative than p-value (which is often more a function of transcript abundance rather than binding signal), but that is not reported here for an undescribed reason.</p>
            <p> </p>
            <p> Since the entire website is based off of the ENCODE eCLIP data, it seems appropriate to cite that manuscript (Van Nostrand EL, et al., 2020 - PMID&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/32728246/">32728246</ext-link>&#x00a0;[Ref-1]) rather than the broader ENCODE overview Luo et al publication.</p>
            <p> </p>
            <p> If the idea is to make it extremely user-friendly, it seems like having an ability to copy-paste a UCSC browser-style region (e.g. "chr1:xxxx-yyyy") would make it simpler to use</p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Partly</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Yes</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>eCLIP, RNA binding protein studies</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-349169-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>A large-scale binding and functional map of human RNA-binding proteins.</article-title>
                        <source>
                            <italic>Nature</italic>
                        </source>.<year>2020</year>;<volume>583</volume>(<issue>7818</issue>) :
                        <elocation-id>10.1038/s41586-020-2077-3</elocation-id>
                        <fpage>711</fpage>-<lpage>719</lpage>
                        <pub-id pub-id-type="pmid">32728246</pub-id>
                        <pub-id pub-id-type="doi">10.1038/s41586-020-2077-3</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report349176">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.175053.r349176</article-id>
            <title-group>
                <article-title>Reviewer response for version 3</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Stock</surname>
                        <given-names>Michiel</given-names>
                    </name>
                    <xref ref-type="aff" rid="r349176a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r349176a1">
                    <label>1</label>Ghent University, Ghent, Belgium</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>9</day>
                <month>1</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Stock M</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport349176" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.131014.3"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This article presents RBP-tar, a small database with a web interface that enables users to access RNA-binding proteins in humans. Users can select regions on different chromosomes and view a list of all binding proteins. Additionally, there are various filter options, such as filtering by protein name for self-binding. Users can also directly download a CSV file containing the entire database, which I always appreciate. The database is designed to facilitate the development of machine learning models.</p>
            <p> </p>
            <p> The short article appears to be well explained. However, I find one aspect lacking: the comparison and relationship with similar databases, such as RBP2GO or EuRBPDB. This should be addressed in a designated section.</p>
            <p> </p>
            <p> I would also like some information on how this will be maintained. Will more RBPs be added?</p>
            <p> </p>
            <p> Furthermore, I would appreciate some additional clarification regarding what the p-values refer to.</p>
            <p> </p>
            <p> minor: &#x00a0;</p>
            <p> - "p values" =&gt; "p-values" (clarify what the p-values are for). &#x00a0;</p>
            <p> - what is exactly the "&#x00a0;[-log10(p-value)] p-value of the binding site"?</p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Partly</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Yes</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Bioinformatics and machine learning.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report313335">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.169056.r313335</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Meyer</surname>
                        <given-names>Clifford A</given-names>
                    </name>
                    <xref ref-type="aff" rid="r313335a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r313335a1">
                    <label>1</label>Harvard T.H. Chan School of Public Health, Dana-Farber Cancer Institute, Boston, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>5</day>
                <month>9</month>
                <year>2024</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2024 Meyer CA</copyright-statement>
                <copyright-year>2024</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport313335" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.131014.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The revision is not satisfactory as most of the reviewers' suggestions for improving the website were not implemented. We hope that these will be implemented in future versions.</p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Partly</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>Epigenetics, Chromatin analysis, Gene regulation, Single cell</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment12848-313335">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Alexiou</surname>
                            <given-names>Panagiotis</given-names>
                        </name>
                        <aff>University of Malta, Malta</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>19</day>
                    <month>11</month>
                    <year>2024</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thank you for the continued feedback on our manuscript and webserver resource. We appreciate the reviewer&#x2019;s commitment to ensuring that the resource becomes more robust and user-friendly. In response to the earlier suggestions, we have implemented the following improvements in our webserver, as outlined below: 
                    <list list-type="order">
                        <list-item>
                            <p>Inclusion of p-values: We have added a p-value column to our data, providing an additional layer of statistical context to aid users in evaluating the significance of RNA binding protein binding sites.</p>
                        </list-item>
                        <list-item>
                            <p>Filtering and sorting functionality by p-value: To enhance usability, we have introduced the functionality to filter and sort binding site data based on p-values. This improvement allows users to more easily interpret and prioritize results according to their significance.</p>
                        </list-item>
                    </list> We recognize that many of the enhancements suggested in previous rounds, such as expanding the dataset scope and incorporating broader quality control metrics, require a significant overhaul and are part of our longer-term development roadmap.</p>
                <p> Our current updates reflect initial, impactful steps toward addressing reviewer concerns, and we remain committed to further expanding the features and datasets in future versions.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report313334">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.169056.r313334</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Pan</surname>
                        <given-names>Xiaoyong</given-names>
                    </name>
                    <xref ref-type="aff" rid="r313334a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-5010-464X</uri>
                </contrib>
                <aff id="r313334a1">
                    <label>1</label>Shanghai Jiao Tong University, Shanghai, Shanghai, China</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>23</day>
                <month>8</month>
                <year>2024</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2024 Pan X</copyright-statement>
                <copyright-year>2024</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport313334" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.131014.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors addressed my comments.</p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Partly</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report224195">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.143817.r224195</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Meyer</surname>
                        <given-names>Clifford A</given-names>
                    </name>
                    <xref ref-type="aff" rid="r224195a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r224195a1">
                    <label>1</label>Harvard T.H. Chan School of Public Health, Dana-Farber Cancer Institute, Boston, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>18</day>
                <month>12</month>
                <year>2023</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2023 Meyer CA</copyright-statement>
                <copyright-year>2023</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport224195" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.131014.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This paper describes a database of RNA binding protein binding sites based on 2 ENCODE cell lines. The website allows uses to search for data by querying a gene or genomic interval.</p>
            <p> </p>
            <p> The scope of the database is very limited. It is not clear why the data is restricted to eCLIP data and not CLIP data in general. Although the authors make the argument that eCLIP data is of good quality, quality will vary from sample to sample and there are likely to be high quality CLIP data sets as well as low quality eCLIP ones.</p>
            <p> </p>
            <p> It appears that the processed peaks are downloaded from ENCODE, but the methods used by ENCODE to identify these peaks are not described, and statistics such as p-values, false discovery rates and enrichment relative to background are not provided. Quality controls are also not provided, so it is not possible to know if the experiment worked.&#x00a0;</p>
            <p> </p>
            <p> As processed RBP data is downloaded from ENCODE&#x00a0; this resource does not add much to what is already available on the ENCODE website. A resource that processed the raw read data from other data resources, such as GEO, would be more valuable. There are over 2,000 eCLIP samples and over 6,000 CLIP samples in GEO.</p>
            <p> </p>
            <p> The search features are useful, although it is hard to interpret the results. Some ranking of the results from most significant to least is needed. It would also be helpful to enable filtering based on some binding sites statistics.</p>
            <p> </p>
            <p> Overall, the resource is limited in terms of the data included and the website features. It would be helpful to include quality control metrics to let users know which samples are more informative.</p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Partly</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>Epigenetics, Chromatin analysis, Gene regulation, Single cell</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment12002-224195">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Alexiou</surname>
                            <given-names>Panagiotis</given-names>
                        </name>
                        <aff>University of Malta, Malta</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>11</day>
                    <month>7</month>
                    <year>2024</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <italic>This paper describes a database of RNA binding protein binding sites based on 2 ENCODE cell lines. The website allows uses to search for data by querying a gene or genomic interval.</italic>
                </p>
                <p>
                    <italic> The scope of the database is very limited. It is not clear why the data is restricted to eCLIP data and not CLIP data in general. Although the authors make the argument that eCLIP data is of good quality, quality will vary from sample to sample and there are likely to be high quality CLIP data sets as well as low quality eCLIP ones.</italic>
                </p>
                <p> </p>
                <p> We agree that the scope of the database is limited at this point and we aim to widen it in the future updates of the app. ENCODE/ENCORE has a comprehensive eCLIP experimental list, and only 3 iCLIP libraries (
                    <ext-link ext-link-type="uri" xlink:href="https://www.encodeproject.org/encore-matrix/?type=Experiment&amp;status=released&amp;internal_tags=ENCORE">
                        <underline>https://www.encodeproject.org/encore-matrix/?type=Experiment&amp;status=released&amp;internal_tags=ENCORE</underline>
                    </ext-link>). We chose eCLIP to keep the experimental procedure more consistent in case of different methods introducing different biases. In a future version, we will explore using different datasets from multiple sources.&#x00a0;</p>
                <p> </p>
                <p> 
                    <italic>It appears that the processed peaks are downloaded from ENCODE, but the methods used by ENCODE to identify these peaks are not described, and statistics such as p-values, false discovery rates and enrichment relative to background are not provided. Quality controls are also not provided, so it is not possible to know if the experiment worked.</italic>&#x00a0;</p>
                <p> </p>
                <p> Methods used by ENCODE to identify peaks from eCLIP data are described here: 
                    <ext-link ext-link-type="uri" xlink:href="https://www.encodeproject.org/eclip/">
                        <underline>https://www.encodeproject.org/eclip/</underline>
                    </ext-link> while the detailed description of data processing pipeline can be downloaded from here: 
                    <ext-link ext-link-type="uri" xlink:href="https://www.encodeproject.org/documents/739ca190-8d43-4a68-90ce-1a0ddfffc6fd/@@download/attachment/eCLIP_analysisSOP_v2.2.pdf">
                        <underline>https://www.encodeproject.org/documents/739ca190-8d43-4a68-90ce-1a0ddfffc6fd/@@download/attachment/eCLIP_analysisSOP_v2.2.pdf</underline>
                    </ext-link>. This pipeline ensures that only high-quality peaks are identified.</p>
                <p> </p>
                <p> 
                    <italic>As processed RBP data is downloaded from ENCODE&#x00a0; this resource does not add much to what is already available on the ENCODE website. A resource that processed the raw read data from other data resources, such as GEO, would be more valuable. There are over 2,000 eCLIP samples and over 6,000 CLIP samples in GEO.</italic>
                </p>
                <p> </p>
                <p> We agree that a resource that processed the raw read data from other data resources, such as GEO, would be more valuable, we will consider adding other data resources in the future updates of the webserver. The addition of our app to what is already available on the ENCODE website is advanced filtering of binding sites based on the genome position.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report224199">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.143817.r224199</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Pan</surname>
                        <given-names>Xiaoyong</given-names>
                    </name>
                    <xref ref-type="aff" rid="r224199a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-5010-464X</uri>
                </contrib>
                <aff id="r224199a1">
                    <label>1</label>Shanghai Jiao Tong University, Shanghai, Shanghai, China</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>18</day>
                <month>12</month>
                <year>2023</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2023 Pan X</copyright-statement>
                <copyright-year>2023</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport224199" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.131014.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>
                <list list-type="order">
                    <list-item>
                        <p>&#x200b;&#x200b;&#x200b;&#x200b;&#x200b;In the paper, the author claim they download narrowPeak metadata from 
                            <ext-link ext-link-type="uri" xlink:href="https://www.encodeproject.org/metadata/?status=released&amp;internal_tags=ENCORE&amp;assay_title=eCLIP&amp;biosample_ontology.term_name=K562&amp;biosample_ontology.term_name=HepG2&amp;files.file_type=bed+narrowPeak&amp;type=Experiment&amp;files.analyses.status=released&amp;files.preferred_default=true">https://www.encodeproject.org/metadata/?status=released&amp;internal_tags=ENCORE&amp;assay_title=eCLIP&amp;biosample_ontology.term_name=K562&amp;biosample_ontology.term_name=HepG2&amp;files.file_type=bed+narrowPeak&amp;type=Experiment&amp;files.analyses.status=released&amp;files.preferred_default=true</ext-link>，and the link summary is provided in the file &#x00a0;
                            <ext-link ext-link-type="uri" xlink:href="https://github.com/ML-Bioinfo-CEITEC/rbp_encode_eclip/blob/main/csv/coord_links.txt">https://github.com/ML-Bioinfo-CEITEC/rbp_encode_eclip/blob/main/csv/coord_links.txt</ext-link>. In the ENCODE dataset, the majority of RNA-binding proteins (RBPs) have biological replicate(s) or technical replicate(s), with each replicate having an associated narrowPeak file. Additionally, there is another merged peak file containing reproducible peaks as determined by entropy-ordered peaks between two replicates. However, it is observed that only one file per RBP from K562 or HepG2, which is both the merged peak file. For eCLIP experiment on K562 against QKI, I just find ENCFF190XSX that is from replicate 1. A careful examination of the data is recommended to ensure accuracy and completeness.</p>
                    </list-item>
                    <list-item>
                        <p>After narrowPeak download, the author filter the peak by length, excluding ones shorter than 20 and longer than 100 nucleotides. It's important to note that must short (rarely more than 4&#x2013;6-nucleotide-long) &#x00a0;sequence elements within RNA targets that are recognized and bound by RNA-binding proteins (Gerstberger, S et al. (2014 
                            <sup>
                                <xref ref-type="bibr" rid="rep-ref-224199-1">1</xref>
                            </sup>). In my perspective, employing a filter based on fold change and p-value, in accordance with the eCLIP-seq Processing Pipeline of ENCODE (
                            <ext-link ext-link-type="uri" xlink:href="https://www.encodeproject.org/documents/739ca190-8d43-4a68-90ce-1a0ddfffc6fd/@@download/attachment/eCLIP_analysisSOP_v2.2.pdf)">https://www.encodeproject.org/documents/739ca190-8d43-4a68-90ce-1a0ddfffc6fd/@@download/attachment/eCLIP_analysisSOP_v2.2.pdf)</ext-link> is powerful.</p>
                    </list-item>
                    <list-item>
                        <p>If this job is for the benefit of biologists, why not use transcripts as a reference? Because RBP binds to transcripts. And it's too simplistic for systems biologists, the author can refer to the POSTAR3 data.</p>
                    </list-item>
                </list>
            </p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Partly</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>The authors need solve some issues before published</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-224199-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>A census of human RNA-binding proteins.</article-title>
                        <source>
                            <italic>Nat Rev Genet</italic>
                        </source>.<year>2014</year>;<volume>15</volume>(<issue>12</issue>) :
                        <elocation-id>10.1038/nrg3813</elocation-id>
                        <fpage>829</fpage>-<lpage>45</lpage>
                        <pub-id pub-id-type="pmid">25365966</pub-id>
                        <pub-id pub-id-type="doi">10.1038/nrg3813</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment12001-224199">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Alexiou</surname>
                            <given-names>Panagiotis</given-names>
                        </name>
                        <aff>University of Malta, Malta</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>11</day>
                    <month>7</month>
                    <year>2024</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <bold>Reviewer Comment:</bold>
                </p>
                <p> 
                    <italic>&#x200b;&#x200b;&#x200b;&#x200b;&#x200b;In the paper, the author claim they download narrowPeak metadata from
                        <ext-link ext-link-type="uri" xlink:href="https://www.encodeproject.org/metadata/?status=released&amp;internal_tags=ENCORE&amp;assay_title=eCLIP&amp;biosample_ontology.term_name=K562&amp;biosample_ontology.term_name=HepG2&amp;files.file_type=bed+narrowPeak&amp;type=Experiment&amp;files.analyses.status=released&amp;files.preferred_default=true"> 
                            <underline>https://www.encodeproject.org/metadata/?status=released&amp;internal_tags=ENCORE&amp;assay_title=eCLIP&amp;biosample_ontology.term_name=K562&amp;biosample_ontology.term_name=HepG2&amp;files.file_type=bed+narrowPeak&amp;type=Experiment&amp;files.analyses.status=released&amp;files.preferred_default=true</underline>
                        </ext-link>，and the link summary is provided in the file&#x00a0; 
                        <ext-link ext-link-type="uri" xlink:href="https://github.com/ML-Bioinfo-CEITEC/rbp_encode_eclip/blob/main/csv/coord_links.txt">
                            <underline>https://github.com/ML-Bioinfo-CEITEC/rbp_encode_eclip/blob/main/csv/coord_links.txt</underline>
                        </ext-link>. In the ENCODE dataset, the majority of RNA-binding proteins (RBPs) have biological replicate(s) or technical replicate(s), with each replicate having an associated narrowPeak file. Additionally, there is another merged peak file containing reproducible peaks as determined by entropy-ordered peaks between two replicates. However, it is observed that only one file per RBP from K562 or HepG2, which is both the merged peak file. For eCLIP experiment on K562 against QKI, I just find ENCFF190XSX that is from replicate 1. A careful examination of the data is recommended to ensure accuracy and completeness.</italic>
                </p>
                <p> </p>
                <p> 
                    <italic>
                        <bold>Author Response:</bold>
                    </italic> We would like to thank the reviewer for the insightful finding. We have carefully examined the downloaded metadata and adjusted the processing pipeline to ensure the merged peak file is used for every RBP. In the methods, section &#x201c;Pipeline for reproducible data download and annotation&#x201d;, we have added the parameter &#x201c;files.analyses.status=released&#x201d; that is ensuring all available files are downloaded. Updated data have been uploaded to the RBP-Tar webserver and the manuscript has been updated with the current number of binding sites and size of files. We have provided new figures with up-to-date views of the web server.</p>
                <p> </p>
                <p> 
                    <bold>Reviewer Comment:</bold>
                </p>
                <p> 
                    <italic>After narrowPeak download, the author filter the peak by length, excluding ones shorter than 20 and longer than 100 nucleotides. It's important to note that must short (rarely more than 4&#x2013;6-nucleotide-long)&#x00a0; sequence elements within RNA targets that are recognized and bound by RNA-binding proteins (Gerstberger, S et al. (2014
                        <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/articles/12-755#rep-ref-224199-1"> 
                            <underline>1</underline>
                        </ext-link>). In my perspective, employing a filter based on fold change and p-value, in accordance with the eCLIP-seq Processing Pipeline of ENCODE (
                        <ext-link ext-link-type="uri" xlink:href="https://www.encodeproject.org/documents/739ca190-8d43-4a68-90ce-1a0ddfffc6fd/@@download/attachment/eCLIP_analysisSOP_v2.2.pdf)">
                            <underline>https://www.encodeproject.org/documents/739ca190-8d43-4a68-90ce-1a0ddfffc6fd/@@download/attachment/eCLIP_analysisSOP_v2.2.pdf)</underline>
                        </ext-link> is powerful.</italic>
                </p>
                <p> </p>
                <p> 
                    <italic>
                        <bold>Author Response:&#x00a0;</bold>
                    </italic>We appreciate the reviewer's perspective on filtering criteria. Our decision to filter based on length aimed to exclude potential artifacts and prioritize biologically relevant binding sites. While fold change and p-value are valuable parameters, our focus was on ensuring the inclusion of robust binding sites within the specified length range. We will consider incorporating additional filtering parameters in future iterations leading to a new version of the web server.</p>
                <p> </p>
                <p> 
                    <bold>Reviewer Comment:</bold>
                </p>
                <p> 
                    <italic>If this job is for the benefit of biologists, why not use transcripts as a reference? Because RBP binds to transcripts. And it's too simplistic for systems biologists, the author can refer to the POSTAR3 data.</italic>
                </p>
                <p> </p>
                <p> 
                    <italic>
                        <bold>Author Response:&#x00a0;</bold>
                    </italic>&#x00a0;Our choice to focus on genomic coordinates was driven by the goal of providing a comprehensive resource for accessing RBP binding sites across the genome. While transcripts offer valuable context, our approach allows for flexibility in analyzing binding sites within different genomic regions and across multiple transcripts. However, we recognize the importance of incorporating transcript-level information and will explore integrating such data in future updates of the webserver to enhance the utility of our tool for systems biologists.</p>
            </body>
        </sub-article>
    </sub-article>
</article>
