<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="other" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.9973.2</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Software Tool Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                    <subj-group>
                        <subject>Bioinformatics</subject>
                    </subj-group>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>BgeeDB, an R package for retrieval of curated expression datasets and for gene list expression localization enrichment tests</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 2; peer review: 2 approved, 1 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no" equal-contrib="yes">
                    <name>
                        <surname>Komljenovic</surname>
                        <given-names>Andrea</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no" equal-contrib="yes">
                    <name>
                        <surname>Roux</surname>
                        <given-names>Julien</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a2">2</xref>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Wollbrett</surname>
                        <given-names>Julien</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Robinson-Rechavi</surname>
                        <given-names>Marc</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Bastian</surname>
                        <given-names>Frederic B.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland</aff>
                <aff id="a2">
                    <label>2</label>SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland</aff>
                <aff id="a3">
                    <label>3</label>Department of Biomedicine, University of Basel, Basel, Switzerland</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:frederic.bastian@unil.ch">frederic.bastian@unil.ch</email>
                </corresp>
                <fn id="fn1">
                    <p>*Contributed equally to the work.</p>
                </fn>
                <fn fn-type="con">
                    <p>AK and JR contributed equally to this work. AK developed the initial BgeeDB R package and made it available in Bioconductor. JR implemented the enrichment analyses, and refined the data download part. JW corrected some bugs, updated the package and the files used by the package, to use the latest Bgee release and for better compatibility between operating systems. FBB developed the server-side responses. MRR and FBB tested and commented on the package development. AK and JR wrote the manuscript. All authors discussed the results and implications and commented on the manuscript at all stages.</p>
                </fn>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>7</day>
                <month>8</month>
                <year>2018</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2016</year>
            </pub-date>
            <volume>5</volume>
            <elocation-id>2748</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>30</day>
                    <month>7</month>
                    <year>2018</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2018 Komljenovic A et al.</copyright-statement>
                <copyright-year>2018</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/5-2748/pdf"/>
            <abstract>
                <p>BgeeDB is a collection of functions to import into R re-annotated, quality-controlled and re-processed expression data available in the Bgee database. This includes data from thousands of wild-type healthy samples of multiple animal species, generated with different gene expression technologies (RNA-seq, Affymetrix microarrays, expressed sequence tags, and 
                    <italic toggle="yes">in situ</italic> hybridizations). BgeeDB facilitates downstream analyses, such as gene expression analyses with other Bioconductor packages. Moreover, BgeeDB includes a new gene set enrichment test for preferred localization of expression of genes in anatomical structures (&#x201c;TopAnat&#x201d;). Along with the classical Gene Ontology enrichment test, this test provides a complementary way to interpret gene lists.</p>
                <p>Availability: 
                    <ext-link ext-link-type="uri" xlink:href="https://www.bioconductor.org/packages/BgeeDB/">https://www.bioconductor.org/packages/BgeeDB/</ext-link>
                </p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Bioconductor</kwd>
                <kwd>R Package</kwd>
                <kwd>Collective Data Access</kwd>
                <kwd>Gene expression</kwd>
                <kwd>Gene Enrichment Analysis</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1">
                    <funding-source>Schweizerischer Nationalfonds zur F&#x00f6;rderung der Wissenschaftlichen Forschung</funding-source>
                    <award-id>31003A</award-id>
                </award-group>
                <award-group id="fund-2">
                    <funding-source>Schweizerischer Nationalfonds zur F&#x00f6;rderung der Wissenschaftlichen Forschung</funding-source>
                    <award-id>153341</award-id>
                </award-group>
                <funding-statement>This work was supported by SIB Swiss Institute of Bioinformatics project Bgee, Swiss National Science Foundation grant 31003A_153341, SystemsX.ch project AgingX, and Etat de Vaud.</funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 1</title>
                <p>We thank the reviewers for their work, and we feel that, thanks to their comments, the manuscript has been greatly improved. We have updated considerably the Bgee database, and the corresponding documentation. This addresses the comments made by the reviewers about a lack of transparency of our data processing steps. Moreover, we have set up the documentation is such a manner as to insure that it remains updated with the progress of the database in the future. We have added information about the processing of gene expression data performed at the Bgee database. We notably now link to the source code of our pipeline for data processing. We have added some information about similar tools allowing to perform gene list expression localization enrichment analyses. We also have updated the examples and results based on the use of the latest Bgee release and BgeeDB package version. The code examples have been made more robust to potential future changes to the format of the files used by the package. We felt that it was necessary to link the revised publication of the BgeeDB package to the updated documentation. We have added an author, Julien Wollbrett, to the author list. Since this manuscript was first submitted, Julien Wollbrett made significant contributions to the development of the package described in this paper, and notably towards the aim of submitting this revised manuscript. Please also note that we have updated all figures and supplementary files, so that they are based on the latest releases of our database and R package.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>Gene expression levels influence the behavior of cells, the functionality of tissues, and a wide range of processes from development and aging to physiology or behavior. It is of particular importance that researchers are able to take advantage of the vast amounts of publicly available gene expression datasets to reproduce and validate results, or to investigate new research questions
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-3">3</xref>
                </sup>.</p>
            <p>To that purpose, one should be able to easily query and import gene expression datasets generated using different technologies, and their associated metadata. The R environment
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup> has now become a standard for bioinformatics and statistical analysis of gene expression data, through the Bioconductor framework and its many open source packages
                <sup>
                    <xref ref-type="bibr" rid="ref-5">5</xref>,
                    <xref ref-type="bibr" rid="ref-6">6</xref>
                </sup>. It is thus desirable to provide an access to gene expression datasets programmatically and directly into R. For example, the Bioconductor packages 
                <ext-link ext-link-type="uri" xlink:href="https://www.bioconductor.org/packages/ArrayExpress/">ArrayExpress</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-7">7</xref>
                </sup>, 
                <ext-link ext-link-type="uri" xlink:href="https://bioconductor.org/packages/GEOquery">GEOquery</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-8">8</xref>
                </sup> and 
                <ext-link ext-link-type="uri" xlink:href="https://bioconductor.org/packages/SRAdb">SRAdb</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-9">9</xref>
                </sup> provide access to the reference databases 
                <ext-link ext-link-type="uri" xlink:href="https://www.ebi.ac.uk/arrayexpress/">ArrayExpress</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-10">10</xref>
                </sup>, 
                <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/geo/">GEO</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-11">11</xref>
                </sup> and 
                <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/sra">SRA</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-12">12</xref>
                </sup> respectively.</p>
            <p>However, such databases are primary archives aiming at comprehensiveness. They include gene expression datasets and other functional genomics data, generated from diverse experimental conditions, of diverse quality. The data provided are heterogeneous, with some datasets including only unprocessed raw data, and others including only data processed using specific analysis pipelines. For instance, over the 
                <ext-link ext-link-type="uri" xlink:href="http://www.ebi.ac.uk/arrayexpress/browse.html?keywords=processed%3Atrue&amp;exptype%5B%5D=%22rna+assay%22&amp;exptype%5B%5D=%22array+assay%22">44,481 RNA array assay experiments stored in ArrayExpress with processed data available</ext-link> as of June 2018, 
                <ext-link ext-link-type="uri" xlink:href="http://www.ebi.ac.uk/arrayexpress/search.html?query=raw%3Atrue+AND+processed%3Atrue&amp;exptype%5B%5D=%22rna+assay%22&amp;exptype%5B%5D=%22array+assay%22">7,544 do not include the raw data</ext-link>. Metadata are often provided as free-text information that is difficult to query. For instance, the GEO database encourages submitters of high-throughput sequencing experiments to provide MINSEQE elements, but does not enforce this practice (see, e.g., 
                <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/geo/info/seq.html#intro">GEO submission guidelines</ext-link>, and 
                <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/geo/info/examples/seq_template_v2.1.xls">GEO Excel template for submissions</ext-link>). Unless the user needs to retrieve a specific known dataset from its accession number, it can be difficult to identify relevant available datasets. This can ultimately constitute an obstacle to data reuse.</p>
            <p>One response to this diversity of primary archives is topical databases
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>. They can be useful for researchers of specialized fields, and even more so if they propose an R package for data access. For example, the 
                <ext-link ext-link-type="uri" xlink:href="http://bioconductor.org/packages/BrainStars/">BrainStars Bioconductor package</ext-link> allows access to microarray data of mouse brain regions samples from the BrainStars project
                <sup>
                    <xref ref-type="bibr" rid="ref-13">13</xref>,
                    <xref ref-type="bibr" rid="ref-14">14</xref>
                </sup>. The 
                <ext-link ext-link-type="uri" xlink:href="https://bioconductor.org/packages/ImmuneSpaceR">ImmuneSpaceR Bioconductor package</ext-link> allows access to the gene expression data generated by the Human Immunology Project Consortium
                <sup>
                    <xref ref-type="bibr" rid="ref-15">15</xref>
                </sup>. Such efforts allow a better control of the data and annotation quality, but by nature they include a limited number of conditions, which only fit the needs of specialized projects. Similarly, numerous &#x201c;ExperimentData&#x201d; packages are available on the Bioconductor repository, which each include a single curated and well-formatted expression dataset (see 
                <ext-link ext-link-type="uri" xlink:href="https://www.bioconductor.org/packages/release/BiocViews.html#___ExpressionData">https://www.bioconductor.org/packages/release/BiocViews.html#___ExpressionData</ext-link>). But these packages are rarely updated and are mostly meant to be used as examples in software packages vignettes, for teaching, or to provide supplementary data of publications. The package 
                <ext-link ext-link-type="uri" xlink:href="https://bioconductor.org/packages/ExperimentHub/">ExperimentHub</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-16">16</xref>
                </sup> also provides access to a central location where this type of single datasets can be retrieved, but it does not address the difficulty of integrating datasets annotated and processed in different ways.</p>
            <p>Finally, added-value databases aim at filtering, annotating, and possibly reprocessing all or some of the datasets available from the primary archives
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>. For example, there is a 
                <ext-link ext-link-type="uri" xlink:href="https://www.bioconductor.org/packages/ExpressionAtlas/">Bioconductor package to access the Expression Atlas</ext-link>, which includes a selection of microarray and RNA-seq datasets from ArrayExpress that are re-annotated and reprocessed
                <sup>
                    <xref ref-type="bibr" rid="ref-17">17</xref>,
                    <xref ref-type="bibr" rid="ref-18">18</xref>
                </sup>. Similarly, the 
                <ext-link ext-link-type="uri" xlink:href="http://bioconductor.org/packages/recount/">ReCount</ext-link> Bioconductor package provides access to a dataset of over 70,000 reanalyzed human RNA-seq samples from SRA (see 
                <ext-link ext-link-type="uri" xlink:href="https://jhubiostatistics.shinyapps.io/recount/">https://jhubiostatistics.shinyapps.io/recount/</ext-link>)
                <sup>
                    <xref ref-type="bibr" rid="ref-19">19</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-21">21</xref>
                </sup>.</p>
            <p>The Bgee database (
                <ext-link ext-link-type="uri" xlink:href="https://bgee.org/">https://bgee.org/</ext-link>)
                <sup>
                    <xref ref-type="bibr" rid="ref-22">22</xref>
                </sup> is another added-value database, which currently offers access to reprocessed gene expression datasets from 29 animal species. Bgee aims at comparisons of gene expression patterns across tissues, developmental stages, ages and species. It provides manually curated annotations to ontology terms, describing precisely the experimental conditions used. It integrates expression data generated with multiple technologies: RNA-Seq, Affymetrix microarrays, 
                <italic toggle="yes">in situ</italic> hybridization, and expressed sequence tags (ESTs) in release 14. An important characteristic of Bgee is that all datasets are manually curated to retain only &#x201c;normal&#x201d; healthy wild-type samples, i.e., excluding gene knock-out, treatments, or diseases. Finally, Bgee datasets are carefully checked for quality issues, and reprocessed to produce normalized expression level, calls of presence/absence of expression, and of differential expression. Bgee thus provides a reference of high-quality and reusable gene expression datasets that are relevant for biological insights into normal conditions of gene expression. The release 14.0 of Bgee covers 29 animal species, and includes 5,745 RNA-seq libraries, 12,996 Affymetrix chips, 360,653 results from 49,241 
                <italic toggle="yes">in situ</italic> hybridization experiments, and 3,335 EST libraries. This includes 4,860 human RNA-Seq libraries from the GTEx project
                <sup>
                    <xref ref-type="bibr" rid="ref-23">23</xref>,
                    <xref ref-type="bibr" rid="ref-24">24</xref>
                </sup>.</p>
            <p>Until 2016 the Bgee database lacked a programmatic access to data through a R package, a shortcoming that we have addressed with the release of the BgeeDB Bioconductor package, available at 
                <ext-link ext-link-type="uri" xlink:href="https://www.bioconductor.org/packages/BgeeDB/">https://www.bioconductor.org/packages/BgeeDB/</ext-link>. The package provides functions for fast extraction of data and metadata. The data structures used in the package can be easily incorporated with other Bioconductor packages, offering a wide range of possibilities for downstream analyses.</p>
            <p>Moreover, we introduce in BgeeDB the possibility to run TopAnat analyses, i.e., anatomical expression enrichment tests on gene lists provided by the user. This functionality is based on the 
                <ext-link ext-link-type="uri" xlink:href="https://bioconductor.org/packages/topGO/">topGO</ext-link> package
                <sup>
                    <xref ref-type="bibr" rid="ref-25">25</xref>,
                    <xref ref-type="bibr" rid="ref-26">26</xref>
                </sup>, modified to use Bgee data (A. Alexa, personal communication). TopAnat is similar to the widely used Gene Ontology enrichment test
                <sup>
                    <xref ref-type="bibr" rid="ref-27">27</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-29">29</xref>
                </sup>. But in our case, the enrichment test is applied to terms from an anatomical ontology, mapped to genes by expression patterns. The reference set of genes in a given species consists of all genes for which at least one "present" expression call is available in Bgee. The expression calls are propagated to parent anatomical structures by part_of and is_a relations, using the Uberon anatomical ontology
                <sup>
                    <xref ref-type="bibr" rid="ref-30">30</xref>,
                    <xref ref-type="bibr" rid="ref-31">31</xref>
                </sup> (e.g., a gene expressed in the "hindbrain" is also considered expressed in the parent structure "brain"). Different algorithms, from TopGO, are available in TopAnat to account for the non-independence of anatomical structures, and avoid the over-representation of lowly-informative top-level terms. Enrichment of expression is tested for each anatomical structure independently with a Fisher exact test, and the resulting p-values for all anatomical structures are then corrected using a FDR correction
                <sup>
                    <xref ref-type="bibr" rid="ref-32">32</xref>
                </sup>. As a result, TopAnat allows to discover the tissues where a set of genes is preferentially expressed. This feature is available as a web-tool at 
                <ext-link ext-link-type="uri" xlink:href="https://bgee.org/?page=top_anat">https://bgee.org/?page=top_anat</ext-link>, but the R package offers more flexibility in the choice of input data and analysis parameters, and possibilities of inclusion within programs or pipelines.</p>
            <p>There exist few other tools allowing to perform anatomical expression enrichment tests. For instance, the web-application 
                <ext-link ext-link-type="uri" xlink:href="http://genetics.wustl.edu/jdlab/tsea/">Tissue Specific Expression Analysis</ext-link> (TSEA
                <sup>
                    <xref ref-type="bibr" rid="ref-33">33</xref>
                </sup> based on methods from refs
                <sup>
                    <xref ref-type="bibr" rid="ref-34">34</xref>,
                    <xref ref-type="bibr" rid="ref-35">35</xref>
                </sup>) allows to perform such analyses, but only in human and mouse, while TopAnat can be used for any species integrated in Bgee (29 species as of Bgee release 14.0). For human, TSEA is based on the Genotype-Tissue Expression (GTEx) RNA-Seq dataset, while Bgee integrates GTEx data, but also other RNA-Seq datasets, and datasets from different data types, providing a higher diversity of anatomical structures. TSEA was last updated on March 2014. The database wormbase also proposes a 
                <ext-link ext-link-type="uri" xlink:href="https://www.wormbase.org/tools/enrichment/tea/tea.cgi">similar tool</ext-link>, but for analyses only in C. elegans
                <sup>
                    <xref ref-type="bibr" rid="ref-36">36</xref>
                </sup>. There exists 
                <ext-link ext-link-type="uri" xlink:href="http://regulatorycircuits.org/">another application</ext-link> for expression enrichment analyses, but focused on analyzing gene regulatory networks in human
                <sup>
                    <xref ref-type="bibr" rid="ref-37">37</xref>
                </sup>.</p>
            <p>The pipeline to process the data accessible through the BgeeDB package is documented in detail at 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/BgeeDB/bgee_pipeline">https://github.com/BgeeDB/bgee_pipeline</ext-link>. In brief, for RNA-seq experiments: data present in SRA
                <sup>
                    <xref ref-type="bibr" rid="ref-12">12</xref>
                </sup> are selected and annotated using information from GEO
                <sup>
                    <xref ref-type="bibr" rid="ref-11">11</xref>
                </sup> or from papers, or provided by the Model Organism Database Wormbase
                <sup>
                    <xref ref-type="bibr" rid="ref-38">38</xref>
                </sup>. GTF annotation files and genome sequence fasta files are retrieved from Ensembl and Ensembl Genomes Metazoa
                <sup>
                    <xref ref-type="bibr" rid="ref-39">39</xref>,
                    <xref ref-type="bibr" rid="ref-40">40</xref>
                </sup>. After quality control steps, the Kallisto software is used to perform a pseudo-mapping of the reads to the transcriptome
                <sup>
                    <xref ref-type="bibr" rid="ref-41">41</xref>
                </sup>. TMM normalization
                <sup>
                    <xref ref-type="bibr" rid="ref-42">42</xref>
                </sup> is used to normalize TPM and F/RPKM values within each experiment independently. Present/absent expression calls are produced for each library by comparing the level of expression of each gene to the background transcriptional noise in the library (estimated by using the level of expression of intergenic regions; Roux J., Rosikiewicz M., Wollbrett J., Robinson-Rechavi M., Bastian F.B.; in preparation). In brief, for Affymetrix experiments: data present in ArrayExpress and GEO are selected and annotated using the information available in these repositories, or in papers, or provided by the Model Organism Database Wormbase. Mappings of probesets to genes are retrieved from Ensembl and Ensembl Genomes Metazoa. Quality controls are performed to remove low quality chips and redundant chips
                <sup>
                    <xref ref-type="bibr" rid="ref-43">43</xref>,
                    <xref ref-type="bibr" rid="ref-44">44</xref>
                </sup>. When raw data are available, they are normalized using gcRMA (using version 2.42.0 for Bgee release 14.0) within each experiment independently
                <sup>
                    <xref ref-type="bibr" rid="ref-45">45</xref>
                </sup>. Present/absent expression calls are generated either from the MAS5 processed data
                <sup>
                    <xref ref-type="bibr" rid="ref-46">46</xref>
                </sup>, based on the perfect match/mismatch probesets, or using the raw data when available, by comparing the signal of a probeset to a subset of weakly expressed probesets
                <sup>
                    <xref ref-type="bibr" rid="ref-47">47</xref>
                </sup>.</p>
            <p>The BgeeDB package information is available on the Bioconductor website at 
                <ext-link ext-link-type="uri" xlink:href="https://bioconductor.org/packages/BgeeDB/">https://bioconductor.org/packages/BgeeDB/</ext-link>. The source code is available at 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/BgeeDB/BgeeDB_R">https://github.com/BgeeDB/BgeeDB_R</ext-link>. The preferred location for filing bug reports and suggestions is the issue tracker on GitHub.</p>
            <p>In the following sections we provide some typical examples of usage of the BgeeDB package.</p>
        </sec>
        <sec sec-type="methods">
            <title>Methods</title>
            <sec>
                <title>Requirements</title>
                <p>To reproduce the results of examples in this paper, based on Bgee release 14.0:</p>
                <list list-type="bullet">
                    <list-item>
                        <p>R &gt;= 3.5</p>
                    </list-item>
                    <list-item>
                        <p>Bioconductor &gt;= 3.7</p>
                    </list-item>
                    <list-item>
                        <p>BgeeDB package version &gt;= 2.6.2</p>
                    </list-item>
                    <list-item>
                        <p>edgeR = 3.22.2</p>
                    </list-item>
                    <list-item>
                        <p>Mfuzz = 2.40.0</p>
                    </list-item>
                    <list-item>
                        <p>biomaRt &gt;= 2.36.1 (with Ensembl release 84 accessible)</p>
                    </list-item>
                    <list-item>
                        <p>Working internet connection</p>
                    </list-item>
                </list>
                <p>Please note that an earlier version of Bioconductor and of R (&gt;= 3.3) could be used, but would require to clone our GitHub repository and to use the 
                    <monospace>R</monospace> 
                    <monospace>CMD</monospace> 
                    <monospace>BUILD</monospace> command to build the package.</p>
            </sec>
            <sec>
                <title>Package installation</title>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;">source("https://bioconductor.org/biocLite.R")
biocLite("BgeeDB")
# load the library
library(BgeeDB)</styled-content>
                    </preformat>
                </p>
            </sec>
        </sec>
        <sec>
            <title>Use cases</title>
            <sec>
                <title>Data download and import of normalized expression levels</title>
                <p>The first step of data retrieval is to initialize a new 
                    <monospace>Bgee</monospace> reference class object, for a targeted species and data type. Normalized expression levels are currently available in the BgeeDB package for two data types: Affymetrix microarrays and Illumina RNA-seq. The list of species available in the Bgee database for each data type, along with their NCBI taxonomy IDs and common names can be obtained with the 
                    <monospace>listBgeeSpecies()</monospace> function. By default, data will be downloaded from the latest Bgee release, but this can be changed with the 
                    <monospace>release</monospace> argument.</p>
                <p>Next, the functions 
                    <monospace>getAnnotation()</monospace>, 
                    <monospace>getData()</monospace>, and 
                    <monospace>formatData()</monospace> can be called to respectively download the annotations of datasets, download the actual expression data, and reformat the expression data for more convenient use. Of note, BgeeDB creates a directory to store the downloaded annotation files and datasets, by default in the user&#x2019;s R working directory, but this can be changed with the 
                    <monospace>pathToData</monospace> argument. These versioned cached files make it faster for the user to return to previously used data and allow to work offline.</p>
                <p>
                    <bold>
                        <italic toggle="yes">Microarray dataset retrieval.</italic>
                    </bold> In the following example, we will look for a microarray dataset in mouse (
                    <italic toggle="yes">Mus musculus</italic>), spanning multiple early developmental stages, including zygote.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;"># specify species and data type
# the examples in this paper are based on Bgee release 14.0
# the following line targets the latest Bgee release.
# In order to target specifically the release 14.0,
# add the parameter 'release="14.0"'

bgee.affymetrix &lt;- Bgee$new(species="Mus_musculus", dataType="affymetrix")

# retrieve annotation of all mouse affymetrix datasets in Bgee
annotation.bgee.mouse.affymetrix &lt;- getAnnotation(bgee.affymetrix)
</styled-content>
                    </preformat>
                </p>
                <p>This creates a list of two data frames, one including the annotation of experiments, and the other including the annotation of each individual sample, i.e., hybridized microarray chip. For mouse, there are 698 Affymetrix experiments and 6,095 samples available in Bgee release 14.0. Anatomical structures and developmental stages are annotated using the Uberon ontology
                    <sup>
                        <xref ref-type="bibr" rid="ref-30">30</xref>,
                        <xref ref-type="bibr" rid="ref-31">31</xref>
                    </sup>. Sex and strain information is also provided. Below, we are selecting the experiments for which at least one sample is annotated to the zygote stage (
                    <monospace>UBERON:0000106</monospace>).</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;"># retrieve annotations of samples and experiments
sample.annotation &lt;- annotation.bgee.mouse.affymetrix$sample.annotation
experiment.annotation &lt;- annotation.bgee.mouse.affymetrix$experiment.annotation

# list experiments including a zygote sample 
selected.experiments &lt;- unique(sample.annotation$Experiment.ID[sample.annotation$Stage.ID == "UBERON:0000106"])
experiment.annotation[experiment.annotation$Experiment.ID %in% selected.experiments,]

# stages sampled in each of these experiments
unique(sample.annotation[sample.annotation$Experiment.ID %in% selected.experiments, c("Experiment.ID", "Stage.name")])</styled-content>
                    </preformat>
                </p>
                <p>This yields three microarray experiments, with accessions 
                    <monospace>GSE1749</monospace>, 
                    <monospace>E-MEXP-51</monospace> and 
                    <monospace>GSE18290</monospace>. Among these, the accession 
                    <monospace>E-MEXP-51</monospace>, submitted to ArrayExpress by Wang and colleagues
                    <sup>
                        <xref ref-type="bibr" rid="ref-48">48</xref>
                    </sup>, includes samples from more developmental stages than the other two, so we will choose this one for the next steps. For this experiment, raw data were available from ArrayExpress, so samples were fully normalized with gcRMA
                    <sup>
                        <xref ref-type="bibr" rid="ref-49">49</xref>
                    </sup> trough the Bgee pipeline.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;"># List all samples from E-MEXP-51 in Bgee
sample.annotation[sample.annotation$Experiment.ID == "E-MEXP-51",]</styled-content>
                    </preformat>
                </p>
                <p>The experiment includes 35 samples that passed Bgee quality controls. They originate from 12 developmental stages: primary and secondary oocyte, zygote, early, mid and late 2-cells embryo, 4-cells embryo, 8-cells embryo, 16-cells embryo, early, mid and late blastocyst. The developmental stage ontology used is not precise enough yet to differentiate some of these conditions: the early, mid and late 2-cells stages are annotated as Theiler stage 2 embryo, and the 4-cells and 8-cells stages are annotated as Theiler stage 3 embryo. All samples were hybridized to the 
                    <monospace>Affymetrix</monospace> 
                    <monospace>GeneChip</monospace> 
                    <monospace>Murine</monospace> 
                    <monospace>Genome</monospace> 
                    <monospace>U74Av2</monospace> microarray. Let us download the normalized probesets intensities measured for all samples.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;">data.E.MEXP.51 &lt;- getData(bgee.affymetrix, experimentId="E-MEXP-51")
head(data.E.MEXP.51)</styled-content>
                    </preformat>
                </p>
                <p>The resulting data frame lists for each sample (column &#x201c;Chip.ID&#x201d;), the 8,954 probesets on the microarray (column &#x201c;Probeset.ID&#x201d;), their mapping to Ensembl gene IDs (column &#x201c;Gene.ID&#x201d;), their logged normalized intensities (column &#x201c;Log.of.normalized.signal.intensity&#x201d;), and a presence/absence call and quality (columns &#x201c;Detection.flag&#x201d; and &#x201c;Detection.quality&#x201d;).</p>
                <p>As this format might not be the most convenient for downstream processing of an expression dataset, we offer the 
                    <monospace>formatData()</monospace> function, which creates an 
                    <monospace>ExpressionSet</monospace> object including the expression data matrix, the probesets annotation to Ensembl genes and the samples' anatomical structure and stage annotation into (
                    <monospace>assayData</monospace>, 
                    <monospace>featureData</monospace> and 
                    <monospace>phenoData</monospace> slots respectively). This object class is of standard use in numerous Bioconductor packages.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;">data.E.MEXP.51.formatted &lt;- formatData(bgee.affymetrix, data.E.MEXP.51,
callType="all", stats="intensities")
data.E.MEXP.51.formatted
# matrix of expression intensities
head(exprs(data.E.MEXP.51.formatted))
# annotation of samples
pData(data.E.MEXP.51.formatted)
# annotation of probesets
head(fData(data.E.MEXP.51.formatted))</styled-content>
                    </preformat>
                </p>
                <p>The 
                    <monospace>callType</monospace> option of the 
                    <monospace>formatData()</monospace> function could alternatively be set to 
                    <monospace>present</monospace> or 
                    <monospace>present high quality</monospace> to display only the intensities of probesets detected as actively expressed.</p>
                <p>The result is a nicely formatted Bioconductor object including expression data and their annotations, ready to be used for downstream analysis with other Bioconductor packages.</p>
                <p>
					
                    <bold>
						
                        <italic toggle="yes">RNA-seq dataset retrieval.</italic>
					</bold> We will now search Bgee for a RNA-seq dataset sampling brain and liver tissues (Uberon Ids 
                    <monospace>UBERON:0000955</monospace> and 
                    <monospace>UBERON:0002107</monospace> respectively) in macaque (
                    <italic toggle="yes">Macaca mulatta</italic>), and including multiple biological replicates for each tissue. As for Affymetrix data, Bgee RNA-seq annotations provide information about anatomical structure, developmental stage, sex, and strain.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;"># specify species and data type
# the examples in this paper are based on Bgee release 14.0
# the following line targets the latest Bgee release. In order
# to target specifically the release 14.0, add the parameter
# 'release="14.0"'
bgee.rnaseq &lt;- Bgee$new(species="Macaca_mulatta", dataType="rna_seq")

# retrieve annotations of RNA-seq samples and experiments
annotation.bgee.macaque.rna.seq &lt;- getAnnotation(bgee.rnaseq)
sample.annotation &lt;- annotation.bgee.macaque.rna.seq$sample.annotation
experiment.annotation &lt;- annotation.bgee.macaque.rna.seq$experiment.annotation

# list experiments including both brain and liver samples
selected.experiments &lt;- intersect(unique(sample.annotation$Experiment.ID[sample.annotation$Anatomical.entity.ID == "UBERON:0000955"]),
unique(sample.annotation$Experiment.ID[sample.annotation$Anatomical.entity.ID == "UBERON:0002107"]))
experiment.annotation[experiment.annotation$Experiment.ID %in% selected.experiments,]

# check whether experiments include biological replicates
sample.annotation[sample.annotation$Experiment.ID %in%
selected.experiments &amp; (sample.annotation$Anatomical.entity.ID == "UBERON:0000955" 
| sample.annotation$Anatomical.entity.ID == "UBERON:0002107"), c("Experiment.ID","Library.ID","Anatomical.entity.ID", "Anatomical.entity.name","Stage.ID")]</styled-content>
                    </preformat>
                </p>
                <p>Accessions 
                    <monospace>GSE41637</monospace>
                    <sup>
                        <xref ref-type="bibr" rid="ref-50">50</xref>
                    </sup> and 
                    <monospace>GSE30352</monospace>
                    <sup>
                        <xref ref-type="bibr" rid="ref-51">51</xref>
                    </sup> both include biological replicates for brain and liver. We will focus on 
                    <monospace>GSE41637</monospace> for the next steps since it includes three replicates of each tissue, vs. only two for 
                    <monospace>GSE30352</monospace>. We will download the dataset and reformat it to obtain an 
                    <monospace>ExpressionSet</monospace> including counts of mapped reads on each Ensembl gene for each sample.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;">data.GSE41637 &lt;- getData(bgee.rnaseq, experimentId="GSE41637")
data.GSE41637.formatted &lt;- formatData(bgee.rnaseq, data.GSE41637, callType="all", stats="counts")
data.GSE41637.formatted</styled-content>
                    </preformat>
                </p>
                <p>Instead of mapped read counts, it is also possible to fill the data matrix with expression levels in F/RPKMs (fragments/reads per kilobase per million reads) or in TPM (transcript per million)
                    <sup>
                        <xref ref-type="bibr" rid="ref-52">52</xref>,
                        <xref ref-type="bibr" rid="ref-53">53</xref>
                    </sup>, using the option stats="fpkm" or stats="tpm".</p>
                <p>
					
                    <bold>
						
                        <italic toggle="yes">Presence/absence calls retrieval.</italic>
					</bold> It is often difficult to compare expression levels across species
                    <sup>
                        <xref ref-type="bibr" rid="ref-54">54</xref>
                    </sup>, and even within species, across datasets generated by different experimenters or laboratories
                    <sup>
                        <xref ref-type="bibr" rid="ref-55">55</xref>&#x2013;
                        <xref ref-type="bibr" rid="ref-57">57</xref>
                    </sup>. Batch effects have indeed been shown to impact extensively gene expression levels, confounding biological signal differences.</p>
                <p>Encoding gene expression as present or absent in a sample allows a more robust comparison across such conditions. In addition to retrieving RNA-seq and Affymetrix quantitative expression levels, BgeeDB also allows to retrieve calls of presence or absence of expression computed in the Bgee database for each gene (RNA-seq) or probeset (Affymetrix), in the column &#x201c;Detection.flag&#x201d; of the 
                    <monospace>data.E.MEXP.51</monospace> and 
                    <monospace>data.GSE41637</monospace> objects created above. And interestingly, expression calls are also available in Bgee for ESTs and 
                    <italic toggle="yes">in situ</italic> hybridization data, as well as for the consensus of the four data types for each combination &#x201c;gene / tissue / developmental stage / sex / strain&#x201d;.</p>
                <p>A powerful use of these expression calls is the anatomical expression enrichment test &#x201c;TopAnat&#x201d;. TopAnat uses a similar approach to Gene Ontology enrichment tests
                    <sup>
                        <xref ref-type="bibr" rid="ref-27">27</xref>
                    </sup>, but genes are associated to the anatomical structures where they display expression, instead of to their functional classification. These tests allow discovering where a set of genes is preferentially expressed as compared to a background universe (Roux J., Seppey M., Sanjeev K., Rech de Laval V., Moret P., Artimo P., Duvaud S., Ioannidis V., Stockinger H., Robinson-Rechavi M., Bastian F.B.; in preparation). We show an example of such an analysis in the section &#x201c;Anatomical expression enrichment analysis&#x201d; below.</p>
                <p>Of note, the expression calls imported from BgeeDB can also be used for other downstream analyses. For example, when studying protein-protein interaction datasets, it might be biologically relevant to retain only interactions for which both members are expressed in the same tissues
                    <sup>
                        <xref ref-type="bibr" rid="ref-58">58</xref>,
                        <xref ref-type="bibr" rid="ref-59">59</xref>
                    </sup>.</p>
            </sec>
            <sec>
                <title>Downstream analysis examples</title>
                <p>
					
                    <bold>
						
                        <italic toggle="yes">Clustering analysis.</italic>
					</bold> A variety of downstream analyses can be performed on the imported expression data. Below we detail an example of gene expression clustering analysis on the developmental time-series microarray experiment imported above. The analysis, performed with the 
                    <monospace>Mfuzz</monospace> package
                    <sup>
                        <xref ref-type="bibr" rid="ref-60">60</xref>,
                        <xref ref-type="bibr" rid="ref-61">61</xref>
                    </sup> (version 2.40.0 for this paper), aims at uncovering genes with similar expression profiles across development. We can readily start with the 
                    <monospace>ExpressionSet</monospace> object previously created.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;"># for simplicity, keep only one sample per condition
data.E.MEXP.51.formatted &lt;- data.E.MEXP.51.formatted[,!duplicated(pData(data.E.MEXP.51.formatted)[
c("Anatomical.entity.ID","Anatomical.entity.name","Stage.ID","Stage.name")])]

# order developmental stages
stages &lt;- c("GVoocyte1","MIIoocyte1","Zygote1","Early2-cell1","4Cell1","16cell1","EarlyBlastocyst1","MidBlastocyst1",
"LateBlastocyst1")
data.E.MEXP.51.formatted &lt;- data.E.MEXP.51.formatted[, stages]

# filter out rows with no variance
data.E.MEXP.51.formatted &lt;-
data.E.MEXP.51.formatted[apply(exprs(data.E.MEXP.51.formatted), 1, sd) != 0, ]

# Mfuzz clustering
biocLite("Mfuzz")
library(Mfuzz)
# standardize matric of expression data
z.mat &lt;- standardise(data.E.MEXP.51.formatted)
# cluster data into 16 clusters
clusters &lt;- mfuzz(z.mat, centers=16, m=1.25)

# visualizing clusters
mfuzz.plot2(z.mat, cl=clusters, mfrow=c(4,4), colo="fancy",
time.labels=row.names(pData(z.mat)), las=2, xlab="", ylab="Standardized expression level", x11=FALSE)</styled-content>
                    </preformat>
                </p>
                <p>The resulting plot can be seen in 
                    <xref ref-type="fig" rid="f1">Figure 1</xref>.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>Figure 1. </label>
                    <caption>
                        <title>Standardized expression levels of 16 groups of microarray probesets, clustered according to their expression during mouse early development.</title>
                        <p>The x-axis displays sample names (column &#x201c;Chip.ID&#x201d; of the 
                            <monospace>data.E.MEXP.51</monospace> object).</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/16883/7331c1e9-9b7c-46e0-a203-07d71c10f795_figure1.gif"/>
                </fig>
                <p>
					
                    <bold>
						
                        <italic toggle="yes">Differential expression analysis.</italic>
					</bold> Below, we detail a differential expression analysis, with the package 
                    <monospace>edgeR</monospace>
                    <sup>
                        <xref ref-type="bibr" rid="ref-62">62</xref>,
                        <xref ref-type="bibr" rid="ref-63">63</xref>
                    </sup> (version 3.22.2 for this paper), on the previously imported RNA-seq dataset of macaque tissues. We aim at isolating genes differentially expressed between brain and liver.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;"># differential expression analysis with edgeR
biocLite("edgeR")
library(edgeR)

# subset the dataset to brain and liver
brain.liver &lt;- data.GSE41637.formatted[, pData(data.GSE41637.formatted)$Anatomical.entity.name %in%
c("brain", "liver")]  

# filter out very lowly expressed genes
brain.liver.filtered &lt;- brain.liver[rowSums(cpm(brain.liver) &gt; 1) &gt; 3, ]

# create edgeR DGElist object
dge &lt;- DGEList(counts=brain.liver.filtered,
group=pData(brain.liver.filtered)$Anatomical.entity.name)
dge &lt;- calcNormFactors(dge)
dge &lt;- estimateCommonDisp(dge)
dge &lt;- estimateTagwiseDisp(dge)
de &lt;- exactTest(dge, pair=c("brain","liver"))
de.genes &lt;- topTags(de, n=nrow(de))$table

# MA plot with DE genes highlighted
plotSmear(dge, de.tags=rownames(de.genes)[de.genes$FDR &lt; 0.01], cex=0.3)</styled-content>
                    </preformat>
                </p>
                <p>The resulting plot can be seen in 
                    <xref ref-type="fig" rid="f2">Figure 2</xref>.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <title>Mean-average (MA) plot of differential gene expression between brain and liver in macaque based on RNA-seq data.</title>
                        <p>Significantly differentially expressed genes (FDR &lt; 1%) are highlighted in red.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/16883/7331c1e9-9b7c-46e0-a203-07d71c10f795_figure2.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Anatomical expression enrichment analysis</title>
                <p>The 
                    <monospace>loadTopAnatData()</monospace> function loads the names of anatomical structures, and relationships between them, from the Uberon anatomical ontology (based on parent-child &#x201c;is_a&#x201d; and &#x201c;part_of&#x201d; relationships). It also loads a mapping from genes to anatomical structures, based on the presence calls of the genes in the targeted species. These calls come from a consensus of all data types specified in the input Bgee class object. We recommend to use all available data types (in Bgee 14, RNA-seq, Affymetrix, EST and 
                    <italic toggle="yes">in situ</italic> hybridization) for both genomic coverage and anatomical precision, which is the default behavior if no 
                    <monospace>dataType</monospace> argument is specified when the Bgee class object is created.</p>
                <p>By default, presence calls of both silver and gold quality are used, which can be changed with the 
                    <monospace>confidence</monospace> argument of the 
                    <monospace>loadTopAnatData()</monospace> function (in releases of Bgee up to 13, "high" and "low" confidence were used). Finally, it is possible to specify the developmental stage under consideration, with the 
                    <monospace>stage</monospace> argument. By default expression calls generated from samples of all developmental stages are used, which is equivalent to specifying 
                    <monospace>stage="UBERON:0000104"</monospace> (&#x201c;life cycle&#x201d;, the root of the stage ontology). Data are stored in versioned tab-separated cached files that will be read again if a query with the exact same parameters is launched later, to save time and server resources, and to work offline.</p>
                <p>In this example, we will use expression calls for zebrafish genes using all sources of expression data.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;"># the examples in this paper are based on Bgee
# release 14.0
# the following line targets the latest Bgee release. In order to target
# specifically the release 14.0, add the parameter 'release="14.0"'
bgee.topanat &lt;- Bgee$new(species="Danio_rerio")
top.anat.data &lt;- loadTopAnatData(bgee.topanat)</styled-content>
                    </preformat>
                </p>
                <p>We will look at the expression localization of the genes with an annotated phenotype related to pectoral fin (i.e., genes which upon knock-out or knock-down led to abnormal phenotypes of pectoral fin or its components). Zebrafish phenotypic data are available from the ZFIN database
                    <sup>
                        <xref ref-type="bibr" rid="ref-64">64</xref>
                    </sup> and integrated into the Ensembl database
                    <sup>
                        <xref ref-type="bibr" rid="ref-39">39</xref>
                    </sup>. We will thus retrieve the targeted genes using the 
                    <monospace>biomaRt</monospace>
                    <sup>
                        <xref ref-type="bibr" rid="ref-65">65</xref>
                    </sup> Bioconductor package (version 2.36.1 for this paper).</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;">biocLite("biomaRt")
library(biomaRt)

# zebrafish data in Ensembl 84, that Bgee 14.0 uses (stable link)
ensembl &lt;- useMart("ENSEMBL_MART_ENSEMBL",
dataset="drerio_gene_ensembl", host="mar2016.archive.ensembl.org")

# get the mapping of Ensembl genes to phenotypes
genes.to.phenotypes &lt;- getBM(filters=c("phenotype_source"), value=c("ZFIN"),
attributes=c("ensembl_gene_id","phenotype_description"), mart=ensembl)

# select phenotypes related to pectoral fin
Phenotypes &lt;- grep("pectoral fin", unique(genes.to.phenotypes$phenotype_description), value=T)

# select the genes annotated to select phenotypes
genes &lt;- unique(genes.to.phenotypes$ensembl_gene_id[
genes.to.phenotypes$phenotype_description %in% phenotypes])</styled-content>
                    </preformat>
                </p>
                <p>This gives a list of 147 zebrafish genes implicated in the development and function of pectoral fin. The next step of the analysis relies on the 
                    <monospace>topGO</monospace> Bioconductor package. We will prepare a modified 
                    <monospace>topGOdata</monospace> object allowing to handle the Uberon anatomical ontology instead of the Gene Ontology, and perform a GO-like enrichment test for anatomical terms. As for a classical 
                    <monospace>topGO</monospace> analysis, we need to prepare a vector including all background genes, and with values 0 or 1 depending if genes are part of the foreground or not. The choice of background is very important since the wrong background can lead to spurious results in enrichment tests
                    <sup>
                        <xref ref-type="bibr" rid="ref-66">66</xref>
                    </sup>. Here we choose as background all zebrafish Ensembl genes with an annotated phenotype from ZFIN.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;"># prepare the gene list vector 
gene.list  &lt;- factor(as.integer(unique(genes.to.phenotypes$ensembl_gene_id) %in% genes))
names(gene.list) &lt;- unique(genes.to.phenotypes$ensembl_gene_id)
summary(gene.list)

# prepare the topAnat object based on topGO
top.anat.object  &lt;- topAnat(top.anat.data, gene.list)
top.anat.object</styled-content>
                    </preformat>
                </p>
                <p>At this step, expression calls are propagated through the whole ontology (e.g., expression in the forebrain will also be counted as expression in the brain, the nervous system, etc). This can take some time, especially if the gene list is large.</p>
                <p>Finally, we can launch an enrichment test for anatomical terms. The functions of the 
                    <monospace>topGO</monospace> package can directly be used at this step. See the vignette of this package for more details
                    <sup>
                        <xref ref-type="bibr" rid="ref-26">26</xref>
                    </sup>. Here we will use a Fisher test, coupled with the &#x201c;weight&#x201d; decorrelation algorithm.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;">results &lt;- runTest(top.anat.object, algorithm='weight', statistic='fisher')
results</styled-content>
                    </preformat>
                </p>
                <p>Finally, we implemented a function to display results in a formatted table. By default anatomical structures are sorted by their test 
                    <italic toggle="yes">p</italic>-value, which is displayed along with the associated false discovery rate (FDR
                    <sup>
                        <xref ref-type="bibr" rid="ref-32">32</xref>
                    </sup>) and the enrichment fold. Sorting on other columns of the table (e.g., on decreasing enrichment folds) is possible with the 
                    <monospace>ordering</monospace> argument. Of note, it is debated whether a FDR correction is relevant on such enrichment test results, since tests on different terms of the ontologies are not independent. An interesting discussion can be found in the vignette of the 
                    <monospace>topGO</monospace> package.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;"># retrieve anatomical structures enriched at a 1% FDR threshold
table.Over &lt;- makeTable(top.anat.data, top.anat.object, results, cutoff=0.01)
head(table.over)</styled-content>
                    </preformat>
                </p>
                <p>The 27 anatomical structures displaying a significant enrichment at a FDR threshold of 1% are show in 
                    <xref ref-type="table" rid="T1">Table 1</xref>. The first term is &#x201c;pectoral fin&#x201d;, and the second &#x201c;paired limb/fin bud&#x201d;. Other terms in the list, especially those with high enrichment folds, are clearly related to pectoral fins (e.g., &#x201c;pectoral appendage field&#x201d;), or substructures of fins (e.g., &#x201c;fin bone&#x201d;). This analysis shows that genes with phenotypic effects on pectoral fins are specifically expressed in or next to these structures. More generally, it demonstrates the relevance of TopAnat analysis for the characterization of lists of genes.</p>
                <p>Of note, it is possible to retrieve for a particular tissue the significant genes that were mapped to it.</p>
                <p>
                    <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
                        <styled-content style="font-size:15px;"># In order to retrieve significant genes mapped to the term "paired limb/fin bud"
term &lt;- "UBERON:0004357"
termStat(top.anat.object, term) 

# 198 genes mapped to this term for Bgee 14.0 and Ensembl 84
genesInTerm(top.anat.object, term)
# 48 significant genes mapped to this term for Bgee 14.0
# and Ensembl 84
annotated &lt;- genesInTerm(top.anat.object,
term)[["UBERON:0004357"]]
annotated[annotated %in% sigGenes(top.anat.object)]</styled-content>
                    </preformat>
                </p>
                <table-wrap id="T1" orientation="portrait" position="anchor">
                    <label>Table 1. </label>
                    <caption>
                        <title>Zebrafish anatomical structures showing a significant enrichment in expression of genes with a pectoral fin phenotype (FDR &lt; 1%).</title>
                        <p>The &#x201c;weight&#x201d; algorithm of the topGO package was used to decorrelate the structure of the ontology.</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">organId</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">organName</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">annotated</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">significant</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">expected</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">foldEnrichment</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">pValue</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">FDR</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0000151</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">pectoral fin</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">439</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">79</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">21.48</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3.68</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.36E-27</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.47E-24</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0004357</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">paired limb/fin bud</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">198</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">48</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">9.69</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.95</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5.19E-23</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.80E-20</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:2000040</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">median fin fold</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">59</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">20</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.89</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">6.92</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">9.37E-13</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3.38E-10</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0003051</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">ear vesicle</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">391</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">49</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">19.13</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.56</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5.50E-11</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.49E-08</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0005729</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">pectoral
                                    <break/>appendage field</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">20</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">11</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.98</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">11.22</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3.05E-10</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">6.60E-08</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0004376</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">fin bone</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">34</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">12</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.66</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7.23</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.60E-08</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.69E-06</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0011004</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">pharyngeal arch
                                    <break/>cartilage</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">66</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">16</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3.23</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.95</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.96E-08</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7.65E-06</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0003406</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">cartilage of
                                    <break/>respiratory system</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">52</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">14</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.54</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5.51</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">8.61E-08</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.16E-05</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0004756</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">dermal skeletal
                                    <break/>element</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">55</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">15</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.69</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5.58</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">9.14E-07</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.10E-04</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0003108</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">suspensorium</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">56</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">13</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.74</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.74</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.66E-06</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.79E-04</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0004375</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">bone of free limb
                                    <break/>or fin</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">27</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">9</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.32</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">6.82</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.77E-06</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.72E-04</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0001042</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">chordate pharynx</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">417</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">44</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">20.40</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.16</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3.38E-06</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3.04E-04</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0006068</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">bone of tail</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">11</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">6</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.54</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">11.11</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.67E-06</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3.89E-04</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0003128</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">cranium</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">334</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">37</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">16.34</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.26</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5.25E-06</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.05E-04</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:4000170</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">median fin skeleton</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">26</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">8</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.27</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">6.30</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.00E-05</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.44E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0004117</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">pharyngeal pouch</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">51</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">11</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.49</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.42</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.32E-05</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.57E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0002533</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">post-anal tail bud</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1447</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">95</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">70.79</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.34</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.77E-05</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.76E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0012275</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">meso-epithelium</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1616</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">104</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">79.05</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.32</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5.53E-05</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3.32E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0002514</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">intramembranous
                                    <break/>bone</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">23</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.13</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">6.19</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7.36E-05</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.01E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0001708</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">jaw skeleton</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">108</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">24</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5.28</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.55</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7.77E-05</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.01E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0001003</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">skin epidermis</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">112</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">16</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5.48</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.92</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7.79E-05</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.01E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0002541</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">germ ring</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">117</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">16</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5.72</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.80</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.33E-04</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">6.54E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0010188</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">protuberance</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">598</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">68</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">29.25</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.32</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.51E-04</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7.01E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:4000163</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">anal fin</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">12</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.59</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">8.47</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.57E-04</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7.01E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0010363</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">endochondral
                                    <break/>element</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">58</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">13</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.84</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.58</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.62E-04</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7.01E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:2000555</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">opercular flap</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">26</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.27</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5.51</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.74E-04</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7.25E-03</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">UBERON:0007812</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">post-anal tail</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1452</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">96</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">71.03</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.35</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.03E-04</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">8.13E-03</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
            </sec>
        </sec>
        <sec sec-type="conclusions">
            <title>Conclusion</title>
            <p>In summary, the BgeeDB package serves as a bridge between curated data from the Bgee database and the R/Bioconductor environment, facilitating the access to high-quality curated and re-analyzed gene expression datasets, and significantly reducing time for downstream analyses of the datasets. Moreover, it provides access to TopAnat, a new enrichment tool allowing to make sense of lists of genes, by uncovering their preferential localization of expression in anatomical structures. The TopAnat workflow is straightforward; for users already using topGO in their analysis pipelines, performing a TopAnat analysis on the same gene list only requires 6 additional lines of code.</p>
        </sec>
        <sec>
            <title>Software and data availability</title>
            <p>Software available from: 
                <ext-link ext-link-type="uri" xlink:href="http://www.bioconductor.org/packages/BgeeDB/">http://www.bioconductor.org/packages/BgeeDB/</ext-link>
			</p>
            <p>Latest source code: 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/BgeeDB/BgeeDB_R">https://github.com/BgeeDB/BgeeDB_R</ext-link>
			</p>
            <p>Archived source code as at the time of publication: 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.1293418">https://doi.org/10.5281/zenodo.1293418</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-67">67</xref>
                </sup>
			</p>
        </sec>
    </body>
    <back>
        <sec id="SM1" sec-type="supplementary-material">
            <title>Supplementary material</title>
            <p id="SF1">File S1. R markdown file including code from the paper.</p>
            <p>
				
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/9973/d95b5e76-8b20-4a59-8d0d-36a28d6d7a21.Rmd">Click here to access the data</ext-link>.</p>
            <p id="SF2">File S2. PDF file including the results of execution of the code from File S1.</p>
            <p>
				
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/9973/734673b9-f41e-4294-b339-62acfabdd616.pdf">Click here to access the data</ext-link>.</p>
        </sec>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rung</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Brazma</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>Reuse of public genome-wide gene expression data.</article-title>
                    <source>

                        <italic toggle="yes">Nat Rev Genet.</italic>
</source>
                    <year>2013</year>;<volume>14</volume>(<issue>2</issue>):<fpage>89</fpage>&#x2013;<lpage>99</lpage>.
                    <pub-id pub-id-type="pmid">23269463</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nrg3394</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ioannidis</surname>
                            <given-names>JP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Allison</surname>
                            <given-names>DB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ball</surname>
                            <given-names>CA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Repeatability of published microarray gene expression analyses.</article-title>
                    <source>

                        <italic toggle="yes">Nat Genet.</italic>
</source>
                    <year>2009</year>;<volume>41</volume>(<issue>2</issue>):<fpage>149</fpage>&#x2013;<lpage>55</lpage>.
                    <pub-id pub-id-type="pmid">19174838</pub-id>
                    <pub-id pub-id-type="doi">10.1038/ng.295</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wan</surname>
                            <given-names>X</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pavlidis</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>Sharing and reusing gene expression profiling data in neuroscience.</article-title>
                    <source>

                        <italic toggle="yes">Neuroinformatics.</italic>
</source>
                    <year>2007</year>;<volume>5</volume>(<issue>3</issue>):<fpage>161</fpage>&#x2013;<lpage>75</lpage>.
                    <pub-id pub-id-type="pmid">17917127</pub-id>
                    <pub-id pub-id-type="doi">10.1007/s12021-007-0012-5</pub-id>
                    <pub-id pub-id-type="pmcid">2980754</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="book">
                    <collab>R Development Core Team</collab>:
                    <article-title>R: A Language and Environment for Statistical Computing.</article-title>Vienna, Austria: R Foundation for Statistical Computing;<year>2007</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://www.r-project.org/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Huber</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Carey</surname>
                            <given-names>VJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gentleman</surname>
                            <given-names>R</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Orchestrating high-throughput genomic analysis with Bioconductor.</article-title>
                    <source>

                        <italic toggle="yes">Nat Methods.</italic>
</source>
                    <year>2015</year>;<volume>12</volume>(<issue>2</issue>):<fpage>115</fpage>&#x2013;<lpage>21</lpage>.
                    <pub-id pub-id-type="pmid">25633503</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nmeth.3252</pub-id>
                    <pub-id pub-id-type="pmcid">4509590</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gentleman</surname>
                            <given-names>RC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Carey</surname>
                            <given-names>VJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bates</surname>
                            <given-names>DM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Bioconductor: open software development for computational biology and bioinformatics.</article-title>
                    <source>

                        <italic toggle="yes">Genome Biol.</italic>
</source>
                    <year>2004</year>;<volume>5</volume>(<issue>10</issue>):<fpage>R80</fpage>.
                    <pub-id pub-id-type="pmid">15461798</pub-id>
                    <pub-id pub-id-type="doi">10.1186/gb-2004-5-10-r80</pub-id>
                    <pub-id pub-id-type="pmcid">545600</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kauffmann</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rayner</surname>
                            <given-names>TF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Parkinson</surname>
                            <given-names>H</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Importing ArrayExpress datasets into R/Bioconductor.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2009</year>;<volume>25</volume>(<issue>16</issue>):<fpage>2092</fpage>&#x2013;<lpage>4</lpage>.
                    <pub-id pub-id-type="pmid">19505942</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btp354</pub-id>
                    <pub-id pub-id-type="pmcid">2723004</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Davis</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Meltzer</surname>
                            <given-names>PS</given-names>
                        </name>
</person-group>:
                    <article-title>GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2007</year>;<volume>23</volume>(<issue>14</issue>):<fpage>1846</fpage>&#x2013;<lpage>7</lpage>.
                    <pub-id pub-id-type="pmid">17496320</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btm254</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zhu</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Stephens</surname>
                            <given-names>RM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Meltzer</surname>
                            <given-names>PS</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>SRAdb: query and use public next-generation sequencing data from within R.</article-title>
                    <source>

                        <italic toggle="yes">BMC Bioinformatics.</italic>
</source>
                    <year>2013</year>;<volume>14</volume>(<issue>1</issue>):<fpage>19</fpage>.
                    <pub-id pub-id-type="pmid">23323543</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2105-14-19</pub-id>
                    <pub-id pub-id-type="pmcid">3560148</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kolesnikov</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hastings</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Keays</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>ArrayExpress update--simplifying data submissions.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2015</year>;<volume>43</volume>(<issue>Database issue</issue>):<fpage>D1113</fpage>&#x2013;<lpage>6</lpage>.
                    <pub-id pub-id-type="pmid">25361974</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gku1057</pub-id>
                    <pub-id pub-id-type="pmcid">4383899</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Barrett</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wilhite</surname>
                            <given-names>SE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ledoux</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>NCBI GEO: archive for functional genomics data sets--update.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2013</year>;<volume>41</volume>(<issue>Database issue</issue>):<fpage>D991</fpage>&#x2013;<lpage>5</lpage>.
                    <pub-id pub-id-type="pmid">23193258</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gks1193</pub-id>
                    <pub-id pub-id-type="pmcid">3531084</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kodama</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Shumway</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Leinonen</surname>
                            <given-names>R</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The Sequence Read Archive: explosive growth of sequencing data.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2012</year>;<volume>40</volume>(<issue>Database issue</issue>):<fpage>D54</fpage>&#x2013;<lpage>D6</lpage>.
                    <pub-id pub-id-type="pmid">22009675</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkr854</pub-id>
                    <pub-id pub-id-type="pmcid">3245110</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-13">
                <label>13</label>
                <mixed-citation publication-type="book">
                    <article-title>BrainStars Bioconductor package</article-title>.
                    <pub-id pub-id-type="doi">10.18129/B9.bioc.BrainStars</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kasukawa</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Masumoto</surname>
                            <given-names>KH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nikaido</surname>
                            <given-names>I</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Quantitative expression profile of distinct functional regions in the adult mouse brain.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2011</year>;<volume>6</volume>(<issue>8</issue>):<fpage>e23228</fpage>.
                    <pub-id pub-id-type="pmid">21858037</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0023228</pub-id>
                    <pub-id pub-id-type="pmcid">3155528</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-15">
                <label>15</label>
                <mixed-citation publication-type="book">
                    <article-title>ImmuneSpaceR Bioconductor package</article-title>.
                    <pub-id pub-id-type="doi">10.18129/B9.bioc.ImmuneSpaceR</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <collab>Bioconductor Package Maintainer</collab>:
                    <article-title>ExperimentHub: Client to access ExperimentHub resources</article-title>. R package version 1.6.0.<year>2018</year>.
                    <pub-id pub-id-type="doi">10.18129/B9.bioc.ExperimentHub</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-17">
                <label>17</label>
                <mixed-citation publication-type="book">
                    <article-title>ExpressionAtlas Bioconductor package</article-title>.
                    <pub-id pub-id-type="doi">10.18129/B9.bioc.ExpressionAtlas</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Petryszak</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Keays</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tang</surname>
                            <given-names>YA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Expression Atlas update--an integrated database of gene and protein expression in humans, animals and plants.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2016</year>;<volume>44</volume>(<issue>D1</issue>):<fpage>D746</fpage>&#x2013;<lpage>52</lpage>.
                    <pub-id pub-id-type="pmid">26481351</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkv1045</pub-id>
                    <pub-id pub-id-type="pmcid">4702781</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Collado-Torres</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nellore</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kammers</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>recount: A large-scale resource of analysis-ready RNA-seq expression data.</article-title>
                    <source>

                        <italic toggle="yes">bioRxiv.</italic>
</source>
                    <year>2016</year>.
                    <pub-id pub-id-type="doi">10.1101/068478</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Frazee</surname>
                            <given-names>AC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Langmead</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Leek</surname>
                            <given-names>JT</given-names>
                        </name>
</person-group>:
                    <article-title>ReCount: a multi-experiment resource of analysis-ready RNA-seq gene count datasets.</article-title>
                    <source>

                        <italic toggle="yes">BMC Bioinformatics.</italic>
</source>
                    <year>2011</year>;<volume>12</volume>(<issue>1</issue>):<fpage>449</fpage>.
                    <pub-id pub-id-type="pmid">22087737</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2105-12-449</pub-id>
                    <pub-id pub-id-type="pmcid">3229291</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <article-title>recount Bioconductor package</article-title>.
                    <pub-id pub-id-type="doi">10.18129/B9.bioc.recount</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bastian</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Parmentier</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Roux</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Bgee: Integrating and Comparing Heterogeneous Transcriptome Data Among Species.</article-title>
                    <source>

                        <italic toggle="yes">Data Integr Life Sci.</italic>
</source>
                    <year>2008</year>;<fpage>124</fpage>&#x2013;<lpage>31</lpage>.
                    <pub-id pub-id-type="doi">10.1007/978-3-540-69828-9_12</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <collab>GTEx Consortium</collab>:
                    <article-title>Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans.</article-title>
                    <source>

                        <italic toggle="yes">Science.</italic>
</source>
                    <year>2015</year>;<volume>348</volume>(<issue>6235</issue>):<fpage>648</fpage>&#x2013;<lpage>60</lpage>.
                    <pub-id pub-id-type="pmid">25954001</pub-id>
                    <pub-id pub-id-type="doi">10.1126/science.1262110</pub-id>
                    <pub-id pub-id-type="pmcid">4547484</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mel&#x00e9;</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ferreira</surname>
                            <given-names>PG</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Reverter</surname>
                            <given-names>F</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Human genomics. The human transcriptome across tissues and individuals.</article-title>
                    <source>

                        <italic toggle="yes">Science.</italic>
</source>
                    <year>2015</year>;<volume>348</volume>(<issue>6235</issue>):<fpage>660</fpage>&#x2013;<lpage>5</lpage>.
                    <pub-id pub-id-type="pmid">25954002</pub-id>
                    <pub-id pub-id-type="doi">10.1126/science.aaa0355</pub-id>
                    <pub-id pub-id-type="pmcid">4547472</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Alexa</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rahnenf&#x00fc;hrer</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lengauer</surname>
                            <given-names>T</given-names>
                        </name>
</person-group>:
                    <article-title>Improved scoring of functional groups from gene expression data by decorrelating GO graph structure.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2006</year>;<volume>22</volume>(<issue>13</issue>):<fpage>1600</fpage>&#x2013;<lpage>7</lpage>.
                    <pub-id pub-id-type="pmid">16606683</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btl140</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-26">
                <label>26</label>
                <mixed-citation publication-type="book">
                    <article-title>topGO Bioconductor package</article-title>.
                    <pub-id pub-id-type="doi">10.18129/B9.bioc.topGO</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-27">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rhee</surname>
                            <given-names>SY</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wood</surname>
                            <given-names>V</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dolinski</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Use and misuse of the gene ontology annotations.</article-title>
                    <source>

                        <italic toggle="yes">Nat Rev Genet.</italic>
</source>
                    <year>2008</year>;<volume>9</volume>(<issue>7</issue>):<fpage>509</fpage>&#x2013;<lpage>15</lpage>.
                    <pub-id pub-id-type="pmid">18475267</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nrg2363</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-28">
                <label>28</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ashburner</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ball</surname>
                            <given-names>CA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Blake</surname>
                            <given-names>JA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.</article-title>
                    <source>

                        <italic toggle="yes">Nat Genet.</italic>
</source>
                    <year>2000</year>;<volume>25</volume>(<issue>1</issue>):<fpage>25</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="pmid">10802651</pub-id>
                    <pub-id pub-id-type="doi">10.1038/75556</pub-id>
                    <pub-id pub-id-type="pmcid">3037419</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-29">
                <label>29</label>
                <mixed-citation publication-type="book">
                    <article-title>The Gene Ontology Handbook.</article-title>Dessimoz C, &#x0160;kunca N, editors: Humana Press;<year>2016</year>; XII, 305.
                    <pub-id pub-id-type="doi">10.1007/978-1-4939-3743-1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-30">
                <label>30</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Haendel</surname>
                            <given-names>MA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Balhoff</surname>
                            <given-names>JP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bastian</surname>
                            <given-names>FB</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Unification of multi-species vertebrate anatomy ontologies for comparative biology in Uberon.</article-title>
                    <source>

                        <italic toggle="yes">J Biomed Semantics.</italic>
</source>
                    <year>2014</year>;<volume>5</volume>(<issue>1</issue>):<fpage>21</fpage>.
                    <pub-id pub-id-type="pmid">25009735</pub-id>
                    <pub-id pub-id-type="doi">10.1186/2041-1480-5-21</pub-id>
                    <pub-id pub-id-type="pmcid">4089931</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-31">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Mungall</surname>
                            <given-names>CJ</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Torniai</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Gkoutos</surname>
                            <given-names>GV</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Uberon, an integrative multi-species anatomy ontology.</article-title>
                    <source>
						
                        <italic toggle="yes">Genome Biol.</italic>
					</source>
                    <year>2012</year>;<volume>13</volume>(<issue>1</issue>):<fpage>R5</fpage>.
                    <pub-id pub-id-type="pmid">22293552</pub-id>
                    <pub-id pub-id-type="doi">10.1186/gb-2012-13-1-r5</pub-id>
                    <pub-id pub-id-type="pmcid">3334586</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-32">
                <label>32</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Benjamini</surname>
                            <given-names>Y</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Hochberg</surname>
                            <given-names>Y</given-names>
                        </name>
					</person-group>:
                    <article-title>Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing.</article-title>
                    <source>
						
                        <italic toggle="yes">J R Stat Soc Series B Stat Methodol.</italic>
					</source>
                    <year>1995</year>;<volume>57</volume>(<issue>1</issue>):<fpage>289</fpage>&#x2013;<lpage>300</lpage>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.stat.purdue.edu/~doerge/BIOINFORM.D/FALL06/Benjamini%20and%20Y%20FDR.pdf">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-33">
                <label>33</label>
                <mixed-citation publication-type="journal">
                    <article-title>Tissue Specific Expression Analysis (TSEA) version 1</article-title>.  [cited 2018 June 14].
                    <ext-link ext-link-type="uri" xlink:href="http://genetics.wustl.edu/jdlab/tsea/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-34">
                <label>34</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Dougherty</surname>
                            <given-names>JD</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Schmidt</surname>
                            <given-names>EF</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Nakajima</surname>
                            <given-names>M</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Analytical approaches to RNA profiling data for the identification of genes enriched in specific cells.</article-title>
                    <source>
				
                        <italic toggle="yes">Nucleic Acids Res.</italic>
			</source>
                    <year>2010</year>;<volume>38</volume>(<issue>13</issue>):<fpage>4218</fpage>&#x2013;<lpage>30</lpage>.
                    <pub-id pub-id-type="pmid">20308160</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkq130</pub-id>
                    <pub-id pub-id-type="pmcid">2910036</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-35">
                <label>35</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Xu</surname>
                            <given-names>X</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Wells</surname>
                            <given-names>AB</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>O'Brien</surname>
                            <given-names>DR</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Cell type-specific expression analysis to identify putative cellular mechanisms for neurogenetic disorders.</article-title>
                    <source>
				
                        <italic toggle="yes">J Neurosci.</italic>
			</source>
                    <year>2014</year>;<volume>34</volume>(<issue>4</issue>):<fpage>1420</fpage>&#x2013;<lpage>31</lpage>.
                    <pub-id pub-id-type="pmid">24453331</pub-id>
                    <pub-id pub-id-type="doi">10.1523/JNEUROSCI.4488-13.2014</pub-id>
                    <pub-id pub-id-type="pmcid">3898298</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-36">
                <label>36</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Angeles-Albores</surname>
                            <given-names>D</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>N Lee</surname>
                            <given-names>RY</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Chan</surname>
                            <given-names>J</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Tissue enrichment analysis for C. elegans genomics.</article-title>
                    <source>
				
                        <italic toggle="yes">BMC Bioinformatics.</italic>
			</source>
                    <year>2016</year>;<volume>17</volume>(<issue>1</issue>):<fpage>366</fpage>.
                    <pub-id pub-id-type="pmid">27618863</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s12859-016-1229-9</pub-id>
                    <pub-id pub-id-type="pmcid">5020436</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-37">
                <label>37</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Marbach</surname>
                            <given-names>D</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Lamparter</surname>
                            <given-names>D</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Quon</surname>
                            <given-names>G</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases.</article-title>
                    <source>
				
                        <italic toggle="yes">Nat Methods.</italic>
			</source>
                    <year>2016</year>;<volume>13</volume>(<issue>4</issue>):<fpage>366</fpage>&#x2013;<lpage>70</lpage>.
                    <pub-id pub-id-type="pmid">26950747</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nmeth.3799</pub-id>
                    <pub-id pub-id-type="pmcid">4967716</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-38">
                <label>38</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Lee</surname>
                            <given-names>RYN</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Howe</surname>
                            <given-names>KL</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Harris</surname>
                            <given-names>TW</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>WormBase 2017: molting into a new stage.</article-title>
                    <source>
				
                        <italic toggle="yes">Nucleic Acids Res.</italic>
			</source>
                    <year>2018</year>;<volume>46</volume>(<issue>D1</issue>):<fpage>D869</fpage>&#x2013;<lpage>D874</lpage>.
                    <pub-id pub-id-type="pmid">29069413</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkx998</pub-id>
                    <pub-id pub-id-type="pmcid">5753391</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-39">
                <label>39</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Zerbino</surname>
                            <given-names>DR</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Achuthan</surname>
                            <given-names>P</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Akanni</surname>
                            <given-names>W</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Ensembl 2018.</article-title>
                    <source>
				
                        <italic toggle="yes">Nucleic Acids Res.</italic>
			</source>
                    <year>2018</year>;<volume>46</volume>(<issue>D1</issue>):<fpage>D754</fpage>&#x2013;<lpage>D761</lpage>.
                    <pub-id pub-id-type="pmid">29155950</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkx1098</pub-id>
                    <pub-id pub-id-type="pmcid">5753206</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-40">
                <label>40</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Kersey</surname>
                            <given-names>PJ</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Allen</surname>
                            <given-names>JE</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Allot</surname>
                            <given-names>A</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species.</article-title>
                    <source>
				
                        <italic toggle="yes">Nucleic Acids Res.</italic>
			</source>
                    <year>2018</year>;<volume>46</volume>(<issue>D1</issue>):<fpage>D802</fpage>&#x2013;<lpage>D808</lpage>.
                    <pub-id pub-id-type="pmid">29092050</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkx1011</pub-id>
                    <pub-id pub-id-type="pmcid">5753204</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-41">
                <label>41</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Bray</surname>
                            <given-names>NL</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Pimentel</surname>
                            <given-names>H</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Melsted</surname>
                            <given-names>P</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Near-optimal probabilistic RNA-seq quantification.</article-title>
                    <source>
						
                        <italic toggle="yes">Nat Biotechnol.</italic>
					</source>
                    <year>2016</year>;<volume>34</volume>(<issue>5</issue>):<fpage>525</fpage>&#x2013;<lpage>7</lpage>.
                    <pub-id pub-id-type="pmid">27043002</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nbt.3519</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-42">
                <label>42</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Robinson</surname>
                            <given-names>MD</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Oshlack</surname>
                            <given-names>A</given-names>
                        </name>
			</person-group>:
                    <article-title>A scaling normalization method for differential expression analysis of RNA-seq data.</article-title>
                    <source>
				
                        <italic toggle="yes">Genome Biol.</italic>
			</source>
                    <year>2010</year>;<volume>11</volume>(<issue>3</issue>):<fpage>R25</fpage>.
                    <pub-id pub-id-type="pmid">20196867</pub-id>
                    <pub-id pub-id-type="doi">10.1186/gb-2010-11-3-r25</pub-id>
                    <pub-id pub-id-type="pmcid">2864565</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-43">
                <label>43</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Rosikiewicz</surname>
                            <given-names>M</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Comte</surname>
                            <given-names>A</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Niknejad</surname>
                            <given-names>A</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Uncovering hidden duplicated content in public transcriptomics data.</article-title>
                    <source>
				
                        <italic toggle="yes">Database (Oxford).</italic>
			</source>
                    <year>2013</year>;<volume>2013</volume>:<fpage>bat010</fpage>.
                    <pub-id pub-id-type="pmid">23487185</pub-id>
                    <pub-id pub-id-type="doi">10.1093/database/bat010</pub-id>
                    <pub-id pub-id-type="pmcid">3595988</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-44">
                <label>44</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Rosikiewicz</surname>
                            <given-names>M</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Robinson-Rechavi</surname>
                            <given-names>M</given-names>
                        </name>
			</person-group>:
                    <article-title>IQRray, a new method for Affymetrix microarray quality control, and the homologous organ conservation score, a new benchmark method for quality control metrics.</article-title>
                    <source>
				
                        <italic toggle="yes">Bioinformatics.</italic>
			</source>
                    <year>2014</year>;<volume>30</volume>(<issue>10</issue>):<fpage>1392</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="pmid">24451627</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btu027</pub-id>
                    <pub-id pub-id-type="pmcid">4016700</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-45">
                <label>45</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Wu</surname>
                            <given-names>Z</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Irizarry</surname>
                            <given-names>RA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Gentleman</surname>
                            <given-names>R</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>A Model-Based Background Adjustment for Oligonucleotide Expression Arrays.</article-title>
                    <source>
						
                        <italic toggle="yes">J Am Stat Assoc.</italic>
					</source>
                    <year>2004</year>;<volume>99</volume>(<issue>468</issue>):<fpage>909</fpage>&#x2013;<lpage>17</lpage>.
                    <pub-id pub-id-type="doi">10.1198/016214504000000683</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-46">
                <label>46</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Hubbell</surname>
                            <given-names>E</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>WM</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Mei</surname>
                            <given-names>R</given-names>
                        </name>
			</person-group>:
                    <article-title>Robust estimators for expression analysis.</article-title>
                    <source>
				
                        <italic toggle="yes">Bioinformatics.</italic>
			</source>
                    <year>2002</year>;<volume>18</volume>(<issue>12</issue>):<fpage>1585</fpage>&#x2013;<lpage>92</lpage>.
                    <pub-id pub-id-type="pmid">12490442</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/18.12.1585</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-47">
                <label>47</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Schuster</surname>
                            <given-names>EF</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Blanc</surname>
                            <given-names>E</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Partridge</surname>
                            <given-names>L</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Correcting for sequence biases in present/absent calls.</article-title>
                    <source>
				
                        <italic toggle="yes">Genome Biol.</italic>
			</source>
                    <year>2007</year>;<volume>8</volume>(<issue>6</issue>):<fpage>R125</fpage>.
                    <pub-id pub-id-type="pmid">17594492</pub-id>
                    <pub-id pub-id-type="doi">10.1186/gb-2007-8-6-r125</pub-id>
                    <pub-id pub-id-type="pmcid">2394774</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-48">
                <label>48</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>QT</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Piotrowska</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Ciemerych</surname>
                            <given-names>MA</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>A genome-wide study of gene activity reveals developmental signaling pathways in the preimplantation mouse embryo.</article-title>
                    <source>
						
                        <italic toggle="yes">Dev Cell.</italic>
					</source>
                    <year>2004</year>;<volume>6</volume>(<issue>1</issue>):<fpage>133</fpage>&#x2013;<lpage>44</lpage>.
                    <pub-id pub-id-type="pmid">14723853</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S1534-5807(03)00404-0</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-49">
                <label>49</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Wu</surname>
                            <given-names>Z</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Irizarry</surname>
                            <given-names>RA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Gentleman</surname>
                            <given-names>R</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>A Model-Based Background Adjustment for Oligonucleotide Expression Arrays.</article-title>
                    <source>
						
                        <italic toggle="yes">J Am Stat Assoc.</italic>
					</source>
                    <year>2004</year>;<volume>99</volume>(<issue>468</issue>):<fpage>909</fpage>&#x2013;<lpage>17</lpage>.
                    <pub-id pub-id-type="doi">10.1198/016214504000000683</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-50">
                <label>50</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Merkin</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Russell</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>P</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Evolutionary dynamics of gene and isoform regulation in Mammalian tissues.</article-title>
                    <source>
						
                        <italic toggle="yes">Science.</italic>
					</source>
                    <year>2012</year>;<volume>338</volume>(<issue>6114</issue>):<fpage>1593</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="pmid">23258891</pub-id>
                    <pub-id pub-id-type="doi">10.1126/science.1228186</pub-id>
                    <pub-id pub-id-type="pmcid">3568499</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-51">
                <label>51</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Brawand</surname>
                            <given-names>D</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Soumillon</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Necsulea</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>The evolution of gene expression levels in mammalian organs.</article-title>
                    <source>
						
                        <italic toggle="yes">Nature.</italic>
					</source>
                    <year>2011</year>;<volume>478</volume>(<issue>7369</issue>):<fpage>343</fpage>&#x2013;<lpage>8</lpage>.
                    <pub-id pub-id-type="pmid">22012392</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nature10532</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-52">
                <label>52</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Wagner</surname>
                            <given-names>GP</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Kin</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Lynch</surname>
                            <given-names>VJ</given-names>
                        </name>
					</person-group>:
                    <article-title>Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples.</article-title>
                    <source>
						
                        <italic toggle="yes">Theory Biosci.</italic>
					</source>
                    <year>2012</year>;<volume>131</volume>(<issue>4</issue>):<fpage>281</fpage>&#x2013;<lpage>5</lpage>.
                    <pub-id pub-id-type="pmid">22872506</pub-id>
                    <pub-id pub-id-type="doi">10.1007/s12064-012-0162-3</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-53">
                <label>53</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>B</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Dewey</surname>
                            <given-names>CN</given-names>
                        </name>
					</person-group>:
                    <article-title>RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.</article-title>
                    <source>
						
                        <italic toggle="yes">BMC Bioinformatics.</italic>
					</source>
                    <year>2011</year>;<volume>12</volume>:<fpage>323</fpage>.
                    <pub-id pub-id-type="pmid">21816040</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2105-12-323</pub-id>
                    <pub-id pub-id-type="pmcid">3163565</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-54">
                <label>54</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Roux</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Rosikiewicz</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Robinson-Rechavi</surname>
                            <given-names>M</given-names>
                        </name>
					</person-group>:
                    <article-title>What to compare and how: Comparative transcriptomics for Evo-Devo.</article-title>
                    <source>
						
                        <italic toggle="yes">J Exp Zool B Mol Dev Evol.</italic>
					</source>
                    <year>2015</year>;<volume>324</volume>(<issue>4</issue>):<fpage>372</fpage>&#x2013;<lpage>82</lpage>.
                    <pub-id pub-id-type="pmid">25864439</pub-id>
                    <pub-id pub-id-type="doi">10.1002/jez.b.22618</pub-id>
                    <pub-id pub-id-type="pmcid">4949521</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-55">
                <label>55</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Gilad</surname>
                            <given-names>Y</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Mizrahi-Man</surname>
                            <given-names>O</given-names>
                        </name>
					</person-group>:
                    <article-title>A reanalysis of mouse ENCODE comparative gene expression data [version 1; referees: 3 approved, 1 approved with reservations].</article-title>
                    <source>
						
                        <italic toggle="yes">F1000Res.</italic>
					</source>
                    <year>2015</year>;<volume>4</volume>:<fpage>121</fpage>.
                    <pub-id pub-id-type="pmid">26236466</pub-id>
                    <pub-id pub-id-type="doi">10.12688/f1000research.6536.1</pub-id>
                    <pub-id pub-id-type="pmcid">4516019</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-56">
                <label>56</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Leek</surname>
                            <given-names>JT</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Scharpf</surname>
                            <given-names>RB</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Bravo</surname>
                            <given-names>HC</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Tackling the widespread and critical impact of batch effects in high-throughput data.</article-title>
                    <source>
						
                        <italic toggle="yes">Nat Rev Genet.</italic>
					</source>
                    <year>2010</year>;<volume>11</volume>(<issue>10</issue>):<fpage>733</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="pmid">20838408</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nrg2825</pub-id>
                    <pub-id pub-id-type="pmcid">3880143</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-57">
                <label>57</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Akey</surname>
                            <given-names>JM</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Biswas</surname>
                            <given-names>S</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Leek</surname>
                            <given-names>JT</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>On the design and analysis of gene expression studies in human populations.</article-title>
                    <source>
						
                        <italic toggle="yes">Nat Genet.</italic>
					</source>
                    <year>2007</year>;<volume>39</volume>(<issue>7</issue>):<fpage>807</fpage>&#x2013;<lpage>8</lpage>.
                    <pub-id pub-id-type="pmid">17597765</pub-id>
                    <pub-id pub-id-type="doi">10.1038/ng0707-807</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-58">
                <label>58</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Deane</surname>
                            <given-names>CM</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Salwi&#x0144;ski</surname>
                            <given-names>&#x0141;</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Xenarios</surname>
                            <given-names>I</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Protein Interactions: Two Methods for Assessment of the Reliability of High Throughput Observations.</article-title>
                    <source>
						
                        <italic toggle="yes">Mol Cell Proteomics.</italic>
					</source>
                    <year>2002</year>;<volume>1</volume>(<issue>5</issue>):<fpage>349</fpage>&#x2013;<lpage>56</lpage>.
                    <pub-id pub-id-type="pmid">12118076</pub-id>
                    <pub-id pub-id-type="doi">10.1074/mcp.M100037-MCP200</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-59">
                <label>59</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Kotlyar</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Pastrello</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Sheahan</surname>
                            <given-names>N</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Integrated interactions database: tissue-specific view of the human and model organism interactomes.</article-title>
                    <source>
						
                        <italic toggle="yes">Nucleic Acids Res.</italic>
					</source>
                    <year>2016</year>;<volume>44</volume>(<issue>D1</issue>):<fpage>D536</fpage>&#x2013;<lpage>D41</lpage>.
                    <pub-id pub-id-type="pmid">26516188</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkv1115</pub-id>
                    <pub-id pub-id-type="pmcid">4702811</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-60">
                <label>60</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Futschik</surname>
                            <given-names>ME</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Carlisle</surname>
                            <given-names>B</given-names>
                        </name>
					</person-group>:
                    <article-title>Noise-robust soft clustering of gene expression time-course data.</article-title>
                    <source>
						
                        <italic toggle="yes">J Bioinform Comput Biol.</italic>
					</source>
                    <year>2005</year>;<volume>3</volume>(<issue>4</issue>):<fpage>965</fpage>&#x2013;<lpage>88</lpage>.
                    <pub-id pub-id-type="pmid">16078370</pub-id>
                    <pub-id pub-id-type="doi">10.1142/S0219720005001375</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-61">
                <label>61</label>
                <mixed-citation publication-type="journal">
                    <article-title>Mfuzz Bioconductor package</article-title>.
                    <pub-id pub-id-type="doi">10.18129/B9.bioc.Mfuzz</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-62">
                <label>62</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Robinson</surname>
                            <given-names>MD</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>McCarthy</surname>
                            <given-names>DJ</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Smyth</surname>
                            <given-names>GK</given-names>
                        </name>
					</person-group>:
                    <article-title>edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.</article-title>
                    <source>
						
                        <italic toggle="yes">Bioinformatics.</italic>
					</source>
                    <year>2010</year>;<volume>26</volume>(<issue>1</issue>):<fpage>139</fpage>&#x2013;<lpage>40</lpage>.
                    <pub-id pub-id-type="pmid">19910308</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btp616</pub-id>
                    <pub-id pub-id-type="pmcid">2796818</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-63">
                <label>63</label>
                <mixed-citation publication-type="journal">
                    <article-title>edgeR Bioconductor package</article-title>.
                    <pub-id pub-id-type="doi">10.18129/B9.bioc.edgeR</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-64">
                <label>64</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Howe</surname>
                            <given-names>DG</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Bradford</surname>
                            <given-names>YM</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Conlin</surname>
                            <given-names>T</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>ZFIN, the Zebrafish Model Organism Database: increased support for mutants and transgenics.</article-title>
                    <source>
						
                        <italic toggle="yes">Nucleic Acids Res.</italic>
					</source>
                    <year>2013</year>;<volume>41</volume>(<issue>Database issue</issue>):<fpage>D854</fpage>&#x2013;<lpage>60</lpage>.
                    <pub-id pub-id-type="pmid">23074187</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gks938</pub-id>
                    <pub-id pub-id-type="pmcid">3531097</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-65">
                <label>65</label>
                <mixed-citation publication-type="journal">
                    <article-title>biomaRt Bioconductor package</article-title>.
                    <pub-id pub-id-type="doi">10.18129/B9.bioc.biomaRt</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-66">
                <label>66</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Timmons</surname>
                            <given-names>JA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Szkop</surname>
                            <given-names>KJ</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Gallagher</surname>
                            <given-names>IJ</given-names>
                        </name>
					</person-group>:
                    <article-title>Multiple sources of bias confound functional enrichment analysis of global -omics data.</article-title>
                    <source>
						
                        <italic toggle="yes">Genome Biol.</italic>
					</source>
                    <year>2015</year>;<volume>16</volume>(<issue>1</issue>):<fpage>186</fpage>.
                    <pub-id pub-id-type="pmid">26346307</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s13059-015-0761-7</pub-id>
                    <pub-id pub-id-type="pmcid">4561415</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-67">
                <label>67</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Komljenovic</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Roux</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Robinson-Rechavi</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>BgeeDB/BgeeDB_R: Bgee R package release 2.6.2.</article-title>
                    <source>
						
                        <italic toggle="yes">Zenodo.</italic>
					</source>
                    <year>2018</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.doi.org/10.5281/zenodo.1293418">http://www.doi.org/10.5281/zenodo.1293418</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report36881">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.16883.r36881</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Collado-Torres</surname>
                        <given-names>Leonardo</given-names>
                    </name>
                    <xref ref-type="aff" rid="r36881a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-2140-308X</uri>
                </contrib>
                <aff id="r36881a1">
                    <label>1</label>Lieber Institute for Brain Development, Baltimore, MD, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>28</day>
                <month>8</month>
                <year>2018</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2018 Collado-Torres L</copyright-statement>
                <copyright-year>2018</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport36881" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9973.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors (including the new author J. Wollbrett) have addressed all my comments from version 1 very throughly or plan to address some of the more technical ones in the future.&#x00a0;</p>
            <p> </p>
            <p> I want to highlight all the work the authors&#x00a0;did ensuring that their paper is reproducible (they mention versions of packages throughout the text) and in clarifying their work (mostly updates on the introduction). I was able to download supplementary file 1 (an .Rmd file) and execute it without any problems or modifications at all using R 3.5.1 with Bioconductor 3.7 on a&#x00a0;Mac (BgeeDB 2.6.2, MFuzz 2.40.0, biomaRt 2.36.1). The authors did a great job with&#x00a0;https://github.com/BgeeDB/bgee_pipeline which I believe includes all the information needed for anyone who is interested in the Bgee pipeline. If not, the authors seem responsive on&#x00a0;https://github.com/BgeeDB/bgee_pipeline/issues and https://github.com/BgeeDB/BgeeDB_R/issues which is a great sign. Thanks to their changes in the introduction I now have a better understanding of their anatomical expression enrichment test.&#x00a0;</p>
            <p> </p>
            <p> I look forward to seeing how the community uses BgeeDB in the future and the use cases they apply it to.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report17980">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.10748.r17980</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Collado-Torres</surname>
                        <given-names>Leonardo</given-names>
                    </name>
                    <xref ref-type="aff" rid="r17980a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-2140-308X</uri>
                </contrib>
                <aff id="r17980a1">
                    <label>1</label>Lieber Institute for Brain Development, Baltimore, MD, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>16</day>
                <month>12</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Collado-Torres L</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport17980" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9973.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>reject</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In this manuscript the authors describe the BgeeDB Bioconductor package and show how to use it (as of Bioconductor 3.4) to interact with Bgee
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-17980-1">1</xref>
                </sup>&#x00a0;in order to get the data from Bgee into your R session. This allows users to then perform differential expression analyses and integrate Bgee with other data sets such as unpublished data. The manuscript includes code that shows how to use BgeeDB and showcases it's different features including their unique anatomical expression enrichment analysis method.</p>
            <p> </p>
            <p> I find interesting that you can use BgeeDB to get data from different platforms and from different organisms. Most of this can be done using other packages such as GEOquery, but BgeeDB makes it so the user doesn't have to do all the processing of the data and standarization over multiple projects.</p>
            <p> </p>
            <p> My main concern with the manuscript in its current form and the BgeeDB package itself is the lack of clarity on how the data has been processed and how the anatomical expression test works. That is, it could potentially become a black box that produces interesting output but hides information that could be important.</p>
            <p> </p>
            <p> For example, I'm sure some of the Affymetrix data could be downloaded with other packages and I do not know what would be the differences between the raw data and the data downloaded via BgeeDB. Is the data in BgeeDB normalized? If so, how? The help pages of BgeeDB, the package vignette, the original Bgee publication
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-17980-1">1</xref>
                </sup> and 
                <ext-link ext-link-type="uri" xlink:href="http://bgee.org/?page=doc">http://bgee.org/?page=doc</ext-link>&#x00a0;did not help me fully answer these questions (I might have missed the information). Maybe the functions in BgeeDB could print a message describing the main steps of how a given data set was processed or this could be added to the help pages. I currently ignore if all data sets were treated the same. For instance, is all the Affymetrix data normalized with the same method and same parameters? I assume that the answer is yes but I don't know. I suggest that the authors describe in more detail the data available in Bgee. The authors might want to consider making the processing code public at&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://github.com/BgeeDB">https://github.com/BgeeDB</ext-link>&#x00a0;or citable via 
                <ext-link ext-link-type="uri" xlink:href="https://figshare.com/">figshare</ext-link>.</p>
            <p> </p>
            <p> With the anatomical expression test it's not clear to me how to interpret the results from BgeeDB::makeTable(). I understand that the authors will describe the details of how their anatomical test works in a future publication, which they did before with Bgee
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-17980-1">1</xref>
                </sup> and Homolanto
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-17980-2">2</xref>
                </sup>. Ideally, the anatomical expression test would have been described first followed by BgeeDB. Without hindering the current plan, I believe that the authors could provide a summary of how TopAnat works. Then they can explain it fully in the planned future TopAnat publication. I am also curious on how users could use their own data to improve the TopAnat results, although that could be work for the TopAnat paper or future work.</p>
            <p> </p>
            <p> I think that the manuscript is overall well written and will be more appealing if the data and main features (TopAnat) are described in more detail. I hope that the authors are not discouraged by my report.&#x00a0;</p>
            <p> </p>
            <p> Best,</p>
            <p> Leonardo</p>
            <p> </p>
            <p> </p>
            <p> Minor comments: 
                <list list-type="bullet">
                    <list-item>
                        <p>I'm an author of recount
                            <sup>
                                <xref ref-type="bibr" rid="rep-ref-17980-3">3</xref>
                            </sup>&#x00a0;which is incorrectly cited here. The pre-print version of&#x00a0;
                            <ext-link ext-link-type="uri" xlink:href="https://jhubiostatistics.shinyapps.io/recount/">https://jhubiostatistics.shinyapps.io/recount/</ext-link>&#x00a0;had data from 2040 different projects which together made up more than 60,000 RNA-seq samples. The current version has data from over 70,000 Illumina human RNA-seq samples from SRA, GTEx and TCGA.</p>
                    </list-item>
                    <list-item>
                        <p>I don't think that it makes sense to include the str() calls in the paper. They do make sense in the supplementary material (the html and pdf rendered versions of the paper code) since those include the output. Also, while str() shows all the details of an object, it can encourage users to write code that depends on the internal structure of the object. You might want to consider adding accessor functions.</p>
                    </list-item>
                    <list-item>
                        <p>If you added indentation the code that runs over multiple lines would be easier to read. You can use the Bioconductor standard of using 4 spaces at the start of the line. Also make sure that object names don't get split into multiple lines. For example check the line after the "list experiments including both brain and liver samples" comment where "Anatomical.entity.ID" gets split into "Anatomical.e" and "ntity.ID" in the html version of the paper. Copy pasting works fine, but if someone prints the paper they might introduce errors can be avoided with better formatting.&#x00a0;F1000Research's team should be able to tell you what is the character limit per line to use so that the PDF and HTML versions look great. The 
                            <ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/web/packages/formatR/index.html">formatR</ext-link> package might be useful here.</p>
                    </list-item>
                    <list-item>
                        <p>I would not use numerical indexes in the code since the results could change with time in such a way that the current code would not work in the future or worse, it might run without error but change the results in a way a new user would not notice. For example, change the code on the line after the "order developmental stages" comment which currently reads:</p>
                        <p> </p>
                        <p> data.E.MEXP.51.formatted &lt;- data.E.MEXP.51.formatted[, c(5,8,9,3,2,1,4,7,6)]</p>
                    </list-item>
                    <list-item>
                        <p>The comment that reads with "retrieve anatomical&#x00a0;structures enriched&#x00a0;at a 1% FDR threshold" is mixed with the code. That is, you are missing a new line character.</p>
                    </list-item>
                    <list-item>
                        <p>Reference 46 is incorrect. It's edgeR, not eedgeR.</p>
                    </list-item>
                    <list-item>
                        <p>The package's vignette is missing a title as currently shown at&#x00a0;
                            <ext-link ext-link-type="uri" xlink:href="http://bioconductor.org/packages/release/bioc/html/BgeeDB.html">http://bioconductor.org/packages/release/bioc/html/BgeeDB.html</ext-link>.</p>
                    </list-item>
                    <list-item>
                        <p>I recommend adding internal R links to your manual pages. For example, ?topAnat mentions&#x00a0;loadTopAnatData(). Those links make it easier for a user to browse the help pages.</p>
                    </list-item>
                </list> </p>
            <p> I was able to run all the code without any edits (beyond that new line issue I already mentioned) using Bioconductor 3.4 (current Bioc-release) on R 3.3.1. Here are my session details:</p>
            <p> </p>
            <p> &gt; options(width = 120)</p>
            <p> &gt; devtools::session_info()</p>
            <p> Session info -----------------------------------------------------------------------------------------------------------</p>
            <p> &#x00a0;setting &#x00a0;value &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;&#x00a0;</p>
            <p> &#x00a0;version &#x00a0;R version 3.3.1 (2016-06-21)</p>
            <p> &#x00a0;system &#x00a0; x86_64, mingw32 &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;&#x00a0;</p>
            <p> &#x00a0;ui &#x00a0; &#x00a0; &#x00a0; RStudio (0.99.902) &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;</p>
            <p> &#x00a0;language (EN) &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;</p>
            <p> &#x00a0;collate &#x00a0;English_United States.1252 &#x00a0;</p>
            <p> &#x00a0;tz &#x00a0; &#x00a0; &#x00a0; America/Mexico_City &#x00a0; &#x00a0; &#x00a0; &#x00a0;&#x00a0;</p>
            <p> &#x00a0;date &#x00a0; &#x00a0; 2016-12-15 &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;</p>
            <p> </p>
            <p> Packages ---------------------------------------------------------------------------------------------------------------</p>
            <p> &#x00a0;package &#x00a0; &#x00a0; &#x00a0; * version &#x00a0;date &#x00a0; &#x00a0; &#x00a0; source &#x00a0; &#x00a0; &#x00a0; &#x00a0;</p>
            <p> &#x00a0;AnnotationDbi * 1.36.0 &#x00a0; 2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;assertthat &#x00a0; &#x00a0; &#x00a0;0.1 &#x00a0; &#x00a0; &#x00a0;2013-12-06 CRAN (R 3.3.1)</p>
            <p> &#x00a0;BgeeDB &#x00a0; &#x00a0; &#x00a0; &#x00a0;* 2.0.0 &#x00a0; &#x00a0;2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;Biobase &#x00a0; &#x00a0; &#x00a0; * 2.34.0 &#x00a0; 2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;BiocGenerics &#x00a0;* 0.20.0 &#x00a0; 2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;BiocInstaller * 1.24.0 &#x00a0; 2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;biomaRt &#x00a0; &#x00a0; &#x00a0; * 2.30.0 &#x00a0; 2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;bitops &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;1.0-6 &#x00a0; &#x00a0;2013-08-17 CRAN (R 3.3.1)</p>
            <p> &#x00a0;class &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; 7.3-14 &#x00a0; 2015-08-30 CRAN (R 3.3.1)</p>
            <p> &#x00a0;data.table &#x00a0; &#x00a0; &#x00a0;1.10.0 &#x00a0; 2016-12-03 CRAN (R 3.3.2)</p>
            <p> &#x00a0;DBI &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; 0.5-1 &#x00a0; &#x00a0;2016-09-10 CRAN (R 3.3.1)</p>
            <p> &#x00a0;devtools &#x00a0; &#x00a0; &#x00a0; &#x00a0;1.12.0 &#x00a0; 2016-06-24 CRAN (R 3.3.1)</p>
            <p> &#x00a0;digest &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;0.6.10 &#x00a0; 2016-08-02 CRAN (R 3.3.1)</p>
            <p> &#x00a0;dplyr &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; 0.5.0 &#x00a0; &#x00a0;2016-06-24 CRAN (R 3.3.1)</p>
            <p> &#x00a0;DynDoc &#x00a0; &#x00a0; &#x00a0; &#x00a0;* 1.52.0 &#x00a0; 2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;e1071 &#x00a0; &#x00a0; &#x00a0; &#x00a0; * 1.6-7 &#x00a0; &#x00a0;2015-08-05 CRAN (R 3.3.1)</p>
            <p> &#x00a0;edgeR &#x00a0; &#x00a0; &#x00a0; &#x00a0; * 3.16.4 &#x00a0; 2016-11-27 Bioconductor &#x00a0;</p>
            <p> &#x00a0;GO.db &#x00a0; &#x00a0; &#x00a0; &#x00a0; * 3.4.0 &#x00a0; &#x00a0;2016-10-22 Bioconductor &#x00a0;</p>
            <p> &#x00a0;graph &#x00a0; &#x00a0; &#x00a0; &#x00a0; * 1.52.0 &#x00a0; 2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;IRanges &#x00a0; &#x00a0; &#x00a0; * 2.8.1 &#x00a0; &#x00a0;2016-11-08 Bioconductor &#x00a0;</p>
            <p> &#x00a0;lattice &#x00a0; &#x00a0; &#x00a0; &#x00a0; 0.20-34 &#x00a0;2016-09-06 CRAN (R 3.3.1)</p>
            <p> &#x00a0;limma &#x00a0; &#x00a0; &#x00a0; &#x00a0; * 3.30.6 &#x00a0; 2016-11-29 Bioconductor &#x00a0;</p>
            <p> &#x00a0;locfit &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;1.5-9.1 &#x00a0;2013-04-20 CRAN (R 3.3.1)</p>
            <p> &#x00a0;magrittr &#x00a0; &#x00a0; &#x00a0; &#x00a0;1.5 &#x00a0; &#x00a0; &#x00a0;2014-11-22 CRAN (R 3.3.1)</p>
            <p> &#x00a0;matrixStats &#x00a0; &#x00a0; 0.51.0 &#x00a0; 2016-10-09 CRAN (R 3.3.1)</p>
            <p> &#x00a0;memoise &#x00a0; &#x00a0; &#x00a0; &#x00a0; 1.0.0 &#x00a0; &#x00a0;2016-01-29 CRAN (R 3.3.1)</p>
            <p> &#x00a0;Mfuzz &#x00a0; &#x00a0; &#x00a0; &#x00a0; * 2.34.0 &#x00a0; 2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;R6 &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;2.2.0 &#x00a0; &#x00a0;2016-10-05 CRAN (R 3.3.1)</p>
            <p> &#x00a0;Rcpp &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;0.12.8 &#x00a0; 2016-11-17 CRAN (R 3.3.2)</p>
            <p> &#x00a0;RCurl &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; 1.95-4.8 2016-03-01 CRAN (R 3.3.1)</p>
            <p> &#x00a0;rsconnect &#x00a0; &#x00a0; &#x00a0; 0.6 &#x00a0; &#x00a0; &#x00a0;2016-11-21 CRAN (R 3.3.2)</p>
            <p> &#x00a0;RSQLite &#x00a0; &#x00a0; &#x00a0; &#x00a0; 1.1-1 &#x00a0; &#x00a0;2016-12-10 CRAN (R 3.3.2)</p>
            <p> &#x00a0;S4Vectors &#x00a0; &#x00a0; * 0.12.1 &#x00a0; 2016-12-01 Bioconductor &#x00a0;</p>
            <p> &#x00a0;SparseM &#x00a0; &#x00a0; &#x00a0; * 1.74 &#x00a0; &#x00a0; 2016-11-10 CRAN (R 3.3.2)</p>
            <p> &#x00a0;tibble &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;1.2 &#x00a0; &#x00a0; &#x00a0;2016-08-26 CRAN (R 3.3.1)</p>
            <p> &#x00a0;tidyr &#x00a0; &#x00a0; &#x00a0; &#x00a0; * 0.6.0 &#x00a0; &#x00a0;2016-08-12 CRAN (R 3.3.1)</p>
            <p> &#x00a0;tkWidgets &#x00a0; &#x00a0; &#x00a0; 1.52.0 &#x00a0; 2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;topGO &#x00a0; &#x00a0; &#x00a0; &#x00a0; * 2.26.0 &#x00a0; 2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;widgetTools &#x00a0; * 1.52.0 &#x00a0; 2016-10-18 Bioconductor &#x00a0;</p>
            <p> &#x00a0;withr &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; 1.0.2 &#x00a0; &#x00a0;2016-06-20 CRAN (R 3.3.1)</p>
            <p> &#x00a0;XML &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; 3.98-1.5 2016-11-10 CRAN (R 3.3.2)</p>
            <p> </p>
            <p> </p>
            <p> Regarding Virag Sharma's peer review report
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-17980-4">4</xref>
                </sup>, I assume that Virag was using an earlier R version (and thus an earlier Bioconductor version). The current development version of BgeeDB uses "dataType" and not "datatype", just like the release version. Check&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://github.com/Bioconductor-mirror/BgeeDB/search?utf8=%E2%9C%93&amp;q=datatype">https://github.com/Bioconductor-mirror/BgeeDB/search?utf8=%E2%9C%93&amp;q=datatype</ext-link>. Hopefully the authors won't change the spelling of arguments in the future since that's confusing for users, although that's certainly doable following the deprecated/defunct code cycle.</p>
            <p> </p>
            <p> Regarding&#x00a0;Daniel S. Himmelstein's peer review report
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-17980-5">5</xref>
                </sup>, there is no need to add a license file when the license is specified in the DESCRIPTION file of an R package. See&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://github.com/Bioconductor-mirror/BgeeDB/blob/master/DESCRIPTION#L14">https://github.com/Bioconductor-mirror/BgeeDB/blob/master/DESCRIPTION#L14</ext-link>&#x00a0;where they state that the license is GPL-2. Although the authors should make sure that they correctly specify which license their software is released on: GPL-2 or GPLv3 as Daniel mentioned. Regarding where to place bug reports, the authors could resolve this by specifying the "BugReports" field in their DESCRIPTION file. For example see&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://github.com/Bioconductor-mirror/recount/blob/master/DESCRIPTION#L63">https://github.com/Bioconductor-mirror/recount/blob/master/DESCRIPTION#L63</ext-link>. I also agree with Daniel that currently BgeeDB has a bit of a messy download structure. I would prefer if the files were downloaded in a single directory (say "bgee_downloads") instead of the current working directory.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-17980-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Bgee: Integrating and Comparing Heterogeneous Transcriptome Data Among Species</article-title>.<year>2008</year>;
                        <elocation-id>10.1007/978-3-540-69828-9_12</elocation-id>
                        <fpage>124</fpage>-<lpage>131</lpage>
                        <pub-id pub-id-type="doi">10.1007/978-3-540-69828-9_12</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-17980-2">
                    <label>2</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Homolonto: generating homology relationships by pairwise alignment of ontologies and application to vertebrate anatomy.</article-title>
                        <source>
                            <italic>Bioinformatics</italic>
                        </source>.<year>2010</year>;<volume>26</volume>(<issue>14</issue>) :
                        <elocation-id>10.1093/bioinformatics/btq283</elocation-id>
                        <fpage>1766</fpage>-<lpage>71</lpage>
                        <pub-id pub-id-type="pmid">20519284</pub-id>
                        <pub-id pub-id-type="doi">10.1093/bioinformatics/btq283</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-17980-3">
                    <label>3</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>recount: A large-scale resource of analysis-ready RNA-seq expression data</article-title>.<year>2016</year>;
                        <elocation-id>10.1101/068478</elocation-id>
                        <pub-id pub-id-type="doi">10.1101/068478</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-17980-4">
                    <label>4</label>
                    <mixed-citation>
                        <person-group person-group-type="author"/>:
                        <article-title>Referee Report For: BgeeDB, an R package for retrieval of curated expression datasets and for gene list expression localization enrichment tests [version 1; referees: 1 approved, 1 approved with reservations]</article-title>.
                        <source>
                            <italic>F1000Research</italic>
                        </source>.<year>2016</year>;<volume>5</volume>(<issue>2748</issue>) :
                        <elocation-id>10.5256/f1000research.10748.r17925</elocation-id>
                        <pub-id pub-id-type="doi">10.5256/f1000research.10748.r17925</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-17980-5">
                    <label>5</label>
                    <mixed-citation>
                        <person-group person-group-type="author"/>:
                        <article-title>Referee Report For: BgeeDB, an R package for retrieval of curated expression datasets and for gene list expression localization enrichment tests [version 1; referees: 1 approved, 1 approved with reservations]</article-title>.<year>2016</year>;<volume>5</volume>(<issue>2748</issue>) :
                        <elocation-id>10.5256/f1000research.10748.r18221</elocation-id>
                        <pub-id pub-id-type="doi">10.5256/f1000research.10748.r18221</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment3775-17980">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Bastian</surname>
                            <given-names>Frederic</given-names>
                        </name>
                        <aff>Swiss Institute of Bioinformatics - University of Lausanne, Switzerland</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>28</day>
                    <month>6</month>
                    <year>2018</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <italic>My main concern with the manuscript in its current form and the BgeeDB package itself is the lack of clarity on how the data has been processed and how the anatomical expression test works. That is, it could potentially become a black box that produces interesting output but hides information that could be important.</italic>
                </p>
                <p>
                    <italic> For example, I'm sure some of the Affymetrix data could be downloaded with other packages and I do not know what would be the differences between the raw data and the data downloaded via BgeeDB. Is the data in BgeeDB normalized? If so, how? The help pages of BgeeDB, the package vignette, the original Bgee publication1 and http://bgee.org/?page=doc did not help me fully answer these questions (I might have missed the information).</italic>
                </p>
                <p> We have made public the Bgee pipeline source code at https://github.com/BgeeDB/bgee_pipeline. We also have added a paragraph at the end of the "Introduction" section, pointing to the relevant part of the documentation for RNA-Seq and Affymetrix analyses, and describing them in brief.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>&#x00a0;Maybe the functions in BgeeDB could print a message describing the main steps of how a given data set was processed or this could be added to the help pages.</italic>
                </p>
                <p> We have opened an issue on our tracker related to this point, see https://github.com/BgeeDB/BgeeDB_R/issues/22. We will add a function pointing to the relevant documentation in a future release.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>&#x00a0;I currently ignore if all data sets were treated the same. For instance, is all the Affymetrix data normalized with the same method and same parameters? I assume that the answer is yes but I don't know.</italic>
                </p>
                <p> The Affymetrix data are not treated in the same way depending on whether the raw data were available, or only the data processed by using the MAS5 software. This is clarified at the end of the "Introduction" section. Also, in the package, this information about raw data availability can be retrieved in the annotation data frame.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>I suggest that the authors describe in more detail the data available in Bgee. The authors might want to consider making the processing code public at https://github.com/BgeeDB or citable via figshare.</italic>
                </p>
                <p> We have made public the Bgee pipeline source code at https://github.com/BgeeDB/bgee_pipeline.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>With the anatomical expression test it's not clear to me how to interpret the results from BgeeDB::makeTable(). I understand that the authors will describe the details of how their anatomical test works in a future publication, which they did before with Bgee and Homolanto. Ideally, the anatomical expression test would have been described first followed by BgeeDB. Without hindering the current plan, I believe that the authors could provide a summary of how TopAnat works. Then they can explain it fully in the planned future TopAnat publication.</italic>
                </p>
                <p> We have added a brief description of how TopAnat works in the "Introduction" section.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>I am also curious on how users could use their own data to improve the TopAnat results, although that could be work for the TopAnat paper or future work.</italic>
                </p>
                <p> This represents an advanced use of TopAnat that we don't find suitable for the paper. But users can override the association file, mapping genes to anatomical structures in the BgeeDB directory, to use their own data. Also, since the source code of the package is public, users can also modify the mapping files used by modifying the source code.</p>
                <p> </p>
                <p> ---</p>
                <p> </p>
                <p> 
                    <italic>&#x00a0;I'm an author of recount which is incorrectly cited here. The pre-print version of https://jhubiostatistics.shinyapps.io/recount/ had data from 2040 different projects which together made up more than 60,000 RNA-seq samples. The current version has data from over 70,000 Illumina human RNA-seq samples from SRA, GTEx and TCGA.</italic>
                </p>
                <p> We have updated the number in our paper. We apologize for the mistake.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>&#x00a0;I don't think that it makes sense to include the str() calls in the paper. They do make sense in the supplementary material (the html and pdf rendered versions of the paper code) since those include the output. Also, while str() shows all the details of an object, it can encourage users to write code that depends on the internal structure of the object. You might want to consider adding accessor functions.</italic>
                </p>
                <p> We have removed str() calls from the paper. For the future, we will think of adding accessor functions, although several are already available thanks to the topGO package.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>&#x00a0;If you added indentation the code that runs over multiple lines would be easier to read. You can use the Bioconductor standard of using 4 spaces at the start of the line. Also make sure that object names don't get split into multiple lines. For example check the line after the "list experiments including both brain and liver samples" comment where "Anatomical.entity.ID" gets split into "Anatomical.e" and "ntity.ID" in the html version of the paper. Copy pasting works fine, but if someone prints the paper they might introduce errors can be avoided with better formatting. F1000Research's team should be able to tell you what is the character limit per line to use so that the PDF and HTML versions look great. The formatR package might be useful here.</italic>
                </p>
                <p> We didn't know about the formatR package and will have a look at it. In the meantime, we have split such offending lines, as identified by the reviewer.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>&#x00a0;I would not use numerical indexes in the code since the results could change with time in such a way that the current code would not work in the future or worse, it might run without error but change the results in a way a new user would not notice. For example, change the code on the line after the "order developmental stages" comment which currently reads:</italic>
                </p>
                <p>
                    <italic> &#x00a0;data.E.MEXP.51.formatted &lt;- data.E.MEXP.51.formatted[, c(5,8,9,3,2,1,4,7,6)]</italic>
                </p>
                <p> We have replaced all lines using numerical indexes, with use of column names.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>The comment that reads with "retrieve anatomical structures enriched at a 1% FDR threshold" is mixed with the code. That is, you are missing a new line character.</italic>
                </p>
                <p> This was fixed.</p>
                <p> </p>
                <p> ---</p>
                <p> &#x00a0;
                    <italic>Reference 46 is incorrect. It's edgeR, not eedgeR.</italic>
                </p>
                <p> This was fixed.</p>
                <p> </p>
                <p> ---</p>
                <p> &#x00a0;
                    <italic>The package's vignette is missing a title as currently shown at 
                        <ext-link ext-link-type="uri" xlink:href="http://bioconductor.org/packages/release/bioc/html/BgeeDB.html">http://bioconductor.org/packages/release/bioc/html/BgeeDB.html</ext-link>.</italic>
                </p>
                <p> This was added.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>I recommend adding internal R links to your manual pages. For example, ?topAnat mentions loadTopAnatData(). Those links make it easier for a user to browse the help pages.</italic>
                </p>
                <p> We thank the reviewer for the suggestion, and we will implement this in a future release.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>I was able to run all the code without any edits (beyond that new line issue I already mentioned) using Bioconductor 3.4 (current Bioc-release) on R 3.3.1. Here are my session details:</italic>
                </p>
                <p>
                    <italic> &gt; options(width = 120)</italic>
                </p>
                <p>
                    <italic> &gt; devtools::session_info()</italic>
                </p>
                <p>
                    <italic> [...]</italic>
                </p>
                <p>
                    <italic> Regarding Virag Sharma's peer review report4, I assume that Virag was using an earlier R version (and thus an earlier Bioconductor version). The current development version of BgeeDB uses "dataType" and not "datatype", just like the release version. Check https://github.com/Bioconductor-mirror/BgeeDB/search?utf8=%E2%9C%93&amp;q=datatype. Hopefully the authors won't change the spelling of arguments in the future since that's confusing for users, although that's certainly doable following the deprecated/defunct code cycle.</italic>
                </p>
                <p> This is indeed a change that we introduced in an earlier version, in an effort to name all our arguments in a consistent manner. We will try not to change this in the future.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>Regarding Daniel S. Himmelstein's peer review report5, there is no need to add a license file when the license is specified in the DESCRIPTION file of an R package. See https://github.com/Bioconductor-mirror/BgeeDB/blob/master/DESCRIPTION#L14 where they state that the license is GPL-2. Although the authors should make sure that they correctly specify which license their software is released on: GPL-2 or GPLv3 as Daniel mentioned.</italic>
                </p>
                <p> We have updated the DESCRIPTION file in the development branch of Bioconductor. The package is now released under the GPL-3.0 license.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>Regarding where to place bug reports, the authors could resolve this by specifying the "BugReports" field in their DESCRIPTION file. For example see 
                        <ext-link ext-link-type="uri" xlink:href="https://github.com/Bioconductor-mirror/recount/blob/master/DESCRIPTION#L63">https://github.com/Bioconductor-mirror/recount/blob/master/DESCRIPTION#L63</ext-link>.</italic>
                </p>
                <p> This was done.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>I also agree with Daniel that currently BgeeDB has a bit of a messy download structure. I would prefer if the files were downloaded in a single directory (say "bgee_downloads") instead of the current working directory.</italic>
                </p>
                <p> While another directory can be specified by using the "pathToData" argument, it is true that the solution proposed by the reviewer would be convenient, and we will try to update the package accordingly in the future.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report18221">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.10748.r18221</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Himmelstein</surname>
                        <given-names>Daniel S.</given-names>
                    </name>
                    <xref ref-type="aff" rid="r18221a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-3012-7446</uri>
                </contrib>
                <aff id="r18221a1">
                    <label>1</label>Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>14</day>
                <month>12</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Himmelstein DS</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport18221" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9973.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This study describes the BgeeDB R package, which provides a programmatic interface for accessing Bgee gene expression data. Bgee is a valuable resource because it integrates gene expression results across many experiments. 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.15363/thinklab.d124">Previously, I've used Bgee</ext-link> for its&#x00a0;presence/absence of expression calls and its differential expression calls.</p>
            <p> </p>
            <p> In my opinion, Bgee's ability to provide a genome-wide profile of expression for a given species, developmental stage, and anatomical structure is its most powerful capability. It was not clear to me whether BgeeDB provides this functionality. For example, can the user retrieve the&#x00a0;normalized expression level across several experiments for the same species-stage-anatomy combination? In general, I think users will be more interested in this high-level functionality than the low level access&#x00a0;BgeeDB currently provides. An example here would likely clear things up for me.</p>
            <p> </p>
            <p> Is it possible to integrate expression levels across Affymetrix and RNA-Seq experiments?</p>
            <p> </p>
            <p> The 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.163768">Zenodo archive</ext-link> of the source code specifies 
                <ext-link ext-link-type="uri" xlink:href="https://opensource.org/licenses/gpl-3.0.html">GPLv3</ext-link> as the license. This is great, but it's ideal to also 
                <ext-link ext-link-type="uri" xlink:href="https://help.github.com/articles/adding-a-license-to-a-repository/">add a LICENSE file</ext-link> to the GitHub.</p>
            <p> </p>
            <p> It looks like there are at least two potential places where bug reports should be filed: on 
                <ext-link ext-link-type="uri" xlink:href="https://support.bioconductor.org/p/new/post/?tag_val=BgeeDB">Bioconductor Support</ext-link>&#x00a0;and 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/BgeeDB/BgeeDB_R/issues">GitHub Issues</ext-link>. It would be nice to clarify the preferred location for filing&#x00a0;bug reports go and opening pull requests.</p>
            <p> </p>
            <p> Currently, the GitHub repository&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://github.com/BgeeDB/BgeeDB_R">BgeeDB/BgeeDB_R</ext-link> mentioned in the manuscript is forked from&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://github.com/wirawara/BgeeDB">wirawara/BgeeDB</ext-link>. I expect this may cause some confusion, as BgeeDB/BgeeDB_R should be the upstream repository that users fork and contribute back to. If you make&#x00a0;wirawara/BgeeDB private, this 
                <ext-link ext-link-type="uri" xlink:href="https://help.github.com/articles/what-happens-to-forks-when-a-repository-is-deleted-or-changes-visibility/#changing-a-public-repository-to-a-private-repository">should break</ext-link> the relationship. @wirawara&#x00a0;can then fork&#x00a0;BgeeDB/BgeeDB_R to continue contributions if desired.</p>
            <p> </p>
            <p> Finally, I created some GitHub issues as part of this review: 
                <list list-type="bullet">
                    <list-item>
                        <p>
                            <ext-link ext-link-type="uri" xlink:href="https://github.com/BgeeDB/BgeeDB_R/issues/5">Sample annotation variable names</ext-link>
                        </p>
                    </list-item>
                    <list-item>
                        <p>
                            <ext-link ext-link-type="uri" xlink:href="https://github.com/BgeeDB/BgeeDB_R/issues/4">A less messy default download directory</ext-link>
                        </p>
                    </list-item>
                </list>
            </p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-18221-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>A quick guide to software licensing for the scientist-programmer.</article-title>
                        <source>
                            <italic>PLoS Comput Biol</italic>
                        </source>.<year>2012</year>;<volume>8</volume>(<issue>7</issue>) :
                        <elocation-id>10.1371/journal.pcbi.1002598</elocation-id>
                        <fpage>e1002598</fpage>
                        <pub-id pub-id-type="pmid">22844236</pub-id>
                        <pub-id pub-id-type="doi">10.1371/journal.pcbi.1002598</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment3776-18221">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Bastian</surname>
                            <given-names>Frederic</given-names>
                        </name>
                        <aff>Swiss Institute of Bioinformatics - University of Lausanne, Switzerland</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>28</day>
                    <month>6</month>
                    <year>2018</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <italic>In my opinion, Bgee's ability to provide a genome-wide profile of expression for a given species, developmental stage, and anatomical structure is its most powerful capability. It was not clear to me whether BgeeDB provides this functionality. For example, can the user retrieve the normalized expression level across several experiments for the same species-stage-anatomy combination? In general, I think users will be more interested in this high-level functionality than the low level access BgeeDB currently provides. An example here would likely clear things up for me.</italic>
                </p>
                <p> Indeed there is currently no easy way to do this. As mentioned in 
                    <underline>
                        <ext-link ext-link-type="uri" xlink:href="https://github.com/BgeeDB/BgeeDB_R/issues/7">https://github.com/BgeeDB/BgeeDB_R/issues/7</ext-link>
                    </underline>, it would be nice to have a getDataByCondition function that would return all processed data for chips / libraries matching a queried organ/stage/(sex)/(strain). But it was hard to set priorities for the initial development (should the package complement the web interface, or be orthogonal to it?), and we will likely implement it in the near future.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>Is it possible to integrate expression levels across Affymetrix and RNA-Seq experiments?</italic>
                </p>
                <p> If the reviewer means to integrate present/absent expression calls, it is relatively easy to get all the genes expressed in one tissue and all sub-tissues from Affymetrix and RNA-Seq data, although a dedicated method could be added, for instance:</p>
                <p> library(BgeeDB)</p>
                <p> bgee_human &lt;- Bgee$new(species='Homo_sapiens', dataType=c('rna_seq', 'affymetrix'))</p>
                <p> my_data &lt;- loadTopAnatData(bgee_human)</p>
                <p> calls_by_tissue &lt;- reverseSplit(my_data$gene2anatomy)</p>
                <p> # pick you favorite tissue, for example liver</p>
                <p> calls_by_tissue[["UBERON:0002107"]]</p>
                <p> And this can be limited by stage too, for example:</p>
                <p> my_data &lt;- loadTopAnatData(bgee_human, stage="UBERON:0000068")</p>
                <p> We have noted in the issue 7 mentioned above to add a direct function to do this.</p>
                <p> If the reviewer means to integrate levels of expression, it is then not the aim of Bgee: Bgee integrate different data types and different experiments, processed and normalized independently (but in a consistent manner).</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>The Zenodo archive of the source code specifies GPLv3 as the license. This is great, but it's ideal to also add a LICENSE file to the GitHub.</italic>
                </p>
                <p> We have added the LICENSE file to GitHub (GPL 3.0).</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>It looks like there are at least two potential places where bug reports should be filed: on Bioconductor Support and GitHub Issues. It would be nice to clarify the preferred location for filing bug reports go and opening pull requests.</italic>
                </p>
                <p> We have added the preferred location for filing bug reports at the end of the "Introduction" section (GitHub), and in the DESCRIPTION file of the source code.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>Currently, the GitHub repository BgeeDB/BgeeDB_R mentioned in the manuscript is forked from wirawara/BgeeDB. I expect this may cause some confusion, as BgeeDB/BgeeDB_R should be the upstream repository that users fork and contribute back to. If you make wirawara/BgeeDB private, this should break the relationship. @wirawara can then fork BgeeDB/BgeeDB_R to continue contributions if desired.</italic>
                </p>
                <p> We thank the reviewer for the suggestion, we have now made wirawara/BgeeDB private.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>Finally, I created some GitHub issues as part of this review:</italic>
                </p>
                <p>
                    <italic> &#x00a0;Sample annotation variable names</italic>
                </p>
                <p>
                    <italic> 
                        <ext-link ext-link-type="uri" xlink:href="https://github.com/BgeeDB/BgeeDB_R/issues/5">https://github.com/BgeeDB/BgeeDB_R/issues/5</ext-link>
                    </italic>
                </p>
                <p> We have replied on the issue. Our answer was that is a bit of a controversial topic. For example Google's R Style Guide (https://google.github.io/styleguide/Rguide.xml#identifiers) advise against the use of underscores (although they do not justify why, and we agree that the "words separated with dots" convention can be disturbing for python users).</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>&#x00a0;A less messy default download directory</italic>
                </p>
                <p> This point was discussed in https://github.com/BgeeDB/BgeeDB_R/issues/4. We notably mention that another directory can be specified by using the "pathToData" argument. This parameter is mentioned at the end of the section "Data download and import of normalized expression levels". We agree that a default directory should be used in future releases.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report17925">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.10748.r17925</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Sharma</surname>
                        <given-names>Virag</given-names>
                    </name>
                    <xref ref-type="aff" rid="r17925a1">1</xref>
                    <xref ref-type="aff" rid="r17925a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r17925a1">
                    <label>1</label>Max Planck Institute of Molecular Cell Biology and Genetics&#x00a0;(MPI-CBG), Dresden, Germany</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>7</day>
                <month>12</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Sharma V</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport17925" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9973.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In the manuscript, Komljenovic et al. present BgeeDB which is an R package for retrieval of expression datasets which have been curated. Additionally, they also provide a method (TopAnat) to determine tissue-specific enrichments for a given list of genes and species.</p>
            <p> </p>
            <p> The former is a very useful resource because there is clearly a need for a database that provides gene expression datasets which are homogenous in nature and are of comparable quality. The BgeeDB database contains gene expression datasets from 17 species across different tissues and developmental stages, which is impressive. The fact that the database can be queried via a Bioconductor package should ensure that the database will be used &#x00a0;by both - wet-lab biologists and computational scientists.</p>
            <p> Similarly, the TopAnat method also provides a useful functionality to determine anatomical expression enrichment on a user specified list.</p>
            <p> </p>
            <p> I have a few minor comments regarding the manuscript: 
                <list list-type="order">
                    <list-item>
                        <p>The authors should include some details about how they have reprocessed the gene expression datasets that are a part of BgeeDB. At the moment, it is rather unclear how this was achieved. I assume that the authors have an automated pipeline in place but it would be beneficial for readers to know how this was done.</p>
                    </list-item>
                    <list-item>
                        <p>The authors state that &#x201c;TopAnat allows for discovery of tissues where a set of genes is preferentially expressed&#x201d;. Is TopAnat the only tool that offers such a functionality? A brief background of similar tools that are currently available will be useful for the readers.</p>
                    </list-item>
                    <list-item>
                        <p>I was not able to run the workflow that the authors have included in the Supplementary material:</p>
                        <p> </p>
                        <p> See below:&#x00a0;</p>
                        <p> </p>
                        <p> source("https://bioconductor.org/biocLite.R")</p>
                        <p> biocLite("BgeeDB")</p>
                        <p> biocLite(c("edgeR", "Mfuzz", "biomaRt"))</p>
                        <p> library(BgeeDB)</p>
                        <p> listBgeeSpecies()</p>
                        <p> </p>
                        <p> bgee_affymetrix &lt;- Bgee$new(species="Mus_musculus", dataType="affymetrix", release="13.2")</p>
                        <p> Error in envRefSetField(.Object, field, classDef, selfEnv, elements[[field]]) :</p>
                        <p> 'dataType' is not a field in class "Bgee"</p>
                        <p> </p>
                        <p> ## Turns out that I need to use "datatype" instead of "dataType"</p>
                        <p> bgee_affymetrix &lt;- Bgee$new(species="Mus_musculus", datatype="affymetrix") &#x00a0;</p>
                        <p> bgee_affymetrix &lt;- Bgee$new(species="Mus_musculus", datatype="affymetrix", release="13.2")</p>
                        <p> Error in envRefSetField(.Object, field, classDef, selfEnv, elements[[field]]) :</p>
                        <p> 'release' is not a field in class "Bgee"</p>
                        <p> </p>
                        <p> ####</p>
                        <p> </p>
                        <p> At this moment, I did not try further.</p>
                        <p> The authors need to clearly state what version of BgeeDB was used to create this workflow. If something has changed, then this needs to be appropriately addressed. I tried using &#x201c;release=13.2&#x201d; but it did not work.</p>
                    </list-item>
                    <list-item>
                        <p>I did manage to run an enrichment test for anatomical terms though with some tweaking</p>
                        <p> </p>
                        <p> ## Again an error message</p>
                        <p> bgee_topanat &lt;- loadTopAnatData(species="Danio_rerio")</p>
                        <p> Error in loadTopAnatData(species = "Danio_rerio") :</p>
                        <p> Problem: the specified speciesId is not among the list of species in Bgee.</p>
                        <p> </p>
                        <p> ## This works though</p>
                        <p> myTopAnatData &lt;- loadTopAnatData(species="7955")</p>
                        <p> </p>
                        <p> ####</p>
                        <p> </p>
                        <p> The rest of the work-flow went smoothly and I was able to get a list of anatomical structures sorted by their p-value</p>
                        <p> </p>
                        <p> head(tableOver)</p>
                        <p> &#x00a0; &#x00a0; &#x00a0; &#x00a0; organId &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;organName annotated significant</p>
                        <p> 12 UBERON:0004357 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;paired limb/fin bud 144 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;41</p>
                        <p> 2 &#x00a0;UBERON:0000151 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;pectoral fin 420 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;70</p>
                        <p> 22 UBERON:2000040 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;median fin fold 51 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;18</p>
                        <p> 9 &#x00a0;UBERON:0003051 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;ear vesicle 304 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;41</p>
                        <p> 15 UBERON:0005729 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;pectoral appendage field 16 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;10</p>
                        <p> 16 UBERON:0007390 pectoral appendage cartilage tissue 17 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;9</p>
                        <p> </p>
                        <p> &#x00a0;expected foldEnrichment &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;pValue &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;FDR</p>
                        <p> 12 &#x00a0;&#x00a0;&#x00a0;&#x00a0;7.15 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;5.734266 1.622480e-22 1.445630e-19</p>
                        <p> 2 &#x00a0;&#x00a0;&#x00a0;&#x00a0;20.85 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;3.357314 1.037552e-18 4.622296e-16</p>
                        <p> 22 &#x00a0;&#x00a0;&#x00a0;&#x00a0;2.53 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;7.114625 7.171001e-12 2.129787e-09</p>
                        <p> 9 &#x00a0;&#x00a0;&#x00a0;&#x00a0;15.09 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;2.717031 3.135769e-10 6.984926e-08</p>
                        <p> 15 &#x00a0;&#x00a0;&#x00a0;&#x00a0;0.79 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;12.658228 4.004917e-10 7.136762e-08</p>
                        <p> 16 &#x00a0;&#x00a0;&#x00a0;&#x00a0;0.84 &#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;10.714286 2.411891e-08 3.581659e-06</p>
                        <p> </p>
                        <p> </p>
                        <p> It would be useful if the authors could include a feature that allows the TopAnat method to print the 41 genes which represent the paired limb/fin bud. At some point, the users might want to revisit their gene lists and tag their genes based on the different anatomical structures.</p>
                        <p> </p>
                        <p> Other tools that perform Enrichment tests, for example Enrichr
                            <sup>
                                <xref ref-type="bibr" rid="rep-ref-17925-1">1</xref>
                            </sup>, have this feature and this is extremely useful, in my opinion.</p>
                    </list-item>
                </list>
            </p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-17925-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Enrichr: a comprehensive gene set enrichment analysis web server 2016 update.</article-title>
                        <source>
                            <italic>Nucleic Acids Res</italic>
                        </source>.<year>2016</year>;<volume>44</volume>(<issue>W1</issue>) :
                        <elocation-id>10.1093/nar/gkw377</elocation-id>
                        <fpage>W90</fpage>-<lpage>7</lpage>
                        <pub-id pub-id-type="pmid">27141961</pub-id>
                        <pub-id pub-id-type="doi">10.1093/nar/gkw377</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment3777-17925">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Bastian</surname>
                            <given-names>Frederic</given-names>
                        </name>
                        <aff>Swiss Institute of Bioinformatics - University of Lausanne, Switzerland</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>28</day>
                    <month>6</month>
                    <year>2018</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <italic>&#x00a0;The authors should include some details about how they have reprocessed the gene expression datasets that are a part of BgeeDB. At the moment, it is rather unclear how this was achieved. I assume that the authors have an automated pipeline in place but it would be beneficial for readers to know how this was done.</italic>
                </p>
                <p> </p>
                <p> There is now a complete and updated documentation for the Bgee pipeline: https://github.com/BgeeDB/bgee_pipeline</p>
                <p> We have included this information in the manuscript, as well as a brief outline of the analyses we perform, see "Introduction" section.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>The authors state that &#x201c;TopAnat allows for discovery of tissues where a set of genes is preferentially expressed&#x201d;. Is TopAnat the only tool that offers such a functionality? A brief background of similar tools that are currently available will be useful for the readers.</italic>
                </p>
                <p> </p>
                <p> We have added a paragraph describing similar tools, see end of the "Introduction" section.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>I was not able to run the workflow that the authors have included in the Supplementary material:</italic>
                </p>
                <p>
                    <italic> [...]</italic>
                </p>
                <p>
                    <italic> &#x00a0;At this moment, I did not try further.</italic>
                </p>
                <p>
                    <italic> &#x00a0;The authors need to clearly state what version of BgeeDB was used to create this workflow. If something has changed, then this needs to be appropriately addressed. I tried using &#x201c;release=13.2&#x201d; but it did not work.</italic>
                </p>
                <p> </p>
                <p> We suspect that the reviewer did not use the latest version of the package (maybe the Bioconductor release itself needs to be updated first). The reviewer could maybe uninstall the BgeeDB package and rerun the following steps:</p>
                <p> source("https://bioconductor.org/biocLite.R")</p>
                <p> biocLite("BgeeDB")</p>
                <p> sessionInfo()</p>
                <p> With a package version &gt;= 2.6.2, the errors should disappear. The R, Bioconductor, and BgeeDB package version requirements are listed at the beginning of the "Methods" section.</p>
                <p> If the problem persists, could the reviewer post the sessionInfo() results?</p>
                <p> Of note, the &#x201c;release&#x201d; argument is used to specify a particular Bgee release, but this is independent of the package version.</p>
                <p> </p>
                <p> ---</p>
                <p> 
                    <italic>I did manage to run an enrichment test for anatomical terms though with some tweaking</italic>
                </p>
                <p>
                    <italic> &#x00a0;## Again an error message</italic>
                </p>
                <p>
                    <italic> &#x00a0;bgee_topanat &lt;- loadTopAnatData(species="Danio_rerio")</italic>
                </p>
                <p>
                    <italic> &#x00a0;Error in loadTopAnatData(species = "Danio_rerio") :</italic>
                </p>
                <p>
                    <italic> &#x00a0;Problem: the specified speciesId is not among the list of species in Bgee.</italic>
                </p>
                <p>
                    <italic> &#x00a0;## This works though</italic>
                </p>
                <p>
                    <italic> &#x00a0;myTopAnatData &lt;- loadTopAnatData(species="7955")</italic>
                </p>
                <p>
                    <italic> &#x00a0;####</italic>
                </p>
                <p> Again, this should be solved by updating to the last BgeeDB version</p>
                <p> </p>
                <p> ---</p>
                <p> &#x00a0;
                    <italic>The rest of the work-flow went smoothly and I was able to get a list of anatomical structures sorted by their p-value</italic>
                </p>
                <p>
                    <italic> [...]</italic>
                </p>
                <p>
                    <italic> &#x00a0;It would be useful if the authors could include a feature that allows the TopAnat method to print the 41 genes which represent the paired limb/fin bud. At some point, the users might want to revisit their gene lists and tag their genes based on the different anatomical structures. Other tools that perform Enrichment tests, for example Enrichr, have this feature and this is extremely useful, in my opinion.</italic>
                </p>
                <p> This is a good point. It is possible to cross the 
                    <italic>geneList</italic> vector with the expression mapping present in the 
                    <italic>myTopAnatData</italic> object. Another approach is to use functions that are inherited from the 
                    <italic>topGO</italic> package. For the &#x201c;paired limb/fin bud&#x201d; term:</p>
                <p> myTerm &lt;- "UBERON:0004357"</p>
                <p> termStat(myTopAnatObject, myTerm)</p>
                <p> # 198 genes mapped to this term for Bgee 14.0 and Ensembl 84</p>
                <p> genesInTerm(myTopAnatObject, myTerm)</p>
                <p> # 48 significant genes mapped to this term for Bgee 14.0 and Ensembl 84</p>
                <p> annotated &lt;- genesInTerm(myTopAnatObject, myTerm)[["UBERON:0004357"]]</p>
                <p> annotated[annotated %in% sigGenes(myTopAnatObject)]</p>
                <p> We have added this example at the end of the "Anatomical expression enrichment analysis" section.</p>
            </body>
        </sub-article>
    </sub-article>
</article>
