<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="methods-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.18061.4</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Method Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Introducing high school students to the Gene Ontology classification system</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 4; peer review: 3 approved, 1 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Dedhia</surname>
                        <given-names>Mehek</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Kohetuk</surname>
                        <given-names>Kenneth</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-1503-5735</uri>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Crusio</surname>
                        <given-names>Wim E.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-6638-202X</uri>
                    <xref ref-type="aff" rid="a3">3</xref>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Delprato</surname>
                        <given-names>Anna</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-6134-2560</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>BioScience Project, Wakefield, MA, 01880, USA</aff>
                <aff id="a2">
                    <label>2</label>Saint Dominic Savio Catholic High School, Austin, TX, 78717, USA</aff>
                <aff id="a3">
                    <label>3</label>Institut de Neurosciences Cognitives et Int&#x00e9;gratives d'Aquitaine (UMR 5287), Pessac, France</aff>
                <aff id="a4">
                    <label>4</label>University of Bordeaux (UMR 5287), Pessac, France</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:adelprato@bioscienceproject.org">adelprato@bioscienceproject.org</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>6</day>
                <month>8</month>
                <year>2019</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2019</year>
            </pub-date>
            <volume>8</volume>
            <elocation-id>241</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>31</day>
                    <month>7</month>
                    <year>2019</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2019 Dedhia M et al.</copyright-statement>
                <copyright-year>2019</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/8-241/pdf"/>
            <abstract>
                <p>We present a tutorial that introduces high school students to the Gene Ontology classification system which is widely used in genomics and systems biology studies to characterize large sets of genes based on functional and structural information. This classification system is a valuable and standardized method used to identify genes that act in similar processes and pathways and also provides insight into the overall architecture and distribution of genes and gene families associated with a particular tissue or disease. By means of this tutorial, students learn how the classification system works through analyzing a gene set using DAVID the Database for Annotation, Visualization and Integrated Discovery that incorporates the Gene Ontology system into its suite of analysis tools. This method of analyzing genes is used by our high school student interns to categorize gene expression data related to behavioral neuroscience. Students will get a feel for working with genes and gene sets, acquire vocabulary, obtain an understanding of how a database is structured and gain an awareness of the vast amount of information that is known about genes as well as the online analysis tools to manage this information that is nowadays available. Based on survey responses, students intellectually benefit from learning about the Gene Ontology System and using the DAVID tools, they are better prepared for future database use and they also find it enjoyable.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>gene ontology</kwd>
                <kwd>high school students</kwd>
                <kwd>genomics</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>The author(s) declared that no grants were involved in supporting this work.</funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 3</title>
                <p>In this 4th version of the manuscript, we have added an optional exercise for a gene enrichment analysis with a step by step instructional video and a corresponding figure. There are also changes to the introduction which have been added for clarification based upon reviewer comments.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>Genomics is the branch of biology concerned with the study of genes and their functions (see the 
                <ext-link ext-link-type="uri" xlink:href="https://www.genome.gov/19016904/">National Institutes of Health Frequently Asked Questions about Genetic and Genomic Science</ext-link>). Genomics arose from the acceleration of genetic research which was fueled by the development of rapid and affordable DNA sequencing technologies (
                <xref ref-type="bibr" rid="ref-8">Shendure 
                    <italic toggle="yes">et al.</italic>, 2017</xref>). This opened the door to the sequencing of entire genomes. Presently, the DNA codes for thousands of genomes from diverse species have been sequenced and studied (see 
                <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/genome/browse/#!/overview/">the National Center for Biotechnology Information Genome database</ext-link>).</p>
            <p>The goals in genomics research are to address all genes and their inter-relationships in order to understand the combined influence on the function of an organism. With this newfound knowledge of the staggering number of genes that make up an organism, the 
                <ext-link ext-link-type="uri" xlink:href="http://www.geneontology.org/">Gene Ontology</ext-link> (GO) classification system was created by the Gene Ontology Consortium to organize genes by their similarities and differences (see 
                <ext-link ext-link-type="uri" xlink:href="http://www.geneontology.org/page/about">Gene Ontology Consortium &#x2018;About&#x2019; page</ext-link>). &#x201c;Ontology&#x201d; is not a commonly encountered term and there are several definitions that are related to philosophical concepts.</p>
            <p>In the context of information science, as described here, &#x201c;ontology&#x201d; is concerned with the representation, formal naming and classification system with the purpose of describing the relationship categories and properties of the data. This is similar to Wikipedia which is also based on a controlled vocabulary, categories to group material by like subject matter, and parent-child terms.</p>
            <p>The Gene Ontology information is curated, collected, validated, and annotated by the Gene Ontology Consortium in collaboration with their partners which consist of research groups, research communities and other databases (see 
                <ext-link ext-link-type="uri" xlink:href="http://geneontology.org/docs/go-consortium/">http://geneontology.org/docs/go-consortium/</ext-link>
            </p>
            <p>This GO classification system provides the scientific community with a structured vocabulary for defining genes (
                <xref ref-type="bibr" rid="ref-1">Ashburner 
                    <italic toggle="yes">et al.</italic>, 2000</xref>; 
                <xref ref-type="bibr" rid="ref-5">du Plessis 
                    <italic toggle="yes">et al</italic>., 2011</xref>; 
                <xref ref-type="bibr" rid="ref-7">Hastings, 2017</xref>; 
                <xref ref-type="bibr" rid="ref-9">Thomas, 2017</xref>). GO terms are commonly used in most, if not all, databases and analysis tools relevant to bioinformatics, systems biology (
                <xref ref-type="bibr" rid="ref-10">Wanjek, 2011</xref>), and genomics studies (
                <xref ref-type="bibr" rid="ref-5">du Plessis 
                    <italic toggle="yes">et al</italic>., 2011</xref>). GO terms are species specific and are updated monthly as biological knowledge is obtained (
                <xref ref-type="bibr" rid="ref-6">Gaudet 
                    <italic toggle="yes">et al</italic>., 2017</xref>). GO terms describe how a gene functions at the molecular level, its location within the cell, and what biological programs it is involved with. Each GO annotation is associated with an evidence code that is comprised of six categories: experimental evidence, phylogenetic evidence, computational evidence, author statements, curatorial statements, and automatically generated annotations (
                <ext-link ext-link-type="uri" xlink:href="http://geneontology.org/docs/guide-go-evidence-codes/">http://geneontology.org/docs/guide-go-evidence-codes/</ext-link>)</p>
            <p>The importance of the GO term system becomes apparent when analyzing the organization of genomes and coding regions, the distribution of genes involved in specific processes and the conservation of genes across species (
                <xref ref-type="bibr" rid="ref-6">Gaudet 
                    <italic toggle="yes">et al</italic>., 2017</xref>; 
                <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5821137/">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5821137/</ext-link>). This classification system is also quite powerful when analyzing data from large scale gene expression studies (
                <xref ref-type="bibr" rid="ref-5">du Plessis 
                    <italic toggle="yes">et al</italic>., 2011</xref>) that consider co-expression data from specific tissues obtained under defined circumstances such as treatment with pharmaceutical agents, or with neurodevelopmental disorders, cancer, or diabetes as examples. GO terms are instrumental for understanding the functions of these genes.</p>
            <p>Introducing GO terms and the gene classification system to high school students will bring them up to speed on a commonly used research tool in current genomics methods and expose them to the vast amounts of data that have been derived from genomics and systems biology studies.</p>
            <p>In the subsequent sections we show an example of how to extract information about a gene from its associated GO terms and then provide instruction for a practical exercise which will enable students to profile a list of genes using GO terms in the bioinformatics resource 
                <ext-link ext-link-type="uri" xlink:href="https://david.ncifcrf.gov/">DAVID</ext-link>, The 
                <underline>D</underline>atabase for 
                <underline>A</underline>nnotation, 
                <underline>V</underline>isualization and 
                <underline>I</underline>ntegrated 
                <underline>D</underline>iscovery. This is a protocol that we teach to our high school student interns when they are evaluating gene expression data for their summer projects (
                <xref ref-type="bibr" rid="ref-3">Crusio 
                    <italic toggle="yes">et al</italic>., 2017</xref>, see 
                <ext-link ext-link-type="uri" xlink:href="http://www.bioscienceproject.org/student-posters">BioScience Project student posters</ext-link>). The student research internship projects are in the context of behavioral neuroscience. Students typically work with gene expression data associated with a  specific brain region or brain disorder. As an example for projects related to learning and memory, gene expression data for the hippocampus would be used. Concerning a neurodevelopmental disorder like Schizophrenia, gene expression data for the prefrontal cortex would be considered.  There are many online databases that have freely available gene expression data and this could be a way to expand the scope of this tutorial. We use the Allen Brain Atlas for our primary source of gene expression data in the student internship projects.</p>
        </sec>
        <sec sec-type="methods">
            <title>Procedure</title>
            <sec>
                <title>Overview of gene function with GO</title>
                <p>The overall structure of GO is hierarchical and is based on parent-child terms where the parent term is broader and child term is more specialized.</p>
                <p>GO terms group genes according to 3 categories, each of which are considered a distinct ontology: Molecular Function (MF, molecular-level activities performed by gene products), Biological Process (BP, the larger processes, or biological programs accomplished by multiple molecular activities), and Cellular Component (CC, the locations relative to cellular structures in which a gene product performs a function).</p>
                <p>As an example, consider the GO term classification for the 
                    <italic toggle="yes">RAB5A</italic> gene (
                    <xref ref-type="fig" rid="f1">Figure 1</xref>). 
                    <italic toggle="yes">RAB5A</italic> belongs to a family of genes called RAB GTPases that are key regulators of intracellular membrane trafficking. Rabs are involved in the formation of transport vesicles and their fusion with membranes. They are enzymes and mediate their function by cycling between a GDP bound inactive and a GTP bound active state. Because of their fundamental and ubiquitous role, this family of genes are associated with many biological processes and diseases.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>Figure 1. </label>
                    <caption>
                        <title>DAVID output for the 
                            <italic toggle="yes">RAB5A</italic> gene.</title>
                        <p>Screenshot of the DAVID results for the 
                            <italic toggle="yes">RAB5A</italic> gene Top left (blue bar): Gene Symbol identifier. Center (blue bar): full gene name. Labels (left): GO Term BP, GO Term CC, and GO Term MF descriptors. Note that these descriptors are clickable.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/22130/cd0a06b1-7618-40b0-a3d9-814894350d97_figure1.gif"/>
                </fig>
                <p>The GO term classification for the 
                    <italic toggle="yes">RAB5A</italic> gene gives:</p>
                <p>

                    <bold>GOTERM_BP</bold>: endocytosis, phagocytosis, small GTPase mediated signal transduction, blood coagulation, protein transport, regulation of endocytosis, synaptic vesicle recycling, viral RNA genome replication, early endosome to late endosome transport, positive regulation of exocytosis, regulation of endosome size, regulation of filopodium assembly, receptor internalization involved in canonical Wnt signaling pathway, regulation of synaptic vesicle exocytosis, regulation of autophagosome assembly</p>
                <p>

                    <bold>GOTERM_CC</bold>: ruffle, intracellular, cytoplasm, endosome, early endosome, cytosol, plasma membrane, synaptic vesicle, endosome membrane, actin cytoskeleton, endocytic vesicle, axon, dendrite, phagocytic vesicle membrane, somatodendritic compartment, melanosome, neuronal cell body, terminal bouton, axon terminus, membrane raft, phagocytic vesicle, extracellular exosome, cytoplasmic side of early endosome membrane.</p>
                <p>

                    <bold>GOTERM_MF</bold>: GTPase activity, protein binding, GTP binding, GDP binding</p>
                <p>From the 
                    <italic toggle="yes">RAB5A</italic> related GO terms, we get the overall impression that this gene encodes an enzyme that is involved in signaling, transport and vesicle dynamics and is associated with cell membranes. How do we arrive at this description?</p>
                <p>In this example, the information obtained from the 
                    <bold>MF</bold> category is that the protein product of the 
                    <italic toggle="yes">RAB5A</italic> gene binds to guanine nucleotides: GTP and GDP (Guanosine tri and di phosphate, respectively) and that it is an enzyme. This is evident by the &#x201c;GTPase activity&#x201d; term. Whenever the suffix &#x201c;ase&#x201d; is used in the context of a gene or protein, it refers to an enzyme, something that catalyzes a chemical reaction. For the 
                    <bold>BP</bold> category, there are several terms associated with intracellular transport, signaling, and endocytosis. Finally, the terms associated with 
                    <bold>CC</bold> include endosome and endosome-like organelles (melanosomes, synaptic vesicles, phagocytic vesicles), as well as membrane structures (ruffles, rafts).</p>
            </sec>
            <sec>
                <title>Gene Analysis in DAVID</title>
                <p>DAVID is a database with a suite of analysis tools that groups genes based on different criteria related to GO terms. DAVID also links to other databases that contain primary source information like The Gene Ontology as well as complementary information related to pathways and human disease. In this exercise, students will use the sample gene lists (DEMOLIST1 or DEMOLIST2) that are accessible from the DAVID database to see how the Gene Ontology classification partitions a set of genes based on GO Terms. Screenshots and videos are provided for step by step instruction. We also provide a video to instruct students on analyzing a gene list in DAVID obtained from a random gene list generator.</p>
            </sec>
            <sec>
                <title>Protocol</title>
                <p>Screenshot 1 (
                    <xref ref-type="fig" rid="f2">Figure 2</xref>). DAVID landing page. The start analysis link is accessed here and is circled in red in this image. (Video 1, 
                    <xref ref-type="bibr" rid="ref-2">Delprato 
                        <italic toggle="yes">et al.</italic>, 2019c</xref>)</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <title>DAVID landing page.</title>
                        <p>Screenshot of the DAVID landing page containing a brief description of the site and links for available tools. The &#x201c;start analysis&#x201d; button is circled in red and is located at the top left side of the page. This is the first step in submitting a geneset for analysis with GO Terms in DAVID.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/22130/cd0a06b1-7618-40b0-a3d9-814894350d97_figure2.gif"/>
                </fig>
                <p>Screenshot 2 (
                    <xref ref-type="fig" rid="f3">Figure 3</xref>). Submitting a gene list. Select either DEMOLIST 1 or DEMOLIST 2 (left panel). The identifier will come up automatically because this is a demonstration list. If you are submitting your own gene list then, the identifier will have to be specified from the dropdown menu (Video 2, 
                    <xref ref-type="bibr" rid="ref-2">Delprato 
                        <italic toggle="yes">et al.</italic>, 2019c</xref>). Typically the identifier is the &#x201c;Official Gene Symbol&#x201d;. Click &#x201c;Gene List&#x201d;, then &#x201c;Submit List&#x201d; (Video 1; (
                    <xref ref-type="bibr" rid="ref-2">Delprato 
                        <italic toggle="yes">et al.</italic>, 2019c</xref>)).</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>Figure 3. </label>
                    <caption>
                        <title>Submitting a geneset.</title>
                        <p>Screenshot of the page where a geneset can be submitted &#x201c;Step1: Enter Gene List&#x201d;, there are options to copy and paste a geneset or upload a file. There is also an option to use either of two sample lists provided by the DAVID site. Under the pull down menu, &#x201c;Step 2 Select Identifier&#x201d;, there are many types of identification designations for the same gene. For the sample lists, the identifier and species will be recognized automatically. When submitting a gene list from the geneset generator, the identifier is &#x201c;Official Gene Symbol&#x201d; which is used in most cases. If the identifier that you choose does not match the identifiers in the submitted gene set, you will receive a message stating this and an option to convert to the correct identifier.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/22130/cd0a06b1-7618-40b0-a3d9-814894350d97_figure3.gif"/>
                </fig>
                <p>Screenshot 3 (
                    <xref ref-type="fig" rid="f4">Figure 4</xref>). Species selection. You will see a notice: &#x201c;Multiple Species, have been Detected&#x201d;, Highlight &#x201c;Homo Sapiens&#x201d; in the window, Select &#x201c;Homo Sapiens&#x201d; below the window (Example - DEMOLIST 1: 149 genes, highlighted in grey, left panel). Next, you will see the message &#x201c;Submission Successful&#x201d; (Video 1; 
                    <xref ref-type="bibr" rid="ref-2">Delprato 
                        <italic toggle="yes">et al.</italic>, 2019c</xref>).</p>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>Figure 4. </label>
                    <caption>
                        <title>Species selection.</title>
                        <p>Screenshot of a successfully submitted gene set. Here it is necessary to select the species. Again, note that in this instance for the sample genesets provided by DAVID, &#x201c;Homo sapiens&#x201d; is already highlighted along with the number of genes that are recognized by the site and for which information is available. Once the species is highlighted, it is necessary to click the &#x201c;Select Species&#x201d; button in order to limit the output to just the desired species.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/22130/cd0a06b1-7618-40b0-a3d9-814894350d97_figure4.gif"/>
                </fig>
                <p>Screenshot 4 (
                    <xref ref-type="fig" rid="f5">Figure 5</xref>). Obtaining the results. Select &#x201c;Functional Annotation Tool&#x201d;, beneath the blue arrow. Next, select &#x201c;Functional Annotation Table&#x201d;, Bottom of the page (Video 1, 
                    <xref ref-type="bibr" rid="ref-2">Delprato 
                        <italic toggle="yes">et al.</italic>, 2019c</xref>).</p>
                <fig fig-type="figure" id="f5" orientation="portrait" position="float">
                    <label>Figure 5. </label>
                    <caption>
                        <title>Obtaining the results.</title>
                        <p>Screenshot of the different types of analysis options provided by DAVID for a given geneset. For the purpose of this tutorial, the relevant output is the &#x201c;Functional Annotation Table&#x201d; located at the bottom of the page.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/22130/cd0a06b1-7618-40b0-a3d9-814894350d97_figure5.gif"/>
                </fig>
                <p>Screenshot 5 (
                    <xref ref-type="fig" rid="f6">Figure 6</xref>). Reading the output. The gene ID and the full gene name are shown in the blue bars above each entry. The GO Term BP (Biological Process), GO Term CC (Cellular Component), and GO Term MF (Molecular Function), terms are clickable descriptors and link to the Gene Ontology website. See above for a complete description of the GO categories (Video 1, 
                    <xref ref-type="bibr" rid="ref-2">Delprato 
                        <italic toggle="yes">et al.</italic>, 2019c</xref>).</p>
                <fig fig-type="figure" id="f6" orientation="portrait" position="float">
                    <label>Figure 6. </label>
                    <caption>
                        <title>Interpreting the output.</title>
                        <p>Screenshot of the DAVID &#x201c;Functional Annotation Table&#x201d; results for a geneset Top left: (blue bar): Gene Symbol identifier. Center (blue bar): full gene name. Labels (left): GO Term BP, GO Term CC, and GO Term MF descriptors. Note that these descriptors are clickable.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/22130/cd0a06b1-7618-40b0-a3d9-814894350d97_figure6.gif"/>
                </fig>
                <p>Screenshot 6 (
                    <xref ref-type="fig" rid="f7">Figure 7</xref>). Keyword search. When selecting terms for a keyword search, a more complete outcome is achieved if just a few letters are specified. For example, -&#x201d;neur&#x201d; will capture terms both starting with neuro and neural (Video 1, 
                    <xref ref-type="bibr" rid="ref-2">Delprato 
                        <italic toggle="yes">et al.</italic>, 2019c</xref>). DAVID output can be searched for genes related to other process and diseases as well. Have students evaluate the gene list based on their interest. They can identify genes related to a particular process. Students may work individually or in groups.</p>
                <fig fig-type="figure" id="f7" orientation="portrait" position="float">
                    <label>Figure 7. </label>
                    <caption>
                        <title>Keyword search.</title>
                        <p>Screenshot of a keyword search of the DAVID output to identify genes that are relevant to a given biological function. In this case the search term is &#x201c;neur&#x201d; (highlighted in yellow), to identify genes related to neurological processes. The keyword search is done via the general search function on your computer. For larger gene sets, a computer program may be used.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/22130/cd0a06b1-7618-40b0-a3d9-814894350d97_figure7.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Optional exercise</title>
                <p>Students may wish to try this with their own gene lists. This 
                    <ext-link ext-link-type="uri" xlink:href="http://www.molbiotools.com/randomgenesetgenerator.html">online gene list generator</ext-link> will enable students to generate a random list of genes for evaluation. (See also Video 2 for instruction; 
                    <xref ref-type="bibr" rid="ref-2">Delprato 
                        <italic toggle="yes">et al.</italic>, 2019c</xref>)</p>
            </sec>
            <sec>
                <title>Protocol</title>
                <p>Step 1. Specify species: Human is the default</p>
                <p>Step 2. Specify list length: 200&#x2013;500 is a good representative number. Note that DAVID will not evaluate lists with more than 2000 genes. An error message stating this will be received.</p>
                <p>Step 3. Select &#x201c;Generate&#x201d;</p>
                <p>Step 4. Copy the gene list using the &#x201c;Select All&#x201d; option and paste the list directly into DAVID for evaluation as described above. Make sure to select &#x201c;Official Gene Symbol&#x201d; as the identifier when submitting the gene list.</p>
            </sec>
            <sec>
                <title>Optional exercise 2</title>
                <p>Students may perform a gene enrichment analysis exercise based on GO annotation of their gene list directly at the Gene Ontology site. (Video 3). The results can be viewed as a pie chart which makes for a great visual representation of the GO categories (
                    <xref ref-type="fig" rid="f8">Figure 8</xref>). Clicking on the individual sections of the pie will result in more inclusive and specific annotation. Clicking on the legends will redirect to a table that contains the gene names associated with a specific GO category (For a more in depth explanation of the results page,  see 
                    <ext-link ext-link-type="uri" xlink:href="http://geneontology.org/docs/go-enrichment-analysis/">http://geneontology.org/docs/go-enrichment-analysis/</ext-link>).</p>
                <fig fig-type="figure" id="f8" orientation="portrait" position="float">
                    <label>Figure 8. </label>
                    <caption>
                        <title>Gene enrichment.</title>
                        <p>Screenshot of a pie chart generated at the Gene Ontology website showing a gene enrichment analysis for a sample gene list along with the associated category legends. The enrichment analysis finds GO terms that are over-represented (or under-represented) for a gene set. In this example, the annotation for Biological Process is shown for 72 genes.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/22130/cd0a06b1-7618-40b0-a3d9-814894350d97_figure8.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Protocol</title>
                <p>Step 1. Paste a gene list in the window of the Gene Ontology landing page (
                    <ext-link ext-link-type="uri" xlink:href="http://geneontology.org/">http://geneontology.org/</ext-link>).</p>
                <p>Step 2. Specify the ontology (MF, CC, BP) that you would like to analyze from the drop down menu. This can be specified later as well on the results page. (Biological Process is the default).</p>
                <p>Step 3. Select species (
                    <italic toggle="yes">Homo sapiens</italic> is the default).</p>
                <p>Step 4. Select &#x201c;Launch&#x201d;.</p>
                <p>Step 5: Select the clickable number that represents your list of genes relative to the reference list to which is compared.</p>
                <p>Step 6. Click on the pie chart above the table and specify the desired ontology from the drop down menu. Pie chart personalization and options for extended information are described in the text above the pie chart. Pie chart and legends may be saved as screenshots.</p>
            </sec>
            <sec>
                <title>Learning assessment</title>
                <p>We polled 12 student interns from our summer program for feedback on their experience working with the GO system and DAVID. The survey consists of 5 direct yes or no response questions and two open ended questions. The responses are show in 
                    <xref ref-type="table" rid="T1">Table 1</xref>, 
                    <xref ref-type="table" rid="T2">Table 2</xref>, and 
                    <xref ref-type="table" rid="T3">Table 3</xref>.</p>
                <table-wrap id="T1" orientation="portrait" position="anchor">
                    <label>Table 1. </label>
                    <caption>
                        <title>DAVID and GO Student Survey.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="center" colspan="1" rowspan="1">Questions</th>
                                <th align="center" colspan="3" rowspan="1">Responses</th>
                            </tr>
                            <tr>
                                <th colspan="1" rowspan="1"/>
                                <th align="center" colspan="1" rowspan="1">Yes</th>
                                <th align="center" colspan="1" rowspan="1">No</th>
                                <th align="center" colspan="1" rowspan="1">Other</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">1. Have you ever heard of or did you have any experience working with the DAVID database prior to the summer
                                    <break/>internship with BioScience Project?</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">12</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">2. Have you ever heard of the Gene Ontology Classification (GO Terms) System prior to the summer internship with
                                    <break/>BioScience Project?</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">10</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">3. Did you benefit intellectually from working with the DAVID tools and learning about the Gene Ontology System?</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">12</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">4. Did you enjoy working with the DAVID tools and learning about gene analysis with the Gene Ontology System?</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">12</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">5. If applicable, are you better able to navigate other genomic databases as a result of working in DAVID?</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">11</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <table-wrap id="T2" orientation="portrait" position="anchor">
                    <label>Table 2. </label>
                    <caption>
                        <title>Student Short Answer Responses to question 3a.</title>
                        <p>(Question 3: Did you benefit intellectually from working with the DAVID tools and learning about the Gene Ontology System?) 3a: If yes, please explain how?.</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">Through this internship using DAVID, I was exposed for the first time to the field of bioinformatics and the gene ontology system. I
                                    <break/>learned more about the role genes play in neurological disorders in humans. I also got a glimpse of what biomedical researchers and
                                    <break/>neuroscientists actually do and the type of resources that they work with.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I learned about current methods used by researchers and was able to apply those techniques to my own project. The exposure and
                                    <break/>experience I gained with bioinformatics gave me a better understanding of gene interactions and how researchers analyze them. The
                                    <break/>tools I used during the internship furthered my understanding of gene profiling and provided me with insight on the importance of
                                    <break/>research.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">Rather than just learning about genetics as I had in biology classes in the past, I had the opportunity to partake in hands-on learning with
                                    <break/>the DAVID tools. Through using DAVID and the Gene Ontology system, I gained a broader understanding of the biological functions of
                                    <break/>genes by seeing and being exposed to such a large variety. Being able to apply what I learned from DAVID and the results I gathered to
                                    <break/>my own independent project helped to further my understanding of genetics while exposing me to the field of bioinformatics.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I learned different ways genes are classified and analyzed (kegg pathway, etc.). I can use this database for future research projects to get
                                    <break/>a sense of different pathways these genes are involved in.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I learnt a lot about genomics and how interconnected the different genes in our body are. It also helped me work on my project on
                                    <break/>Alzheimer&#x2019;s and the APOE 4 allele.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I was able to learn more about PTSD and identifying candidate genes for PTSD. I performed this project with DAVID and learned a lot
                                    <break/>about the Gene Ontology System.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">The DAVID tools helped me cluster annotation terms based on keywords associated with neurological disorders. Also, the KEGG Pathway
                                    <break/>map allowed me to visualize genes and how they interact.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">This was the first time that I worked with either of these tools and I think I benefited mostly because I got an idea of what working in the
                                    <break/>field of Biology/Neuroscience would be like.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I was able to learn more about the GO System, different databases such as the Allen Brain Atlas and StringDB, which I used in
                                    <break/>conjunction with DAVID and learned about how DAVID analyzes the data.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I found that working with the DAVID tools and Gene Ontology System I was exposed to a real world experience in science that gave
                                    <break/>me a better understanding of where we are in research now, and what is still to be done. The benefits I found in myself were a capital in
                                    <break/>scientific nomenclature, new skills in analysis of data, and a wholesome exposure into the field of genomics research.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I learned a lot about gene databases and how to look at specific sets of data while ignoring information that may be necessary.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I felt like working with the DAVID tools exposed me to an area of research that I wasn&#x2019;t previously familiar with in an engaging and
                                    <break/>fascinating way. Learning how to find specific information about gene interactions/gene functions through an online database is a
                                    <break/>valuable tool that I believe will benefit me in my future research.</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <table-wrap id="T3" orientation="portrait" position="anchor">
                    <label>Table 3. </label>
                    <caption>
                        <title>Student Short Answer Responses to question 4a.</title>
                        <p>(Question 4. Did you enjoy working with the DAVID tools and learning about gene analysis/profiling with the Gene Ontology System?) 4a:  If yes, please explain why?.</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">Working with DAVID and the other databases gave me a chance to learn about the field of bioinformatics and genetics, beyond our
                                    <break/>school curriculum. It was extremely interesting to learn about the hundreds of genes in the human genome, their various functions both
                                    <break/>at the molecular and biological level and how they affect the neurological characteristics of human beings. The databases were also very
                                    <break/>interactive and allowed me to explore the other parts of the database myself.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">Although it was initially difficult to navigate the DAVID tools, I found the experience rewarding in the end. The process became easier as I
                                    <break/>persisted in using the database, and I enjoyed being able to explore the realm of gene profiling.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I enjoyed working with the DAVID tools and learning about gene profiling because from a biological point of view, DAVID is very good at
                                    <break/>finding relevant information, like other correlates, to the keyword I'm looking for.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I felt like what I was learning about through DAVID and the Gene Ontology system was applicable to my project and I was able to utilize
                                    <break/>and apply my knowledge of these tools effectively, making me more excited to use it.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I gained insight into gene analysis and gained knowledge that I can use in other situations.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I got practical hands-on experience working with a scientific database which was quite different from the textbook learning taught in
                                    <break/>schools. I found this quite refreshing and enjoyable.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">Although it was pretty confusing for me at the time, I think what made it enjoyable was that I found using these tools interesting especially
                                    <break/>since this was all new to me! Also it was not too difficult or overwhelming since the provided instructions for the internship walked me
                                    <break/>through each step. I&#x2019;m actually now studying Neuroscience at BU and I&#x2019;m actually hoping to get back to relearning how to use these tools
                                    <break/>again now that I have a better understanding.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I enjoyed working with the DAVID tools and learning about gene profiling with the GO System because it was very interesting to see how
                                    <break/>different genes were connected to each other and how far reaching the effects of certain genes are.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">Because the breadth and depth of information felt like a million different rabbit holes that I could fall into and learn something new from.
                                    <break/>However, these tools required navigational help and direction from the supervisor and fellow interns for me to truly reach this point of
                                    <break/>knowing how to immerse myself in it out of mere curiosity, because of how complex felt at first, and I still have so much more to learn, but
                                    <break/>overall enjoyed working with these tools once I was comfortable with them.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">It was a really interesting and valuable experience to have, and I feel like I learned a lot about how different genes may be connected to
                                    <break/>each other and what is important to consider and look for in gene profiling.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I thought it was interesting to be able to visualize some of the molecular pathways through the diagrams provided. In completing my
                                    <break/>research, it was helpful to have all of the biological processes and molecular functions of certain genes all in one place</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1">I really enjoyed working on a topic that was interesting to me. I could learn about PTSD while learning more about biology and gene
                                    <break/>profiling.</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>For the direct response questions, 100% of the students had not worked or heard of the DAVID database (Ques1, 
                    <xref ref-type="table" rid="T1">Table 1</xref>.) 83% of the students had not heard of the GO system. Two students had heard of the GO system in an advanced placement biology class but had not explored it further (Ques2, 
                    <xref ref-type="table" rid="T1">Table1</xref>). 100% of the students responded that they benefited intellectually from working with the GO system and DAVID tools (Ques3, 
                    <xref ref-type="table" rid="T1">Table1</xref>) and also that they had enjoyed the experience (Ques 4, 
                    <xref ref-type="table" rid="T1">Table 1</xref>). 92% of the students thought that the experience would better prepare them for future database use. One student indicated that this was not applicable (Quest 5, 
                    <xref ref-type="table" rid="T1">Table 1</xref>).</p>
                <p>For the open ended survey questions 3a and 4a (
                    <xref ref-type="table" rid="T2">Table 2</xref> and 
                    <xref ref-type="table" rid="T3">Table 3</xref>), students were asked to explain how they benefited from working with the databases and tools and what about the experience they had enjoyed. A two step process was used to analyze their answers. First, the responses to each question were grouped and an online text analyzer 
                    <ext-link ext-link-type="uri" xlink:href="http://textalyser.net/">text analyzer</ext-link> was used to assess the words occurring with the highest frequency (Workbooks 1 and 2) (
                    <xref ref-type="bibr" rid="ref-11">Delprato 
                        <italic toggle="yes">et al</italic>., 2019a</xref>; 
                    <xref ref-type="bibr" rid="ref-12">Delprato 
                        <italic toggle="yes">et al</italic>., 2019b</xref>). Words of 2 or fewer characters were not considered in the analysis. In a subsequent step, a spreadsheet for coding open ended survey questions was used to organize the results from the text analysis (Workbooks 1 and 2).</p>
                <p>Response category words chosen represent replies to the question asked. Words containing the same root such as learn, learned, learnt, learning were grouped. The response categories selected for question 3a that may provide insight into why students believe they have benefited intellectually are: expose (5), analyze (4) understand (3), help (3), experience (2), and gained (2) (Workbook 1).</p>
                <p>For question 4a, the response categories chosen which may provide insight into why students enjoyed the experience are: interesting (6), gained (2), new (2) and explore (2). Other adjectives used in single responses were: interactive, rewarding, and refreshing (Workbook 2).</p>
            </sec>
            <sec>
                <title>Entry and exit tickets</title>
                <p>A basic entry and exit ticket method is suggested to determine what students know about genomics and genes before the lesson as well as what they have learned: main points, questions they may have and what they found most interesting. Sample questions are provided in what follows.</p>
            </sec>
            <sec>
                <title>Entry ticket questions and answers</title>
                <list list-type="bullet">
                    <list-item>
                        <label>1. </label>
                        <p>What is a gene?</p>
                    </list-item>
                </list>
                <p>

                    <italic toggle="yes">A sequence of DNA or RNA which codes for a molecule that has a function.</italic> A gene is the basic physical and functional unit of heredity</p>
                <list list-type="bullet">
                    <list-item>
                        <label>2. </label>
                        <p>What is genomics?</p>
                    </list-item>
                </list>
                <p>

                    <italic toggle="yes">Study of the full set of the genes and DNA in an organism</italic>
</p>
                <list list-type="bullet">
                    <list-item>
                        <label>3. </label>
                        <p>How many protein coding genes does a human have?</p>
                    </list-item>
                </list>
                <p>

                    <italic toggle="yes">~20,000</italic>
</p>
                <list list-type="bullet">
                    <list-item>
                        <label>4. </label>
                        <p>Do humans all have the same genes?</p>
                    </list-item>
                </list>
                <p>

                    <italic toggle="yes">Yes, but people have different alleles. Alleles are the variation of a gene resulting from mutations. As an example consider eye color. We all have the gene for eye color but some of us have brown, blue or green eyes and there are different shades and hues within those categories.</italic>
</p>
                <list list-type="bullet">
                    <list-item>
                        <label>5. </label>
                        <p>Do genes work together?</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022; </label>
                        <p>Yes</p>
                    </list-item>
                    <list-item>
                        <label>6. </label>
                        <p>If yes, provide an example</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022; </label>
                        <p>Transcription factors</p>
                    </list-item>
                    <list-item>
                        <label>7. </label>
                        <p>Have you ever worked with a biological database?</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022; </label>
                        <p>Subjective question</p>
                    </list-item>
                </list>
            </sec>
            <sec>
                <title>Exit ticket questions</title>
                <list list-type="bullet">
                    <list-item>
                        <label>1. </label>
                        <p>What were the main points of the lesson?</p>
                    </list-item>
                    <list-item>
                        <label>2. </label>
                        <p>Do you have any questions?</p>
                    </list-item>
                    <list-item>
                        <label>3. </label>
                        <p>What aspect of this lesson did you find most interesting?</p>
                    </list-item>
                    <list-item>
                        <label>4. </label>
                        <p>Make up a gene and describe it using Gene Ontology classifiers for the 3 categories Molecular Function (MF), Biological Process (BP), and Cellular Component CC)</p>
                    </list-item>
                    <list-item>
                        <label>5. </label>
                        <p>Why do some genes have many classifiers while others do not?</p>
                    </list-item>
                    <list-item>
                        <label>6. </label>
                        <p>For the Biological Process &#x2013; BP category, what are the classifiers based on, i.e., how are they derived?</p>
                    </list-item>
                    <list-item>
                        <label>7. </label>
                        <p>How do you think you could use this database in a high school research project?</p>
                    </list-item>
                </list>
            </sec>
        </sec>
        <sec sec-type="conclusions">
            <title>Conclusions</title>
            <p>We describe a procedure for students to become acquainted with the Gene Ontology classification system which is widely used in genomics and systems biology research to characterize gene function. Grouping genes with GO Terms and the DAVID database is based on a protocol that we use with our summer interns to profile gene expression data related to behavioral neuroscience studies (
                <xref ref-type="bibr" rid="ref-3">Crusio 
                    <italic toggle="yes">et al</italic>., 2017</xref>). Grouping genes in this way identifies genes that function in like processes and also provides information about the overall distribution of a set of genes associated with a particular tissue or process. This tutorial will familiarize early stage students with a biological database and teach them how to mine it and extract useful information from a sample list of genes.</p>
            <p>Survey response data from twelve students indicate that they believe they have benefited intellectually from this work and that they enjoy this type of learning experience. Based on coding of the open ended survey responses, the underlying reasons are because they have learned something beneficial and that they find it interesting. The majority of the students (92%) also state that as a result of this experience, they are better prepared for future database use. Entry and exit ticket questions designed to assess student prior and post knowledge as well as stimulate ideas on how these databases and tools may be used in a research project, are included as a formative assessment strategy.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <sec>
                <title>Underlying data</title>
                <p>Figshare: Workbook 1. Coding of student responses to question 3a. 
                    <ext-link ext-link-type="uri" xlink:href="https://dx.doi.org/10.6084/m9.figshare.8166611.v1">https://doi.org/10.6084/m9.figshare.8166611.v1</ext-link> (
                    <xref ref-type="bibr" rid="ref-11">Delprato 
                        <italic toggle="yes">et al</italic>., 2019a</xref>)</p>
                <p>This project contains the following underlying data:</p>
                <list list-type="bullet">
                    <list-item>
                        <p>Coding of student responses to question 3a.xlsx (Coding of responses to open ended student survey question 3a)</p>
                    </list-item>
                </list>
                <p>Figshare: Workbook 2. Coding of student responses to question 4a. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.8166650.v1">https://doi.org/10.6084/m9.figshare.8166650.v1</ext-link> (
                    <xref ref-type="bibr" rid="ref-12">Delprato 
                        <italic toggle="yes">et al</italic>., 2019b</xref>)</p>
                <p>This project contains the following underlying data:</p>
                <list list-type="bullet">
                    <list-item>
                        <p>Coding of student responses to question 4a.xlsx (Coding of responses to open ended student survey question 4a)</p>
                    </list-item>
                </list>
                <p>Data are available under the terms of the 
                    <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/legalcode">Creative Commons Attribution 4.0 International license</ext-link> (CC-BY 4.0).</p>
            </sec>
            <sec>
                <title>Extended data</title>
                <p>Extended data is available from figshare</p>
                <p>Figshare: Extended data 1. Video 1: GeneSetProfiling Analysis Instructional video for using DAVID to obtain Gene Ontology classifiers for a sample geneset which is provided by the DAVID site 
                    <ext-link ext-link-type="uri" xlink:href="https://dx.doi.org/10.6084/m9.figshare.8166650.v1">https://doi.org/10.6084/m9.figshare.8166650.v1</ext-link> (
                    <xref ref-type="bibr" rid="ref-2">Delprato 
                        <italic toggle="yes">et al</italic>., 2019c</xref>)</p>
                <p>Figshare: Extended data 2. Video 2. UploadGeneSet Instructional video for generating a random geneset and submitting this geneset to DAVID for Gene Ontology classification, 
                    <ext-link ext-link-type="uri" xlink:href="https://dx.doi.org/10.6084/m9.figshare.7649231.v1">https://doi.org/10.6084/m9.figshare.7649231.v1</ext-link> (
                    <xref ref-type="bibr" rid="ref-4">Delprato 
                        <italic toggle="yes">et al</italic>., 2019d</xref>)</p>
                <p>Figshare: Extended data 3. Video 3. GeneEnrichment Instructional video for performing a gene enrichment exercise at the Gene Ontology site (
                    <ext-link ext-link-type="uri" xlink:href="http://geneontology.org/">http://geneontology.org/</ext-link>) 
                    <ext-link ext-link-type="uri" xlink:href="https://dx.doi.org/10.6084/m9.figshare.9172121">https://doi.org/10.6084/m9.figshare.9172121</ext-link> (
                    <xref ref-type="bibr" rid="ref-30">Delprato 
                        <italic toggle="yes">et al</italic>., 2019e</xref>)</p>
            </sec>
        </sec>
    </body>
    <back>
        <ref-list>
            <ref id="ref-1">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Ashburner</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Ball</surname>
                            <given-names>CA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Blake</surname>
                            <given-names>JA</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.</article-title>
                    <source>
						
                        <italic toggle="yes">Nat Genet.</italic>
					</source>
                    <year>2000</year>;<volume>25</volume>(<issue>1</issue>):<fpage>25</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="pmid">10802651</pub-id>
                    <pub-id pub-id-type="doi">10.1038/75556</pub-id>
                    <pub-id pub-id-type="pmcid">3037419</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Crusio</surname>
                            <given-names>WE</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Rubino</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Delprato</surname>
                            <given-names>A</given-names>
                        </name>
					</person-group>:
                    <article-title>Engaging high school students in neuroscience research -through an e-internship program [version 2; referees: 3 approved].</article-title>
                    <source>
						
                        <italic toggle="yes">F1000Res.</italic>
					</source>
                    <year>2017</year>;<volume>6</volume>:<fpage>20</fpage>.
                    <pub-id pub-id-type="pmid">28299187</pub-id>
                    <pub-id pub-id-type="doi">10.12688/f1000research.10570.2</pub-id>
                    <pub-id pub-id-type="pmcid">5302143</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Delprato</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Kohutek</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Crusio</surname>
                            <given-names>W</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Workbook 1. Coding of student responses to question 3a.</article-title>
                    <source>
						
                        <italic toggle="yes">figshare.</italic>
					</source>
                    <year>2019a</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://www.doi.org/10.6084/m9.figshare.8166611.v1">https://www.doi.org/10.6084/m9.figshare.8166611.v1</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-12">
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Delprato</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Kohutek</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Crusio</surname>
                            <given-names>W</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Workbook 2. Coding of student responses to question 4a.</article-title>
                    <source>
						
                        <italic toggle="yes">figshare.</italic>
					</source>
                    <year>2019b</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://www.doi.org/10.6084/m9.figshare.8166650.v1">https://www.doi.org/10.6084/m9.figshare.8166650.v1</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Delprato</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Dedhia</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Crusio</surname>
                            <given-names>W</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Video_1 GeneSetProfiling.mp4.</article-title>
                    <source>
						
                        <italic toggle="yes">figshare.</italic>
					</source>Media.<year>2019c</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.doi.org/10.6084/m9.figshare.7649225.v1">http://www.doi.org/10.6084/m9.figshare.7649225.v1</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Delprato</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Crusio</surname>
                            <given-names>W</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Dedhia</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Video_2.UploadGeneSist.</article-title>
                    <source>
						
                        <italic toggle="yes">figshare.</italic>
					</source>Media.<year>2019d</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.doi.org/10.6084/m9.figshare.7649231.v1">http://www.doi.org/10.6084/m9.figshare.7649231.v1</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-30">
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Delprato</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Crusio</surname>
                            <given-names>W</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Dedhia</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Video_3.UploadGeneEnrichment.</article-title>
                    <source>
						
                        <italic toggle="yes">figshare.</italic>
					</source>Media.<year>2019e</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.doi.org/10.6084/m9.figshare.9172121">http://www.doi.org/10.6084/m9.figshare.9172121</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>du Plessis</surname>
                            <given-names>L</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Skunca</surname>
                            <given-names>N</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Dessimoz</surname>
                            <given-names>C</given-names>
                        </name>
					</person-group>:
                    <article-title>The what, where, how and why of gene ontology--a primer for bioinformaticians.</article-title>
                    <source>
						
                        <italic toggle="yes">Brief Bioinform.</italic>
					</source>
                    <year>2011</year>;<volume>12</volume>(<issue>6</issue>):<fpage>723</fpage>&#x2013;<lpage>35</lpage>.
                    <pub-id pub-id-type="pmid">21330331</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bib/bbr002</pub-id>
                    <pub-id pub-id-type="pmcid">3220872</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Gaudet</surname>
                            <given-names>P</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>&#x0160;kunca</surname>
                            <given-names>N</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Hu</surname>
                            <given-names>JC</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Primer on the Gene Ontology</article-title>. In: Dessimoz C, &#x0160;kunca N. (eds)
                    <italic toggle="yes">The Gene Ontology Handbook. Methods Mol Biol.</italic>Humana Press, New York, NY.<year>2017</year>;
                    <bold>1446</bold>:<fpage>25</fpage>&#x2013;<lpage>37</lpage>.
                    <pub-id pub-id-type="pmid">27812933</pub-id>
                    <pub-id pub-id-type="doi">10.1007/978-1-4939-3743-1_3</pub-id>
                    <pub-id pub-id-type="pmcid">6377150</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Hastings</surname>
                            <given-names>J</given-names>
                        </name>
					</person-group>:
                    <article-title>Primer on Ontologies</article-title>.  In: Dessimoz C, &#x0160;kunca N. (eds)
                    <italic toggle="yes">The Gene Ontology Handbook. Methods Mol Biol.</italic>Humana Press, New York, NY.<year>2017</year>;
                    <bold>1446</bold>:<fpage>3</fpage>&#x2013;<lpage>13</lpage>.
                    <pub-id pub-id-type="pmid">27812931</pub-id>
                    <pub-id pub-id-type="doi">10.1007/978-1-4939-3743-1_1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Shendure</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Balasubramanian</surname>
                            <given-names>S</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Church</surname>
                            <given-names>GM</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>DNA sequencing at 40: past, present and future.</article-title>
                    <source>
						
                        <italic toggle="yes">Nature.</italic>
					</source>
                    <year>2017</year>;<volume>550</volume>(<issue>7676</issue>):<fpage>345</fpage>&#x2013;<lpage>353</lpage>.
                    <pub-id pub-id-type="pmid">29019985</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nature24286</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Thomas</surname>
                            <given-names>PD</given-names>
                        </name>
					</person-group>:
                    <article-title>The Gene Ontology and the Meaning of Biological Function</article-title>. In: Dessimoz C, &#x0160;kunca N. (eds)
                    <italic toggle="yes">The Gene Ontology Handbook. Methods Mol Biol.</italic>Humana Press, New York, NY.<year>2017</year>;
                    <bold>1446</bold>:<fpage>15</fpage>&#x2013;<lpage>24</lpage>.
                    <pub-id pub-id-type="pmid">27812932</pub-id>
                    <pub-id pub-id-type="doi">10.1007/978-1-4939-3743-1_2</pub-id>
                    <pub-id pub-id-type="pmcid">6438694</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Wanjek</surname>
                            <given-names>C</given-names>
                        </name>
					</person-group>:
                    <article-title>Systems Biology as Defined by NIH.</article-title>
                    <source>
						
                        <italic toggle="yes">The NIH Catalyst.</italic>
					</source>
                    <year>2011</year>;<volume>19</volume>(<issue>6</issue>): Date accessed Dec 27, 2018.
                    <ext-link ext-link-type="uri" xlink:href="https://irp.nih.gov/catalyst/v19i6/systems-biology-as-defined-by-nih">Reference Source</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report52147">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.22130.r52147</article-id>
            <title-group>
                <article-title>Reviewer response for version 4</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Gaudet</surname>
                        <given-names>Pascale</given-names>
                    </name>
                    <xref ref-type="aff" rid="r52147a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-1813-6857</uri>
                </contrib>
                <aff id="r52147a1">
                    <label>1</label>SIB Swiss Institute of Bioinformatics, Geneva, Switzerland</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>14</day>
                <month>8</month>
                <year>2019</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2019 Gaudet P</copyright-statement>
                <copyright-year>2019</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport52147" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.18061.4"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>My comments have been addressed.</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Partly</p>
            <p>Is the description of the method technically sound?</p>
            <p>No</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>Biocuration, gene ontology.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report50035">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.21308.r50035</article-id>
            <title-group>
                <article-title>Reviewer response for version 3</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Gaudet</surname>
                        <given-names>Pascale</given-names>
                    </name>
                    <xref ref-type="aff" rid="r50035a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-1813-6857</uri>
                </contrib>
                <aff id="r50035a1">
                    <label>1</label>SIB Swiss Institute of Bioinformatics, Geneva, Switzerland</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>29</day>
                <month>7</month>
                <year>2019</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2019 Gaudet P</copyright-statement>
                <copyright-year>2019</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport50035" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.18061.3"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The method presented in this manuscript is of interest to the education community, in that it describes concisely how to use DAVID, a widely-used tool for analysis using GO. However, some points should be addressed to make the method presented here more realistically usable by other schools wishing to introduce this tutorial: 
                <list list-type="bullet">
                    <list-item>
                        <p>How are the Gene Ontology, and ontologies in general, introduced to the students? Clearly what is presented in the manuscript is not sufficient. Can the authors provide easily accessible introductory tutorials?</p>
                    </list-item>
                    <list-item>
                        <p>I am surprised by how much high school students are expected to know about genes and their function, for example when the activity of RAB5A is described. Can the authors provide recommended pre-requisites for this tutorial?</p>
                    </list-item>
                    <list-item>
                        <p>Likewise, the authors intend to demonstrate how to do a gene profiling, but don't describe what is the purpose of this test.</p>
                    </list-item>
                    <list-item>
                        <p>Nowhere do authors mention 'annotation' or 'curation'. It is important to explain this concept &#x2013; how genes come about to be linked to GO terms.&#x00a0;</p>
                    </list-item>
                </list> There are many sentences in the text that are unclear or imprecise: 
                <list list-type="bullet">
                    <list-item>
                        <p>The authors state "&#x201c;Ontology&#x201d; is not a commonly encountered term and there are several definitions that are related to philosophical concepts.", and then go on to explain only one of these concepts: "In the context of information science, as described here, &#x201c;ontology&#x201d; is concerned with the representation, formal naming and classification system with the purpose of describing the relationship categories and properties of the data."</p>
                        <p> In this context, I am not clear what the authors mean by a "classification system".</p>
                    </list-item>
                    <list-item>
                        <p>The importance of the GO term system becomes apparent when analyzing the organization of genomes (...)": As formulated, this information is incorrect.</p>
                    </list-item>
                    <list-item>
                        <p>Section "Making sense of a gene": that title isn't clear; change title to something along the lines of "Overview of gene function with GO".</p>
                    </list-item>
                    <list-item>
                        <p>This section doesn't explain how to get this information from the DAVID site. This is essential if this method is meant to be used by other groups.</p>
                    </list-item>
                    <list-item>
                        <p>Section "Gene profiling in DAVID": No tools on the DAVID website are named "Gene Profiling"; on the DAVID website I get 0 results when I search on "profiling". The authors should either rename the section with the name of the DAVID tool being described, or at the very least explain what&#x00a0;they mean by gene profiling.</p>
                    </list-item>
                    <list-item>
                        <p>The first sentence of that section is "DAVID is primarily a clustering program that groups genes based on different criteria related to GO terms." However, the protocol, demonstrated in Video 1, only shows how to obtain all the functional annotations of all genes from the list inputted, there is no clustering analysis here. The graph that provides clustering is the "Functional Annotation Clustering", but I couldn't get it to work; I get an error message stating that there are too many genes on my list (3000), which by the way was incorrect.</p>
                    </list-item>
                    <list-item>
                        <p>The authors state that "DAVID links to other databases that contain complementary information like The Gene Ontology". This is incorrect: the Gene Ontology is the source of the data (annotations) presented in DAVID. The Gene Ontology therefore does not contain complementary information; it contains the source data. (Note also that DAVID is much out of date with respect to the GO data; the last update was in 2018, and the previous one in 2009. This should not impact the ability to use this tool for demonstration purposes, but for research project, as those mentioned by the students in their comments, the gene enrichment analysis tool from the GO website should be favored, as its data is updated monthly).</p>
                    </list-item>
                    <list-item>
                        <p>Screenshot 3 describing Figure 4: The authors state " You will see a notice: &#x201c;Multiple Species, have been Detected&#x201d;"; I don't see this text in the page. I do see "Select to limit annotations by one or more species"; maybe this is what the authors are referring to? In that case, for the DEMOLIST sets, there are not multiple species, there is 'human' and 'unknown'. It looks like the 'unknowns' are genes that DAVID cannot map (if you follow the link to Ensembl you will notice it is not there). Students should understand this limitation if they are faced with it: identifiers are unstable, some identifiers may disappear. It's unfortunate that DAVID provides examples containing these unknown identifiers, but if one does the analysis with the 'human list', you are excluding the genes in the 'unknown' list.</p>
                    </list-item>
                    <list-item>
                        <p>Screenshot 6 describing Figure 7: Keyword search, only demonstrates how to use the built-in functionality of a browser to search text in a web page. While this is good knowledge to have to successfully navigate web-based resources, this is not relevant here.</p>
                    </list-item>
                    <list-item>
                        <p>There is a mistake in Video 2: When the list is selected from the Random Gene List generator tool, the button 'Copy' should be used, not 'Paste'. In this case, it worked because probably the list was already in the clipboard. Again, this is pretty basic web browsing or even text management knowledge, I don't think it needs to be part of this tutorial.</p>
                    </list-item>
                    <list-item>
                        <p>There are no answers to Entry ticket questions 5, 6, and 7.</p>
                    </list-item>
                </list> Typos and errors: 
                <list list-type="bullet">
                    <list-item>
                        <p>Figure 2 'David' should be DAVID.</p>
                    </list-item>
                    <list-item>
                        <p>Question 4 Table 1 should be 92% (11/12) and not 96%.</p>
                    </list-item>
                    <list-item>
                        <p>Lists are inconsistent: sometimes lines end with a period, sometimes not.</p>
                    </list-item>
                </list> I strongly recommend that these points are addressed before the manuscript is indexed.</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Partly</p>
            <p>Is the description of the method technically sound?</p>
            <p>No</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>Biocuration, gene ontology.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment4795-50035">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Delprato</surname>
                            <given-names>Anna</given-names>
                        </name>
                        <aff>BioScience Project, Wakefield, USA</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>31</day>
                    <month>7</month>
                    <year>2019</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <bold>Reviewer 4</bold>
                </p>
                <p> The method presented in this manuscript is of interest to the education community, in that it describes concisely how to use DAVID, a widely-used tool for analysis using GO. However, some points should be addressed to make the method presented here more realistically usable by other schools wishing to introduce this tutorial:</p>
                <p> How are the Gene Ontology, and ontologies in general, introduced to the students? Clearly what is presented in the manuscript is not sufficient. Can the authors provide easily accessible introductory tutorials?</p>
                <p> </p>
                <p> 
                    <italic>We use video conferencing, screen sharing, screenshots, and instructional videos to teach our student interns about the Gene Ontology system. The screenshots and videos that we use for teaching are included in this manuscript and the extended data. We would be willing to provide a tutorial geared to instructors via videoconferencing and screen sharing either one on one or in small groups.</italic>
                </p>
                <p> I am surprised by how much high school students are expected to know about genes and their function, for example when the activity of RAB5A&#x00a0;s described. Can the authors provide recommended pre-requisites for this tutorial?</p>
                <p> 
                    <italic>Students are not expected to have prior knowledge about genes and their function. This is what they learn about in the internship through direct discussion with the instructor and while working on their projects. We do not use pre-requisites for this tutorial.</italic>
                </p>
                <p> Likewise, the authors intend to demonstrate how to do a gene profiling, but don't describe what is the purpose of this test.</p>
                <p> 
                    <italic>By gene profiling, we simply mean analysis to understand function. We have changed the word &#x201c;profiling&#x201d; to &#x201c;analyses&#x201d; for clarification.</italic>
                </p>
                <p> Nowhere do authors mention 'annotation' or 'curation'. It is important to explain this concept &#x2013; how genes come about to be linked to GO terms.&#x00a0;</p>
                <p> </p>
                <p> 
                    <italic>We agree and as stated above, we have now discussed this point in the introduction.&#x00a0;</italic>
                </p>
                <p> There are many sentences in the text that are unclear or imprecise:</p>
                <p> The authors state "&#x201c;Ontology&#x201d; is not a commonly encountered term and there are several definitions that are related to philosophical concepts.", and then go on to explain only one of these concepts: "In the context of information science, as described here, &#x201c;ontology&#x201d; is concerned with the representation, formal naming and classification system with the purpose of describing the relationship categories and properties of the data."</p>
                <p> In this context, I am not clear what the authors mean by a "classification system".</p>
                <p> </p>
                <p> 
                    <italic>By classification system we mean the organization and retrieval of information. We have attempted to clarify this point by using an analogy with Wikipedia which also uses a structured and controlled vocabulary like GO. We have added this to the introduction.</italic>
                </p>
                <p> The importance of the GO term system becomes apparent when analyzing the organization of genomes (...)": As formulated, this information is incorrect.</p>
                <p> </p>
                <p> 
                    <italic>We apologize, but we fail to see what is incorrect about this phrase</italic>.</p>
                <p> Section "Making sense of a gene": that title isn't clear; change title to something along the lines of "Overview of gene function with GO".</p>
                <p> </p>
                <p> 
                    <italic>The section title has been changed.</italic>
                </p>
                <p> This section doesn't explain how to get this information from the DAVID site. This is essential if this method is meant to be used by other groups. Section "Gene profiling in DAVID": No tools on the DAVID website are named "Gene Profiling"; on the DAVID website I get 0 results when I search on "profiling". The authors should either rename the section with the name of the DAVID tool being described, or at the very least explain what they mean by gene profiling.</p>
                <p> </p>
                <p> 
                    <italic>We apologize for the confusion. We have changed the word &#x201c;profiling&#x201d; to &#x201c;analyses&#x201d; for clarification.</italic>
                </p>
                <p> The first sentence of that section is "DAVID is primarily a clustering program that groups genes based on different criteria related to GO terms." However, the protocol, demonstrated in Video 1, only shows how to obtain all the functional annotations of all genes from the list inputted, there is no clustering analysis here. The graph that provides clustering is the "Functional Annotation Clustering", but I couldn't get it to work; I get an error message stating that there are too many genes on my list (3000), which by the way was incorrect.</p>
                <p> </p>
                <p> 
                    <italic>We apologize for the confusion. We have omitted the word &#x201c;clustering&#x201d; from the sentence.</italic>
                </p>
                <p> The authors state that "DAVID links to other databases that contain complementary information like The Gene Ontology". This is incorrect: the Gene Ontology is the source of the data (annotations) presented in DAVID. The Gene Ontology therefore does not contain complementary information; it contains the source data.&#x00a0;</p>
                <p> </p>
                <p> 
                    <italic>We have removed the word &#x201c;complimentary&#x201d; from the sentence.&#x00a0;</italic>
                </p>
                <p> </p>
                <p> (Note also that DAVID is much out of date with&#x00a0;with respect to the GO data; the last update was in 2018, and the previous one in 2009. This should not impact the ability to use this tool for demonstration purposes, but for research project, as those mentioned by the students in their comments, the gene enrichment analysis tool from the GO website should be favored, as its data is updated monthly).</p>
                <p> </p>
                <p> 
                    <italic>Students do in fact use the Gene Ontology site in addition to DAVID for their research projects. Based on a suggestion made by Reviewer 3, we have now added an optional gene enrichment exercise using the Gene Ontology&#x00a0;</italic>tools.</p>
                <p> Screenshot 3 describing Figure 4: The authors state " You will see a notice: &#x201c;Multiple Species, have been Detected&#x201d;"; I don't see this text in the page. I do see "Select to limit annotations by one or more species"; maybe this is what the authors are referring to? In that case, for the DEMOLIST sets, there are not multiple species, there is 'human' and 'unknown'. It looks like the 'unknowns' are genes that DAVID cannot map (if you follow the link to Ensembl you will notice it is not there). Students should understand this limitation if they are faced with it: identifiers are unstable, some identifiers may disappear. It's&#x00a0;unfortunate that DAVID provides examples containing these unknown identifiers, but if one does the analysis with the 'human list', you are excluding the genes in the 'unknown' list.</p>
                <p> </p>
                <p> 
                    <italic>We have attempted to clarify this point in the figure legend.</italic>
                </p>
                <p> There are no answers to Entry ticket questions 5, 6, and 7.</p>
                <p> </p>
                <p> 
                    <italic>We have now provided answers to Entry ticket questions 5 and 6. Question 7, is subjective (Have you ever worked with a biological database?) and we have not included an answer but have indicated that it is a subjective question.</italic>
                </p>
                <p> Typos and errors:</p>
                <p> </p>
                <p> Figure 2 'David' should be DAVID.</p>
                <p> 
                    <italic>Corrected</italic>
                </p>
                <p> </p>
                <p> Question 4 Table 1 should be 92% (11/12) and not 96%.</p>
                <p> 
                    <italic>Corrected</italic>
                </p>
                <p> </p>
                <p> Lists are inconsistent: sometimes lines end with a period, sometimes not.</p>
                <p> 
                    <italic>Corrected</italic>
                </p>
                <p> </p>
                <p> </p>
                <p> </p>
                <p> </p>
                <p> </p>
                <p> </p>
                <p> </p>
                <p> </p>
                <p> </p>
                <p> </p>
                <p> &#x200b;&#x200b;&#x200b;&#x200b;&#x200b;&#x200b;&#x200b;</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report50598">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.21308.r50598</article-id>
            <title-group>
                <article-title>Reviewer response for version 3</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Lovering</surname>
                        <given-names>Ruth</given-names>
                    </name>
                    <xref ref-type="aff" rid="r50598a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-9791-0064</uri>
                </contrib>
                <aff id="r50598a1">
                    <label>1</label>Functional Gene Annotation, Institute of Cardiovascular Science, University College London, London, UK</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>9</day>
                <month>7</month>
                <year>2019</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2019 Lovering R</copyright-statement>
                <copyright-year>2019</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport50598" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.18061.3"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This article describes the resources used to teach students about the Gene Ontology and one of the many functional analysis tools that are available. As someone who provides Gene Ontology (GO) annotations and teaches MSc students about bioinformatics resources, I found this article very interesting. I am surprised that high school students were engaged and understood sufficient biology to appreciate the application of GO and DAVID. An aspect not apparently covered by this course is the source of the annotations included in these resources. Biocuration is a poorly appreciated aspect of annotation resources, the majority of scientists have no idea how the data from scientific papers is being summarised and incorporated into annotation files. This is an ongoing process and it would be useful to discuss this as well, for example the similarity of the GO resource with Wikipedia (which also includes GO annotations). Also to point out that the annotation files are not static but updated on a monthly basis. My final question is whether the authors considered using any other tool? DAVID does not regularly update the annotations used in the analysis (last updated in 2016) and it is difficult to see how the same genes are enriched with similar terms. DAVID also has a complicated system of GO FAT etc, which requires the user to understand the ontology structure. But this does help to limit view of the GO terms retrieved. However, the figures only seem to show the list of GO (and other) terms associated with each gene. A figure showing the enriched terms for the full list would be useful.</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Biocuration, Gene Ontology, molecular genetics, functional gene analysis.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-50598-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Improving Interpretation of Cardiac Phenotypes and Enhancing Discovery With Expanded Knowledge in the Gene Ontology.</article-title>
                        <source>
                            <italic>Circ Genom Precis Med</italic>
                        </source>.<volume>11</volume>(<issue>2</issue>) :
                        <elocation-id>10.1161/CIRCGEN.117.001813</elocation-id>
                        <fpage>e001813</fpage>
                        <pub-id pub-id-type="pmid">29440116</pub-id>
                        <pub-id pub-id-type="doi">10.1161/CIRCGEN.117.001813</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment4794-50598">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Delprato</surname>
                            <given-names>Anna</given-names>
                        </name>
                        <aff>BioScience Project, Wakefield, USA</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests to declare.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>31</day>
                    <month>7</month>
                    <year>2019</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <bold>Reviewer 3</bold>
                </p>
                <p> This article describes the resources used to teach students about the Gene Ontology and one of the many functional analysis tools that are available. As someone who provides Gene Ontology (GO) annotations and teaches MSc students about bioinformatics resources, I found this article very interesting. I am surprised that high school students were engaged and understood sufficient biology to appreciate the application of GO and DAVID.</p>
                <p> </p>
                <p> An aspect not apparently covered by this course is the source of the annotations included in these resources. Biocuration is a poorly appreciated aspect of annotation resources, the majority of scientists have no idea how the data from scientific papers is being summarised and incorporated into annotation files. This is an ongoing process and it would be useful to discuss this as well, for example the similarity of the GO resource with Wikipedia (which also includes GO annotations). Also to point out that the annotation files are not static but updated on a monthly basis.</p>
                <p> </p>
                <p> We have now discussed these important points in the introduction of the revised article - version 4.</p>
                <p> My final question is whether the authors considered using any other tool? DAVID does not regularly update the annotations used in the analysis (last updated in 2016) and it is difficult to see how the same genes are enriched with similar terms. DAVID also has a complicated system of GO FAT etc, which requires the user to understand the ontology structure. But this does help to limit view of the GO terms retrieved. However, the figures only seem to show the list of GO (and other) terms associated with each gene. A figure showing the enriched terms for the full list would be useful.</p>
                <p> We use DAVID because the site and its output is well organized and highly informative for students. In addition to the Gene Ontology classification&#x00a0;system and its structure as part of the tutorial. In addition to DAVID we use the Gene Ontology site as well.</p>
                <p> </p>
                <p> We have included an optional gene enrichment exercise along with a step by step video and an additional figure (Figure 8) showing the results of the gene enrichment.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report47231">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.20696.r47231</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Ufnar</surname>
                        <given-names>Jennifer A.</given-names>
                    </name>
                    <xref ref-type="aff" rid="r47231a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r47231a1">
                    <label>1</label>Department of Teaching and Learning, Vanderbilt University, Nashville, TN, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>2</day>
                <month>5</month>
                <year>2019</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2019 Ufnar JA</copyright-statement>
                <copyright-year>2019</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport47231" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.18061.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors have provided some data in a table to show that the students liked the database method and felt that it was helpful in their learning. I feel that the authors need to dig more deeply into the data, though, to explain this to readers. The results and conclusions sections were very short, and did not explain the data in enough detail. As an educator or someone wanting to follow this new method, I would want more data to show that it does work and why/how it is working. Since the results are qualitative, more work with the data is needed to show that the qualitative data actually shows the reader that this method works (i.e. coding of open ended survey responses).</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>No</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>No</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>No source data required</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Molecular microbiology; STEM outreach.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report47230">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.20696.r47230</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Grisham</surname>
                        <given-names>William</given-names>
                    </name>
                    <xref ref-type="aff" rid="r47230a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r47230a1">
                    <label>1</label>University of California, Los Angeles (UCLA), Department of Psychology, Los Angeles, CA, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>24</day>
                <month>4</month>
                <year>2019</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2019 Grisham W</copyright-statement>
                <copyright-year>2019</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport47230" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.18061.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>All of my concerns have been met, and the article seem fine as it now stands.</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>No source data required</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Neuroscience and pedagogy</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report45178">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.19752.r45178</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Ufnar</surname>
                        <given-names>Jennifer A.</given-names>
                    </name>
                    <xref ref-type="aff" rid="r45178a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r45178a1">
                    <label>1</label>Department of Teaching and Learning, Vanderbilt University, Nashville, TN, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>26</day>
                <month>3</month>
                <year>2019</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2019 Ufnar JA</copyright-statement>
                <copyright-year>2019</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport45178" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.18061.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This article presents an interesting way to introduce gene ontologies and related vocabulary to high school students to enhance biological education. While the proposal describes the method in detail, there is no data presented to show that the method actually works. While this is an interesting protocol, it is not a fully fleshed out methods article. The authors will need to present data to show the viability of the method, or resubmit potentially as a protocol.&#x00a0;</p>
            <p> </p>
            <p> A few more specific comments about the article are listed below: 
                <list list-type="order">
                    <list-item>
                        <p>The abstract is worded as more of a proposal rather than a paper.&#x00a0;</p>
                    </list-item>
                    <list-item>
                        <p>The protocol seems very much like following a set of directions (not very thought-provoking or inquiry-based), without relating it to the neuroscience. I think the paper would be much stronger if the authors were to relate it back to the behavioral neuroscience projects that are mentioned in the abstract.&#x00a0;</p>
                    </list-item>
                </list>
            </p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>No</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>No</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>No source data required</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Molecular microbiology; STEM outreach</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report45177">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.19752.r45177</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Grisham</surname>
                        <given-names>William</given-names>
                    </name>
                    <xref ref-type="aff" rid="r45177a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r45177a1">
                    <label>1</label>University of California, Los Angeles (UCLA), Department of Psychology, Los Angeles, CA, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>21</day>
                <month>3</month>
                <year>2019</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2019 Grisham W</copyright-statement>
                <copyright-year>2019</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport45177" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.18061.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This article is aimed at describing an easily searchable database that can be used by students to search by gene ontologies. The point of the article is to provide a guide to use this tool in education at the high school level, but the article could provide a good starting point for college level instructors and even industry trainers. The article describes the underlying question well and chooses a gene, RAB5A, whose function should be readily understood by most biologists.</p>
            <p> The submitted article is timely because the DAVID tool described is not very intuitive. Thus, an easy to follow guide such as the article provides is most welcome. Notably, database tools are becoming more elaborate, and their complexity has created a barrier for effectively use in teaching, particularly at lower levels. The article does a good job of describing this tool, DAVID, in a fashion that will be intelligible to high school and college level students. Lavish illustrations are provided as are links to videos to aid in instruction.</p>
            <p> The use of Entry Ticket and Exit Ticket questions is an excellent pedagogical idea. The questions, however, seem a bit simplistic.</p>
            <p> A bigger problem with the article is that there are no&#x00a0;data showing&#x00a0;how using the DAVID tool in teaching has produced changes in measurable ways. Do students do better on tests of content knowledge? Are students more sophisticated in their thinking as a function of using the DAVID tool? Are they better able to navigate other genomic tools? Do they simply enjoy the lessons more than with traditional instruction? Many of these questions can be answered with a simple pre- and posttest quiz or even comparing current scores on tests/evaluations to historical scores. This addition would make this article of much more value to educators who could then see if incorporating this tool into their pedagogy is worthwhile.</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>No source data required</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Neuroscience and pedagogy</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
</article>
