<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="editorial" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.24904.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Editorial</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Making the most of genomic data with OMA</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: not peer reviewed]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Glover</surname>
                        <given-names>Natasha M.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-1811-4340</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Computational Biology, University of Lausanne, Lausanne, 1015, Switzerland</aff>
                <aff id="a2">
                    <label>2</label>Swiss Institute of Bioinformatics, Lausanne, 1015, Switzerland</aff>
                <aff id="a3">
                    <label>3</label>Center for Integrative Genomics, Lausanne, 1015, Switzerland</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:natasha.glover@unil.ch">natasha.glover@unil.ch</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>1</day>
                <month>7</month>
                <year>2020</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2020</year>
            </pub-date>
            <volume>9</volume>
            <elocation-id>665</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>24</day>
                    <month>6</month>
                    <year>2020</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2020 Glover NM</copyright-statement>
                <copyright-year>2020</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/9-665/pdf"/>
            <abstract>
                <p>In this editorial, I describe OMA Collection, a resource for users of Orthologous Matrix. In this collection, we provide tutorials and protocols on how to leverage the tools provided by OMA to analyse your data. Here, I explain the motivation for this collection and its published works thus far.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>OMA</kwd>
                <kwd>Orthologous Matrix</kwd>
                <kwd>collection</kwd>
                <kwd>orthologs</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1">
                    <funding-source>Swiss Institute of Bioinformatics</funding-source>
                </award-group>
                <funding-statement>Supported by Service and Infrastructure grant from the Swiss Institute of Bioinformatics.</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec>
            <title/>
            <p>Next generation sequencing has become commonplace, and we are now entering an age where whole genome sequences are a &#x201c;dime a dozen.&#x201d; Thousands of different eukaryotic species&#x2019; genomes have been sequenced to date, with certain species, such as humans, sequenced tens of thousands of times over. But this is just the tip of the iceberg! For example, the 
                <ext-link ext-link-type="uri" xlink:href="https://www.earthbiogenome.org/">Earth BioGenome Project</ext-link> aims to sequence all 1.5 million known eukaryotic species in 10 years. The &#x2018;-omics&#x2019; data presents layers of complexity in the form of gene expression, regulation, network interaction, epigenetics, structural, functional, and comparative genomics and more. With all this data comes a wealth of potential biological knowledge, but there must be efficient and smart ways to make sense out of all these genes and genomes.</p>
            <p>One fundamental way to relate genomes is by orthology, or the relationship between genes of different species which originated from a single gene in the common ancestor of those species. It is commonly contrasted with paralogy, or the relationship between genes which originated by duplication. By tracing the evolutionary history of genes and their relationships between each other, we can start to understand the complexity of the biological processes underlying the evolution of life forms.</p>
            <p>Indeed, there are many applications of orthology, such as: 

                <list list-type="bullet">
                    <list-item>
                        <p>Prediction of gene function for uncharacterized proteins.</p>
                    </list-item>
                    <list-item>
                        <p>Elucidating gene losses, duplications, or gains (i.e. taxonomically restricted genes) to study evolution of gene families and species.</p>
                    </list-item>
                    <list-item>
                        <p>Finding the best model systems for study based on a particular physiological process.</p>
                    </list-item>
                    <list-item>
                        <p>Phylogenetic profiling, or correlating ortholog presence or absence among many species to detect biologically related processes.</p>
                    </list-item>
                    <list-item>
                        <p>Studying the positional conservation of genes, which can aid in genome assembly, homology detection, and provide insight into structural evolution of chromosomes.</p>
                    </list-item>
                    <list-item>
                        <p>Phylogenomics, among others.</p>
                    </list-item>
                </list>
</p>
            <p>One particular tool for inferring orthologs is OMA (Orthologous MAtrix), which is a method and database for the inference of orthologs among complete genomes
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>. Covering over 2000 species from a broad phylogenetic range, some distinctive features of the OMA browser include a feature-rich web interface, availability of data in a wide range of formats and interfaces, and twice yearly update schedule. As part of the 
                <ext-link ext-link-type="uri" xlink:href="https://lab.dessimoz.org/">Dessimoz lab</ext-link>, I have been working on the OMA project for the past 6 years, as well as being a user of the online browser and standalone software
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>
                </sup>.</p>
            <p>There are many orthology inference methods, software, and databases at one&#x2019;s disposal. From working the past decade in computational biology, I have found that the bottlenecks for effectively using many bioinformatics tools are: 1) choosing the appropriate tool to fit your needs; 2) understanding the relevant information in the black box of how the tool works; and 3) efficiently running the tool and understanding the output.</p>
            <p>Thus, the aim of this F1000 Research Collection is to provide a resource for users of OMA to help them with their analysis needs. I hope to make it as hassle-free as possible to use OMA and the many supplementary analysis tools currently provided. In this direction, we have written several Tutorials, Guides, and Protocols on how to use OMA to get the most out of one&#x2019;s biological data.</p>
            <p>So far, this collection contains four papers: 
                <list list-type="bullet">
                    <list-item>
                        <p>Identifying orthologs with OMA: A primer (
                            <italic toggle="yes">Software Tool article</italic>)
                            <sup>
                                <xref ref-type="bibr" rid="ref-3">3</xref>
                            </sup>
                        </p>
                    </list-item>
                    <list-item>
                        <p>How to build phylogenetic species trees with OMA (
                            <italic toggle="yes">Method article</italic>)
                            <sup>
                                <xref ref-type="bibr" rid="ref-4">4</xref>
                            </sup>
                        </p>
                    </list-item>
                    <list-item>
                        <p>A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL (
                            <italic toggle="yes">Method article</italic>)
                            <sup>
                                <xref ref-type="bibr" rid="ref-5">5</xref>
                            </sup>
                        </p>
                    </list-item>
                    <list-item>
                        <p>Expanding the Orthologous Matrix (OMA) programmatic interfaces: REST API and the OmaDB packages for R and Python (
                            <italic toggle="yes">Software Tool article</italic>)
                            <sup>
                                <xref ref-type="bibr" rid="ref-6">6</xref>
                            </sup>
                        </p>
                    </list-item>
                </list>
</p>
            <p>All the aforementioned protocols use publicly available software, and we provide scripts, code snippets, practical examples, and plenty of explanations in order to facilitate the use of OMA in user analyses. We have several more tutorials planned for this collection and aim to continually add more resources to this collection in order to help our users.</p>
            <p>Furthermore, with the goal of providing real-world examples on how OMA can be used, any research, commentaries, conference posters/slides, or other published work that uses OMA are welcome in this collection. Hopefully, the OMA Collection will prove to be a valuable resource for making the most of genomics data.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <p>No data is associated with this article.</p>
        </sec>
    </body>
    <back>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Altenhoff</surname>
                            <given-names>AM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Glover</surname>
                            <given-names>NM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Train</surname>
                            <given-names>CM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2018</year>;<volume>46</volume>(<issue>D1</issue>):<fpage>D477</fpage>&#x2013;<lpage>85</lpage>.
                    <pub-id pub-id-type="pmid">29106550</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkx1019</pub-id>
                    <pub-id pub-id-type="pmcid">5753216</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Altenhoff</surname>
                            <given-names>AM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Levy</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zarowiecki</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>OMA standalone: orthology inference among public and custom genomes and transcriptomes.</article-title>
                    <source>

                        <italic toggle="yes">Genome Res.</italic>
</source>
                    <year>2019</year>;<volume>29</volume>(<issue>7</issue>):<fpage>1152</fpage>&#x2013;<lpage>63</lpage>.
                    <pub-id pub-id-type="pmid">31235654</pub-id>
                    <pub-id pub-id-type="doi">10.1101/gr.243212.118</pub-id>
                    <pub-id pub-id-type="pmcid">6633268</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zahn-Zabal</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dessimoz</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Glover</surname>
                            <given-names>NM</given-names>
                        </name>
</person-group>:
                    <article-title>Identifying orthologs with OMA: A primer [version 1; peer review: 2 approved].</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
</source>
                    <year>2020</year>;<volume>9</volume>:<fpage>27</fpage>.
                    <pub-id pub-id-type="pmid">32089838</pub-id>
                    <pub-id pub-id-type="doi">10.12688/f1000research.21508.1</pub-id>
                    <pub-id pub-id-type="pmcid">7014581</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dylus</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nevers</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Altenhoff</surname>
                            <given-names>AM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>How to build phylogenetic species trees with OMA [version 1; peer review: awaiting peer review].</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
</source>
                    <year>2020</year>;<volume>9</volume>:<fpage>511</fpage>.
                    <pub-id pub-id-type="doi">10.12688/f1000research.23790.1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sima</surname>
                            <given-names>AC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dessimoz</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Stockinger</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL [version 1; peer review: 2 approved with reservations].</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
</source>
                    <year>2019</year>;<volume>8</volume>:<fpage>1822</fpage>.
                    <pub-id pub-id-type="doi">10.12688/f1000research.21027.1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kaleb</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vesztrocy</surname>
                            <given-names>AW</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Altenhoff</surname>
                            <given-names>AM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Expanding the Orthologous Matrix (OMA) programmatic interfaces: REST API and the 
                        <italic toggle="yes">OmaDB</italic> packages for R and Python [version 2; peer review: 2 approved].</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
</source>
                    <year>2019</year>;<volume>8</volume>:<fpage>42</fpage>.
                    <pub-id pub-id-type="pmid">31001419</pub-id>
                    <pub-id pub-id-type="doi">10.12688/f1000research.17548.2</pub-id>
                    <pub-id pub-id-type="pmcid">6464060</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
</article>
