<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="other" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.144515.2</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Opinion Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Establishing the ELIXIR Microbiome Community</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 2; peer review: 2 approved, 1 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Finn</surname>
                        <given-names>Robert D.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-8626-2148</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Balech</surname>
                        <given-names>Bachir</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-4419-0729</uri>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Burgin</surname>
                        <given-names>Josephine</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Chua</surname>
                        <given-names>Physilia</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-7229-4480</uri>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Corre</surname>
                        <given-names>Erwan</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-6354-2278</uri>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Cox</surname>
                        <given-names>Cymon J.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a5">5</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Donati</surname>
                        <given-names>Claudio</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a6">6</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>dos Santos</surname>
                        <given-names>Vitor Martins</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a7">7</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Fosso</surname>
                        <given-names>Bruno</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a8">8</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Hancock</surname>
                        <given-names>John</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a9">9</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Heil</surname>
                        <given-names>Katharina F.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-3341-3736</uri>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Ishaque</surname>
                        <given-names>Naveed</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-8426-901X</uri>
                    <xref ref-type="aff" rid="a10">10</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Kale</surname>
                        <given-names>Varsha</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Kunath</surname>
                        <given-names>Benoit J.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a11">11</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>M&#x00e9;digue</surname>
                        <given-names>Claudine</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a12">12</xref>
                    <xref ref-type="aff" rid="a13">13</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Nogueira</surname>
                        <given-names>Teresa</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-0059-5177</uri>
                    <xref ref-type="aff" rid="a14">14</xref>
                    <xref ref-type="aff" rid="a15">15</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Pafilis</surname>
                        <given-names>Evangelos</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a16">16</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Pesole</surname>
                        <given-names>Graziano</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-3663-0859</uri>
                    <xref ref-type="aff" rid="a2">2</xref>
                    <xref ref-type="aff" rid="a8">8</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Richardson</surname>
                        <given-names>Lorna</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-3655-5660</uri>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Santamaria</surname>
                        <given-names>Monica</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a17">17</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Strepis</surname>
                        <given-names>Nikolaos</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a18">18</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Van Den Bossche</surname>
                        <given-names>Tim</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-5916-2587</uri>
                    <xref ref-type="aff" rid="a19">19</xref>
                    <xref ref-type="aff" rid="a20">20</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Vizca&#x00ed;no</surname>
                        <given-names>Juan Antonio</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-3905-4335</uri>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Zafeiropoulos</surname>
                        <given-names>Haris</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-4405-6802</uri>
                    <xref ref-type="aff" rid="a16">16</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Willassen</surname>
                        <given-names>Nils P.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a21">21</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Pelletier</surname>
                        <given-names>Eric</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-4228-1712</uri>
                    <xref ref-type="corresp" rid="c2">b</xref>
                    <xref ref-type="aff" rid="a12">12</xref>
                    <xref ref-type="aff" rid="a22">22</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Batut</surname>
                        <given-names>B&#x00e9;r&#x00e9;nice</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-9852-1987</uri>
                    <xref ref-type="corresp" rid="c3">c</xref>
                    <xref ref-type="aff" rid="a13">13</xref>
                    <xref ref-type="aff" rid="a23">23</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>European Bioinformatics Institute, European Molecular Biology Laboratory, Hinxton, UK</aff>
                <aff id="a2">
                    <label>2</label>Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, Bari, Italy</aff>
                <aff id="a3">
                    <label>3</label>ELIXIR Hub, Hixton, UK</aff>
                <aff id="a4">
                    <label>4</label>Station Biologique de Roscoff, CNRS/Sorbonne Universit&#x00e9;, Roscoff, France</aff>
                <aff id="a5">
                    <label>5</label>Centro de Ci&#x00ea;ncias do Mar, Universidade do Algarve, Faro, Portugal</aff>
                <aff id="a6">
                    <label>6</label>Edmund Mach Foundation Research and Innovation Centre, San Michele all'Adige, Trentino-South Tyrol, Italy</aff>
                <aff id="a7">
                    <label>7</label>Systems and Synthetic Biology, Wageningen University &amp; Research, Wageningen, Gelderland, The Netherlands</aff>
                <aff id="a8">
                    <label>8</label>Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari, Bari, Italy</aff>
                <aff id="a9">
                    <label>9</label>Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia</aff>
                <aff id="a10">
                    <label>10</label>Berlin Institute of Health Charit&#x00e9;, Universit&#x00e4;tsmedizin Berlin, Berlin, Germany</aff>
                <aff id="a11">
                    <label>11</label>Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg</aff>
                <aff id="a12">
                    <label>12</label>Metabolic Genomics, Genoscope, Institut Fran&#x00e7;ois-Jacob / CEA / CNRS / Universit&#x00e9; Evry / Universit&#x00e9; Paris-Saclay, Evry, France</aff>
                <aff id="a13">
                    <label>13</label>IFB-core, Institut Fran&#x00e7;ais de Bioinformatique (IFB), CNRS, INSERM, INRAE, CEA, 94800, Villejuif, France</aff>
                <aff id="a14">
                    <label>14</label>INIAV&#x2014;National Institute for Agrarian and Veterinary Research, 4485-655, Vair&#x00e3;o, Portugal</aff>
                <aff id="a15">
                    <label>15</label>CE3c - Centre for Ecology, Evolution and Environmental Changes &amp; CHANGE - Global Change and Sustainability Institute, Faculdade de Ci&#x00ea;ncias da Universidade de Lisboa, 1749-016, Lisboa, Portugal</aff>
                <aff id="a16">
                    <label>16</label>Institute of Marine Biology, Biotechnology and Aquaculture, Hellenic Centre for Marine Research, Heraklion, Greece</aff>
                <aff id="a17">
                    <label>17</label>Department of Soil, Plant and Food Sciences (Di.S.S.P.A.), University of Bari, Bari, Italy</aff>
                <aff id="a18">
                    <label>18</label>Department of Pathology and Clinical Bioinformatics, Erasmus MC Cancer Institute, Erasmus MC, Rotterdam, The Netherlands</aff>
                <aff id="a19">
                    <label>19</label>Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent, Belgium</aff>
                <aff id="a20">
                    <label>20</label>VIB, UGent Center for Medical Biotechnology, Ghent, Belgium</aff>
                <aff id="a21">
                    <label>21</label>UiT The Arctic University of Norway, Troms&#x00f8;, Norway</aff>
                <aff id="a22">
                    <label>22</label>Research Federation for the study of Global Ocean Systems Ecology and Evolution, CNRS, Paris, France</aff>
                <aff id="a23">
                    <label>23</label>Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Freiburg, Germany</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:rdf@ebi.ac.uk">rdf@ebi.ac.uk</email>
                </corresp>
                <corresp id="c2">
                    <label>b</label>
                    <email xlink:href="mailto:eric.pelletier@genoscope.fr">eric.pelletier@genoscope.fr</email>
                </corresp>
                <corresp id="c3">
                    <label>c</label>
                    <email xlink:href="mailto:berenice.batut@gmail.com">berenice.batut@gmail.com</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>8</day>
                <month>9</month>
                <year>2025</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2024</year>
            </pub-date>
            <volume>13</volume>
            <elocation-id>ELIXIR-50</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>16</day>
                    <month>7</month>
                    <year>2025</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Finn RD et al.</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/13-50/pdf"/>
            <abstract>
                <p>Microbiome research has grown substantially over the past decade in terms of the range of biomes sampled, identified taxa, and the volume of data derived from the samples. In particular, experimental approaches such as metagenomics, metabarcoding, metatranscriptomics and metaproteomics have provided profound insights into the vast, hitherto unknown, microbial biodiversity. The ELIXIR Marine Metagenomics Community, initiated amongst researchers focusing on marine microbiomes, has concentrated on promoting standards around microbiome-derived sequence analysis, as well as understanding the gaps in methods and reference databases, and identifying solutions to the computational overheads of performing such analyses. Nevertheless, the methods used and the challenges faced are not confined to marine microbiome studies, but are broadly applicable to other biomes. Thus, expanding this Marine Metagenomics Community to a more inclusive ELIXIR Microbiome Community will enable it to encompass a broader range of biomes and link expertise across &#x2018;omics technologies. Furthermore, engaging with a large number of researchers will improve the efficiency and sustainability of bioinformatics infrastructure and resources for microbiome research (standards, data, tools, workflows, training), which will enable a deeper understanding of the function and taxonomic composition of the different microbial communities.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Microbiome</kwd>
                <kwd>ELIXIR Community</kwd>
                <kwd>White Paper</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1" xlink:href="http://dx.doi.org/10.13039/501100001871">
                    <funding-source>Funda&#x00e7;&#x00e3;o para a Ci&#x00ea;ncia e a Tecnologia</funding-source>
                    <award-id>UIDB/04326/2020</award-id>
                    <award-id>UIDP/04326/2020</award-id>
                    <award-id>LA/P/0101/2020</award-id>
                </award-group>
                <award-group id="fund-2">
                    <funding-source>ELIXIRxNextGenIT</funding-source>
                    <award-id>IR0000010</award-id>
                </award-group>
                <award-group id="fund-3">
                    <funding-source>Research Foundation Flanders</funding-source>
                    <award-id>1286824N</award-id>
                </award-group>
                <award-group id="fund-4" xlink:href="http://dx.doi.org/10.13039/501100000268">
                    <funding-source>Biotechnology and Biological Sciences Research Council</funding-source>
                    <award-id>BB/V01868X/1</award-id>
                </award-group>
                <award-group id="fund-5">
                    <funding-source>CnrBiomics</funding-source>
                    <award-id>PIR01_00017</award-id>
                </award-group>
                <award-group id="fund-6">
                    <funding-source>National Research Fund Luxembourg</funding-source>
                    <award-id>C19/BM/13684739</award-id>
                </award-group>
                <funding-statement>CJC received Portuguese national funds from the Foundation for Science and Technology (FCT) through projects UIDB/04326/2020, UIDP/04326/2020, and LA/P/0101/2020. T.V.D.B. acknowledges funding from the Research Foundation Flanders (FWO) [1286824N].  GP acknowledges funding from MUR (Italy), CnrBiomics (grant number PIR01_00017) and ELIXIRxNextGenIT (grant number IR0000010). JUV received the C19/BM/13684739 grant, funded by National Research Fund Luxembourg (FNR). VK was supported by a Biotechnology and Biological Sciences Research Council [BB/V01868X/1]. LR and RDF were supported by EMBL core funds.</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 1</title>
                <p>In response to insightful feedback from our reviewers, this revised version of our article has undergone significant updates and refinements to enhance clarity, accuracy, and comprehensiveness. We have meticulously addressed the comments provided, ensuring that each section of the manuscript is thoroughly revised to reflect the latest developments and insights in the field. One of the most notable changes in this version is the substantial reworking of Tables 2 and 3. Table 2, which provides an overview of existing and planned interactions with ELIXIR Communities, has been expanded and updated to include new collaborations and initiatives that have emerged since the previous publication. This table now offers a more detailed and current perspective on how our work integrates with and supports the broader ELIXIR infrastructure. Similarly, Table 3, which describes a selection of current European national and pan-European efforts aimed at microbiome research, has been comprehensively revised. We have added new entries and updated existing ones to reflect the latest advancements and their relevance to the ELIXIR Microbiome Community. This table now provides a more accurate and up-to-date snapshot of the European microbiome research landscape. In addition to these updates, the text throughout the article has been carefully revised to improve readability and coherence. We have also included new data and references to recent studies to ensure that our discussion is grounded in the most current research. Overall, this revised version aims to provide a more accurate, comprehensive, and timely overview of our work and its implications for the ELIXIR Microbiome Community and the broader field of microbiome research.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec id="sec1" sec-type="intro">
            <title>1. Introduction</title>
            <p>The term &#x201c;microbiome&#x201d; is a description of an entire habitat that encompasses all the microbes (bacteria, archaea, eukaryotes, and viruses), their composition (genomes, proteins and various molecules they produce), and the environment they are found in.
                <sup>
                    <xref ref-type="bibr" rid="ref1">1</xref>
                </sup> The microbiome is experimentally characterised by the application of one or more &#x2018;omics techniques, especially metabarcoding, metagenomics and metatranscriptomics, but also metaproteomics and metabolomics, combined with contextual metadata about the surrounding environment, be it a geographic location (e.g. ocean), host-associated (e.g. human gut) or engineered (e.g. wastewater treatment plant). Over the past decade, scientists have become increasingly aware of the role performed by microbes in the health (or maintenance) of the environment, and that dysbiosis of the microbial community can lead to dysregulation and/or negative outcomes. Furthermore, there can be complex compositional modulation of microbiomes. For example, viruses that infect bacteria are found ubiquitously in all environments and play critical roles in community dynamics. Microbial communities can be very diverse and heterogeneous in composition across geospatial and temporal scales, and the culture-independent methods for identifying species with the microbiome often reveal hitherto unknown microbes. Despite methodological difficulties, understanding the taxonomic and functional composition of a microbiome, how compositional differences relate to phenotypes, and how these communities may be manipulated to restore a community close to a natural composition are key current research questions. Given that most academic institutions have access to dedicated sequencing facilities or equivalent commercial facilities, coupled with diminishing costs of DNA sequencing and other &#x2018;omics technologies, it is relatively easy to generate large datasets. Therefore, there are now millions of microbiome-derived sequence datasets, many of which are large (gigabytes to terabytes) and complex (thousands of related and/or diverse samples), but it can require significant computational resources to store and analyse the data. Additionally, datasets from other &#x2018;omics techniques such as metaproteomics and metabolomics are being increasingly generated, alone or in combination with metagenomics and/or metatranscriptomics data coming from the same samples. A key challenge facing the microbiome research community is how to: appropriately store the data; informatically process, integrate, compare and interpret microbiome-derived data; and how to make the data findable, accessible, interoperable and reusable, i.e. FAIR.
                <sup>
                    <xref ref-type="bibr" rid="ref2">2</xref>
                </sup>
            </p>
            <p>Such steps are vital to ensure reuse of data and scientific reproducibility. For example, when wishing to contextualise the results between similar experiments, the way a dataset has been produced and processed must be transparent to establish whether it is comparable (e.g. amplified sequence variants (ASVs) can only be compared to those with the same amplified region). Similarly, ensuring that taxonomic and functional assertions are placed in the context of the original sample/sequencing effort and associated contextual metadata is crucial for understanding compositional microbiome changes, such as those in health and disease or longitudinal datasets. Ensuring data is FAIR is typically necessary to comply with most scientific funding sources open access data requirements. Finally, describing how a dataset has been produced allows verification and replication of scientific experiments.</p>
            <p>

                <ext-link ext-link-type="uri" xlink:href="https://elixir-europe.org/">ELIXIR</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref3">3</xref>
                </sup> is a distributed infrastructure bringing together experts from across Europe to enable life science researchers throughout the world to access and analyse life science data. ELIXIR is formed by member states each with a national Node composed of one or more centres of excellence in bioinformatics. Each Node coordinates services, standards and resources, and collaborates with experts in other Nodes to create a sustainable Europe-wide infrastructure for biological data. ELIXIR Platforms bring together experts from Nodes to develop ELIXIR&#x2019;s vision and coordinate activities in defined areas. The five Platforms are Data, Tools, Interoperability, Compute and Training. ELIXIR Communities bring together experts across ELIXIR Nodes and external partners to coordinate activities within specific life science domains. During the establishment of ELIXIR, the ELIXIR Marine Metagenomics Community acted as a biome-specific network of researchers for the identification and organisation of domain-specific reference resources, development of reproducible workflows and the proposal of best practices. However, there is no underlying reason to restrict these activities to just the marine environment, with most of the aforementioned efforts broadly applicable to analysis of microbiome-derived sequence data from any biome. Furthermore, there is the need to extend the activities of the Marine Metagenomics Community to integrate expertise and knowledge about other &#x2018;omics technologies, such as metatranscriptomics, metaproteomics and metabolomics, which are increasingly used in microbiome studies. Thus, this white paper outlines some of the historical aspects of the Marine Metagenomics Community and the aims of the broader community, especially in the context of the other ELIXIR Communities and infrastructure platforms.</p>
        </sec>
        <sec id="sec2">
            <title>2. From marine metagenomics to a more inclusive Community</title>
            <p>The ELIXIR Marine Metagenomics Community, established in 2015 as part of the European Commission funded ELIXIR EXCELERATE project (grant number 676559), was one of the first four ELIXIR Communities created as &#x201c;Use Cases&#x201d;.
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>
                </sup> During the EXCELERATE project, these ELIXIR &#x201c;Use Cases&#x201d; were expanded and renamed to Communities, with a unified aim of bringing European specialists together to provide sustainable data resources, benchmark different tools and workflows, provide access to computing and storage, improve interoperability, and develop training resources within their research domains. These activities were conducted in collaboration with the ELIXIR Platforms, to ensure harmonisation of the outputs. As such, the Marine Metagenomics Community focused on metagenomics analysis pipelines, addressing the lack of reference databases and promoting the best practices for the research community. Highlights include the incorporation of new tools and resources into the MGnify
                <sup>
                    <xref ref-type="bibr" rid="ref5">5</xref>
                </sup> and MetaPIPE
                <sup>
                    <xref ref-type="bibr" rid="ref6">6</xref>
                </sup> analytical pipelines (e.g. MAPseq,
                <sup>
                    <xref ref-type="bibr" rid="ref7">7</xref>
                </sup> ITSOneDB
                <sup>
                    <xref ref-type="bibr" rid="ref8">8</xref>
                </sup>), the formal description of the MGnify pipeline using the common workflow language (CWL
                <sup>
                    <xref ref-type="bibr" rid="ref9">9</xref>
                </sup>) to promote interoperability, the establishment of marine metagenomics data (e.g. Marine Metagenomics Portal, MARdb,
                <sup>
                    <xref ref-type="bibr" rid="ref10">10</xref>
                </sup> METDB,
                <sup>
                    <xref ref-type="bibr" rid="ref11">11</xref>
                </sup> and the Ocean Gene Atlas
                <sup>
                    <xref ref-type="bibr" rid="ref12">12</xref>
                </sup>) and a community paper (beyond ELIXIR) promoting best practices advocating the use of community standards for contextual provenance and metadata at all stages of the research data life cycle.
                <sup>
                    <xref ref-type="bibr" rid="ref13">13</xref>
                </sup> Capacity building has also been an important activity since the establishment of the Marine Metagenomics Community, and many hands-on workshops and training courses have been developed and completed to build competence and expertise in a broader marine academic community.</p>
            <p>However, the popularity of metagenomics has continued to grow, with current approaches providing greater genome-resolved insights into the community composition and the functions performed by the microbial constituents, with annotations spanning viruses, bacteria, archaea and microbial eukaryotes.
                <sup>
                    <xref ref-type="bibr" rid="ref14">14</xref>
                </sup>
                <sup>&#x2013;</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref18">18</xref>
                </sup> Furthermore, metagenomic-like approaches are increasingly being applied to untangle complex holobiont genomes such as lichens, where both the primary symbionts and secondary non-obligate microbes are captured.
                <sup>
                    <xref ref-type="bibr" rid="ref19">19</xref>
                </sup> Finally, multi-omics datasets are now being more routinely produced to understand not only the genetic potential, but also the actively produced transcripts, proteins and/or metabolites, with a view to establishing the links between genotype and phenotype. When a host organism is involved, such datasets can also be augmented with genetic data from the host, such as genome, single nucleotide polymorphisms and transcriptomic data. The collective data facilitate a hologenomic approach
                <sup>
                    <xref ref-type="bibr" rid="ref20">20</xref>
                </sup> to understanding host phenotypes, in the context of their environment and microbiome. Given this increasing complexity of study designs, and the broad applicability of microbiome research, we advocate expanding the Marine Metagenomics Community to include other areas of microbiome research. In particular, we highlight the need for an ELIXIR Microbiome Community to develop and promote standards and research infrastructures that enable the sharing of efforts, concepts, and best practices, while benefiting from the synergistic interplay with other ELIXIR Communities.</p>
            <sec id="sec3">
                <title>2.1 The scope of the ELIXIR Microbiome Community</title>
                <p>The term metagenomics is often colloquially applied to many different areas of microbiome research (see 
                    <xref ref-type="table" rid="T1">
Table 1</xref>), regularly (incorrectly) used to encompass both shotgun metagenomics (indiscriminate sequencing of DNA from an environmental sample) and metabarcoding approaches (the sequencing of a specific amplified marker gene), as exemplified by the thousands of mislabeled datasets in International Nucleotide Sequence Database Collaboration (INSDC). Depending on the nature of the scientific question being addressed and/or the environment, metagenomic analysis may also involve assembly, and potentially the generation of metagenome assembled genomes (MAGs).
                    <sup>
                        <xref ref-type="bibr" rid="ref21">21</xref>
                    </sup> Equally applicable is the analysis of unassembled raw-read data sets that can be used for taxonomic classification (e.g. Kraken,
                    <sup>
                        <xref ref-type="bibr" rid="ref22">22</xref>
                    </sup> MetaPhlan,
                    <sup>
                        <xref ref-type="bibr" rid="ref23">23</xref>
                    </sup> mOTUs
                    <sup>
                        <xref ref-type="bibr" rid="ref24">24</xref>
                    </sup>) and functional profiling approaches that are especially effective when extensive reference databases are available. Sequencing technologies such as long-read sequencing methodologies and the associated adaptive sequencing techniques,
                    <sup>
                        <xref ref-type="bibr" rid="ref25">25</xref>
                    </sup> together with changing protocols such as host material depletion protocols (e.g. 
                    <sup>
                        <xref ref-type="bibr" rid="ref26">26</xref>
                    </sup>), are facilitating the analysis of a wide-range of differing communities. However, the applicability of certain downstream processing and/or analysis tools changes fundamentally in these different contexts. Similarly with metagenomic data, metatranscriptomic data can be processed in different ways, and with an associated metagenomic dataset from the same sample, enables the estimation of both the genetic potential and actively transcribed fraction. Additionally, metaproteomics and metabolomics are technologies that are increasingly being used in microbiome research, involving the study of proteins expressed or small molecules produced by microbial communities in a given environment, which require quite different methodologies for their analysis.</p>
                <p>Fundamentally, the ELIXIR Microbiome Community is about providing the necessary infrastructures required to perform analysis of nucleotide sequence data derived from a microbiome, especially the reproducibility of the results, the archiving and discovery of analyses and the interoperability of tools and data. The Microbiome Community will work with other ELIXIR communities to determine how microbiome-derived data coming from different &#x2018;omics approaches, may be processed and integrated.</p>
                <table-wrap id="T1" orientation="portrait" position="float">
                    <label>
Table 1. </label>
                    <caption>
                        <title>Overview of the terms and techniques used to study microbiome samples.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">Term</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
Definition</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Metabarcoding</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Amplification and sequencing of diagnostic marker gene(s) found in a microbial community</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Metagenomics</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Random sequencing of the total DNA found in a microbial community</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Metatranscriptomics</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">As with metagenomics, but the sequencing of the total RNA</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Metabolomics (non-targeted)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Indiscriminate study of small molecules and products of metabolism</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Metaproteomics</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Identification and quantification of proteins and their interactions found in a microbial community
</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>A fundamental challenge for the Microbiome Community is to address the provision of infrastructures that are sufficiently adaptable to permit the most appropriate informatics analysis, depending on the environment sampled and the experiments conducted. Finally, the ELIXIR Microbiome Community will gather a variety of researchers wishing to undertake microbiome research, spanning clinicians aiming to understand the role of the human microbiome in disease aetiology, ecologists wanting to understand the changing landscape of biodiversity, the agritechnology sector wishing to enhance animal and crop production, to biotechnology scientists looking for novel enzymes, among others.</p>
            </sec>
            <sec id="sec4">
                <title>2.2 The context within ELIXIR</title>
                <p>Given the breadth of the aforementioned applications of microbiome research, it is unsurprising that there are many links to other current and future ELIXIR activities. 
                    <xref ref-type="fig" rid="f1">
Figure 1</xref> presents a schematic layout of the experimental design of a multi-omic analysis of a microbiome sample. Even in this very high-level representation, it can be easily observed that the new ELIXIR Microbiome Community has many potential interactions with other ELIXIR Communities and Platforms along the experimental workflow. Thus, the Microbiome Community represents a showcase of the essence of ELIXIR by bringing together diverse informatics infrastructures that can be coupled together (interoperate) to achieve complex data analyses (on compute infrastructures) that have the appropriate provenance, with data adequately archived in the relevant ELIXIR core data resources. At all levels in ELIXIR, it will be essential to coordinate activities to ensure functional harmony between ELIXIR Communities using Platform-devised solutions.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>
Figure 1. </label>
                    <caption>
                        <title>A schematic of how a microbiome sample (i.e. community in the environment) may be analysed using different &#x2018;omics approaches, with the main steps indicated in green.</title>
                        <p>Underpinning these analyses will be the metagenomic and metatranscriptomic data, which will be used as a framework for the metaproteomic and metabolomic interpretation. Highlights in this figure are connections with the ELIXIR platforms (orange boxes) and other ELIXIR communities (dark blue boxes).</p>
                    </caption>
                    <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185171/cfa9a9a4-8dab-4afb-8bbf-020b991bc28b_figure1.gif"/>
                </fig>
                <p>

                    <italic toggle="yes">

                        <bold>2.2.1 Interactions with other ELIXIR Communities</bold>
</italic>
                </p>
                <p>Some of the key areas of interactions, both ongoing and foreseen, with other Communities are listed in 
                    <xref ref-type="table" rid="T2">
Table 2</xref>. As indicated in 
                    <xref ref-type="fig" rid="f1">
Figure 1</xref>, the interaction with other ELIXIR Communities, specifically those concerned with environmental sampling, begins at the start of the data lifecycle, concerning the sample acquisition and characterisation of the microbial communities. For example, the Food and Nutrition Community aims to understand the relationship between food choices and human health. While microbiome analysis forms part of this Community&#x2019;s activities, the aim of the Food and Nutrition Community is to integrate microbiome data within the context of food and nutrition data, host genotype and phenotype information, and develop interventions that may impact disease.
                    <sup>
                        <xref ref-type="bibr" rid="ref27">27</xref>
                    </sup> Thus, in the case of the Food and Nutrition Community the microbiome is only a small part of the overall research program, and restricted to human microbiome research. Members of the existing ELIXIR Microbiome Community are already engaged with the Food and Nutrition Community, and have helped to provide microbiome sequence analysis services. Similarly the Biodiversity Community has multiple overlapping activities, but with a distinct remit. For example, computational infrastructures and tools borne out of metagenomics research are now being applied for pathogen and biodiversity surveillance. Furthermore, taxonomic inventories resulting from analysis of metagenomic/metabarcoding data are commonly accepted as biodiversity resources and biodiversity resources such as 
                    <ext-link ext-link-type="uri" xlink:href="https://www.gbif.org">GBIF</ext-link> and 
                    <ext-link ext-link-type="uri" xlink:href="https://obis.org/">OBIS</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref28">32</xref>
                    </sup> routinely incorporate data from both MGnify and 
                    <ext-link ext-link-type="uri" xlink:href="https://www.insdc.org/">INSDC</ext-link>. Similarly, many of the biodiversity approaches use barcoding methods for studying environmental DNA (eDNA). While this can overlap with the metabarcoding approaches used in the Microbiome Community, eDNA analysis extends to marker genes such as Cytochrome c oxidase subunit I (Cox1) that is specific to macro-organisms and, thus, out of scope for the Microbiome Community and falls into the realm of the ELIXIR Biodiversity Community.</p>
                <table-wrap id="T2" orientation="portrait" position="float">
                    <label>
Table 2. </label>
                    <caption>
                        <title>Overview of existing and planned interactions with ELIXIR Communities.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">ELIXIR Community</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Existing and planned interaction(s)</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">3D BioInfo</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Improve the organisation, quality control and presentations of protein models (e.g. ESMAtlas
                                    <sup>
                                        <xref ref-type="bibr" rid="ref32">28</xref>
                                    </sup>) for proteins predicted from metagenomic and metatranscriptomic assemblies. Improve functional annotations through structure-function-sequence relationships.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Biodiversity</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Connect biodiversity/observation resources with &#x2018;omics data/analysis. Identify overlap between analysis pipelines (e.g. barcodes, genome annotations) and promote best practices.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Federated human data</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Evaluate the landscape concerning human microbiome and national legislation concerning data sharing. If appropriate, investigate solutions from the Federated human data Community for sharing sensitive data.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Food &amp; Nutrition</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Share metagenomics workflows to improve our understanding of the role of the gut microbiome in unlocking nutrients in food.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Galaxy</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">In collaboration with the Galaxy Community, continue to tailor and expand tools, workflows and training materials applicable to the Microbiome Community, directed by the needs identified by an ongoing evaluation study.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Metabolomics</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Develop methods and tools to connect microbiome sequence data (metagenomics and metatranscriptomics) to link functions to metabolites.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Microbial Biotechnology</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Improve the identification of valuable enzymatic activities from environmental metagenomics data to identify bioactives (e.g. enzyme, small molecule) of interest for: bioeconomy; applications in food preservation; agriculture; chemistry; or medicine.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Plant Science</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Develop a greater understanding of the needs of the Plant Science Community for microbiome-based solutions to improve plant resilience to pathogens, as well as understand how plants maintain their microbial communities across generations, and if so, potential mechanisms for doing so.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Proteomics</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Enable the production of tailored reference databases (e.g. biome and/or other contextual metadata) for interpretation of (meta-)proteomics MS2 data. Develop methods to enhance the integration of metagenomic, metatranscriptomics and metaproteomics results.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Single Cell Omics</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Explore areas of overlap in data standards
                                    <sup>
                                        <xref ref-type="bibr" rid="ref89">29</xref>
                                    </sup> and annotation pipelines concerning single amplified genomes (SAGs). Increase knowledge within the Microbiome Community concerning spatial single cell data with respect to improving the quality of MAGs/SAGs
                                    <sup>
                                        <xref ref-type="bibr" rid="ref82">30</xref>
                                    </sup> and the identification of microbes in tissues and tumours.
                                    <sup>
                                        <xref ref-type="bibr" rid="ref83">31</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Systems Biology</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Empower a better integration of multi-omics datasets to describe and understand how different community members interoperate to achieve processes. More specifically, ensure that different multi-omics data types from the same sample are appropriately connected across different archive databases, and improve methods for linking metabolomics data to sequence (protein and nucleotide).
</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>With the growing number of multi-omics datasets, establishing strong ties with the ELIXIR Metabolomics and Proteomics communities
                    <sup>
                        <xref ref-type="bibr" rid="ref29">33</xref>
                    </sup> will be essential for understanding how metagenomic and metatranscriptomic data may be utilised by these Communities (e.g. the production of reference databases for the interpretation of the metaproteomics), and the nature of the data types produced by these other &#x2018;omics technologies, their limitations and how the data could be integrated. For example, overlaying metabolomics results on metagenomic data is currently non-trivial due to the scarcity of small molecule annotations that can be linked to functional annotations. Ongoing work with the Microbial Biotechnology and Systems Biology Communities has identified the need to augment the functional annotation of metagenomic and metatranscriptomic data with chemical reaction information from resources such as Rhea.
                    <sup>
                        <xref ref-type="bibr" rid="ref30">34</xref>
                    </sup> While this will improve the discovery of new industrial applications, there is still the need to expand the protein functional annotations of the, so-called, microbial dark matter. The advent of new structural modelling software
                    <sup>
                        <xref ref-type="bibr" rid="ref32">28</xref>
                    </sup>
                    <sup>,</sup>
                    <sup>
                        <xref ref-type="bibr" rid="ref31">35</xref>
                    </sup> and data resources
                    <sup>
                        <xref ref-type="bibr" rid="ref33">36</xref>
                    </sup> means that there are now structural models for millions of proteins that currently lack functional annotations, yet appear structurally related to functionally characterised proteins. Connections to the 3D BioInfo Community will aid how we store and organise this structural model information, reuse software components for visualisation and leverage their training materials on how to interpret structural model data. This will allow the Microbiome Community to assess the merits and limitations of this data type.</p>
                <p>Furthermore, microbiome research has many translational aspects, ranging from the discovery of biomarkers associated with health (of organisms or environments) and disease, to industrial applications such as using enzymes from microbes or the microbes themselves for performing bioremediation and/or replacing chemical processes. One topic that is an area of intensive research is the discovery of enzymes capable of degrading plastics, typically polyethylene terephthalate (commonly known as PET).
                    <sup>
                        <xref ref-type="bibr" rid="ref34">37</xref>
                    </sup> While metagenomic assembly and analysis is providing a rich source of potential new enzymes, the informatics at the core of the Microbiome Community will not provide the information why one enzyme should be assayed in preference to another, how these alpha-beta hydrolases have adapted to utilising PET, or why one enzyme performs better than another. Such answers will come from the collaborative efforts that bridge across Communities, such as microbial biotechnology and 3D BioInfo and, of course, the wider research community.</p>
                <p>In summary, there are many synergies and connections between the Microbiome Community and the other existing ELIXIR Communities, but none of these Communities are focused on the core issues concerning microbiome-derived sequence analysis, infrastructure provision, data standards and best practices. Moreover, there are key global societal challenges within the One Health concept defined by the World Health Organization, such as food safety, pathogen tracking, climate changes, antimicrobial resistance (and new therapeutics) and pandemic preparedness, in which microbiome research plays a critical role. Yet each one of these areas is too complex to be tackled individually and therefore requires the collective outputs from more than one ELIXIR Community, and reach far beyond informatics research (
                    <xref ref-type="table" rid="T3">
Table 3</xref>).</p>
                <table-wrap id="T3" orientation="portrait" position="float">
                    <label>
Table 3. </label>
                    <caption>
                        <title>Description of a selection of current European national and pan-European efforts aimed at microbiome research, and their relevance to the ELIXIR Microbiome Community.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">Initiative</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Reach</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Aims and relevance</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">M&#x200b;utualised&#x200b; &#x200b;Digital&#x200b; Spaces&#x200b; &#x200b;For&#x200b; Life&#x200b; &#x200b;Sciences (MuDIS4LS)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">National (FR)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">To develop a framework that will connect national and regional data centres to enable the control of biological data from their origin (data-producing national infrastructures) to their public release, while ensuring data security during the intermediate phases of analysis and exploitation.
                                    <break/>

                                    <italic toggle="yes">Relevance</italic>: Guide data management best practices, especially when dealing with sensitive microbiome related data.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Secured computing spaces for the data access and analysis project of the France 2030 programme &#x00ab; Food Systems, Microbiome and Health &#x00bb;(Cloud4SAMS)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">National (FR)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">To deploy a distributed digital infrastructure enabling researchers to exploit microbiome and health data in a secure computing environment, collating software tools and workflows for processing these data, computing and storage platforms suitable for processing microbiome data and matching them with health data, while respecting data access rules.
                                    <break/>

                                    <italic toggle="yes">Relevance</italic>: This project will define deployment recipes describing all the procedures to instantiate a virtual machine in a secure cloud, install software and transfer datasets. Knowledge developed will guide the deploying of similar distributed infrastructures within ELIXIR.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">National Research Center in Bioinformatics for Omics Sciences (CNRBiOmics)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">National
                                    <break/>(IT)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">To enhance the ELIXIR Italian node infrastructure, through the establishment of a &#x201c;centre of excellence&#x201d; for multi-omics data production, management, and analysis. In addition, establish a higher education training platform to develop skills required to use the infrastructure.
                                    <break/>

                                    <italic toggle="yes">Relevance</italic>: Microbiome multi-omics data produced by the infrastructure will provide example use cases, as well as being a test bed for multi-omics data integration solutions developed by the community.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Consolidation of the Italian Infrastructure for Omics Data and Bioinformatics (ELIXIRxNextGenIT)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">National
                                    <break/>(IT)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">To consolidate the ELIXIR-IT infrastructure for omics and bioinformatics, focused on data production, computational analysis, facilities improvement and training, with a view to strengthening the national ELIXIR infrastructure.
                                    <break/>

                                    <italic toggle="yes">Relevance</italic>: Overlaps with training and sharing of microbiome related pipelines.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">NFDI4Microbiota</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">National (DE)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">To: (i) promote FAIR principles in the microbiological community; (ii) provide a comprehensive training program; (iii) enhance data resources for microbiology community; (iv) support high-quality research data management; (v) increase data value by standardising and systematically collecting rich metadata and building tools for querying; (vi) make research more reproducible by standardising data processing and analysis; (vii) provide computational tools and infrastructure for the translation of data into new knowledge.
                                    <break/>

                                    <italic toggle="yes">Relevance</italic>: This national programme shares many of the objectives of the Microbiome Community, so it will be important to synergise activities.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">European Reference Genomes Atlas (ERGA)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">European</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">To generate eukaryotic reference genomes of European species and create a powerful resource for the understanding of biodiversity.
                                    <break/>

                                    <italic toggle="yes">Relevance</italic>: This collection will also include microbial eukaryotes that can be used to enhance eukaryote genome analysis in microbiome research.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">European e-Science Infrastructure for biodiversity and ecosystem research
                                    <break/>(LifeWatch ERIC)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">European</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">To accelerate the sharing, integration and analysis of open-data and its Virtual Research Environments (VREs) to enable studies on biodiversity structure and conservation related to multiple drivers.
                                    <break/>

                                    <italic toggle="yes">Relevance</italic>: Metagenomics data will be used to develop ecological models, while reference genome data will improve analysis pipelines.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Metaproteomics Initiative</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">European/
                                    <break/>International</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">To promote dissemination of metaproteomics fundamentals, advancements, and applications through collaborative networking in microbiome research.
                                    <break/>

                                    <italic toggle="yes">Relevance:</italic> The central information hub and open meeting place will allow members of the Microbiome Community to interact with metaproteomics experts. Will help the Microbiome Community aim to standardise and methodologies in this field.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">microGalaxy</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">European/
                                    <break/>International</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">To: (i) develop and sustain microbial data analysis in Galaxy, (ii) implement standardised &#x201c;best practices&#x201d;, (iii) expand documentation and training, (iv) coordinate efforts in tools, workflows and training development.
                                    <break/>

                                    <italic toggle="yes">Relevance</italic>: Usable and standardised workflows for the Microbiome Community.
</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>

                    <bold>

                        <italic toggle="yes">2.2.2 Interaction with ELIXIR Platforms</italic>
</bold>
                </p>
                <p>Similar to the collaborations with the ELIXIR Communities, there are multiple ongoing and future interactions with the ELIXIR Platforms. In the following sections the connections between the past Marine Metagenomics Community or the future Microbiome Community and each of the Platforms will be highlighted.</p>
                <p>

                    <italic toggle="yes">2.2.2.1 Data</italic>
                </p>
                <p>The aim of the ELIXIR Data Platform is to promote the use, re-use and value of life science data. A key part of this activity has been the establishment of the Core Data Resources (CDR). Underpinning sequenced-based microbiome research is the INSDC, especially the European Nucleotide Archive (ENA) in the context of ELIXIR. Alongside the archived sequence data, users can access comprehensive metadata that is important to contextualise where the data originated. Throughout the lifetime of the ELIXIR Marine Metagenomics Community there have been extensive efforts to increase the standardisation of derived sequence products from metagenomic short-read datasets, particularly increasing the availability of assemblies
                    <sup>
                        <xref ref-type="bibr" rid="ref5">5</xref>
                    </sup> and the introduction of the deposition layers to support the increase in the numbers of MAGs being generated.
                    <sup>
                        <xref ref-type="bibr" rid="ref35">38</xref>
                    </sup> In the new Microbiome Community we will continue to promote and develop these layers to accommodate Eukaryotic MAGs (see below), viral sequences and complex coassembly, as well as incorporating the latest community standards as they are approved by authoritative bodies. The work undertaken to generate the MAR databases highlighted that many marine samples in ENA lack key metadata fields. Through extensive curation efforts, using literature as well as contacting the original data submitters, much of this missing data was retrieved and added to the MAR database. While ENA (or any of the INSDC partners) can not add this metadata to the original sequence record, an ELIXIR sponsored initiative led to the establishment of the Contextual Data Clearinghouse (
                    <ext-link ext-link-type="uri" xlink:href="https://www.ebi.ac.uk/ena/clearinghouse/api/">CDCH</ext-link>). The CDCH facilitates the capture of additional metadata using controlled vocabularies including a description of how this data was generated (e.g. manual assertion, computationally derived), so that they can be associated with an INSDC record. Longer term, this data will be incorporated into BioSamples.</p>
                <p>In other non-sequenced based &#x2018;omics fields, microbiome data archiving and analysis is supported by data-type specific resources. In the case of metaproteomics, the PRIDE database repository (also an ELIXIR CDR) enables archiving and re-analysis of (meta) proteomics data, and now also encourages researchers to upload their metadata in SDRF-format.
                    <sup>
                        <xref ref-type="bibr" rid="ref36">39</xref>
                    </sup>
                    <sup>,</sup>
                    <sup>
                        <xref ref-type="bibr" rid="ref37">40</xref>
                    </sup> PRIDE is the leading resource of the International ProteomeXchange Consortium of proteomics data resources, involving additional databases in the USA, Japan and China, in addition to PRIDE. Similarly, in the case of metabolites the data can be deposited in the MetaboLights repository
                    <sup>
                        <xref ref-type="bibr" rid="ref38">41</xref>
                    </sup> or similar resources. A current challenge facing the field is connecting different multi-omics data that have been derived from the same sample.</p>
                <p>The Data Platform also promotes the linkage between Europe PMC
                    <sup>
                        <xref ref-type="bibr" rid="ref39">42</xref>
                    </sup> and other CDR databases. This is critical for the Microbiome Community as additional contextual metadata can often be found in the literature,
                    <sup>
                        <xref ref-type="bibr" rid="ref40">43</xref>
                    </sup>
                    <sup>,</sup>
                    <sup>
                        <xref ref-type="bibr" rid="ref41">44</xref>
                    </sup> providing crucial overarching context to the experiment, which can be important for re-analyses or meta-analyses. We will continue to promote such approaches, enriching metadata wherever possible.</p>
                <p>Last but not least, new activities will be promoted aimed at the integration of microbiome data coming from different &#x2018;omics approaches. In this context, recently, the PRIDE and MGnify teams developed and implemented new pipelines in both platforms for the re-analysis and integration of metagenomic and metaproteomic data, allowing the re-analysis of metaproteomics datasets from PRIDE using sequence databases generated from MGnify, and contextualising the results back into the MGnify web interface in terms of assembly annotations (
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/PRIDE-reanalysis/MetaPUF">https://github.com/PRIDE-reanalysis/MetaPUF</ext-link>). The ELIXIR Microbiome Community will also work to move the Marine Metagenomics domain in the ELIXIR Research Data Management Kit (RDMKit) towards a more general Microbiome domain.</p>
                <p>

                    <italic toggle="yes">2.2.2.2 Tools</italic>
                </p>
                <p>Microbiome data analysis employs a large number of tools which are used to perform basic quality control on the sequence data, with separate tools (and reference databases) typically used for taxonomic and functional profiling. Installing and managing dependencies has been eased by the use of package management systems such as Conda, or through the use of containers, e.g. Singularity. The ELIXIR Microbiome Community will increase their use of 
                    <ext-link ext-link-type="uri" xlink:href="https://biocontainers.pro/">BioContainers</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref84">45</xref>
                    </sup> to promote the packaging, containerisation and deployment of tools relevant to microbiome research.</p>
                <p>In order to make tools findable by the end users, the Microbiome Community will work on improving their annotation by (i) expanding the EDAM ontology
                    <sup>
                        <xref ref-type="bibr" rid="ref42">46</xref>
                    </sup> to include microbiome-specific keywords, (ii) performing periodic reviews of tools and their associated annotations in the 
                    <ext-link ext-link-type="uri" xlink:href="https://bio.tools/">bio.tools</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref85">47</xref>
                    </sup> catalogue. These annotations will subsequently be used to build a catalogue of tools for microbiome data analysis and their availability for different platforms, e.g. Galaxy,
                    <sup>
                        <xref ref-type="bibr" rid="ref86">48</xref>
                    </sup> or as workflow descriptions (e.g. Snakemake,
                    <sup>
                        <xref ref-type="bibr" rid="ref43">49</xref>
                    </sup> CWL,
                    <sup>
                        <xref ref-type="bibr" rid="ref9">9</xref>
                    </sup> Nextflow
                    <sup>
                        <xref ref-type="bibr" rid="ref44">50</xref>
                    </sup>), which can be readily combined to make new annotation workflows. Additionally, the Microbiome Community will develop and maintain cloud-deployable and FAIR analysis pipelines using state of the art tools and following best open science practices by: (i) using workflow descriptions; (ii) documenting the workflows and depositing them in 
                    <ext-link ext-link-type="uri" xlink:href="https://workflowhub.eu/">WorkflowHub</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref87">51</xref>
                    </sup> for easy discovery, re-use and assessment; (iii) making them available for the Microbiome Community via platforms such as MGnify and Galaxy.</p>
                <p>As an integral part of the Tools platform, Galaxy has integration with OpenEBench, WorkflowHub, EDAM, bio.tools and follows all Software Best Practices. A current joint effort between the Microbiome and Galaxy Communities is running an evaluation of tool requirements for microbiome data analysis in the Galaxy ecosystem. This evaluation will lead to a shared roadmap between both Communities for tool integration and standardised workflow development for microbiome data analysis.</p>
                <p>A key part of understanding the applicability of a tool/workflow is its benchmarking, and has been an aim of the Tools platform. Very few analyses in microbiome research employ a single tool, with the norm being the coupling of multiple tools and reference databases to achieve a comprehensive analysis that includes both taxonomic and functional results. Even relatively simple workflows that perform metagenomics assembly are computationally heavy. This combination of workflow complexity and typical computational overheads has always made the routine benchmarking tools for microbiome informatics research burdensome. Nevertheless, where two or more tools perform equivalent tasks, it can be relatively simple to modify existing formally described workflows to evaluate their respective performances, but that ease often depends on where they occur in the overall workflow and the metrics used to evaluate the tool. Many efforts have tried to compare the outputs of tools and workflows (e.g. 
                    <sup>
                        <xref ref-type="bibr" rid="ref85">47</xref>
                    </sup>
                    <sup>,</sup>
                    <sup>
                        <xref ref-type="bibr" rid="ref87">51</xref>
                    </sup>
                    <sup>&#x2013;</sup>
                    <sup>
                        <xref ref-type="bibr" rid="ref49">54</xref>
                    </sup>), with the Critical Assessment of Microbiome Interpretations (CAMI) having become an internationally recognised benchmarking effort.
                    <sup>
                        <xref ref-type="bibr" rid="ref50">55</xref>
                    </sup>
                    <sup>&#x2013;</sup>
                    <sup>
                        <xref ref-type="bibr" rid="ref53">58</xref>
                    </sup> The CAMI challenges have established a range of benchmarking datasets for evaluating different categories of tools. Importantly, the organisers of CAMI have engaged data generators to provide data, such that truly independent benchmarking can be undertaken. However, these benchmark datasets can become outdated over time, as the underlying data enters the reference database. The Galaxy Community has already investigated implementing benchmarking infrastructure using CAMI datasets, and increasing the awareness of this infrastructure will be a key effort across Communities and Platforms. As the Microbiome Community establishes, we will develop a broader understanding of the requirements of the wider microbiome research community needs, and feed these requirements to the Tools Platform, as well as seek opportunities to interact with the Tools Platform to capture the diversity of tools and their utility via such benchmarking activities.</p>
                <p>

                    <italic toggle="yes">2.2.2.3 Compute</italic>
                </p>
                <p>Depending on the analysis being performed, the computational requirements can be very different. For example, metagenomic assembly typically requires small numbers of cores on a large memory machine, whereas some forms of raw-read analysis require many cores (hundreds) with a small memory footprint. As such, microbiome researchers need to understand the likely computational costs, and their options for deploying them on high performance computing (HPC) and cloud environments. Efforts such as 
                    <ext-link ext-link-type="uri" xlink:href="https://blue-cloud.org/">Blue Cloud</ext-link> have helped reduce some of the barriers to using the European Open Science Cloud (EOSC) for marine research through the delivery of a collaborative virtual environment, but the range of services is limited. While such efforts help, there are still many barriers to accessing compute resources and deploying complex metagenomic pipelines in a distributed or even hybrid fashion. Working with the Compute Platform, the ELIXIR Microbiome Community will continue to investigate solutions that facilitate the execution of workflows within such distributed and/or hybrid environments, e.g. using Pulsar network, the distributed compute network offered by the Galaxy Community, and provide guidance of the likely costs of using compute infrastructures.</p>
                <p>

                    <italic toggle="yes">2.2.2.4 Interoperability</italic>
                </p>
                <p>Previous work by the Marine Metagenomics Community has leveraged many of the ELIXIR Interoperability Platform solutions, especially the use of workflow languages for the formal description of pipelines, improving the provenance of the data outputs. As such, both the MetaPIPE and MGnify pipelines have been described using the Common Workflow Language (CWL). This effort was paralleled by MG-RAST,
                    <sup>
                        <xref ref-type="bibr" rid="ref54">59</xref>
                    </sup> which also allowed MGnify and MG-RAST to exchange pipelines and establish that the biological signatures reported by the respective pipelines were very similar, yet confounded by different reference databases and methodologies for assigning function.
                    <sup>
                        <xref ref-type="bibr" rid="ref55">60</xref>
                    </sup> Since then, MGnify has published their workflows in WorkflowHub, further promoting their discovery and reuse. As an example of reuse, the MGnify pipeline has been used as the basis for the newly developed metaGOflow pipeline,
                    <sup>
                        <xref ref-type="bibr" rid="ref57">61</xref>
                    </sup> to be used by the Marine Genomic Observatories. Moreover, this work also employed Research Object Crate (
                    <ext-link ext-link-type="uri" xlink:href="https://www.researchobject.org/ro-crate/">RO-crate
</ext-link>)
                    <sup>
                        <xref ref-type="bibr" rid="ref88">62</xref>
                    </sup> to package relevant metadata about the sample and the bioinformatics analysis applied and the data products. RO-crate offers new opportunities for sharing or federating the metagenomics analysis workload. In parallel, Galaxy, which supports the Tool Registry Service (TRS) protocol to exchange and run workflows between the WorkflowHub and Galaxy, gained support for RO-Crate (version 23.0) to export complete data analysis as a structured and FAIR digital object, supporting the Global Alliance for Genomics and Health (GA4GH) standards, and is in the process of applying to be an ELIXIR Recommended Interoperability Resource.</p>
                <p>The Microbiome Community will continue to work with the Interoperability Platform to make wider use of RO-crate, with a view to federate data analysis between resources. For example, future work by the new Microbiome Community will enable the MGnify workflows to be made deployable on Galaxy, with the RO-crate to be transferred, verified and ingested into MGnify. Additional work needs to be undertaken to understand how universal this approach is, so that MGnify could become a hub for a range of additional analyses, thereby reducing the duplication of effort that currently exists in the community.</p>
                <p>Finally, we will work on the development of novel mechanisms to integrate and link data coming from multi-omic approaches using different tools and data resources. This will require the development of new data Interoperability layers for data resources that are not normally used in metagenomics-centred Microbiome data, such as the PRIDE database in the case of metaproteomics data.</p>
                <p>

                    <italic toggle="yes">2.2.2.5 Training</italic>
                </p>
                <p>One of the key areas commonly highlighted by national and international reports on the potential of microbiome research is the need for training, especially in the area of bioinformatics. As already highlighted, microbiome analysis is an emerging and evolving research field by itself, with plenty of challenges still to be addressed. Combined with this complexity, the increasing number of researchers using such methods makes the need for continuous training and re-training a challenge on its own. Researchers need to become familiar with modern computing technologies, such as HPC and cloud computing, and follow the constant updates on experimental approaches, algorithms (new and updates) and pipeline developments. As new pipelines are established and existing pipelines improved through the incorporation of new tools and/or reference databases, this adds further complexity to the tool and data output landscape associated with microbiome research, and sets the need for well defined and consistent training modules dedicated to computational approaches to microbiome analysis.</p>
                <p>Platforms such as MGnify support large-scale services for most steps of a microbiome study, meaning the distribution of raw-data, production of assemblies, their analysis, and their potential use for meta-analysis, have proved of great benefit. Nevertheless, these analyses should be considered just the starting point for further downstream analysis, which requires the specific domain expertise of the researchers involved in undertaking the study. One approach can be the use of cloud-based initiatives such as Galaxy supporting graphical interfaces and allowing the users to choose more specific tools, while tuning their parameters and reference databases according to their environment being studied. Such infrastructures attempt to fill the gap between researchers without experience in computer science and their needs for FAIR and quality microbiome analysis. Despite both solutions being readily available, there remains knowledge gaps and/or reticence about using such resources, often due to a lack of training.</p>
                <p>To upskill microbiome scientists and keep them up-to-date in microbiome data analysis and standards, the ELIXIR Microbiome Community will work in coordination with the ELIXIR Training Platform to offer scalable and FAIR training. The Microbiome Community will continue to: (i) annotate training materials with appropriate metadata to create a comprehensive training portfolio; (ii) FAIRify the training content, making it open-access; (iii) register training material, national and international providers and events in ELIXIR&#x2019;s Training Portal TeSS
                    <sup>
                        <xref ref-type="bibr" rid="ref89">29</xref>
                    </sup>; (iv) assist the Training platform in the development of annual training gap surveys; and (v) develop materials and design learning paths specific to different community needs (e.g. biomes or data types).</p>
                <p>To enable access to training resources and deliver this training, face-to-face and online workshops will be organised and videos will be recorded for &#x201c;on demand&#x201d; learning. The technical infrastructure for training, in particular the computational environment setup and software installation challenge will be addressed in coordination with the ELIXIR Tools and Compute Platforms, with the aim of promoting the use of Conda environments, containers, digital notebooks or platforms like Galaxy which mitigate many of the current obstacles. In order to make these aspirations possible, the Community will increase its training capacity by working with training communities on practices, organising Train the Trainers events and building a community of microbiome research trainers, with areas of expertise covering different environments, &#x2018;omics approaches and data analysis strategies. Ensuring these trainers maintain their knowledge with the evolving informatics landscape is, arguably, a key challenge that is yet to be addressed and something this Community will strive to solve in collaboration with the ELIXIR Training Platform.</p>
            </sec>
        </sec>
        <sec id="sec12">
            <title>3. Context with other international initiatives</title>
            <p>We have highlighted the need for promoting best practices and standards throughout this article. However, it is also important that the Microbiome Community continues to build upon engagement with organisations such as the Genomics Standards Consortium (GSC
                <sup>
                    <xref ref-type="bibr" rid="ref58">63</xref>
                </sup>). The GSC has become critical for establishing many of the standards that underpin genomic research, and more recently metabolomic. Examples of GSC established standards that are particularly pertinent to the microbiome domain include: minimal information about any sequence (MIxS
                <sup>
                    <xref ref-type="bibr" rid="ref59">64</xref>
                </sup>), the Biological Observation Matrix (BIOM) format
                <sup>
                    <xref ref-type="bibr" rid="ref60">65</xref>
                </sup>; and the Minimum Information about a Metagenome-Assembled Genome (MIMAG
                <sup>
                    <xref ref-type="bibr" rid="ref89">29</xref>
                </sup>).</p>
            <p>There are also other ELIXIR Node-specific initiatives that the Microbiome Community connects with to ensure that the respective efforts are synergised. Examples of projects with ELIXIR Node involvement directly related to the ELIXIR Microbiome Community are presented in 
                <xref ref-type="table" rid="T3">
Table 3</xref>, which cover a diverse range of topics. The engagement needs to be bi-directional to ensure that the needs of Nodes are well understood and that solutions developed at national levels can be spread across the ELIXIR Microbiome Community, and vice versa. In this context, the ELIXIR Microbiome Community leads will undertake coordinating roles, engaging with the project representatives, inviting them to relevant ELIXIR events and promoting active participation in relevant ELIXIR Communities.</p>
            <p>MicrobiomeSupport, formerly a European Commission funded coordination and support action (CSA) program aimed at improving microbiome research and innovation, highlighted in their final report
                <sup>
                    <xref ref-type="bibr" rid="ref62">66</xref>
                </sup> that there was &#x201c;limited connectedness&#x201d; in microbiome research conducted on different environments/systems, and that during the course of this program the lack of connectedness did not improve. This independent finding reinforces the need for broadening the ELIXIR Marine Metagenomics Community to a more generalist Microbiome Community. Since the end of the CSA project, this MicrobiomeSupport has transitioned to the MicrobiomeSupport Association to continue the engagement activities within the microbiome research community and will provide a key dissemination route for the ELIXIR Microbiome Community outcomes.</p>
            <p>It will also be important to showcase the ELIXIR Microbiome Community to European countries that are yet to join ELIXIR. For example, Romania has a thriving microbiome research community, but is faced with the same set of informatics challenges. Sharing knowledge beyond ELIXIR, will not be the primary goal, but will nevertheless be important to harmonise the activities internationally and promote the benefits of participation in ELIXIR. Beyond Europe, there are parallel organisations that strive to achieve similar goals to ELIXIR in other locations. For example, Australia BioCommons aims to promote bioinformatics and bioscience data infrastructures at a national level. Given the strength of microbiome research in Australia (see below), we will explore opportunities for international collaboration.</p>
            <p>In addition, it will be important to showcase the ELIXIR Microbiome Community to communities (within and outside Europe) that are not yet familiarised with ELIXIR activities. For example, the 
                <ext-link ext-link-type="uri" xlink:href="https://www.metaproteomics.org">Metaproteomics Initiative</ext-link> is an international community that promotes dissemination of metaproteomics fundamentals, advancements, and applications through collaborative networking in microbiome research.
                <sup>
                    <xref ref-type="bibr" rid="ref63">67</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref64">68</xref>
                </sup> For example, recently, they benchmarked metaproteomics workflows and bioinformatics methods in the field in the first multi-lab benchmark study in metaproteomics (called CAMPI), showcasing the robustness of metaproteomics data analysis workflows.
                <sup>
                    <xref ref-type="bibr" rid="ref63">67</xref>
                </sup>
            </p>
            <p>Finally, the National Microbiome Data Collaborative (NMDC),
                <sup>
                    <xref ref-type="bibr" rid="ref65">69</xref>
                </sup> a US led initiative, is developing a 
                <ext-link ext-link-type="uri" xlink:href="https://data.microbiomedata.org">unified data portal</ext-link> to support microbiome multi-omics data integration and analysis through an integrated, distributed framework. Many of the governing principles associated with this portal are common with those described here, especially with the desire to have containerised, reusable computational workflows, as well as trying to make the data compliant with the FAIR principles. Sharing experiences and best practices between NMDC and the ELIXIR Microbiome Community (and others) will improve the global standardisation of microbiome research.</p>
        </sec>
        <sec id="sec13">
            <title>4. Interaction with other key data resources beyond ELIXIR</title>
            <p>Microbiome research is global, so it is also key that European microbiome research infrastructures are coordinated with other international resources. Below we highlight a small selection of widely used resources that are produced outside Europe, and place them in context of the ELIXIR Microbiome Community. Some of the most utilised tools and resources used by the current Microbiome Community are CheckM,
                <sup>
                    <xref ref-type="bibr" rid="ref66">70</xref>
                </sup> the Genome Taxonomy Database (GTDB) and the associated GTDB toolkit.
                <sup>
                    <xref ref-type="bibr" rid="ref67">71</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref68">72</xref>
                </sup> CheckM is widely used to assess the completeness and contamination of prokaryotic MAGs, and is part of the GSC reporting standard. The GTDB resources is a genome based taxonomy of prokaryotes, and the associated GTDB-tk facilitates the classification of other prokaryotic genomes against this framework, more often than not, to determine novelty. These are currently made available via the Australian research groups, who face similar challenges in maintaining resources. Other key resources are based in the US, with MG-RAST
                <sup>
                    <xref ref-type="bibr" rid="ref54">59</xref>
                </sup> and a range of different resources produced by the Joint Genome Institute (JGI). MG-RAST facilitates the analysis of raw-reads and assemblies (metabarcoding, metagenomics and metatranscriptomics), but does not perform assembly nor offer any form or long-term archiving assurance. The JGI IMG/M resource
                <sup>
                    <xref ref-type="bibr" rid="ref69">73</xref>
                </sup> has many parallels with MGnify, offering a wide range of data analyses focused on assembly and MAG generation, but IMG/M does not deal with metabarcoding. Notably, JGI also produces IMG/VR,
                <sup>
                    <xref ref-type="bibr" rid="ref70">74</xref>
                </sup> a globally unique collection of viruses, many of which have been determined from metagenomic and metatranscriptomics. Any future effort in Europe focused on viruses must aim to minimise the duplication of effort and content with IMG/VR. Recently it has also produced IMG/PR: a database of plasmids from genomes and metagenomes with rich annotations and metadata.
                <sup>
                    <xref ref-type="bibr" rid="ref71">75</xref>
                </sup> Coordinating with these global initiatives is key to ensure the future availability of the tools and resources, ensuring interoperability between the resources, maintaining uniform standards and sharing of the informatics/computational burden.</p>
        </sec>
        <sec id="sec14">
            <title>5. Specific challenges and objectives of the ELIXIR Microbiome Community</title>
            <p>A key early challenge in developing the ELIXIR Microbiome Community is to establish a detailed understanding of the current approaches and databases used for the analysis of different microbiomes. For example, it is widely accepted that current short-read assembly-based methods do not generally work as well for soil microbiomes due to the diversity of the microbial community typically present (the sequence depths are insufficient to build useful contigs or the datasets are so large, that they are computationally intractable). This current limitation has led and will continue to lead to the development of new experimental methods, from sampling to nucleic acid sequencing and informatics analysis. In this section, some of the key challenges associated with microbiome research are highlighted below, together with how these challenges will be addressed by the new ELIXIR Microbiome Community. 
                <xref ref-type="table" rid="T4">
Table 4</xref> lists the key thematic areas and objectives that the Microbiome Community will address, split into short-term and longer-term objectives to provide a high-level overview of the proposed activities.</p>
            <table-wrap id="T4" orientation="portrait" position="float">
                <label>
Table 4. </label>
                <caption>
                    <title>Objectives of the ELIXIR Microbiome Community.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1" valign="top">Area</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Objective</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="2" rowspan="1" valign="middle">
                                <bold>Near-term
</bold> (2 years timeframe)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="4" valign="middle">Community Expansion</td>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Survey the needs, key datasets, data analysis approaches, &#x2018;omics data types and biome specific specialisation</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Identify key experts involved in viral, prokaryotic and eukaryotic analysis</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Establish and share a strategic technical roadmap with the Communities and Platforms, highlighting key contacts</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Identify relevant funding calls, with the aid of building microbiome research informatics capacities and connecting to key experts in other &#x2018;omics (e.g. metaproteomics)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="3" valign="middle">Training</td>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Increase awareness of microbiome tools, resources, and their applicability to different microbiomes</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Address knowledge gaps in generating and adopting data analysis workflows</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Teach advanced containerisation and cloud deployment</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="4" valign="middle">Co-ordinate
</td>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Increase rates of data archival deposition, with rich contextual metadata. Establish a mapping between biome and checklists</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Promote data analysis through the use of ELIXIR services</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Share of ideas on the design and implementation of workflows for microbiome research, promoting the use of best practices</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Organise in-person and virtual meetings for the Microbiome Community</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Industry connection</td>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Use ELIXIR and Node forums to understand pharmaceutical and biotechnological demands and current limitations impacting this sector.</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="2" rowspan="1" valign="middle">
                                <bold>Longer-term
</bold> (~3-5 years)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="4" valign="middle">Training</td>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Design targeted training for different microbiome communities</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Addressing the issue of maintaining &#x201c;Train the Trainer&#x201d;</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Organise hackathon to improve integration of ELIXIR services providing microbiome data</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Establish a rich set of training materials, appropriately tagged to aid find ability</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="3" valign="middle">Federated data analysis</td>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Enable the execution of MGnify pipelines in Galaxy and/or other data management workflows, and submission of results to MGnify</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Establish routine mechanisms for federating microbiome analysis (e.g. RO-Crates, resources)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Demonstrate approaches to multi&#x2019;omics integration, through collaborative, cross-Community initiatives</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="5" valign="middle">Promoting new approaches</td>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Establish new standards for microbiome research, particularly with respect to data analysis reporting and contextual metadata reporting in conjunction with GSC</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Leverage new data-types and experimental approaches to improve the scope and/or quality of microbiome analysis</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Enhance existing or establish new reference databases in response to the Microbiome Community demand and capacity</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Establish new methods for across study comparisons, mitigating against confounding factors to enhance discovery</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Provide a mechanism for estimating the cost/benefit of performing different types of analysis in the context of different microbiomes</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="3" valign="middle">International harmonisation</td>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Represent the Microbiome Community at international conferences, promoting the Community/ELIXIR outputs and solutions</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Foster international collaborations between other resources providers and databases to ensure global harmonisation of e-infrastructures for microbiome research</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="middle">Leverage the CAMI initiative to facilitate benchmarking of tools and workflows
</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>The Microbiome Community will also provide a mechanism for sharing knowledge about new approaches for microbiome research, be it experimental or informatics-based techniques. For example, there is an increasing number of metagenomics datasets that are produced using long-read sequence technologies. While long-read sequencing technologies can require larger quantities of DNA or may be more error prone compared to third-generation short-read sequencing technologies &#x2013; which can limit their use &#x2013; the long-reads can mitigate the computational burden of metagenomic assembly and increase the confidence in analysis results (e.g. MAGs produced by long-reads can have high contiguity and therefore less prone to contamination). The long-reads can be paired with short-read sequences, which can then be used in different ways (e.g. sequence error correction). Increasing the awareness of these long-read and hybrid-sequencing approaches, the workflows that support their analysis and when and where they could be applied will be a key output of the Microbiome Community. Similarly, there are other experimental approaches such as single amplified genomes (SAGs), which have increased in popularity. The Microbiome Community will also be important for assessing the utility of emerging sequencing approaches, such as adaptive sequencing approaches. In this case, the methods can access low abundance microbes, although such methods will not facilitate the generation of abundance profiles. Bringing these data types alongside the ubiquitous short-read datasets will require new standards and data integration approaches to be developed by the Microbiome Community.</p>
            <p>There has been a paradigm-shift in metagenomic analysis with a common goal now being the generation of environmental genomes (MAGs), which has not only allowed the identification of thousands of specific functions, but facilitated them to be assigned to specific organisms. As such, this has started the development of specific MAG deposition layers,
                <sup>
                    <xref ref-type="bibr" rid="ref72">76</xref>
                </sup> and the development of MAG specific resources. The new Microbiome Community will promote the use of MAG deposition, and provide guidelines and software to aid their deposition. Workflows that encompass both MAG generation and quality verification will be developed that include the capture of both prokaryotic and eukaryotic MAGs. The Microbiome Community will help establish best practices for eukaryotic MAG discovery, as well as develop new standards for removing redundancy and methods for assigning taxonomy, which are recognised gaps in the area of eukaryotic MAG discovery. While prokaryotic MAG recovery methods are more mature and standardised, it is anticipated that there will be continuous improvements in both experimental and computational methods for generating longer contigs, and more datasets that enable different approaches to enhance the detection of contamination and/or misassembly. The ELIXIR Microbiome Community will also evaluate methods and establish best practices for the identification of sub-species/strains in metagenomic datasets. To do so, we will engage with efforts such as the CAMI
                <sup>
                    <xref ref-type="bibr" rid="ref50">55</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref73">77</xref>
                </sup> to identify tools that can scalably and accurately classify MAGs at a finer grain taxonomic level than species.</p>
            <p>Finally, the classification and naming of MAGs is going to be paramount, so that the novel biodiversity can be understood and more easily referenced by the scientific community. Currently, the Microbiome Community has widely adopted the GTDB
                <sup>
                    <xref ref-type="bibr" rid="ref67">71</xref>
                </sup> and the associated GTDB-tk
                <sup>
                    <xref ref-type="bibr" rid="ref68">72</xref>
                </sup> for classifying MAGs against a reference tree. However, the taxonomy of GTDB differs from the more widely-used NCBI taxonomy, and there is a need to increase the interoperability between these two taxonomies. The ELIXIR Microbiome Community will work on addressing the current issues associated with MAGs and taxonomy. Additionally, another key area of development of taxonomy will be increasing the linkage between genomic resources and marker genes, such as the ribosomal small subunit (SSU) RNA.</p>
            <p>In addition to cellular microbes, another area for the ELIXIR Microbiome Community to address is the development of the infrastructure and resources for identifying and cataloguing viruses in metagenomic and metatranscriptomic data.
                <sup>
                    <xref ref-type="bibr" rid="ref74">78</xref>
                </sup>
                <sup>&#x2013;</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref77">81</xref>
                </sup> Viral genomes are incredibly diverse in terms of composition and organisation. However, there are three challenges associated with viral microbiomes: (i) there is no universal marker gene covering all viruses; (ii) viral taxonomic frameworks are incomplete; (iii) there is no centralised database collecting the millions of viral sequences; and (iv) metagenomics informatics often only produces fragments of viruses, which causes ambiguities concerning their classification and functions. It will be critical for the ELIXIR Microbiome Community to engage with established viral infrastructures and organisations, such as the European Virus Bioinformatics Center, to establish methods, standards and resources for improving the analysis of viruses found in microbiome sequence data, and how best to overcome the current fragmented organisation of viral datasets.</p>
            <p>The increase in metagenomic assemblies has resulted in a parallel increase in the number of predicted protein sequences that have been identified, with sets of non-redundant proteins now in the billions. There is huge potential for discovery in these protein datasets, as well as de novo designs fit for purpose, e.g. carbonic anhydrases
                <sup>
                    <xref ref-type="bibr" rid="ref78">82</xref>
                </sup> and a key aim for the new ELIXIR Microbiome Community will be ensuring that these data are annotated, both as individual sequences or as higher order grouping (e.g. pathways, biosynthetic gene clusters). This will involve the evaluation of emerging tools, as well as harnessing structural models to allow the detection of relationships that are undetectable by current sequence based methods. The Microbiome Community will need to work together to shed light on the functions of the so-called &#x2018;Dark Matter&#x2019;, develop standards for functional labelling that encapsulate both the mechanisms and confidence of the annotation, and develop new infrastructural frameworks for accessing slices of the data based or adequate representatives based on the requirements.</p>
            <p>As identified by the ELIXIR Marine Metagenomics Community, experimental and contextual metadata is critical to comparative metagenomics. The absence of rich contextual and experimental metadata limits data reuse and the production of downstream data products, such as assemblies and MAGs. With the Microbiome Community, we will identify areas where metadata standards need to be improved, with biome specific contextual metadata being the most likely source of specific metadata checklist. The Microbiome Community will develop training promoting the need for metadata, checking compliance against standards, how the metadata can be captured and submitted to accompany the sequence data, and potentially other &#x2018;omics data types. Within the Microbiome Community, we will promote and develop standards regarding the analysis provenance, and how the collective corpus of metadata can be used to improve meta-analysis and the identification of confounding factors when comparing different research projects.</p>
            <p>Another key challenge that the Microbiome Community needs to address is ensuring that compute resources are accessible for performing data analysis that can be associated with microbiome derived sequence data. Previously, we have highlighted the need for interaction with the ELIXIR Compute, Interoperability and Training platforms, as well as ELIXIR Communities such as the Galaxy Community. This requires that analysis pipelines are readily discoverable and deployable, and that key issues regarding both compute processing and storage requirements are well understood. Additionally, given that microbiome associated data analysis has such computational overheads, it is vital that models for data archiving and/or sharing are developed by the Microbiome Community to increase the capacity of microbiome research within Europe. This may require the extensions to existing databases or development of new ones, but it requires an agreement from the research community to adopt them. Achieving this will involve both communication and training of the microbiome research community.</p>
            <p>While there are data resources such as MGnify that provide access to consistent analyses pertaining to different metabarcoding, metatranscriptomics and metagenomics datasets from a variety of biomes, it is fundamental to remember that these data outputs do not represent the end of the analysis pathway. Typically studies require comparison between different cohort groups (disease vs health, treatment vs non-treatment). Furthermore, as the biological signal from meta&#x2019;omics datasets can be extremely noisy, there can often be the need to combine datasets to boost statistical significance of the biological signal. Similarly, the combination of studies can also be used to: (i) contextualise against previous studies (e.g. similar studies on the same diseases); (ii) understand the distribution of microbes or functional features (e.g. antimicrobial resistance genes) between different geographical locations; and/or (iii) study the relationship between biomes (e.g. studies adopting a One Health approach). To enable such large, complex studies there needs to be a greater understanding of the approaches suitable for cross study comparisons, and their limitations. Thus, a major objective for the Microbiome Community will be to include those researchers that are developing methods that can identify and mitigate experimental and informatic confounding factors, which currently limit data reuse. Existing approaches often rely on correlating contextual and experimental metadata with statistically significant factors identified in the datasets. There is also the need to develop and promote methods for performing robust statistical analysis of microbiome derived data, thereby enabling biological signals to be extracted from cross-sample/project datasets. Currently, there is a tendency to analyse the different &#x2018;omics datasets independently, and then correlate the derived signals. However, statistical methods are being developed to facilitate the analysis of integrated multi-omics datasets, and it will be important that the Microbiome Community determines the applicability of these approaches for microbiome research.</p>
            <p>In the context of other &#x2018;omics approaches, there are also some major challenges in metaproteomics.
                <sup>
                    <xref ref-type="bibr" rid="ref79">83</xref>
                </sup> One of the major challenges is the construction of tailored protein sequence databases which are needed to identify proteins in complex microbial communities. Metaproteomics aims to elucidate the functional and taxonomic interplay of proteins in microbiomes, but the diversity and vast number of unknown and uncharacterized proteins present in these communities makes database creation and accurate protein identification difficult. As microbial communities are highly dynamic and their protein expression can vary significantly, conventional protein sequence databases might not cover the entire diversity, leading to potential limitations in accurate protein identification (e.g. the use of de novo sequencing). Other challenges in metaproteomics includes how to report proteins which cannot be explained by a corresponding metagenomic study, so called unidentified proteins of unknown function, methods for protein inference and quantification that are comparable across studies, the dynamic range of protein sizes analysed, as well as how researchers handle non-specific peptide-spectrum matches, compared to specific matches. Addressing these challenges is crucial for improving the reliability and confidence of metaproteomic analysis and obtaining comprehensive insights into the functional roles of proteins in complex microbiomes.</p>
            <p>As metagenomic methods have become a more routine method for studying microbial communities, metagenomics has been and will continue to be paired with more and more diverse sets of measurements of the microbiome. Examples of non-omics data collected alongside metagenomics data include geochemical (e.g. PANGAEA
                <sup>
                    <xref ref-type="bibr" rid="ref80">84</xref>
                </sup>) measurements, meteorological, image data and even acoustics. While methods are already emerging for the integration of &#x2018;omics datatypes (e.g. MOFA,
                <sup>
                    <xref ref-type="bibr" rid="ref81">85</xref>
                </sup> 
                <ext-link ext-link-type="uri" xlink:href="https://microbiome.github.io/mia/">MIA</ext-link> (
                <ext-link ext-link-type="uri" xlink:href="https://github.com/microbiome/mia">https://github.com/microbiome/mia</ext-link>)), integration of these additional non-omics data types will enable a broader understanding of microbiomes in context. For the new Microbiome Community, it will be essential to identify the appropriate archives for these data types, and establish the methods to facilitate navigating between datasets from the same samples. Only through achieving this, can new data visualisation schemas that enable the combination of environmental, geospatial and temporal data, in addition to biological data (taxonomy/function), be developed.</p>
        </sec>
        <sec id="sec15" sec-type="conclusions">
            <title>6. Conclusions</title>
            <p>The overarching aim of the Microbiome Community is to develop a sustainable bioinformatics infrastructure for microbiome resources (data, tools, workflows, standards, training) which will enable a deep understanding of the function and taxonomy of the entire microbial fraction. The aim is to be biome-agnostic and, balanced in supporting the analysis and interpretation of data from different environments. We aim to highlight the very best approaches for the analysis and integration of different data types (e.g. sequences, metabolites, proteomics, and images) and their visualisation. By broadening the Marine Metagenomics Community we will engage many more researchers and aspire to have a greater representation of scientists from different disciplines, such as ecologists and clinicians, complementing the strong molecular biology and genomics backgrounds already represented in the Microbiome Community. The Microbiome Community will have key roles in engaging with policy makers (e.g. access and benefit sharing, climate change impact assessment), as well as the industrial sector, which is increasing the translation of basic research to microbiome-based products (e.g. 
                <ext-link ext-link-type="uri" xlink:href="https://ktn-uk.org/wp-content/uploads/2021/02/Microbiome_Strategic_Roadmap_FINAL.pdf">UK Microbiome Strategic Roadmap for Innovation</ext-link>). Such a strong microbiome infrastructure as envisaged by the Microbiome Community is essential to maximise the impact that European research programs have in the field of microbiome research, and to facilitate the exploitation of microbiome-based solutions in a range of settings, from clinical to industrial processes, thereby addressing key societal challenges and needs.</p>
        </sec>
    </body>
    <back>
        <sec id="sec18" sec-type="data-availability">
            <title>Data availability</title>
            <p>No data is associated with this article
                <italic toggle="yes">.</italic>
            </p>
        </sec>
        <ack>
            <title>Acknowledgments</title>
            <p>We would like to thank the reviewers of the manuscript for their insightful comments, particularly Reviewer 1 for providing a very thorough review, with many constructive comments that have been incorporated into the final manuscript.</p>
        </ack>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Marchesi</surname>
                            <given-names>JR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ravel</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>The vocabulary of microbiome research: a proposal.</article-title>
                    <source>

                        <italic toggle="yes">Microbiome.</italic>
</source>
                    <year>2015 Jul 30</year>;<volume>3</volume>:<fpage>31</fpage>.
                    <pub-id pub-id-type="pmid">26229597</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s40168-015-0094-5</pub-id>
                    <pub-id pub-id-type="pmcid">PMC4520061</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wilkinson</surname>
                            <given-names>MD</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dumontier</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Aalbersberg</surname>
                            <given-names>IJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The FAIR Guiding Principles for scientific data management and stewardship.</article-title>
                    <source>

                        <italic toggle="yes">Scientific Data.</italic>
</source>
                    <year>2016 Mar 15</year>;<volume>3</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>9</lpage>.</mixed-citation>
            </ref>
            <ref id="ref3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Harrow</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Drysdale</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Smith</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>ELIXIR: providing a sustainable infrastructure for life science data at European scale.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2021 Jun 27</year>;<volume>37</volume>(<issue>16</issue>):<fpage>2506</fpage>&#x2013;<lpage>2511</lpage>.
                    <pub-id pub-id-type="pmid">34175941</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btab481</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8388016</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Robertsen</surname>
                            <given-names>EM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Denise</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mitchell</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>ELIXIR pilot action: Marine metagenomics &#x2013; towards a domain specific set of sustainable services.</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
</source>
                    <year>2017 Jan 23</year>;<volume>6</volume>(<issue>70</issue>):<fpage>70</fpage>.
                    <pub-id pub-id-type="pmid">28620454</pub-id>
                    <pub-id pub-id-type="doi">10.12688/f1000research.10443.1</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5461914</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Richardson</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Allen</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Baldi</surname>
                            <given-names>G</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>MGnify: the microbiome sequence data analysis resource in 2023.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2022 Dec 7</year>;<volume>51</volume>(<issue>D1</issue>):<fpage>D753</fpage>&#x2013;<lpage>D759</lpage>.
                    <pub-id pub-id-type="doi">10.1093/nar/gkac1080</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Agafonov</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mattila</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tuan</surname>
                            <given-names>CD</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>META-pipe cloud setup and execution.</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
</source>
                    <year>2017 Nov 29</year>;<volume>6</volume>:<fpage>2060</fpage>.
                    <pub-id pub-id-type="pmid">31069047</pub-id>
                    <pub-id pub-id-type="doi">10.12688/f1000research.13204.3</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6480938</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Matias Rodrigues</surname>
                            <given-names>JF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schmidt</surname>
                            <given-names>TSB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tackmann</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2017 Aug 14</year>;<volume>33</volume>(<issue>23</issue>):<fpage>3808</fpage>&#x2013;<lpage>3810</lpage>.
                    <pub-id pub-id-type="pmid">28961926</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btx517</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5860325</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Santamaria</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fosso</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Licciulli</surname>
                            <given-names>F</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>ITSoneDB: a comprehensive collection of eukaryotic ribosomal RNA Internal Transcribed Spacer 1 (ITS1) sequences.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2017 Sep 25</year>;<volume>46</volume>(<issue>D1</issue>):<fpage>D127</fpage>&#x2013;<lpage>D132</lpage>.
                    <pub-id pub-id-type="doi">10.1093/nar/gkx855</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Crusoe</surname>
                            <given-names>MR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Abeln</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Iosup</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Methods included: standardizing computational reuse and portability with the Common Workflow Language.</article-title>
                    <source>

                        <italic toggle="yes">Commun. ACM.</italic>
</source>
                    <year>2022 May 20</year>;<volume>65</volume>(<issue>6</issue>):<fpage>54</fpage>&#x2013;<lpage>63</lpage>.
                    <pub-id pub-id-type="doi">10.1145/3486897</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Klemetsen</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Raknes</surname>
                            <given-names>IA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fu</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The MAR databases: development and implementation of databases specific for marine metagenomics.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2018 Jan 4</year>;<volume>46</volume>(<issue>D1</issue>):<fpage>D692</fpage>&#x2013;<lpage>D699</lpage>.
                    <pub-id pub-id-type="pmid">29106641</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkx1036</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5753341</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Niang</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hoebeke</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Meng</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>METdb: A Genomic Reference Database For Marine Species.</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
                    </source>
                    <year>2020 Jun 6</year>;<volume>9</volume>(ELIXIR):<fpage>564</fpage>(poster).
                    <pub-id pub-id-type="doi">10.7490/f1000research.1118000.1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Vernette</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lecubin</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>S&#x00e1;nchez</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The Ocean Gene Atlas v2.0: online exploration of the biogeography and phylogeny of plankton genes.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2022 Jul 5</year>;<volume>50</volume>(<issue>W1</issue>):<fpage>W516</fpage>&#x2013;<lpage>W526</lpage>.
                    <pub-id pub-id-type="pmid">35687095</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkac420</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9252727</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref13">
                <label>13</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ten Hoopen</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Finn</surname>
                            <given-names>RD</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bongo</surname>
                            <given-names>LA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The metagenomic data life-cycle: standards and best practices.</article-title>
                    <source>

                        <italic toggle="yes">Gigascience.</italic>
</source>
                    <year>2017 Aug 1</year>;<volume>6</volume>(<issue>8</issue>):<fpage>1</fpage>&#x2013;<lpage>11</lpage>.
                    <pub-id pub-id-type="pmid">28637310</pub-id>
                    <pub-id pub-id-type="doi">10.1093/gigascience/gix047</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>J&#x00e9;gousse</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vannier</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Groben</surname>
                            <given-names>R</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A total of 219 metagenome-assembled genomes of microorganisms from Icelandic marine waters.</article-title>
                    <source>

                        <italic toggle="yes">PeerJ.</italic>
</source>
                    <year>2021 Apr 2</year>;<volume>9</volume>:<fpage>e11112</fpage>.
                    <pub-id pub-id-type="pmid">33859876</pub-id>
                    <pub-id pub-id-type="doi">10.7717/peerj.11112</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8020865</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>D&#x00e1;vila-Ramos</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Castel&#x00e1;n-S&#x00e1;nchez</surname>
                            <given-names>HG</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mart&#x00ed;nez-&#x00c1;vila</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A Review on Viral Metagenomics in Extreme Environments.</article-title>
                    <source>

                        <italic toggle="yes">Front. Microbiol.</italic>
</source>
                    <year>2019 Oct 18</year>;<volume>10</volume>:<fpage>472040</fpage>.
                    <pub-id pub-id-type="doi">10.3389/fmicb.2019.02403</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wong</surname>
                            <given-names>HL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>MacLeod</surname>
                            <given-names>FI</given-names>
                        </name>

                        <name name-style="western">
                            <surname>White</surname>
                            <given-names>RA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Microbial dark matter filling the niche in hypersaline microbial mats.</article-title>
                    <source>

                        <italic toggle="yes">Microbiome.</italic>
</source>
                    <year>2020 Sep 16</year>;<volume>8</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>14</lpage>.
                    <pub-id pub-id-type="doi">10.1186/s40168-020-00910-0</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Obiol</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Giner</surname>
                            <given-names>CR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>S&#x00e1;nchez</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A metagenomic assessment of microbial eukaryotic diversity in the global ocean.</article-title>
                    <source>

                        <italic toggle="yes">Mol. Ecol. Resour.</italic>
</source>
                    <year>2020 May 1</year>;<volume>20</volume>(<issue>3</issue>):<fpage>718</fpage>&#x2013;<lpage>731</lpage>.
                    <pub-id pub-id-type="pmid">32065492</pub-id>
                    <pub-id pub-id-type="doi">10.1111/1755-0998.13147</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref18">
                <label>18</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Delmont</surname>
                            <given-names>TO</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gaia</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hinsinger</surname>
                            <given-names>DD</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Functional repertoire convergence of distantly related eukaryotic plankton lineages revealed by genome-resolved metagenomics.</article-title>
                    <source>

                        <italic toggle="yes">bioRxiv.</italic>
</source>
                    <year>2020.10.15.341214</year>.
                    <pub-id pub-id-type="doi">10.1101/2020.10.15.341214v2.abstract</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref19">
                <label>19</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tagirdzhanova</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Saary</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cameron</surname>
                            <given-names>ES</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Evidence for a core set of microbial lichen symbionts from a global survey of metagenomes.</article-title>
                    <source>

                        <italic toggle="yes">bioRxiv.</italic>
</source>
                    <year>2023.02.02.524463</year>.
                    <pub-id pub-id-type="doi">10.1101/2023.02.02.524463v1.abstract</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Alberdi</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Andersen</surname>
                            <given-names>SB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Limborg</surname>
                            <given-names>MT</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Disentangling host-microbiota complexity through hologenomics.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Rev. Genet.</italic>
</source>
                    <year>2022 May</year>;<volume>23</volume>(<issue>5</issue>):<fpage>281</fpage>&#x2013;<lpage>297</lpage>.
                    <pub-id pub-id-type="pmid">34675394</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41576-021-00421-0</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nielsen</surname>
                            <given-names>HB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Almeida</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Juncker</surname>
                            <given-names>AS</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Biotechnol.</italic>
</source>
                    <year>2014 Aug</year>;<volume>32</volume>(<issue>8</issue>):<fpage>822</fpage>&#x2013;<lpage>828</lpage>.
                    <pub-id pub-id-type="pmid">24997787</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nbt.2939</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lu</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rincon</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wood</surname>
                            <given-names>DE</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Metagenome analysis using the Kraken software suite.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Protoc.</italic>
</source>
                    <year>2022 Dec</year>;<volume>17</volume>(<issue>12</issue>):<fpage>2815</fpage>&#x2013;<lpage>2839</lpage>.
                    <pub-id pub-id-type="pmid">36171387</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41596-022-00738-y</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9725748</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Beghini</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>McIver</surname>
                            <given-names>LJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Blanco-M&#x00ed;guez</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3.</article-title>
                    <source>

                        <italic toggle="yes">elife.</italic>
</source>
                    <year>2021 May 4</year>;<volume>10</volume>.
                    <pub-id pub-id-type="pmid">33944776</pub-id>
                    <pub-id pub-id-type="doi">10.7554/eLife.65088</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8096432</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ruscheweyh</surname>
                            <given-names>HJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Milanese</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Paoli</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Cultivation-independent genomes greatly expand taxonomic-profiling capabilities of mOTUs across various environments.</article-title>
                    <source>

                        <italic toggle="yes">Microbiome.</italic>
</source>
                    <year>2022 Dec 5</year>;<volume>10</volume>(<issue>1</issue>):<fpage>212</fpage>.
                    <pub-id pub-id-type="pmid">36464731</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s40168-022-01410-z</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9721005</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Martin</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Heavens</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lan</surname>
                            <given-names>Y</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Nanopore adaptive sampling: a tool for enrichment of low abundance species in metagenomic samples.</article-title>
                    <source>

                        <italic toggle="yes">Genome Biol.</italic>
</source>
                    <year>2022 Jan 24</year>;<volume>23</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>27</lpage>.
                    <pub-id pub-id-type="doi">10.1186/s13059-021-02582-x</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nelson</surname>
                            <given-names>MT</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pope</surname>
                            <given-names>CE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Marsh</surname>
                            <given-names>RL</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Human and Extracellular DNA Depletion for Metagenomic Analysis of Complex Clinical Infection Samples Yields Optimized Viable Microbiome Profiles.</article-title>
                    <source>

                        <italic toggle="yes">Cell Rep.</italic>
</source>
                    <year>2019 Feb 19</year>;<volume>26</volume>(<issue>8</issue>):<fpage>2227</fpage>&#x2013;<lpage>40.e5</lpage>.
                    <pub-id pub-id-type="pmid">30784601</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.celrep.2019.01.091</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6435281</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref27">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Balech</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Brennan</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Carrillo de Santa Pau</surname>
                            <given-names>E</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The future of food and nutrition in ELIXIR.</article-title>
                    <source>F1000Res.</source>
                    <year>2022</year>
                    <volume>11</volume>(ELIXIR): 978.
                    <pub-id pub-id-type="doi">10.12688/f1000research.51747.1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref32">
                <label>28</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lin</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Akin</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rao</surname>
                            <given-names>R</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Evolutionary-scale prediction of atomic-level protein structure with a language model.</article-title>
                    <source>

                        <italic toggle="yes">Science.</italic>
</source>
                    <year>2023 Mar 17</year>;<volume>379</volume>(<issue>6637</issue>):<fpage>1123</fpage>&#x2013;<lpage>1130</lpage>.
                    <pub-id pub-id-type="pmid">36927031</pub-id>
                    <pub-id pub-id-type="doi">10.1126/science.ade2574</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref89">
                <label>29</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Beard</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bacall</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nenadic</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Thurston</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <article-title>Carole A Goble, Susanna-Assunta Sansone, Teresa K Attwood, TeSS: a platform for discovering life-science training opportunities.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2020</year>;<volume>36</volume>(<issue>10</issue>):<fpage>3290</fpage>&#x2013;<lpage>3291</lpage>.
                    <pub-id pub-id-type="pmid">32044952</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa047</pub-id>
                    <pub-id pub-id-type="pmcid">PMC7214044</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref82">
                <label>30</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Arikawa</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ide</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kogawa</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Recovery of strain-resolved genomes from human microbiome through an integration framework of single-cell genomics and metagenomics.</article-title>
                    <source>

                        <italic toggle="yes">Microbiome.</italic>
</source>
                    <year>2021 Oct 12</year>;<volume>9</volume>(<issue>1</issue>):<fpage>202</fpage>.
                    <pub-id pub-id-type="pmid">34641955</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s40168-021-01152-4</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8507239</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref83">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ghaddar</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Biswas</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Harris</surname>
                            <given-names>C</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Tumor microbiome links cellular programs and immunity in pancreatic cancer.</article-title>
                    <source>

                        <italic toggle="yes">Cancer Cell</italic>.</source>
                    <year>2022 Oct</year>;<volume>40</volume>(<issue>10</issue>):<fpage>1240</fpage>&#x2013;<lpage>1253.e5</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.ccell.2022.09.009</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9556978</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref28">
                <label>32</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Heberling</surname>
                            <given-names>JM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Miller</surname>
                            <given-names>JT</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Noesgaard</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Data integration enables global biodiversity synthesis.</article-title>
                    <source>

                        <italic toggle="yes">PNAS.</italic>
</source>
                    <year>2021 Feb 9</year>;<volume>118</volume>(<issue>6</issue>):<fpage>e2018093118</fpage>.
                    <pub-id pub-id-type="pmid">33526679</pub-id>
                    <pub-id pub-id-type="doi">10.1073/pnas.2018093118</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8017944</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref29">
                <label>33</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Vizca&#x00ed;no</surname>
                            <given-names>JA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Walzer</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jim&#x00e9;nez</surname>
                            <given-names>RC</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A community proposal to integrate proteomics activities in ELIXIR.</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
</source>
                    <year>2017</year>;<volume>6</volume>:<fpage>875</fpage>.
                    <pub-id pub-id-type="pmid">28713550</pub-id>
                    <pub-id pub-id-type="doi">10.12688/f1000research.11751.1</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5499783</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref30">
                <label>34</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bansal</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Morgat</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Axelsen</surname>
                            <given-names>KB</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Rhea, the reaction knowledgebase in 2022.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2022 Jan 7</year>;<volume>50</volume>(<issue>D1</issue>):<fpage>D693</fpage>&#x2013;<lpage>D700</lpage>.
                    <pub-id pub-id-type="pmid">34755880</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkab1016</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8728268</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref31">
                <label>35</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jumper</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Evans</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pritzel</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Highly accurate protein structure prediction with AlphaFold.</article-title>
                    <source>

                        <italic toggle="yes">Nature.</italic>
</source>
                    <year>2021 Aug</year>;<volume>596</volume>(<issue>7873</issue>):<fpage>583</fpage>&#x2013;<lpage>589</lpage>.
                    <pub-id pub-id-type="pmid">34265844</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41586-021-03819-2</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8371605</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref33">
                <label>36</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Varadi</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Anyango</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Deshpande</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2021 Nov 17</year>;<volume>50</volume>(<issue>D1</issue>):<fpage>D439</fpage>&#x2013;<lpage>D444</lpage>.
                    <pub-id pub-id-type="doi">10.1093/nar/gkab1061</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref34">
                <label>37</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yoshida</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hiraga</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Takehana</surname>
                            <given-names>T</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Response to Comment on &#x201c;A bacterium that degrades and assimilates poly (ethylene terephthalate).&#x201d;.</article-title>
                    <source>

                        <italic toggle="yes">Science.</italic>
</source>
                    <year>2016 Aug 19</year>;<volume>353</volume>(<issue>6301</issue>):<fpage>759</fpage>.
                    <pub-id pub-id-type="pmid">27540160</pub-id>
                    <pub-id pub-id-type="doi">10.1126/science.aaf8625</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref35">
                <label>38</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gurbich</surname>
                            <given-names>TA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Almeida</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Beracochea</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>MGnify Genomes: A Resource for Biome-specific Microbial Genome Catalogues.</article-title>
                    <source>

                        <italic toggle="yes">J. Mol. Biol.</italic>
</source>
                    <year>2023 Jul 15</year>;<volume>435</volume>(<issue>14</issue>):<fpage>168016</fpage>.
                    <pub-id pub-id-type="pmid">36806692</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.jmb.2023.168016</pub-id>
                    <pub-id pub-id-type="pmcid">PMC10318097</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref36">
                <label>39</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Claeys</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Van Den Bossche</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Perez-Riverol</surname>
                            <given-names>Y</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>lesSDRF is more: maximizing the value of proteomics data through streamlined metadata annotation.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Commun.</italic>
</source>
                    <year>2023 Oct 24</year>;<volume>14</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>4</lpage>.
                    <pub-id pub-id-type="doi">10.1038/s41467-023-42543-5</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref37">
                <label>40</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dai</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>F&#x00fc;llgrabe</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pfeuffer</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A proteomics sample metadata representation for multi-omics integration and big data analysis.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Commun.</italic>
</source>
                    <year>2021 Oct 6</year>;<volume>12</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>8</lpage>.
                    <pub-id pub-id-type="doi">10.1038/s41467-021-26111-3</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref38">
                <label>41</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Haug</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cochrane</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nainala</surname>
                            <given-names>VC</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>MetaboLights: a resource evolving in response to the needs of its scientific community.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2019 Nov 6</year>;<volume>48</volume>(<issue>D1</issue>):<fpage>D440</fpage>&#x2013;<lpage>D444</lpage>.
                    <pub-id pub-id-type="doi">10.1093/nar/gkz1019</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref39">
                <label>42</label>
                <mixed-citation publication-type="journal">
                    <collab>The Europe PMC Consortium</collab>:
                    <article-title>Europe PMC: a full-text literature database for the life sciences and platform for innovation.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2014 Nov 6</year>;<volume>43</volume>(<issue>D1</issue>):<fpage>D1042</fpage>&#x2013;<lpage>D1048</lpage>.
                    <pub-id pub-id-type="doi">10.1093/nar/gku1061</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref40">
                <label>43</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nassar</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rogers</surname>
                            <given-names>AB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Talo&#x2019;</surname>
                            <given-names>F</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A machine learning framework for discovery and enrichment of metagenomics metadata from open access publications.</article-title>
                    <source>

                        <italic toggle="yes">Gigascience.</italic>
</source>
                    <year>2022 Aug 11</year>;<volume>11</volume>.
                    <pub-id pub-id-type="pmid">35950838</pub-id>
                    <pub-id pub-id-type="doi">10.1093/gigascience/giac077</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9366992</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref41">
                <label>44</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zafeiropoulos</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Paragkamian</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ninidakis</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types.</article-title>
                    <source>

                        <italic toggle="yes">Microorganisms.</italic>
</source>
                    <year>2022 Jan 26</year>;<volume>10</volume>(<issue>2</issue>).
                    <pub-id pub-id-type="pmid">35208748</pub-id>
                    <pub-id pub-id-type="doi">10.3390/microorganisms10020293</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8879827</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref84">
                <label>45</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gruening</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sallou</surname>
                            <given-names>O</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Moreno</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Recommendations for the packaging and containerizing of bioinformatics software.</article-title>
                    <source>

                        <italic toggle="yes">F1000Research.</italic>
</source>
                    <year>2018</year>;<volume>7</volume>.
                    <pub-id pub-id-type="doi">10.12688/f1000research.15140.1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref42">
                <label>46</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ison</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kalas</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jonassen</surname>
                            <given-names>I</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2013 May 15</year>;<volume>29</volume>(<issue>10</issue>):<fpage>1325</fpage>&#x2013;<lpage>1332</lpage>.
                    <pub-id pub-id-type="pmid">23479348</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btt113</pub-id>
                    <pub-id pub-id-type="pmcid">PMC3654706</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref85">
                <label>47</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ison</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rapacki</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>M&#x00e9;nager</surname>
                            <given-names>H</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Tools and data services registry: a community effort to document bioinformatics resources.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2015</year>;<volume>44</volume>:<fpage>D38</fpage>&#x2013;<lpage>D47</lpage>.
                    <pub-id pub-id-type="doi">10.1093/nar/gkv1116</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref86">
                <label>48</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <collab>The Galaxy Community</collab>
</person-group>:
                    <article-title>The Galaxy platform for accessible, reproducible, and collaborative data analyses: 2024 update.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2024</year>;<volume>52</volume>(<issue>W1</issue>):<fpage>W83</fpage>&#x2013;<lpage>W94</lpage>.
                    <pub-id pub-id-type="pmid">38769056</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkae410</pub-id>
                    <pub-id pub-id-type="pmcid">PMC11223835</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref43">
                <label>49</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>M&#x00f6;lder</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jablonski</surname>
                            <given-names>KP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Letcher</surname>
                            <given-names>B</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Sustainable data analysis with Snakemake.</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
</source>
                    <year>2021</year>;<volume>10</volume>:<fpage>33</fpage>.
                    <pub-id pub-id-type="pmid">34035898</pub-id>
                    <pub-id pub-id-type="doi">10.12688/f1000research.29032.2</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8114187</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref44">
                <label>50</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Di Tommaso</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chatzou</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Floden</surname>
                            <given-names>EW</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Nextflow enables reproducible computational workflows.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Biotechnol.</italic>
</source>
                    <year>2017 Apr</year>;<volume>35</volume>(<issue>4</issue>):<fpage>316</fpage>&#x2013;<lpage>319</lpage>.
                    <pub-id pub-id-type="pmid">28398311</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nbt.3820</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref87">
                <label>51</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gustafsson</surname>
                            <given-names>OJR</given-names>
                        </name> 
                        <etal/>
</person-group>:
                    <article-title>WorkflowHub: a registry for computational workflows.</article-title>
                    <source>

                        <italic toggle="yes">Sci. Data</italic>
</source>
                    <year>2025</year>;<volume>12</volume>:<fpage>837</fpage>.
                    <pub-id pub-id-type="pmid">40399296</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41597-025-04786-3</pub-id>
                    <pub-id pub-id-type="pmcid">PMC12095652</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref45">
                <label>52</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lindgreen</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Adair</surname>
                            <given-names>KL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gardner</surname>
                            <given-names>PP</given-names>
                        </name>
</person-group>:
                    <article-title>An evaluation of the accuracy and speed of metagenome analysis tools.</article-title>
                    <source>

                        <italic toggle="yes">Sci. Rep.</italic>
</source>
                    <year>2016 Jan 18</year>;<volume>6</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>14</lpage>.
                    <pub-id pub-id-type="doi">10.1038/srep19233</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref48">
                <label>53</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>O&#x2019;Sullivan</surname>
                            <given-names>DM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Doyle</surname>
                            <given-names>RM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Temisak</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>An inter-laboratory study to investigate the impact of the bioinformatics component on microbiome analysis using mock communities.</article-title>
                    <source>

                        <italic toggle="yes">Sci. Rep.</italic>
</source>
                    <year>2021 May 19</year>;<volume>11</volume>(<issue>1</issue>):<fpage>10590</fpage>.
                    <pub-id pub-id-type="pmid">34012005</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41598-021-89881-2</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8134577</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref49">
                <label>54</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Poussin</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Khachatryan</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sierro</surname>
                            <given-names>N</given-names>
                        </name>
                        <etal/>
</person-group>:
                    <article-title>Crowdsourced benchmarking of taxonomic metagenome profilers: lessons learned from the sbv IMPROVER Microbiomics challenge.</article-title>
                    <source>

                        <italic toggle="yes">BMC Genomics.</italic>
</source>
                    <year>2022 Aug 30</year>;<volume>23</volume>(<issue>1</issue>):<fpage>624</fpage>.
                    <pub-id pub-id-type="pmid">36042406</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s12864-022-08803-2</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9429340</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref50">
                <label>55</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sczyrba</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hofmann</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Belmann</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Methods.</italic>
</source>
                    <year>2017 Nov</year>;<volume>14</volume>(<issue>11</issue>):<fpage>1063</fpage>&#x2013;<lpage>1071</lpage>.
                    <pub-id pub-id-type="pmid">28967888</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nmeth.4458</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5903868</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref51">
                <label>56</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fritz</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hofmann</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Majda</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>CAMISIM: simulating metagenomes and microbial communities.</article-title>
                    <source>

                        <italic toggle="yes">Microbiome.</italic>
</source>
                    <year>2019 Feb 8</year>;<volume>7</volume>(<issue>1</issue>):<fpage>17</fpage>.
                    <pub-id pub-id-type="pmid">30736849</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s40168-019-0633-6</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6368784</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref52">
                <label>57</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Meyer</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hofmann</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Belmann</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>AMBER: Assessment of Metagenome BinnERs.</article-title>
                    <source>

                        <italic toggle="yes">Gigascience.</italic>
</source>
                    <year>2018 Jun 1</year>;<volume>7</volume>(<issue>6</issue>).
                    <pub-id pub-id-type="pmid">29893851</pub-id>
                    <pub-id pub-id-type="doi">10.1093/gigascience/giy069</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6022608</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref53">
                <label>58</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Meyer</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fritz</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Deng</surname>
                            <given-names>ZL</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Critical Assessment of Metagenome Interpretation: the second round of challenges.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Methods.</italic>
</source>
                    <year>2022 Apr</year>;<volume>19</volume>(<issue>4</issue>):<fpage>429</fpage>&#x2013;<lpage>440</lpage>.
                    <pub-id pub-id-type="pmid">35396482</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41592-022-01431-4</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9007738</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref54">
                <label>59</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Keegan</surname>
                            <given-names>KP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Glass</surname>
                            <given-names>EM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Meyer</surname>
                            <given-names>F</given-names>
                        </name>
</person-group>:
                    <article-title>MG-RAST, a Metagenomics Service for Analysis of Microbial Community Structure and Function.</article-title>
                    <source>

                        <italic toggle="yes">Methods Mol. Biol.</italic>
</source>
                    <year>2016</year>:<volume>1399</volume>:<fpage>207</fpage>&#x2013;<lpage>233</lpage>.
                    <pub-id pub-id-type="pmid">26791506</pub-id>
                    <pub-id pub-id-type="doi">10.1007/978-1-4939-3369-3_13</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref55">
                <label>60</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Perkel</surname>
                            <given-names>JM</given-names>
                        </name>
</person-group>:
                    <article-title>Workflow systems turn raw data into scientific knowledge.</article-title>
                    <source>

                        <italic toggle="yes">Nature.</italic>
</source>
                    <year>2019 Sep</year>;<volume>573</volume>(<issue>7772</issue>):<fpage>149</fpage>&#x2013;<lpage>150</lpage>.
                    <pub-id pub-id-type="pmid">31477884</pub-id>
                    <pub-id pub-id-type="doi">10.1038/d41586-019-02619-z</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref57">
                <label>61</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zafeiropoulos</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Beracochea</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ninidakis</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>metaGOflow: a workflow for the analysis of marine Genomic Observatories shotgun metagenomics data.</article-title>
                    <source>

                        <italic toggle="yes">Gigascience.</italic>
</source>
                    <year>2022 Dec 28</year>;<volume>12</volume>.
                    <pub-id pub-id-type="pmid">37850871</pub-id>
                    <pub-id pub-id-type="doi">10.1093/gigascience/giad078</pub-id>
                    <pub-id pub-id-type="pmcid">PMC10583283</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref88">
                <label>62</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Soiland-Reyes</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sefton</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Crosas</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Packaging research artefacts with RO-Crate.</article-title>
                    <source>

                        <italic toggle="yes">Data Sci.</italic>
</source>
                    <year>2022</year>;<volume>5</volume>(<issue>2</issue>):<fpage>97</fpage>&#x2013;<lpage>138</lpage>.
                    <pub-id pub-id-type="doi">10.3233/DS-210053</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref58">
                <label>63</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Field</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Amaral-Zettler</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cochrane</surname>
                            <given-names>G</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The Genomic Standards Consortium.</article-title>
                    <source>

                        <italic toggle="yes">PLoS Biol.</italic>
</source>
                    <year>2011 Jun</year>;<volume>9</volume>(<issue>6</issue>):<fpage>e1001088</fpage>.
                    <pub-id pub-id-type="pmid">21713030</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pbio.1001088</pub-id>
                    <pub-id pub-id-type="pmcid">PMC3119656</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref59">
                <label>64</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yilmaz</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kottmann</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Field</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Biotechnol.</italic>
</source>
                    <year>2011 May 6</year>;<volume>29</volume>(<issue>5</issue>):<fpage>415</fpage>&#x2013;<lpage>420</lpage>.
                    <pub-id pub-id-type="pmid">21552244</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nbt.1823</pub-id>
                    <pub-id pub-id-type="pmcid">PMC3367316</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref60">
                <label>65</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>McDonald</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Clemente</surname>
                            <given-names>JC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kuczynski</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The Biological Observation Matrix (BIOM) format or: how I learned to stop worrying and love the ome-ome.</article-title>
                    <source>

                        <italic toggle="yes">Gigascience.</italic>
</source>
                    <year>2012 Jul 12</year>;<volume>1</volume>(<issue>1</issue>): 2047&#x2013;217X &#x2013; 1&#x2013;7.
                    <pub-id pub-id-type="pmid">23587224</pub-id>
                    <pub-id pub-id-type="doi">10.1186/2047-217X-1-7</pub-id>
                    <pub-id pub-id-type="pmcid">PMC3626512</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref62">
                <label>66</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Meisner</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kostic</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vernooij</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The global microbiome research landscape: mapping of research, infrastructures, policies and institutions in 2021.</article-title>
                    <source>

                        <italic toggle="yes">MicrobiomeSupport Consortium.</italic>
</source>
                    <year>2022</year>.</mixed-citation>
            </ref>
            <ref id="ref63">
                <label>67</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Van Den Bossche</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kunath</surname>
                            <given-names>BJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schallert</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Critical Assessment of MetaProteome Investigation (CAMPI): a multi-laboratory comparison of established workflows.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Commun.</italic>
</source>
                    <year>2021 Dec 15</year>;<volume>12</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>15</lpage>.
                    <pub-id pub-id-type="doi">10.1038/s41467-021-27542-8</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref64">
                <label>68</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Van Den Bossche</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Arntzen</surname>
                            <given-names>M&#x00d8;</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Becher</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The Metaproteomics Initiative: a coordinated approach for propelling the functional characterization of microbiomes.</article-title>
                    <source>

                        <italic toggle="yes">Microbiome.</italic>
</source>
                    <year>2021 Dec 20</year>;<volume>9</volume>(<issue>1</issue>):<fpage>243</fpage>.
                    <pub-id pub-id-type="pmid">34930457</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s40168-021-01176-w</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8690404</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref65">
                <label>69</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Eloe-Fadrosh</surname>
                            <given-names>EA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ahmed</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Anubhav</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The National Microbiome Data Collaborative Data Portal: an integrated multi-omics microbiome data resource.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2021 Oct 30</year>;<volume>50</volume>(<issue>D1</issue>):<fpage>D828</fpage>&#x2013;<lpage>D836</lpage>.</mixed-citation>
            </ref>
            <ref id="ref66">
                <label>70</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Parks</surname>
                            <given-names>DH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Imelfort</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Skennerton</surname>
                            <given-names>CT</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.</article-title>
                    <source>

                        <italic toggle="yes">Genome Res.</italic>
</source>
                    <year>2015 Jul</year>;<volume>25</volume>(<issue>7</issue>):<fpage>1043</fpage>&#x2013;<lpage>1055</lpage>.
                    <pub-id pub-id-type="pmid">25977477</pub-id>
                    <pub-id pub-id-type="doi">10.1101/gr.186072.114</pub-id>
                    <pub-id pub-id-type="pmcid">PMC4484387</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref67">
                <label>71</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Parks</surname>
                            <given-names>DH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chuvochina</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rinke</surname>
                            <given-names>C</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2021 Sep 14</year>;<volume>50</volume>(<issue>D1</issue>):<fpage>D785</fpage>&#x2013;<lpage>D794</lpage>.
                    <pub-id pub-id-type="pmid">34520557</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkab776</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8728215</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref68">
                <label>72</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chaumeil</surname>
                            <given-names>PA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mussig</surname>
                            <given-names>AJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hugenholtz</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2019 Nov 15</year>;<volume>36</volume>(<issue>6</issue>):<fpage>1925</fpage>&#x2013;<lpage>1927</lpage>.
                    <pub-id pub-id-type="pmid">31730192</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btz848</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref69">
                <label>73</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>IMA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chu</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Palaniappan</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The IMG/M data management and analysis system v.7: content updates and new features.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2023 Jan 6</year>;<volume>51</volume>(<issue>D1</issue>):<fpage>D723</fpage>&#x2013;<lpage>D732</lpage>.
                    <pub-id pub-id-type="pmid">36382399</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkac976</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9825475</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref70">
                <label>74</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Camargo</surname>
                            <given-names>AP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nayfach</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>IMA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>IMG/VR v4: an expanded database of uncultivated virus genomes within a framework of extensive functional, taxonomic, and ecological metadata.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2023 Jan 6</year>;<volume>51</volume>(<issue>D1</issue>):<fpage>D733</fpage>&#x2013;<lpage>D743</lpage>.
                    <pub-id pub-id-type="pmid">36399502</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkac1037</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9825611</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref71">
                <label>75</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Camargo</surname>
                            <given-names>AP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Call</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Roux</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>IMG/PR: a database of plasmids from genomes and metagenomes with rich annotations and metadata.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2024 Jan 5</year>;<volume>52</volume>(<issue>D1</issue>):<fpage>D164</fpage>&#x2013;<lpage>D173</lpage>.
                    <pub-id pub-id-type="pmid">37930866</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkad964</pub-id>
                    <pub-id pub-id-type="pmcid">PMC10767988</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref72">
                <label>76</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Amid</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Alako</surname>
                            <given-names>BTF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Balavenkataraman Kadhirvelu</surname>
                            <given-names>V</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The European Nucleotide Archive in 2019.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2020 Jan 8</year>;<volume>48</volume>(<issue>D1</issue>):<fpage>D70</fpage>&#x2013;<lpage>D76</lpage>.
                    <pub-id pub-id-type="pmid">31722421</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkz1063</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref73">
                <label>77</label>
                <mixed-citation publication-type="journal">
                    <article-title>CAMI II: identifying best practices and issues for metagenomics software.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Methods.</italic>
</source>
                    <year>2022 Apr</year>;<volume>19</volume>(<issue>4</issue>):<fpage>412</fpage>&#x2013;<lpage>413</lpage>.
                    <pub-id pub-id-type="pmid">35396486</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41592-022-01419-0</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref74">
                <label>78</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sommers</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chatterjee</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Varsani</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Integrating Viral Metagenomics into an Ecological Framework.</article-title>
                    <source>

                        <italic toggle="yes">Annu. Rev. Virol.</italic>
</source>
                    <year>2021 Sep 29</year>;<volume>8</volume>(<issue>1</issue>):<fpage>133</fpage>&#x2013;<lpage>158</lpage>.
                    <pub-id pub-id-type="pmid">34033501</pub-id>
                    <pub-id pub-id-type="doi">10.1146/annurev-virology-010421-053015</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref75">
                <label>79</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Roux</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Camargo</surname>
                            <given-names>AP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Coutinho</surname>
                            <given-names>FH</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>iPHoP: An integrated machine learning framework to maximize host prediction for metagenome-derived viruses of archaea and bacteria.</article-title>
                    <source>

                        <italic toggle="yes">PLoS Biol.</italic>
</source>
                    <year>2023 Apr</year>;<volume>21</volume>(<issue>4</issue>):<fpage>e3002083</fpage>.
                    <pub-id pub-id-type="pmid">37083735</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pbio.3002083</pub-id>
                    <pub-id pub-id-type="pmcid">PMC10155999</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref76">
                <label>80</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Camargo</surname>
                            <given-names>AP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Roux</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schulz</surname>
                            <given-names>F</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Identification of mobile genetic elements with geNomad.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Biotechnol.</italic>
</source>
                    <year>2023 Sep 21</year>:<fpage>1</fpage>&#x2013;<lpage>10</lpage>.
                    <pub-id pub-id-type="doi">10.1038/s41587-023-01953-y</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref77">
                <label>81</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rasmussen</surname>
                            <given-names>JA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chua</surname>
                            <given-names>PYS</given-names>
                        </name>
</person-group>:
                    <article-title>Genome-resolving metagenomics reveals wild western capercaillies (
                        <italic toggle="yes">Tetrao urogallus</italic>) as avian hosts for antibiotic-resistance bacteria and their interactions with the gut-virome community.</article-title>
                    <source>

                        <italic toggle="yes">Microbiol. Res.</italic>
</source>
                    <year>2023 Jun</year>;<volume>271</volume>:<fpage>127372</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.micres.2023.127372</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref78">
                <label>82</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fredslund</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Borchert</surname>
                            <given-names>MS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Poulsen</surname>
                            <given-names>JCN</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Structure of a hyperthermostable carbonic anhydrase identified from an active hydrothermal vent chimney.</article-title>
                    <source>

                        <italic toggle="yes">Enzym. Microb. Technol.</italic>
</source>
                    <year>2018 Jul</year>;<volume>114</volume>:<fpage>48</fpage>&#x2013;<lpage>54</lpage>.
                    <pub-id pub-id-type="pmid">29685353</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.enzmictec.2018.03.009</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref79">
                <label>83</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Schiebenhoefer</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Van Den Bossche</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fuchs</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Challenges and promise at the interface of metaproteomics and genomics: an overview of recent progress in metaproteogenomic data analysis.</article-title>
                    <source>

                        <italic toggle="yes">Expert Rev. Proteomics.</italic>
</source>
                    <year>2019 May</year>;<volume>16</volume>(<issue>5</issue>):<fpage>375</fpage>&#x2013;<lpage>390</lpage>.
                    <pub-id pub-id-type="pmid">31002542</pub-id>
                    <pub-id pub-id-type="doi">10.1080/14789450.2019.1609944</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref80">
                <label>84</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Felden</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>M&#x00f6;ller</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schindler</surname>
                            <given-names>U</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>PANGAEA - Data Publisher for Earth &amp; Environmental Science.</article-title>
                    <source>

                        <italic toggle="yes">Scientific Data.</italic>
</source>
                    <year>2023 Jun 2</year>;<volume>10</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="doi">10.1038/s41597-023-02269-x</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref81">
                <label>85</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Argelaguet</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Velten</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Arnol</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Multi-Omics Factor Analysis&#x2014;a framework for unsupervised integration of multi-omics data sets.</article-title>
                    <source>

                        <italic toggle="yes">Mol. Syst. Biol.</italic>
</source>
                    <year>2018 Jun 1</year>;<volume>14</volume>(<issue>6</issue>):<fpage>e8124</fpage>.
                    <pub-id pub-id-type="pmid">29925568</pub-id>
                    <pub-id pub-id-type="doi">10.15252/msb.20178124</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6010767</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report412934">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.185171.r412934</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Lui</surname>
                        <given-names>Lauren</given-names>
                    </name>
                    <xref ref-type="aff" rid="r412934a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-8720-5268</uri>
                </contrib>
                <aff id="r412934a1">
                    <label>1</label>E O Lawrence Berkeley National Laboratory, Berkeley, California, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>6</day>
                <month>10</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Lui L</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport412934" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.144515.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>
                <underline>Summary</underline>
            </p>
            <p> 
                <italic>Finn et al</italic>. have written a comprehensive white paper on the establishment of the ELIXIR Microbiome Community.&#x00a0; They outline the purpose of establishing the ELIXIR Microbiome Community, the history of its formation, interaction with other ELIXIR communities, plans for data management and computational resources, and challenges in implementation.&#x00a0; I found the manuscript informative about the intent of the ELIXIR Microbiome Community initiatives, and it was useful to have the historical and international context included. There is a lot of information packed into this manuscript!&#x00a0; I only have minor issues with the manuscript that I think can easily be addressed.</p>
            <p> </p>
            <p> 
                <underline>Major Comments</underline>
            </p>
            <p> On page 6, the authors state that the ELIXIR Microbiome Community is about providing the necessary infrastructure for &#x201c;nucleotide sequence data&#x201d; from a microbiome.&#x00a0; In the context of the entire manuscript this statement is confusing because proteomics and metabolomics are also discussed in relation to the entire ELIXIR metacommunity.&#x00a0; Should this statement be revised or can there be some clarification added here?</p>
            <p> </p>
            <p> 
                <underline>Why I put &#x201c;no&#x201d; for &#x201c;Are all factual statements correct and adequately supported by citations?&#x201d; and why I put &#x201c;no&#x201d; for &#x201c;Are arguments sufficiently supported by evidence from the published literature&#x201d;</underline>
            </p>
            <p> </p>
            <p> There are some blanket statements in the manuscript that are consistent with the literature but there aren&#x2019;t any citations. There are a few statements that are incorrect. 
                <list list-type="bullet">
                    <list-item>
                        <p>Introduction.&#x00a0; There are many statements here that could be supported by literature references.&#x00a0; For example, the statement that viruses that infect bacteria play critical roles in community dynamics could easily have a few references.&#x00a0; This statement could also be expanded to include archaeal and eukaryotic viruses.&#x00a0; For example, viruses that infect eukaryotic algae are also very important in the dynamics of algal blooms. The statement about the impact of dysbiosis of microbiomes could also have a few references.</p>
                    </list-item>
                    <list-item>
                        <p>One Health.&#x00a0; One Health is used in the manuscript but not defined nor does it have a citation.&#x00a0; Please describe briefly and include a citation. Page 8.</p>
                    </list-item>
                    <list-item>
                        <p>Microbial dark matter. There&#x2019;s at least two places that mentions &#x201c;microbial dark matter&#x201d; but don&#x2019;t define it (pages 8, and 17).&#x00a0; Capitalization is not consistent between these two instances.&#x00a0; Please include a reference and/or define this term.</p>
                    </list-item>
                    <list-item>
                        <p>Page 16.&#x00a0; Most of the content on this page could use more references: 
                            <list list-type="bullet">
                                <list-item>
                                    <p>Section on long reads. 
                                        <list list-type="bullet">
                                            <list-item>
                                                <p>This secion could include citations on long read metagenomics studies, notably the publication of Microflora Danica from &#x00a0;Mads Albertsen&#x2019;s lab (https://www.nature.com/articles/s41564-025-02062-z).&#x00a0; I acknowledge that this came out fairly recently and the authors may have drafted this section before this publication came out, but there are many other long read metagenome papers that can also be cited.</p>
                                            </list-item>
                                            <list-item>
                                                <p>My second issue with this section is that long reads don&#x2019;t really mitigate the computational burden &#x2013; you still can&#x2019;t assemble the metagenomes on a laptop and nanopore basecalling needs GPUs.&#x00a0; Long reads assist with genome assembly because the reads are longer than genomic repeats, eliminating the ambiguities of short read assembly. This is discussed by Ryan Wick on his github page (https://github.com/rrwick/Unicycler?tab=readme-ov-file) and in the introduction and conclusion of this paper https://pmc.ncbi.nlm.nih.gov/articles/PMC8172020/.</p>
                                            </list-item>
                                        </list> </p>
                                </list-item>
                                <list-item>
                                    <p>Paragraph on MAGs. 
                                        <list list-type="bullet">
                                            <list-item>
                                                <p>MAGs should be defined as &#x201c;metagenome-assembled genomes&#x201d; not &#x201c;environmental genomes.&#x201d; MAGs can also be from clinical metagenomes.&#x00a0;</p>
                                            </list-item>
                                            <list-item>
                                                <p>In the same sentence as above, the statement &#x201c;has allowed the identification of thousands of specific functions&#x201d; is unclear as to what it refers to.&#x00a0; If this refers to protein gene function, then all of these would be by computational prediction and not really identification of new functions.&#x00a0; A citation would be helpful here.</p>
                                            </list-item>
                                        </list> </p>
                                </list-item>
                                <list-item>
                                    <p>Paragraph about classification of MAGs.&#x00a0; 
                                        <list list-type="bullet">
                                            <list-item>
                                                <p>Please add &#x201c;taxonomic&#x201d; in front of &#x201c;classification&#x201d; in the first sentence for clarity</p>
                                            </list-item>
                                            <list-item>
                                                <p>There&#x2019;s been huge changes in prokaryotic taxonomy in recent years, and there are no citations to the committee reports here or opinion pieces of the use of &#x201c;candidatus&#x201d; when naming organisms.&#x00a0; It is worth a few sentences of discussion here or added references.</p>
                                            </list-item>
                                        </list> </p>
                                </list-item>
                                <list-item>
                                    <p>Paragraph on viruses 
                                        <list list-type="bullet">
                                            <list-item>
                                                <p>Please add a few references for the statements about the challenges of analyzing viral microbiomes.</p>
                                            </list-item>
                                            <list-item>
                                                <p>The statement that there is no centralized database is incorrect.&#x00a0; IMG/VR, which is mentioned in the article, has millions of viral genomes.</p>
                                            </list-item>
                                        </list> </p>
                                </list-item>
                            </list> </p>
                    </list-item>
                </list> Some statements may need to be qualified or rephrased. 
                <list list-type="bullet">
                    <list-item>
                        <p>Page 11, the statement &#x201c;Even relatively simple workflows that perform metagenomics assembly are computationally heavy.&#x201d; This statement is heavily qualified based on someone&#x2019;s experience with and access to computational resources, as well as the amount of data to be processed.&#x00a0; It would also be better to use something like &#x201c;resource-intensive&#x201d; instead of &#x201c;heavy.&#x201d;&#x00a0; If this statement could also be changed to something like &#x201c;most metagenomes cannot be assembled on a laptop&#x201d; to provide more context for the reader.</p>
                    </list-item>
                    <list-item>
                        <p>Page 17, last paragraph.&#x00a0; The sentence about the challenges in metaproteomics mentions proteins that are not identified in corresponding metagenomes and calls them &#x201c;so called unidentified proteins of unknown function.&#x201d; This seems to be a bit of a non sequitur because proteins of unknown function are those that cannot have a function assigned computationally, not because they are not found in the genomic data. It is also possible that "so called unidentified proteins of unknown function" is a separate list item - if so then numbers should be added into this sentence for clarity.</p>
                    </list-item>
                    <list-item>
                        <p>Page 14, the statement beginning with &#x201c;For example, it is widely accepted that current short-read assembly-based methods&#x2026;&#x201d; The accuracy of this statement depends on access to computational resources.&#x00a0; Terabase scale metagenomes have been done (
                            <ext-link ext-link-type="uri" xlink:href="https://www.nature.com/articles/s41598-020-67416-5">https://www.nature.com/articles/s41598-020-67416-5</ext-link>).&#x00a0; Perhaps change &#x201c;computationally intractable&#x201d; to &#x201c;highly computationally resource intensive.&#x201d;</p>
                    </list-item>
                </list> </p>
            <p> </p>
            <p> 
                <underline>Minor comments</underline>
            </p>
            <p> 
                <italic>Use of &#x201c;ELIXIR Microbiome Community&#x201d;</italic> 
                <list list-type="bullet">
                    <list-item>
                        <p>In most places, &#x201c;ELIXIR Microbiome Community&#x201d; &#x00a0;is used, but in some places it is only &#x201c;Microbiome Community.&#x201d; Please change all of these instances to &#x201c;ELIXIR Microbiome Community&#x201d; for consistency.</p>
                    </list-item>
                </list> 
                <italic>ELIXIR Platforms</italic> 
                <list list-type="bullet">
                    <list-item>
                        <p>Please add a table of what each of the platforms do or describe briefly in the text (top of page 5)</p>
                    </list-item>
                </list> 
                <italic>ASV</italic> 
                <list list-type="bullet">
                    <list-item>
                        <p>ASV is the abbreviation for &#x201c;amplicon sequence variant&#x201d; not &#x201c;amplified sequence variant&#x201d;</p>
                    </list-item>
                </list> 
                <italic>Typos</italic> 
                <list list-type="bullet">
                    <list-item>
                        <p>Page 11, &#x201c;non-sequenced&#x201d; should be &#x201c;non-sequence&#x201d;&#x2019;</p>
                    </list-item>
                    <list-item>
                        <p>Page 11, bottom of paragraph 4, should microbiome be lower case?</p>
                    </list-item>
                </list> 
                <italic>Citations</italic> 
                <list list-type="bullet">
                    <list-item>
                        <p>INSDC needs a citation (bottom of page 5)</p>
                    </list-item>
                </list> 
                <italic>Figure 1</italic> 
                <list list-type="bullet">
                    <list-item>
                        <p>I assume that the starting point for this diagram is the &#x201c;Microbiomes in their environment&#x201d; box.&#x00a0; This could either be a unique color (so something other than white, blue, or orange) or larger than the other boxes to visually indicate that this is the starting point of the data/material workflow.</p>
                    </list-item>
                </list> </p>
            <p> 
                <italic>Abbreviations</italic>
            </p>
            <p> Given the nature of this article, there are many acronyms and the definitions of a few were missed.&#x00a0; 
                <list list-type="bullet">
                    <list-item>
                        <p>GBIF &#x2013; page 8</p>
                    </list-item>
                    <list-item>
                        <p>Country abbreviations in Table 3 (FR, IT, DE)</p>
                    </list-item>
                </list> </p>
            <p> 
                <italic>Grammar</italic>
            </p>
            <p> There are some grammatical issues sprinkled throughout the manuscript but are easily fixed: 
                <list list-type="order">
                    <list-item>
                        <p>Colons and semicolons are misused throughout this manuscript given the large number of lists. &#x00a0;Colons only come after complete clauses.&#x00a0; Semi-colons can only separate two independent clauses. For example, on page 4, the last sentence of the first paragraph should be something like &#x201c;Key challenges facing the microbiome research community are how to (i) appropriately store the data, (ii) informatically process, integrate, compare and interpret microbiome-derived data, and (iii) how to make the data findable, accessible, interoperable, and reusable, 
                            <italic>i.e., </italic>FAIR.&#x201d;&#x00a0; The colon was removed and commas used to separate the list items.&#x00a0; This is also an issue in the third paragraph of section 2.2.2.5 Training, first paragraph of section 3, page 16 fifth paragraph, fourth paragraph on page 17, last row of table 3.</p>
                    </list-item>
                    <list-item>
                        <p>Use commas after i.e. and e.g.</p>
                    </list-item>
                    <list-item>
                        <p>I think that last sentence of the third paragraph on page 6 may need to have an &#x201c;and&#x201d; in front of &#x201c;to biotechnology&#x201d;.</p>
                    </list-item>
                    <list-item>
                        <p>Page 17, last paragraph.&#x00a0; The sentence about the challenges in metaproteomics could use numbers for the items. As it is currently structured, the sentence is confusing to read.</p>
                    </list-item>
                    <list-item>
                        <p>Page 18.&#x00a0; Second paragraph would benefit from the use of the oxford comma.&#x00a0; Example is add a comma after &#x201c;image data.&#x201d; The last sentence of this paragraph is awkwardly worded.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the topic of the opinion article discussed accurately in the context of the current literature?</p>
            <p>Yes</p>
            <p>Are arguments sufficiently supported by evidence from the published literature?</p>
            <p>No</p>
            <p>Are all factual statements correct and adequately supported by citations?</p>
            <p>No</p>
            <p>Are the conclusions drawn balanced and justified on the basis of the presented arguments?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Microbial ecology, Bioinformatics, Metagenomics</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report412360">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.185171.r412360</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Pauvert</surname>
                        <given-names>Charlie</given-names>
                    </name>
                    <xref ref-type="aff" rid="r412360a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-9832-2507</uri>
                </contrib>
                <aff id="r412360a1">
                    <label>1</label>University Hospital of RWTH, Aachen, Germany</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>16</day>
                <month>9</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Pauvert C</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport412360" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.144515.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>Thanks to the authors for addressing all my comments and suggestions. They successfully restructured and amended the tables 2 and 3 and the article for an increased impact to highlight how the scientific community can benefit from the ELIXIR Microbiome Community.</p>
            <p> </p>
            <p> </p>
            <p> On a small minor note, it seems possible to acknowledge the authors of the MIA package more than the URL by citing their work see 
                <ext-link ext-link-type="uri" xlink:href="https://microbiome.github.io/mia/authors.html#citation">https://microbiome.github.io/mia/authors.html#citation</ext-link> (and the added citation #1 via the form).</p>
            <p>Is the topic of the opinion article discussed accurately in the context of the current literature?</p>
            <p>Yes</p>
            <p>Are arguments sufficiently supported by evidence from the published literature?</p>
            <p>Partly</p>
            <p>Are all factual statements correct and adequately supported by citations?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn balanced and justified on the basis of the presented arguments?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>microbiome, bioinformatics, reproducible research, data and metadata standards</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-412360-1">
                    <label>1</label>
                    <mixed-citation>
                        <person-group person-group-type="author"/>:
                        <article-title>mia: Microbiome analysis. R package version 1.17.9</article-title>.
                        <source>
                            <italic>https://microbiome.github.io/mia/</italic>
                        </source>.<year>2025</year>;</mixed-citation>
                </ref>
            </ref-list>
        </back>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report251099">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.158321.r251099</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Heinken</surname>
                        <given-names>Almut</given-names>
                    </name>
                    <xref ref-type="aff" rid="r251099a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r251099a1">
                    <label>1</label>University of Lorraine, Lorraine, France</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>11</day>
                <month>5</month>
                <year>2024</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2024 Heinken A</copyright-statement>
                <copyright-year>2024</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport251099" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.144515.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In this article, the ELIXIR Microbiome community is described. The community was established in 2015 as the Marine Metagenomics community and was recently expanded to the scope of metagenomics research for multiple areas such as human health, agriculture, and ecology. The authors then describe their perspective for the community. One focus area will be the promotion of best practices for metagenomics analyses by benchmarking tools. Interoperability between different tools will also be facilitated. Another focus will be encouraging data sharing and reuse and providing data storage platforms. Finally, links to existing ELIXIR initiatives in related areas as well as with international initiatives will be established.</p>
            <p> Overall, this is a very clear, concise, and informative review. The scope and aims of ELIXIR Microbiome are well-described and detailed. The short-term and long-term objectives are also clearly described.</p>
            <p> </p>
            <p> Specific comments: 
                <list list-type="bullet">
                    <list-item>
                        <p>I appreciate the links to other ELIXIR initiatives in Table 2. I would be particularly interested in more detail on the integration with the Systems Biology community. How would the ability to reuse multi-omics data for systems biology approaches be improved?</p>
                    </list-item>
                    <list-item>
                        <p>It is a bit unclear to me which is the proposed data platforms, tools, and training will be freely available to non-ELIXIR members.</p>
                    </list-item>
                    <list-item>
                        <p>Regarding providing metadata of samples, how will GDPR regulations be handled for human samples?</p>
                    </list-item>
                </list>
            </p>
            <p>Is the topic of the opinion article discussed accurately in the context of the current literature?</p>
            <p>Yes</p>
            <p>Are arguments sufficiently supported by evidence from the published literature?</p>
            <p>Yes</p>
            <p>Are all factual statements correct and adequately supported by citations?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn balanced and justified on the basis of the presented arguments?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Systems biology</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
        <sub-article article-type="response" id="comment14228-251099">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Batut</surname>
                            <given-names>B&#x00e9;r&#x00e9;nice</given-names>
                        </name>
                        <aff>Institut Francais de Bioinformatique, &#x00c9;vry-Courcouronnes, &#x00ce;le-de-France, France</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>15</day>
                    <month>7</month>
                    <year>2025</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <italic>In this article, the ELIXIR Microbiome community is described. The community was established in 2015 as the Marine Metagenomics community and was recently expanded to the scope of metagenomics research for multiple areas such as human health, agriculture, and ecology. The authors then describe their perspective for the community. One focus area will be the promotion of best practices for metagenomics analyses by benchmarking tools. Interoperability between different tools will also be facilitated. Another focus will be encouraging data sharing and reuse and providing data storage platforms. Finally, links to existing ELIXIR initiatives in related areas as well as with international initiatives will be established.</italic>
                </p>
                <p>
                    <italic> Overall, this is a very clear, concise, and informative review. The scope and aims of ELIXIR Microbiome are well-described and detailed. The short-term and long-term objectives are also clearly described.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We would like to thank the reviewer for their positive comments concerning the ELIXIR microbiome community papers. Below we address their specific comments.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>Specific comments:</italic> 
                    <list list-type="bullet">
                        <list-item>
                            <p>
                                <italic>I appreciate the links to other ELIXIR initiatives in Table 2. I would be particularly interested in more detail on the integration with the Systems Biology community. How would the ability to reuse multi-omics data for systems biology approaches be improved?</italic>
                            </p>
                        </list-item>
                    </list> 
                    <bold>We have added a few sentences to expand how the integration might be improved.</bold> 
                    <list list-type="bullet">
                        <list-item>
                            <p>
                                <italic>It is a bit unclear to me which is the proposed data platforms, tools, and training will be freely available to non-ELIXIR members.</italic>
                            </p>
                        </list-item>
                    </list> 
                    <bold>All of the proposed activities are freely available to non-ELIXIR members. ELIXIR does not put explicit boundaries on who can use, but engagement with some ELIXIR events may be preferentially given to scientists coming from ELIXIR member states and funds from ELIXIR funding schemes would be restricted to member states.</bold> 
                    <list list-type="bullet">
                        <list-item>
                            <p>
                                <italic>Regarding providing metadata of samples, how will GDPR regulations be handled for human samples?</italic>
                            </p>
                        </list-item>
                    </list> 
                    <bold>The landscape concerning human microbiome samples is complicated. Currently there is little consensus across Europe whether human microbiomes should be under controlled access. Similarly, the GDPR landscape is also complicated and not entirely independent.&#x00a0; There are already established routes for suppression of data (should an individual wish to be forgotten), which can be propagated to other databases (e.g. MGnify will remove analyses associated with a suppressed sequence dataset). This is clearly an area that will need to be developed as part of the ELIXR community, as no specific solutions have been agreed, we would prefer not to comment about how they will be handled in this manuscript.&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;</bold>
                </p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report244855">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.158321.r244855</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Pauvert</surname>
                        <given-names>Charlie</given-names>
                    </name>
                    <xref ref-type="aff" rid="r244855a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-9832-2507</uri>
                </contrib>
                <aff id="r244855a1">
                    <label>1</label>University Hospital of RWTH, Aachen, Germany</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>14</day>
                <month>3</month>
                <year>2024</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2024 Pauvert C</copyright-statement>
                <copyright-year>2024</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport244855" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.144515.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors make the case for mutating the ELIXIR Marine Metagenomics Community initiative into an ELIXIR Microbiome Community. It is indeed timely to have an such a pan-European initiative especially as it draws on previous experience, albeit a more narrow biome. The emphasis on multi-omics integration is also a strong suit as it is a complex topic where the microbiome research community would need support and infrastructure. The authors map out existing (but not all) similar initiatives, even if it is unclear how they would work together. In my opinion, a couple of claims should be strengthened and the argumentative power of this white paper would benefit from minor reorganization, a text slim down and extra proofreading. To this end, I highlight the strengths and weaknesses of the manuscript below.</p>
            <p> </p>
            <p> ## Strengths</p>
            <p> </p>
            <p> - Table 3 is a great idea to map out the others initiatives but needs a bit rework to be straight to the point.</p>
            <p> - In the Data section, the part between "Throughout the lifetime of the ELIXIR Marine Metagenomics Community" and the end of the paragraph is really relevant and draws from the experience of the ELIXIR Community. These sentences should be more highlighted, maybe upstream. I do wonder how the authors considered how the others initiatives (mentioned in Table 3) could also contribute to the addition of metadata for a global curation effort, possibly via the mentioned Contextual Data Clearinghouse.</p>
            <p> - The data re-analysis and integration between PRIDE and MGnify (named MetaPUF) is a good use case of the efforts that the new ELIXIR Community can promote.</p>
            <p> - I did appreciate the clear objective at the end of the Compute section, to help with scaling up analyses as well as the plan for the interaction with the ELIXIR Training Platform which promise to give a boost in training the current and next generation of microbiome researchers.</p>
            <p> - I liked how the authors highlighted (using the example of viral sequences databases) that time and resources should not be wasted in duplicating works, and that initiatives such as the ELIXIR Microbiome Community can promote this.</p>
            <p> - The authors thought ahead to promote and develop methods to limit confounding factors when re-using datasets, especially in multi-omics settings, and added this to one of the ELIXIR Microbiome Community challenges.</p>
            <p> </p>
            <p> ## Weaknesses</p>
            <p> </p>
            <p> ### Introduction</p>
            <p> </p>
            <p> - (minor) The microbiome definition used in the paper was revisited in Review reference 1 to include the interactions as well as others molecules than genomes as part of the microbiome. I suggest to use this definition given the emphasis on additional omics made in the paper.</p>
            <p> - (major) The first paragraph ends on challenges of our research community including how to make data FAIR. Given the experience gained by the ELIXIR Marine Metagenomics Community, I would have appreciated a sentence on why making our data FAIR is important (e.g., transparency to make the research more reproducible, accountability given the (public) source of funding). See major comment in next section.</p>
            <p> </p>
            <p> ### The scope of the ELIXIR Microbiome Community</p>
            <p> </p>
            <p> - (major) These following arguments for FAIR data should be in the Introduction section.</p>
            <p> "Moreover, when wishing to contextualise the results with similar experiments, the way a dataset has been produced and processed must be transparent to establish whether it is comparable (e.g. amplified sequence variants can only be compared when the same amplified regions are compared). Furthermore, when different methods are applied, best practices in data stewardship are required to ensure that the connectivity of the derived sequence data products, together with functional and taxonomic assertions are kept in context of the original sample/sequencing effort and associated contextual metadata."</p>
            <p> - (minor) The very first sentence of this section blames researchers for misuse of a term "[...] regularly (incorrectly) used [...]". I suggest to rephrase the sentence to simply state the differences, or back up the misuse of the term with references or a survey.</p>
            <p> - (minor) "Fundamentally, the ELIXIR Microbiome Community is about providing the necessary infrastructures required to perform analysis of 
                <bold>nucleotide</bold> sequence data derived from a microbiome"</p>
            <p> - (minor) "Finally, and possible unique to this ELIXIR Community, is the variety of researchers". I am not sure that these features are unique to the ELIXIR Community but rather to the field of microbiome research. I suggest to tone it down by simply pointing out that the ELIXIR Community gathers a variety of researchers.</p>
            <p> </p>
            <p> ### Table 1</p>
            <p> </p>
            <p> - (major) Metabolomics is listed in Table 1 but is omitted from the narrative in the paragraph.</p>
            <p> - (major) The Table 1 is not really used but could actually reduce some redundancies in the manuscript by providing a one-stop-shop to explain and detail these techniques.</p>
            <p> </p>
            <p> ### Figure 1</p>
            <p> </p>
            <p> - (major) The figure is early on in the manuscript and it is unclear which of the ELIXIR platforms and communities are already established, or foreseen. Especially since "current and future ELIXIR activities" is mentioned in the manuscript before referencing this figure. The Table 2 does not add more information in that regard.</p>
            <p> - (major) There is a need for different types of arrows as the same arrow represent interactions between communities or processes.</p>
            <p> - (minor) The "Data" node is quite generic, I was wondering whether it meant public repositories, institute repositories or both. Please be precise.</p>
            <p> - (minor) I guess the type of metadata illustrated in the figure is restricted to biological metadata, that is indeed collected during sampling, however, technical metadata such as the sequencing method used, the type of instrument or the library layout, are collected during the 
                <italic>processing</italic> of the sample not only the sampling itself.</p>
            <p> - (minor) There should be an arrow from "Archive" back to "Data" when the data produced is deposited and then contributes back to public repositories?</p>
            <p> </p>
            <p> ### Interactions with other ELIXIR Communities</p>
            <p> </p>
            <p> * (major) The sentence "Similarly, many of the biodiversity approaches use marker gene amplification for studying environmental DNA (eDNA)" is redundant with the previous one 
                <bold>and</bold> introduces a different nomenclature that was not used before&#x00a0; (marker gene amplication vs metabarcoding) and the differences, if any, are not explained. I would suggest to remove the sentence.</p>
            <p> * (major) The term "Isolation of genomes" is misleading and I guess the authors used it as a shorthand for "The isolation of bacteria, its DNA extraction, genome sequencing and their annotation". Please rephrase to avoid misinterpretation. Plus I would argue that these steps are also done when deconstructing microbiomes via cultivation strategies and are therefore not so out-of-scope.</p>
            <p> * (minor) "yet each one of these areas is far greater in scientific scope" feels exaggerated and vague. It should be rephrased. A suggestion is: "yet each one of these areas is too complex to be tacked individually"</p>
            <p> * (minor) There is no link nor transition between the paragraph that starts with "In summary, " before the Table 3 and the paragraph after that starts with "Similarly, microbiome research". Please rephrase or edit to connect the two sections. Plus, whilst this is good to have a concrete example in the "Similarly" paragraph, the paragraph before was very broad and doing a summary. I would suggest to try reordering the two paragraphs and bring the example earlier for a smoother transition.</p>
            <p> * (minor) Typo "Similarly, microbiome research has many translation
                <bold>al</bold> aspects"</p>
            <p> * (minor) The PET acronym is detailed but actually used only once, I am unsure if this is necessary.</p>
            <p> </p>
            <p> ### Table 2</p>
            <p> </p>
            <p> - (major) The text in the "Interaction" column needs rework as it does not use a consistent wording and could be more to the point, especially as it is a complement to the main text:</p>
            <p> &#x00a0;&#x00a0; &#x00a0;- The Food and Nutrition entry is a question.</p>
            <p> &#x00a0;&#x00a0; &#x00a0;- The Galaxy entry has an unnecessary return carriage, and a unspecific "ongoing evaluation study" that needs to be clarified.</p>
            <p> &#x00a0;&#x00a0; &#x00a0;- The Plant Science entry starts with a generic sentence that could be in the introduction or removed for clarity. Plus I would add a clarification that "plants maintain 
                <bold>or not</bold> their microbial communities across generations."</p>
            <p> - (minor) The status (e.g., currently active, inactive, planned, etc.) of each ELIXIR Communities would have been appreciated as it is missing also from the Figure 1.</p>
            <p> - (minor) The mention of "the field" for the Federated human data community is vague in a manuscript about gathering communities, which research field is implied? If this is the human microbiome research field as a whole, please indicate.</p>
            <p> </p>
            <p> ### Table 3</p>
            <p> </p>
            <p> - (major) Similarly to the Table 2 major comment, there is a lack of consistent wording in the "Aim" column that makes the Table 3 not as impactful as it should and could be. Maybe the authors could extract common/distinct features from each of these initiatives as an alternative way to the "Aim" column. A couple of suggestions for these features would be: is it a national initiative?, does it relates to data storage, data analysis, training? is it linked to ELIXIR?</p>
            <p> - (major) I was surprised not to see the NMDC listed in the table 3, especially when it is discussed in the main text. What about the NCCR Microbiomes initiative in Switzerland? I can understand that some initiatives are not included for space reason, but maybe state it in the legend of the table.</p>
            <p> - (minor) Is this table sorted? It seems not, but it could be by acronyms or names.</p>
            <p> - (minor) The aim for the NFDI4Microbiota is way too big a paragraph. The authors should reduce it for conciseness.</p>
            <p> - (minor) The Metaproteomics Initiative entry has a hyperlink and a reference when none of the others have. Please homogenise.</p>
            <p> - (minor) Some entries have country listed and some not. Please homogenise.</p>
            <p> - (minor) I am not questioning the existence of the European Reference Genome Atlas here, but how best to phrase its relevance to ELIXIR in the manuscript. There seems to have no prokaryotes genomes in their atlas, however, there seems to be a trove of fungi and protists genomes which are usually said to be understudied in microbiome. So I think there is a missed opportunity for the authors here to make the most out of this entry.</p>
            <p> - (minor) "With its headquarters in Bari (Apulia region)," seems irrelevant in the context of the table, please remove.</p>
            <p> </p>
            <p> ### Data</p>
            <p> </p>
            <p> - (minor) The end of the following sentence is redundant as the INSDC was introduced earlier already "INSDC, which in collaboration with the National Institute of Genetics DNA DataBank of Japan (DDBJ) and the United States National Center for Biotechnology&#x2019;s (NCBI) GenBank and Sequence Read Archive (SRA), facilitate the deposition and global exchange of sequence data.". Please adjust accordingly.</p>
            <p> - (minor) "A current challenge facing the field is connecting different multi &#x2018;omics data that have been derived from the same sample." Is this going to be tackled by the ELIXIR Microbiome Community? If so, I would state that this is part of its objectives.</p>
            <p> - (minor) The end of sentence "[...]overarching context to the experiment, which can be important for meta-analyses." seems like an euphemism, I would suggest to replace with "[...]overarching context to the experiment, re-analyses or meta-analyses." to include re-analyses as well.</p>
            <p> - (minor) "We will continue to promote such approaches, enriching metadata wherever possible." Is this going to be done via the CDCH?</p>
            <p> - (minor) "The ELIXIR Microbiome Community will also work to move the Marine Metagenomics domain in the RDMKit towards a more general Microbiome domain." What is the RDMKit? It is not explained, nor cited nor mentioned again.</p>
            <p> </p>
            <p> ### Tools</p>
            <p> </p>
            <p> - (minor) "will increase their use of BioContainers" should be "will increase the use of BioContainers"</p>
            <p> - (minor) "In order to make tools findable 
                <bold>by the end users,&#x00a0;</bold> the Community"</p>
            <p> - (minor) "workflow descriptions (e.g. Snakemake, CWL, Nextflow)" None of them have their references cited, is it an omission or space limitation?</p>
            <p> - (minor) "A 
                <bold>current</bold> joint effort between the Microbiome and Galaxy Communities"</p>
            <p> </p>
            <p> ### Benchmarking</p>
            <p> </p>
            <p> - (major) "Benchmarking" it is the only item at this hierarchy level, meaning that this is useless for structuring the text. Please edit.</p>
            <p> - (minor)&#x00a0;Review reference 2 published a recent review with guidelines to learn from that could have its place in this paragraph.</p>
            <p> - (minor) in the sentence: "As the Microbiome Community establishes, we will develop a broader understanding of the requirements of the 
                <bold>Community</bold>, feed this to the Tools Platform, as well as seek opportunities to interact with the Tools Platform to capture the diversity of tools and their utility via such benchmarking activities.", is this the ELIXIR Microbiome Community, or the wide community of microbiome researchers? Is it to mean that the ELIXIR Microbiome Community is going to act as an interface between microbiome researchers and ELIXIR Tools/Infrastructure?</p>
            <p> </p>
            <p> ### Compute</p>
            <p> - (minor) The first sentence would fit better in the introduction.</p>
            <p> </p>
            <p> ### Interoperability</p>
            <p> - (minor) The reference 57 should be at the end of the sentence starting with "This effort was paralleled" not in the middle.</p>
            <p> - (minor) Reference 58 should be removed at it is a duplicate of reference 47.</p>
            <p> - (minor) Use the full text "Global Alliance for Genomics and Health" instead of GA4GH.</p>
            <p> - (minor) "is in the process of applying to be a
                <bold>n ELIXIR</bold> Recommended Interoperability Resource."</p>
            <p> - (minor) In the sentence: "This will require the development of new data Interoperability layers for data resources that are not normally focused in Microbiome data" I think the authors meant "used" instead of "focused", and "microbiome" instead of "Microbiome".</p>
            <p> </p>
            <p> ### Training</p>
            <p> - (minor) In "Platforms such as MGnify support large-scale services for most, if not all, steps of a microbiome study", I would suggest to remove the "if not all".</p>
            <p> - (minor) "with areas of expertise covering different environments, &#x2018;omics approaches and data analysis pathways." I think the authors meant "learning paths", and I would refrain from using "pathways" as it also has a biological meaning.</p>
            <p> </p>
            <p> ### Context with other international initiatives</p>
            <p> </p>
            <p> - (major) Whilst I appreciated the emphasis that no initiative exists on its own, I feel the first paragraph on the Genom
                <bold>ic</bold> (please correct the typo) Standards Consortium feels lengthy for a manuscript whose topic is not the GSC. I would advise to summarize. In this respect, the second paragraph is particularly relevant to a tangible collaboration between ELIXIR Microbiome Community and GSC.</p>
            <p> - (major) The NMDC is discussed in this section but not part of the Table 3.</p>
            <p> - (minor) Is the mentioned M5 project still active as the website's last update is 2012?</p>
            <p> - (minor) "Combining the activities on standards concerning workflows [...]" does this means adding and providing workflows to the microbiome research community?</p>
            <p> - (minor) Given the emphasis on the fact that MicrobiomeSupport 
                <italic>was</italic> a program, the authors could update the readers and indicate that it is now MicrobiomeSupport Association.</p>
            <p> </p>
            <p> ### Interaction with other key data resources beyond ELIXIR</p>
            <p> </p>
            <p> - (major) There is an order issue with the main text that a proofread could solve, as MG-RAST is explained and cited in the first paragraph but already mentioned upstream of the main text in the Interoperability section.</p>
            <p> </p>
            <p> ### Specific challenges and objectives of the ELIXIR Microbiome Community</p>
            <p> </p>
            <p> - (major) The strong claim "it is 
                <italic>widely accepted</italic> that current short-read assembly-based methods do not generally work as well for soil microbiomes" would probably need at least one reference.</p>
            <p> - (major) "(iii) there is no centralised database collecting the millions of viral sequences". It seems to be the case indeed, and there are databases (~24) out there as recently compiled in Review reference 3. How ELIXIR Microbiome Community plans to integrate/aggregate these resources in a non-duplicating manner?</p>
            <p> - (minor) The word "through" is superfluous in the the sentence that starts with "This current limitation, [...]" and can be removed.</p>
            <p> - (minor) The CAMI was already explained and cited above, so the already defined acronym can be used.</p>
            <p> - (minor) Precise the area in :"Additionally, another key area of development 
                <bold>of taxonomy</bold> [...]".</p>
            <p> - (minor) The sentence "Viruses, particularly those that infect bacteria, are found ubiquitously in all environments and play critical roles in community dynamics." belongs in an introduction, not so downstream of the manuscript.</p>
            <p> - (minor) The sentence starting the sixth paragraph could be precised as "The increase in metagenomic assemblies has resulted in a parallel increase in the number of 
                <bold>predicted</bold> protein sequences, with sets of non-redundant proteins now in the billions."</p>
            <p> - (minor) Fix typo in "that are undetectable by current sequence based methods."</p>
            <p> - (minor) Would it make sense to also be able to access representative, of clusters for instance? "[...] develop new infrastructural frameworks for accessing slices of the data 
                <bold>or adequate representatives</bold> based on the requirements."</p>
            <p> - (minor) What is the "
                <bold>expanded</bold> Microbiome Community"? It was never mentioned before.</p>
            <p> - (minor) A few comments on the sentence: "Within the Community, we will develop and promote standards around the analysis provenance (analytical metadata),". In my opinion and how it was already stated in the manuscript, it would make more sense to promote existing standards first and then develop if need be. There is no mention of other type of metadata, so the "analytical metadata" precision seems superfluous. I would suggest: "Within the Community, we will 
                <bold>promote and develop</bold> standards 
                <bold>regarding</bold> the analysis provenance,"</p>
            <p> - (minor) Precise the term forms in "[...] ensuring that comput
                <bold>ing</bold> resources are accessible for performing the different 
                <italic>forms</italic> of data analysis [...]", do the authors mean types of/steps in the data analysis?</p>
            <p> - (minor) Same argument as before regarding reinventing the wheel, I would swap the part of the sentence: "This may require the 
                <bold>extensions to existing databases or development of new ones</bold>, but it requires an agreement from the research community to adopt them."</p>
            <p> - (minor) This part "Metaproteomics aims to elucidate the functional and taxonomic interplay of proteins in microbiomes," should have been in the Table 1, or to reuse the Table 1 here.</p>
            <p> - (minor) The tenth paragraph of this section starts with the mention of multiple major challenges, but detail "only" one of them. I would suggest to mention some of the others challenges.</p>
            <p> - (minor) The "MIA" method is just a hyperlink, without any reference. Either cite the website accordingly or add the reference.</p>
            <p> </p>
            <p> ### Table 4</p>
            <p> </p>
            <p> - (major) I was surprised to see that the objective "Foster international collaborations between other resources providers and databases to ensure global harmonisation of e-infrastructures for microbiome research" was long-term, as I would have imagined that a gap analysis would be short-term to ensure we do not reinvent the wheel, especially given the others initiatives discussed in the manuscript.</p>
            <p> - (minor) If the Objective column starts with action verbs (which is a good idea), then it should be "Survey the needs" instead of "Survey of needs".</p>
            <p> - (minor) Specify the type of workflow with "Address knowledge gaps in generating and adopting 
                <bold>data analysis</bold> workflows"</p>
            <p> - (minor) The verb is missing in "
                <bold>Teach a</bold>dvanced containerisation and cloud deployment"</p>
            <p> - (minor) The verb is missing in "
                <bold>Promote</bold> data analysis through the use of services". Is this ELIXIR services in general or specific ELIXIR Microbiome Community services?</p>
            <p> - (minor) Use "Share" instead of "Sharing" in the "Co-ordinate" entry.</p>
            <p> - (minor) The "Industry connection" entry does not fit the action verb pattern. A suggestion would be "Use ELIXIR and Node forums to understand pharmaceutical and biotechnological demands and current limitations impacting this sector." as the first sentence felt generic.</p>
            <p> - (minor) The verb is missing in "
                <bold>Design</bold> targeted training for different microbiome communities"</p>
            <p> - (minor) Use "findability" instead of "discoverability" for consistency and to fit with the FAIR principles.</p>
            <p> - (minor) Reorder the sentence to start with the action verb: "
                <bold>E</bold>stablish new standards for microbiome research, particularly with respect to data analysis reporting and contextual metadata reporting 
                <bold>in conjunction with GSC</bold>"</p>
            <p> - (minor) Correct "Established" to "Establish" in the "Promoting new approaches" entry.</p>
            <p> </p>
            <p> Editorial comments:</p>
            <p> - (major) "Community" is used in upper-case and this is unclear in many instances whether the ELIXIR Marine Metagenomics Community is referred to, the ELIXIR Microbiome Community, or the broader microbiome research community. I suggest to use consistently the full term for the sake of transparency. Abbreviations like EMMC and EMC could be even more misleading in my opinion.</p>
            <p> - (major) The structure of the white paper is not evident as the hierarchy is indicated only by change in font size, and some sections are quite lengthy for a white paper that is supposed to be concise. I understand that this is a constraint from the Editor, but see Review reference 4 for a white paper with a more clearer structure. An alternative could be to use numbered sections.</p>
            <p> </p>
            <p> - References</p>
            <p> &#x00a0;&#x00a0; &#x00a0;- (minor) The very first reference is oddly formatted in the text creating an artificial and confusing end of sentence.</p>
            <p> &#x00a0;&#x00a0; &#x00a0;- (minor) Title of reference 9 is truncated and should be "Methods included: standardizing computational reuse and portability with the Common Workflow Language"</p>
            <p> &#x00a0;&#x00a0; &#x00a0;- (minor) The superscript numbers of the numeric style of bibliography are in many instances after the final dot (see reference 2, 12-16, 17-19, 36, 37), after a comma (see reference 10, 20, 23), a bracket (see reference 60) or semi colon (see reference 65) when they should be before any of these symbols.</p>
            <p> &#x00a0;&#x00a0; &#x00a0;- (minor) In the paragraph "Interactions with other ELIXIR Communities", we jump from reference 24 to 29 when the numeric style of bibliography (that was chosen by the authors) is expected to mirror the mentions in the manuscript. Please either have the Table 2 earlier in the paper, or change the order of the references.</p>
            <p>Is the topic of the opinion article discussed accurately in the context of the current literature?</p>
            <p>Yes</p>
            <p>Are arguments sufficiently supported by evidence from the published literature?</p>
            <p>Partly</p>
            <p>Are all factual statements correct and adequately supported by citations?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn balanced and justified on the basis of the presented arguments?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>microbiome, bioinformatics, reproducible research, data and metadata standards</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-244855-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Microbiome definition re-visited: old concepts and new challenges</article-title>.
                        <source>
                            <italic>Microbiome</italic>
                        </source>.<year>2020</year>;<volume>8</volume>(<issue>1</issue>) :
                        <elocation-id>10.1186/s40168-020-00875-0</elocation-id>
                        <pub-id pub-id-type="doi">10.1186/s40168-020-00875-0</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-244855-2">
                    <label>2</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Navigating the Landscape: A Comprehensive Review of Current Virus Databases.</article-title>
                        <source>
                            <italic>Viruses</italic>
                        </source>.<year>2023</year>;<volume>15</volume>(<issue>9</issue>) :
                        <elocation-id>10.3390/v15091834</elocation-id>
                        <pub-id pub-id-type="pmid">37766241</pub-id>
                        <pub-id pub-id-type="doi">10.3390/v15091834</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-244855-3">
                    <label>3</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Challenges and best practices in omics benchmarking.</article-title>
                        <source>
                            <italic>Nat Rev Genet</italic>
                        </source>.<year>2024</year>;
                        <elocation-id>10.1038/s41576-023-00679-6</elocation-id>
                        <pub-id pub-id-type="pmid">38216661</pub-id>
                        <pub-id pub-id-type="doi">10.1038/s41576-023-00679-6</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-244855-4">
                    <label>4</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>A community proposal to integrate proteomics activities in ELIXIR.</article-title>
                        <source>
                            <italic>F1000Res</italic>
                        </source>.<year>2017</year>;<volume>6</volume>:
                        <elocation-id>10.12688/f1000research.11751.1</elocation-id>
                        <pub-id pub-id-type="pmid">28713550</pub-id>
                        <pub-id pub-id-type="doi">10.12688/f1000research.11751.1</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment14229-244855">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Batut</surname>
                            <given-names>B&#x00e9;r&#x00e9;nice</given-names>
                        </name>
                        <aff>Institut Francais de Bioinformatique, &#x00c9;vry-Courcouronnes, &#x00ce;le-de-France, France</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>15</day>
                    <month>7</month>
                    <year>2025</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <italic>The authors make the case for mutating the ELIXIR Marine Metagenomics Community initiative into an ELIXIR Microbiome Community. It is indeed timely to have an such a pan-European initiative especially as it draws on previous experience, albeit a more narrow biome. The emphasis on multi-omics integration is also a strong suit as it is a complex topic where the microbiome research community would need support and infrastructure. The authors map out existing (but not all) similar initiatives, even if it is unclear how they would work together. In my opinion, a couple of claims should be strengthened and the argumentative power of this white paper would benefit from minor reorganization, a text slim down and extra proofreading. To this end, I highlight the strengths and weaknesses of the manuscript below.</italic>
                </p>
                <p> </p>
                <p> 
                    <italic>## Strengths</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - Table 3 is a great idea to map out the others initiatives but needs a bit rework to be straight to the point.</italic>
                </p>
                <p>
                    <italic> - In the Data section, the part between "Throughout the lifetime of the ELIXIR Marine Metagenomics Community" and the end of the paragraph is really relevant and draws from the experience of the ELIXIR Community. These sentences should be more highlighted, maybe upstream. I do wonder how the authors considered how the others initiatives (mentioned in Table 3) could also contribute to the addition of metadata for a global curation effort, possibly via the mentioned Contextual Data Clearinghouse.</italic>
                </p>
                <p>
                    <italic> - The data re-analysis and integration between PRIDE and MGnify (named MetaPUF) is a good use case of the efforts that the new ELIXIR Community can promote.</italic>
                </p>
                <p>
                    <italic> - I did appreciate the clear objective at the end of the Compute section, to help with scaling up analyses as well as the plan for the interaction with the ELIXIR Training Platform which promise to give a boost in training the current and next generation of microbiome researchers.</italic>
                </p>
                <p>
                    <italic> - I liked how the authors highlighted (using the example of viral sequences databases) that time and resources should not be wasted in duplicating works, and that initiatives such as the ELIXIR Microbiome Community can promote this.</italic>
                </p>
                <p>
                    <italic> - The authors thought ahead to promote and develop methods to limit confounding factors when re-using datasets, especially in multi-omics settings, and added this to one of the ELIXIR Microbiome Community challenges.</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> ## Weaknesses</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> ### Introduction</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (minor) The microbiome definition used in the paper was revisited in Review reference 1 to include the interactions as well as others molecules than genomes as part of the microbiome. I suggest to use this definition given the emphasis on additional omics made in the paper.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have added that the definition includes other molecules in addition to genomes.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (major) The first paragraph ends on challenges of our research community including how to make data FAIR. Given the experience gained by the ELIXIR Marine Metagenomics Community, I would have appreciated a sentence on why making our data FAIR is important (e.g., transparency to make the research more reproducible, accountability given the (public) source of funding). See major comment in next section.</italic>
                </p>
                <p> 
                    <bold>Thank you for this suggestion. We have added a few sentences to this effect in the manuscript.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> ### The scope of the ELIXIR Microbiome Community</p>
                <p> </p>
                <p> 
                    <italic>- (major) These following arguments for FAIR data should be in the Introduction section.</italic>
                </p>
                <p>
                    <italic> "Moreover, when wishing to contextualise the results with similar experiments, the way a dataset has been produced and processed must be transparent to establish whether it is comparable (e.g. amplified sequence variants can only be compared when the same amplified regions are compared). Furthermore, when different methods are applied, best practices in data stewardship are required to ensure that the connectivity of the derived sequence data products, together with functional and taxonomic assertions are kept in context of the original sample/sequencing effort and associated contextual metadata."</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have merged this part of the paragraph into the Introduction.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The very first sentence of this section blames researchers for misuse of a term "[...] regularly (incorrectly) used [...]". I suggest to rephrase the sentence to simply state the differences, or back up the misuse of the term with references or a survey.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have qualified this with an example of INSDC mislabelling.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) "Fundamentally, the ELIXIR Microbiome Community is about providing the necessary infrastructures required to perform analysis of 
                        <bold>nucleotide</bold> sequence data derived from a microbiome"</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>&#x201c;Nucleotide&#x201d; has been added to this sentence.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) "Finally, and possible unique to this ELIXIR Community, is the variety of researchers". I am not sure that these features are unique to the ELIXIR Community but rather to the field of microbiome research. I suggest to tone it down by simply pointing out that the ELIXIR Community gathers a variety of researchers.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Modified accordingly.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>### Table 1</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (major) Metabolomics is listed in Table 1 but is omitted from the narrative in the paragraph.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have added a sentence in the paragraph.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (major) The Table 1 is not really used but could actually reduce some redundancies in the manuscript by providing a one-stop-shop to explain and detail these techniques.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have increased the cross linking to the table.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> ### Figure 1</p>
                <p> </p>
                <p> - (major) The figure is early on in the manuscript and it is unclear which of the ELIXIR platforms and communities are already established, or foreseen. Especially since "current and future ELIXIR activities" is mentioned in the manuscript before referencing this figure. The Table 2 does not add more information in that regard.</p>
                <p> </p>
                <p> 
                    <italic>- (major) There is a need for different types of arrows as the same arrow represent interactions between communities or processes.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Modified accordingly.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The "Data" node is quite generic, I was wondering whether it meant public repositories, institute repositories or both. Please be precise.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>"Data" refers here to the ELIXIR Data Platform</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) I guess the type of metadata illustrated in the figure is restricted to biological metadata, that is indeed collected during sampling, however, technical metadata such as the sequencing method used, the type of instrument or the library layout, are collected during the processing of the sample not only the sampling itself.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>This is everything from phenotypic, sample conditions, experimental methods</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) There should be an arrow from "Archive" back to "Data" when the data produced is deposited and then contributes back to public repositories?</italic>
                </p>
                <p> </p>
                <p> 
                    <italic>### Interactions with other ELIXIR Communities</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> * (major) The sentence "Similarly, many of the biodiversity approaches use marker gene amplification for studying environmental DNA (eDNA)" is redundant with the previous one 
                        <bold>and</bold> introduces a different nomenclature that was not used before&#x00a0; (marker gene amplication vs metabarcoding) and the differences, if any, are not explained. I would suggest to remove the sentence.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have modified the sentence to refer to barcoding rather than removing the sentence. We feel it is important to mention eDNA.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>* (major) The term "Isolation of genomes" is misleading and I guess the authors used it as a shorthand for "The isolation of bacteria, its DNA extraction, genome sequencing and their annotation". Please rephrase to avoid misinterpretation. Plus I would argue that these steps are also done when deconstructing microbiomes via cultivation strategies and are therefore not so out-of-scope.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>This sentence has been removed as the reviewer is correct that cultivation of bacteria (and other organisms) from microbiomes is becoming an increasing trend.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>* (minor) "yet each one of these areas is far greater in scientific scope" feels exaggerated and vague. It should be rephrased. A suggestion is: "yet each one of these areas is too complex to be tacked individually"</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have amended the sentence accordingly.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>* (minor) There is no link nor transition between the paragraph that starts with "In summary, " before the Table 3 and the paragraph after that starts with "Similarly, microbiome research". Please rephrase or edit to connect the two sections. Plus, whilst this is good to have a concrete example in the "Similarly" paragraph, the paragraph before was very broad and doing a summary. I would suggest to try reordering the two paragraphs and bring the example earlier for a smoother transition.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>The paragraphs have been reordered as suggested and slightly amended to improve readability.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>* (minor) Typo "Similarly, microbiome research has many translation
                        <bold>al</bold> aspects"</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Fixed</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>* (minor) The PET acronym is detailed but actually used only once, I am unsure if this is necessary.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>There is another instance of this abbreviation, so we have retained this abbreviation.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>### Table 2</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (major) The text in the "Interaction" column needs rework as it does not use a consistent wording and could be more to the point, especially as it is a complement to the main text:</italic>
                </p>
                <p>
                    <italic> &#x00a0;&#x00a0;&#x00a0;&#x00a0;- The Food and Nutrition entry is a question.</italic>
                </p>
                <p>
                    <italic> &#x00a0;&#x00a0;&#x00a0;&#x00a0;- The Galaxy entry has an unnecessary return carriage, and a unspecific "ongoing evaluation study" that needs to be clarified.</italic>
                </p>
                <p>
                    <italic> &#x00a0;&#x00a0;&#x00a0;&#x00a0;- The Plant Science entry starts with a generic sentence that could be in the introduction or removed for clarity. Plus I would add a clarification that "plants maintain 
                        <bold>or not</bold> their microbial communities across generations."</italic>
                </p>
                <p>
                    <italic> - (minor) The status (e.g., currently active, inactive, planned, etc.) of each ELIXIR Communities would have been appreciated as it is missing also from the Figure 1.</italic>
                </p>
                <p>
                    <italic> - (minor) The mention of "the field" for the Federated human data community is vague in a manuscript about gathering communities, which research field is implied? If this is the human microbiome research field as a whole, please indicate.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>In response to all of the above comments, we have substantially reworked table 2. The only comment that we have not addressed so specifically, is whether an activity is in progress or not, because the activities ebb and flow, and it is not always easy to say when there is a start or an end. However, being somewhat more focused in the content, we feel this revised table provides a stronger view of the direction that the community wants to take with the other communities.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>### Table 3</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (major) Similarly to the Table 2 major comment, there is a lack of consistent wording in the "Aim" column that makes the Table 3 not as impactful as it should and could be. Maybe the authors could extract common/distinct features from each of these initiatives as an alternative way to the "Aim" column. A couple of suggestions for these features would be: is it a national initiative?, does it relates to data storage, data analysis, training? is it linked to ELIXIR?</italic>
                </p>
                <p>
                    <italic> - (major) I was surprised not to see the NMDC listed in the table 3, especially when it is discussed in the main text. What about the NCCR Microbiomes initiative in Switzerland? I can understand that some initiatives are not included for space reason, but maybe state it in the legend of the table.</italic>
                </p>
                <p>
                    <italic> - (minor) Is this table sorted? It seems not, but it could be by acronyms or names.</italic>
                </p>
                <p>
                    <italic> - (minor) The aim for the NFDI4Microbiota is way too big a paragraph. The authors should reduce it for conciseness.</italic>
                </p>
                <p>
                    <italic> - (minor) The Metaproteomics Initiative entry has a hyperlink and a reference when none of the others have. Please homogenise.</italic>
                </p>
                <p>
                    <italic> - (minor) Some entries have country listed and some not. Please homogenise.</italic>
                </p>
                <p>
                    <italic> - (minor) I am not questioning the existence of the European Reference Genome Atlas here, but how best to phrase its relevance to ELIXIR in the manuscript. There seems to have no prokaryotes genomes in their atlas, however, there seems to be a trove of fungi and protists genomes which are usually said to be understudied in microbiome. So I think there is a missed opportunity for the authors here to make the most out of this entry.</italic>
                </p>
                <p>
                    <italic> - (minor) "With its headquarters in Bari (Apulia region)," seems irrelevant in the context of the table, please remove.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>As with Table 2, we have substantially reworked Table 3, including ordering by the reach of the effort, harmonising the language and reducing text. We have also included the relevance of the activity to the community. We have not included NMDC, as this is a US effort, and while important to the community, this table is focused on Europe efforts.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>### Data</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (minor) The end of the following sentence is redundant as the INSDC was introduced earlier already "INSDC, which in collaboration with the National Institute of Genetics DNA DataBank of Japan (DDBJ) and the United States National Center for Biotechnology&#x2019;s (NCBI) GenBank and Sequence Read Archive (SRA), facilitate the deposition and global exchange of sequence data.". Please adjust accordingly.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have removed the redundancy here.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) "A current challenge facing the field is connecting different multi &#x2018;omics data that have been derived from the same sample." Is this going to be tackled by the ELIXIR Microbiome Community? If so, I would state that this is part of its objectives.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We feel this is likely to be a joint effort across different communities. Thus, we have retained the sentence as is.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The end of sentence "[...]overarching context to the experiment, which can be important for meta-analyses." seems like an euphemism, I would suggest to replace with "[...]overarching context to the experiment, re-analyses or meta-analyses." to include re-analyses as well.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have added &#x201c;re-analyses or&#x201d; as suggested.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) "We will continue to promote such approaches, enriching metadata wherever possible." Is this going to be done via the CDCH?</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>While CDCH offers one approach, we believe that there are many different ways that metadata may be enriched, first by making scientists more aware of the need of submitting meta, promoting different, more specific checklists (e.g. STORMS or MicroB3) and mining metadata from published literature. This, specifically highlighting CDCH would not be appropriate.&#x00a0;&#x00a0;</bold>
                </p>
                <p> </p>
                <p> - (minor) "The ELIXIR Microbiome Community will also work to move the Marine Metagenomics domain in the RDMKit towards a more general Microbiome domain." What is the RDMKit? It is not explained, nor cited nor mentioned again.</p>
                <p> </p>
                <p> 
                    <bold>This has been expanded to the ELIXIR Research Data Management Kit and a link has been added to the text.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>### Tools</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (minor) "will increase their use of BioContainers" should be "will increase the use of BioContainers"</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Corrected.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) "In order to make tools findable 
                        <bold>by the end users, </bold>&#x00a0;the Community"</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Done.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) "workflow descriptions (e.g. Snakemake, CWL, Nextflow)" None of them have their references cited, is it an omission or space limitation?</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Added references</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) "A 
                        <bold>current</bold> joint effort between the Microbiome and Galaxy Communities"</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Fixed</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>### Benchmarking</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (major) "Benchmarking" it is the only item at this hierarchy level, meaning that this is useless for structuring the text. Please edit.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Thank you for pointing this out. We have changed the sub-sub-heading into an introductory sentence to this paragraph.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Review reference 2 published a recent review with guidelines to learn from that could have its place in this paragraph.</italic>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) in the sentence: "As the Microbiome Community establishes, we will develop a broader understanding of the requirements of the 
                        <bold>Community</bold>, feed this to the Tools Platform, as well as seek opportunities to interact with the Tools Platform to capture the diversity of tools and their utility via such benchmarking activities.", is this the ELIXIR Microbiome Community, or the wide community of microbiome researchers? Is it to mean that the ELIXIR Microbiome Community is going to act as an interface between microbiome researchers and ELIXIR Tools/Infrastructure?</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We clarified this to mean the wider community of microbiome researchers.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>### Compute</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (minor) The first sentence would fit better in the introduction.</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> ### Interoperability</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (minor) The reference 57 should be at the end of the sentence starting with "This effort was paralleled" not in the middle.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>This citation has been moved to the end of the sentence.</bold>
                </p>
                <p> </p>
                <p> - (minor) Reference 58 should be removed at it is a duplicate of reference 47.</p>
                <p> </p>
                <p> 
                    <bold>References have been fixed</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Use the full text "Global Alliance for Genomics and Health" instead of GA4GH.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>This abbreviation has been expanded.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) "is in the process of applying to be a
                        <bold>n ELIXIR</bold> Recommended Interoperability Resource."</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>ELIXIR has been added to this sentence.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) In the sentence: "This will require the development of new data Interoperability layers for data resources that are not normally focused in Microbiome data" I think the authors meant "used" instead of "focused", and "microbiome" instead of "Microbiome".</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have updated the sentence accordingly.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>### Training</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (minor) In "Platforms such as MGnify support large-scale services for most, if not all, steps of a microbiome study", I would suggest to remove the "if not all".</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Agreed.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) "with areas of expertise covering different environments, &#x2018;omics approaches and data analysis pathways." I think the authors meant "learning paths", and I would refrain from using "pathways" as it also has a biological meaning.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Actually, we do mean data analysis, but have changed to data analysis strategies.&#x00a0; This is in reference to the fact that there may be multiple different ways of analysing a data type, for example metagenomics can be analysed using tools such as Kraken or MetaPhlAn4, to full assembly, gene calling and functional analysis.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>### Context with other international initiatives</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (major) Whilst I appreciated the emphasis that no initiative exists on its own, I feel the first paragraph on the Genom
                        <bold>ic</bold> (please correct the typo) Standards Consortium feels lengthy for a manuscript whose topic is not the GSC. I would advise to summarize. In this respect, the second paragraph is particularly relevant to a tangible collaboration between ELIXIR Microbiome Community and GSC.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>The typo has been corrected.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (major) The NMDC is discussed in this section but not part of the Table 3.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Table 3 lists pan-European efforts, whereas the NMDC is a US specific initiative. Thus, we have not added this to the table.&#x00a0;&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Is the mentioned M5 project still active as the website's last update is 2012?</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>While the website has not been updated, there are ongoing efforts to expand and revitalise this effort. RDF is a member of the GSC board and is promoting aspects of the M5 initiative.&#x00a0; While the name may change, we feel this is useful to keep.&#x00a0; However, as part of condensing the overall length of the manuscript, we have removed the reference to this initiative.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) "Combining the activities on standards concerning workflows [...]" does this means adding and providing workflows to the microbiome research community?</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (minor) Given the emphasis on the fact that MicrobiomeSupport was a program, the authors could update the readers and indicate that it is now MicrobiomeSupport Association.</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> ### Interaction with other key data resources beyond ELIXIR</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (major) There is an order issue with the main text that a proofread could solve, as MG-RAST is explained and cited in the first paragraph but already mentioned upstream of the main text in the Interoperability section.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>This has now been fixed.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> ### Specific challenges and objectives of the ELIXIR Microbiome Community</p>
                <p> </p>
                <p> - (major) The strong claim "it is 
                    <italic>widely accepted</italic> that current short-read assembly-based methods do not generally work as well for soil microbiomes" would probably need at least one reference.</p>
                <p> </p>
                <p> 
                    <bold>Added</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (major) "(iii) there is no centralised database collecting the millions of viral sequences". It seems to be the case indeed, and there are databases (~24) out there as recently compiled in Review reference 3. How</italic>
                </p>
                <p>
                    <italic> ELIXIR Microbiome Community plans to integrate/aggregate these resources in a non-duplicating manner?</italic>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The word "through" is superfluous in the the sentence that starts with "This current limitation, [...]" and can be removed.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have removed the sentence &#x201c;through&#x201d;.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The CAMI was already explained and cited above, so the already defined acronym can be used.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>This has been fixed.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Precise the area in :"Additionally, another key area of development 
                        <bold>of taxonomy</bold> [...]".</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Added</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The sentence "Viruses, particularly those that infect bacteria, are found ubiquitously in all environments and play critical roles in community dynamics." belongs in an introduction, not so downstream of the manuscript.</italic>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The sentence starting the sixth paragraph could be precised as "The increase in metagenomic assemblies has resulted in a parallel increase in the number of 
                        <bold>predicted</bold> protein sequences, with sets of non-redundant proteins now in the billions."</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Added &#x201c;predicted&#x201d; to the sentence.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Fix typo in "that are undetectable by current sequence based methods."</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Fixed - sequenced -&gt; sequence</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Would it make sense to also be able to access representative, of clusters for instance? "[...] develop new infrastructural frameworks for accessing slices of the data 
                        <bold>or adequate representatives</bold> based on the requirements."</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Added</bold>&#x00a0;</p>
                <p> </p>
                <p> 
                    <italic>- (minor) What is the "
                        <bold>expanded</bold> Microbiome Community"? It was never mentioned before.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Removed &#x201c;expanded&#x201d; from the sentence.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) A few comments on the sentence: "Within the Community, we will develop and promote standards around the analysis provenance (analytical metadata),". In my opinion and how it was already stated in the manuscript, it would make more sense to promote existing standards first and then develop if need be. There is no mention of other type of metadata, so the "analytical metadata" precision seems superfluous. I would suggest: "Within the Community, we will 
                        <bold>promote and develop</bold> standards 
                        <bold>regarding</bold> the analysis provenance,"</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We agree with the reviewer's comment and have re-ordered accordingly.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Precise the term forms in "[...] ensuring that comput
                        <bold>ing</bold> resources are accessible for performing the different forms of data analysis [...]", do the authors mean types of/steps in the data analysis?</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>&#x201c;compute resources&#x201d; is an accepted resource.&#x00a0; We have removed the &#x201c;different forms of&#x201d; as we feel it is a little superfluous.&#x00a0; We were meaning the different strategies, but it does not add to the sentence.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Same argument as before regarding reinventing the wheel, I would swap the part of the sentence:</italic>
                </p>
                <p>
                    <italic> "This may require the 
                        <bold>extensions to existing databases or development of new ones</bold>, but it requires an agreement from the research community to adopt them."</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We agree, and have amended accordingly.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) This part "Metaproteomics aims to elucidate the functional and taxonomic interplay of proteins in microbiomes," should have been in the Table 1, or to reuse the Table 1 here.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have highlighted this in table 1. We have kept the text here to provide the context for the rest of the sentence.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The tenth paragraph of this section starts with the mention of multiple major challenges, but detail "only" one of them. I would suggest to mention some of the others challenges.</italic>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The "MIA" method is just a hyperlink, without any reference. Either cite the website accordingly or add the reference.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>There is not a reference, and the hyperlink goes to the GitHub site, as requested by the authors. We have included &#x201c;
                        <ext-link ext-link-type="uri" xlink:href="https://github.com/microbiome/mia">https://github.com/microbiome/mia</ext-link>&#x201d; for completeness sake.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>### Table 4</italic>
                </p>
                <p> </p>
                <p> 
                    <italic>- (major) I was surprised to see that the objective "Foster international collaborations between other resources providers and databases to ensure global harmonisation of e-infrastructures for microbiome research" was long-term, as I would have imagined that a gap analysis would be short-term to ensure we do not reinvent the wheel, especially given the others initiatives discussed in the manuscript.</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (minor) If the Objective column starts with action verbs (which is a good idea), then it should be "Survey the needs" instead of "Survey of needs".</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have amended accordingly.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Specify the type of workflow with "Address knowledge gaps in generating and adopting 
                        <bold>data analysis</bold> workflows"</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Added</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The verb is missing in "
                        <bold>Teach a</bold>dvanced containerisation and cloud deployment"</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Added.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The verb is missing in "
                        <bold>Promote</bold> data analysis through the use of services". Is this ELIXIR services in general or specific ELIXIR Microbiome Community services?</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Added Promote - generalised to ELIXIR services.</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Use "Share" instead of "Sharing" in the "Co-ordinate" entry.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Done</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The "Industry connection" entry does not fit the action verb pattern. A suggestion would be "Use ELIXIR and Node forums to understand pharmaceutical and biotechnological demands and current limitations impacting this sector." as the first sentence felt generic.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Done</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) The verb is missing in "
                        <bold>Design</bold> targeted training for different microbiome communities"</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Done</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Use "findability" instead of "discoverability" for consistency and to fit with the FAIR principles.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Done</bold>.</p>
                <p> </p>
                <p> 
                    <italic>- (minor) Reorder the sentence to start with the action verb: "
                        <bold>E</bold>stablish new standards for microbiome research, particularly with respect to data analysis reporting and contextual metadata reporting 
                        <bold>in conjunction with GSC</bold>"</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Done</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (minor) Correct "Established" to "Establish" in the "Promoting new approaches" entry.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Done</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>Editorial comments:</italic>
                </p>
                <p>
                    <italic> </italic>
                </p>
                <p>
                    <italic> - (major) "Community" is used in upper-case and this is unclear in many instances whether the ELIXIR Marine Metagenomics Community is referred to, the ELIXIR Microbiome Community, or the broader microbiome research community. I suggest to use consistently the full term for the sake of transparency. Abbreviations like EMMC and EMC could be even more misleading in my opinion.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have prefixed all instances of &#x201c;Community&#x201d; with their explicit community name.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- (major) The structure of the white paper is not evident as the hierarchy is indicated only by change in font size, and some sections are quite lengthy for a white paper that is supposed to be concise. I understand that this is a constraint from the Editor, but see Review reference 4 for a white paper with a more clearer structure. An alternative could be to use numbered sections.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>We have introduced section numbers to aid the structuring of the manuscript.&#x00a0;</bold>
                </p>
                <p> </p>
                <p> 
                    <italic>- References</italic>
                </p>
                <p>
                    <italic> &#x00a0; &#x00a0; - (minor) The very first reference is oddly formatted in the text creating an artificial and confusing end of sentence.</italic>
                </p>
                <p>
                    <italic> &#x00a0;&#x00a0;&#x00a0;&#x00a0;- (minor) Title of reference 9 is truncated and should be "Methods included: standardizing computational reuse and portability with the Common Workflow Language"</italic>
                </p>
                <p>
                    <italic> &#x00a0;&#x00a0;&#x00a0;&#x00a0;- (minor) The superscript numbers of the numeric style of bibliography are in many instances after the final dot (see reference 2, 12-16, 17-19, 36, 37), after a comma (see reference 10, 20, 23), a bracket (see reference 60) or semi colon (see reference 65) when they should be before any of these symbols.</italic>
                </p>
                <p>
                    <italic> &#x00a0;&#x00a0;&#x00a0;&#x00a0;- (minor) In the paragraph "Interactions with other ELIXIR Communities", we jump from reference 24 to 29 when the numeric style of bibliography (that was chosen by the authors) is expected to mirror the mentions in the manuscript. Please either have the Table 2 earlier in the paper, or change the order of the references.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>References have been fixed</bold>
                </p>
            </body>
        </sub-article>
    </sub-article>
</article>
