<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.128320.2</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Concatenated 16S rRNA sequence analysis improves bacterial taxonomy</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 2; peer review: 1 approved, 1 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Paul</surname>
                        <given-names>Bobby</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-7212-1886</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Bioinformatics, Manipal School of Life Sciences, Manipal Academy of Higher Education, Manipal, Karnataka, 576104, India</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:bobby.paul@manipal.edu">bobby.paul@manipal.edu</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>3</day>
                <month>4</month>
                <year>2023</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2022</year>
            </pub-date>
            <volume>11</volume>
            <elocation-id>1530</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>23</day>
                    <month>3</month>
                    <year>2023</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2023 Paul B</copyright-statement>
                <copyright-year>2023</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/11-1530/pdf"/>
            <abstract>
                <p>
                    <bold>Background:</bold> Microscopic, biochemical, molecular, and computer-based approaches are extensively used to identify and classify bacterial populations. Advances in DNA sequencing and bioinformatics workflows have facilitated sophisticated genome-based methods for microbial taxonomy although sequencing of the 16S rRNA gene is widely employed to identify and classify the bacterial community as a cost-effective and single-gene approach. However, the 16S rRNA sequence-based species identification accuracy is limited by multiple copies of the gene and their higher sequence identity between closely related species. The availability of a large volume of bacterial whole-genome data provided an opportunity to develop comprehensive species-specific 16S rRNA reference libraries.</p>
                <p>
                    <bold>Methods:</bold> The 16S rRNA copies were retrieved from the whole genomes in the complete stage at the Genome database. With defined rules, four 16S rRNA gene copy variants were concatenated to develop a species-specific reference library. The sequence similarity search was performed with a web-based BLAST program, and MEGA software was used to construct the phylogenetic tree.</p>
                <p>
                    <bold>Results:</bold> Using this approach, species-specific 16S rRNA gene libraries were developed for four closely related 
                    <italic toggle="yes">Streptococcus</italic> species (
                    <italic toggle="yes">S. gordonii</italic>, 
                    <italic toggle="yes">S. mitis</italic>, 
                    <italic toggle="yes">S. oralis</italic>, and 
                    <italic toggle="yes">S. pneumoniae</italic>). Sequence similarity and phylogenetic analysis using concatenated 16S rRNA copies yielded better resolution than single gene copy approaches.</p>
                <p>
                    <bold>Conclusions:</bold> The approach is very effective in classifying genetically related species and may reduce misclassification of bacterial species and genome assemblies.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>bacterial nomenclature</kwd>
                <kwd>bacterial taxonomy</kwd>
                <kwd>concatenated phylogeny</kwd>
                <kwd>species-specific barcode reference library</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1" xlink:href="http://dx.doi.org/10.13039/100019305">
                    <funding-source>Manipal Academy of Higher Education</funding-source>
                </award-group>
                <funding-statement>Open access funding was provided by Manipal Academy of Higher Education, Manipal.</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 1</title>
                <p>The manuscript has been revised as per the reviewer's suggestions. Previous studies suggest the UPGMA algorithm for 16S rRNA-based phylogeny. Hence, phylogenetic trees were plotted using the UPGMA algorithm. Fig 3 split into two as Fig 3 and Fig 4, to highlight the difference in single copy and concatenated approaches. Fig 3 is tree plotted based on single copy approach, and Fig 4 is plotted using concatenated sequences. Supplementary images (S1-S4) were provided to show the intra-species Parsim-info variable sites from 
                    <italic>S. oralis</italic>, 
                    <italic>S. mitis</italic>, 
                    <italic>S. gordonii</italic>, and 
                    <italic>S. pneumoniae</italic>.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec id="sec1" sec-type="intro">
            <title>Introduction</title>
            <p>The genomic region encoding the 16S ribosomal RNA (16S rRNA) is extensively studied, and used to identify and classify bacterial species. The 16S rRNA is a conserved component of the small subunit (30S) of the prokaryotic ribosome. The gene encoding the 16S rRNA is ~1500 base pair (bp) long, and it consists of nine variable regions (
                <xref ref-type="bibr" rid="ref31">Reller 
                    <italic toggle="yes">et al</italic>. 2007</xref>; 
                <xref ref-type="bibr" rid="ref8">Chakravorty 
                    <italic toggle="yes">et al.</italic> 2007</xref>; 
                <xref ref-type="bibr" rid="ref32">Sabat 
                    <italic toggle="yes">et al</italic>. 2017</xref>). The sequence of the 16S rRNA gene has been extensively used as a molecular marker in culture-independent methods to identify and classify diverse bacterial communities (
                <xref ref-type="bibr" rid="ref11">Clarridge 2004</xref>; 
                <xref ref-type="bibr" rid="ref17">Johnson 
                    <italic toggle="yes">et al</italic>. 2019</xref>). Bacterial 16S rRNA sequences are currently being used to study the evolution, phylogenetic relationships, and environmental abundance of various taxa (
                <xref ref-type="bibr" rid="ref41">Vetrovsky and Baldrian 2013</xref>; 
                <xref ref-type="bibr" rid="ref35">Srinivasan 
                    <italic toggle="yes">et al</italic>. 2015</xref>; 
                <xref ref-type="bibr" rid="ref29">Peker 
                    <italic toggle="yes">et al</italic>. 2019</xref>).</p>
            <p>Although 16S rRNA sequence analyses are the mainstay of taxonomic studies of bacteria, there are some limitations. For example, the 16S rRNA gene has poor discriminatory power at the species level (
                <xref ref-type="bibr" rid="ref43">Winand 
                    <italic toggle="yes">et al</italic>. 2020</xref>), and the copy number per genome can vary from 1 to 15 or even more (
                <xref ref-type="bibr" rid="ref41">Vetrovsky and Baldrian 2013</xref>; 
                <xref ref-type="bibr" rid="ref43">Winand 
                    <italic toggle="yes">et al</italic>. 2020</xref>). The variable copies of this gene within a genome makes distinct data for a species. Therefore, gene copy normalization (GCN) may be necessary prior to sequence analysis. However, GCN may not improve the 16S rRNA sequence analyses in all scenarios, and comprehensive, species-specific catalogues of 16S rRNA gene copies may be necessary (
                <xref ref-type="bibr" rid="ref37">Starke 
                    <italic toggle="yes">et al</italic>. 2021</xref>). Furthermore, intra-species variations in the 16S rRNA gene copies were observed in several bacterial genome assemblies (
                <xref ref-type="bibr" rid="ref27">Paul 
                    <italic toggle="yes">et al</italic>. 2019</xref>). Only a few bacterial species contain identical 16S rRNA gene copies, and sequence diversity increases with increasing copy numbers of 16S rRNA genes (
                <xref ref-type="bibr" rid="ref41">Vetrovsky and Baldrian 2013</xref>). The high levels of similarity of the 16S rRNA gene across some bacterial species poses a major challenge for taxonomic studies using bioinformatics methods (
                <xref ref-type="bibr" rid="ref12">Deurenberg 
                    <italic toggle="yes">et al</italic>. 2017</xref>; 
                <xref ref-type="bibr" rid="ref29">Peker 
                    <italic toggle="yes">et al</italic>. 2019</xref>).</p>
            <p>Factors such as purity of bacterial cultures, quality of the purified DNA samples, and potential DNA chimeras should be carefully considered while sequencing and analysis of 16S rRNA genes (
                <xref ref-type="bibr" rid="ref16">Janda and Abbott 2007</xref>; 
                <xref ref-type="bibr" rid="ref9">Church 
                    <italic toggle="yes">et al</italic>. 2020</xref>). Sequencing errors can lead to misidentification of bacteria and phylogenetic anomalies (
                <xref ref-type="bibr" rid="ref1">Alachiotis 
                    <italic toggle="yes">et al</italic>. 2013</xref>). Other concerns include sequence ambiguities, gaps generated during DNA sequencing and sequence comparisons, and choosing the appropriate algorithm (local or global) for sequence alignment. Since the local alignment algorithm is extensively used for sequence similarity-based comparisons, it is important to carefully consider whether a single variable region or a combination of variable regions of the 16S rRNA gene would be ideal for bacterial classification (
                <xref ref-type="bibr" rid="ref16">Janda and Abbott 2007</xref>; 
                <xref ref-type="bibr" rid="ref17">Johnson 
                    <italic toggle="yes">et al</italic>. 2019</xref>; 
                <xref ref-type="bibr" rid="ref43">Winand 
                    <italic toggle="yes">et al</italic>. 2020</xref>). Using erroneous 16S rRNA sequences as references and improper bioinformatics workflows can mislead bacterial identification. Further, the growth of bioinformatics and genetic data has led to the current genome-based microbial classification. However, the success rate of these approaches are highly dependent on the skill of data analyst personnel in next generation sequencing technologies, computational tools, operation of high performance computing systems. Researchers without sufficient experience or skill in such technologies may also mislead the bacterial taxonomy (
                <xref ref-type="bibr" rid="ref4">Baltrus 2016</xref>).</p>
            <p>Other methods for bacterial identification include the sequencing and analysis of the polymerase chain reaction (PCR) amplified &#x223c;4.5 kb 16S&#x2013;23S rRNA regions (
                <xref ref-type="bibr" rid="ref5">Benitez-Paez and Sanz 2017</xref>; 
                <xref ref-type="bibr" rid="ref32">Sabat 
                    <italic toggle="yes">et al</italic>. 2017</xref>; 
                <xref ref-type="bibr" rid="ref19">Kerkhof 
                    <italic toggle="yes">et al</italic>. 2017</xref>). However, the 16S&#x2013;23S rRNA sequence-based method is less practical application due to the lack of appropriate reference sequence databases and reliable tools/methods for sequence analysis (
                <xref ref-type="bibr" rid="ref32">Sabat 
                    <italic toggle="yes">et al</italic>. 2017</xref>). Recent advances in bioinformatics workflows (
                <xref ref-type="bibr" rid="ref43">Winand 
                    <italic toggle="yes">et al</italic>. 2020</xref>; 
                <xref ref-type="bibr" rid="ref33">Schloss 2020</xref>) and reference databases such as SILVA, EzBioCloud (
                <xref ref-type="bibr" rid="ref30">Quast 
                    <italic toggle="yes">et al</italic>. 2013</xref>; 
                <xref ref-type="bibr" rid="ref44">Yoon 
                    <italic toggle="yes">et al</italic>. 2017</xref>) have further improved 16S rRNA-based bacterial taxonomy. However, these approaches are not completely reliable due to misclassification of some bacterial species and erroneous genome assemblies (
                <xref ref-type="bibr" rid="ref38">Steven 
                    <italic toggle="yes">et al</italic>. 2017</xref>; 
                <xref ref-type="bibr" rid="ref24">Mart&#x00ed;nez-Romero 
                    <italic toggle="yes">et al</italic>. 2018</xref>; 
                <xref ref-type="bibr" rid="ref25">Mateo-Estrada 
                    <italic toggle="yes">et al</italic>. 2019</xref>; 
                <xref ref-type="bibr" rid="ref3">Bagheri 
                    <italic toggle="yes">et al</italic>. 2020</xref>).</p>
            <p>The entire 16S rRNA gene (~1500 bp) can be amplified and sequenced using the conventional or high throughput sequencing methods. However, many 16S rRNA sequence-based bacterial identification studies do not seem to include all of these nine variable regions (
                <xref ref-type="bibr" rid="ref36">Stackebrandt 
                    <italic toggle="yes">et al</italic>. 2021</xref>). Due to the large volume of whole-genome data that is being produced by high throughput sequencing technologies, there is an urgent need to translate the genomic data for convenient microbiome analyses that ensure clinical practitioners can readily understand and quickly implement (
                <xref ref-type="bibr" rid="ref9">Church 
                    <italic toggle="yes">et al</italic>. 2020</xref>). This study aimed to develop a workflow for accurate identification of bacteria using concatenated, species-specific 16S rRNA sequences. It was hoped that the species-specific libraries would yield much better resolution in sequence similarity- and phylogeny-based bacterial classification.</p>
        </sec>
        <sec id="sec2" sec-type="methods">
            <title>Methods</title>
            <sec id="sec2.1">
                <title>Estimation of variations in intra-genomic 16S rRNA gene copies</title>
                <p>It has been reported that sequence alignment of 16S rRNA gene copies at the intra-genomic level shows a higher degree of variability in species belonging to the 
                    <italic toggle="yes">Firmicutes</italic> and 
                    <italic toggle="yes">Proteobacteria</italic> (
                    <xref ref-type="bibr" rid="ref41">Vetrovsky and Baldrian 2013</xref>; 
                    <xref ref-type="bibr" rid="ref15">Ibal 
                        <italic toggle="yes">et al</italic>. 2019</xref>). Therefore, this study used eight 16S rRNA gene copies (Underlying data: Supplementary data 1 (
                    <xref ref-type="bibr" rid="ref28">Paul 2022</xref>)) retrieved from the complete genome of 
                    <italic toggle="yes">Enterobacter asburiae</italic> strain ATCC 35953 (NZ_CP011863.1). To estimate intra-genomic variability between these 16S rRNA gene copies, BLAST+ 2.13.0 (RRID:SCR_004870; 
                    <xref ref-type="bibr" rid="ref2">Altschul 
                        <italic toggle="yes">et al</italic>. 1990</xref>) and Clustal Omega 1.2.4 (RRID:SCR_001591; 
                    <xref ref-type="bibr" rid="ref34">Sievers 
                        <italic toggle="yes">et al</italic>. 2011</xref>) sequence alignment algorithms were used. Previous studies suggested unweighted pair group method with arithmetic averages (UPGMA) algorithm for the phylogenetic analysis of 16S rRNA genes (
                    <xref ref-type="bibr" rid="ref11">Clarridge 2004</xref>; 
                    <xref ref-type="bibr" rid="ref7">Caporaso 
                        <italic toggle="yes">et al</italic>. 2011</xref>). Hence, phylogenetic analysis of these 16S rRNA gene copies were performed using the UPGMA method (Maximum Composite Likelihood; 500 bootstrap replicates) provided in the MEGA software (version 11; RRID: SCR_000667; 
                    <xref ref-type="bibr" rid="ref20">Kumar 
                        <italic toggle="yes">et al</italic>. 2018</xref>).</p>
            </sec>
            <sec id="sec2.2">
                <title>Construction of species-specific concatenated 16S rRNA reference libraries</title>
                <p>Previous studies have reported that the genes encoding 16S rRNA from several bacterial species share &gt;99% sequence identity (
                    <xref ref-type="bibr" rid="ref12">Deurenberg 
                        <italic toggle="yes">et al</italic>. 2017</xref>; 
                    <xref ref-type="bibr" rid="ref29">Peker 
                        <italic toggle="yes">et al</italic>. 2019</xref>). Therefore, the 16S rRNA-based methods failed to correctly identify bacterial species that are genetically closely related (
                    <xref ref-type="bibr" rid="ref12">Deurenberg 
                        <italic toggle="yes">et al</italic>. 2017</xref>; 
                    <xref ref-type="bibr" rid="ref13">Devanga-Ragupathi 
                        <italic toggle="yes">et al</italic>. 2018</xref>). It has been reported that 16S rRNA-based methods cannot distinguish between 
                    <italic toggle="yes">Streptococcus mitis</italic> and 
                    <italic toggle="yes">Streptococcus pneumoniae</italic> due to the high sequence similarity (
                    <xref ref-type="bibr" rid="ref31">Reller 
                        <italic toggle="yes">et al</italic>. 2007</xref>; 
                    <xref ref-type="bibr" rid="ref21">Lal 
                        <italic toggle="yes">et al</italic>. 2011</xref>). Hence, the study decided to choose the 16S rRNA gene copies from four closely related species of 
                    <italic toggle="yes">Streptococcus.</italic>
                </p>
                <p>More than 498,000 whole-genome sequences are currently (Jan 2023) available for bacterial species in the Genome database (RRID:SCR_002474; 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/genome">https://www.ncbi.nlm.nih.gov/genome</ext-link>). Many of these genomes were sequenced using high throughput sequencing technologies such as Illumina/Ion-Torrent (short read sequencing) and PacBio/Nanopre (long read sequencing). Furthermore, most of these whole-genome sequences were obtained after a hybrid assembly of short and long read sequence data. This extensive, high throughput data can be effectively used to develop advanced genome-based methods for microbial systematics. Although the genomic data is available in four levels (contig, scaffold, chromosome, and complete), this study used only the complete genomes to retrieve 16S rRNA genes.</p>
                <p>To develop species-specific barcode reference libraries, this study retrieved full-length 16S rRNA genes from 16 complete genome sequences belonging to four 
                    <italic toggle="yes">Streptococcus</italic> species (
                    <italic toggle="yes">S. gordonii</italic>, 
                    <italic toggle="yes">S. mitis</italic>, 
                    <italic toggle="yes">S. oralis,</italic> and 
                    <italic toggle="yes">S. pneumoniae</italic>). Details of the dataset used to develop species-specific concatenated reference libraries are provided in 
                    <xref ref-type="table" rid="T1">Table 1</xref>, and the sequences are provided in the underlying data (Supplementary data 2 (
                    <xref ref-type="bibr" rid="ref28">Paul 2022</xref>)). Sequences were trimmed beyond the universal primer pair (fD1-5&#x2032;-GAG TTT GAT CCT GGC TCA-3&#x2032; and rP2-5&#x2032;-ACG GCT AAC TTG TTA CGA CT-3&#x2032;, which are used for full-length 16S rDNA amplification, 
                    <xref ref-type="bibr" rid="ref42">Weisburg 
                        <italic toggle="yes">et al</italic>. 1991</xref>) to maintain uniform length. To perform multiple sequence alignment and identify the intra-species parsimony informative (Parsim-info) variable sites, the MEGA 11 software was used. A species-specific barcode reference library that covers the entire Parsim-info variable sites was constructed by concatenating four 16S rRNA gene copies from four different strains of a species. The rationale for the selection of four copies for constructing a species-specific barcode reference library was: (i) a maximum of four variations can be found at a single site, and (ii) earlier studies have shown that the mean 16S rRNA copies per genome is four (
                    <xref ref-type="bibr" rid="ref41">Vetrovsky and Baldrian 2013</xref>).</p>
                <table-wrap id="T1" orientation="portrait" position="float">
                    <label>Table 1. </label>
                    <caption>
                        <title>Details of whole genome assemblies used for the development of concatenated 16S rRNA reference libraries.</title>
                        <p>One copy of 16S rRNA gene from each strain is used for the concatenation.</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">Species</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Strains</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Genome accession number</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">No. of 16S rRNA gene copies</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Sequencing platform</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Species-specific library name</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Library length (bp)</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">No. of Parsim-info sites</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="4" valign="middle">
                                    <italic toggle="yes">S. gordonii</italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">FDAARGOS 1454</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">CP077224.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">PacBio; Illumina</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">
                                    <italic toggle="yes">S.gordonii-</italic>Ref-I</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">6076</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">7</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">NCTC7868</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">LR134291.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">PacBio</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">KCOM 1506</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">CP012648.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">5</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">Illumina</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">NCTC9124</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">LR594041.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">PacBio</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="4" valign="middle">
                                    <italic toggle="yes">S. mitis</italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">B6</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">NC_013853.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">NA</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">
                                    <italic toggle="yes">S.mitis-</italic>Ref-I</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">6033</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">10</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">KCOM 1350</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">CP012646.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">3</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">Illumina</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">SVGS 061</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">CP014326.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">PacBio; Illumina</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">NCTC 12261</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">CP028414.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">PacBio</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="4" valign="middle">
                                    <italic toggle="yes">S. oralis</italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">NCTC 11427</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">LR134336.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">PacBio</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">
                                    <italic toggle="yes">S.oralis-</italic>Ref-I</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">6038</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">24</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">34</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">CP079724.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">Illumina; Nanopore</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">FDAARGOS 886</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">CP065706.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">PacBio; Illumina</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">F0392</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">CP034442.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">PacBio</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="4" valign="middle">
                                    <italic toggle="yes">S. pneumoniae</italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">475</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">CP046355.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">PacBio</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">
                                    <italic toggle="yes">S.pneumoniae</italic>-Ref-I</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">6032</td>
                                <td align="left" colspan="1" rowspan="4" valign="middle">6</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">NU83127</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">AP018936.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">Nanopore; Illumina</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">NCTC7465</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">LN831051.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">PacBio</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">6A-10</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">CP053210.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">PacBio</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
            </sec>
            <sec id="sec2.3">
                <title>Demonstration of concatenated 16S rRNA in sequence similarity and phylogeny</title>
                <p>This study analyzed a few cases to demonstrate (i) the classical sequence similarity and (ii) phylogenetic analysis using concatenated species-specific 16S rRNA reference libraries. The study used nine 16S rRNA gene copies (sequenced using the Sanger method) showing higher sequence similarity to the 16S rRNA genes of multiple species of 
                    <italic toggle="yes">Streptococcus</italic> were retrieved from GenBank database (RRID:SCR_002760). The web-based BLAST2 (version 2.13.0) program for aligning two or more sequences was used to estimate the maximum score, total alignment score, and sequence identity of these nine 16S rRNA sequences selected. For the sequence similarity search, a single copy of the 16S rRNA (sequenced using the Sanger method or retrieved from a whole-genome assembly) can be considered as &#x2018;Query sequence&#x2019;. The concatenated species-specific reference libraries need to be provided in the text area for &#x2018;Subject sequence&#x2019;. However, to perform phylogenetic analysis, it is mandatory that the target sequence (length = n bp) be concatenated four times (length = 4 &#x00d7; n bp). Phylogenetic analysis was performed for single gene copies and concatenated approach using UPGMA method as indicated above.</p>
            </sec>
        </sec>
        <sec id="sec3" sec-type="results">
            <title>Results</title>
            <sec id="sec3.1">
                <title>Intra-genomic 16S rRNA variations in 
                    <italic toggle="yes">E. asburiae</italic>
                </title>
                <p>Historically, sequences of the 16S rRNA genes have been used to identify known and new bacterial species. However, efficiency of PCR-based amplification, poor discrimination at the species level, multiple polymorphic 16S rRNA gene copies, and improper bioinformatics workflows for the data analysis can impact the identification. The genome of 
                    <italic toggle="yes">E. asburiae</italic> contains eight copies of the 16S rRNA gene. Analysis using Clustal Omega (global alignment) and BLAST (local alignment) showed that the sequences of these eight alleles had average identities of 99.29 and 99%, respectively (
                    <xref ref-type="table" rid="T2">Table 2</xref>). Therefore, choosing the appropriate algorithm/tool is critical for the estimation of sequence identities and sequence-based species delineation. For analyzing sequence pairs that are highly identical, global sequence alignment algorithms seem to be more appropriate because they consider all the nucleotides for the estimation of sequence identity. Clustal Omega based multiple sequence alignment of the eight alleles of the 16S rRNA gene in the genome of 
                    <italic toggle="yes">E. asburiae</italic> showed 22 variable sites (
                    <xref ref-type="fig" rid="f1">Figure 1</xref>). These results show that the computational analysis using a single gene copy makes different results for species harbouring variable copies of this gene.</p>
                <table-wrap id="T2" orientation="portrait" position="float">
                    <label>Table 2. </label>
                    <caption>
                        <title>Percent identity of eight intra genomic 16S rRNA regions from 
                            <italic toggle="yes">Enterobacter asburiae</italic> strain ATCC 35953 (NZ_CP011863.1).</title>
                        <p>Percent identity given below the diagonal line is calculated with Clustal Omega software (Mean identity: 99.29%) and those above the diagonal line were calculated with the BLASTN program (Mean identity: 99.00%). Genome coordinates of 16S rRNA copies: R1: 2686082&#x2013;2687660 (1579 bp); R2: 3148265&#x2013;3149814 (1550 bp); R3: 3313470&#x2013;3315019 (1550 bp); R4: 3583942&#x2013;3585481 (1540 bp); R5:3684745&#x2013;3686294 (1550 bp); R6: 3771751&#x2013;3773300 (1550 bp); R7: 3968538&#x2013;3970087 (1550 bp); R8: 4647650&#x2013;4649199 (1550 bp).</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_Graphical5.gif"/>
                                </td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>Figure 1. </label>
                    <caption>
                        <title>Clustal Omega based multiple sequence alignment of eight intra genomic 16S rRNA gene copies from 
                            <italic toggle="yes">Enterobacter asburiae</italic> strain ATCC 35953 (NZ_CP011863.1) showing 22 variable sites.</title>
                        <p>According to 
                            <xref ref-type="bibr" rid="ref8">Chakravorty 
                                <italic toggle="yes">et al.</italic> (2007)</xref>, the nine variable regions of 16S rRNA gene spanned nucleotides 69-99, 137-242, 433-497, 576-682, 822-879, 986-1043, 1117-1173, 1243-1294, and 1435-1465 for V1 to V9 respectively.</p>
                    </caption>
                    <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_figure1.gif"/>
                </fig>
                <p>The evolutionary relationship between species is usually represented using a phylogenetic tree based on the analysis of a single gene, multiple genes, or whole genomes. However, bacterial identification and classification is mainly based on the phylogenetic analysis of single copies of 16S rRNA genes. A phylogenetic tree was constructed to understand how variations in the sequences of the eight alleles of the 16S rRNA gene in the genome of 
                    <italic toggle="yes">E. asburiae</italic> influence species delineation (
                    <xref ref-type="fig" rid="f2">Figure 2</xref>). These results indicate that the intra-genomic variations in 16S rRNA copies may mislead the bacterial taxonomy in single gene copy approaches.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <title>Phylogenetic tree of eight intra genomic 16S rRNA gene copies from 
                            <italic toggle="yes">Enterobacter asburiae</italic> strain ATCC 35953 (NZ_CP011863.1).</title>
                        <p>The node label denotes the coordinate of 16S rRNA regions in the genome.</p>
                    </caption>
                    <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_figure2.gif"/>
                </fig>
            </sec>
            <sec id="sec3.2">
                <title>Species-specific concatenated 16S rRNA libraries</title>
                <p>This study selected four species of 
                    <italic toggle="yes">Streptococcus</italic> (
                    <italic toggle="yes">S. gordonii</italic>, 
                    <italic toggle="yes">S. mitis</italic>, 
                    <italic toggle="yes">S. oralis</italic>, and 
                    <italic toggle="yes">S. pneumoniae</italic>) to construct species-specific concatenated reference libraries based on 16S rRNA gene sequences obtained from complete genomes. Four variable copies of the 16S rRNA gene from a species are required to construct a species-specific concatenated reference library. The details of species-specific libraries are listed in 
                    <xref ref-type="table" rid="T1">Table 1</xref> and the sequences are provided in the underlying data (Supplementary data 3 (
                    <xref ref-type="bibr" rid="ref28">Paul 2022</xref>)). Analysis using the sequences of 16S rRNA genes showed 24, 10, 7, and 6 Parsim-info variable sites for 
                    <italic toggle="yes">S. oralis</italic>, 
                    <italic toggle="yes">S. mitis</italic>, 
                    <italic toggle="yes">S. gordonii,</italic> and 
                    <italic toggle="yes">S. pneumoniae</italic>, respectively
                    <italic toggle="yes">.</italic> The intra-species Parsim-info variable sites were located in both the conserved and variable regions of the 16S rRNA gene (Supplementary data 4 (
                    <xref ref-type="bibr" rid="ref28">Paul 2022</xref>)).</p>
                <p>The study used full-length 16S rRNA gene copies from four different strains to highlight the variations at the species level. However, a large number of partial 16S rRNA gene sequences are available in the public genetic databases. Further, many researchers are amplifying only few variable regions of the 16S rRNA gene. In such cases, a species-specific concatenated reference library can be constructed using partial sequences. Intra-species variations in the sequences of 16S rRNA gene copies influence the sequence-based bacterial identification. Therefore, concatenation of the sequences of 16S rRNA gene provides much better resolution compared to analysis using sequences from a single copy of the 16S rRNA gene.</p>
            </sec>
            <sec id="sec3.3">
                <title>Demonstration of concatenated 16S rRNA based species identification</title>
                <p>This study compared sequences of nine 16S rRNA genes from different species of 
                    <italic toggle="yes">Streptococcus</italic> (
                    <xref ref-type="table" rid="T3">Table 3</xref>) against the species-specific concatenated reference libraries constructed. The analysis showed that the concatenated sequences provide much better resolution in sequence similarity search and phylogenetic analysis. The sequence accession numbers GU470907.1 and KF933785.1 classified as 
                    <italic toggle="yes">S. mitis</italic> showed a higher maximum and total alignment score with concatenated 16S rRNA library of 
                    <italic toggle="yes">S. oralis</italic> than 
                    <italic toggle="yes">S. mitis</italic> (
                    <xref ref-type="table" rid="T3">Table 3</xref>). Two sequences (OM368574.1 classified as 
                    <italic toggle="yes">S. mitis</italic> and OM368578.1 classified as 
                    <italic toggle="yes">S. pneumoniae</italic>) showed same score against the four reference libraries constructed. Based on the maximum total alignment score these two sequences are belonging to 
                    <italic toggle="yes">S. pneumoniae</italic>, however, they classified as two separate species. Interestingly, the sequence GU470907.1 classified as 
                    <italic toggle="yes">S. mitis</italic> showed 100% identity with 
                    <italic toggle="yes">S. oralis</italic> reference library with a total alignment score of 10936.</p>
                <table-wrap id="T3" orientation="portrait" position="float">
                    <label>Table 3. </label>
                    <caption>
                        <title>Similarity of selected sequences against the concatenated species-specific 16S rRNA reference libraries.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="2" valign="top">GenBank Accession Number</th>
                                <th align="left" colspan="1" rowspan="2" valign="top">Species</th>
                                <th align="left" colspan="3" rowspan="1" valign="top">
                                    <italic toggle="yes">S. gordonii</italic>-Ref-I</th>
                                <th align="left" colspan="3" rowspan="1" valign="top">
                                    <italic toggle="yes">S. mitis</italic>-Ref-I</th>
                                <th align="left" colspan="3" rowspan="1" valign="top">
                                    <italic toggle="yes">S. oralis</italic>-Ref-I</th>
                                <th align="left" colspan="3" rowspan="1" valign="top">
                                    <italic toggle="yes">S. pneumoniae</italic>-Ref-I</th>
                            </tr>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">Max Score</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Total Score</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Identity (%)</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Max Score</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Total Score</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Identity (%)</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Max Score</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Total Score</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Identity (%)</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Max Score</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Total Score</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Identity (%)</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">AJ295848.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <italic toggle="yes">S. mitis</italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2495</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">9967</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">96.45</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2769</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">11027</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.80</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2758</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10851</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.67</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2752</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10982</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.60</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">AM157428.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <italic toggle="yes">S. mitis</italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2462</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">9845</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">96.05</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2724</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10866</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.27</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2702</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10685</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.01</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2708</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10805</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.07</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">NR_028664.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <italic toggle="yes">S. mitis</italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2499</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">9991</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">96.45</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2776</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10979</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.87</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2750</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10864</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.54</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2724</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10888</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.27</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">GU470907.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <italic toggle="yes">S. mitis</italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2536</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10096</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">96.91</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2715</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10796</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.14</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2787</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10936</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">100</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2091</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10716</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">98.87</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">KF933785.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <italic toggle="yes">S. mitis</italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2466</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">9832</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">96.06</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2667</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10593</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">98.54</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2673</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10650</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">98.61</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2632</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10502</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">98.15</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">OM368574.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <italic toggle="yes">S. mitis</italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2475</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">9896</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">96.24</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2754</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10968</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.67</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2732</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10814</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.40</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2760</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10990</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.73</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">OM368578.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <italic toggle="yes">
                                        <styled-content style="#0E101A" style-type="color">S. pneumoniae</styled-content>
                                    </italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2475</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">9896</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">96.24</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2754</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10968</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.67</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2732</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10814</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.40</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2760</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10990</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.73</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">AM157442.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <italic toggle="yes">
                                        <styled-content style="#0E101A" style-type="color">S. pneumoniae</styled-content>
                                    </italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2470</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">9863</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">96.12</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2702</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10779</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.01</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2715</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10726</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.14</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2702</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10777</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.01</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">NR_117719.1</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <italic toggle="yes">
                                        <styled-content style="#0E101A" style-type="color">S. oralis</styled-content>
                                    </italic>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2531</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10074</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">96.84</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2710</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10774</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">99.07</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2787</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10925</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">100</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2697</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10739</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">98.94</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>The study plotted two phylogenetic tree to highlight the difference in single gene copy approach and concatenated approach. 
                    <xref ref-type="fig" rid="f3">Figure 3</xref> represent the single gene copy approach, shows phylogenetic tree of the nine 16S rRNA gene sequences selected along with the gene copies used for the construction of four concatenated species-specific reference libraries. The inclusion of misclassified sequences and intra-species variations in 16S rRNA copies may mislead the phylogenetic tree inference. 
                    <xref ref-type="fig" rid="f4">Figure 4</xref> shows the phylogenetic relationship of nine selected sequences with four concatenated species-specific reference libraries constructed. The concatenated GU470907.1 sequence showed a phylogenetic relationship with 
                    <italic toggle="yes">S. oralis</italic> and sequence OM368574.1 was genetically related to 
                    <italic toggle="yes">S. pneumoniae.</italic> Phylogenetic analysis showed that three sequences AM157428 (
                    <italic toggle="yes">S. mitis</italic>), KF933785 (
                    <italic toggle="yes">S. mitis</italic>), and AM157442 (
                    <italic toggle="yes">S. pneumoniae</italic>) stayed separately and might be other species than the four species tested. Furthermore, two sequences AJ295848 and NR_028664 classified as 
                    <italic toggle="yes">S. mitis</italic> showed significant similarity with concatenated 16S rRNA reference library of 
                    <italic toggle="yes">S. mitis.</italic> Similarly, sequence NR_117719 (
                    <italic toggle="yes">S. oralis</italic>) showed phylogenetic relationship with reference library of 
                    <italic toggle="yes">S. oralis</italic> and OM368578 (
                    <italic toggle="yes">S. pneumoniae</italic>) with 
                    <italic toggle="yes">S. pneumoniae</italic> reference library. These results further confirm that species-specific concatenated 16S rRNA reference libraries provide much better taxonomic resolution. Therefore, this study recommends concatenated sequences of 16S rRNA genes for sequence similarity- and phylogeny-based species identification.</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>Figure 3. </label>
                    <caption>
                        <title>Phylogenetic analysis of randomly selected nine 16S rRNA sequences classified as 
                            <italic toggle="yes">Streptococcus</italic> species and sequences used for species-specific reference library.</title>
                        <p>The phylogenetic tree plotted using single copy approach. The node name highlighted in shapes (
                            <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_Graphical1.gif"/>, 
                            <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_Graphical2.gif"/>, 
                            <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_Graphical3.gif"/>, 
                            <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_Graphical4.gif"/>) represents the sequences which are used for the construction of four concatenated species-specific reference libraries.</p>
                    </caption>
                    <graphic id="gr3" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_figure3.gif"/>
                </fig>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>Figure 4. </label>
                    <caption>
                        <title>Phylogenetic tree constructed using concatenated 16S rRNA approach.</title>
                        <p>The randomly selected nine 16S rRNA sequences classified as 
                            <italic toggle="yes">Streptococcus</italic> species were compared with four species-specific reference libraries constructed. The node name highlighted in shapes (
                            <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_Graphical1.gif"/>, 
                            <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_Graphical2.gif"/>, 
                            <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_Graphical3.gif"/>, 
                            <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_Graphical4.gif"/>) represents the four species-specific reference libraries.</p>
                    </caption>
                    <graphic id="gr4" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/144651/5fb9e0f7-004c-4319-a0a9-012bda510597_figure4.gif"/>
                </fig>
            </sec>
        </sec>
        <sec id="sec4">
            <title>Discussion</title>
            <p>Sequencing and analysis of the 16S rRNA encoding region is a conventional and robust method for identifying and classifying bacterial species. The barcode gene is widely used in sequence similarity, phylogeny, and metagenome-based species identification. However, the accuracy of bacterial taxonomy based on 16S rRNA barcode regions is limited by the intra-genomic heterogeneity of multiple 16S rRNA gene copies and significant sequence identity of this gene among closely related taxa. Furthermore, identification of closely related species using sequences of the 16S rRNA gene is a challenge, and it may lead to species misidentification (
                <xref ref-type="bibr" rid="ref6">Boudewijns 
                    <italic toggle="yes">et al</italic>. 2006</xref>; 
                <xref ref-type="bibr" rid="ref9">Church 
                    <italic toggle="yes">et al</italic>. 2020</xref>). About 15% of the bacterial genomes have only a single copy of the 16S rRNA gene, and only a minority of bacterial genomes contain identical 16S rRNA gene copies (
                <xref ref-type="bibr" rid="ref41">Vetrovsky and Baldrian 2013</xref>). The 16S rRNA gene copies can vary from 1 to 15 in a genome, and the copy number is taxon specific (
                <xref ref-type="bibr" rid="ref41">Vetrovsky and Baldrian 2013</xref>). Sequence diversity increases with the increasing 16S rRNA copy numbers. The 16S rRNA sequence variation can even be found at intra-genomic level or in different strains of a species. Amplification of a limited number of variable regions cannot achieve the same taxonomic resolution as that of the entire gene (
                <xref ref-type="bibr" rid="ref17">Johnson 
                    <italic toggle="yes">et al</italic>. 2019</xref>). Usage of misclassified 16S rRNA sequences as a reference and inappropriate bioinformatics workflows can also mislead the taxonomic assignment. To overcome these challenges, it is important to translate high throughput microbial genomic data into meaningful, actionable information that clinicians can readily understand and quickly implement for bacterial identification. Hence, the study intended to develop a species-specific catalogue of concatenated 16S rRNA gene copies that can yield better inference in sequence similarity and phylogenetic analysis.</p>
            <p>Several bioinformatics resources are extensively used for the 16S rRNA sequence analysis and bacterial identification. However, several researchers report the sequence similarity derived through a local alignment algorithm. Earlier reports have suggested that the species belonging to the taxa Gammaproteobacteria show higher intra-species variability (
                <xref ref-type="bibr" rid="ref41">Vetrovsky and Baldrian 2013</xref>). Hence, the study estimated the percent identity of intra-genomic 16S rRNA gene copies of 
                <italic toggle="yes">E. asburiae</italic> using local and global alignment algorithms. The reference genome of 
                <italic toggle="yes">E. asburiae</italic> has eight 16S rRNA gene copies in its genome. The BLAST and Clustal sequence alignment algorithms yielded marginally varying results for the intra-genomic 16S rRNA gene copies. Local alignment algorithms may not consider base mismatches at the ends of sequences when calculating percent identity, while global alignment algorithms consider entire sequences. Therefore, global sequence alignment is best for estimating intra and inter-species identity for single gene copies. However, BLAST can calculate the total alignment score with multiple paralogue regions. Hence, web-based BLAST2 is suggested for estimating the sequence similarity using concatenated barcode reference libraries.</p>
            <p>The GenBank (
                <xref ref-type="bibr" rid="ref22">Leray 
                    <italic toggle="yes">et al</italic>. 2019</xref>) and NCBI 16S RefSeq databases for bacteria (
                <xref ref-type="bibr" rid="ref43">Winand 
                    <italic toggle="yes">et al</italic>. 2020</xref>) are reliable for species-level identification and classification. However, few earlier studies have highlighted the misclassification of species and genome assemblies in public genetic databases (
                <xref ref-type="bibr" rid="ref26">Parks 
                    <italic toggle="yes">et al</italic>. 2018</xref>; 
                <xref ref-type="bibr" rid="ref40">Varghese 
                    <italic toggle="yes">et al</italic>. 2015</xref>). For example, the 16S rRNA sequence accession number (Ac. No.) LT707617.1 shows the organism as 
                <italic toggle="yes">Streptococcus mitis.</italic> Conventional BLAST-based sequence similarity search shows the highest identity of 99.60% with 
                <italic toggle="yes">S. mitis</italic> 16S rRNA sequence (Ac. No. AB002520.1). However, the 16S rRNA sequence (Ac. No. LT707617.1) did not show significant similarity with other 16S rRNA reference sequences available for 
                <italic toggle="yes">S. mitis.</italic> Furthermore, the sequence also shows 99.44% identity with reference 16S rRNA sequences of 
                <italic toggle="yes">S. gordonii.</italic> Hence, the study performed a sequence alignment of the sequence (Acc. No. LT707617.1) against species-specific concatenated 16S rRNA reference libraries for 
                <italic toggle="yes">S. gordonii</italic> (
                <italic toggle="yes">S.gordonii-</italic>Ref-I), and 
                <italic toggle="yes">S. mitis</italic> (
                <italic toggle="yes">S.mitis-</italic>Ref-I). The alignment resulted in a significant identity of 99.44% with 
                <italic toggle="yes">S.gordonii-</italic>Ref-I (2279 maximum and 9041 total alignment score) than 
                <italic toggle="yes">S.mitis</italic>-Ref-I (97.13% identity with 2119 maximum and 8449 total alignment score). Single copy BLAST results may show only a minor fraction of the difference in percent identity and maximum or total alignment score for closely related species. However, sequence similarity estimation using species-specific concatenated reference libraries shows marginal difference in total alignment score, as it is aligned against four copies. Hence, 16S rRNA analysis with a species-specific concatenated barcode reference library will give better accuracy for bacterial classification than approaches using a single copy.</p>
            <p>Several 16S rRNA sequences show 100% identity with multiple species, which is the major challenge in sequence-based species identification. For example, the 16S rRNA sequence from 
                <italic toggle="yes">S. mitis</italic> (Accession. No. GU470907.1; 1522 bp) shares 100% identity with the 16S rRNA gene from 
                <italic toggle="yes">S. oralis</italic> strain ATCC 35037 genome (Ac. No. CP034442.1). Hence, the sequence (GU470907.1) aligned against the species-specific concatenated reference libraries for 
                <italic toggle="yes">S. oralis</italic> (
                <italic toggle="yes">S.oralis-</italic>Ref-I), and 
                <italic toggle="yes">S. mitis</italic> (
                <italic toggle="yes">S.mitis-</italic>Ref-I). The result showed 100% identity with 
                <italic toggle="yes">S. oralis</italic> (2787 maximum and 10936 total alignment score), and 99.14% identity with 
                <italic toggle="yes">S. mitis</italic> (2715 maximum and 10796 total alignment score). Further, a phylogenetic tree of GU470907.1 (1509 &#x00d7; 4 = 6036 bp) with reference libraries 
                <italic toggle="yes">S.mitis-</italic>Ref-I, and 
                <italic toggle="yes">S.oralis-</italic>Ref-I was plotted. The UPGMA-based phylogenetic tree showed that the 
                <italic toggle="yes">S. mitis</italic> (GU470907.1) sequence is more closely related to 
                <italic toggle="yes">S. oralis</italic> than 
                <italic toggle="yes">S. mitis</italic> (
                <xref ref-type="fig" rid="f4">Figure 4</xref>). Concatenated 16S rRNA-based estimation of sequence similarity and a phylogenetic inference provides better resolution than single-gene approaches. These results show that the concatenated 16S rRNA approach is very effective in discriminating genetically closely related bacterial species. Furthermore, other studies have also highlighted that the phylogenetic tree inferred from vertically inherited protein sequence concatenation provided higher resolution than those obtained from a single copy (
                <xref ref-type="bibr" rid="ref10">Ciccarelli 
                    <italic toggle="yes">et al</italic>. 2006</xref>; 
                <xref ref-type="bibr" rid="ref39">Thiergart 
                    <italic toggle="yes">et al</italic>. 2014</xref>).</p>
            <p>Recent phylogenetic studies using concatenated multi-gene sequence data highlighted the importance of incorporating variations in gene histories, which will improve the traditional phylogenetic inferences (
                <xref ref-type="bibr" rid="ref14">Devulder 
                    <italic toggle="yes">et al</italic>. 2005</xref>; 
                <xref ref-type="bibr" rid="ref18">Johnston 
                    <italic toggle="yes">et al</italic>. 2019</xref>). Furthermore, a single type of analysis should not be relied upon, instead, and to a certain extent, integrated bioinformatics approaches can avoid misclassification. As a cost-effective approach, the study combined substantial variations in 16S rRNA gene copies from a species to examine the performance of the single gene concatenation approach. Analyses using a concatenated 16S rRNA gene approach have the following advantages: (i) the gene is present in all the bacterial species, (ii) the gene is weakly affected by horizontal gene transfer, (iii) the approach is very cost-effective, (iv) there is a large volume of reference genomic data available for several bacterial species, (v) it is effective in discriminating closely related bacterial species, (vi) the analyses can be performed in a computer with minimum configuration, and (vii) the analyses can be employed with available tools for sequence similarity and molecular phylogeny.</p>
        </sec>
        <sec id="sec5" sec-type="conclusions">
            <title>Conclusions</title>
            <p>The concatenated 16S rRNA analyses showed that:
                <list list-type="bullet">
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Full-length 16S rRNA gene amplification provides better accuracy than inference based on partial gene sequences with a limited number of variable regions.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Full-length 16S rRNA gene copies from whole-genome assemblies (in 'complete' stage) should be used rather than partial sequences available from the public genetic databases to construct species-specific concatenated 16S rRNA libraries and further downstream analysis.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>To avoid mismatches in the sequence alignment, trim the bases beyond the primer ends and correct the base-call errors prior to the analysis.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Estimation of mean 16S rRNA identity at the intra-species level helps to classify the species having a higher degree of intra-genomic 16S rRNA heterogeneity.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Four distinct 16S rRNA gene copies cover all the Parsim-Info variable sites and these can be used to construct a concatenated species-specific reference library.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>The total alignment score can be considered if the query sequence shows more or less the same percent identity with multiple species.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>It is not prudent to rely only on sequence similarity; the final decision must be based on the phylogenetic inference.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Species-specific concatenated 16S rRNA gene libraries are recommended for sequence similarity and phylogenetic analysis.</p>
                    </list-item>
                </list>
            </p>
        </sec>
    </body>
    <back>
        <sec id="sec10" sec-type="data-availability">
            <title>Data availability</title>
            <sec id="sec11">
                <title>Underlying data</title>
                <p>Zenodo: Underlying data for &#x2018;Concatenated 16S rRNA sequence analysis improves bacterial taxonomy&#x2019;, 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.7758747">https://doi.org/10.5281/zenodo.7758747</ext-link> (
                    <xref ref-type="bibr" rid="ref28">Paul 2022</xref>).</p>
                <p>This project contains the following underlying data:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Supplementary data 1: The 16S rRNA copies retrieved from the whole genome of 
                                <italic toggle="yes">Enterobacter asburiae</italic> strain ATCC 35953.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Supplementary data 2: Full-length 16S rRNA gene copies retrieved from 16 genome assemblies belonging to four 
                                <italic toggle="yes">Streptococcus</italic> species (
                                <italic toggle="yes">S. gordonii</italic>, 
                                <italic toggle="yes">S. mitis</italic>, 
                                <italic toggle="yes">S. oralis</italic>, and 
                                <italic toggle="yes">S. pneumoniae</italic>).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Supplementary data 3: Species-specific concatenated 16S rRNA libraries constructed for four 
                                <italic toggle="yes">Streptococcus</italic> species (
                                <italic toggle="yes">S. gordonii</italic>, 
                                <italic toggle="yes">S. mitis</italic>, 
                                <italic toggle="yes">S. oralis</italic>, and 
                                <italic toggle="yes">S. pneumoniae</italic>).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Supplementary data 4: Intra-species Parsim-info variable sites in the 16S rRNA gene from for four 
                                <italic toggle="yes">Streptococcus</italic>
 species (
                                <italic toggle="yes">S. gordonii</italic>, 
                                <italic toggle="yes">S. mitis</italic>, 
                                <italic toggle="yes">S. oralis</italic>, and 
                                <italic toggle="yes">S. pneumoniae</italic>).
</p>
                        </list-item>
                    </list>
                </p>
                <p>Data are available under the terms of the 
                    <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International license</ext-link> (CC-BY 4.0)</p>
            </sec>
            <sec id="sec12">
                <title>Accession numbers</title>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus gordonii</italic> strain FDAARGOS 1454 chromosome, complete genome. Accession number CP077224.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CP077224.1">https://www.ncbi.nlm.nih.gov/nuccore/CP077224.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus gordonii</italic> strain NCTC7869, chromosome 1, complete genome. Accession number LR134291.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/LR134291.1">https://www.ncbi.nlm.nih.gov/nuccore/LR134291.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus gordonii</italic> strain KCOM 1506 (=ChDC B679), complete genome. Accession number CP012648.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CP012648.1">https://www.ncbi.nlm.nih.gov/nuccore/CP012648.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus gordonii</italic> strain NCTC9124, chromosome 1, complete genome. Accession number LR594041.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/LR594041.1">https://www.ncbi.nlm.nih.gov/nuccore/LR594041.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> B6, complete genome. Accession number NC_013853.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/NC_013853.1">https://www.ncbi.nlm.nih.gov/nuccore/NC_013853.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> strain KCOM 1350 (= ChDC B183), complete genome. Accession number CP012646.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CP012646.1">https://www.ncbi.nlm.nih.gov/nuccore/CP012646.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> strain SVGS_061 chromosome, complete genome. Accession number CP014326.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CP014326.1">https://www.ncbi.nlm.nih.gov/nuccore/CP014326.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> NCTC 12261 chromosome, complete genome. Accession number CP028414.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CP028414.1">https://www.ncbi.nlm.nih.gov/nuccore/CP028414.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus oralis</italic> strain NCTC11427, chromosome 1, complete genome. Accession number LR134336.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/LR134336.1">https://www.ncbi.nlm.nih.gov/nuccore/LR134336.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus oralis</italic> strain 34 chromosome, complete genome. Accession number CP079724.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CP079724.1">https://www.ncbi.nlm.nih.gov/nuccore/CP079724.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus oralis</italic> strain FDAARGOS_886 chromosome, complete genome. Accession number CP065706.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CP065706.1">https://www.ncbi.nlm.nih.gov/nuccore/CP065706.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus oralis</italic> subsp. 
                    <italic toggle="yes">dentisani</italic> strain F0392 chromosome, complete genome. Accession number CP034442.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CP034442.1">https://www.ncbi.nlm.nih.gov/nuccore/CP034442.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus pneumoniae</italic> strain 475 chromosome, complete genome. Accession number CP046355.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CP046355.1">https://www.ncbi.nlm.nih.gov/nuccore/CP046355.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus pneumoniae</italic> NU83127 DNA, complete genome. Accession number AP018936.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/AP018936.1">https://www.ncbi.nlm.nih.gov/nuccore/AP018936.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus pneumoniae</italic> NCTC7465, chromosome 1, complete genome. Accession number LN831051.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/LN831051.1">https://www.ncbi.nlm.nih.gov/nuccore/LN831051.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus pneumoniae</italic> strain 6A-10 chromosome, complete genome. Accession number CP053210.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CP053210.1">https://www.ncbi.nlm.nih.gov/nuccore/CP053210.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> strain 127R, partial 16S rRNA gene. Accession number AJ295848.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/AJ295848.1">https://www.ncbi.nlm.nih.gov/nuccore/AJ295848.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> clone 2C4, 16S rRNA gene. Accession number AM157428.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/AM157428.1">https://www.ncbi.nlm.nih.gov/nuccore/AM157428.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> strain NS51, partial 16S rRNA gene. Accession number NR_028664.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/NR_028664.1">https://www.ncbi.nlm.nih.gov/nuccore/NR_028664.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> bv. 2 strain F0392, partial 16S rRNA gene. Accession number GU470907.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/GU470907.1">https://www.ncbi.nlm.nih.gov/nuccore/GU470907.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> strain ChDC B553, partial 16S rRNA gene. Accession number KF933785. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/KF933785.1">https://www.ncbi.nlm.nih.gov/nuccore/KF933785.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> strain FC6528, partial 16S rRNA gene. Accession number OM368574.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/OM368574.1">https://www.ncbi.nlm.nih.gov/nuccore/OM368574.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus pneumoniae</italic> strain FC6532, partial 16S rRNA gene. Accession number OM368578.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/OM368578.1">https://www.ncbi.nlm.nih.gov/nuccore/OM368578.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus pneumoniae</italic> clone 4V4, 16S rRNA gene. Accession number AM157442. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/AM157442.1">https://www.ncbi.nlm.nih.gov/nuccore/AM157442.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus oralis</italic> subsp. 
                    <italic toggle="yes">dentisani</italic> strain 7747, partial 16S rRNA gene. Accession number NR_117719. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/NR_117719.1">https://www.ncbi.nlm.nih.gov/nuccore/NR_117719.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Enterobacter asburiae</italic> strain ATCC 35953 chromosome, complete genome. Accession number NZ_CP011863. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/NZ_CP011863.1">https://www.ncbi.nlm.nih.gov/nuccore/NZ_CP011863.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> strain HAC11, isolate #11, partial 16S rRNA gene. Accession number LT707617. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/LT707617.1">https://www.ncbi.nlm.nih.gov/nuccore/LT707617.1</ext-link>
                </p>
                <p>GenBank: 
                    <italic toggle="yes">Streptococcus mitis</italic> strain NCTC 3165, MAFF 911479, 16S rRNA gene. Accession number AB002520.1. 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/AB002520.1">https://www.ncbi.nlm.nih.gov/nuccore/AB002520.1</ext-link>
                </p>
            </sec>
        </sec>
        <ack>
            <title>Acknowledgements</title>
            <p>The author would like to thank DST-FIST, the Government of India, TIFAC-CORE in Pharmacogenomics and Manipal Academy of Higher Education (MAHE), Manipal for the support and facilities provided.</p>
        </ack>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Alachiotis</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vogiatzi</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pavlidis</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Chromatogate: a tool for detecting base mis-calls in multiple sequence alignments by semi-automatic chromatogram inspection.</article-title>
                    <source>

                        <italic toggle="yes">Comput. Struct. Biotechnol. J.</italic>
</source>
                    <year>2013</year>;<volume>6</volume>:<fpage>e201303001</fpage>.
                    <pub-id pub-id-type="pmid">24688709</pub-id>
                    <pub-id pub-id-type="doi">10.5936/csbj.201303001</pub-id>
                    <pub-id pub-id-type="pmcid">PMC3962156</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref2">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Altschul</surname>
                            <given-names>SF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gish</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Miller</surname>
                            <given-names>W</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Basic local alignment search tool.</article-title>
                    <source>

                        <italic toggle="yes">J. Mol. Biol.</italic>
</source>
                    <year>1990</year>;<volume>215</volume>:<fpage>403</fpage>&#x2013;<lpage>410</lpage>.
                    <pub-id pub-id-type="doi">10.1016/S0022-2836(05)80360-2</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref3">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bagheri</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Severin</surname>
                            <given-names>AJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rajan</surname>
                            <given-names>H</given-names>
                        </name>
</person-group>:
                    <article-title>Detecting and correcting misclassified sequences in the large-scale public databases.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2020</year>;<volume>36</volume>:<fpage>4699</fpage>&#x2013;<lpage>4705</lpage>.
                    <pub-id pub-id-type="pmid">32579213</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa586</pub-id>
                    <pub-id pub-id-type="pmcid">PMC7821992</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref4">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Baltrus</surname>
                            <given-names>DA</given-names>
                        </name>
</person-group>:
                    <article-title>Divorcing strain classification from species names.</article-title>
                    <source>

                        <italic toggle="yes">Trends Microbiol.</italic>
</source>
                    <year>2016</year>;<volume>24</volume>:<fpage>431</fpage>&#x2013;<lpage>439</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.tim.2016.02.004</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref5">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Benitez-Paez</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sanz</surname>
                            <given-names>Y</given-names>
                        </name>
</person-group>:
                    <article-title>Multi-locus and long amplicon sequencing approach to study microbial diversity at species level using the MinIONTM portable Nanopore sequencer.</article-title>
                    <source>

                        <italic toggle="yes">Gigascience.</italic>
</source>
                    <year>2017</year>;<volume>6</volume>:<fpage>1</fpage>&#x2013;<lpage>12</lpage>.
                    <pub-id pub-id-type="pmid">28605506</pub-id>
                    <pub-id pub-id-type="doi">10.1093/gigascience/gix043</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5534310</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref6">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Boudewijns</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bakkers</surname>
                            <given-names>JM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sturm</surname>
                            <given-names>PDJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>16S rRNA gene sequencing and the routine clinical microbiology laboratory: A perfect marriage?</article-title>
                    <source>

                        <italic toggle="yes">J. Clin. Microbiol.</italic>
</source>
                    <year>2006</year>;<volume>44</volume>:<fpage>3469</fpage>&#x2013;<lpage>3470</lpage>.
                    <pub-id pub-id-type="pmid">16954306</pub-id>
                    <pub-id pub-id-type="doi">10.1128/JCM.01017-06</pub-id>
                    <pub-id pub-id-type="pmcid">PMC1594676</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref7">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Caporaso</surname>
                            <given-names>JG</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lauber</surname>
                            <given-names>CL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Walters</surname>
                            <given-names>WA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample.</article-title>
                    <source>

                        <italic toggle="yes">Proc. Natl. Acad. Sci. U S A.</italic>
</source>
                    <year>2011</year>;<volume>108</volume>:<fpage>4516</fpage>&#x2013;<lpage>4522</lpage>.
                    <pub-id pub-id-type="doi">10.1073/pnas.1000080107</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref8">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chakravorty</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Helb</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Burday</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A detailed analysis of 16S ribosomal RNA gene segments for the diagnosis of pathogenic bacteria.</article-title>
                    <source>

                        <italic toggle="yes">J. Microbiol. Methods.</italic>
</source>
                    <year>2007</year>;<volume>69</volume>:<fpage>330</fpage>&#x2013;<lpage>339</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.mimet.2007.02.005</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref9">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Church</surname>
                            <given-names>DL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cerutti</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>G&#x00fc;rtler</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Performance and application of 16S rRNA gene cycle sequencing for routine identification of bacteria in the clinical microbiology laboratory.</article-title>
                    <source>

                        <italic toggle="yes">Clin. Microbiol. Rev.</italic>
</source>
                    <year>2020</year>;<volume>33</volume>:<fpage>e00053</fpage>&#x2013;<lpage>e00019</lpage>.
                    <pub-id pub-id-type="pmid">32907806</pub-id>
                    <pub-id pub-id-type="doi">10.1128/CMR.00053-19</pub-id>
                    <pub-id pub-id-type="pmcid">PMC7484979</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref10">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ciccarelli</surname>
                            <given-names>FD</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Doerks</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mering</surname>
                            <given-names>C</given-names>
                            <prefix>von</prefix>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Toward automatic reconstruction of a highly resolved tree of life.</article-title>
                    <source>

                        <italic toggle="yes">Science.</italic>
</source>
                    <year>2006</year>;<volume>311</volume>:<fpage>1283</fpage>&#x2013;<lpage>1287</lpage>.
                    <pub-id pub-id-type="doi">10.1126/science.1123061</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref11">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Clarridge</surname>
                            <given-names>JE</given-names>
                        </name>
</person-group>:
                    <article-title>Impact of 16S rRNA gene sequence analysis for identification of bacteria on clinical microbiology and infectious diseases.</article-title>
                    <source>

                        <italic toggle="yes">Clin. Microbiol. Rev.</italic>
</source>
                    <year>2004</year>;<volume>17</volume>:<fpage>840</fpage>&#x2013;<lpage>862</lpage>.
                    <pub-id pub-id-type="pmid">15489351</pub-id>
                    <pub-id pub-id-type="doi">10.1128/CMR.17.4.840-862.2004</pub-id>
                    <pub-id pub-id-type="pmcid">PMC523561</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref12">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Deurenberg</surname>
                            <given-names>RH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bathoorn</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chlebowicz</surname>
                            <given-names>MA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Application of next generation sequencing in clinical microbiology and infection prevention.</article-title>
                    <source>

                        <italic toggle="yes">J. Biotechnol.</italic>
</source>
                    <year>2017</year>;<volume>243</volume>:<fpage>16</fpage>&#x2013;<lpage>24</lpage>.
                    <pub-id pub-id-type="pmid">28042011</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.jbiotec.2016.12.022</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref13">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Devanga-Ragupathi</surname>
                            <given-names>NK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Muthuirulandi</surname>
                            <given-names>SDP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Inbanathan</surname>
                            <given-names>FY</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Accurate differentiation of 
                        <italic toggle="yes">Escherichia coli</italic> and 
                        <italic toggle="yes">Shigella</italic> serogroups: challenges and strategies.</article-title>
                    <source>

                        <italic toggle="yes">New Microbes New Infect.</italic>
</source>
                    <year>2018</year>;<volume>21</volume>:<fpage>58</fpage>&#x2013;<lpage>62</lpage>.
                    <pub-id pub-id-type="pmid">29204286</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.nmni.2017.09.003</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5711669</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref14">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Devulder</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Montclos</surname>
                            <given-names>MP</given-names>
                            <prefix>de</prefix>
                        </name>

                        <name name-style="western">
                            <surname>Flandrois</surname>
                            <given-names>JP</given-names>
                        </name>
</person-group>:
                    <article-title>A multigene approach to phylogenetic analysis using the genus 
                        <italic toggle="yes">Mycobacterium</italic> as a model.</article-title>
                    <source>

                        <italic toggle="yes">Int. J. Syst. Evol. Microbiol.</italic>
</source>
                    <year>2005</year>;<volume>55</volume>:<fpage>293</fpage>&#x2013;<lpage>302</lpage>.
                    <pub-id pub-id-type="doi">10.1099/ijs.0.63222-0</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref15">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ibal</surname>
                            <given-names>JC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pham</surname>
                            <given-names>HQ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Park</surname>
                            <given-names>CE</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Information about variations in multiple copies of bacterial 16S rRNA genes may aid in species identification.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2019</year>;<volume>14</volume>:<fpage>e0212090</fpage>.
                    <pub-id pub-id-type="pmid">30768621</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0212090</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6377111</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref16">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Janda</surname>
                            <given-names>JM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Abbott</surname>
                            <given-names>SL</given-names>
                        </name>
</person-group>:
                    <article-title>16S rRNA gene sequencing for bacterial identification in the diagnostic laboratory: Pluses, perils, and pitfalls.</article-title>
                    <source>

                        <italic toggle="yes">J. Clin. Microbiol.</italic>
</source>
                    <year>2007</year>;<volume>45</volume>:<fpage>2761</fpage>&#x2013;<lpage>2764</lpage>.
                    <pub-id pub-id-type="pmid">17626177</pub-id>
                    <pub-id pub-id-type="doi">10.1128/JCM.01228-07</pub-id>
                    <pub-id pub-id-type="pmcid">PMC2045242</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref17">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Johnson</surname>
                            <given-names>JS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Spakowicz</surname>
                            <given-names>DJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hong</surname>
                            <given-names>BY</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Commun.</italic>
</source>
                    <year>2019</year>;<volume>10</volume>:<fpage>5011</fpage>&#x2013;<lpage>5029</lpage>.
                    <pub-id pub-id-type="pmid">31695033</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41467-019-13036-1</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6834636</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref18">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Johnston</surname>
                            <given-names>PR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Quijada</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Smith</surname>
                            <given-names>CA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A multigene phylogeny toward a new phylogenetic classification of 
                        <italic toggle="yes">Leotiomycetes.</italic>
                    </article-title>
                    <source>

                        <italic toggle="yes">IMA Fungus.</italic>
</source>
                    <year>2019</year>;<volume>10</volume>:<fpage>1</fpage>.
                    <pub-id pub-id-type="pmid">32647610</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s43008-019-0002-x</pub-id>
                    <pub-id pub-id-type="pmcid">PMC7325659</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref19">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kerkhof</surname>
                            <given-names>LJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dillon</surname>
                            <given-names>KP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Haggblom</surname>
                            <given-names>MM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Profiling bacterial communities by MinION sequencing of ribosomal operons.</article-title>
                    <source>

                        <italic toggle="yes">Microbiome.</italic>
</source>
                    <year>2017</year>;<volume>5</volume>:<fpage>116</fpage>.
                    <pub-id pub-id-type="pmid">28911333</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s40168-017-0336-9</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5599880</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref20">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kumar</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Stecher</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>MEGA X: Molecular evolutionary genetics analysis across computing platforms.</article-title>
                    <source>

                        <italic toggle="yes">Mol. Biol. Evol.</italic>
</source>
                    <year>2018</year>;<volume>35</volume>:<fpage>1547</fpage>&#x2013;<lpage>1549</lpage>.
                    <pub-id pub-id-type="pmid">29722887</pub-id>
                    <pub-id pub-id-type="doi">10.1093/molbev/msy096</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5967553</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref21">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lal</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Verma</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lal</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <article-title>Exploring internal features of 16S rRNA gene for identification of clinically relevant species of the genus 
                        <italic toggle="yes">Streptococcus.</italic>
                    </article-title>
                    <source>

                        <italic toggle="yes">Ann. Clin. Microbiol. Antimicrob.</italic>
</source>
                    <year>2011</year>;<volume>10</volume>:<fpage>28</fpage>.
                    <pub-id pub-id-type="pmid">21702978</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1476-0711-10-28</pub-id>
                    <pub-id pub-id-type="pmcid">PMC3151204</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref22">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Leray</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Knowlton</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ho</surname>
                            <given-names>SL</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>GenBank is a reliable resource for 21
                        <sup>st</sup> century biodiversity research.</article-title>
                    <source>

                        <italic toggle="yes">Proc. Natl. Acad. Sci. USA.</italic>
</source>
                    <year>2019</year>;<volume>116</volume>:<fpage>22651</fpage>&#x2013;<lpage>22656</lpage>.
                    <pub-id pub-id-type="pmid">31636175</pub-id>
                    <pub-id pub-id-type="doi">10.1073/pnas.1911714116</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6842603</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref23">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lai</surname>
                            <given-names>Q</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Shao</surname>
                            <given-names>Z</given-names>
                        </name>
</person-group>:
                    <article-title>Genome analysis-based reclassification of 
                        <italic toggle="yes">Bacillus weihenstephanensis</italic> as a later heterotypic synonym of 
                        <italic toggle="yes">Bacillus mycoides.</italic>
                    </article-title>
                    <source>

                        <italic toggle="yes">Int. J. Syst. Evol. Microbiol.</italic>
</source>
                    <year>2018</year>;<volume>68</volume>:<fpage>106</fpage>&#x2013;<lpage>112</lpage>.
                    <pub-id pub-id-type="pmid">29095136</pub-id>
                    <pub-id pub-id-type="doi">10.1099/ijsem.0.002466</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref24">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mart&#x00ed;nez-Romero</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rodr&#x00ed;guez-Medina</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Beltr&#x00e1;n-Rojel</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Genome misclassification of 
                        <italic toggle="yes">Klebsiella variicola</italic> and 
                        <italic toggle="yes">Klebsiella quasipneumoniae</italic> isolated from plants, animals and humans.</article-title>
                    <source>

                        <italic toggle="yes">Salud Publica Mex.</italic>
</source>
                    <year>2018</year>;<volume>60</volume>:<fpage>56</fpage>&#x2013;<lpage>62</lpage>.
                    <pub-id pub-id-type="pmid">29689657</pub-id>
                    <pub-id pub-id-type="doi">10.21149/8149</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref25">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mateo-Estrada</surname>
                            <given-names>V</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Grana-Miraglia</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lopez-Leal</surname>
                            <given-names>G</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Phylogenomics reveals clear cases of misclassification and genus-wide phylogenetic markers for 
                        <italic toggle="yes">Acinetobacter.</italic>
                    </article-title>
                    <source>

                        <italic toggle="yes">Genome Biol. Evol.</italic>
</source>
                    <year>2019</year>;<volume>11</volume>:<fpage>2531</fpage>&#x2013;<lpage>2541</lpage>.
                    <pub-id pub-id-type="pmid">31406982</pub-id>
                    <pub-id pub-id-type="doi">10.1093/gbe/evz178</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6740150</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref26">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Parks</surname>
                            <given-names>DH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Waite</surname>
                            <given-names>DW</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Skarshewski</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Biotechnol.</italic>
</source>
                    <year>2018</year>;<volume>36</volume>:<fpage>996</fpage>&#x2013;<lpage>1004</lpage>.
                    <pub-id pub-id-type="pmid">30148503</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nbt.4229</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref27">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Paul</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dixi</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Murali</surname>
                            <given-names>TS</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Genome-based taxonomic classification.</article-title>
                    <source>

                        <italic toggle="yes">Genome.</italic>
</source>
                    <year>2019</year>;<volume>62</volume>:<fpage>45</fpage>&#x2013;<lpage>52</lpage>.
                    <pub-id pub-id-type="doi">10.1139/gen-2018-0072</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref28">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Paul</surname>
                            <given-names>B</given-names>
                        </name>
</person-group>:
                    <article-title>Concatenated 16S rRNA sequence analysis improves bacterial taxonomy.</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.51281/zenodo.7384709</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref29">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Peker</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Garcia-Croes</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dijkhuizen</surname>
                            <given-names>B</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A comparison of three different bioinformatics analyses of the 16S-23S rRNA encoding region for bacterial identification.</article-title>
                    <source>

                        <italic toggle="yes">Front. Microbiol.</italic>
</source>
                    <year>2019</year>;<volume>10</volume>:<fpage>620</fpage>.
                    <pub-id pub-id-type="pmid">31040829</pub-id>
                    <pub-id pub-id-type="doi">10.3389/fmicb.2019.00620</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6476902</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref30">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Quast</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pruesse</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yilmaz</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The SILVA ribosomal RNA gene database project: improved data processing and web-based tools.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2013</year>;<volume>41</volume>:<fpage>D590</fpage>&#x2013;<lpage>D596</lpage>.
                    <pub-id pub-id-type="pmid">23193283</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gks1219</pub-id>
                    <pub-id pub-id-type="pmcid">PMC3531112</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref31">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Reller</surname>
                            <given-names>LB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Weinstein</surname>
                            <given-names>MP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Petti</surname>
                            <given-names>CA</given-names>
                        </name>
</person-group>:
                    <article-title>Detection and identification of microorganisms by gene amplification and sequencing.</article-title>
                    <source>

                        <italic toggle="yes">Clin. Infect. Dis.</italic>
</source>
                    <year>2007</year>;<volume>44</volume>:<fpage>1108</fpage>&#x2013;<lpage>1114</lpage>.
                    <pub-id pub-id-type="doi">10.1086/512818</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref32">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sabat</surname>
                            <given-names>AJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zanten</surname>
                            <given-names>E</given-names>
                            <prefix>van</prefix>
                        </name>

                        <name name-style="western">
                            <surname>Akkerboom</surname>
                            <given-names>V</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Targeted next-generation sequencing of the 16S-23S rRNA region for culture-independent bacterial identification increased discrimination of closely related species.</article-title>
                    <source>

                        <italic toggle="yes">Sci. Rep.</italic>
</source>
                    <year>2017</year>;<volume>7</volume>:<fpage>1</fpage>&#x2013;<lpage>12</lpage>.
                    <pub-id pub-id-type="pmid">28611406</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41598-017-03458-6</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5469791</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref33">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Schloss</surname>
                            <given-names>PD</given-names>
                        </name>
</person-group>:
                    <article-title>Reintroducing mothur: 10 Years Later.</article-title>
                    <source>

                        <italic toggle="yes">Appl. Environ. Microbiol.</italic>
</source>
                    <year>2020</year>;<volume>86</volume>:<fpage>e02343</fpage>&#x2013;<lpage>e02319</lpage>.
                    <pub-id pub-id-type="pmid">31704678</pub-id>
                    <pub-id pub-id-type="doi">10.1128/AEM.02343-19</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6952234</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref34">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sievers</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wilm</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dineen</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.</article-title>
                    <source>

                        <italic toggle="yes">Mol. Syst. Biol.</italic>
</source>
                    <year>2011</year>;<volume>7</volume>:<fpage>539</fpage>.
                    <pub-id pub-id-type="doi">10.1038/msb.2011.75</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref35">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Srinivasan</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Karaoz</surname>
                            <given-names>U</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Volegova</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Use of 16S rRNA gene for identification of a broad range of clinically relevant bacterial pathogens.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2015</year>;<volume>10</volume>:<fpage>e0117617</fpage>.
                    <pub-id pub-id-type="pmid">25658760</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0117617</pub-id>
                    <pub-id pub-id-type="pmcid">PMC4319838</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref36">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Stackebrandt</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mondotte</surname>
                            <given-names>JA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fazio</surname>
                            <given-names>LL</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Authors need to be prudent when assigning names to microbial isolates.</article-title>
                    <source>

                        <italic toggle="yes">Arch. Microbiol.</italic>
</source>
                    <year>2021</year>;<volume>203</volume>:<fpage>5845</fpage>&#x2013;<lpage>5848</lpage>.
                    <pub-id pub-id-type="pmid">34709418</pub-id>
                    <pub-id pub-id-type="doi">10.1007/s00203-021-02599-7</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref37">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Starke</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pylro</surname>
                            <given-names>VS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Morais</surname>
                            <given-names>DK</given-names>
                        </name>
</person-group>:
                    <article-title>16S rRNA gene copy number normalization does not provide more reliable conclusions in metataxonomic surveys.</article-title>
                    <source>

                        <italic toggle="yes">Microb. Ecol.</italic>
</source>
                    <year>2021</year>;<volume>81</volume>:<fpage>535</fpage>&#x2013;<lpage>539</lpage>.
                    <pub-id pub-id-type="pmid">32862246</pub-id>
                    <pub-id pub-id-type="doi">10.1007/s00248-020-01586-7</pub-id>
                    <pub-id pub-id-type="pmcid">PMC7835310</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref38">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Steven</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hesse</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Soghigian</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Simulated rRNA/DNA ratios show potential to misclassify active populations as dormant.</article-title>
                    <source>

                        <italic toggle="yes">Appl. Environ. Microbiol.</italic>
</source>
                    <year>2017</year>;<volume>83</volume>:<fpage>e00696</fpage>&#x2013;<lpage>e00617</lpage>.
                    <pub-id pub-id-type="pmid">28363969</pub-id>
                    <pub-id pub-id-type="doi">10.1128/AEM.00696-17</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5440720</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref39">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Thiergart</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Landan</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Martin</surname>
                            <given-names>WF</given-names>
                        </name>
</person-group>:
                    <article-title>Concatenated alignments and the case of the disappearing tree.</article-title>
                    <source>

                        <italic toggle="yes">BMC Evol. Biol.</italic>
</source>
                    <year>2014</year>;<volume>14</volume>:<fpage>212</fpage>&#x2013;<lpage>266</lpage>.
                    <pub-id pub-id-type="pmid">25547755</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s12862-014-0266-0</pub-id>
                    <pub-id pub-id-type="pmcid">PMC4302582</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref40">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Varghese</surname>
                            <given-names>NJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mukherjee</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ivanova</surname>
                            <given-names>N</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Microbial species delineation using whole genome sequences.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2015</year>;<volume>43</volume>:<fpage>6761</fpage>&#x2013;<lpage>6771</lpage>.
                    <pub-id pub-id-type="pmid">26150420</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkv657</pub-id>
                    <pub-id pub-id-type="pmcid">PMC4538840</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref41">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Vetrovsky</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Baldrian</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>The variability of the 16S rRNA gene in bacterial genomes and its consequences for bacterial community analyses.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2013</year>;<volume>8</volume>:<fpage>e57923</fpage>.
                    <pub-id pub-id-type="pmid">23460914</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0057923</pub-id>
                    <pub-id pub-id-type="pmcid">PMC3583900</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref42">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Weisburg</surname>
                            <given-names>WG</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Barns</surname>
                            <given-names>SM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pelletier</surname>
                            <given-names>DA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>16S ribosomal DNA amplification for phylogenetic study.</article-title>
                    <source>

                        <italic toggle="yes">J. Bacteriol.</italic>
</source>
                    <year>1991</year>;<volume>173</volume>:<fpage>697</fpage>&#x2013;<lpage>703</lpage>.
                    <pub-id pub-id-type="pmid">1987160</pub-id>
                    <pub-id pub-id-type="doi">10.1128/jb.173.2.697-703.1991</pub-id>
                    <pub-id pub-id-type="pmcid">PMC207061</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref43">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Winand</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bogaerts</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hoffman</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Targeting the 16S rRNA gene for bacterial identification in complex mixed samples: Comparative evaluation of second (Illumina) and third (Oxford Nanopore technologies) generation sequencing technologies.</article-title>
                    <source>

                        <italic toggle="yes">Int. J. Mol. Sci.</italic>
</source>
                    <year>2020</year>;<volume>21</volume>:<fpage>298</fpage>.
                    <pub-id pub-id-type="pmid">31906254</pub-id>
                    <pub-id pub-id-type="doi">10.3390/ijms21010298</pub-id>
                    <pub-id pub-id-type="pmcid">PMC6982111</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref44">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yoon</surname>
                            <given-names>SH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ha</surname>
                            <given-names>SM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kwon</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Introducing EzBioCloud: a taxonomically united database of 16S rRNA gene sequences and whole-genome assemblies.</article-title>
                    <source>

                        <italic toggle="yes">Int. J. Syst. Evol. Microbiol.</italic>
</source>
                    <year>2017</year>;<volume>67</volume>:<fpage>1613</fpage>&#x2013;<lpage>1617</lpage>.
                    <pub-id pub-id-type="pmid">28005526</pub-id>
                    <pub-id pub-id-type="doi">10.1099/ijsem.0.001755</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5563544</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report189677">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.144651.r189677</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Sjamsuridzal</surname>
                        <given-names>Wellyzar</given-names>
                    </name>
                    <xref ref-type="aff" rid="r189677a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-0794-9510</uri>
                </contrib>
                <aff id="r189677a1">
                    <label>1</label>Department of Biology, Faculty of Mathematics and Natural Sciences, Universitas Indonesia, Depok, West Java, Indonesia</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>21</day>
                <month>8</month>
                <year>2023</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2023 Sjamsuridzal W</copyright-statement>
                <copyright-year>2023</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport189677" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.128320.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>
                <bold>Are the conclusions drawn adequately supported by the results?&#x00a0;</bold>
            </p>
            <p> </p>
            <p> Partly: The conclusion should elaborate more by considering the limitation of the study. Developing a species-specific reference library for all bacteria using concatenated 16S rRNA gene copy variants is a challenging task.</p>
            <p> </p>
            <p> Here are a few considerations regarding its feasibility for species identification:</p>
            <p> </p>
            <p> Concatenating 16S rRNA gene copy variants for sequence similarity estimation and phylogenetic inference can offer improved resolution compared to single-gene approaches. However, careful design, data interpretation, and consideration of alternative approaches are still required to account for potential biases and supplementary analyses when necessary. 
                <list list-type="order">
                    <list-item>
                        <p>The 16S rRNA gene is commonly used for bacterial identification, but it may not capture the full diversity of all bacterial species. While it can be effective for many bacteria, there are certain groups that may have unique or divergent sequences not adequately represented by the 16S rRNA gene.</p>
                    </list-item>
                    <list-item>
                        <p>Some bacterial species can have significant intraspecies genetic variation, including variations in the 16S rRNA gene. Concatenating multiple gene copy variants can help account for this variability to a certain degree, but it may not capture all the genetic diversity within a species.</p>
                    </list-item>
                    <list-item>
                        <p>The 16S rRNA gene is suitable for identifying bacterial species at a broader taxonomic level. However, for more precise differentiation or identification, additional genetic markers or techniques may be required, such as whole-genome sequencing or multilocus sequence typing.</p>
                    </list-item>
                    <list-item>
                        <p>Concatenating multiple gene copy variants from a large number of bacteria would result in a substantial increase in the complexity and size of the reference library. Managing and interpreting such a comprehensive library would require significant computational resources and expertise.</p>
                    </list-item>
                </list> In summary, while concatenating 16S rRNA gene copy variants can be a valuable approach for bacterial identification, developing a species-specific reference library for all bacteria using this method may have limitations. It is important to consider the diversity and variability within bacterial species and explore additional techniques for more precise identification when necessary.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Microbial Systematics</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment10125-189677">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Paul</surname>
                            <given-names>Bobby</given-names>
                        </name>
                        <aff>Manipal University, India</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>Nil</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>28</day>
                    <month>8</month>
                    <year>2023</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thank you for the critical comments. I have included the limitation of this approach in the conclusion section, and the manuscript has been updated. The study used four genetically related species for the demonstration. Concatenated 16S rRNA analysis yielded better resolution than a single copy. Yes, developing concatenated species-specific 16S rRNA reference libraries for entire bacterial populations is challenging. The study developed reference libraries for four species. More than 552,575 whole-genome sequences are currently (Aug 2023) available for bacterial species in the Genome database. Hopefully, these datasets can be used for the development of species-specific reference libraries for remaining bacterial species. Further, we are in the process of developing reference libraries for other bacterial species and hosting a web server for easy analysis.</p>
                <p> Here is the response to the specific queries. 
                    <list list-type="order">
                        <list-item>
                            <p>The gene encoding the 16S rRNA is ~1500 base pair (bp) long, present in all bacterial populations, and the gene consists of nine variable regions. In sequence analysis, the variable regions are differentiating species. Hence, the proposed approach is applicable to delineate all the bacterial populations. However, as it is ~1500 bp, sequencing of this gene is not sufficient to estimate the genome-wide difference. Estimations such as average nucleotide identity (ANI), Genome BLAST Distance Phylogeny approach can be applied to whole genome sequences.</p>
                        </list-item>
                        <list-item>
                            <p>Genetic differences can be found between strains within a species, as well as between intra-genomic 16S rRNA gene copies. The approach proposed in this manuscript is exclusively to avoid misclassification due to genetic variations in the intra-species 16S rRNA gene copies. The 16S rRNA sequence analysis is not sufficient to estimate the entire genetic difference. Several whole genome-based approaches exist for genome-wide diversity estimation and bacterial taxonomy. The 16S rRNA sequencing is considered a cost-effective, gold-standard single gene approach for species identification. Hence, the method is more appropriate for bacterial species identification. The rate of mutations and lateral gene transfer is high in bacterial populations. However, 16S rRNA is less affected by these genetic changes. Further, factors such as genome size, genetic rearrangements, and unique genomic regions can slightly affect whole genome based species identification. However, concatenated 16S rRNA analysis won't be affected by these factors.</p>
                        </list-item>
                        <list-item>
                            <p>Approaches to bacterial taxonomy have improved from microscopic examination to genome-based methods over the years. However, few earlier studies have highlighted the misclassification of species and genome assemblies in public genetic databases (Parks et al. 2018; Varghese et al. 2015). The manuscript explains the proof of concept. Whether concatenation of multiple copies of 16S rRNA can improve bacterial taxonomy. The study used four closely related 
                                <italic>Streptococcus</italic> species (
                                <italic>S. gordonii</italic>, 
                                <italic>S. mitis, S. oralis</italic>, and 
                                <italic>S. pneumoniae</italic>), which is difficult to differentiate with current genome based approaches for bacterial taxonomy. The results show the proposed concatenated 16S rRNA analysis can easily differentiate bacterial species, even sharing high genetic similarity. Although, estimations such as average nucleotide identity (ANI), genome-to-genome distance calculation, Genome BLAST Distance Phylogeny approach can be applied to whole genome sequences.</p>
                        </list-item>
                        <list-item>
                            <p>Thank you. The study was completed using a laptop that has a 4GB of RAM. Whole genome sequence data available in the public genetic databases can be used to develop reference libraries for remaining bacterial species. The analyses can be employed with currently available tools for sequence similarity and molecular phylogeny. Therefore, interpreting results should not pose a challenge for researchers. However, we are in the process of hosting a web server for more easy and appropriate analysis of global bacterial populations.</p>
                        </list-item>
                    </list> Kindly respond if you have any other queries or concerns.</p>
                <p> </p>
                <p> Kind Regards</p>
                <p> Bobby Paul</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report168593">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.144651.r168593</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Shivakumara</surname>
                        <given-names>Siddaramappa</given-names>
                    </name>
                    <xref ref-type="aff" rid="r168593a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-5566-1955</uri>
                </contrib>
                <aff id="r168593a1">
                    <label>1</label>Institute of Bioinformatics and Applied Biotechnology, Bengaluru, Karnataka, India</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>11</day>
                <month>4</month>
                <year>2023</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2023 Shivakumara S</copyright-statement>
                <copyright-year>2023</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport168593" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.128320.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The author has addressed most of my technical and non-technical (e.g., writing style and presentation) concerns. Version 2 is considerably improved compared to Version 1 in terms of readability, flow, and impact. However, I still believe there is scope to further improve the Discussion section. Nevertheless, I approve Version 2 without any further reservations.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Not applicable</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Bacterial Genomics, Bacterial Taxonomy, Comparative Genomics, Pathogenomics</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report158444">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.140896.r158444</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Shivakumara</surname>
                        <given-names>Siddaramappa</given-names>
                    </name>
                    <xref ref-type="aff" rid="r158444a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-5566-1955</uri>
                </contrib>
                <aff id="r158444a1">
                    <label>1</label>Institute of Bioinformatics and Applied Biotechnology, Bengaluru, Karnataka, India</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>10</day>
                <month>2</month>
                <year>2023</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2023 Shivakumara S</copyright-statement>
                <copyright-year>2023</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport158444" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.128320.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The manuscript entitled &#x201c;Concatenated 16S rRNA sequence analysis improves bacterial taxonomy [version 1]&#x201d; by Bobby Paul is generally well written and reports interesting results. The methods are appropriate and the analyses meet the quality standards.</p>
            <p> </p>
            <p> However, there are some concerns about the clarity and presentation of results. Major revisions in the text are required to improve clarity and comprehensibility. My suggestions/critiques for revising the text in different sections of the manuscript are provided. Author&#x2019;s attention has also been drawn to technical concerns, which need to be carefully addressed.</p>
            <p> </p>
            <p> The Discussion section is rather lengthy; it should be shortened and include only relevant results/studies (present and previous). I would be happy to provide detailed critiques on the Discussion section after the manuscript goes through the first round of revision.</p>
            <p> </p>
            <p> 
                <italic>Is the work clearly and accurately presented and does it cite the current literature?</italic>
            </p>
            <p> Some statements/results are missing citations and references. These have been highlighted and need to be revised.</p>
            <p> </p>
            <p> 
                <italic>Are the conclusions drawn adequately supported by the results?</italic>
            </p>
            <p> Some conclusions are unsubstantiated, and require relevant data and proper interpretation. These have been highlighted and need to be revised.</p>
            <p> </p>
            <p> </p>
            <p> 
                <bold>Introduction</bold>
            </p>
            <p> </p>
            <p> "The 16S ribosomal RNA (16S rRNA) encoding region is extensively studied to identify and classify bacterial species. The 16S rRNA is a conserved component of the 30S small subunit of a prokaryotic ribosome"</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> "The DNA region encoding the 16S ribosomal RNA (16S rRNA)&#x00a0; is extensively studied, and used to&#x00a0;identify and classify bacterial species.&#x00a0;The 16S rRNA is a conserved component of the small subunit (30S) of the prokaryotic ribosome".</p>
            <p> </p>
            <p> The gene is ~1500 base pair (bp) long, and it consists of nine variable regions.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> "The gene encoding the&#x00a0;16S rRNA is&#x00a0;base pair (bp) long, and it consists of nine variable regions"</p>
            <p> </p>
            <p> "For decades, the sequence of the 16S rRNA gene has been used as a potential molecular marker in culture-independent methods to identify and classify diverse bacterial communities (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref9">Clarridge, 2004</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref15">Johnson&#x00a0;
                    <italic>et al</italic>. 2019</ext-link>)"</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> "The sequence of the 16S rRNA gene has been extensively used as a molecular marker in culture-independent methods to identify and classify diverse bacterial communities"</p>
            <p> </p>
            <p> "The 16S rRNA sequences are currently being used as an accurate and rapid method to study bacterial evolution, phylogenetic relationships, populations in an environment, and quantification of abundant taxa"</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> "Bacterial 16S rRNA sequences are currently being used to study the evolution, phylogenetic relationships, and environmental abundanceof various taxa".</p>
            <p> </p>
            <p> "Despite the wide range of applications, a few shortcomings limit the accuracy of results derived through the 16S rRNA sequence analysis. One such aspect is that the 16S rRNA gene has poor discriminatory power at the species level (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref41">Winand&#x00a0;
                    <italic>et al</italic>. 2020</ext-link>), and the copy number can vary from 1 to 15 or even more (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref39">Vetrovsky and Baldrian, 2013</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref41">Winand&#x00a0;
                    <italic>et al</italic>. 2020</ext-link>)."</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> "Although 16S rRNA sequence analyses are the mainstay of taxonomic studies of bacteria, there are some limitations. For example, the 16S rRNA gene has poor discriminatory power at the species level (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref41">Winand&#x00a0;
                    <italic>et al</italic>. 2020</ext-link>), and the copy number is highly variable (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref39">Vetrovsky and Baldrian, 2013</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref41">Winand&#x00a0;
                    <italic>et al</italic>. 2020</ext-link>).&#x00a0;".&#x00a0;</p>
            <p> </p>
            <p> "The presence of multiple variable copies of this gene makes distinct data for a species."</p>
            <p> 
                <bold>The above statement is a little confusing, and should be revised. Their presence in itself may not "make distinct data".</bold>
            </p>
            <p> </p>
            <p> "Hence, gene copy normalization (GCN) is necessary prior to sequence analysis."</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> "Therefore, gene copy normalization (GCN) may be necessary prior to sequence analysis.</p>
            <p> </p>
            <p> "However, studies show that the GCN approach does not improve the 16S rRNA sequence analyses in real scenarios and suggests a comprehensive species-specific catalogue of gene copies (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref35">Starke&#x00a0;
                    <italic>et al</italic>. 2021</ext-link>)".</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> "However, GCN may not improve the 16S rRNA sequence analyses in all scenarios, and comprehensive, species-specific catalogues of 16S rRNA gene copies may be necessary (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref35">Starke&#x00a0;
                    <italic>et al</italic>. 2021</ext-link>).</p>
            <p> </p>
            <p> "Secondly, the intra-genomic variations between the 16S rRNA gene copies were observed in several bacterial genome assemblies (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref25">Paul&#x00a0;
                    <italic>et al</italic>. 2019</ext-link>)."</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> "Furthermore, intra-species variations in the number of copies of the16S rRNA gene were observed in several bacterial genome assemblies"</p>
            <p> </p>
            <p> "Only a minority of the bacterial genomes harbor identical 16S rRNA gene copies, and sequence diversity increases with increasing copy numbers (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref39">Vetrovsky and Baldrian, 2013</ext-link>). "</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> "Only a few bacterial species contain identical 16S rRNA gene copies, and sequence diversity increases with increasing copy numbers of&#x00a0;16S rRNA genes".</p>
            <p> </p>
            <p> "Further, currently available 16S rRNA-based bioinformatics approaches are not always amenable to classify bacterium at the species level due to high inter-species sequence similarities (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref27">Peker&#x00a0;
                    <italic>et al</italic>. 2019</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref10">Deurenberg&#x00a0;
                    <italic>et al</italic>. 2017</ext-link>)."</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> "The high levels of similiarty of the&#x00a0;16S rRNA genes across some bacterial species poses a major challege for&#x00a0;taxonomic studies using bioinformatics methods".</p>
            <p> </p>
            <p> &#x201c;A few other issues are also related to the sequencing and bioinformatics analysis of 16S rRNA gene regions. These include the purity of bacterial isolates, the quality of isolated DNA, and the possibility of chimeric molecules (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref14">Janda and Abbott, 2007</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref7">Church&#x00a0;
                    <italic>et al</italic>. 2020</ext-link>)&#x201d;</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Factors such as purity of bacterial cultures, quality of the purified DNA samples, and potential DNA chimeras should be carefully considered while sequencing and analysis of 16S rRNA genes.</p>
            <p> </p>
            <p> &#x201c;Base-call errors can also mislead the sequence identity and phylogenetic inferences (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref1">Alachiotis&#x00a0;
                    <italic>et al</italic>. 2013</ext-link>).</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Sequencing errors can lead to misidentification of bacteria and phylogenetic anomalies&#x201d;.</p>
            <p> </p>
            <p> &#x201c;The other concerns on sequence-based analysis, comparison, and species identification include the number of base ambiguities processed, gaps generated during sequence comparison, and algorithm (local or global) used for the sequence alignment.&#x201d;</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Other concerns include sequence ambiguities, gaps generated during DNA sequencing and sequence comparisons, and choosing the appropriate algorithm (local or global) for sequence alignment.&#x201d;.&#x201d;</p>
            <p> </p>
            <p> &#x201c;The local alignment algorithm is extensively used for sequence similarity-based species identification. Several studies were conducted to identify the best variable region or combination of variable regions for bacterial classification, and a consensus remains to be implemented (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref14">Janda and Abbott, 2007</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref15">Johnson&#x00a0;
                    <italic>et al</italic>. 2019</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref41">Winand&#x00a0;
                    <italic>et al</italic>. 2020</ext-link>).&#x201d;</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Since the local alignment algorithm is extensively used for sequence similarity-based comparisons, it is important to carefully consider whether a single variable region or a combination of variable regions of the 16S rRNA gene would be ideal for bacterial classification.&#x201d;</p>
            <p> </p>
            <p> &#x201c;Usage of misclassified sequence as a reference and improper bioinformatics workflows mislead the bacterial taxonomy.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Using erroneous 16S rRNA sequences as references and improper bioinformatics workflows can mislead bacterial identification&#x201d;.&#x00a0;</p>
            <p> </p>
            <p> "Further, the growth of bioinformatics and genetic data has placed genome-based microbial classification with researchers with little or no taxonomic experience, which may also mislead the bacterial taxonomy (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref4">Baltrus, 2016</ext-link>)&#x201d;.</p>
            <p> 
                <bold>Kindly revise the above statement because it is too complex and confusing.</bold>
            </p>
            <p> </p>
            <p> &#x201c;A few bacterial identification systems with high resolution have been developed using the sequence of polymerase chain reaction (PCR) amplified &#x223c;4.5 kb long 16S&#x2013;23S rRNA regions (Ben&#x00ed;tez-P&#x00e1;ez and Sanz, 2017;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref30">Sabat&#x00a0;
                    <italic>et al</italic>. 2017</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref17">Kerkhof&#x00a0;
                    <italic>et al</italic>. 2017</ext-link>).&#x201d;</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Other methods for bacterial identification include the sequencing and analysis of the polymerase chain reaction (PCR) amplified &#x223c;4.5 kb 16S&#x2013;23S rRNA regions (Ben&#x00ed;tez-P&#x00e1;ez and Sanz, 2017;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref30">Sabat&#x00a0;
                    <italic>et al</italic>. 2017</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref17">Kerkhof&#x00a0;
                    <italic>et al</italic>. 2017</ext-link>).&#x201d;</p>
            <p> </p>
            <p> &#x201c;However, these approaches have a few limitations, such as the lack of reference 16S&#x2013;23S rRNA sequence databases and complementary bioinformatics resources for reliable species identification (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref30">Sabat&#x00a0;
                    <italic>et al</italic>. 2017</ext-link>)&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;However, the 16S&#x2013;23S rRNA sequence-based method is less practical due to the lack of appropriate reference sequence databases and reliable tools/methods for sequence analysis&#x201d;</p>
            <p> </p>
            <p> &#x201c;The recent advancements in bioinformatics workflows (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref41">Winand&#x00a0;
                    <italic>et al</italic>. 2020</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref31">Schloss, 2020</ext-link>) and reference databases such as SILVA, EzBioCloud (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref28">Quast&#x00a0;
                    <italic>et al</italic>. 2013</ext-link>; Yoon, 2017) improved 16S rRNA-based bacterial taxonomy. However, a few recent genome-based studies highlighted the misclassification incidences in bacterial species and genome assemblies (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref36">Steven&#x00a0;
                    <italic>et al</italic>. 2017</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref22">Mart&#x00ed;nez-Romero,&#x00a0;
                    <italic>et al</italic>. 2018</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref23">Mateo-Estrada&#x00a0;
                    <italic>et al</italic>. 2019</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref3">Bagheri&#x00a0;
                    <italic>et al</italic>. 2020</ext-link>)&#x201d;.</p>
            <p> 
                <bold>The first part here talks about 16S rRNA-based bacterial taxonomy, and the second part talks about genome assemblies. It is not clear what the connection is. Kindly revise these two sentences. A suggestion for revision is shown below (although even this suggestion fails to convey the connection):</bold>
            </p>
            <p> &#x00a0;&#x201c;Although improvements in reference databases (such as SILVA and EzBioCloud) and bioinformatics workflows have facilitated 16S rRNA-based bacterial taxonomy, recent genome-based studies have indicated that incidences of misclassification of bacterial species and erroneous genome assemblies.&#x201d;</p>
            <p> </p>
            <p> &#x201c;Nowadays, conventional and high throughput sequencers can amplify all the nine variable regions of the 16S rRNA gene&#x201d;.</p>
            <p> 
                <bold>Please revise as</bold>
            </p>
            <p> &#x201c;The entire 16S rRNA gene (~1500 bp) can be amplified and sequenced using the conventional or high throughput sequencing methods&#x201d;.</p>
            <p> </p>
            <p> &#x201c;Although, many 16S rRNA-based bacterial identification studies lack a complete set of variable regions (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref34">Stackebrandt&#x00a0;
                    <italic>et al</italic>. 2021</ext-link>)&#x201d;.</p>
            <p> 
                <bold>Please revise as</bold>
            </p>
            <p> &#x201c;However, many 16S rRNA sequence-based bacterial identification studies do not seem to include all of these nine variable regions (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref34">Stackebrandt&#x00a0;
                    <italic>et al</italic>. 2021</ext-link>)&#x201d;.</p>
            <p> </p>
            <p> &#x201c;The classical and high throughput sequencing technologies produce a large volume of whole-genome data. There is an urgent need to translate the genomic data for convenient microbiome analyses that ensure clinical practitioners can readily understand and quickly implement it (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref7">Church&#x00a0;
                    <italic>et al</italic>. 2020</ext-link>)&#x201d;.</p>
            <p> 
                <bold>Please revise as</bold>
            </p>
            <p> &#x201c;Due to the large volume of whole-genome data that is being produced by high throughput sequencing technologies, there is an urgent need to develop methods for analyzing this data for accurate identification of bacteria. It is also important to envisage that clinical practitioners would be able to understand these methods and implement them quickly (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref7">Church&#x00a0;
                    <italic>et al</italic>. 2020</ext-link>)&#x201d;.</p>
            <p> </p>
            <p> &#x201c;Hence, the study intended to demonstrate a workflow to develop species-specific concatenated 16S rRNA reference libraries and its analysis. The species-specific libraries can yield better resolution in sequence similarity and phylogeny based bacterial classification approaches&#x201d;.</p>
            <p> 
                <bold>Please revise as</bold>
            </p>
            <p> &#x201c;This study aimed to develop a workflow for accurate identification of bacteria using concatenated, species-specific 16S rRNA sequences. It was hoped that the species-specific libraries would yield much better resolution in sequence similarity- and phylogeny-based bacterial classification&#x201d;.</p>
            <p> </p>
            <p> 
                <bold>Methods</bold>
            </p>
            <p> 
                <italic>Estimation of variations in intra-genomic 16S rRNA gene copies</italic>
            </p>
            <p> &#x201c;Sequence alignment of 16S rRNA copies at the intra-genomic level shows a higher degree of variability in species belonging to the&#x00a0;
                <italic>Firmicutes</italic>&#x00a0;and&#x00a0;
                <italic>Proteobacteria</italic>&#x00a0;(
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref39">Vetrovsky and Baldrian, 2013</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref13">Ibal&#x00a0;
                    <italic>et al</italic>. 2019</ext-link>)&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;It has been reported that sequence alignment of 16S rRNA alleles at the intra-genomic level shows a higher degree of variability in species belonging to the&#x00a0;
                <italic>Firmicutes</italic>&#x00a0;and&#x00a0;
                <italic>Proteobacteria</italic>&#x00a0;(
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref39">Vetrovsky and Baldrian, 2013</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref13">Ibal&#x00a0;
                    <italic>et al</italic>. 2019</ext-link>)&#x201d;.</p>
            <p> </p>
            <p> &#x201c;Hence, the study used eight 16S rRNA copies (Underlying data: Supplementary data 1 (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref26">Paul, 2022</ext-link>)) retrieved from the whole genome of&#x00a0;
                <italic>Enterobacter asburiae</italic>&#x00a0;strain ATCC 35953 (NZ_CP011863.1)&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Therefore, this study used eight 16S rRNA alleles (Underlying data: Supplementary data 1 (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref26">Paul, 2022</ext-link>)) retrieved from the complete genome of&#x00a0;
                <italic>Enterobacter asburiae</italic>&#x00a0;strain ATCC 35953 (NZ_CP011863.1)&#x201d;.</p>
            <p> </p>
            <p> &#x201c;The BLAST+ 2.13.0 (RRID:SCR_004870;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref2">Altschul&#x00a0;
                    <italic>et al</italic>. 1990</ext-link>) and Clustal Omega 1.2.4 (RRID:SCR_001591;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref32">Sievers&#x00a0;
                    <italic>et al</italic>. 2011</ext-link>) sequence alignment algorithms were used to estimate intra-genomic variability between the 16S rRNA gene copies&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;To estimate intra-genomic variability between these 16S rRNA alleles, BLAST+ 2.13.0 (RRID:SCR_004870;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref2">Altschul&#x00a0;
                    <italic>et al</italic>. 1990</ext-link>) and Clustal Omega 1.2.4 (RRID:SCR_001591;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref32">Sievers&#x00a0;
                    <italic>et al</italic>. 2011</ext-link>) sequence alignment algorithms were used&#x201d;.</p>
            <p> </p>
            <p> &#x201c;Phylogenetic relatedness between intra-genomic 16S rRNA copies were estimated using the Maximum Likelihood method (Tamura-Nei model; 500 bootstrap replicates) with MEGA software (version 11; RRID: SCR_000667;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref18">Kumar&#x00a0;
                    <italic>et al</italic>. 2018</ext-link>).&#x201d;</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Phylogenetic analysis of these 16S rRNA alleles were performed using the maximum likelihood method (Tamura-Nei model; 500 bootstrap replicates) and the MEGA software (version 11; RRID: SCR_000667;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref18">Kumar&#x00a0;
                    <italic>et al</italic>. 2018</ext-link>)&#x201d;.</p>
            <p> </p>
            <p> 
                <italic>Construction of species-specific concatenated 16S rRNA reference libraries</italic>
            </p>
            <p> </p>
            <p> &#x201c;Previous studies have reported that several bacterial species share more than 99% sequence identity in the 16S rRNA encoding region&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Previous studies have reported that the genes encoding 16S rRNA from several bacterial species share &gt;99% sequence identity&#x201d;. [ALSO PROVIDE A REFERENCE HERE]</p>
            <p> </p>
            <p> &#x201c;Hence, the 16S rRNA-based bacterial identification methods failed to discriminate such genetically related species (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref10">Deurenberg&#x00a0;
                    <italic>et al</italic>. 2017</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref11">Devanga-Ragupathi&#x00a0;
                    <italic>et al</italic>. 2018</ext-link>)&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Therefore, the 16S rRNA-based methods failed to correctly identify bacterial species that are genetically closely related (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref10">Deurenberg&#x00a0;
                    <italic>et al</italic>. 2017</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref11">Devanga-Ragupathi&#x00a0;
                    <italic>et al</italic>. 2018</ext-link>)&#x201d;.</p>
            <p> </p>
            <p> &#x201c;It has been reported that&#x00a0;
                <italic>Streptococcus mitis</italic>&#x00a0;and&#x00a0;
                <italic>Streptococcus pneumoniae</italic>&#x00a0;are almost indistinguishable from each other based on the sequence similarity of their 16S rRNA regions (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref29">Reller&#x00a0;
                    <italic>et al</italic>. 2007</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref19">Lal&#x00a0;
                    <italic>et al</italic>. 2011</ext-link>)&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> It has been reported that&#x00a0;16S rRNA-based methods cannot 
                <italic>&#x00a0;</italic>distinguish between 
                <italic>Streptococcus mitis</italic>&#x00a0;and&#x00a0;
                <italic>Streptococcus pneumoniae</italic>&#x00a0; due to the high sequence similarity (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref29">Reller&#x00a0;
                    <italic>et al</italic>. 2007</ext-link>;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref19">Lal&#x00a0;
                    <italic>et al</italic>. 2011</ext-link>).</p>
            <p> </p>
            <p> &#x201c;To develop species-specific barcode reference libraries, the study used 16S rRNA gene copies from whole-genome assemblies of four closely related species of&#x00a0;
                <italic>Streptococcus</italic>&#x00a0;(
                <italic>S. gordonii</italic>,&#x00a0;
                <italic>S. mitis</italic>,&#x00a0;
                <italic>S. oralis,</italic>&#x00a0;and&#x00a0;
                <italic>S. pneumoniae</italic>)&#x201d;.</p>
            <p> 
                <bold>Please DELETE the above, because it is repeated below.</bold>
            </p>
            <p> </p>
            <p> &#x201c;More than 463,000 whole-genome assemblies are currently available for prokaryotes at the Genome database (RRID:SCR_002474;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/genome">https://www.ncbi.nlm.nih.gov/genome</ext-link>)&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x2018;More than 463,000 whole-genome sequences are currently (please provide a date here) available for prokaryotes (please specify if this is only for bacteria, or also includes archaea) in the Genome database (RRID:SCR_002474;&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/genome">https://www.ncbi.nlm.nih.gov/genome</ext-link>)&#x201d;.</p>
            <p> </p>
            <p> &#x201c;Most microbial genomes were sequenced with high throughput sequencing technologies such as Illumina/Ion-Torrent (short read sequencing) and PacBio/Nanopre (long read sequencing&#x201d;).</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Many of these genomes were sequenced using high throughput sequencing technologies such as Illumina/Ion-Torrent (short read sequencing) and PacBio/Nanopre (long read sequencing)&#x201d;.</p>
            <p> &#x201c;Further, many of these whole-genome assemblies are derived through a hybrid assembly of short and long read sequence data&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Furthermore, most of these whole-genome sequences were obtained after a hybrid assembly of short and long read sequence data&#x201d;.</p>
            <p> </p>
            <p> &#x201c;The large volume of high throughput data can be effectively used to develop advanced genome-based approaches for microbial systematics&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;This extensive, high throughput data can be effectively used to develop advanced genome-based methods for microbial systematics.&#x201d;</p>
            <p> </p>
            <p> The genomic data is available in four assembly completion levels (contig, scaffold, chromosome, and complete). However, the study used only the genomes assemblies in the 'complete' stage to retrieve 16S rRNA gene copies.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Although the genomic data is available in four levels (contig, scaffold, chromosome, and complete), this study used only the complete genomes to retrieve 16S rRNA genes.</p>
            <p> </p>
            <p> &#x201c;The study retrieved full-length 16S rRNA gene copies from 16 genome assemblies belonging to four&#x00a0;
                <italic>Streptococcus</italic>&#x00a0;species (
                <italic>S. gordonii</italic>,&#x00a0;
                <italic>S. mitis</italic>,&#x00a0;
                <italic>S. oralis,</italic>&#x00a0;and&#x00a0;
                <italic>S. pneumoniae&#x201d;</italic>).</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;To develop species-specific barcode reference libraries, this study retrieved full-length 16S rRNA genes from 16 complete genome sequences belonging to four&#x00a0;
                <italic>Streptococcus</italic>&#x00a0;species (
                <italic>S. gordonii</italic>,&#x00a0;
                <italic>S. mitis</italic>,&#x00a0;
                <italic>S. oralis,</italic>&#x00a0;and&#x00a0;
                <italic>S. pneumoniae</italic>)".</p>
            <p> </p>
            <p> &#x201c;The detailed information on the dataset used to develop species-specific concatenated reference libraries is provided in&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#T1">Table 1</ext-link>&#x00a0;and the sequences are provided in the underlying data (Supplementary data 2 (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref26">Paul, 2022</ext-link>))&#x201d;</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Details of the dataset used to develop species-specific concatenated reference libraries are provided in&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#T1">Table 1</ext-link>, and the sequences are provided in the underlying data (Supplementary data 2 (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref26">Paul, 2022</ext-link>))&#x201d;</p>
            <p> </p>
            <p> &#x201c;To maintain equal length, sequences were trimmed out beyond the universal primer pair fD1-5'-GAG TTT GAT CCT GGC TCA-3' and rP2-5'-ACG GCT AAC TTG TTA CGA CT-3' (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref40">Weisburg&#x00a0;
                    <italic>et al</italic>. 1991</ext-link>) for full-length 16S rDNA amplification&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Sequences were trimmed beyond the universal primer pair (fD1-5'-GAG TTT GAT CCT GGC TCA-3' and rP2-5'-ACG GCT AAC TTG TTA CGA CT-3', which are used &#x00a0;for full-length 16S rDNA amplification, 
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref40">Weisburg&#x00a0;
                    <italic>et al</italic>. 1991</ext-link>) to maintain uniform length [PLEASE MENTION THIS IN BASE PAIRS]"</p>
            <p> </p>
            <p> &#x201c;The study used MEGA 11 software to perform multiple sequence alignment and identify the intra-species parsimony informative (Parsim-info) variable sites&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;To perform multiple sequence alignment and identify the intra-species parsimony informative (Parsim-info) variable sites, the MEGA 11 software was used&#x201d;.</p>
            <p> </p>
            <p> &#x201c;A species-specific barcode reference library covering entire Parsim-info variable sites was constructed by concatenating four 16S rRNA gene copies representing four different strains of a species&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;A species-specific barcode reference library that covers the entire Parsim-info variable sites was constructed by concatenating four 16S rRNA gene copies from four different strains of a species&#x201d;.</p>
            <p> </p>
            <p> &#x201c;The rationale behind the selection of four copies for a species-specific barcode reference library is: (i) a maximum of four variations can be found on a single site, and (ii) earlier studies have shown that the mean 16S rRNA copies per genome is four (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref39">Vetrovsky and Baldrian, 2013</ext-link>)&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;The rationale for the selection of four copies for constructing a species-specific barcode reference library was: (i) a maximum of four variations can be found at a single site, and (ii) earlier studies have shown that the mean 16S rRNA copies per genome is four (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref39">Vetrovsky and Baldrian, 2013</ext-link>)&#x201d;.</p>
            <p> </p>
            <p> 
                <italic>Demonstration of concatenated 16S rRNA in sequence similarity and phylogeny</italic>
            </p>
            <p> </p>
            <p> &#x201c;The study analyzed a few cases to demonstrate the classical sequence similarity and phylogenetic analysis using concatenated species-specific 16S rRNA reference libraries&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;This study analyzed a few cases to demonstrate (i) the classical sequence similarity and (ii) phylogenetic analysis using concatenated species-specific 16S rRNA reference libraries&#x201d;.</p>
            <p> </p>
            <p> &#x201c;The study used nine Sanger sequenced 16S rRNA gene copies showing higher sequence similarity with multiple species of&#x00a0;
                <italic>Streptococcus</italic>&#x00a0;retrieved from the GenBank database (RRID:SCR_002760)&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Nine 16S rRNA gene copies (sequenced using the Sanger method) showing higher sequence similarity to the 16S rRNA genes of multiple species of&#x00a0;
                <italic>Streptococcus</italic>&#x00a0;were retrieved from GenBank database (RRID:SCR_002760)&#x201d;.</p>
            <p> </p>
            <p> &#x201c;The web based BLAST2 (version 2.13.0) program for aligning two or more sequences was used to estimate the maximum score, total alignment score, and sequence identity&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;The web based BLAST2 (version 2.13.0) program for aligning two or more sequences was used to estimate the maximum score, total alignment score, and sequence identity of these nine 16S rRNA&#x201d;.</p>
            <p> </p>
            <p> &#x201c;A single copy of the 16S rRNA region derived through Sanger sequencing or retrieved from a whole-genome assembly can be considered as &#x2018;Query sequence&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;A single 16S rRNA gene (sequenced using the Sanger method or retrieved from a whole-genome assembly was the &#x2018;Query sequence&#x2019;&#x201d;. [IT IS NOT CLEAR WHETHER THIS GENE WAS FROM 
                <italic>Streptococcus.</italic> PLEAASE CLARIFY THIS]</p>
            <p> </p>
            <p> &#x201c;The concatenated species-specific reference libraries must be provided in the &#x2018;Subject sequence&#x2019; section&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;The concatenated species-specific reference libraries were provided in the &#x2018;Subject sequence&#x2019; window&#x201d;.</p>
            <p> </p>
            <p> &#x201c;To perform phylogenetic analysis, it is mandatory that the target sequence (length = n bp) has to be concatenated four times (length = 4 &#x00d7; n bp), appending next to the last base&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;To perform phylogenetic analysis, it is mandatory that the target sequence (length = n bp) be concatenated four times (length = 4 &#x00d7; n bp)&#x201d;.</p>
            <p> </p>
            <p> &#x201c;Phylogenetic relatedness was estimated using the Maximum Likelihood method (Tamura-Nei model; 500 bootstrap replicates) with MEGA 11 software&#x201d;.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> &#x201c;Phylogenetic analysis was performed as indicated above/&#x201d;</p>
            <p> </p>
            <p> 
                <bold>Results</bold>
            </p>
            <p>
                <bold> Intra-genomic 16S rRNA variations in&#x00a0;
                    <italic>Enterobacter asburiae</italic>
                </bold>
            </p>
            <p> </p>
            <p> Historically, the 16S rRNA gene sequences were used to identify known and new bacterial species.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Historically, sequences of the 16S rRNA genes have been used to identify known and new bacterial species.</p>
            <p> </p>
            <p> However, this method is impacted by several factors such as amplification efficiency, poor discriminatory power at the species level, multiple polymorphic 16S rRNA gene copies, and improper bioinformatics workflows for the data analysis.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> However, efficiency of PCR-based amplification, poor discrimination at the species level, multiple polymorphic 16S rRNA gene copies, and improper bioinformatics workflows for the data analysis can impact the identification. [PLEASE PROVIDE A REFERENCE HERE]</p>
            <p> </p>
            <p> The&#x00a0;
                <italic>E. asburiae</italic>&#x00a0;genome had eight 16S rRNA gene copies that showed a mean identity of 99.29% in sequence alignment using Clustal Omega (global alignment), whereas BLAST (local alignment) analysis resulted in an average of 99% identity between the copies (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#T2">Table 2</ext-link>).</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> The&#x00a0;genome of
                <italic> E. asburiae</italic>&#x00a0;contains eight copies of the 16S rRNA gene. Analysis using Clustal Omega (global alignment) and BLAST (local alignment) showed that the sequences of these eight alleles had average identities of 99.29 and 99%, respectively (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#T2">Table 2</ext-link>).</p>
            <p> </p>
            <p> Hence, the selection of an appropriate algorithm has a significant role in the estimation of percent identity, and a vital role in sequence-based species delineation.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Therefore, choosing the appropriate algorithm/tool is critical for the estimation of sequence identities and sequence-based species delineation.</p>
            <p> </p>
            <p> Global sequence alignment programs generally perform better for highly identical sequence pairs, and the algorithm considers all the bases for the estimation of sequence identity. The multiple sequence alignment showed 22 variable sites in 16S rRNA gene copies of the&#x00a0;
                <italic>E. asburiae</italic>&#x00a0;genome (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#f1">Figure 1</ext-link>).</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> For analyzing sequence pairs that are highly identical, global sequence alignment programs/tools seem to be more appropriate because they consider all the nucleotides for the estimation of sequence identity.</p>
            <p> </p>
            <p> The multiple sequence alignment showed 22 variable sites in 16S rRNA gene copies of the&#x00a0;
                <italic>E. asburiae</italic>&#x00a0;genome (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#f1">Figure 1</ext-link>).</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Multiple sequence alignment [PLEASE MENTION THE TOOL/ALGORITHM HERE, AND ALSO IN THE LEGEND OF FIGURE 1] of the sequences of the eight alleles of the 16S rRNA gene in the genome of 
                <italic>E. asburiae</italic>&#x00a0;showed 22 variable sites (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#f1">Figure 1</ext-link>).</p>
            <p> </p>
            <p> The evolutionary relationship between species is usually represented in a phylogenetic tree drawn using a single barcode gene, multiple genes, or whole genomes.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> The evolutionary relationship between species is usually represented using a phylogenetic tree based on the analysis of a single gene, multiple genes, or whole genomes.</p>
            <p> </p>
            <p> However, bacterial species nomenclature is mainly designated based on the confidence obtained from the phylogenetic tree derived through single copy 16S rRNA analysis.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> However, bacterial identification and classification is mainly based on the phylogenetic analysis of single copies of 16S rRNA genes</p>
            <p> </p>
            <p> To highlight how the intra-genomic 16S rRNA variations influence the species delineation, a phylogenetic tree was constructed using eight 16S rRNA gene copies of&#x00a0;
                <italic>E. asburiae</italic>&#x00a0;reference genome showing multiple nodes (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#f2">Figure 2</ext-link>).</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> A phylogenetic tree was constructed to understand how variations in the sequences of the eight alleles of the 16S rRNA gene in the genome of 
                <italic>E. asburiae</italic>&#x00a0;influence species delineation (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#f2">Figure 2</ext-link>).</p>
            <p> </p>
            <p> The sequence similarity and phylogeny-based analysis indicate that the intra-genomic variations in 16S rRNA copies may mislead the bacterial taxonomy in single gene copy approaches.</p>
            <p> 
                <bold>IT IS NOT CLEAR TO ME HOW THIS ANALYSIS &#x201c;</bold>indicate(s) that the intra-genomic variations in 16S rRNA copies may mislead the bacterial taxonomy in single gene copy approaches&#x201d;. 
                <bold>KINDLY ELABORATE ON THE SAME.</bold>
            </p>
            <p> </p>
            <p> 
                <bold>Species-specific concatenated 16S rRNA libraries</bold>
            </p>
            <p> </p>
            <p> The study selected four&#x00a0;
                <italic>Streptococcus</italic>&#x00a0;species (
                <italic>S. gordonii</italic>,&#x00a0;
                <italic>S. mitis</italic>,&#x00a0;
                <italic>S. oralis</italic>, and&#x00a0;
                <italic>S. pneumoniae</italic>) to construct species-specific concatenated 16S rRNA reference libraries.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> This study selected four species of&#x00a0;
                <italic>Streptococcus</italic>&#x00a0;(
                <italic>S. gordonii</italic>,&#x00a0;
                <italic>S. mitis</italic>,&#x00a0;
                <italic>S. oralis</italic>, and&#x00a0;
                <italic>S. pneumoniae</italic>) to construct species-specific concatenated reference libraries based on 16S rRNA gene sequences obtained from complete genomes.</p>
            <p> </p>
            <p> The study used 16S rRNA copies retrieved from four whole genome assemblies in the &#x2018;complete&#x2019; stage to construct a species-specific barcode library.</p>
            <p> 
                <bold>Please DELETE the above sentence.</bold>
            </p>
            <p> </p>
            <p> Four copies of the 16S rRNA gene are required to construct the concatenated library for a species.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Sequences from four copies of the 16S rRNA gene are required to construct a concatenated library for a species [PLEASE PROVIDE A REFERENCE HERE].</p>
            <p> The details of constructed species-specific libraries are listed in&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#T1">Table 1</ext-link>&#x00a0;and the sequence is provided in the underlying data (Supplementary data 3 (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref26">Paul, 2022</ext-link>)).</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> The details of species-specific libraries are listed in&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#T1">Table 1</ext-link>&#x00a0;and the sequences are provided in the underlying data (Supplementary data 3 (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#ref26">Paul, 2022</ext-link>)). THIS REFERENCE IS INCOMPLETE IN THE LIST, AND THE LINK IS NOT WORKING. PLEASE CHECK ANC CORRECT.</p>
            <p> ALSO, IN THE TITLE OF TABLE 1, &#x201c;for the development of concatenated&#x201d; SHOULD BE REVISED AS &#x201c;for the construction of concatenated&#x201d;</p>
            <p> </p>
            <p> The 16S rRNA sequence analysis shows 24 Parsim-info variable sites for&#x00a0;
                <italic>S. oralis</italic>, 11 variations in&#x00a0;
                <italic>S. mitis</italic>, seven variations in&#x00a0;
                <italic>S. gordonii,</italic>&#x00a0;and six variations found in&#x00a0;
                <italic>S. pneumoniae.</italic>&#x00a0;</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Analysis using the sequences of 16S rRNA genes showed 24, 11, 7, and 6 Parsim-info variable sites for&#x00a0;
                <italic>S. oralis</italic>, 
                <italic>S. mitis</italic>, 
                <italic>S. gordonii,</italic>&#x00a0;and 
                <italic>S. pneumoniae</italic>, respectively. [PLEASE PROVIDE/SHOW THE DATA FOR THIS]</p>
            <p> </p>
            <p> The observed intra-species Parsim-info variable sites are residing on both conserved and variable regions of the 16S rRNA gene.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> The intra-species [PLEASE CHECK IF THIS SHOULD BE &#x201c;inter-species&#x201d;] Parsim-info variable sites were located in both the conserved and variable regions of the 16S rRNA gene. [PLEASE PROVIDE/SHOW THE DATA FOR THIS]</p>
            <p> </p>
            <p> The study used full-length 16S rRNA copies from four different strains to highlight the variations at the species level.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> This study used full-length sequences of 16S rRNA genes from four different species to check the variations at the species level.</p>
            <p> </p>
            <p> However, a large volume of partial 16S rRNA sequences are available in the public genetic databases.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> However, a large number of partial sequences of 16S rRNA genes are available in the public genetic databases.</p>
            <p> </p>
            <p> In such cases, a species-specific concatenated 16S rRNA reference library can be developed with partial sequences.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> In such cases, a species-specific concatenated reference library can be constructed using partial sequences.</p>
            <p> </p>
            <p> Intra-species variation on 16S rRNA gene copies influences the sequence based bacterial taxonomy.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Intra-species [PLEASE CHECK IF THIS SHOULD BE &#x201c;inter-species&#x201d;] variations in the sequences of 16S rRNA gene copies influences the sequence-based bacterial identification. [PLEASE PROVIDE/SHOW THE DATA FOR THIS,&#x00a0; OR SUBSTANTIATE THE CONCLUSION]</p>
            <p> </p>
            <p> Hence, the concatenated 16S rRNA approach yields better resolution than single copy analysis in classical sequence similarity and phylogeny based species identification approaches.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Therefore, concatenation of the sequences of 16S rRNA genes/alleles provides much better resolution compared to analysis using sequences from a single copy of the 16S rRNA gene. PLEASE NOTE: IN THE ABSENCE OF DATA, THIS STATEMENT REMAINS UNSUBSTANTIATED.</p>
            <p> </p>
            <p> 
                <bold>Demonstration of concatenated 16S rRNA based species identification</bold>
            </p>
            <p> </p>
            <p> The study compared nine 16S rRNA sequences representing&#x00a0;
                <italic>Streptococcus</italic>&#x00a0;species (
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#T3">Table 3</ext-link>) with species-specific concatenated reference libraries.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> This study compared sequences of nine 16S rRNA genes from different species of 
                <italic>Streptococcus</italic>&#x00a0;(
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#T3">Table 3</ext-link>) using species-specific concatenated reference libraries.</p>
            <p> </p>
            <p> Concatenated sequence analysis gives better resolution in sequence similarity search and phylogenetic analysis.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Concatenated sequences provide much better resolution in sequence similarity search and phylogenetic analysis.</p>
            <p> </p>
            <p> The sequence accession numbers GU470907.1 and KF933785.1 classified as&#x00a0;
                <italic>S. mitis</italic>&#x00a0;showed a higher maximum and total alignment score with&#x00a0;
                <italic>S. oralis</italic>&#x00a0;than&#x00a0;
                <italic>S. mitis</italic>&#x00a0;(
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#T3">Table 3</ext-link>).</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Two sequences (accession numbers GU470907.1 and KF933785.1) from 
                <italic>S. mitis</italic> had a higher maximum and total alignment score to sequences from&#x00a0;
                <italic>S. oralis</italic>&#x00a0;than&#x00a0;
                <italic>S. mitis</italic>&#x00a0;(
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#T3">Table 3</ext-link>). PLEASE PROVIDE THE ACCESSION NUMBERS FROM S. ORALIS THAT PRODUCED THIS RESULT.</p>
            <p> </p>
            <p> Whereas the sequence (OM368574.1; classified as&#x00a0;
                <italic>S. mitis</italic>) showed a higher sequence alignment score with&#x00a0;
                <italic>S. pneumoniae.</italic>
            </p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Furthermore, yet another sequence from 
                <italic>S. mitis </italic>(accession number OM368574.1) had a higher alignment score to sequences from&#x00a0;
                <italic>S. pneumoniae. </italic>PLEASE PROVIDE THE ACCESSION NUMBERS FROM S. PNEUMONIAE THAT PRODUCED THIS RESULT.</p>
            <p> </p>
            <p> 
                <ext-link ext-link-type="uri" xlink:href="https://f1000research.com/my/referee/report/158444#f3">Figure 3A</ext-link>&#x00a0;shows a maximum likelihood tree of the nine 16S rRNA gene sequences with four concatenated species-specific reference libraries.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> The maximum likelihood phylogenetic tree based on four concatenated species-specific reference libraries and the sequences of nine 16S rRNA genes is shown in Figure 3A. [THE LEGEND IS INSUFFICIENT; PLEASE PROVIDE MORE DETAILS IN THE LEGEND. ALSO, FIGURE 3B HAS NOT BEEN CALLED IN THE RESULTS SECTION, IT HAS BEEN CALLED DIRECTLY IN THE DISCUSSION SECTION]</p>
            <p> </p>
            <p> The concatenated GU470907.1 and KF933785.1 sequences showed a phylogenetic relationship with&#x00a0;
                <italic>S. oralis</italic>&#x00a0;and sequence OM368574.1 was genetically related to&#x00a0;
                <italic>S. pneumoniae.</italic>&#x00a0;</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> [BASED ON THE ACCESSION NUMBERS, GU470907.1 and KF933785.1 CANNOT BE CONCATENATED SEQUENCES, PLEASE CHECK AND CORRECT]</p>
            <p> Two sequences (accession numbers GU470907.1 and KF933785.1) from 
                <italic>S. mitis</italic> appeared to be phylogenetically closer to 
                <italic>S. oralis</italic>, and yet another sequence from 
                <italic>S. mitis </italic>(accession number OM368574.1) from 
                <italic>S. mitis</italic> was closer to&#x00a0;
                <italic>S. pneumoniae.</italic> [PLEASE NOTE: THE BOOTSTRAP VALUES ARE RATHER LOW; THEREFORE THE RESULTS NEED TO BE INTERPRETED CAREFULLY]</p>
            <p> </p>
            <p> These results indicate that the species-specific concatenated 16S rRNA reference libraries have great potential in the taxonomic classification.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> These results further confirm that species-specific concatenated 16S rRNA reference libraries provide much better taxonomic resolution.</p>
            <p> </p>
            <p> Hence, the study suggests the usage of concatenated variable 16S rRNA copies for sequence similarity and phylogeny-based species identification.</p>
            <p> 
                <bold>Please revise this as</bold>
            </p>
            <p> Therefore, this study recommends using concatenated sequences of 16S rRNA genes for sequence similarity- and phylogeny-based species identification.</p>
            <p> </p>
            <p> A species-specific reference library with concatenated 16S rRNA gene copies provides better resolution in phylogenetic analysis than the single copy inference.</p>
            <p> THE SENTENCE ABOVE MAY BE DELETED BECAUSE A SIMILAR STATEMENT HAS ALREADY BEEN MADE.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Not applicable</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Bacterial Genomics, Bacterial Taxonomy, Comparative Genomics, Pathogenomics</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment9493-158444">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Paul</surname>
                            <given-names>Bobby</given-names>
                        </name>
                        <aff>Manipal University, India</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>Nil</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>22</day>
                    <month>3</month>
                    <year>2023</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Dear Sir,</p>
                <p> </p>
                <p> Thank you very much for your critical review and valuable suggestions for manuscript improvement. I have revised the manuscript by addressing all the suggestions and resubmitted it for your consideration. I sincerely hope that the modified manuscript is sufficiently improved, and kindly respond if you have any further questions or comments.</p>
                <p> </p>
                <p> Thank you</p>
                <p> </p>
                <p> Kind Regards</p>
                <p> Bobby Paul</p>
            </body>
        </sub-article>
    </sub-article>
</article>
