<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="data-paper" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.44714.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Data Note</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Draft genome assembly of the biofuel grass crop 
                    <italic>Miscanthus sacchariflorus</italic>
                </article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 2 approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>De Vega</surname>
                        <given-names>Jose</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-2847-5158</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Donnison</surname>
                        <given-names>Iain</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Dyer</surname>
                        <given-names>Sarah</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Farrar</surname>
                        <given-names>Kerrie</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Earlham Institute, Norwich, NR4 7UZ, UK</aff>
                <aff id="a2">
                    <label>2</label>Institute of Biological, Environmental &amp; Rural Sciences (IBERS) - Aberystwyth University, Aberystwyth, SY23 3EE, UK</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:jose.de-vega@earlham.ac.uk">jose.de-vega@earlham.ac.uk</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>18</day>
                <month>1</month>
                <year>2021</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2021</year>
            </pub-date>
            <volume>10</volume>
            <elocation-id>29</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>11</day>
                    <month>1</month>
                    <year>2021</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 De Vega J et al.</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/10-29/pdf"/>
            <abstract>
                <p>
                    <italic toggle="yes">Miscanthus sacchariflorus</italic> (Maxim.) Hack. is a highly productive C4 perennial rhizomatous biofuel grass crop. 
                    <italic toggle="yes">M. sacchariflorus</italic> is among the most widely distributed species in the genus, particularly at cold northern latitudes, and is one of the progenitor species of the commercial 
                    <italic toggle="yes">M. &#x00d7; giganteus</italic> genotypes. We generated a 2.54 Gb whole-genome assembly of the diploid 
                    <italic toggle="yes">M. sacchariflorus</italic> cv. &#x201c;Robustus 297&#x201d; genotype, which represented ~59% of the expected total genome size. We later anchored this assembly using the chromosomes from the 
                    <italic toggle="yes">M. sinensis</italic> genome to generate a second assembly with improved contiguity. We annotated 86,767 and 69,049 protein-coding genes in the unanchored and anchored assemblies, respectively. We estimated our assemblies included ~85% of the 
                    <italic toggle="yes">M. sacchariflorus</italic> genes based on homology and core markers. The utility of the new reference for genomic studies was evidenced by a 99% alignment rate of the RNA-seq reads from the same genotype.  The raw data, unanchored and anchored assemblies, and respective gene annotations are publicly available.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Miscanthus</kwd>
                <kwd>biofuel</kwd>
                <kwd>C4</kwd>
                <kwd>assembly</kwd>
                <kwd>annotation</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1" xlink:href="http://dx.doi.org/10.13039/501100000268">
                    <funding-source>Biotechnology and Biological Sciences Research Council</funding-source>
                    <award-id>BBS/E/T/000PR9818</award-id>
                    <award-id>BBS/E/W/10963A01A</award-id>
                </award-group>
                <funding-statement>This work was funded by core strategic funding from the Biotechnology and Biological Sciences Research Council (BBSRC) in projects BBS/E/T/000PR9818 and BBS/E/W/10963A01A.</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>
                <italic toggle="yes">Miscanthus</italic> is a genus of C4 perennial rhizomatous grasses native to East Asia and Oceania, and naturally adapted to a wide range of climate zones and land types. 
                <italic toggle="yes">Miscanthus sacchariflorus</italic> is among the most widely distributed species within the genus. It originated in the Yellow Sea region of China and can be predominantly found in cool latitudes of East Asia with varying ploidy
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>. 
                <italic toggle="yes">M. sacchariflorus</italic> occurs in both diploid (2n&#x2009;=&#x2009;38) and tetraploid (2n&#x2009;=&#x2009;76) forms, where tetraploid 
                <italic toggle="yes">M. sacchariflorus</italic> genotypes originated by autopolyploidy
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>
                </sup>. 
                <italic toggle="yes">M. sacchariflorus</italic> probably has the greatest winter hardiness among all the Saccharinae
                <sup>
                    <xref ref-type="bibr" rid="ref-3">3</xref>
                </sup>.</p>
            <p>Natural interspecific 
                <italic toggle="yes">Miscanthus</italic> hybrids are commonly observed, even between individuals of different ploidy. For example, introgression of 
                <italic toggle="yes">M. sacchariflorus</italic> is often found among cultivated European 
                <italic toggle="yes">M. sinensis</italic> ecotypes
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>,
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>. Furthermore, 
                <italic toggle="yes">M. x giganteus</italic>, a sterile triploid hybrid resulting from the hybridization between 
                <italic toggle="yes">M. sinensis</italic> and 
                <italic toggle="yes">M. sacchariflorus</italic>, is the predominant commercially grown species owing to its high biomass productivity and low chemical input requirements. The common occurrence of hybridization events and variable ploidy are challenging to the improvement of these bioenergy grasses and increase the need for genomic resources from different 
                <italic toggle="yes">Miscanthus</italic> species. A chromosomal-scale reference genome using a doubled-haploid 
                <italic toggle="yes">M. sinensis</italic> line was recently published
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>.</p>
            <p>We assembled, annotated and validated a draft genome from the diploid 
                <italic toggle="yes">M. sacchariflorus</italic> cv. &#x201c;Robustus 297&#x201d; genotype, as well as generating rhizome, stem and leaf RNA-Seq data from the same genotype. This dataset was previously used to verify that both 
                <italic toggle="yes">M. sinensis</italic> and 
                <italic toggle="yes">M. sacchariflorus</italic> share the same A/B ancestral tetraploidy
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>. Here, we present the first draft genome of 
                <italic toggle="yes">M. sacchariflorus</italic>, the second 
                <italic toggle="yes">Miscanthus</italic> genome available after 
                <italic toggle="yes">M. sinensis</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>.</p>
        </sec>
        <sec sec-type="methods">
            <title>Methods</title>
            <sec>
                <title>Plant materials and sequencing</title>
                <p>DNA was extracted from leaves from the diploid 
                    <italic toggle="yes">M. sacchariflorus</italic> cv. &#x201c;Robustus 297&#x201d; genotype (Biosample SAMN08580354) using the Qiagen DNeasy kit. RNA was also extracted from leaf, stem and root tissues from the same plant. All samples were taken from a plant grown from seed in trays in a glasshouse in 2009. This genotype is established and used in breeding at IBERS (Wales, UK). The RNA-seq libraries were deposited as part of previous work in the BioProject 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA639832">PRJNA639832</ext-link>.</p>
            </sec>
            <sec>
                <title>Whole genome sequencing and assembly</title>
                <p>We obtained ~5.86e9 pairs of 100 bp paired-end reads from an Illumina paired-end library with a 560 bp insert-size that was sequenced on Illumina HiSeq 2500 machines in rapid run mode by the Earlham Institute. This represents approximately 50X coverage of the heterozygous content and 100X coverage of the homozygous content of the genome. Read quality was assessed, and contaminants and adaptors removed using Kontaminant
                    <sup>
                        <xref ref-type="bibr" rid="ref-5">5</xref>
                    </sup>. These paired-end short-reads were assembled into 17M contigs with a total length of 3.27 Gb using 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/bcgsc/abyss">ABySS</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-6">6</xref>
                    </sup> version 1.5.1, with default options and a kmer size of 71.</p>
                <p>We obtained ~141.1e6 pairs of reads from a Nextera 150 bp mate-pair library with approximately 7 Kb insert-size, which was used for scaffolding the previous contigs together with the paired-end reads, using 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/nsoranzo/sspace_basic">SSPACE</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-7">7</xref>
                    </sup> without &#x201c;extension&#x201d; step. Nextera mate-pair reads were required to include a fragment of the adaptor to be used in the scaffolding step
                    <sup>
                        <xref ref-type="bibr" rid="ref-5">5</xref>
                    </sup>, and we filtered out sequences shorter than 500 bp. We obtained 589K scaffolds, a total length of 2.54 Gb with an N50 of 10.2 Kb. This whole-genome assembly was denominated &#x201c;Msac_v2&#x201d; and is deposited at NCBI in BioProject 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA679435">PRJNA679435</ext-link>.</p>
            </sec>
            <sec>
                <title>Gene model and functional annotations</title>
                <p>Our gene structure annotation pipeline
                    <sup>
                        <xref ref-type="bibr" rid="ref-8">8</xref>
                    </sup> used five sources of evidence that were provided to 
                    <ext-link ext-link-type="uri" xlink:href="http://bioinf.uni-greifswald.de/augustus/">AUGUSTUS</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-9">9</xref>
                    </sup> (version 2.7) for gene annotation: (1) Repetitive and low complexity regions of the scaffolds identified using 
                    <ext-link ext-link-type="uri" xlink:href="http://www.repeatmasker.org/RepeatMasker/">RepeatMasker</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-10">10</xref>
                    </sup> (version open-4.0.5) based on homology with the 
                    <ext-link ext-link-type="uri" xlink:href="https://www.girinst.org/repbase/">RepBase</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-11">11</xref>
                    </sup> public database (Release 20140131) and a new database of repeat elements identified in the assembly with 
                    <ext-link ext-link-type="uri" xlink:href="http://www.repeatmasker.org/RepeatModeler/">RepeatModeler</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-12">12</xref>
                    </sup>. The repeats annotation was deposited in Zenodo (See data availability); (2) exon-intron junctions identified by 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/infphilo/tophat">Tophat</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-13">13</xref>
                    </sup> (version 2.1.0); (3) 
                    <italic toggle="yes">de novo</italic> and genome-guided 
                    <italic toggle="yes">ab initio</italic> transcripts assembled with 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/cole-trapnell-lab/cufflinks">Trinity</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-14">14</xref>
                    </sup> (version 2.6.5 )and 
                    <ext-link ext-link-type="uri" xlink:href="http://cole-trapnell-lab.github.io/cufflinks/">Cufflinks</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-15">15</xref>
                    </sup> (version 2.2.1) from RNA-Seq reads obtained from several tissues from the same genotype; (4) 
                    <italic toggle="yes">ab initio</italic> gene models predicted by 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/KorfLab/SNAP">SNAP</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-16">16</xref>
                    </sup> (version 29-11-2013) and 
                    <ext-link ext-link-type="uri" xlink:href="https://genome.crg.cat/software/geneid/index.html">GeneID</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-17">17</xref>
                    </sup> (version 1.4.4); and (5) homology-based alignments of transcripts and proteins from 
                    <italic toggle="yes">Miscanthus sinensis</italic> and maize using Exonerate
                    <sup>
                        <xref ref-type="bibr" rid="ref-18">18</xref>
                    </sup> with a minimal identity of 0.7 and coverage of 0.7. Finally, AUGUSTUS
                    <sup>
                        <xref ref-type="bibr" rid="ref-9">9</xref>
                    </sup> was run with the options &#x201c;genemodel=complete&#x201d; and &#x201c;alternatives-from-evidence=true&#x201d; to ensure that the predicted genes were compatible with all the previous provided evidence.</p>
                <p>For the functional annotation of these predicted genes, translated gene sequences were compared with the NCBI non-redundant (nr 20170116) proteins and EBI&#x2019;s 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ebi.ac.uk/interpro/">InterPro</ext-link> (version 5.22.61) databases, and the results were imported into 
                    <ext-link ext-link-type="uri" xlink:href="https://www.blast2go.com/">Blast2GO</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-19">19</xref>
                    </sup> to annotate the GO and GO slim terms, enzymatic protein codes and KEGG pathways. A similar GO annotation from translated gene sequences can be done with 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/eggnogdb/eggnog-mapper">eggNOG-mapper</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-20">20</xref>
                    </sup>. These functional descriptors were deposited in Zenodo (See 
                    <italic toggle="yes">Underlying data</italic>).</p>
            </sec>
            <sec>
                <title>Anchoring the whole genome assembly using the Miscanthus sinensis reference</title>
                <p>To improve the genome contiguity, we anchored our 
                    <italic toggle="yes">M. sacchariflorus</italic> scaffolds to the Miscanthus sinensis genome
                    <sup>
                        <xref ref-type="bibr" rid="ref-4">4</xref>
                    </sup>. However, no nucleotide content from 
                    <italic toggle="yes">M. sinensis</italic> was incorporated in the 
                    <italic toggle="yes">M. sacchariflorus</italic> assemblies.</p>
                <p>Firstly, scaffolds longer than 2 kbps from the whole genome assembly &#x201c;Msac_v2&#x201d; were scaffolded again using SSPACE
                    <sup>
                        <xref ref-type="bibr" rid="ref-7">7</xref>
                    </sup> and the 
                    <italic toggle="yes">M. sinensis</italic> mate-pairs reads, the gaps between scaffolds were filled in with Ns. This new whole-genome assembly was denominated &#x201c;Msac_v3&#x201d;, and was deposited at NCBI in Bioproject 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA435476">PRJNA435476</ext-link>, under the GenBank accession 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/assembly/GCA_002993905.1/">GCA_002993905</ext-link>. It contains 137,916 scaffolds for a total of 2.074 Gb with an N50 of 25.6 Kbps. The gene annotation was projected to the &#x201c;Msac_v3&#x201d; assembly using 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/PASApipeline/PASApipeline">PASA</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-21">21</xref>
                    </sup> (version 2.0.1): genes were aligned to the new assembly using GMAP, requiring a minimum identity of 0.85 and coverage of 0.55, and later validated using the default parameters in PASA.</p>
                <p>Finally, we obtained the chromosomal position in the 
                    <italic toggle="yes">M. sinensis</italic> chromosomes of the scaffolds from the &#x201c;Msac_v3&#x201d; assembly. Using 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/bioinfologics/satsuma2">Satsuma2</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-22">22</xref>
                    </sup> (version untagged-330e3341a1151a978b37), we identified every perfect-identify match between both assemblies (3,635,504 matches in total). The coordinates of these matches in BED 8 format were used as input to the &#x201c;OrderOrientBySynteny&#x201d; script from Satsuma2, which identifies the best chromosomal position for each scaffold. These position coordinates are available as an AGP file as part of GCA_002993905, which anchors our final whole-genome assembly to 19 chromosomes (accessions 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CM009591.1">CM00959</ext-link> to 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/CM009609.1">CM009609</ext-link> in NCBI).</p>
            </sec>
            <sec>
                <title>Completeness assessment</title>
                <p>RNA-seq cleaned reads from each tissue were independently aligned to both assembly versions using 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/alexdobin/STAR">STAR</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-23">23</xref>
                    </sup> (version 2.6.0c). 
                    <ext-link ext-link-type="uri" xlink:href="https://busco.ezlab.org/">BUSCO</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-24">24</xref>
                    </sup> (version 4.1.4) was used to assess completeness with the single-copy orthologs database for green plants (Viridiplantae, version 2020-09-10). Orthologs were identified using 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/davidemms/OrthoFinder">Orthofinder2</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-25">25</xref>
                    </sup> (version 2.3.12) with default parameters and the option &#x201c;-msa&#x201d;, which directly provided comprehensive statistics comparing the provided proteomes. All the proteomes from the other species used (
                    <xref ref-type="table" rid="T1">Table 1</xref>) were downloaded from 
                    <ext-link ext-link-type="uri" xlink:href="https://phytozome.jgi.doe.gov/pz/portal.html">Phytozome</ext-link> (v7.1 DOE-JGI). Genomes were aligned using 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/lh3/minimap2">Minimap2</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-26">26</xref>
                    </sup> (version 2.17) with the &#x201c;asm10&#x201d; parameter for related genomes, secondary alignments (tp:A:S) filtered out, and results visualised using 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/tpoorten/dotPlotly">dotPlotly</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-27">27</xref>
                    </sup> (Github version, latest updated on 4 May 2018).</p>
                <table-wrap id="T1" orientation="portrait" position="anchor">
                    <label>Table 1. </label>
                    <caption>
                        <title>Completeness statistics of the unanchored and anchored 
                            <italic toggle="yes">M. sacchariflorus</italic> whole-genome assemblies in comparison to the 
                            <italic toggle="yes">M. sinensis</italic> reference.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th colspan="1" rowspan="1"/>
                                <th align="left" colspan="1" rowspan="1" valign="top">
                                    <italic toggle="yes">Msac_v2</italic>
                                    <break/>(unanchored)</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
                                    <italic toggle="yes">Msac_v3</italic>
                                    <break/>(anchored by 
                                    <italic toggle="yes">M. sinensis</italic>)</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Reference:
                                    <break/>
                                    <italic toggle="yes">M. sinensis
                                        <sup>
                                            <xref ref-type="bibr" rid="ref-4">4</xref>
                                        </sup>
                                    </italic>.</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">NCBI bioproject</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">PRJNA679435</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">PRJNA435476
                                    <break/>(GCA_002993905)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">v.7.1 from Phytozome</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Length</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.539 Gb</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.074 Gb</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.68 Gbps</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Scaffolds</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">588,758 scaffolds</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">137,931 scaffolds
                                    <xref ref-type="other" rid="TFN1">*</xref>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">19 Chrs and 14,414
                                    <break/>scaffolds</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">N20</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">25.39 Kbps</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">62.61 Kbps</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">146.1 Mbps</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">N50</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">10.25 Kbps</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">25.63 Kbps</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">88.51 Mbps</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">N80</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.79 Kbps</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">9.42 Kbps</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">75.06 Mbps</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Max</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">378.48 Kbps</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">458.83 Kbps</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">160.9 Mbps</td>
                            </tr>
                            <tr>
                                <th align="center" colspan="1" rowspan="1" valign="top">
                                    <underline>ANNOTATION</underline>
                                </th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
                                    <italic toggle="yes">
                                        <underline>Msac_v2</underline>
                                    </italic>
                                </th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
                                    <italic toggle="yes">
                                        <underline>Msac_v3</underline>
                                    </italic>
                                </th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
                                    <italic toggle="yes">
                                        <underline>M. sinensis</underline>
                                    </italic>
                                </th>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Gene models</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">81,431</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">68,578</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">67,967</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Proteins</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">86,767</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">68,578
                                    <xref ref-type="other" rid="TFN1">**</xref>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">67,789</td>
                            </tr>
                            <tr>
                                <th align="center" colspan="1" rowspan="1" valign="top">
                                    <underline>BUSCO</underline>
                                </th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
                                    <italic toggle="yes">
                                        <underline>Msac_v2</underline>
                                    </italic>
                                </th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
                                    <italic toggle="yes">
                                        <underline>Msac_v3</underline>
                                    </italic>
                                </th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
                                    <italic toggle="yes">
                                        <underline>M. sinensis</underline>
                                    </italic>
                                </th>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Complete</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">55.5%
                                    <break/>(48% in single copy)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">59.8%
                                    <break/>(50.4% in single copy)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">97.6%
                                    <break/>(36.2% in single copy)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Fragmented</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">32.2%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">26.4%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.6%</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Missing</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">12.3%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">13.6%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.8%</td>
                            </tr>
                            <tr>
                                <th align="center" colspan="1" rowspan="1" valign="top">
                                    <underline>RNA MAPPING</underline>
                                </th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
                                    <italic toggle="yes">
                                        <underline>Msac_v2</underline>
                                    </italic>
                                </th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
                                    <italic toggle="yes">
                                        <underline>Msac_v3</underline>
                                    </italic>
                                </th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
                                    <italic toggle="yes">
                                        <underline>M. sinensis</underline>
                                    </italic>
                                </th>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Reads mapping in the genome
                                    <break/>once (root, stem and leaf)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">76.2%
                                    <break/>76.4%
                                    <break/>78.8%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">75%
                                    <break/>76.7% 
                                    <break/>78.1%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">78.8%
                                    <xref ref-type="other" rid="TFN1">***</xref>
                                    <break/>83.5%
                                    <break/>82.5%</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Reads mapping in the genome
                                    <break/>multiple times (root, stem and leaf)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">22.5%
                                    <break/>23%
                                    <break/>20.7%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">19.5%
                                    <break/>18.8%
                                    <break/>17.3%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">19.7%
                                    <xref ref-type="other" rid="TFN1">***</xref>
                                    <break/>15.5%
                                    <break/>16.6%</td>
                            </tr>
                        </tbody>
                    </table>
                    <table-wrap-foot>
                        <fn>
                            <p id="TFN1">*15 scaffolds from plastids were discarded during the deposit in NCBI resulting in 137,916 scaffolds. ** Only the longest transcript was considered in each projected locus. *** Cross-species alignments.</p>
                        </fn>
                    </table-wrap-foot>
                </table-wrap>
            </sec>
        </sec>
        <sec sec-type="results">
            <title>Results</title>
            <p>We produced two whole-genome assemblies for 
                <italic toggle="yes">M. sacchariflorus</italic> that we named &#x201c;Msac_v2&#x201d; and &#x201c;Msac_v3&#x201d;, with total lengths of 2.54 Gbps and 2.074 Gbps, respectively (
                <xref ref-type="table" rid="T1">Table 1</xref>). The difference in size is mainly a result of filtering 402 Mb from sequences under 2 kb in the latter before anchoring to the 
                <italic toggle="yes">M. sinensis</italic> genome. Our &#x201c;Msac_v2&#x201d; assembly covered ~59 % of 
                <italic toggle="yes">M. sacchariflorus</italic> genome size, which is estimated to be 4.3 Gb
                <sup>
                    <xref ref-type="bibr" rid="ref-28">28</xref>
                </sup>. Approximately 40% of the assembly was composed by transposable elements (987.3 Mb; 
                <xref ref-type="table" rid="T2">Table 2</xref>), including 491 Mb (19.4%) and 154 Mb (6.1%) by copies of the Gypsy and Copia LTRs, respectively; and 180 Mb (7.1%) by several class 2 DNA transposons (MULE, CMC, Harbinger, etc.)</p>
            <table-wrap id="T2" orientation="portrait" position="anchor">
                <label>Table 2. </label>
                <caption>
                    <title>Transposable elements identified in the Miscanthus sacchariflorus genome.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="center" colspan="1" rowspan="1" valign="middle">Category</th>
                            <th align="center" colspan="1" rowspan="1" valign="middle">Superfamily</th>
                            <th align="center" colspan="1" rowspan="1" valign="middle">Coverage(bp)</th>
                            <th align="center" colspan="1" rowspan="1" valign="middle">Fraction
                                <break/>(2.539 Gb)</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="center" colspan="1" rowspan="6" valign="middle">Class 1 TEs:
                                <break/>retrotransposons
                                <break/>(copy and paste)</td>
                            <td align="center" colspan="1" rowspan="1" valign="middle">Gypsy LTR</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">491,915,558</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">19.37%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">Copia LTR</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">154,244,411</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">6.08%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">Other LTRs</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">87,661,401</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">3.45%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">SINEs</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">5,029,476</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">0.20%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">LINEs</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">25,076,275</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">0.99%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">Other non-LTR
                                <break/>retrotransposons</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">29,192,841</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">1.15%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="7" valign="middle">Class 2 TEs: DNA
                                <break/>transposons (cut
                                <break/>and paste)</td>
                            <td align="center" colspan="1" rowspan="1" valign="middle">hAT</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">10,722,644</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">0.42%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">Harbinger/PIF</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">24,553,614</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">0.97%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">MULE/MuDR</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">29,733,691</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">1.17%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">Stowaway/TcMar</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">14,112,359</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">0.56%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">CMC_EnSpm</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">56,449,907</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">2.22%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">Helitron</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">10,601,152</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">0.42%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">Other</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">34,676,501</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">1.37%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">Unclassified TEs</td>
                            <td align="center" colspan="1" rowspan="1" valign="middle">Unclassified TEs</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">5,934,794</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">0.23%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="2" valign="middle">Non TEs</td>
                            <td align="center" colspan="1" rowspan="1" valign="middle">Satellites</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">5,339,464</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">0.21%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">snRNAs</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">23,147</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">0.00%</td>
                        </tr>
                        <tr>
                            <td align="center" colspan="1" rowspan="1" valign="middle">TOTAL</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1" valign="middle">985,267,235</td>
                            <td align="right" colspan="1" rowspan="1" valign="middle">38.81%</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>We identified 219,394 primary alignments longer than 2 kb between the unanchored 
                <italic toggle="yes">M. sacchariflorus</italic> (&#x201c;Msac_v2&#x201d;) and 
                <italic toggle="yes">M. sinensis</italic>. The resulting dotplot (
                <xref ref-type="fig" rid="f1">Figure 1</xref>) shows the conserved synteny between both species, which diverged 1.6 Mya
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>. 
                <xref ref-type="fig" rid="f1">Figure 1</xref> also shows the highly conserved synteny between the pairs of homoeologous chromosomes (e.g. green boxes in chromosomes one and two), and the fusion in chromosome 7 of the chromosome homeolog to chromosome 13; which was also reported in 
                <italic toggle="yes">M. sinensis</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>. There are several large inversions between chromosomes 9 and 10, and 3 and 4 (cyan boxes in 
                <xref ref-type="fig" rid="f1">Figure 1</xref>). Our assembly of a heterozygous genotype resulted in multiple heterotigs (heterozygous contigs) containing the alternative or secondary haplotypes (e.g. pink boxes in 
                <xref ref-type="fig" rid="f1">Figure 1</xref>).</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>Figure 1. </label>
                <caption>
                    <title>Conserved synteny between 
                        <italic toggle="yes">M. sacchariflorus</italic> and 
                        <italic toggle="yes">M. sinensis</italic> genomes.</title>
                    <p>The plot shows the primary alignments longer than 2 kbps between both species. The 
                        <italic toggle="yes">M. sacchariflorus</italic> scaffolds (Y-axis) have been sorted by their coordinates in 
                        <italic toggle="yes">M. sinensis</italic> chromosomes (X-axis). Large homoeologous blocks and chromosomal rearrangements are highlighted in boxes. </p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/47786/59783fd6-44c0-44b3-ae44-993e29910d65_figure1.gif"/>
            </fig>
            <p>The utility of our assemblies for genomic studies is evidenced by the proportion of RNA-seq from three different tissues from the same 
                <italic toggle="yes">M. sacchariflorus</italic> genotype that aligned to the assemblies. On average 99% and 95% of the RNA-seq reads aligned in &#x201c;Msac_v2&#x201d; and &#x201c;Msac_v3&#x201d;, respectively (
                <xref ref-type="table" rid="T1">Table 1</xref>).</p>
            <p>We estimated that we assembled more than 85% of the 
                <italic toggle="yes">M. sacchariflorus</italic> genes. Furthermore, our assemblies include several alleles of genes in the heterozygous regions of the genome, while the 
                <italic toggle="yes">M. sinensis</italic> reference was generated from a double-haplotyped genotype. The estimation of the proportion of assembled genes (~85%) was supported by (1) the results from BUSCO, which reported 86.4&#x2013;87.7% of presented core genes, of which ~2/3rds were complete (
                <xref ref-type="table" rid="T1">Table 1</xref>); and (2) the difference in the number of proteins from related species for which we can identify an ortholog in 
                <italic toggle="yes">M. sacchariflorus</italic> compared to 
                <italic toggle="yes">M. sinensis</italic>, as control, using Orthofinder2 (
                <xref ref-type="table" rid="T3">Table 3</xref>).</p>
            <table-wrap id="T3" orientation="portrait" position="anchor">
                <label>Table 3. </label>
                <caption>
                    <title>Number of orthologs between 
                        <italic toggle="yes">Miscanthus sinensis</italic> (Msin), 
                        <italic toggle="yes">Setaria italica</italic> (Sita; foxtail millet), 
                        <italic toggle="yes">Sorghum bicolor</italic> (Sbic; sorghum), 
                        <italic toggle="yes">Zea mays</italic> (Zma; maize), and 
                        <italic toggle="yes">Panicum virgatum</italic> (Pvi; switchgrass) obtained using Orthofinder 2.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1" valign="top">Orthologs</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Msac_v2</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Msac_v3</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Msin</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Sita</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Sbic</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Zma</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Pvi</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">From Msac_v2 (86,767)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">-</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">NA</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">44,151 (50.9%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">36,904 (42.5%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">37,219 (42.9%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">38,478 (44.3%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">45,792 (52.8%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">From Msac_v3 (68,578)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">NA</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">-</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">38,122 (55.6%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">32,273 (47.1%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">32,296 (47.1%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">33,395 (48.7%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">38,755 (56.5%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">From Msin (67,789)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">43,739 (64.5%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">37,501 (55%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">-</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">41,532 (64.1%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">43,475 (64.1%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">39,986 (58.9%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">45,913 (67.7%)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">From Sita (40,599)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">26,846 (66.1%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">23,559 (58%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">28,473 (70.1%)</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">From Sbic (39,441)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">27,877 (70.7%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">24,125 (61.2%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">30,907 (78.4%)</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">From Zma (88,760)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">41,530 (46.8%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">35,955 (40.5%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">41,784 (47.1%)</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">From Pvi (125,439)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">63,692 (50.8%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">56,120 (44.7%)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">64,271 (51.2%)</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>Based on the results from Orthofinder2 (
                <xref ref-type="table" rid="T3">Table 3</xref>), we found orthologs in 
                <italic toggle="yes">M. sacchariflorus</italic> for 64.5% of the 
                <italic toggle="yes">M.sinensis</italic> annotated proteins, so we estimated ~1/3
                <sup>rd</sup> of the Miscanthus proteins to be specific to each species. On the other hand, we estimated that ~3,000 genes may be missing in the &#x201c;Msac_v2&#x201d; annotation based on the number of 
                <italic toggle="yes">Sorghum bicolor</italic> proteins with orthologues in 
                <italic toggle="yes">M. sinensis</italic> but absent in 
                <italic toggle="yes">M. sacchariflorus</italic>. Better estimations were obtained with the other four species, where the genes absent in Msac_v2 compared with 
                <italic toggle="yes">M. sinensis</italic> were estimated to be 254, 579 and 1627 (
                <xref ref-type="table" rid="T3">Table 3</xref>). Additionally, ~6,000 genes could be missed in &#x201c;Msac_v3&#x201d; compared to &#x201c;Msac_v2&#x201d; based on the difference in the number of 
                <italic toggle="yes">M. sinensis</italic> orthologues in each assembly (
                <xref ref-type="table" rid="T3">Table 3</xref>). This is likely from genes in the sequences shorter than 2 Kbps (totalling 402 Mbps) that were filtered out before anchoring. There was a large difference in the proportion of &#x201c;fragmented&#x201d; BUSCO genes found in the 
                <italic toggle="yes">M. sacchariflorus</italic> (32.2%) and 
                <italic toggle="yes">M. sinensis</italic> (1.6%) assemblies (
                <xref ref-type="table" rid="T1">Table 1</xref>). To assess if that difference had an effect on the quality of the annotation, we compared the number of proteins from 
                <italic toggle="yes">M. sacchariflorus</italic> and 
                <italic toggle="yes">M. sinensis</italic> for which we can identify an ortholog in another species (
                <xref ref-type="table" rid="T3">Table 3</xref>); we found the difference between both 
                <italic toggle="yes">Miscanthus</italic> species ranged between 6,571 proteins when compared to sorghum (43,475 to 37,219; 
                <xref ref-type="table" rid="T2">Table 2</xref>) to only 121 when compared to maize (39,986 to 38,478, 
                <xref ref-type="table" rid="T3">Table 3</xref>).</p>
            <p>In conclusion, our 
                <italic toggle="yes">M. sacchariflorus</italic> genome can served as the basis for functional genetic analyses on 
                <italic toggle="yes">Miscanthus,</italic> one of the main biofuel grass crops used in temperate latitudes. However, there are opportunities to improve it using new approaches, such as long-reads.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <sec>
                <title>Underlying data</title>
                <p>NCBI BioProject: Miscanthus sacchariflorus cultivar:Robustus 297. Accession number 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA435476">PRJNA435476</ext-link>; 
                    <ext-link ext-link-type="uri" xlink:href="https://identifiers.org/bioproject:PRJNA435476">https://identifiers.org/bioproject:PRJNA435476</ext-link>.</p>
                <p>This BioProject contains the raw paired-end and mate-pair reads.</p>
                <p>NCBI BioProject: RNA-seq Miscanthus hybrids with contrasting phenotypes. Accession number PRJNA639832; 
                    <ext-link ext-link-type="uri" xlink:href="https://identifiers.org/bioproject:PRJNA639832">https://identifiers.org/bioproject:PRJNA639832</ext-link>.</p>
                <p>This BioProject contains RNA-seq reads, deposited as part of a previous project
                    <sup>
                        <xref ref-type="bibr" rid="ref-29">29</xref>
                    </sup>.</p>
                <p>NCBI BioProject: Miscanthus sacchariflorus cultivar:Robustus 297. Accession number 
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA679435">PRJNA679435</ext-link>; 
                    <ext-link ext-link-type="uri" xlink:href="https://identifiers.org/bioproject:PRJNA679435">https://identifiers.org/bioproject:PRJNA679435</ext-link>.</p>
                <p>This Bioproject contains the unanchored &#x201c;Msac_v2&#x201d; assemblies and gene annotations under accession JADQCR000000000.</p>
                <p>The anchored &#x201c;Msac_v3&#x201d; assemblies and gene annotations are deposited under accession accession GCA_002993905 under Bioproject PRJNA435476.</p>
                <p>The chromosomal positions in the 
                    <italic toggle="yes">M. sinensis</italic> chromosomes of the scaffolds from the &#x201c;Msac_v3&#x201d; assembly are available in an AGP file as part of GCA_002993905, which places the scaffolds in 19 chromosomes (accessions CM009591 to CM009609 in NCBI).</p>
                <p>Zenodo: Supplementary dataset to "Draft genome assembly of the biofuel grass crop Miscanthus sacchariflorus". 
                    <ext-link ext-link-type="uri" xlink:href="http://doi.org/10.5281/zenodo.4270235">http://doi.org/10.5281/zenodo.4270235</ext-link>.</p>
                <p>This project contains the assemblies in FASTA format, gene annotations in GFF3 format, functional annotations in tabulated text format, and AGP file with anchoring information.</p>
                <p>Data deposited with Zenodo are available under the terms of the 
                    <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/legalcode">Creative Commons Attribution 4.0 International license</ext-link> (CC-BY 4.0).</p>
            </sec>
        </sec>
    </body>
    <back>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Clark</surname>
                            <given-names>LV</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jin</surname>
                            <given-names>X</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Petersen</surname>
                            <given-names>KK</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Population structure of 
                        <italic toggle="yes">Miscanthus sacchariflorus</italic> reveals two major polyploidization events, tetraploid-mediated unidirectional introgression from diploid 
                        <italic toggle="yes">M. sinensis</italic>, and diversity centred around the Yellow Sea.</article-title>
                    <source>

                        <italic toggle="yes">Ann Bot.</italic>
</source>
                    <year>2019</year>;<volume>124</volume>(<issue>4</issue>):<fpage>731</fpage>&#x2013;<lpage>48</lpage>.
                    <pub-id pub-id-type="pmid">30247525</pub-id>
                    <pub-id pub-id-type="doi">10.1093/aob/mcy161</pub-id>
                    <pub-id pub-id-type="pmcid">6821896</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dwiyanti</surname>
                            <given-names>MS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rudolph</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Swaminathan</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Genetic analysis of putative triploid 
                        <italic toggle="yes">Miscanthus</italic> hybrids and tetraploid 
                        <italic toggle="yes">M. sacchariflorus</italic> collected from sympatric populations of Kushima, Japan.</article-title>
                    <source>

                        <italic toggle="yes">Bioenergy Res.</italic>
</source>
                    <year>2013</year>;<volume>6</volume>(<issue>2</issue>):<fpage>486</fpage>&#x2013;<lpage>93</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s12155-012-9274-3</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Clark</surname>
                            <given-names>LV</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dzyubenko</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dzyubenko</surname>
                            <given-names>N</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Ecological characteristics and 
                        <italic toggle="yes">in situ</italic> genetic associations for yield-component traits of wild 
                        <italic toggle="yes">Miscanthus</italic> from eastern Russia.</article-title>
                    <source>

                        <italic toggle="yes">Ann Bot.</italic>
</source>
                    <year>2016</year>;<volume>118</volume>(<issue>5</issue>):<fpage>941</fpage>&#x2013;<lpage>55</lpage>.
                    <pub-id pub-id-type="pmid">27451985</pub-id>
                    <pub-id pub-id-type="doi">10.1093/aob/mcw137</pub-id>
                    <pub-id pub-id-type="pmcid">5055818</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mitros</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Session</surname>
                            <given-names>AM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>James</surname>
                            <given-names>BT</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Genome biology of the paleotetraploid perennial biomass crop 
                        <italic toggle="yes">Miscanthus</italic>.</article-title>
                    <source>

                        <italic toggle="yes">Nat Commun.</italic>
</source>
                    <year>2020</year>;<volume>11</volume>(<issue>1</issue>):<fpage>5442</fpage>.
                    <pub-id pub-id-type="pmid">33116128</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41467-020-18923-6</pub-id>
                    <pub-id pub-id-type="pmcid">7595124</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Leggett</surname>
                            <given-names>RM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ramirez-Gonzalez</surname>
                            <given-names>RH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Clavijo</surname>
                            <given-names>BJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Sequencing quality assessment tools to enable data-driven informatics for high throughput genomics.</article-title>
                    <source>

                        <italic toggle="yes">Front Genet.</italic>
</source>
                    <year>2013</year>;<volume>4</volume>:<fpage>288</fpage>.
                    <pub-id pub-id-type="pmid">24381581</pub-id>
                    <pub-id pub-id-type="doi">10.3389/fgene.2013.00288</pub-id>
                    <pub-id pub-id-type="pmcid">3865868</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Simpson</surname>
                            <given-names>JT</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wong</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jackman</surname>
                            <given-names>SD</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>ABySS: a parallel assembler for short read sequence data.</article-title>
                    <source>

                        <italic toggle="yes">Genome Res.</italic>
</source>
                    <year>2009</year>;<volume>19</volume>(<issue>6</issue>):<fpage>1117</fpage>&#x2013;<lpage>23</lpage>.
                    <pub-id pub-id-type="pmid">19251739 </pub-id>
                    <pub-id pub-id-type="doi">10.1101/gr.089532.108</pub-id>
                    <pub-id pub-id-type="pmcid">2694472</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Boetzer</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Henkel</surname>
                            <given-names>CV</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jansen</surname>
                            <given-names>HJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Scaffolding pre-assembled contigs using SSPACE.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2011</year>;<volume>27</volume>(<issue>4</issue>):<fpage>578</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="pmid">21149342</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btq683</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>De Vega</surname>
                            <given-names>JJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ayling</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hegarty</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Red clover (
                        <italic toggle="yes">Trifolium pratense</italic> L.) draft genome provides a platform for trait improvement.</article-title>
                    <source>

                        <italic toggle="yes">Sci Rep.</italic>
</source>
                    <year>2015</year>;<volume>5</volume>:<fpage>17394</fpage>.
                    <pub-id pub-id-type="pmid">26617401</pub-id>
                    <pub-id pub-id-type="doi">10.1038/srep17394</pub-id>
                    <pub-id pub-id-type="pmcid">4663792</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Stanke</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Keller</surname>
                            <given-names>O</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gunduz</surname>
                            <given-names>I</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>AUGUSTUS: 
                        <italic toggle="yes">ab initio</italic> prediction of alternative transcripts.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2006</year>;<volume>34</volume>(<issue>Web Server issue</issue>):<fpage>W435</fpage>&#x2013;<lpage>W9</lpage>.
                    <pub-id pub-id-type="pmid">16845043</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkl200</pub-id>
                    <pub-id pub-id-type="pmcid">1538822</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tarailo&#x2010;Graovac</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>N</given-names>
                        </name>
</person-group>:
                    <article-title>Using RepeatMasker to identify repetitive elements in genomic sequences.</article-title>
                    <source>

                        <italic toggle="yes">Curr Protoc Bioinformatics.</italic>
</source>
                    <year>2009</year>;<volume>25</volume>(<issue>1</issue>): Chapter 4: Unit 4.10.
                    <pub-id pub-id-type="pmid">19274634</pub-id>
                    <pub-id pub-id-type="doi">10.1002/0471250953.bi0410s25</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jurka</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kapitonov</surname>
                            <given-names>VV</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pavlicek</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Repbase Update, a database of eukaryotic repetitive elements.</article-title>
                    <source>

                        <italic toggle="yes">Cytogenet Genome Res.</italic>
</source>
                    <year>2005</year>;<volume>110</volume>(<issue>1&#x2013;4</issue>):<fpage>462</fpage>&#x2013;<lpage>7</lpage>.
                    <pub-id pub-id-type="pmid">16093699</pub-id>
                    <pub-id pub-id-type="doi">10.1159/000084979</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Smit</surname>
                            <given-names>AF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hubley</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <article-title>RepeatModeler Open-1.0</article-title>.<year>2008</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://www.scienceopen.com/document?vid=c1050e3d-8dc1-48ba-81c8-b372af25be50">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-13">
                <label>13</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Trapnell</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pachter</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Salzberg</surname>
                            <given-names>SL</given-names>
                        </name>
</person-group>:
                    <article-title>TopHat: discovering splice junctions with RNA-Seq.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2009</year>;<volume>25</volume>(<issue>9</issue>):<fpage>1105</fpage>&#x2013;<lpage>11</lpage>.
                    <pub-id pub-id-type="pmid">19289445</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btp120</pub-id>
                    <pub-id pub-id-type="pmcid">2672628</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Grabherr</surname>
                            <given-names>MG</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Haas</surname>
                            <given-names>BJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yassour</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data.</article-title>
                    <source>

                        <italic toggle="yes">Nat Biotechnol.</italic>
</source>
                    <year>2011</year>;<volume>29</volume>(<issue>7</issue>):<fpage>644</fpage>&#x2013;<lpage>52</lpage>.
                    <pub-id pub-id-type="pmid">21572440</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nbt.1883</pub-id>
                    <pub-id pub-id-type="pmcid">3571712</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Trapnell</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Roberts</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Goff</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.</article-title>
                    <source>

                        <italic toggle="yes">Nat Protoc.</italic>
</source>
                    <year>2012</year>;<volume>7</volume>(<issue>3</issue>):<fpage>562</fpage>&#x2013;<lpage>78</lpage>.
                    <pub-id pub-id-type="pmid">22383036</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nprot.2012.016</pub-id>
                    <pub-id pub-id-type="pmcid">3334321</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bromberg</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rost</surname>
                            <given-names>B</given-names>
                        </name>
</person-group>:
                    <article-title>SNAP: predict effect of non-synonymous polymorphisms on function.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2007</year>;<volume>35</volume>(<issue>11</issue>):<fpage>3823</fpage>&#x2013;<lpage>35</lpage>.
                    <pub-id pub-id-type="pmid">17526529</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkm238</pub-id>
                    <pub-id pub-id-type="pmcid">1920242</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Blanco</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Parra</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Guig&#x00f3;</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <article-title>Using geneid to identify genes.</article-title>
                    <source>

                        <italic toggle="yes">Curr Protoc Bioinformatics.</italic>
</source>
                    <year>2007</year>; Chapter 4: Unit 4.3.
                    <pub-id pub-id-type="pmid">18428791</pub-id>
                    <pub-id pub-id-type="doi">10.1002/0471250953.bi0403s18</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Slater</surname>
                            <given-names>GSC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Birney</surname>
                            <given-names>E</given-names>
                        </name>
</person-group>:
                    <article-title>Automated generation of heuristics for biological sequence comparison.</article-title>
                    <source>

                        <italic toggle="yes">BMC Bioinformatics.</italic>
</source>
                    <year>2005</year>;<volume>6</volume>(<issue>1</issue>):<fpage>31</fpage>.
                    <pub-id pub-id-type="pmid">15713233</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2105-6-31</pub-id>
                    <pub-id pub-id-type="pmcid">553969</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Conesa</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>G&#x00f6;tz</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Garc&#x00ed;a-G&#x00f3;mez</surname>
                            <given-names>JM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2005</year>;<volume>21</volume>(<issue>18</issue>):<fpage>3674</fpage>&#x2013;<lpage>6</lpage>.
                    <pub-id pub-id-type="pmid">16081474</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/bti610</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Huerta-Cepas</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Forslund</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Coelho</surname>
                            <given-names>LP</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper.</article-title>
                    <source>

                        <italic toggle="yes">Mol Biol Evol.</italic>
</source>
                    <year>2017</year>;<volume>34</volume>(<issue>8</issue>):<fpage>2115</fpage>&#x2013;<lpage>22</lpage>.
                    <pub-id pub-id-type="pmid">28460117</pub-id>
                    <pub-id pub-id-type="doi">10.1093/molbev/msx148</pub-id>
                    <pub-id pub-id-type="pmcid">5850834</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Haas</surname>
                            <given-names>BJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Delcher</surname>
                            <given-names>AL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mount</surname>
                            <given-names>SM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Improving the 
                        <italic toggle="yes">Arabidopsis</italic> genome annotation using maximal transcript alignment assemblies.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2003</year>;<volume>31</volume>(<issue>19</issue>):<fpage>5654</fpage>&#x2013;<lpage>66</lpage>.
                    <pub-id pub-id-type="pmid">14500829</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkg770</pub-id>
                    <pub-id pub-id-type="pmcid">206470</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Clavijo</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wright</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yanes</surname>
                            <given-names>L</given-names>
                        </name>
</person-group>.
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/bioinfologics/satsuma2">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dobin</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Davis</surname>
                            <given-names>CA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schlesinger</surname>
                            <given-names>F</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>STAR: ultrafast universal RNA-seq aligner.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2013</year>;<volume>29</volume>(<issue>1</issue>):<fpage>15</fpage>&#x2013;<lpage>21</lpage>.
                    <pub-id pub-id-type="pmid">23104886</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/bts635</pub-id>
                    <pub-id pub-id-type="pmcid">3530905</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sim&#x00e3;o</surname>
                            <given-names>FA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Waterhouse</surname>
                            <given-names>RM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ioannidis</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2015</year>;<volume>31</volume>(<issue>19</issue>):<fpage>3210</fpage>&#x2013;<lpage>2</lpage>.
                    <pub-id pub-id-type="pmid">26059717</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btv351</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Emms</surname>
                            <given-names>DM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kelly</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>OrthoFinder: phylogenetic orthology inference for comparative genomics.</article-title>
                    <source>

                        <italic toggle="yes">Genome Biol.</italic>
</source>
                    <year>2019</year>;<volume>20</volume>(<issue>1</issue>):<fpage>238</fpage>.
                    <pub-id pub-id-type="pmid">31727128</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s13059-019-1832-y</pub-id>
                    <pub-id pub-id-type="pmcid">6857279</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>H</given-names>
                        </name>
</person-group>:
                    <article-title>Minimap2: pairwise alignment for nucleotide sequences.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2018</year>;<volume>34</volume>(<issue>18</issue>):<fpage>3094</fpage>&#x2013;<lpage>100</lpage>.
                    <pub-id pub-id-type="pmid">29750242</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/bty191</pub-id>
                    <pub-id pub-id-type="pmcid">6137996</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-27">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Poorten</surname>
                            <given-names>T</given-names>
                        </name>
</person-group>.
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/tpoorten/dotPlotly">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-28">
                <label>28</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rayburn</surname>
                            <given-names>AL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Crawford</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rayburn</surname>
                            <given-names>CM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Genome Size of Three 
                        <italic toggle="yes">Miscanthus</italic> Species.</article-title>
                    <source>

                        <italic toggle="yes">Plant Mol Biol Report.</italic>
</source>
                    <year>2009</year>;<volume>27</volume>(<issue>2</issue>):<fpage>184</fpage>.
                    <pub-id pub-id-type="doi">10.1007/s11105-008-0070-3</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-29">
                <label>29</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>De Vega</surname>
                            <given-names>JJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Peel</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Purdy</surname>
                            <given-names>SJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Differential expression of starch and sucrose metabolic genes linked to varying biomass yield in 
                        <italic toggle="yes">Miscanthus</italic> hybrids.</article-title>
                    <source>

                        <italic toggle="yes">BioRxiv.</italic>
</source>
                    <year>2020</year>;<fpage>2020</fpage>&#x2013;<lpage>08</lpage>.
                    <pub-id pub-id-type="doi">10.1101/2020.08.04.236885</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report78040">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.47786.r78040</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Dwiyanti</surname>
                        <given-names>Maria Stefanie</given-names>
                    </name>
                    <xref ref-type="aff" rid="r78040a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-9042-0612</uri>
                </contrib>
                <aff id="r78040a1">
                    <label>1</label>Research Faculty of Agriculture, Hokkaido University, Sapporo, Japan</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>1</day>
                <month>3</month>
                <year>2021</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Dwiyanti MS</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport78040" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.44714.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The availability of 
                <italic>Miscanthus sacchariflorus</italic> genome sequence will be useful for 
                <italic>Miscanthus</italic> related research, particularly in bioenergy related topics.&#x00a0;</p>
            <p> </p>
            <p> 
                <italic>"Better estimations were obtained with the other four species, where the genes absent in Msac_v2 compared with M. sinensis were estimated to be 254, 579 and 1627 (Table 3)."</italic> 
                <list list-type="bullet">
                    <list-item>
                        <p>I found that the difference between number of genes in 
                            <italic>"Msac_v2"</italic> compared to other four species is larger than 254, 579, and 1627; or the way I look into the table is wrong?</p>
                    </list-item>
                    <list-item>
                        <p>Perhaps the sentence above can be reworded so we can easily compare with the Table 3 content.</p>
                    </list-item>
                </list> Also, what are the predicted functions of genes absent in 
                <italic>"Msac_v2"</italic> compared to 
                <italic>M.sinensis</italic>? 
                <list list-type="bullet">
                    <list-item>
                        <p>This information may provide some clues to trait difference between 
                            <italic>M. sacchariflorus</italic> and 
                            <italic>M.sinensis.</italic>
                        </p>
                    </list-item>
                </list>
            </p>
            <p>Are sufficient details of methods and materials provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Is the rationale for creating the dataset(s) clearly described?</p>
            <p>Yes</p>
            <p>Are the datasets clearly presented in a useable and accessible format?</p>
            <p>Yes</p>
            <p>Are the protocols appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Plant genetics and genomics</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report77622">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.47786.r77622</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Ria&#x00f1;o-Pach&#x00f3;n</surname>
                        <given-names>Diego Mauricio</given-names>
                    </name>
                    <xref ref-type="aff" rid="r77622a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-9803-3465</uri>
                </contrib>
                <aff id="r77622a1">
                    <label>1</label>Computational, Evolutionary and Systems Biology Laboratory, Center for Nuclear Energy in Agriculture (CENA), University of S&#x00e3;o Paulo, Piracicaba, Brazil</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>25</day>
                <month>2</month>
                <year>2021</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Ria&#x00f1;o-Pach&#x00f3;n DM</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport77622" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.44714.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The Data Note, "Draft genome assembly of the biofuel grass crop&#x00a0;
                <italic>Miscanthus sacchariflorus</italic>", introduces two 
                <italic>Miscanthus sacchariflorus</italic> genome assemblies, which have been deposited in NCBI under the bioprojects:&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA679435">PRJNA679435</ext-link>&#x00a0;and&#x00a0;
                <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA435476">PRJNA435476</ext-link>. Genome sequencing was carried out with Illumina paired end reads and mate-pairs, the assemblies are greatly fragmented, which is expected due to the sequencing technologies used. This is the first 
                <italic>Miscanthus sacchariflorus</italic> genome assembly, which is of interested for the bioenergy community, and can be used to generate insigths with the genomes of other bioenergy crops, like sorghum and sugarcane.</p>
            <p> 
                <bold>Suggestions:</bold> 
                <list list-type="bullet">
                    <list-item>
                        <p>Look for contaminant organisms in the final assemblies using BlobPlots.</p>
                    </list-item>
                    <list-item>
                        <p>Provide GenomeScope and Smudgeplots for the clean reads, to generate further statistics prior to assembly.</p>
                    </list-item>
                </list>
            </p>
            <p>Are sufficient details of methods and materials provided to allow replication by others?</p>
            <p>No</p>
            <p>Is the rationale for creating the dataset(s) clearly described?</p>
            <p>Yes</p>
            <p>Are the datasets clearly presented in a useable and accessible format?</p>
            <p>Yes</p>
            <p>Are the protocols appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Bioinformatics, genome assembly and annotation.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
</article>
