ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Data Note

Initial genome sequencing of the sugarcane CP 96-1252 complex hybrid

[version 1; peer review: 2 approved]
PUBLISHED 17 May 2017
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Agriculture, Food and Nutrition gateway.

This article is included in the Genomics and Genetics gateway.

This article is included in the Data: Use and Reuse collection.

Abstract

The CP 96-1252 cultivar of sugarcane is a complex hybrid of commercial importance. DNA was extracted from lab-grown leaf tissue and sequenced. The raw Illumina DNA sequencing results provide 101 Gbp of genome sequence reads. The dataset is available from https://www.ncbi.nlm.nih.gov/bioproject/PRJNA345486/.

Keywords

Sugarcane genome, DNA sequencing, sequencing reads

Introduction

Sugarcane is an important crop for food and energy production. The genomes of modern cultivars are hybrids of species that are themselves polyploid; see for example (Vilela et al., 2017). Selected genomic BAC sequences have been sequenced and assembled (de Setta et al., 2014) (Okura et al., 2016). Chloroplast and mitochondrial genomes have been published (Asano et al., 2004) (Shearman et al., 2016), as have several transcriptomes (Cardoso-Silva et al., 2014). Whole genome sequence assemblies have not been published. CP 96-1252 is the top commercial sugarcane cultivar in Florida, USA (Sandhu & Davidson, 2016). CP 96-1252 was developed by USDA-ARS, the University of Florida, and the Florida Sugar Cane League and released to growers in 2003. CP 96-1252 is a complex hybrid of Saccharum officinarum L., S. barberi Jeswiet, S. spontaneum L., and S. sinense Roxb. amend. Jeswiet (Edmé et al., 2005). Toward better understanding of this cultivar through its genome sequence, DNA reads were generated and made public.

Methods

Using lab-grown plantlets, kindly provided by USDA, 14 g of tissue was harvested from the leaves of Saccharum hybrid cultivar CP 96-1252 (Reg. no CV-120, PI 634935, NCBI taxon ID 1983727). DNA was extracted from purified plant nuclei at Amplicon Express (Pullman, WA, USA). Separately, DNA was extracted from whole cells at JCVI (Rockville, MD, USA) using a Qiagen Plant DNA isolation kit. Extracted DNA was fragmented and size selected on the Blue Pippin (Sage Scientific) prior to library construction to ensure a 260 bp insert size. Standard Illumina PE libraries were generated using the NEBNext kit (NEB). Libraries were size selected, QC’d and quantified by qPCR prior to sequencing. Barcode BS78 AGCCATGC was used for the nuclei prep library and barcode BS79 AGGCTAAC was used for the cell prep library. The libraries were generated and sequenced at the JCVI sequencing core in La Jolla, CA, USA. To test for bacterial contamination, both DNA samples plus negative controls were used to generate amplicon libraries targeting the V4 16S region followed by Illumina MiSeq sequencing. These reads were processed by a pipeline using usearch version 8.1.1.1861 for clustering (Edgar, 2017), mothur version 1.36.1 for taxonomic classification (Schloss et al., 2011), and the SILVA SSURef NR99 123 database for reference (Quast et al., 2013). Hits to chloroplast and mitochondria were observed as expected, but bacteria were virtually absent and similar to controls.

An Illumina NextSeq 500 instrument was used to generate paired 150 bp shotgun reads. Run #1 applied the Illumina High Output kit to libraries BS78 and BS79. Run #1 instrument metrics were: 1.8 pM pool loaded, 1% PhiX spike-in with 1.8% aligned, cluster density 138 K/mm2, 96% pass filter, and 106 Gbp in 345 M PE reads. Barcode analysis indicated 46% BS78 and 49% BS79. Run #2 applied the Illumina High Output kit to library BS78 only. Run #2 metrics were: 1.8 pM pool loaded, 1% PhiX spike-in with 1% aligned, and 110 Gbp in 360 M PE reads. The resulting FASTQ files contained 101 Gbp in 161 M pairs from BS78 run #1, 169 M pairs from BS79 run #1, and 341 M pairs from BS78 run #2.

Dataset validation

To confirm sugarcane origin of the DNA, the run #1 reads were mapped to available BACs, namely the 608 Kbp of R570 BACs (GenBank accessions KF184657.1 to KF184973.1 (de Setta et al., 2014)). Reads were mapped with bowtie2 (Langmead & Salzberg, 2012) version 2.2.5 with options “-p 4 --no-unal --no-mixed --no-discordant --end-to-end --fast”. Both sequencing libraries demonstrated concordant pair mapping rates of 4.1% unique, 27% repeat, and 69% unmapped. Genome coverage analysis was inconclusive; the K-mer frequency distribution computed by Jellyfish (Marçais & Kingsford, 2011) version 2.2.4, with K=17 showed no peak above 1X coverage.

Data availability

The data are available at NCBI SRA under BioProject PRJNA345486, Study SRP091668. Amplified reads from BS78 and BS79 have respective accessions SRR5500242 and SRR5500243. Genomic reads from BS78 have accessions are SRR5500246 and SRR5500247. Genomic reads from BS79 have accession SRR5500249.

Comments on this article Comments (1)

Version 1
VERSION 1 PUBLISHED 17 May 2017
  • Author Response 29 Jun 2017
    Jason Miller, Shepherd University, USA
    29 Jun 2017
    Author Response
    A draft genome sequence assembly, for a different hybrid of sugarcane, appeared shortly after this paper.

    Riaño-Pachón DM and Mattiello L. Draft genome sequencing of the sugarcane hybrid SP80-3280. F1000Research 2017, 6:861 ... Continue reading
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Miller JR, Dilley KA, Harkins DM et al. Initial genome sequencing of the sugarcane CP 96-1252 complex hybrid [version 1; peer review: 2 approved]. F1000Research 2017, 6:688 (https://doi.org/10.12688/f1000research.11629.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 17 May 2017
Views
17
Cite
Reviewer Report 29 Jun 2017
Jeremy R. Shearman, National Center for Genetic Engineering and Biotechnology (BIOTEC), Khlong Luang, Thailand 
Approved
VIEWS 17
The manuscript describes the generation of whole genome shotgun sequence data from two separate DNA preparation methods. The methods for data generation are clearly described and the sample that was used has ample information about its origins publicly available and ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Shearman JR. Reviewer Report For: Initial genome sequencing of the sugarcane CP 96-1252 complex hybrid [version 1; peer review: 2 approved]. F1000Research 2017, 6:688 (https://doi.org/10.5256/f1000research.12560.r23843)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
22
Cite
Reviewer Report 30 May 2017
Paulo Arruda, Center of Molecular Biology and Genetic Engineering (CBMEG), State University of Campinas (UNICAMP), Campinas, Brazil 
Approved
VIEWS 22
The data note reported was produced by 150bp paired-end Illumina sequencing of genomic DNA prepared from the sugarcane variety CP-96-1252. A raw data set of 101 Gbps was generated and made public available.The authors did not present assemblage data which would ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Arruda P. Reviewer Report For: Initial genome sequencing of the sugarcane CP 96-1252 complex hybrid [version 1; peer review: 2 approved]. F1000Research 2017, 6:688 (https://doi.org/10.5256/f1000research.12560.r22855)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (1)

Version 1
VERSION 1 PUBLISHED 17 May 2017
  • Author Response 29 Jun 2017
    Jason Miller, Shepherd University, USA
    29 Jun 2017
    Author Response
    A draft genome sequence assembly, for a different hybrid of sugarcane, appeared shortly after this paper.

    Riaño-Pachón DM and Mattiello L. Draft genome sequencing of the sugarcane hybrid SP80-3280. F1000Research 2017, 6:861 ... Continue reading
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.