Keywords
Stevia rebaudiana, Sweetleaf, genome, assembly, annotation
This article is included in the Genomics and Genetics gateway.
Stevia rebaudiana, Sweetleaf, genome, assembly, annotation
The Sweetleaf (Stevia rebaudiana: Asteraceae) is cultivated commercially for use as a sweetener. The sweetness is due to various steviol glycosides, primarily stevioside and rebaudioside. These compounds have 200-300X the sweetness of sugar (Abdullateef & Osman, 2012) but have no calories. The market for raw Stevia and derived products is expected to exceed 1B USD by 2021 (International Stevia Council, 2017).
Stevia rebaudiana has been used as a sweetener for centuries in Brazil and Paraguay (Misra et al., 2011). Botanist Moisés Santiago Bertoni first described the plant as growing in eastern Paraguay and noted its use as a sweetener (Bertoni, 1899).
Chemists Bridel and Lavielle isolated the glycosides stevioside and rebaudioside that give the leaves their sweet taste (Bridel & Lavielle, 1931). The chemical structures of the aglycone steviol and its glycoside have been solved (Mosettig & Nes, 1955).
A complete genome sequence for this species will assist with discovering markers for crop yields, disease and drought resistance, and determining the biochemical pathways for the relevant metabolites.
A single commercially grown Stevia rebaudiana plant was used for this study (Behnke Nurseries, Beltsville, MD, USA). DNA extraction was performed on tissue from a single leaf using the Qiagen DNAeasy genomic extraction kit for plants, using the standard process. A paired-end sequencing library was constructed using the Illumina TruSeq kit, according to the manufacturer’s instructions. The library was sequenced on an Illumina Hi-Seq platform in paired-end, 2 × 150bp format.
The resulting fastq files were trimmed of adapter/primer sequence and low-quality regions with Trimmomatic v0.33 (Bolger et al., 2014). The trimmed sequence was assembled by SPAdes v2.5 (Bankevich et al., 2012) followed by a finishing step using RagTag v1.0.0 (Alonge, 2020) to make additional contig joins based on conserved regions in related plant species: Erigeron canadensis (GCA_010389155), Mikania micrantha (GCA_009363875), and Helianthus annuus (GCA_002127325). Default parameters were used for all assembly steps.
Annotation was performed using GeneMark-ES v2.0 (Lomsadze et al., 2005). Annotation was performed fully de novo without a curated training set and default parameters.
The genome assembly yielded a total sequence length of 411,383,069 bp over 55,557 scaffolds with an N50 of 37,276,437. The GeneMark-ES annotation resulted in 24,994 genes.
Raw and assembled data is publicly available via GenBank:
Raw genome of Stevia rebaudiana, Accession number SRR6792730: https://www.ncbi.nlm.nih.gov/sra/?term=SRR6792730
Assembly of Stevia rebaudiana, Accession number ASM993640v1: https://www.ncbi.nlm.nih.gov/assembly/GCA_009936405.1/
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Is the rationale for creating the dataset(s) clearly described?
Yes
Are the protocols appropriate and is the work technically sound?
Yes
Are sufficient details of methods and materials provided to allow replication by others?
Yes
Are the datasets clearly presented in a useable and accessible format?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Plant systematics.
Is the rationale for creating the dataset(s) clearly described?
Yes
Are the protocols appropriate and is the work technically sound?
Yes
Are sufficient details of methods and materials provided to allow replication by others?
Yes
Are the datasets clearly presented in a useable and accessible format?
Yes
Competing Interests: I am a collaborator with Staci Pirro and we may publish a paper together in the future.
Reviewer Expertise: Sanger sequencing; mycology
Is the rationale for creating the dataset(s) clearly described?
Yes
Are the protocols appropriate and is the work technically sound?
Partly
Are sufficient details of methods and materials provided to allow replication by others?
Partly
Are the datasets clearly presented in a useable and accessible format?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Computational genomics, bioinformatics
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | |||
---|---|---|---|
1 | 2 | 3 | |
Version 1 21 Jul 20 |
read | read | read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)