Keywords
Genome, assembly, arthropoda, insecta, hemiptera
This article is included in the Genomics and Genetics gateway.
Genome, assembly, arthropoda, insecta, hemiptera
Editorial note (3rd July 2023): Peer review activity for this article has been discontinued following discussion and agreement between the authors and F1000 Editorial Team. As per our policies on discontinuing peer review, the article is now considered equivalent to a preprint and the authors have chosen to submit the manuscript to a journal for peer review and publication elsewhere.
Periodical cicadas in North America spend 13 or 17 years in the larval stage underground, and emerge in very large numbers for 4–6 weeks to mate and lay eggs. This strategy, known as “predator satiation” is intended to ensure that after all predators have eaten as much as possible, most cicadas will survive. (Williams & Simon, 1995). The emergence occurring in prime-numbered years is thought to be a mechanism to avoid competition between species for egg-laying sites and accidental cross-species mating as the emergence of the 13- and 17-year cicadas would only coincide once every 221 years (Tanaka et al., 2009).
The length of time spent in the larval stage is thought to be dependent on a single gene, although this has not yet been demonstrated at the genomic level (Cox & Carlton, 1991).
Complete genome sequences for these two species will assist with studies on taxonomy, longevity, and the timing of long-term larval development.
Wild caught specimens of Magicicada septendecim and Magicicada septendecula from a small premature emergence of Brood X (2017) collected in Newark, Delaware, USA were used in this study. DNA extraction was performed using the Qiagen DNAeasy genomic extraction kit for tissue, using the standard process. A paired-end sequencing library was constructed using the Illumina TruSeq kit, according to the manufacturer’s instructions. The library was sequenced on an Illumina Hi-Seq platform in paired-end, 2 × 150bp format.
The resulting fastq files were trimmed of adapter/primer sequence and low-quality regions with Trimmomatic v0.33 (Bolger et al., 2014). The trimmed sequence was assembled by SPAdes v2.5 (Bankevich et al., 2012) followed by a finishing step using RagTag v1.0.0 (Alonge, 2020) to make additional contig joins based on conserved regions in related insect species: Rhopalosiphum maidis (GCA_003676215), Euschistus heros (GCA_003667255), and Aphis glycines (GCA_009928515). Default parameters were used for all assembly steps.
Annotation was performed using GeneMark-ES v2.0 (Lomsadze et al., 2005). Annotation was performed fully de novo without a curated training set and using default parameters.
The genome assembly for Magicicada septendecim yielded a total sequence length of 1,579,033,894 with an N50 value of 983 kb and 27,124 gene models.
The genome assembly of Magicicada septendecula yielded 1,585,977,997 with an N50 value of 281 kb and 28,651 gene models.
Genome data available from NCBI’s Short Read Archive (SRA):
Magicicada septendecim, Accession number SRR6782667: https://www.ncbi.nlm.nih.gov/sra/SRR6782667
Magicicada septendecula, Accession number SRR6792649: https://www.ncbi.nlm.nih.gov/sra/SRR6792649
Assembled genomes available from NCBI’s Assembly database:
Magicicada septendecim, Accession number GCA_011326945: https://www.ncbi.nlm.nih.gov/assembly/GCA_011326945.1/
Magicicada septendecula, Accession number GCA_011763675: https://www.ncbi.nlm.nih.gov/assembly/GCA_011763675.1/
Harold B. White is now retired from Department of Chemistry and Biochemistry, University of Delaware.
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Is the rationale for creating the dataset(s) clearly described?
Partly
Are the protocols appropriate and is the work technically sound?
Partly
Are sufficient details of methods and materials provided to allow replication by others?
No
Are the datasets clearly presented in a useable and accessible format?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Insect genomics
Is the rationale for creating the dataset(s) clearly described?
Yes
Are the protocols appropriate and is the work technically sound?
Yes
Are sufficient details of methods and materials provided to allow replication by others?
Partly
Are the datasets clearly presented in a useable and accessible format?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: comparative genomics, phylogenomics, population genomics
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | ||
---|---|---|
1 | 2 | |
Version 1 16 Mar 21 |
read | read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)