ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Data Note
Revised

Draft genome of tule elk Cervus canadensis nannodes

[version 2; peer review: 2 approved]
Previously titled: Draft genome of tule elk Cervus elaphus nannodes
PUBLISHED 11 Dec 2017
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Genomics and Genetics gateway.

Abstract

This paper presents the first draft genome of the tule elk (Cervus elaphus nannodes), a subspecies native to California that underwent an extreme genetic bottleneck in the late 1800s.  The genome was generated from Illumina HiSeq 3000 whole genome sequencing of four individuals, resulting in the assembly of 2.395 billion base pairs (Gbp) over 602,862 contigs over 500 bp and N50 = 6,885 bp. This genome provides a resource to facilitate future genomic research on elk and other cervids.

Keywords

Cervus elaphus nannodes, genome draft, mammalian genome assembly, tule elk

Revised Amendments from Version 1

In this version, the name Cervus elaphus nannodes was changed to Cervus canadensis nannoodes everywhere it appeared in the publication because most people now refer to the elk as Cervus canadensis to differentiate it from Eurasian red deer. Our original publication stated that we were presenting the first Cervidae genome, but this statement has been edited to reflect the recent addition (since our initial submission) of a red deer genome Cervus elaphus hippelaphus available on NCBI. Reference 1 has also been updated to point to this genome. The reported code in the “Bioinformatics processing” section contained an erroneous “SLIDING” parameter for trimmomatic, and this has been deleted to match the correct code on GitHub. Additional information about the quality of the sequencing run was added to the Results. Table 1 was reformatted for easier viewing.

See the authors' detailed response to the review by Rudiger Brauning
See the authors' detailed response to the review by Steve Olsen

Introduction

At the initiation of this project, no genome assembly existed for any member of the deer family (Cerivdae). We therefore sought to generate the first such assembly for the tule elk (Cervus canadensis nannodes). We note that after we completed our project and submitted the intial draft of this manuscript, a full assembly of red deer (Cervus elaphus hippelaphus) became available online1. The present paper presents the first de novo genomic draft of the tule elk. This California-endemic elk subspecies underwent a major genetic bottleneck when its numbers were reduced to as few as three individuals in the 1870s2,3. Although their numbers have increased to >5,000 today4, the historical bottleneck nevertheless left its mark on the elk’s genome, rendering it more homozygous than other elk subspecies.

Our motivation for generating a genomic resource for the tule elk was to create a reference for identifying single nucleotide polymorphisms (SNPs) to develop assays to monitor elk population abundance and for related population genetic applications. Due to the relatively low coverage generated in this work (40X overall with an average of 10X coverage from each individual), we used the MEGAHIT metagenome assembler, which has been found to perform well on low-quality or low-coverage DNA sequencing in bacteria5.

Methods

Sample collection and library prep

Elk were selected from four geographically distinct populations across northern California to maximize genomic diversity (San Luis Reservoir, California Valley, American Canyon, and the San Luis National Wildlife Refuge4). Genomic DNA was extracted from skin biopsies, which were obtained by the California Department of Fish and Wildlife as part of their elk management activities4. We extracted DNA from skin using Qiagen DNeasy blood & tissue kits (QIAGEN Inc., Valencia, CA), according to the manufacturer’s instructions. The DNA was then fragmented via sonication using a Bioruptor (Diagenode, Denville, NJ) to 300 to 400 base pairs (bp) prior to adapter ligation. After verification of fragment size range using agarose gel electrophoresis, NEBNext® Ultra™ DNA Library Prep Kit for Illumina® (New England Biolabs, Inc., Ipswich, MA) was used to ligate Illumina adapters. Multiplexed libraries were prepared using NEBNext Multiplex Oligos for Illumina (New England Biolabs) to individually barcode each of four individual elk. Barcodes were annealed using low-cycle polymerase chain reactions during library preparation. To assess library quality, trace analysis was performed using a Bioanalyzer 2100 (Agilent, Santa Clara, CA) and fluorometric DNA quantitation of libraries was performed using a Qubit fluorometer (Invitrogen, Carlsbad, CA) prior to equilibrating sample concentrations and pooling for sequencing. After library quality control, the four samples were pooled in equimolar concentrations and submitted for paired-end sequencing. Samples were sequenced on an Illumina HiSeq 3000 at the DNA Technologies and Expression Analysis Core of the UC Davis Genome Center.

Bioinformatics processing

Sequencing quality on demultiplexed reads was evaluated using FastQC v0.11.3 (RRID:SCR_014583)6. The Illumina TruSeq3-PE sequencing adapters were removed using Trimmomatic v0.30 (RRID:SCR_011848)7 with the ILLUMINACLIP parameter set to TruSeq3-PE.fa:2:40:15. The TruSeq3-PE.fa sequence was downloaded from https://anonscm.debian.org/cgit/debian-med/trimmomatic.git/plain/adapters/TruSeq3-PE.fa. LEADING and TRAILING parameters were set to 2, resulting in the removal of bases with a quality score of 2 or less according to a phred33 quality scoring matrix. The SLIDINGWINDOW parameter of 4:2 was used to clip reads once the quality score fell below 2 within the window. The MINLENGTH parameter set to 25 dropped any reads that fell below that length due to quality trimming. The demultiplexed, quality-filtered reads were interleaved using the interleave-reads.py script in khmer v2.0 (RRID:SCR_001156)8. The assembly was performed using MEGAHIT v1.0.59 on interleaved quality filtered reads. Genome statistical analysis was done using QUAST v3.0 (RRID:SCR_001228)10. All code used is publicly available at https://github.com/dib-lab/2017-tule-elk/.

Results

We obtained 377,980,276 demultiplexed 150 bp paired-end raw reads, containing a total of 113.394 Gbp of sequence, from which 99.830 Gbp (88%) had quality scores ≥ Q30 (average quality score = 37.2), or approximately 40X coverage of the approximately 3 Gbp tule elk genome. Sequence assembly resulted in the generation of a total genome sequence size of 2.395 Gbp. Reads were assembled into 602,862 contiguous sequences ("contigs") averaging 3,973 bp in length with a minimum contig length of 201 bp. The G+C content of the genome was 41.55%. The N50 was 6,885 bp and maximum contig length was 72,391 bp. Additional assembly statistics are available in Table 1. No contigs (e.g. under a certain size or likely to reflect repeats) were removed from the assembly.

Table 1. Quality metrics on tule elk (Cervus canadensis nannodes) assembly, as generated with QUAST v3.0.

MetricTule elk assembly
# contigs (≥ 200 bp)1,367,218
# contigs ≥ 500 bp602,862
# contigs (≥ 1000 bp)460,702
# contigs (≥ 5000 bp)160,229
# contigs (≥ 10000 bp)51,790
# contigs (≥ 25000 bp)2,606
# contigs (≥ 50000 bp)36
Total length (≥ 200 bp)2,607,088,486
Total length (≥ 1000 bp)2,295,163,580
Total length (≥ 5000 bp)1,531,314,985
Total length (≥ 10000 bp)771,863,493
Total length (≥ 25000 bp)80,157,993
Total length (≥ 50000 bp)2,056,962
Largest contig72,391
Total length2,395,105,945
GC41.55%
N506,885
N753,646
L50103,346
L75222,107
# N's per 100 kbp0

This genome can serve as the basis for further genomic work on tule elk and other cervids, such as the development of a SNP assay to track elk population movement across increasingly developed northern Californian terrain. Furthermore, it is one of the first whole genome assemblies available from the family Cervidae, providing a useful interim reference genome for bioinformatic analyses on other deer and elk species.

Data availability

Raw reads are available in the SRA under the BioProject ID PRJNA345218. The genome draft is available at https://doi.org/10.6084/m9.figshare.5382565.v111.

Code used in this study have been archived at http://doi.org/10.5281/zenodo.88793512

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 15 Sep 2017
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Mizzi JE, Lounsberry ZT, Brown CT and Sacks BN. Draft genome of tule elk Cervus canadensis nannodes [version 2; peer review: 2 approved]. F1000Research 2017, 6:1691 (https://doi.org/10.12688/f1000research.12636.2)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 2
VERSION 2
PUBLISHED 11 Dec 2017
Revised
Views
7
Cite
Reviewer Report 18 Dec 2017
Rudiger Brauning, Invermay Agricultural Centre, AgResearch, Mosgiel, New Zealand 
Approved
VIEWS 7
I'm happy with the ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Brauning R. Reviewer Report For: Draft genome of tule elk Cervus canadensis nannodes [version 2; peer review: 2 approved]. F1000Research 2017, 6:1691 (https://doi.org/10.5256/f1000research.14577.r28939)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Version 1
VERSION 1
PUBLISHED 15 Sep 2017
Views
18
Cite
Reviewer Report 16 Nov 2017
Rudiger Brauning, Invermay Agricultural Centre, AgResearch, Mosgiel, New Zealand 
Approved with Reservations
VIEWS 18
The authors describe the generation of a draft assembly for tule elk in the style of a brief genome announcement. For SNP detection and primer design this assembly is fine. It could e.g. be used in combination with Genotyping by Sequencing on ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Brauning R. Reviewer Report For: Draft genome of tule elk Cervus canadensis nannodes [version 2; peer review: 2 approved]. F1000Research 2017, 6:1691 (https://doi.org/10.5256/f1000research.13682.r27606)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 11 Dec 2017
    Jessica Mizzi, Microbiology Graduate Group, University of California, Davis, 95616, USA
    11 Dec 2017
    Author Response
    Thank you for your review of this paper. Version 2 has been edited to reflect the presence of the red deer genome and a citation to that genome has been ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 11 Dec 2017
    Jessica Mizzi, Microbiology Graduate Group, University of California, Davis, 95616, USA
    11 Dec 2017
    Author Response
    Thank you for your review of this paper. Version 2 has been edited to reflect the presence of the red deer genome and a citation to that genome has been ... Continue reading
Views
20
Cite
Reviewer Report 25 Oct 2017
Steve Olsen, Infectious Bacterial Diseases Research Unit, National Animal Disease Center, ARS-USDA, Ames, IA, USA 
Approved
VIEWS 20
This article describes the generation of a draft genome (40X coverage from 4 animals) of the tule elk (Cervus elaphus nannodes). The research methods are fairly standard for the Illumina sequencing used.  At 602,862 contigs, the genome is very prelminary and will require quite a bit of additional work ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Olsen S. Reviewer Report For: Draft genome of tule elk Cervus canadensis nannodes [version 2; peer review: 2 approved]. F1000Research 2017, 6:1691 (https://doi.org/10.5256/f1000research.13682.r26607)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 11 Dec 2017
    Jessica Mizzi, Microbiology Graduate Group, University of California, Davis, 95616, USA
    11 Dec 2017
    Author Response
    Thank you for your review of this paper.
    Competing Interests: I declare no competing interests.
COMMENTS ON THIS REPORT
  • Author Response 11 Dec 2017
    Jessica Mizzi, Microbiology Graduate Group, University of California, Davis, 95616, USA
    11 Dec 2017
    Author Response
    Thank you for your review of this paper.
    Competing Interests: I declare no competing interests.

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 15 Sep 2017
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.