The complete mitogenome sequence of clam ( Corbicula fluminea ) [version 1; peer review: 1 approved, 1 approved with reservations]

The global invasion of the genus Corbicula has caused serious ecological and economic problems. The species of Corbicula fluminea stands out amongst the greater part of freshwater invaders around the world. Here, we sequenced the entire mitogenome of the Corbicula fluminea , which were gathered from Dongting Lake, in the Hunan province of central China. The circular genome is 17,073 bp long. The raw reads were obtained from the platform of Illumina HiSeq 2500, and assembled by the MITObim method followed by alignments to related species. The entire dataset was deposited at the NCBI Short Read Archive via accession number SRR14692229 and NCBI GenBank with accession number MZ231034.1.


Introduction
Clams of the genus Corbicula are currently spread worldwide, and they cause great ecological threats and tremendous impacts in the ecological system (Counts 1981;Gomes et al. 2016;Peñarrubia et al. 2017;Hünicken et al. 2019;Douglass et al. 2020). However, their taxonomic status and systematic status are still unclear (Wang et al. 2018;Haponski 2019;Bodon et al. 2020;Ramli et al. 2020). Corbicula fluminea is an important component of Asian freshwater macrobiota and has always been consumed as food in East Asia (Wang et al. 2014;López-Soriano et al. 2018;Zhang et al. 2019;Sano et al. 2020). In this work, the complete mitochondrial genome of Corbicula fluminea from Dongting Lake was sequenced, and the phylogenetic relationships among Venerida were investigated. These results could contribute to distinguishing the taxonomic placement and systematic status of genus Corbicula in further studies.
After sampling, the living specimens were stored in College of Animal Science and Technology, Hunan Agricultural University, the voucher number is DT1808-118. To obtain the total genomic DNA, the 40mg frozen muscle of the foot was derived from the specimen. DNA was obtained by DNeasy™ Blood & Tissue extraction kit (Qiagen, Hilden, Germany). The library was conducted by Berry Genomics Co. Ltd (Beijing, China) according to the Illumina TruSeq Nano DNA library Prep Kit and the manufacturer's recommendations. DNA fragments were selectively enriched using Illumina PCR Primer (F: 5 0 -AATGATACG GCGACCACCGAGA-3 0 and R: 5 0 -CAAGCAGAAGACGGCATACGAGT-3 0 ) Cocktail in a PCR reaction with the following cycling conditions: 95°C for 3 minutes, followed by 8 cycles of 98°C for 20 seconds, 60°C for 15 seconds and 72°C for 30 seconds, with a final extension step at 72°C for 5 minutes. The raw reads of the mitogenome sequence were obtained from the platform of Illumina HiSeq 2500. The adaptors and low sequencing qualities reads (N bases exceeding 10% and more than 50% of phred quality score ≤ 5) were trimmed and reduced using the program Trimmomatic v 0.38.0 (Bolger et al. 2014).
The sequence reads of untrimmed high-quality were assembled by the program MITObim (Hahn et al. 2013). The annotation was performed using MITOS WebServer (Bernt et al. 2013) for the entire mitogenome of Corbicula fluminea from Dongting Lake, and adjusted manually in Geneious Prime v2020.2.2 based on the published mitogenomes of Venerida species.
To validate our data, the CDS sequences of 19 Venerida species (GenBank accession numbers are shown in Figure 2) were aligned. The phylogenetic analysis among Venerida was investigated using maximum-likelihood (ML) and Bayesian inference (BI) approaches. The ML tree was estimated by RAxML version 8.2.12 (Stamatakis 2014) and the BI tree was estimated by MrBayes version 3.2.7 (Ronquist et al. 2012), with GTR-GAMMA model. RAxML was analyzed with 1,000 bootstrap replicates (-m GTRGAMMAI -f a -x 1 -N 1 -p 1 -N 1000), 1 million generations Markov chain Monte Carlo iterations (nCat=4) were analysed and every 1,000 generations were sampled with the initial 10% of samples removed as "burn-in" in MrBayes. The average standard deviation of split frequencies (<0.01) was used to assess the convergence.
A phylogenetic relationship was estimated based on 13 protein-coding genes sequences from the mitogenome sequences of Corbicula fluminea from Dongting Lake and other 19 Venerida species (Figure 2). The study has provided a complete mitochondrial genome of Corbicula fluminea from Dongting Lake. As a phylogenetic tool, it could contribute to systematic and comparative analysis for resolving evolutionary relationships.

Results
The complete circular mitogenome of Corbicula fluminea from Dongting Lake (GenBank accession no. MZ231034.1) is 17,073 bp in total length. As other Venerida mitogenomes, 37 mitochondrial structural genes were found in the mitogenome of Corbicula fluminea from Dongting Lake, including 13 protein-coding genes, two ribosomal RNA genes (12S and 16S rRNA) and 22 transfer RNA genes (Figure 1). Both are encoded on the positive strand.

Open Peer Review
Are the datasets clearly presented in a usable and accessible format, and the assembly and annotation available in an appropriate subject-specific repository? Yes