ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Correspondence

Comment on Falade et al. (2016) DNA-barcoding of Clarias gariepinus, Coptedon zillii and Sarotherodon melanotheron from Southwestern Nigeria

[version 1; peer review: 1 approved, 1 approved with reservations]
PUBLISHED 09 Nov 2016
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Phylogenetics collection.

Abstract

A publication by Falade et al. was selected for discussion by a Naturalis Biodiversity Center-Leiden University Journal Club. The study focused on the identification of fish from Southwestern Nigeria using a DNA barcoding approach. Questions raised during the discussion led to a reanalysis and reinterpretation of the data presented. The authors characterize the process of deriving a taxonomic identification from their sequence data as straightforward, but we were concerned that their approach made it nearly impossible to fail to obtain a taxonomic name for each sequence. The value of sophisticated DNA taxonomy, as well as the pitfalls of its naïve application, are discussed. We suggest that journal discussion groups may be an untapped resource for expanding rigorous peer review, particularly for journals that have adopted an open review model.

Keywords

Barcode, DNA taxonomy, FISH-BOL, peer review, scientific publishing

DNA sequence data has become widely accepted as a useful tool for taxonomic determination and discovery13. But the potential pitfalls of DNA taxonomy in operation have been forewarned for some time410.

The DNA barcode itself is simply a standard region selected to facilitate comparison11. A library built of many such sequences and based on a gene evolving at a rate that minimizes variation within and maximizes variation between species becomes a powerful taxonomic resource5. But the journey from DNA barcode sequence to species determination still requires critical application, particularly when applied to taxa or regions that are not currently well represented in sequence databases.

Falade et al.12 obtained DNA sequences for sixteen individual fish from Southwest Nigeria, a region with relatively sparse coverage in sequence databases. Such data are valuable because broad geographic and taxonomic representation provide insight into genetic diversity within taxonomic groups and help us to refine hypotheses of species circumscription and phylogenetic relationships.

Falade et al.,12 sequenced each specimen for the standard animal DNA barcode region cytochrome oxidase I (COI) and a region of the 16S mitochondrial ribosome. The authors queried their sequences against both the BOLD Systems (RRID: SCR_004278; boldsystems.org/index.php/IDS_OpenIdEngine) and NCBI GenBank (RRID: SCR_004860; BLASTN, RRID: SCR_001598; blast.ncbi.nlm.nih.gov/Blast.cgi) databases (because BOLD does not include 16S, these sequences were only compared to GenBank). Although the authors claim that “this resulted in straightforward identification”, we take a more nuanced view on their results.

The BOLD identification engine and BLASTN comparison with GenBank work differently and were created for different purposes1315; only BOLD is specifically intended to be used as a taxonomic identification tool, while BLASTN assesses sequence similarity. BLASTN will always return the most similar sequences in GenBank. BOLD is more discriminating, since it is limited to a handful of specific loci and uses similarity thresholds to assess whether or not a query sequence can be matched to identified sequences in the database with high confidence. BOLD will alert the user when it determines that no confident identification could be made. DNA-based identification is complicated by the fact that both BOLD and GenBank include misidentified sequences16.

BOLD failed to identify with confidence any of the sixteen COI sequences. Eight were classified as probably belonging to one of a handful of possible species, while the rest received no hit. From this, we infer that Falade et al. made their taxonomic determinations based almost entirely on BLASTN results. As reported (Table 1), all but one of these were scored as 98–99% identical to their top GenBank hit with the remaining sequence (KX231778; Coptodon_zilli_odooba_1) scoring 86% identical.

Table 1. Sequences from Falade et al.12 queried using the BOLD and GenBank databases.

Top BOLD hit and BOLD identification note summarize results from BOLD. Top Blast hit and Sequence name specify the best match in GenBank (excluding the Falade et al. sequences) according to BLASTN, with the Blast metrics Query cover and Ident. See also Table 2 in Falade et al. Note that BOLD contains no 16S data, so these sequences are listed as NA (not applicable).

Accession no.LocusSpecimen voucher no.Top BOLD
hit
BOLD identification noteTop Blast hitSequence nameQuery
cover
IdentBlast note
KX231778COICoptodon_zilli_odooba_1No HitJX173760.1Tilapia zillii isolate MAB08
cytochrome oxidase
subunit I (COI) gene,
partial cds; mitochondrial
7686
KX231779COICoptodon_zilli_odooba_2Top Hit:
Chordata -
Cichliformes
- Tilapia zillii
(99.65%)
A species level match could not
be made, the queried specimen
is likely to be one of the following:
Tilapia zillii, Coptodon zillii,
Coptodon sp., Oreochromis
mossambicus, Coptodon rendalli,
Tilapia guineensis
KM658974.1Coptodon zillii
mitochondrion, complete
genome
8799
KX231780COICoptodon_zilli_odooba_3Top Hit:
Chordata -
Cichliformes
- Tilapia zillii
(99.44%)
A species level match could not
be made, the queried specimen
is likely to be one of the following:
Tilapia zillii, Coptodon zillii,
Coptodon sp., Oreochromis
mossambicus, Coptodon rendalli
KM658974.1Coptodon zillii
mitochondrion, complete
genome
8899
KX231781COISarotherodon_
melanotheron_odooba_4
No HitJF894132.1Sarotherodon melanotheron
mitochondrion, complete
genome
9298
KX231782COISarotherodon_
melanotheron_odooba_5
No HitJF894132.1Sarotherodon melanotheron
mitochondrion, complete
genome
9298
KX231783COIClarias_gariepinus_
odooba_6
No HitKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9299
KX231784COIClarias_gariepinus_
odooba_7
Top Hit:
Chordata -
Siluriformes
- Clarias
gariepinus
(99.62%)
A species level match could not
be made, the queried specimen
is likely to be one of the following:
Clarias gariepinus, Clarias sp.
NM-2010
KT001082.1Clarias gariepinus
mitochondrion, complete
genome
9099
KX231785COIClarias_gariepinus_
odooba_8
No HitKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9299
KX231786COIClarias_gariepinus_
odooba_9
No HitKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9299
KX231787COIClarias_gariepinus_
odooba_10
Top Hit:
Chordata -
Siluriformes
- Clarias
gariepinus
(100%)
A species level match could not
be made, the queried specimen
is likely to be one of the following:
Clarias gariepinus, Clarias sp.
NM-2010
KT001082.1Clarias gariepinus
mitochondrion, complete
genome
9299
KX231788COIClarias_gariepinus_
odooba_11
No HitKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9098
KX231789COIClarias_gariepinus_
asejire_12
No HitKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9098
KX231790COIClarias_gariepinus_
asejire_13
Top Hit:
Chordata -
Siluriformes
- Clarias
gariepinus
(99.84%)
A species level match could not
be made, the queried specimen
is likely to be one of the following:
Clarias gariepinus, Clarias sp.,
Clarias magur, Clarias cf.
stappersii, Clarias ngamensis
JQ699203.1Clarias gariepinus isolate
CLGP5 cytochrome
oxidase subunit I (COI)
gene, partial cds;
mitochondrial
9399
KX231791COIClarias_gariepinus_
asejire_14
Top Hit:
Chordata -
Siluriformes
- Clarias
gariepinus
(100%)
A species level match could not
be made, the queried specimen
is likely to be one of the following:
Clarias gariepinus, Clarias sp.,
Clarias magur, Clarias cf.
stappersii, Clarias ngamensis
KX619412.1Clarias gariepinus
cytochrome oxidase
subunit 1 (COI) gene,
partial cds; mitochondrial
9399
KX231792COIClarias_gariepinus_
asejire_15
Top Hit:
Chordata -
Siluriformes
- Clarias
gariepinus
(100%)
A species level match could not
be made, the queried specimen
is likely to be one of the following:
Clarias gariepinus, Clarias sp.,
Clarias magur, Clarias cf.
stappersii, Clarias ngamensis
JQ699201.1Clarias gariepinus isolate
CLGP3 cytochrome
oxidase subunit I (COI)
gene, partial cds;
mitochondrial
9399
KX231793COIClarias_gariepinus_
asejire_16
No HitKX619412.1Clarias gariepinus
cytochrome oxidase
subunit 1 (COI) gene,
partial cds; mitochondrial
9299
KX24327616SCoptodon_zilli_odooba_1NAKM658974.1Coptodon zillii
mitochondrion, complete
genome
9399Top three
hits from
source
publication,
fourth hit
reported
KX24327716SCoptodon_zilli_odooba_2NAKM658974.1Coptodon zillii
mitochondrion, complete
genome
9399Top hit this
sequence,
second hit
reported
KX24327816SCoptodon_zilli_odooba_3NAGQ168017.1Tilapia aff. zillii 'Kisangani'
isolate J72 16S ribosomal
RNA gene, partial
sequence; mitochondrial
9099Top three
hits from
source
publication,
fourth hit
reported
KX24327916SSarotherodon_
melanotheron_odooba_4
NAJF894132.1Sarotherodon melanotheron
mitochondrion, complete
genome
9399Top two hits
from source
publication,
third hit
reported
KX24328016SSarotherodon_
melanotheron_odooba_5
NAJF894132.1Sarotherodon melanotheron
mitochondrion, complete
genome
8999Top two hits
from source
publication,
third hit
reported
KX24328116SClarias_gariepinus_
odooba_6
NAKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9799Top three
hits from
source
publication,
fourth hit
reported
KX24328216SClarias_gariepinus_
odooba_7
NAKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9499Top six hits
from source
publication,
seventh hit
reported
KX24328316SClarias_gariepinus_
odooba_8
NAKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9499Top four hits
from source
publication,
fifth hit
reported
KX24328416SClarias_gariepinus_
odooba_9
NAKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9499Top six hits
from source
publication,
seventh hit
reported
KX24328516SClarias_gariepinus_
odooba_10
NAKT001082.1Clarias gariepinus
mitochondrion, complete
genome
8499Top four hits
from source
publication,
fifth hit
reported
KX24328616SClarias_gariepinus_
odooba_11
NAKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9293Top two hits
from source
publication,
third hit
reported
KX24328716SClarias_gariepinus_
asejire_12
NAKT001082.1Clarias gariepinus
mitochondrion, complete
genome
5195Top six hits
from source
publication,
seventh hit
reported
KX24328816SClarias_gariepinus_
asejire_13
NAKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9499Top four hits
from source
publication,
fifth hit
reported
KX24328916SClarias_gariepinus_
asejire_14
NAKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9499Top five hits
from source
publication,
sixth hit
reported
KX24329016SClarias_gariepinus_
asejire_15
NAKT001082.1Clarias gariepinus
mitochondrion, complete
genome
8499Top four hits
from source
publication,
fifth hit
reported
KX24329116SClarias_gariepinus_
asejire_16
NAKT001082.1Clarias gariepinus
mitochondrion, complete
genome
9293Top two hits
from source
publication,
third hit
reported

To view the results in context, we downloaded from BOLD all COI sequences identified as one of the three species specified by Falade et al. [search ‘Taxonomy’ for Clarias gariepinus, Sarotherodon melanotheron, and Coptodon zillii (the latter also under the synonym Tilapia zillii)]. These sequences were combined with the Falade et al. data and initially aligned using MAFFT version 7.18717 with manual adjustments made using Mesquite version 3.1018 (mesquiteproject.wikispaces.com/). A phylogenetic analysis was performed using RAxML version 8.2.819. Initial alignment and phylogenetic analysis were performed through the CIPRES Science Gateway version 3.320 (RRID: SCR_008439; phylo.org/). Alignment required reversing or reverse-complementing some of the sequences from Falade et al. The problematic sequence KX231778 could not be satisfactorily aligned with the others and had to be excluded from the tree. The remaining COI sequences did cluster with other GenBank sequences in such a way as to suggest the remaining taxonomic determinations reported by Falade et al. are credible.

Another anomalous sequence is KX243287 (Clarias_gariepinus_asejire_12), a 16S sequence approximately twice the length of the others. We have no explanation for this.

The evidence presented by Falade et al. is not sufficient to determine at least the COI sequence KX231778. The method applied by Falade et al. made it nearly impossible to fail to obtain a taxonomic name for each sequence. This is a scientific flaw, and an example of the uncritical application of DNA taxonomy.

This paper was discussed as part of a regular journal discussion group offered by the Endless Forms research group at Naturalis Biodiversity Center, which involves students in the Evolution, Biodiversity, and Conservation program at Leiden University. Similar journal-article-based discussion groups can be found at many universities and Natural History Museums. We support the rationale behind open review journals (blog.f1000research.com/2014/05/21/what-is-open-peer-review/) and therefore decided to share the sense of our discussion with the broader community. We would like to encourage other journal discussion groups to include open review articles in their literature discussions, and consider sharing summaries of their discussions as article comments. Healthy science literature depends on a robust pool of potential reviewers21. We see journal discussion groups as an untapped resource for providing feedback on scientific literature, and also as incubators for developing student-scientists into constructive and rigorous peer reviewers.

Dataset 1.Aligned COI sequence data.
(FASTA format)
Dataset 2.Phylogenetic tree.
(RAxML tree, NEWICK format)

Data availability

F1000Research: Dataset 1. Aligned COI sequence data, 10.5256/f1000research.9829.d14138322

F1000Research: Dataset 2. Phylogenetic tree, 10.5256/f1000research.9829.d14138423

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 09 Nov 2016
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Miller JA, van Riemsdijk I, Khalik MZ et al. Comment on Falade et al. (2016) DNA-barcoding of Clarias gariepinus, Coptedon zillii and Sarotherodon melanotheron from Southwestern Nigeria [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2016, 5:2654 (https://doi.org/10.12688/f1000research.9829.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 09 Nov 2016
Views
10
Cite
Reviewer Report 30 Jan 2017
Dora Davies, Facultad de ciencias Naturales, Universidad Nacional de Salta, Salta, Argentina 
Carolina Davies, IPE-CONICET, Universidad Nacional de Salta, Salta, Argentina 
Approved
VIEWS 10
The title is appropriate for the content of the article. However, there is a spelling mistake since the correct genus is Coptodon, instead of Coptedon as the authors wrote it. The abstract concisely summarizes the ideas presented in the article. ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Davies D and Davies C. Reviewer Report For: Comment on Falade et al. (2016) DNA-barcoding of Clarias gariepinus, Coptedon zillii and Sarotherodon melanotheron from Southwestern Nigeria [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2016, 5:2654 (https://doi.org/10.5256/f1000research.10597.r19277)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
9
Cite
Reviewer Report 27 Jan 2017
Shadi Shokralla, Biodiversity Institute of Ontario and Department of Integrative Biology, University of Guelph, Guelph, ON, Canada 
Approved with Reservations
VIEWS 9
The authors discussed many aspects in Falade et al. (2016) article but their explanation didn't convince me with their findings. For example:
  • The authors focused on CO1 data and almost ignored 16S data.
     
... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Shokralla S. Reviewer Report For: Comment on Falade et al. (2016) DNA-barcoding of Clarias gariepinus, Coptedon zillii and Sarotherodon melanotheron from Southwestern Nigeria [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2016, 5:2654 (https://doi.org/10.5256/f1000research.10597.r19279)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 09 Nov 2016
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.