Keywords
Azadirachta excelsa, chloroplast genome, identification, identified seed stand, long-reads, phylogenetics, gene marker
Azadirachta excelsa (Jack) Jacobs, Kayu bawang (Meliaceae) is economically valuable and widely used by the local community in Bengkulu (Sumatra) as carpentry and construction wood because of its good durability class. However, it still has ambiguous scientific multiple names, such as Azadirachta excelsa, Protium javanicum, and Dysoxylum mollissimum. Additional tools such as molecular approaches can be used to verify whether it is true or not that Kayu bawang is scientifically named as Azadirachta excelsa based on previous morphological identification. This study aimed to construct draft chloroplast genome and verify the scientific name based on molecular identification using a single rbcL gene marker. Genomic DNA was extracted from bark cambium originated from three different provenances in Bengkulu, Indonesia, namely TBT-A, TBT-K, and TBT-S. MinION from Oxford Nanopore Technologies was used to sequence the samples following manufacture protocols SQK-LSK109 yielding 481.6 Mb for TBT-A, 597.4 Mb for TBT-K, and 853.1 Mb for TBT-S, respectively. Generated data were assembled and constructed, namely 58,780 bp (14 tRNAs and 47 encoding genes) for TBT-A, 142,139 bp (4 rRNAs, 24 tRNAs, and 78 encoding genes) for TBT-K, and 84,906 bp (24 tRNAs and 53 encoding genes) for TBT-S. Based on the phylogenetic tree, Azadirachta excelsa from three identified tree stands were placed in the same group with other Azadirachta excelsa accessions.
Azadirachta excelsa, chloroplast genome, identification, identified seed stand, long-reads, phylogenetics, gene marker
Azadirachta excelsa (Jack) Jacobs or known as Kayu bawang (Meliaceae) is a main local wood in Bengkulu Province that is cultivated in private forests, mainly found in North Bengkulu and Central Bengkulu Districts (Depari et al. 2015; Siahaan & Sumadi 2015). Kayu bawang has been used for carpentry wood. Historically, this wood has been a favorite one of the local community as the primary material for building. In addition, the wood is also known for its resistance to attack by destructive organisms (Krisdianto et al. 2015). Despite kayu bawang is widely used for building wood and furniture, but some references are still ambiguous with respect to its scientific naming. Some references mentioned the naming of kayu bawang tree in Bengkulu as Protium javanicum Burm F (Adfa et al. 2013). In some other references, kayu bawang is named differently such as Dysoxylum mollissimum Blume (Ishiguri et al. 2016), Azadirachta excelsa (Jack) M Jacobs (Premono & Lestari 2014). In addition, in the identified seed stand (TBT) in Bengkulu as registered under No. 002/BPSMBK/SSB/2016, No. 001/BPTH.I-3/SSB/2016, and No. 006/P2STH/SSB/2018, kayu bawang is named respectively as Azadirachta exelsa, Dysoxylum molissimum, and Melia excelsa (synonym of Azadirachta excelsa). Based on morphological identification on Herbarium Bogoriense under National Research and Innovation Agency (BRIN), kayu bawang has been confirmed as Azadirachta excelsa (Jack) Jacobs. We used three specimens from different identified seed stands, namely TBT-A, TBT-K, and TBT-S, that were collected with the specimen numbers BO1991351 (TBT-A), BO1991352;1991353;1991354 (TBT- K), and BO1991350 (TBT-S) for genome sequencing using Oxford Nanopore Technologies. The objectives of the research were to assembly long-read DNA sequences and to construct a phylogeny for verifying species identification.
Genomic DNA was extracted from kayu bawang (Azadirachta excelsa) cambium samples which were taken from three different identified seed stands namely Talang Boseng (TBT-A, -3.66890°S, 102.292122°E), Batu Ampar (TBT-K, -4.394333°S, 103.138056°E), and Penyangkak (TBT-S, -3.531333°S, 102.252861°E) in Bengkulu Province, Indonesia. The modified CTAB (cetyl trimethyl ammonium bromide) method was used to extract the genomic DNA from each A. excelsa cambium (Doyle & Doyle 1987). The DNA extract buffer contained 200 μL of 10% CTAB (CAT NO. 194004, LOT NO. QR15741, MP Biomedicals, LLC), 100 μL of 1 M Tris-HCL, 280 μL of 5M NaCl, 40 μL of 0.5 M EDTA, 100 μL of 1% PVP, 280 μL of aquades. 0.1 g cambium mashed using mortar milling was put into a 2 mL tube, 1000 μL of DNA extract buffer that had been incubated at 65°C (60 min), 40 μL of 26% PVP, and 40 μL of β-Mercaptoethanol. The mixture was homogenized using a vortex and subsequently incubated at 65°C (60 min) with rehomogenization performed every 10 min by gently flipping the tube. The cooled mixture was centrifuged for 10 min, at 10,000 rpm. The supernatant was transferred to a 1.5 mL tube, and 500 μL of chloroform: isoamyl alcohol solution (24:1) and 10 μL of phenol were added and centrifuged for 10 min, at 10,000 rpm. Separation of the mixture was done again by adding 500 μL of chloroform: isoamyl alcohol solution (24:1). The obtained supernatant was put into a 1.5 mL tube, then 500 μL cold isopropanol and 250 μL NaCl were added. The mixture was stored in a freezer at 4°C for 1 day so that the DNA precipitation process occurring. After DNA precipitation, the mixture was centrifuged for 10 min, at 10,000 rpm. The mixture was removed gradually to retrieve the DNA pellet. DNA pellet washing using 500 μL of 70% ethanol, centrifuged for 10 min, at 10,000 rpm. The DNA pellet was separated from the mixture and dried in a desiccator containing silica gel (±20 min). 50 μL of TE RNAse buffer was added to dissolve the DNA pellet. The quality of extracted genomic DNA was evaluated using 1% agarose gel electrophoresis. The quality and quantity of the extracted genomic DNA were measured using NanoPhotometer® NP80 (IMPLEN) and Qubit 1.0 Fluorometer (Thermo Fisher Scientific) with the Qubit dsDNA BR assay kit. The library preparation of long-read sequencing was performed using Oxford Nanopore Technologies (ONT) manufactured protocol SQK-LSK109 version NBE_9065_v109_revZ_14Aug2019 with the MinION R9.4.1 flow cell (FLO-MIN106D, LOT 11001352) on a MinION Mk1C sequencer (Oxford Nanopore Technologies).
The chloroplast genome assembly for TBT-A, TBT-K, and TBT-S (Accession number DRA015537; https://ddbj.nig.ac.jp/resource/bioproject/PRJDB14301) (IPB University 2023) was performed using Galaxy Server (https://usegalaxy.eu/, RRID: SCR_006281, MIT License) v.23.1.1.dev0 following long-read data with some modifications (Wang et al. 2018). Long-read data for each sample was quality checked using NanoPlot (https://usegalaxy.eu/, RRID:SCR_024128, MIT License) v1.41.0 (De Coster et al. 2018). Those clean reads data were aligned with the reference NC_023792.1 (Azadirachta indica) using Minimap2 (https://usegalaxy.eu/, RRID:SCR_018550, MIT License) v.2.26 (Li 2018) and SAMTools (https://usegalaxy.eu/, RRID:SCR_002105, MIT License) v.2.05 (Li et al. 2009). Chloroplast reads were assembled using Canu Assembler (https://usegalaxy.eu/, RRID:SCR_015880, GPLv2 and others) v.2.1.1+galaxy0 (Koren et al. 2017) involving parameters such as technology Nanopore, estimated genome size 160k, and min read length 1000. Medaka Consensus Pipeline (https://usegalaxy.eu/, RRID:SCR_005857, Mozilla Public License 2.0) v1.7.2+galaxy1 was used to polish the assembly result (Oosterbroek et al. 2021). Geseq (https://chlorobox.mpimp-golm.mpg.de/geseq.html, RRID:SCR_017336, MIT License) (Tillich et al. 2017) was used to annotate the assembled chloroplast genome using all Meliaceae reference in NCBI RefSeq and visualized using OGDRaw (https://chlorobox.mpimp-golm.mpg.de/OGDraw.html, RRID:SCR_017337, MIT License) in the CHLOROBOX (https://chlorobox.mpimp-golm.mpg.de/index.html, MIT License) (Greiner et al. 2019). Phylogenetic analysis was performed using MEGAX (https://www.megasoftware.net/, RRID:SCR_023471, MIT License) v10.2.2 (Kumar et al. 2018) with Maximum Likelihood method, Tamura-3 model and bootstrap value of 1000 replications. rbcL gene marker was used to construct the phylogenetic tree from TBT-A, TBT-K, and TBT-S samples (Owenia vernicocsa (DQ238063.1) as an outgroup) (Muellner et al. 2006).
The library yielded 481.6 Mb for TBT-A, 597.4 Mb for TBT-K, and 853.1 Mb for TBT-S. Based on the statistics, TBT-A has 3,339 bp of N50, 3,103 bp for TBT-K, and 3,882 bp for TBT-S (Figure 1). Chloroplast draft genomes of the TBT-A sample (36% GC content) produces 58,780 bp containing 14 tRNAs and 47 encoding genes, TBT-K sample (37% GC content) produced 142,139 bp containing 4 rRNAs, 24 tRNAs, and 78 encoding genes, also TBT-S sample (36% GC content) produced 84,906 bp containing 24 tRNAs and 53 encoding genes (Table 1). Based on the constructed phylogenetic tree, Azadirachta excelsa from three identified tree stands (TBT-A, TBT-K, and TBT-S) were placed in the same group with other Azadirachta excelsa accessions (Figure 2). We can confirm that all those samples belong to Azadirachta excelsa.
Gene groups | TBT-A genes | TBT-K genes | TBT-S genes |
---|---|---|---|
RNAs transfer | trnD-GUC, trnE-UUC, trnF-GAA, trnG-GCC, trnL-UAA*, trnM-CAU, trnP-UGG, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGU, trnV-UAC*, trnW-CCA, trnY-GUA | trnA-UGC*, trnC-GCA, trnD-GUC, trnE-UUCd*, trnF-GAA, trnG-GCC, trnH-GUG, trnK-UUU*, trnL-CAA, trnL-UAA*, trnL-UAG, trnM-CAU, trnN-GUU, trnP-UGG, trnQ-UUG, trnR-ACG, trnR-UCU, trnS-CGAd*, trnT-GGU, trnT-UGU, trnV-GAC, trnV-UACd*, trnW-CCA, trnY-GUA | trnC-GCA, trnD-GUC, trnE-UUCd, trnF-GAA, trnfM-CAU, trnG-GCC*, trnH-GUG, trnI-CAU, trnK-UUU*, trnL-UAA*, trnM-CAU, trnP-GGG, trnP-UGG, trnQ-UUG, trnR-UCU, trnS-CGA*, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGUd, trnV-UAC*, trnV-fragment, trnW-CCA, trnY-GUA |
RNAs ribosomal | rrn4.5, rrn5, rrn5-fragment, rrn16, rrn23 | ||
Small subunit ribosomal proteins (SSU) | rps3, rps4, rps8, rps11, rps12-fragment, rps14, rps18, rps19 | rps2, rps3, rps4, rps7, rps8, rps11, rps12*, rps14, rps15, rps16*, rps18, rps19 | rps2, rps4, rps12, rps12-fragment, rps14, rps16d*, rps18, rps19 |
Large subunit ribosomal proteins (LSU) | rpl2*, rpl14, rpl16*, rpl20, rpl22, rpl23, rpl33, rpl36 | rpl2d*, rpl14, rpl16*, rpl20, rpl22, rpl23, rpl32, rpl33, rpl36 | rpl2d*, rpl20, rpl22, rpl23, rpl33 |
RNA polymerase | rpoA | rpoA, rpoB, rpoC2, rpoC1* | rpoB, rpoC1*, rpoC2 |
ATP synthase | atpB, atpE | atpA, atpB, atpE, atpF*, atpH, atpI | atpA, atpB, atpE, atpFd*, atpH, atpI |
Subunit of Photosystem I | psaA, psaB, psaI, psaJ | psaA, psaB, psaC, psaI, psaJ | psaA, psaB, psaI, psaJ |
Subunit of Photosystem II | psbB, psbC, psbD, psbE, psbF, psbH, psbJ, psbL, psbT, psbZ | psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbT, psbZ | psbA, psbB, psbC, psbD, psbE, psbF, psbI, psbJ, psbK, psbL, psbM, psbZ |
Subunit of rubisco | rbcL | rbcL | rbcL |
NADH dehydrogenase | ndhC, ndhJ, ndhK | ndhA*, ndhB*, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK, | ndhC, ndhJ, ndhK |
Subunit of cytochrome b/f complex | petA, petB*, petD*, petG, petL | petA, petB*, petD*, petG, petL, petN | petA, petG, petL, petN |
Photosystem assembly factors | pafI**, pafII | pafII, pafI** | pafId**, pafII |
Cytochrome c synthesis | ccsA | ||
Photosystem biogenesis factor | pbf1 | pbf1 | |
Transitional initiation factor | infA | infA | |
ATP-dependent protease subunit P | clpP1** | clpP1** | clpP1** |
Fatty metabolic acid | accD | accD | accD |
Carbon metabolism | cemA | cemA | cemA |
Maturase | matK | matK | |
Conserved open reading frames | ycf1, ycf2 | ycf2 |
Efratenta Katherina Depari: Conceptualization, Data Curation, Formal Analysis, Funding Acquisition, Investigation, Methodology, Resources, Visualization, Writing-Original Draft;
Nurheni Wijayanto: Investigation, Supervision, Writing-Review & Editing;
Lina Karlinasari: Supervision, Validation, Writing-Review & Editing;
Rafi M: Resources, Supervision, Writing-Review & Editing;
Iskandar Zulkarnaen Siregar: Conceptualization, Funding Acquisition, Methodology, Resources, Supervision, Validation, Writing-Review & Editing.
DNA Data Bank of Japan (DDBJ): Characterization of kayu bawang in agroforestry system in Bengkulu: species name, wood quality, and metabolite profile. Accession number DRA015537; https://ddbj.nig.ac.jp/resource/bioproject/PRJDB14301 (IPB University 2023).
This project contains the following underlying data:
Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).
Galaxy server available from: https://usegalaxy.eu/ (MIT license).
CHLOROBOX available from: https://chlorobox.mpimp-golm.mpg.de/index.html (MIT license).
MEGAX available from: https://www.megasoftware.net/ (MIT license).
Special thanks to the Forest Genetics and Molecular Forestry Laboratory, Department of Silviculture, Faculty of Forestry and Environment, IPB University and Molecular Science Laboratory, Advanced Research Laboratory, IPB University, Bogor, Indonesia, for providing lab facilities during this study.
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Are the rationale for sequencing the genome and the species significance clearly described?
Yes
Are the protocols appropriate and is the work technically sound?
Yes
Are sufficient details of the sequencing and extraction, software used, and materials provided to allow replication by others?
Yes
Are the datasets clearly presented in a usable and accessible format, and the assembly and annotation available in an appropriate subject-specific repository?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: My research area is plant genetics especially soybean and wild soybean, analysis of whole genome sequence, genotyping-by-sequencing, and SNP genotyping, GWAS and QTL analysis using SNP data.
Are the rationale for sequencing the genome and the species significance clearly described?
Yes
Are the protocols appropriate and is the work technically sound?
Yes
Are sufficient details of the sequencing and extraction, software used, and materials provided to allow replication by others?
Yes
Are the datasets clearly presented in a usable and accessible format, and the assembly and annotation available in an appropriate subject-specific repository?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Genomics, Plant systematics.
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | ||
---|---|---|
1 | 2 | |
Version 1 05 Apr 24 |
read | read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)