The complete chloroplast genome of a centennial olive tree (Olea europaea, Oleaceae) from the southern Peruvian coast

Carla L. Saldaña; Luz S. Chura-Llanos; Richard Estrada; Elizabeth Fernandez; Franz Zirena Vilca; Juan C. Guerrero-Abad; Carlos A. Amasifuen-Guerra; Dina L. Gutierrez-Reynoso; Pedro Injante; Carlos I Arbizu

doi:10.12688/f1000research.155147.1

Home Browse The complete chloroplast genome of a centennial olive tree (Olea europaea,...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Genome Note

The complete chloroplast genome of a centennial olive tree (Olea europaea, Oleaceae) from the southern Peruvian coast

[version 1; peer review: 1 approved, 1 approved with reservations]

Carla L. Saldaña^1,2, Luz S. Chura-Llanos³, Richard Estrada¹, [...] Elizabeth Fernandez⁴, Franz Zirena Vilca³, Juan C. Guerrero-Abad^4,5, Carlos A. Amasifuen-Guerra^4,5, Dina L. Gutierrez-Reynoso⁴, Pedro Injante¹, Carlos I Arbizu ^1,4

Carla L. Saldaña^1,2, Luz S. Chura-Llanos³, [...] Richard Estrada¹, Elizabeth Fernandez⁴, Franz Zirena Vilca³, Juan C. Guerrero-Abad^4,5, Carlos A. Amasifuen-Guerra^4,5, Dina L. Gutierrez-Reynoso⁴, Pedro Injante¹, Carlos I Arbizu ^1,4

PUBLISHED 04 Dec 2024

Author details Author details

¹ Dirección de Desarrollo Tecnológico Agrario, Instituto Nacional de Innovación Agraria, Av. La Molina 1981, Lima, 15024, Peru
² Instituto de Investigación en Ganadería y Biotecnología, Universidad Nacional Toribio Rodriguez de Mendoza de Amazonas, Chachapoyas, Amazonas, 01001, Peru
³ Laboratorio de Contaminantes Orgánicos y Ambiente del, Universidad Nacional de Moquegua, Urb. Ciudad Jardín-Pacocha-Ilo, Moquegua, 18610, Peru
⁴ Dirección de Recursos Genéticos y Biotecnología, Instituto Nacional de Innovación Agraria (INIA), Av. La Molina 1981, Lima, 15024, Peru
⁵ Facultad de Ingeniería y Ciencias Agrarias, Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas, Cl. Higos Urco 342, Amazonas, 01001, Peru

Carla L. Saldaña
Roles: Data Curation, Formal Analysis, Investigation, Methodology, Writing – Original Draft Preparation, Writing – Review & Editing

Luz S. Chura-Llanos
Roles: Formal Analysis, Investigation, Methodology, Writing – Review & Editing

Richard Estrada
Roles: Formal Analysis, Investigation, Methodology, Writing – Review & Editing

Elizabeth Fernandez
Roles: Investigation, Writing – Review & Editing

Franz Zirena Vilca
Roles: Investigation, Resources, Writing – Review & Editing

Juan C. Guerrero-Abad
Roles: Investigation, Methodology, Writing – Review & Editing

Carlos A. Amasifuen-Guerra
Roles: Investigation, Writing – Review & Editing

Dina L. Gutierrez-Reynoso
Roles: Investigation, Writing – Review & Editing

Pedro Injante
Roles: Investigation, Writing – Review & Editing

Carlos I Arbizu
Roles: Investigation, Methodology, Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

Olive (Olea europaea Linaeus 1753) is one of the valuable fruit trees and very important edible oil plant in the world. The chloroplast (cp) genome of an olive tree (Olea europaea) from the southern Peruvian arid coast was obtained for the first time. Genomic DNA of high quality was used to generate librarieswith Illumina Hiseq paired-end methods. The cp genome is 155,886 pb in length and contains a large single-copy region (LSC) of 86,610 pb and a small single copy region (SSC) of 17,790 pb separated by two inverted repeat (IR) regions (25,741bp). The cp genome of olive contains 124 genes that consists of 80 protein-coding genes, 36 tRNA, eight rRNA. Phylogenetic analysis showed this olive tree is sister to O. europaea subsp. maroccana (Oleeae tribe). This study presents the first overview of the chloroplast genome organization and phylogenetics of O. europaea, offering valuable insights for genetic and evolutionary research in the genus Olea.

Keywords

plastome, NGS, coastal zone, phylogenomics

Corresponding author: Carlos I Arbizu

Competing interests: No competing interests were disclosed.

Grant information: This research was funded by “Universidad Nacional de Moquegua”. C.L.S. and C.I.A. were funded by “Programa Presupuestal No. 068” of the Peruvian Government.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2024 Saldaña CL et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Saldaña CL, Chura-Llanos LS, Estrada R et al. The complete chloroplast genome of a centennial olive tree (Olea europaea, Oleaceae) from the southern Peruvian coast [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2024, 13:1478 (https://doi.org/10.12688/f1000research.155147.1) First published: 04 Dec 2024, 13:1478 (https://doi.org/10.12688/f1000research.155147.1) Latest published: 04 Dec 2024, 13:1478 (https://doi.org/10.12688/f1000research.155147.1)

Introduction

The olive Olea europaea (Oleaceae) is a subtropical tree distributed on most continents in tropical and temperate environments (Dupin et al., 2020). The genus Olea comprises about 40 old world species (Jensen et al., 2002) and the majority of Oleaceae species possess an economic importance as most olive fruits are used for oil extraction. Vegetable oil is important for its nutritional and healthy advantages compared to others (Rallo et al., 2000). Currently, ten olive tree chloroplast (cp) genomes were reported from Mediterranean region (Besnard et al., 2011), Italy, Spain, China (Niu et al., 2020), and Jordan (Haddad et al., 2021). However, to date, the olive tree chloroplast genome from Peru has not been reported. In this study, we sequenced, assembled, and annotated for the first time the complete chloroplast genome of a centennial the olive tree from Peru ( Figure 1), using next generation sequencing (NGS), providing valuable information for genetic and evolutionary studies in the genus Olea.

Figure 1. Picture of Olea europaea, a centennial olive tree from Moquegua, Peru.

Methods

Plant materials and extraction of genomic DNA Fresh young leaves were collected from “El Algarrobal” district located in the Ilo province from Moquegua region (latitude 17.61294, longitude -71.27136). The specimen (MOQ001169) was deposited at the National University of Moquegua Herbarium (http://www.hmoqueguensis.unam.edu.pe/, Hibert Huaylla-Limachi hmoqueguensis@unam.edu.pe) under the voucher Nro. 001169. DNA was extracted using CTAB method (Doyle and Doyle 1990) with minor modifications for this specie. The quality and the quantity were evaluated on a 1% agarose gel and fluorescence using the Qubit™ 4 Fluorometer (Invitrogen, Waltham, MA, USA), respectively.

DNA sequence and genome assembly Library construction pair-end reads and sequencing were carried out using Illumina HiSeq 2500 platform and a PE 150 library, with the NexteraXT DNA Library Preparation Kit (Illumina, San Diego, CA, USA). We removed the adapters and verified the quality of reads employing Trim Galore (Martin, 2011) with arguments: −F embplant_pt −R 15 –reduce-reads-for-coverage inf. The cp genome was assembled with GetOrganelle v1.7.2 (Jin et al., 2020), using Olea europaea subsp. europaea (NC_015401) as reference. SPAdes v3.11.1 (Bankevich et al., 2012), bowtie2 v2.4.2 (Langmead and Salzberg, 2012), and BLAST+ v2.11 (Camacho et al., 2009) were also used in the pipeline with default settings.

Annotation of Olea europaea CP genome

The cp genome was annotated using GeSeq in CHLOROBOX web service (Tillich et al., 2017) Default settings were applied, and comparisons were made with all available plastid genomes of Oleae in the NCBI database, followed by manual curation

Phylogenetic analyses

To understand the phylogenetic position of Olea europaea, a maximum likelihood (ML) tree of 19 genomes retrieved from GenBank was reconstructed. First, we employed MAFFT v.7.475 (Katoh and Standley, 2013) to align those genomes. Then, with a GTR + GAMMA model of evolution, we obtained the best-scoring ML tree, considering 1,000 bootstrap (BS) inferences with RAxML v8.2.11 (Stamatakis, 2014). We employed capirona (Calycophyllum spruceanum) cp genome as an outgroup (OK326865). The aligned data that was employed for the phylogenetic analysis consists of 187,111 bp (Supplemental Data 1, https://doi.org/10.5061/dryad.tmpg4f57q).

Results

Olea europaea chloroplast genome assembly

The complete chloroplast genome of olive was 155,886 bp in length with typical quadripartite structure that included a large single-copy (LSC) of 86,610 bp and a small single copy region (SSC) of 17,790 bp separated by two inverted repeat (IR) regions (25,741 bp). Following of annotation and subsequent modifications, we submitted the complete chloroplast (cp) genome sequence to the GenBank database. This submission is associated with the accession number: ON767107. The average depth of coverage is 17,673.05X (Supplementary Figure 1, https://doi.org/10.5281/zenodo.14061216). The chloroplast genome contains 124 genes that consisted of 80 protein-coding genes, 36 tRNA, 8 rRNA. Of the total genes reported 12 present one intron. PafI and clpP1 genes contained two introns. We further report that most genes are present as a single copy, except 14 genes that were duplicated in IR regions ( Figure 2). A total of 11 cis-splicing genes (rps16, atpF, rpoC1, pafl, clpP1, petB, petD, rpl16, rpl2, ndhB, ndhA) were identified; one trans-splicing gene (rps12) was also identified (Supplementary Figure 2, Supplementary Figure 3).

Figure 2. Schematic map of features of Olea europaea chloroplast genome (Genes drawn outside the outer circle are transcribed clockwise, and those inside are transcribed counter-clockwise.

Genes belonging to different functional groups are color-coded. The genes functions are indicated colored in the bottom left corner. The dark grey inner circle indicates the presence of nodes in the LSC, SSC, IR regions).

Phylogenetic inference of Olea europaea

This study examined 19 species of the Oleaceae family, with one outgroup, Calycophyllum spruceanum (OK326865), to understand their evolutionary relationships by analyzing complete chloroplast genome sequences. Maximum likelihood (ML) analysis revealed three distinct monophyletic groups within the tribes Jasminenae, Oleeae, and Forsythiae, with 17 of the nodes showing 100% bootstrap support. In addition, O. europaea is shown to be sister to O. europaea subsp. Maroccana, and sister to them is O. europaea subsp. cuspidata ( Figure 3). As expected, Olea europaea was categorized in the Oleeae tribe. These phylogenetic trees are consistent with the established classification of the Oleaceae family.

Figure 3. The phylogenetic position for O. europaea according to the ML phylogenetic tree constructed based on 19 chloroplast genomes: Jasminum tortuosum NC_034691 (Unpublished), Jasminum sambac NC_034694 (Unpublished), Jasminum fluminense NC_042272 (Olofsson et al., 2019), Jasminum nudiflorum NC_008407 (Lee et al., 2007), Jasminun polyanthum NC_ 042273 (Olofsson et al., 2019), Olea europaea subsp. maroccana NC_015623 (Besnard et al., 2011), Olea europaea subsp. cuspidata NC_015604 (Besnard et al., 2011), Olea lancea NC_042278 (Olofsson et al., 2019), Olea paniculata NC_042460 (Olofsson et al., 2019), Osmanthus aff. armatus Besnard NC_042263 (Olofsson et al., 2019), Osmanthus cooperi NC_053565 (Wang et al., 2019), Picconia excelsa NC_042466 (Olofsson et al., 2019), Forestiera angustifolia NC_042782 (Olofsson et al., 2019), Forestiera isabelae NC_036981 (Van de Paer et al., 2018), Forsythia mandschurica NC_048504 (Olofsson et al., 2019), Forsythia x intermedia NC_036982 (Park et al., 2019), Abeliophyllum distichum NC_031445 (Kim et al., 2016), Myxopyrum hainanense NC_047485 (Zhu et al., 2020), Calycophyllum spruceanum OK326865 (Saldaña et al., 2022).

Discussion and conclusions

In this study, we assembled for the first time the chloroplast genome sequence of the centennial olive tree (Olea europaea) from the southern Peruvian coast. The results differ from other chloroplast genomes of Olea europaea, such as O. europaea subsp. europaea (NC_015401), which presents 85 protein-coding genes, 37 tRNA, and 8 rRNA (Haddad et al., 2021). Niu et al. (2020) reported that two varieties of O. europaea, subsp. europaea var. sylvestris and subsp. cuspidata, presented 133 genes, including 87 protein-coding genes, 37 tRNA, and 9 rRNA. Likewise, they found that the chloroplast genome length for both species were 155,886 bp (Niu et al., 2020). The phylogenetic results showed O. europaea is sister to O. europaea subsp. maroccana and O. europaea subsp. cuspidata in independent sister clade to other species of genus Olea the chloroplast genome sequence and annotation were submitted to NCBI with accession number ON767107.1.

Peruvian olive chloroplast genome will stimulate additional work to develop molecular markers for their application in a genomic selection program of superior individuals in early stages. In addition, this study may be a basis to know the location of genes of interest that may be used in gene editing. This work promotes knowing more about the genetics of the Peruvian olive tree, promoting its modern genetic improvement and conservation.

Authors’ contribution

C.L.S. analyzed the data, was also involved in drafting the article. L.C.-L., C.I.A. and R.E. analyzed the data. E.F.-H., F.Z.-V., J.C.G.-A., C.A.-G., D.L.G.-R. were involved in the conception and design of the work. L.C.-L. and F.Z.-V. were involved in sample collection. F.Z.-V., J.C.G.-A., P.I. and C.I.A. were involved in data interpretation.

F.Z.-V., J.C.G.-A., P.I. and C.I.A. were involved in funding acquisition. C.I.A was involved in drafting the article. All authors read and approved the manuscript.

Ethical approval

Olive sample collected and employed in this work do not involve protection determined by the Republic of Peru. Consequently, this study was exempted from ethical approval.

Data availability statement

Underlying data

NCBI: Complete chloroplast genome. Accession number ON767107; https://www.ncbi.nlm.nih.gov/nuccore/ON767107.

Extended data

Dryad: Supplementary material.

The complete chloroplast genome of a centennial olive tree (Olea europaea, Oleaceae) from the southern Peruvian coast. https://doi.org/10.5061/dryad.tmpg4f57q (Saldaña et al., 2024a).

This link contains the following extended data:

• Supplementary_Figure_1.jpg
• Supplementary_Figure_2.jpg
• Supplementary_Figure_3.jpg

Zenodo: Supplementary material. https://doi.org/10.5281/zenodo.14061216 (Saldaña et al., 2024b).

This link contains the following extended data:

• Supplementary_Data_1.phy
• README.md

Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC0 1.0 Public domain dedication) (http://creativecommons.org/publicdomain/zero/1.0/).

References

Bankevich A, Nurk S, Antipov D, et al.: SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 2012; 19: 455–477. PubMed Abstract | Publisher Full Text | Free Full Text
Besnard G, Hernández P, Khadari B, et al.: Genomic profiling of plastid DNA variation in the Mediterranean olive tree. BMC Plant Biol. 2011; 11(1): 12–80. Publisher Full Text
Camacho C, Coulouris G, Avagyan V, et al.: BLAST+: Architecture and applications. BMC Bioinform. 2009; 10: 421. PubMed Abstract | Publisher Full Text | Free Full Text
Doyle J, Doyle J: Isolation of plant DNA from fresh tissue. Focus. 1990; 12(13): 39–40.
Dupin J, Raimondeau P, Hong-Wa C, et al.: Resolving the phylogeny of the olive family (Oleaceae): confronting information from organellar and nuclear genomes. Genes. 2020; 11(12): 1508. PubMed Abstract | Publisher Full Text | Free Full Text
Haddad N, Migdadi H, Brake M, et al.: Complete chloroplast genome sequence of historical olive (Olea europaea subsp. europaea) cultivar Mehras, in Jordan. Mitochondrial DNA Part B. 2021; 6(1): 194–195. PubMed Abstract | Publisher Full Text | Free Full Text
Jensen SR, Franzyk H, Wallander E: Chemotaxonomy of the Oleaceae: iridoids as taxonomic markers. Phytochemistry. 2002; 60(3): 213–231. PubMed Abstract
Jin JJ, Yu WB, Yang JB, et al.: GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 2020; 21(1): 1–31. Publisher Full Text
Katoh K, Standley DM: MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 2013; 30(4): 772–780. PubMed Abstract | Publisher Full Text | Free Full Text
Kim HW, Lee HL, Lee DK, et al.: Complete plastid genome sequences of Abeliophyllum distichum Nakai (Oleaceae), a Korea endemic genus. Mitochondrial DNA Part B. 2016; 1(1): 596–598. PubMed Abstract | Publisher Full Text | Free Full Text
Langmead B, Salzberg SL: Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012; 9: 357–359. PubMed Abstract | Publisher Full Text | Free Full Text
Lee HL, Jansen RK, Chumley TW, et al.: Gene relocations within chloroplast genomes of Jasminum and Menodora (Oleaceae) are due to multiple, overlapping inversions. Mol. Biol. Evol. 2007; 24(5): 1161–1180. PubMed Abstract | Publisher Full Text
Martin M: Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. Journal. 2011; 17(1): 10–12. Publisher Full Text
Niu E, Jiang C, Wang W, et al.: Chloroplast Genome Variation and Evolutionary Analysis of Olea europaea L. Genes. 2020; 11(8): 879. PubMed Abstract | Publisher Full Text | Free Full Text
Olofsson JK, Cantera I, Van de Paer C, et al.: Phylogenomics using low-depth whole genome sequencing: A case study with the olive tribe. Mol. Ecol. Resour. 2019; 19(4): 877–892. PubMed Abstract | Publisher Full Text
Park J, Min J, Kim Y, et al.: The complete chloroplast genome of a new candidate cultivar, Dae Ryun, of Abeliophyllum distichum Nakai (Oleaceae). Mitochondrial DNA Part B. 2019; 4(2): 3713–3715. PubMed Abstract | Publisher Full Text | Free Full Text
Saldaña CL, Rodriguez-Grados P, Chávez-Galarza JC, et al.: Unlocking the complete chloroplast genome of a native tree species from the Amazon Basin, capirona (Calycophyllum spruceanum, Rubiaceae), and its comparative analysis with other Ixoroideae species. Genes. 2022; 13(1): 113. Publisher Full Text Publisher Full Text
Saldaña C, Chura-Llano L, Estrada R, et al.: Supplementary data of chloroplast genome of a centennial olive tree from the Peruvian coast. [Dataset]. Dryad. 2024a. Publisher Full Text
Saldaña C, Chura-Llano L, Estrada R, et al.: Supplementary data of chloroplast genome of a centennial olive tree from the Peruvian coast. Zenodo. 2024b. Publisher Full Text
Rallo P, Dorado G, Martin A: Development of simple sequence repeats (SSRs) in olive tree (Olea europaea L.). Theor. Appl. Genet. 2000; 101(5): 984–989. Publisher Full Text
Stamatakis A: RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014; 30(9): 1312–1313. PubMed Abstract | Publisher Full Text | Free Full Text
Tillich M, Lehwark P, Pellizzer T, et al.: GeSeq–versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 2017; 45(W1): W6–W11. PubMed Abstract | Publisher Full Text | Free Full Text
Van de Paer C, Bouchez O, Besnard G: Prospects on the evolutionary mitogenomics of plants: a case study on the olive family (Oleaceae). Mol. Ecol. Res. 2018; 18: 407–423. PubMed Abstract | Publisher Full Text
Wang X, Cai F, Zhang C, et al.: Characterization of the complete chloroplast genome of the ornamental plant Osmanthus cooperi. Mitochondrial DNA Part B. 2019; 4(2): 2314–2315. PubMed Abstract | Publisher Full Text | Free Full Text
Zhu X, Xiong Z, Yang K, et al.: The complete chloroplast genome of Myxopyrum hainanense and phylogenic analysis of Oleaceae. Mitochondrial DNA Part B. 2020; 5(3): 2103–2104. PubMed Abstract | Publisher Full Text | Free Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 04 Dec 2024