Nat-UV DB: A Natural Products Database Underlying of Veracruz-Mexico

Edgar López-López; Ana Margarita Hernández-Segura; Carlos Lara-Cuellar; Carolina Barrientos-Salcedo; Carlos M. Cerda-García-Rojas; José L. Medina-Franco

doi:10.12688/f1000research.161261.1

Home Browse Nat-UV DB: A Natural Products Database Underlying of Veracruz-Mexico

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Nat-UV DB: A Natural Products Database Underlying of Veracruz-Mexico

[version 1; peer review: 3 approved]

Edgar López-López ^1,2, Ana Margarita Hernández-Segura³, Carlos Lara-Cuellar³, Carolina Barrientos-Salcedo³, Carlos M. Cerda-García-Rojas², José L. Medina-Franco ¹

Edgar López-López ^1,2, Ana Margarita Hernández-Segura³, [...] Carlos Lara-Cuellar³, Carolina Barrientos-Salcedo³, Carlos M. Cerda-García-Rojas², José L. Medina-Franco ¹

PUBLISHED 04 Feb 2025

Author details Author details

¹ DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autonoma de Mexico, Mexico City, Mexico City, 04510, Mexico
² Department of Chemistry and Graduate Program in Pharmacology, Center for Research and Advanced Studies of the National Polytechnic Institute, Mexico City, Mexico City, 07000, Mexico
³ Laboratorio de Química Médica y Quimiogenómica, Facultad de Bioanálisis Campus Veracruz, Universidad Veracruzana, Veracruz, Veracruz, 91700, Mexico

Edgar López-López
Roles: Conceptualization, Formal Analysis, Investigation, Methodology, Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

Ana Margarita Hernández-Segura
Roles: Formal Analysis, Investigation

Carlos Lara-Cuellar
Roles: Formal Analysis, Investigation

Carolina Barrientos-Salcedo
Roles: Writing – Review & Editing

Carlos M. Cerda-García-Rojas
Roles: Writing – Review & Editing

José L. Medina-Franco
Roles: Funding Acquisition, Methodology, Resources, Supervision, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Cheminformatics gateway.

Abstract

Background

Natural products databases are well-structured data sources that offer new molecular development opportunities in drug discovery, agrochemistry, food, cosmetics, and several other research disciplines or chemical industries. The crescent world’s interest in the development of these databases is related to the exploration of chemical diversity in geographical regions with rich biodiversity.

Methods

In this work, we introduce and discuss Nat-UV DB, the first natural products database from a coastal zone of Mexico. We discuss its construction, curation, and chemoinformatic characterization of their content, and chemical space coverage compared with other compound databases, like approved drugs, and other Mexican (BIOFACQUIM and UNIIQUIM databases) and the Latin American natural products database (LaNAPDB).

Results

Nat-UV DB comprises 227 compounds that contain 112 scaffolds, of which 52 are not present in previous natural product databases. The compounds present in Nat-UV DB have a similar size, flexibility, and polarity to previously reported natural products and approved drug datasets.

Conclusions

Nat-UV DB compounds have a higher structural and scaffold diversity than the approved drugs, but they have low structural and scaffold diversity in contrast with other natural products in the reference datasets. This database serves as a valuable addition to the global natural products landscape, bridging gaps in exploring biodiversity-rich regions.

Keywords

Biodiversity; Chemical diversity; Chemoinformatics; Natural products; Chemical multiverse; Chemical space

Corresponding authors: Edgar López-López, José L. Medina-Franco

Competing interests: No competing interests were disclosed.

Grant information: The author(s) declared that no grants were involved in supporting this work.

Copyright: © 2025 López-López E et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: López-López E, Hernández-Segura AM, Lara-Cuellar C et al. Nat-UV DB: A Natural Products Database Underlying of Veracruz-Mexico [version 1; peer review: 3 approved]. F1000Research 2025, 14(Chem Inf Sci):157 (https://doi.org/10.12688/f1000research.161261.1) First published: 04 Feb 2025, 14(Chem Inf Sci):157 (https://doi.org/10.12688/f1000research.161261.1) Latest published: 24 Apr 2025, 14(Chem Inf Sci):157 (https://doi.org/10.12688/f1000research.161261.2)

Introduction

Mexico is one of the most biodiverse countries in the world, which has a large list of endemic organisms.^1,2 At the same time, the state of Veracruz, Mexico, is a coastal region next to the Gulf of Mexico, which has a large diversity in its geographic landscapes and weather, conditions, which have contributed to the increase in biodiversity, and is considered one of the most biodiverse states in the country.³ It has been reported that the state of Veracruz houses 34% of the total species in Mexico, which highlights the importance of the systematic study of their chemical diversity.⁴

Natural products have demonstrated their key role in developing new drugs, materials, nutraceuticals, pesticides, and insecticides, which justify their study.^5,6 Nowadays it is possible to establish structure-properties relationships using bioinformatics and chemoinformatics methodologies.⁷ To achieve this goal, it is necessary to condense, organize, and curate the databases. Recent efforts in Latin America have been developed on the construction of natural products databases that have contributed to the understanding of Latin American traditional medicine, and to accelerate the rational use of natural products in this geographical region.⁸ Particularly in Mexico, there are two compound databases^9,10 that are mainly focused on the natural products identified in the central zone of Mexico. However, there are no reports of databases from the most biodiverse regions in this country.

Figure 1 illustrates representative chemical structures that have been obtained from natural resources collected in the state of Veracruz, which are distinguished by their structural diversity,^11–21 and their great applicability domain in medicine (e.g. to develop antimicrobial and anticancer drugs), cosmetology (e.g. to develop skincare molecules), nutrition (e.g. to develop nutraceuticals), agriculture (e.g. to develop biopesticides and insecticides), and many other applications.^22–25

Figure 1. Representative natural products of the state of Veracruz, Mexico.

In the present scientific context, it is possible to establish highly efficient virtual screening protocols using chemoinformatics methods, natural product databases have covered a worldly interest in the past 20 years.^26,27 Multiple commercial and open-access natural product databases are available as valuable resources for molecular design. It is expected that databases will continue to grow in number and type. For example, focusing their creation on the organization of data and information on natural products based on their reported biological activity, chemical characterization method, geolocation, natural source, commercial availability, etc.²⁷ Web applications like COCONUT 2.0 (the COlleCtion of Open NatUral producTs) is an an excellent resource freely available at https://coconut.naturalproducts.net/ to unify and standardize multiple natural product databases,²⁸ which facilitates the systematic filtering of multipurpose data useful for chemoinformatic and natural products research.

The main objective of this work is to introduce Nat-UV DB, a database of natural products isolated and characterized in the state of Veracruz, Mexico. We also discuss a systematic analysis using chemoinformatics methods, identifying endemic natural products, and studying their chemical diversity.

Methods

Database construction and curation

The database of natural products from the state of Veracruz was assembled from a literature search. For the construction of the first version of NAT-UV DB, PubMed, Google Scholar, Sci-Finder, Redalyc, and the institutional repository of the Universidad Veracruzana (Mexico) databases were searched using the keywords “natural product”, “NMR”, and “Veracruz.” We collected information from research articles, and bachelor, master, and doctorate theses from universities and research centers. To complement the data mining, two additional criteria were used for the final selection of the literature used to construct the database. The first filter was that the elucidation of the reported chemical structures has been supported by nuclear magnetic resonance (NMR). The second one was that the compounds identified were obtained from a natural source from any region in the state of Veracruz (Mexico). The search was generated by publication year from 1970 to June of 2024. We want to emphasize that this is the first version of Nat-UV DB; future versions will have natural products from more years, and more research repositories, to assemble a database representative of the entire biodiversity of the state of Veracruz. For each collected molecule, their isomeric SMILES strings²⁹ were generated with ChemBioDraw Ultra V.13, maintaining the stereochemistry reported in the primary literature.³⁰ With the module’Wash’, from the molecular operating environment (MOE) program, version 2024,³¹ the database was curated, maintaining without changes the stereochemistry reported of each molecule. This was done to normalize and collect the most relevant information from the molecules. The data curation involved the elimination of salts, the adjustment of the protonation states, and the elimination of the duplicated molecules. The default settings of the ‘Wash’ module were used. The information collected for each identified compound is organized according to the natural origin of its place of collection, like kingdom, genera, species, and geographical collection. Finally, the list of curated compounds was manually cross-referenced to PubChem³² and ChEMBL v.34³³ databases, which enabled the annotation of databases with the bioactivities that have been associated with each chemical structure ( Figure 2). For those compounds reported in theses, and which were evaluated in a biological test, the biological activity was also included in the database.

Figure 2. Workflow used to construct the Nat-UV database.

Reference data sets

In order to characterize the chemical diversity of Nat-UV DB and to explore its chemical space coverage, approved drugs³⁴ and the Latin American natural products compound database (LaNAPDB)³⁵ were used to compare their chemical structures and properties. The structure files used in this work were taken from open repositories of previously published analyses of natural products databases.³⁶ The structures of the reference compounds were curated using the same procedure described to prepare Nat-UV DB. Table 1 summarizes Nat-UV DB and the reference databases and the number of compounds. Of note, the reference collections included data sets of natural products, including two from Mexico.

Table 1. Reference databases compared with Nat-UV DB.

Database	Description	Size*	Reference
Approved drugs (DrugBank v. 2024.0)	Drugs approved for clinical use	2,144^⧫	³⁴
LANaPDB 2.0	Latinoamerican natural products database with chemicals from Brazil, Colombia, Costa Rica, El Salvador, Mexico, Panama, and Peru	13,579	³⁶
BIOFACQUIM	Natural products from, Mexico	531	³⁷
UNIIQUIM	Natural products from, Mexico	855	¹⁰
Nat-UV DB	Natural products from the state of Veracruz (Mexico)	227	-

* Number of compounds after data curation.

⧫ Small molecules.

Druglikness profiling

The curated Nat-UV DB database was characterized by calculating six physicochemical properties of pharmaceutical interest, namely: molecular weight (MW), octanol/water partition coefficient (ClogP), topological surface area (TPSA), number of rotatable bonds (RB), number of H-bond donor atoms (HBD), and number of H-bond acceptor atoms (HBA). The statistical analysis was done with the program DataWarrior v.06,³⁸ by calculating the mean, median, and standard deviation of the calculated properties. Based on these statistics Nat-UV DB was further compared with other databases (LANAPDB, BIOFACQUIM, UNIIQUIM, and approved drugs from DrugBank) ( Table 1). The systematic comparison was generated using the Python programming language. The code is freely available at https://github.com/EdgL2/Nat-UV-DB.

Scaffold content analysis of Nat-UV DB

The most frequent and unique molecular scaffolds of Nat-UV DB and reference databases ( Table 1) were computed using the scaffold definition of Bemis and Murcko.³⁹ This analysis was done using Python, the code of which is freely available at https://github.com/EdgL2/Nat-UV-DB.

Visualization of the chemical space

In order to generate a visual representation of the chemical space of Nat-UV DB, the fingerprint ECFP4 (1024 bits) was calculated for each compound⁴⁰ and the visualization was done using t-distributed stochastic neighbor embedding (t-SNE).⁴¹ The selection of this visualization method was based on recent studies that support its utility for the systematic study of small and large datasets in terms of neighborhood preservation and visualization capabilities.⁴² The ECFP4 fingerprint and the t-SNE coordinates were calculated in KNIME software. The optimization parameters we used in t-SNE were dimensions (3), iterations (10,000), theta (0.3), perplexity (30.0), and number of threats (8), using 28 as the seed number. The interactive visualization was implemented using DataWarrior software, version 06.⁴³ The KNIME workflow and data generated are freely available in the Software availability section.

Chemical diversity analysis

To compare the chemical diversity of Nat-UV DB with the reference data sets, we employed the consensus diversity (CD) plot which is a simple two-dimensional graph that helps to visualize and compare the diversity of several compound data sets considering multiple representations such as chemical scaffolds, and fingerprint-based diversity.⁴⁴ In this study, the CD plot was generated using the median paired similarity (ECFP4-1024 bits)/Tanimoto; x-axis) and the median paired scaffold similarity (Bemis-Murck representations using ECFP4-1014 bits/Tanimoto; y-axis).⁴⁴ Both are established and are representative metrics of the scaffold and fingerprint-based diversity.⁴⁵ Subsets of the compounds were retrieved from control data sets ( Table 1). The workflow implemented in KNIME software is available in the Software availability section.

Results and discussion

In this section, we present the results of the construction of the Nat-UV database followed by a descriptive analysis of the contained data, and the chemoinformatic characterization in terms of physicochemical properties, scaffold content, chemical space coverage, and consensus chemical diversity.

Nat-UV database

As described in the Methods section, the scientific papers and thesis that complied with the inclusion criteria were selected. Each of the 45 scientific documents selected (1 Doctorate thesis degree; 8 Master thesis degrees; 36 research articles) was analyzed individually to extract manually the chemical structures of each identified natural product. The Nat-UV DB contains information that allows identifying the bibliography precedence of the data. For example: compound name, reference, digital object identifier (DOI), and publication year. Also, it contains data related to the natural source precedence of the data. For example: kingdom, genus, species, and geographical location of the collection of the natural source. Additionally, we added cross-referenced IDs with other databases (e.g. PubChem and ChEMBL). Finally, we manually cross-referenced each compound with their reported bioactivity contained in ChEMBL v. 34.

The current version of Nat-UV DB has 227 compounds collected from different geographical zones of Veracruz ( Figure 3A), mainly isolated by different kinds of gender plants ( Figure 3B). For example, the gender Hyptis, Capsicum, Nidema, Dryopteris, Ipomoea, Azadirachta, Hamelia, Croton, and Guarea are examples of the most frequently studied. Other species from other kingdoms also stand out as Aspergillus, Ganoderma, Colletotrichum, and Aegiale ( Figure 3C). Figure 1D illustrates the distribution of compounds per year reported since 1970 to date. Finally, 79% of the compounds contained in this database have been associated with almost one bioactivity report ( Figure 2E, D) which highlights compounds with anticancer and antimicrobial (antibacterial or antifungal) activities.

Figure 3. Descriptive analysis of the Nat-UV DB.

(A) Geographical collection of natural resources studied in this work, and the number of compounds obtained by each region; (B) Quantification of compounds contained in this database by genus; (C) Quantification of compounds contained in this database by specie precedence; (D) Number of isolated compounds by decades; (E) Associated bioactivity for the compounds contained in this database; and (F) Multi-activity landscape of compounds contained in this database.

Molecular scaffolds

From the total number of compounds contained in Nat-UV DB (227 compounds), 112 scaffolds were identified, of which 52 (52/112; 46%) are unique. Namely, Nat-UV DB contains scaffolds (52) that have not been reported previously in other Latin American datasets, and that are not present in the scaffolds collection of approved drugs ( Figure 4A). The most representative unique scaffolds of Nat-UV DB are shown in Figure 4B, highlighting the presence of derivatives of limonoids, butyrolactones, flavones, pentacyclic triterpenes, etc. The full list of unique scaffolds is available in the Data availability section.

Figure 4. Unique scaffold content in Nat-UV DB.

(A) Shared scaffolds of Nat-UV DB and reference datasets for natural products (LANaPDB, BIOFACQUIM, and UNIIQUIM) and approved drugs (Drugbank); (B) Representative unique scaffolds contained in Nat-UV DB.

There are previous reports of two Mexican natural products databases ( Table 1), but the BIOFACQUIM database is the unique one associated with collected geographical data. Interestingly, 74 compounds contained in this dataset were collected in the state of Veracruz (Mexico). This explains that 32 (32/112; 28%) scaffolds are shared in both databases ( Figure 4A). Also, 17 (17/112; 15%) scaffolds are shared between Nat-UV DB and UNIIQUIM, while 53 (53/112; 47%) scaffolds are shared between Nat-UV DB and the LaNaPDB. Finally, 24 (24/112; 21%) scaffolds were shared between Nat-UV DB and the approved drugs collection. In other words, Nat-UV DB contains some natural product scaffolds (60/112; 54%) that have been identified previously in Mexico and other Latin American countries or have been used as a drug.

Molecular properties

Figure 5 shows a violin plot of the distribution of the six drug-likeness properties calculated for Nat-UV DB. The distribution of the same properties for the two references used in this work was included in comparing the violin plots. ( Table 1). Intrinsic molecular properties like size, flexibility, and polarity are described by explicit molecular properties like weight (MW), coefficient of octanol/water partition (CLogP), number of H-acceptor and H-donors bonds, polar surface area (PSA), and number of rotatable bonds (RB) ( Figure 5A). Summary statistics are presented at the bottom of the violin plots ( Figure 5B).

Figure 5. Violin plots for the drug-likeness physicochemical properties of Nat-UV DB and reference data sets.

(A) The boxes inside of violins enclose data with values within the first and third quartile; (B) Summary statistics are included below each l plot. MW: molecular weight; ClogP: octanol/water partition coefficient; H-bond acceptors: number of H-bond acceptor atoms; H-Donors: number of H-bond donor atoms; PSA: polar surface area; RB: number of rotatable bonds.

According to Figure 5 the size (MW, HA, and HB), flexibility (RB), and permeability (PSA) profiling of Nat-UV are comparable with the control datasets. However, the polarity (CLogP) of the compounds contained in Nat-UV DB, LANaPDB, BIOFACQUIM, and UNIIQUIM is higher than the approved drugs, however, the Nat-UV DB exhibited a shorter distribution than each natural products databases. This finding agrees with previous reports indicating that natural products are slightly more hydrophobic than drugs approved for clinical use.³⁶

Chemical space and diversity analysis

Figure 6 shows a visual representation of the chemical space of Nat-UV DB based on ECFP4 fingerprint using t-SNE. Figure 6(B-E) compares Nat-UV DB with other natural products databases (i.e. LANaPDB, BIOFACQUIM, and UNIIQUIM) and approved drugs. Interestingly, Nat-UV DB shares part of its chemical space with the approved drugs dataset, but Nat-UV DB compounds are more distributed in the three dimensions of the plot, which suggests that have a higher structural diversity than the approved drug dataset. However, LANaPDB (the largest dataset analyzed in this study) has an apparently higher structural diversity than the other studied datasets. To quantify the diversity of each dataset, the calculation of structural diversity and scaffold diversity were done ( Figure 6F). To quantify the diversity of each dataset, we calculated the mean of the paired similarity of the structures (x-axis) and scaffolds (y-axis) based on the similarity of each pair of compounds in the dataset using ECFP4 fingerprint and Tanimoto coefficient, where the higher values confirm a higher structural or scaffold diversity of the dataset. These results showed that Nat-UV DB has a higher structural and scaffold diversity than the approved drugs. However, it has low structural and scaffold diversity in contrast with UNIIQUIM and LANaPDB. Finally, Nat-UV DB shows a higher structural diversity than BIOFACQUIM, but a lower scaffold diversity than this one.

Figure 6. Visual representation of the chemical space coverage of Nat-UV DB and reference datasets based on ECFP4 and t-SNE as a visualization method.

(A) Nat-UV DB; (B) Nat-UV DB and Drugbank (approved drugs); (C) Nat-UV DB and LANaPDB; (D) Nat-UV DB and BIOFACQUIM; (E) Nat-UV DB and UNIIQUIM; and (F) Consensus diversity plot of Nat-UV DB and the four reference datasets.

Conclusions

Nat-UV DB is a compound database of natural products from the state of Veracruz in Mexico, which is a coastal zone reported with a large biodiversity. The open-access database contains 227 compounds reported from 1970 to June 2024, which is available at https://github.com/EdgL2/Nat-UV-DB. The compound database contains information of bibliographic resources for each compound, information about the collected species that come from, and cross-referenced bioactivity data. The chemoinformatic characterization and analysis of the coverage and diversity of Nat-UV DB in the chemical space suggest broad coverage, overlapping with regions in the approved drugs chemical space. The analysis also indicated that there are unique compounds in Nat-UV DB concerning other Mexican and Latin American natural products databases. The main perspectives of this work are to use Nat-UV DB to identify active compounds using virtual screening methods and continue to augment the size of Nat-UV DB from the new natural products that would be identified in the state of Veracruz, Mexico.

Ethics and consent

Ethical approval and consent were not required.

Data availability

Zenodo: Nat-UV DB Data Availability. The datasets used in this work. https://doi.org/10.5281/zenodo.14715820.⁴⁶

This project contains the following underlying data:

• FinalDB_ForPaper_DB_cured.csv: The Nat-UV compounds and approved drugs datasets.
• Most_Frecuent_Scaffolds_NatUVDB.xlsx: The most frequent scaffolds contained in Nat-UV DB.

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

Software availability

Source code available from: https://github.com/EdgL2/Nat-UV-DB.⁴⁷

Archived software available from: https://doi.org/10.5281/zenodo.14715820.⁴⁷

The codes and workflow are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

Acknowledgements

ELL and AMHS thank the Consejo Nacional de Humanidades, Ciencias y Tecnología (CONAHCYT) for the scholarships 762342 (No. CVU: 894234) and 4011825 (CVU: 1322038), respectively.

References

1. Dávila-Aranda P, Lira-Saade R, Valdés-Reyna J: Endemic species of grasses in Mexico: a phytogeographic approach. Biodivers. Conserv. 2004; 13: 1101–1121. Publisher Full Text
2. Mapes C, Basurto F: Biodiversity and edible plants of Mexico.Lira R, Casas A, Blancas J, editors. Ethnobotany of Mexico. Ethnobiology. New York, NY: Springer; 2016. Publisher Full Text
3. Peterson AT, Egbert SL, Sánchez-Cordero V, et al.: Geographic analysis of conservation priority: endemic birds and mammals in Veracruz, Mexico. Biol. Conserv. 2000; 93: 85–94. Publisher Full Text
4. SEMARNAT: Informe de la situación del medio ambiente en México.2015. Accessed 15 November 2024. Reference Source
5. Chopra B, Dhingra AK: Natural products: A lead for drug discovery and development. Phytother. Res. 2021; 35: 4660–4702. Publisher Full Text
6. Zhang X, Jiang M, Niu N, et al.: Natural-product-derived carbon dots: From natural products to functional materials. ChemSusChem. 2017; 11: 11–24. PubMed Abstract | Publisher Full Text
7. López-López E, Medina-Franco JL: Toward structure-multiple activity relationships (SMARts) using computational approaches: A polypharmacological perspective. Drug Discov. Today. 2024; 29: 104046. PubMed Abstract | Publisher Full Text
8. Gómez-García A, Medina-Franco JL: Progress and impact of Latin American natural product databases. Biomolecules. 2022; 12: 1202. PubMed Abstract | Publisher Full Text | Free Full Text
9. Pilón-Jiménez B, Saldívar-González F, Díaz-Eufracio B, et al.: BIOFACQUIM: A Mexican compound database of natural products. Biomolecules. 2019; 9: 31. PubMed Abstract | Publisher Full Text | Free Full Text
10. UNIIQUIM: Lista de compuestos.2024. Accessed 15 November 2024. Reference Source
11. Hernandez-Medel MDR, Garcia-Salmones I, Santillan R, et al.: An anthrone from Picramnia antidesma. Phytochemistry. 1998; 49: 2599–2601. Publisher Full Text
12. Martínez-Fructuoso L, Pereda-Miranda R, Rosas-Ramírez D, et al.: Structure elucidation, conformation, and configuration of cytotoxic 6-heptyl-5,6-dihydro-2H-pyran-2-ones from hyptis species and their molecular docking to α-Tubulin. J. Nat. Prod. 2019; 82: 520–531. PubMed Abstract | Publisher Full Text
13. Mendoza Cervantes G: Obtención de macrosporina a partir de Stemphylium lycopersici hongo fitopatógeno de papaya.2006. Accessed 15 November 2024. Reference Source
14. Gutiérrez-Rebolledo GA, Garduño-Siciliano L, García-Rodríguez RV, et al.: Anti-inflammatory and toxicological evaluation of Moussonia deppeana (Schldl. & Cham) hanst and verbascoside as a main active metabolite. J. Ethnopharmacol. 2016; 187: 269–280. PubMed Abstract | Publisher Full Text
15. Hernández-Carlos B, Bye R, Pereda-Miranda R: Orizabins V−VIII, tetrasaccharide glycolipids from the Mexican Scammony Root (Ipomoea orizabensis). J. Nat. Prod. 1999; 62: 1096–1100. PubMed Abstract | Publisher Full Text
16. Espinoza C, Couttolenc A, Fernández JJ, et al.: Brefeldin-A: an antiproliferative metabolite of the fungus Curvularia trifolii collected from the Veracruz coral reef system, Mexico. J. Mex. Chem. Soc. 2016; 60: 79–82. Accessed 15 November 2024. Reference Source
17. Cruz-Miranda OL, Folch-Mallol J, Martínez-Morales F, et al.: Identification of a huperzine A-producing endophytic fungus from Phlegmariurus taxifolius. Mol. Biol. Rep. 2019; 47: 489–495. PubMed Abstract | Publisher Full Text
18. García A, Ramírez-Apan T, Cogordan JA, et al.: Absolute configuration assignments by experimental and theoretical approaches of ent-labdane- and cis-ent-clerodane-type diterpenes isolated from Croton glabellus. Can. J. Chem. 2006; 84: 1593–1602. Publisher Full Text
19. Rivera-Chávez J, Coporo-Blancas D, Morales-Jiménez J: One-step partial synthesis of (±)-asperteretone B and related hPTP1B1–400 inhibitors from butyrolactone I. Bioorg. Med. Chem. 2020; 28: 115817. PubMed Abstract | Publisher Full Text
20. Paniagua-Vega D, Cerda-García-Rojas CM, Ponce-Noyola T, et al.: A new monoterpenoid oxindole alkaloid from Hamelia Patens micropropagated plantlets. Nat. Prod. Commun. 2012; 7: 1934578X1200701. Publisher Full Text
21. Jimenez A, Villarreal C, Toscano RA, et al.: Limonoids from Swietenia humilis and Guarea grandiflora (Meliaceae). Phytochemistry. 1998; 49: 1981–1988. Publisher Full Text
22. Kaur K, Jain M, Kaur T, et al.: Antimalarials from nature. Bioorg. Med. Chem. 2009; 17: 3229–3256. Publisher Full Text
23. Pereda-Miranda R, Hernández L, Villavicencio MJ, et al.: Structure and stereochemistry of pectinolides A-C, novel antimicrobial and cytotoxic 5,6-dihydro-α-pyrones from Hyptis pectinata. J. Nat. Prod. 1993; 56: 583–593. PubMed Abstract | Publisher Full Text
24. Liu S, Luo XH, Liu YF, et al.: Emodin exhibits anti-acne potential by inhibiting cell growth, lipogenesis, and inflammation in human SZ95 sebocytes. Sci. Rep. 2023; 13: 21576. PubMed Abstract | Publisher Full Text | Free Full Text
25. Pastor R, Bouzas C, Tur JA: Beneficial effects of dietary supplementation with olive oil, oleic acid, or hydroxytyrosol in metabolic syndrome: Systematic review and meta-analysis. Free Radic. Biol. Med. 2021; 172: 372–385. PubMed Abstract | Publisher Full Text
26. Bajorath J, Chávez-Hernández AL, Duran-Frigola M, et al.: Chemoinformatics and artificial intelligence colloquium: progress and challenges in developing bioactive compounds. J Cheminform. 2022; 14: 82. PubMed Abstract | Publisher Full Text | Free Full Text
27. Sorokina M, Steinbeck C: Review on natural products databases: where to find data in 2020. J. Cheminform. 2020; 12: 20. PubMed Abstract | Publisher Full Text | Free Full Text
28. Nainala VC, Rajan K, Kanakam SRS, et al.: COCONUT 2.0: A comprehensive overhaul and curation of the collection of open natural products database. ChemRxiv. 2024. Publisher Full Text
29. Weininger D: SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J. Chem. Inf. Model. 1988; 28: 31–36. Publisher Full Text
30. Narayanaswamy VK, Rissdörfer M, Odhav B: Review on cambridgesoft ChemBioDraw ultra 13.0 v. Int. J. Theor. Appl. Sci. 2013; 5: 45–49.
31. Molecular Operating Environment (MOE): Chemical computing group ULC, 910-1010 Sherbrooke St. W., Montreal, QC H3A 2R7, 2025.2024.
32. Kim S, Chen J, Cheng T, et al.: PubChem 2023 update. Nucleic Acids Res. 2023; 51: D1373–D1380. PubMed Abstract | Publisher Full Text | Free Full Text
33. Zdrazil B, Felix E, Hunter F, et al.: The ChEMBL database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods. Nucleic Acids Res. 2024; 52: D1180–D1192. PubMed Abstract | Publisher Full Text | Free Full Text
34. Knox C, Wilson M, Klinger CM, et al.: DrugBank 6.0: the drugBank knowledgebase for 2024. Nucleic Acids Res. 2024; 52: D1265–D1275. PubMed Abstract | Publisher Full Text | Free Full Text
35. Gómez-García A, Acuña Jiménez DA, Zamora WJ, et al.: Navigating the chemical space and chemical multiverse of a unified Latin American natural product database: LANaPDB. Pharmaceuticals. 2023; 16: 1388. PubMed Abstract | Publisher Full Text | Free Full Text
36. Gómez-García A, Acuña Jiménez DA, Zamora WJ, et al.: Latin American Natural Product Database (LANaPDB): An Update. J. Chem. Inf. Model. 2024; 64: 8495–8509. In press. PubMed Abstract | Publisher Full Text | Free Full Text
37. Sánchez-Cruz N, Pilón-Jiménez BA, Medina-Franco JL: Functional group and diversity analysis of BIOFACQUIM: A Mexican natural product database. F1000Res. 2020; 8(Chem Inf Sci): 2071. PubMed Abstract | Publisher Full Text | Free Full Text
38. Sander T, Freyss J, von Korff M , et al.: DataWarrior: An open-source program for chemistry aware data visualization and analysis. J. Chem. Inf. Model. 2015; 55: 460–473. PubMed Abstract | Publisher Full Text
39. Bemis GW, Murcko MA: The properties of known drugs. 1. Molecular frameworks. J. Med. Chem. 1996; 39: 2887–2893. Publisher Full Text
40. Rogers D, Hahn M: Extended-connectivity fingerprints. J. Chem. Inf. Model. 2010; 50: 742–754. Publisher Full Text
41. Medina-Franco JL, Sánchez-Cruz N, López-López E, et al.: Progress on open chemoinformatic tools for expanding and exploring the chemical space. J. Comput. Aided Mol. Des. 2021; 36: 341–354. PubMed Abstract | Publisher Full Text | Free Full Text
42. Orlov AA, Akhmetshin TN, Horvath D, et al.: From high dimensions to human insight: Exploring dimensionality reduction for chemical space visualization. Mol. Inform. 2024; 44: e202400265. PubMed Abstract | Publisher Full Text | Free Full Text
43. López-López E, Naveja JJ, Medina-Franco JL: DataWarrior: an evaluation of the open-source drug discovery tool. Expert Opin. Drug Discov. 2019; 14: 335–341. PubMed Abstract | Publisher Full Text
44. González-Medina M, Prieto-Martínez FD, Owen JR, et al.: Consensus diversity plots: a global diversity analysis of chemical libraries. J. Cheminf. 2016; 8: 63. PubMed Abstract | Publisher Full Text | Free Full Text
45. Dunn TB, López-López E, Kim TD, et al.: Exploring activity landscapes with extended similarity: is Tanimoto enough? Mol. Inf. 2023; 42: e2300056. PubMed Abstract | Publisher Full Text
46. Medina-Franco JL, et al.: Zenodo: Nat-UV DB Data Availability. The datasets used in this work.2025. Publisher Full Text
47. Medina-Franco JL, et al.: Source code. Archived software. 2025. Publisher Full Text Reference Source

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 04 Feb 2025

Author details Author details

Edgar López-López
Roles: Conceptualization, Formal Analysis, Investigation, Methodology, Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

Ana Margarita Hernández-Segura
Roles: Formal Analysis, Investigation

Carlos Lara-Cuellar
Roles: Formal Analysis, Investigation

Carolina Barrientos-Salcedo
Roles: Writing – Review & Editing

Carlos M. Cerda-García-Rojas
Roles: Writing – Review & Editing

José L. Medina-Franco
Roles: Funding Acquisition, Methodology, Resources, Supervision, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

The author(s) declared that no grants were involved in supporting this work.

Article Versions (2)

version 2

Revised

Published: 24 Apr 2025, 14:157

https://doi.org/10.12688/f1000research.161261.2

version 1

Published: 04 Feb 2025, 14:157

https://doi.org/10.12688/f1000research.161261.1

© 2025 López-López E et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

López-López E, Hernández-Segura AM, Lara-Cuellar C et al. Nat-UV DB: A Natural Products Database Underlying of Veracruz-Mexico [version 1; peer review: 3 approved]. F1000Research 2025, 14(Chem Inf Sci):157 (https://doi.org/10.12688/f1000research.161261.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 1

VERSION 1

PUBLISHED 04 Feb 2025

Views

Reviewer Report 12 Apr 2025

Kathia Maria Honorio, Universidade de Sao Paulo, São Paulo, State of São Paulo, Brazil

Approved

https://doi.org/10.5256/f1000research.177266.r370989

The manuscript deals with a topic of great relevance in the areas of chemoinformatics, natural products and drug discovery. The approach used is appropriate and has led to the development of a database of compounds that will be extremely useful to the scientific community. I suggest minor revisions to the text:
1) "...calculating six physicochemical properties of pharmaceutical interest..." - insert which tool was used to calculate the properties.
2) "Figure 1D illustrates the distribution of compounds per year reported since 1970 to date." - would the correct term be Figure 3D?
3) "Finally, 79% of the compounds contained in this database have been associated with almost one bioactivity report (Figure 2E, D)..." - would the correct term be "Figures 3E and F?

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Computational medicinal chemistry

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 17 Apr 2025

José L. Medina-Franco, DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autonoma de Mexico, Mexico City, 04510, Mexico

17 Apr 2025

Author Response

Dear Prof. Kathia Maria Honorio, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Competing Interests: No competing interests were disclosed.
Dear Prof. Kathia Maria Honorio, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Dear Prof. Kathia Maria Honorio, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 17 Apr 2025

José L. Medina-Franco, DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autonoma de Mexico, Mexico City, 04510, Mexico

17 Apr 2025

Author Response

Dear Prof. Kathia Maria Honorio, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Competing Interests: No competing interests were disclosed.
Dear Prof. Kathia Maria Honorio, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Dear Prof. Kathia Maria Honorio, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 02 Apr 2025

David J Newman, NIH Special Volunteer, Wayne, USA

Approved

https://doi.org/10.5256/f1000research.177266.r370987

This is a well-described analysis of the chemical structures found in reports from the most-biodiverse state in Mexico. The authors have provided links to their sources, their analytical tools and the manner in which they have integrated their data sets to provide the first NP-structure analyses from this highly biodiverse area of Mexico.

The authors have successfully linked analytical tools (in particular high-field NMR to confirm structures and biological sources have been identified.

The various data sets provided (together with the methodologies used) will not only act as a building block for subsequent work but more importantly, will permit comparison(s) with both existing agents from other disparate areas not only in Mexico but also in other biodiverse geographic areas in Central and South America, in particular those from Brazil’s Atlantic Forest and various parts of Amazonia where there are databases extant.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 28 Apr 2025

José L. Medina-Franco, DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autonoma de Mexico, Mexico City, 04510, Mexico

28 Apr 2025

Author Response

Dear Prof. David J Newman, Thank you very much for your thoughtful and encouraging comments. We truly appreciate your recognition of our work and its potential to serve as a ... Continue reading Dear Prof. David J Newman, Thank you very much for your thoughtful and encouraging comments. We truly appreciate your recognition of our work and its potential to serve as a foundation for future research. We are especially excited about the possibility of comparative studies and welcome future collaborations to expand this effort across other biodiverse regions in Latin America.
Dear Prof. David J Newman, Thank you very much for your thoughtful and encouraging comments. We truly appreciate your recognition of our work and its potential to serve as a foundation for future research. We are especially excited about the possibility of comparative studies and welcome future collaborations to expand this effort across other biodiverse regions in Latin America.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 28 Apr 2025

José L. Medina-Franco, DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autonoma de Mexico, Mexico City, 04510, Mexico

28 Apr 2025

Author Response

Dear Prof. David J Newman, Thank you very much for your thoughtful and encouraging comments. We truly appreciate your recognition of our work and its potential to serve as a ... Continue reading Dear Prof. David J Newman, Thank you very much for your thoughtful and encouraging comments. We truly appreciate your recognition of our work and its potential to serve as a foundation for future research. We are especially excited about the possibility of comparative studies and welcome future collaborations to expand this effort across other biodiverse regions in Latin America.
Dear Prof. David J Newman, Thank you very much for your thoughtful and encouraging comments. We truly appreciate your recognition of our work and its potential to serve as a foundation for future research. We are especially excited about the possibility of comparative studies and welcome future collaborations to expand this effort across other biodiverse regions in Latin America.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 20 Mar 2025

Virginia Flores-Morales, The Autonomous University of Zacatecas, Zacatecas, Mexico

Approved

https://doi.org/10.5256/f1000research.177266.r370994

The article is interesting and shows in a way very appropriate to the relevance of the results. The structure of the article allows an agile and pleasant reading. Future updates will make this database a reference for study in other regions of great diversity in Mexico. The findings are important and are very well reflected in the work.

I would recommend to standardize PSA and change the TPSA of page 5. A subtlety. I also suggest in Figure 6, arrange the descriptors in colour by order of apparition, including panel F, to make the explanation more harmonious

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: My research lines focus on generating knowledge in the medicinal chemical study of synthetic molecules and natural products and Theoretical chemistry.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 26 Apr 2025

José L. Medina-Franco, DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autonoma de Mexico, Mexico City, 04510, Mexico

26 Apr 2025

Author Response

Dear Prof. Virginia Flores-Morales, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Competing Interests: No competing interests were disclosed.
Dear Prof. Virginia Flores-Morales, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Dear Prof. Virginia Flores-Morales, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 26 Apr 2025

José L. Medina-Franco, DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autonoma de Mexico, Mexico City, 04510, Mexico

26 Apr 2025

Author Response

Dear Prof. Virginia Flores-Morales, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Competing Interests: No competing interests were disclosed.
Dear Prof. Virginia Flores-Morales, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Dear Prof. Virginia Flores-Morales, We appreciate your valuable comments and suggestions. The second version of this revised work addresses all of them.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 04 Feb 2025

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3
Version 2 (revision) 24 Apr 25			read
Version 1 04 Feb 25	read	read	read

Virginia Flores-Morales, The Autonomous University of Zacatecas, Zacatecas, Mexico
David J Newman, NIH Special Volunteer, Wayne, USA
Kathia Maria Honorio, Universidade de Sao Paulo, São Paulo, Brazil

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

9 Views

08 May 2025 | for Version 2

Kathia Maria Honorio, Universidade de Sao Paulo, São Paulo, State of São Paulo, Brazil

9 Views Cite this report Responses(0)

Approved

In my opinion, the manuscript can be approved.

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Computational medicinal chemistry

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

14 Views

12 Apr 2025 | for Version 1

Kathia Maria Honorio, Universidade de Sao Paulo, São Paulo, State of São Paulo, Brazil

14 Views Cite this report Responses(1)

Approved

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Computational medicinal chemistry

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Back to all reports

Reviewer Report

20 Views

02 Apr 2025 | for Version 1

David J Newman, NIH Special Volunteer, Wayne, USA

20 Views Cite this report Responses(1)

Approved

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Back to all reports

Reviewer Report

14 Views

20 Mar 2025 | for Version 1

Virginia Flores-Morales, The Autonomous University of Zacatecas, Zacatecas, Mexico

14 Views Cite this report Responses(1)

Approved

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

My research lines focus on generating knowledge in the medicinal chemical study of synthetic molecules and natural products and Theoretical chemistry.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Dávila-Aranda P, Lira-Saade R, Valdés-Reyna J: Endemic species of grasses in Mexico: a phytogeographic approach. Biodivers. Conserv. 2004; 13: 1101–1121. Publisher Full Text

[2] 2. Mapes C, Basurto F: Biodiversity and edible plants of Mexico.Lira R, Casas A, Blancas J, editors. Ethnobotany of Mexico. Ethnobiology. New York, NY: Springer; 2016. Publisher Full Text

[3] 3. Peterson AT, Egbert SL, Sánchez-Cordero V, et al.: Geographic analysis of conservation priority: endemic birds and mammals in Veracruz, Mexico. Biol. Conserv. 2000; 93: 85–94. Publisher Full Text

[4] 4. SEMARNAT: Informe de la situación del medio ambiente en México.2015. Accessed 15 November 2024. Reference Source

[5] 5. Chopra B, Dhingra AK: Natural products: A lead for drug discovery and development. Phytother. Res. 2021; 35: 4660–4702. Publisher Full Text

[6] 6. Zhang X, Jiang M, Niu N, et al.: Natural-product-derived carbon dots: From natural products to functional materials. ChemSusChem. 2017; 11: 11–24. PubMed Abstract | Publisher Full Text

[7] 7. López-López E, Medina-Franco JL: Toward structure-multiple activity relationships (SMARts) using computational approaches: A polypharmacological perspective. Drug Discov. Today. 2024; 29: 104046. PubMed Abstract | Publisher Full Text

[8] 8. Gómez-García A, Medina-Franco JL: Progress and impact of Latin American natural product databases. Biomolecules. 2022; 12: 1202. PubMed Abstract | Publisher Full Text | Free Full Text

[9] 9. Pilón-Jiménez B, Saldívar-González F, Díaz-Eufracio B, et al.: BIOFACQUIM: A Mexican compound database of natural products. Biomolecules. 2019; 9: 31. PubMed Abstract | Publisher Full Text | Free Full Text

[10] 10. UNIIQUIM: Lista de compuestos.2024. Accessed 15 November 2024. Reference Source

[11] 11. Hernandez-Medel MDR, Garcia-Salmones I, Santillan R, et al.: An anthrone from Picramnia antidesma. Phytochemistry. 1998; 49: 2599–2601. Publisher Full Text

[12] 12. Martínez-Fructuoso L, Pereda-Miranda R, Rosas-Ramírez D, et al.: Structure elucidation, conformation, and configuration of cytotoxic 6-heptyl-5,6-dihydro-2H-pyran-2-ones from hyptis species and their molecular docking to α-Tubulin. J. Nat. Prod. 2019; 82: 520–531. PubMed Abstract | Publisher Full Text

[13] 13. Mendoza Cervantes G: Obtención de macrosporina a partir de Stemphylium lycopersici hongo fitopatógeno de papaya.2006. Accessed 15 November 2024. Reference Source

[14] 14. Gutiérrez-Rebolledo GA, Garduño-Siciliano L, García-Rodríguez RV, et al.: Anti-inflammatory and toxicological evaluation of Moussonia deppeana (Schldl. & Cham) hanst and verbascoside as a main active metabolite. J. Ethnopharmacol. 2016; 187: 269–280. PubMed Abstract | Publisher Full Text

[15] 15. Hernández-Carlos B, Bye R, Pereda-Miranda R: Orizabins V−VIII, tetrasaccharide glycolipids from the Mexican Scammony Root (Ipomoea orizabensis). J. Nat. Prod. 1999; 62: 1096–1100. PubMed Abstract | Publisher Full Text

[16] 16. Espinoza C, Couttolenc A, Fernández JJ, et al.: Brefeldin-A: an antiproliferative metabolite of the fungus Curvularia trifolii collected from the Veracruz coral reef system, Mexico. J. Mex. Chem. Soc. 2016; 60: 79–82. Accessed 15 November 2024. Reference Source

[17] 17. Cruz-Miranda OL, Folch-Mallol J, Martínez-Morales F, et al.: Identification of a huperzine A-producing endophytic fungus from Phlegmariurus taxifolius. Mol. Biol. Rep. 2019; 47: 489–495. PubMed Abstract | Publisher Full Text

[18] 18. García A, Ramírez-Apan T, Cogordan JA, et al.: Absolute configuration assignments by experimental and theoretical approaches of ent-labdane- and cis-ent-clerodane-type diterpenes isolated from Croton glabellus. Can. J. Chem. 2006; 84: 1593–1602. Publisher Full Text

[19] 19. Rivera-Chávez J, Coporo-Blancas D, Morales-Jiménez J: One-step partial synthesis of (±)-asperteretone B and related hPTP1B1–400 inhibitors from butyrolactone I. Bioorg. Med. Chem. 2020; 28: 115817. PubMed Abstract | Publisher Full Text

[20] 20. Paniagua-Vega D, Cerda-García-Rojas CM, Ponce-Noyola T, et al.: A new monoterpenoid oxindole alkaloid from Hamelia Patens micropropagated plantlets. Nat. Prod. Commun. 2012; 7: 1934578X1200701. Publisher Full Text

[21] 21. Jimenez A, Villarreal C, Toscano RA, et al.: Limonoids from Swietenia humilis and Guarea grandiflora (Meliaceae). Phytochemistry. 1998; 49: 1981–1988. Publisher Full Text

[22] 22. Kaur K, Jain M, Kaur T, et al.: Antimalarials from nature. Bioorg. Med. Chem. 2009; 17: 3229–3256. Publisher Full Text

[23] 23. Pereda-Miranda R, Hernández L, Villavicencio MJ, et al.: Structure and stereochemistry of pectinolides A-C, novel antimicrobial and cytotoxic 5,6-dihydro-α-pyrones from Hyptis pectinata. J. Nat. Prod. 1993; 56: 583–593. PubMed Abstract | Publisher Full Text

[24] 24. Liu S, Luo XH, Liu YF, et al.: Emodin exhibits anti-acne potential by inhibiting cell growth, lipogenesis, and inflammation in human SZ95 sebocytes. Sci. Rep. 2023; 13: 21576. PubMed Abstract | Publisher Full Text | Free Full Text

[25] 25. Pastor R, Bouzas C, Tur JA: Beneficial effects of dietary supplementation with olive oil, oleic acid, or hydroxytyrosol in metabolic syndrome: Systematic review and meta-analysis. Free Radic. Biol. Med. 2021; 172: 372–385. PubMed Abstract | Publisher Full Text

[26] 26. Bajorath J, Chávez-Hernández AL, Duran-Frigola M, et al.: Chemoinformatics and artificial intelligence colloquium: progress and challenges in developing bioactive compounds. J Cheminform. 2022; 14: 82. PubMed Abstract | Publisher Full Text | Free Full Text

[27] 27. Sorokina M, Steinbeck C: Review on natural products databases: where to find data in 2020. J. Cheminform. 2020; 12: 20. PubMed Abstract | Publisher Full Text | Free Full Text

[28] 28. Nainala VC, Rajan K, Kanakam SRS, et al.: COCONUT 2.0: A comprehensive overhaul and curation of the collection of open natural products database. ChemRxiv. 2024. Publisher Full Text

[29] 29. Weininger D: SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J. Chem. Inf. Model. 1988; 28: 31–36. Publisher Full Text

[30] 30. Narayanaswamy VK, Rissdörfer M, Odhav B: Review on cambridgesoft ChemBioDraw ultra 13.0 v. Int. J. Theor. Appl. Sci. 2013; 5: 45–49.

[31] 31. Molecular Operating Environment (MOE): Chemical computing group ULC, 910-1010 Sherbrooke St. W., Montreal, QC H3A 2R7, 2025.2024.

[32] 32. Kim S, Chen J, Cheng T, et al.: PubChem 2023 update. Nucleic Acids Res. 2023; 51: D1373–D1380. PubMed Abstract | Publisher Full Text | Free Full Text

[33] 33. Zdrazil B, Felix E, Hunter F, et al.: The ChEMBL database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods. Nucleic Acids Res. 2024; 52: D1180–D1192. PubMed Abstract | Publisher Full Text | Free Full Text

[34] 34. Knox C, Wilson M, Klinger CM, et al.: DrugBank 6.0: the drugBank knowledgebase for 2024. Nucleic Acids Res. 2024; 52: D1265–D1275. PubMed Abstract | Publisher Full Text | Free Full Text

[35] 35. Gómez-García A, Acuña Jiménez DA, Zamora WJ, et al.: Navigating the chemical space and chemical multiverse of a unified Latin American natural product database: LANaPDB. Pharmaceuticals. 2023; 16: 1388. PubMed Abstract | Publisher Full Text | Free Full Text

[36] 36. Gómez-García A, Acuña Jiménez DA, Zamora WJ, et al.: Latin American Natural Product Database (LANaPDB): An Update. J. Chem. Inf. Model. 2024; 64: 8495–8509. In press. PubMed Abstract | Publisher Full Text | Free Full Text

[37] 37. Sánchez-Cruz N, Pilón-Jiménez BA, Medina-Franco JL: Functional group and diversity analysis of BIOFACQUIM: A Mexican natural product database. F1000Res. 2020; 8(Chem Inf Sci): 2071. PubMed Abstract | Publisher Full Text | Free Full Text

[38] 38. Sander T, Freyss J, von Korff M , et al.: DataWarrior: An open-source program for chemistry aware data visualization and analysis. J. Chem. Inf. Model. 2015; 55: 460–473. PubMed Abstract | Publisher Full Text

[39] 39. Bemis GW, Murcko MA: The properties of known drugs. 1. Molecular frameworks. J. Med. Chem. 1996; 39: 2887–2893. Publisher Full Text

[40] 40. Rogers D, Hahn M: Extended-connectivity fingerprints. J. Chem. Inf. Model. 2010; 50: 742–754. Publisher Full Text

[41] 41. Medina-Franco JL, Sánchez-Cruz N, López-López E, et al.: Progress on open chemoinformatic tools for expanding and exploring the chemical space. J. Comput. Aided Mol. Des. 2021; 36: 341–354. PubMed Abstract | Publisher Full Text | Free Full Text

[42] 42. Orlov AA, Akhmetshin TN, Horvath D, et al.: From high dimensions to human insight: Exploring dimensionality reduction for chemical space visualization. Mol. Inform. 2024; 44: e202400265. PubMed Abstract | Publisher Full Text | Free Full Text

[43] 43. López-López E, Naveja JJ, Medina-Franco JL: DataWarrior: an evaluation of the open-source drug discovery tool. Expert Opin. Drug Discov. 2019; 14: 335–341. PubMed Abstract | Publisher Full Text

[44] 44. González-Medina M, Prieto-Martínez FD, Owen JR, et al.: Consensus diversity plots: a global diversity analysis of chemical libraries. J. Cheminf. 2016; 8: 63. PubMed Abstract | Publisher Full Text | Free Full Text

[45] 45. Dunn TB, López-López E, Kim TD, et al.: Exploring activity landscapes with extended similarity: is Tanimoto enough? Mol. Inf. 2023; 42: e2300056. PubMed Abstract | Publisher Full Text

[46] 46. Medina-Franco JL, et al.: Zenodo: Nat-UV DB Data Availability. The datasets used in this work.2025. Publisher Full Text

[47] 47. Medina-Franco JL, et al.: Source code. Archived software. 2025. Publisher Full Text Reference Source

Nat-UV DB: A Natural Products Database Underlying of Veracruz-Mexico

Abstract

Background

Methods

Results

Conclusions

Keywords

Introduction

Figure 1. Representative natural products of the state of Veracruz, Mexico.

Methods

Database construction and curation

Figure 2. Workflow used to construct the Nat-UV database.

Reference data sets

Table 1. Reference databases compared with Nat-UV DB.

Druglikness profiling

Scaffold content analysis of Nat-UV DB

Visualization of the chemical space

Chemical diversity analysis

Results and discussion

Nat-UV database

Figure 3. Descriptive analysis of the Nat-UV DB.

Molecular scaffolds

Figure 4. Unique scaffold content in Nat-UV DB.

Molecular properties

Figure 5. Violin plots for the drug-likeness physicochemical properties of Nat-UV DB and reference data sets.

Chemical space and diversity analysis

Figure 6. Visual representation of the chemical space coverage of Nat-UV DB and reference datasets based on ECFP4 and t-SNE as a visualization method.

Conclusions

Ethics and consent

Data availability

Software availability

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated