ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Research Note

Dimension reduction of Malaria Box data allows efficient compound prioritization

[version 1; peer review: 1 approved, 1 approved with reservations]
PUBLISHED 18 Nov 2016
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

Abstract

Background: New anti-malarial drugs are needed to meet the challenge of artemisinin resistance and to achieve malaria elimination and eradication. The new anti-malarial compounds are expected to have many desirable properties, such as activity against multiple stages of Plasmodium, low host cytotoxicity, and low propensity for resistance development, but whether and how these properties might be linked to each other is not clear. A better understanding of the relationship between activities of compounds against different stages of Plasmodium could help in the development of strategies to prioritize compounds with maximum potential for further development. Methods: We utilized the large amount of data that has recently been generated on 400 anti-malarial Malaria Box compounds and performed statistical analyses, such as rank correlation, hierarchical clustering, and principal-component analyses, to test associations between activities against different stages of Plasmodium, other pathogens, and human cells. Results: We found significant positive correlations between the activities of compounds against different stages of Plasmodium. Our results also show toxicity associated with assays conducted at higher compound concentrations. Principal-component analyses (PCA) of the data allowed differentiation of Plasmodium-specific activity from general toxicity and predicted success in in vitro evolution of resistance. We found that a single principal-component can capture most of the desirable properties of Malaria Box compounds and can be used to rank compounds from most desirable to least desirable activity-profile. Conclusions: Here, we provide a systematic strategy to prioritize Malaria Box compounds for further development. This approach may be applied for prioritization of anti-malarial compounds in general.

Keywords

Malaria Box, Plasmodium falciparum

Introduction

Malaria killed about half a million people in the year 2015, and 70% were children under the age of five1. The emergence and spread of resistance towards frontline anti-malarial drugs in South-East Asia has created an urgent need to discover new drugs. In addition, new drugs are needed to meet the objective of malaria elimination and global eradication, for which the currently available drugs are not adequate2. The desirable characteristics of new clinical candidates, also called Target Compound Profile (TCP), include high potency and fast killing of the asexual erythrocytic stage for quick relief of symptoms, high plasma half-life to reduce treatment duration, activity against the sexual stages to prevent transmission, activity against the liver-stage to avoid relapse and for prophylactic use, activity against multiple species of Plasmodium, and reduced propensity for the development of resistance3. New anti-malarial drugs must also be safe for mass administration, and for children and pregnant women, who are most vulnerable to malaria3.

It is currently not well understood how TCP properties are related to each other. This makes it difficult to assess whether it would be feasible for a single compound to have all TCP properties and what strategies could be adopted to find such candidates. With the discovery of thousands of active compounds from the high-throughput assays against the erythrocytic stage of P. falciparum48, it has become imperative to find a prioritization strategy that can identify the most promising candidates for further development. For a subset of antimalarial compounds identified from high-throughput screens in the so-called “Malaria Box”, many of the TCP properties have been assessed. The Malaria Box is a set of 400 compounds selected based on their potent activity against the erythrocytic stage of P. falciparum, chemical diversity and commercial availability9. These compounds were made available free of cost to researchers, thus catalysing a number of studies, including the screening of these compounds against multiple Plasmodium stages, eukaryotic pathogens and human cells10. Some of these compounds have also been tested for their propensity for resistance generation11. Here, we utilized the large amount of data generated on Malaria Box compounds and found significant associations between different TCP properties. Based on these observations, we propose a prioritization strategy for anti-malarial compounds for further development.

Methods

The screening data on Malaria Box compounds was obtained from Van Voorhis et al,10 who compiled the previously published data on Malaria Box compounds (55 assays), as well as their own data (236 assays). We rank transformed all assay values, such that higher values represent higher inhibition.

In case multiple assays were available for a given stage or concentration, their median values were taken: there were nine assays reporting EC50 values against the asexual stage of P. falciparum, one assay against asexual stage at high compound concentration (10 µM), five gametocytocidal assays conducted at 0.5–1 µM compound concentrations, ten gametocytocidal assays conducted at 2.5–5 µM compound concentrations and six gametocytocidal assays conducted at 10–12.5 µM compound concentrations. There was one assay each at lower and higher compound concentrations against liver (5 µM and 50 µM, respectively) and ookinete stages (1 µM and 10 µM, respectively). Values were also similarly combined for parasites with multiple assays, such as Babesia sp. and Mycobacterium tuberculosis.

The data on the in vitro resistance evolution of Malaria Box compounds was obtained from Corey et al.11.

All statistical analyses were performed in the R software v3.3.1 (https://www.r-project.org/). R commands hclust and prcomp were used for hierarchical clustering and PCA analyses respectively. PCA analyses were performed on the activity data against different Plasmodium stages and human cells. Rank correlation values were used to create the distance matrix for the hierarchical clustering.

Results

Relationship between activity of Malaria Box compounds in different assays

Van Voorhis et al. have recently reported their large-scale screening data on Malaria Box compounds, which was compiled along with the previously published data10. We first reduced the dimensionality of this data by combining variables that describe activity against the same Plasmodium stage and pathogen. Assays conducted at higher concentration of compounds may provide different results from those performed at lower concentration, and thus assays conducted at different concentrations were combined separately.

Figure 1 provides an overview of the relationship between different properties of Malaria Box compounds in the form of a correlation matrix. Multiple observations can be made from this matrix. There is a moderate, but significant correlation, between activity against the asexual stage and the gametocyte stage of P. falciparum (Spearman rank correlation 0.43, between EC50 values and % inhibition of gametocytes at 1 µM compound concentration). The correlation between EC50 values against asexual stage and gametocyte stage gets lower when gametocytocidal activity was screened at higher compound concentrations (Spearman rank correlation 0.17, at 10µM). The gametocytocidal activity at higher concentrations shows a higher correlation with inhibitory activity against different pathogens, including M. tuberculosis and human fibroblast cells (Figure 1). Assays conducted at higher concentrations against asexual, liver and ookinete stages also show a higher correlation with toxicity against human cells and other cell types (Figure 1). These observations suggest that assays conducted at high compound concentration show general toxicity against a wide variety of cells, including human cells, thus hits identified from these assays should be used with caution.

439fbd84-9c1a-45bd-a547-e24620f0b699_figure1.gif

Figure 1. Correlations between inhibition values across different assays.

Spearman rank correlations are shown between assays whose values were rank transformed, such that higher values indicate higher inhibition. Gray boxes indicate p values > 0.05. The assays performed at higher concentrations in Plasmodium show higher positive correlations across different assays, including activity against human cells, suggesting that assays conducted at high compound concentration show general toxicity.

To further understand the relationship between activities against different Plasmodium stages, we performed hierarchical clustering of the data. Three major clusters were evident (Figure 2). Cluster 1 consists of asexual and gametocyte assays conducted at high compound concentrations. Cluster 2 consists of asexual and gametocyte assays conducted at low compound concentrations. Cluster 3 consists of assays conducted against liver and ookinete stages. Separate clustering of assays against asexual and gametocyte stage at different compound concentrations again suggests general toxicity of assays conducted at higher compound concentrations. There are two possibilities why liver and ookinete stages cluster together. These two stages may be physiologically more similar to each other, or it may reflect the fact that these assays were conducted against P. berghei, compared to other assays that were conducted against P. falciparum.

439fbd84-9c1a-45bd-a547-e24620f0b699_figure2.gif

Figure 2. Hierarchical clustering of the assay data against different Plasmodium stages.

The three major clusters are evident that correspond to activity against P. falciparum at high concentrations (Cluster 1, leftmost), activity against P. falciparum at low concentrations (Cluster 2, middle), and possibly activity against P. berghei (Cluster 3, rightmost). Rank correlation values were used to create the distance matrix for the clustering. The color key shows the inhibitory activity of the compounds with a higher number representing a higher activity.

PCA analyses allows differentiation of general and specific toxicity

Given the possible confounding roles of compound concentration and Plasmodium species used in the screening, the prioritization of compounds that have pan-stage activity, but low host cytotoxicity, becomes difficult. We thus tested whether the Principal-Component Analyses (PCA) may be utilized to differentiate Plasmodium-specific activity from general toxicity. PCA analyses of the data from different Plasmodium stages and human cells lead to the identification of principal-components, which showed different properties with respect to general and specific activity. PC1 showed high correlation with assays conducted at higher compound concentrations and against a variety of cell types, including human cells (Figure 3), suggesting that PC1 is related to general toxicity. PC3, on the other hand, showed higher correlations with assays conducted at low compound concentrations, but negative or lower correlation with assays conducted at high compound concentrations in Plasmodium, against different pathogens and human cells (Figure 3), suggesting that PC3 is related to specific activity against Plasmodium across different stages. PC2 showed high positive correlation with the liver and ookinete stage assays, but showed negative correlations with asexual and gametocyte stage assays, suggesting that this component reflects activity against these two stages or against P. berghei in which these assays were performed.

439fbd84-9c1a-45bd-a547-e24620f0b699_figure3.gif

Figure 3. Principal components (PC) differentiate general toxicity from Plasmodium specific activity.

The three principal components PC1, PC2 and PC3 explained 30%, 16% and 13% of the variation in the data, respectively. PC1 showed positive correlation across different assays, suggesting that it reflects general toxicity. PC3 showed higher positive correlation only with assays conducted at lower compound concentration, but lower or negative correlation with assays conducted at higher compound concentration, other parasites and human cells, suggesting that it reflects pan-stage specific activity against Plasmodium.

General toxicity predicts in vitro resistance evolution

In vitro resistance evolution has recently been attempted against 30 Malaria Box compounds with three independent lines for each compound11. We next tested whether the Plasmodium-specific activity or general toxicity estimated from the principal components might predict in vitro resistance evolution. Compounds for which resistance could not be developed showed significantly higher PC1 values (Figure 4A). These compounds also showed higher human toxicity (Figure 4B) and enrichment of probe-like compounds, which have chemical properties associated with higher non-specific activity9 (Figure 4C). These results suggest that general toxicity of compounds may lead to lower success in in vitro resistance evolution. On the other hand, high PC3 values were associated with higher likelihood of in vitro resistance generation (Wilcox test p = 0.02, not shown).

439fbd84-9c1a-45bd-a547-e24620f0b699_figure4.gif

Figure 4. Higher general toxicity of compounds against which in vitro resistance development was not successful.

The in vitro resistance evolution was attempted for 30 Malaria Box compounds and was successful for 13 compounds11. Compounds for which resistance evolution was not successful showed (A) higher PC1 values (Wilcox p 0.004), (B) lower EC50 against human fibroblasts cells (Wilcox p 0.130), and (C) a higher proportion of probe-like compounds, as classified by Medicines for Malaria Venture9 (Fisher p 0.100).

Prioritization of Malaria Box compounds

In general, our results suggest that compounds that show high PC3 values should be prioritized for further development, including target identification by in vitro resistance evolution. Table 1 lists the top 20 Malaria Box compounds with the highest PC3 values. These compounds show high activity against multiple stages at a low concentration, but low activity against human cells. In total, 11 of these compounds show favourable oral bioavailability values. Some of these are active against other pathogens (Table 1). The values of three principal components for all Malaria Box compounds are available in Dataset 114.

Table 1. Top Malaria Box compound with the highest PC3 values.

These compounds show high activity against multiple stages at a low concentration, but low activity against human cells. The mouse oral bioavailability was obtained by measuring the plasma concentration of the compounds with a single high oral dose (140 μM/kg)10. Compounds with favourable plasma concentration (plasma Cmax > 1μg/ml) are indicated. Activity of compounds against other parasites is also indicated. The oral bioavailability data and compound activity data against other parasites was obtained from Van Voorhis et al.10. The PC3 values for all Malaria Box compounds are available in Dataset 114.

MMV IDMouse oral single
dose Cmax > 1μM
Trypanosoma
brucei brucei
Leishmania
donovani
Trypanosoma
cruzi
Brugia malayiSchistosoma
mansoni
MMV007116Y
MMV666021
MMV006389YYY
MMV007127YYY
MMV020439Y
MMV665827
MMV665977Y
MMV000563YYYY
MMV666596
MMV666693Y
MMV396678
MMV007430Y
MMV665878YY
MMV006820Y
MMV007228
MMV665782
MMV396595Y
MMV019266Y
MMV020942Y
MMV007199Y
Dataset 1.PC1, PC2 and PC3 values for 400 Malaria Box compounds.

Discussion

The wide availability of Malaria Box has catalysed a number of studies on these compounds10. Prioritization of compounds based on a large number of variables is not straightforward. Here, we analysed this data and found that a single variable (PC3) can capture most of the desirable compound properties: activity against multiple Plasmodium stages and low host cytotoxicity, thus greatly simplifying the task of compound prioritization. Our analyses suggest that screening at high compound concentrations can lead to general toxicity and thus should be avoided. Thus the idea that hits identified from multiple assays should be more confident10 needs to be considered carefully when hits are identified from high concentration assays. The consensus approach might lead to the selection of compounds with general toxicity.

We found significant correlation between activity against the asexual stage and the gametocyte stage of P. falciparum (Spearman rank correlation 0.43), which suggests that it might be easier to find compounds that have activity against both these stages, even though the two stages have different growth properties. The correlations between asexual stage with the liver and ookinete stages were low (Figure 1 and Figure 2). This could reflect different physiological states of liver and ookinete stages from asexual and gametocyte stages, but it might also reflect that liver and ookinete stage assays were performed in P. berghei, rather than P. falciparum. Thus, the development of higher throughput liver and ookinete stage assays in P. falciparum could be valuable. It is important to note that the correlation values that we report should be considered an underestimate, as inhibition values for assays against the same stage show large variability, e.g. median rank correlation among nine EC50 values against asexual stage of P. falciparum was 0.51. The possible reasons for this variability have previously been discussed10.

The difficulty in the evolution of in vitro resistance is considered a very desirable property of a compound12 given that a number of anti-malarial drugs are becoming less effective because of resistance generation13. Our observations suggest caution in interpreting the results of in vitro resistance evolution experiments. The failure to obtain resistance in vitro could be because of general toxicity of the compound on the erythrocyte hosts. Thus we suggest that the host toxicity of compounds should be thoroughly evaluated before conducting the labour-intensive in vitro resistance evolution experiments.

While we have prioritized compounds according to their pan-stage activity and low human toxicity, we would like to stress that compounds that show activity across pathogens and human cells may also be potential leads, if their toxicity could be managed. One possibility to reduce the toxicity of a compound is to identify its target in the parasite and its human ortholog, and utilize the three-dimensional structures of the compound with the target to modify the compound to increase selectivity. However, target identification of these compounds might be more difficult using in vitro resistance development.

Data availability

This publication uses data on Malaria Box compounds, as reported and compiled by Van Voorhis et al.10 (DOI: 10.1371/journal.ppat.1005763.s002). The data on the in vitro resistance evolution is reported by Corey et al.11.

Dataset 1: PC1, PC2 and PC3 values for 400 Malaria Box compounds, DOI: 10.5256/f1000research.10121.d14257014

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 18 Nov 2016
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Singh GP. Dimension reduction of Malaria Box data allows efficient compound prioritization [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2016, 5:2701 (https://doi.org/10.12688/f1000research.10121.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 18 Nov 2016
Views
9
Cite
Reviewer Report 16 Jan 2017
Francisco-Javier Gamo, Tres Cantos Medicines Development Campus Unit, Malaria Unit, GlaxoSmithKline, Tres Cantos, Spain 
Approved with Reservations
VIEWS 9
The article shows an analysis of the available biological data on the 400 compounds of the Malaria Box set trying to understand if there is any correlation between favorable parasitological properties and undesirable unspecific or toxicity aspects. Most of the ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Gamo FJ. Reviewer Report For: Dimension reduction of Malaria Box data allows efficient compound prioritization [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2016, 5:2701 (https://doi.org/10.5256/f1000research.10902.r18780)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
20
Cite
Reviewer Report 29 Nov 2016
Timothy N. C. Wells, Medicines for Malaria Venture (MMV), Geneva, Switzerland 
Approved
VIEWS 20
This is an analysis of the Malaria Box set, and an attempt to try to understand whether there are any correlations between activity and safety signals or resistance development using Principal Component Analyses. The author defines a single principal component ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Wells TNC. Reviewer Report For: Dimension reduction of Malaria Box data allows efficient compound prioritization [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2016, 5:2701 (https://doi.org/10.5256/f1000research.10902.r18052)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 18 Nov 2016
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.