Keywords
polypharmacology, compound promiscuity, drug efficiacy
polypharmacology, compound promiscuity, drug efficiacy
In version 2, three references (9, 15, and 16) have been updated. In response to reviewer comments of Dr. Hans Matter, we now also report the results of compound promiscuity analysis for five well-known target families including G protein-coupled receptor (GPCR) class A, protein kinases, ion channels, proteases, and nuclear hormone receptors. In addition, we have determined promiscuity levels for compounds in different molecular weight ranges, as also suggested by Dr. Matter. Four new tables (3-6) have been added. Furthermore, in response to reviewer comments of Dr. Jeremy Jenkins, we report median promiscuity rates compared to average rates and briefly discuss a potential relationship between privileged structures and compounds displaying intra-family promiscuity.
To read any peer review reports and author responses for this article, follow the "read" links in the Open Peer Review table.
Polypharmacology is an emerging theme in drug discovery1,2. It is generally accepted that drugs often elicit their therapeutic effects through interactions with different targets and the ensuing modulation of multiple signaling pathways. In some therapeutic areas such as oncology, polypharmacology is heavily exploited, for example, through the use of promiscuous ATP site-directed protein kinase inhibitors3. In other areas, such as the treatment of infectious or chronic inflammatory diseases, achieving a high degree of target selectivity of drug candidates plays a major role.
The study of drug polypharmacology has become an important topic in pharmaceutical research4,5, especially focusing on combined computational and experimental analysis5. On the basis of drug-target networks, it was estimated early on that a drug interacts on average with approximately two targets4. More recent estimates from computational data analysis suggest that drugs might bind on average to two to seven targets, depending on the primary target families, and that more than 50% of current drugs might interact with more than five targets6.
Compound promiscuity as defined herein is the origin of polypharmacology. Promiscuity analysis can be extended from drugs to bioactive compounds through computational mining of currently available activity data. The results of activity data analysis are generally affected by data incompleteness7. This potential influence can only be eliminated by reaching the ultimate (and probably elusive) goal of chemogenomics8, i.e., testing all compounds against all targets. In the presence of data incompleteness, compound promiscuity rates are likely underestimated. However, it is not certain that further increasing amounts of assay data will indeed significantly alter the currently emerging view of compound promiscuity (vide infra).
Recent studies have generated a differentiated picture of compound promiscuity. The interested reader is also referred to comprehensive reviews of compound promiscuity analysis9 and polypharmacology6. In this commentary, we summarize key messages from recent promiscuity analysis in a compact format. It is hoped that this summary might be helpful as a reference for further studies.
Public data sources for compound promiscuity analysis discussed herein have been ChEMBL10, the major repository of compound activity data from medicinal chemistry (currently in May 2013 containing 1,295,510 compounds with a total of 11,420,351 activity annotations), the PubChem BioAssay database11, the major repository of screening data (with more than 3300 confirmatory assays), and DrugBank12, which currently contains 1518 approved and 5080 experimental drugs.
It is important to note that collecting all activity annotations for a compound reported in the literature including, for example, reporter gene or other cell-based assays is at best providing a measure of assay promiscuity, but not of specific interactions with different targets9. Therefore, it is generally required to apply data confidence criteria such as the presence of well-defined activity measurements or evidence for direct ligand-target interactions9 (as provided in ChEMBL as activity data filters).
When monitoring the growth of compound activity data in ChEMBL over a period of more than two years from its original release (January 2010) to release 13 (May 2012), a significant increase in the number of promiscuous compounds was detected13. However, by quantifying compound-based target relationships, it was determined that the increase in compounds with activity against targets from different families was largely due to (assay-dependent) IC50 measurements, rather than (assay-independent) equilibrium constants (Ki values)13. IC50 values are easier to determine than Ki values and provide the readout of most primary biochemical assays (except single-point screening assays), which might at least in part rationalize greater target coverage and the IC50-dependent increase in compound promiscuity across different families. However, it can also not be excluded that apparent promiscuity in different assays is higher on the basis of IC50 measurements, given their assay dependence (and often limited accuracy). Regardless, the type of activity measurements that are taken into account influences the outcome of promiscuity analysis. Thus, clear specification of activity measurements and data selection criteria are required.
The subset of compounds with available Ki measurements from ChEMBL release 13 was further investigated. On the basis of Ki measurements, approximately 62% of all compounds were only annotated with a single target, ~36% with two or more targets from the same family, and only ~2% of all active compounds with multiple targets from different families14. A promiscuous bioactive compound was found to interact on average with two to three targets.
Accordingly, compounds that display intra-family promiscuity might also be considered as candidates for privileged structures/compounds that are preferentially active against targets from a particular family. Therefore, these compounds can be distinguished from those that are promiscuous across different target families.
One might anticipate that the degree of compound promiscuity would be particularly high in screening assays (even if frequent hitters and other non-specific compounds are excluded). Therefore, 1085 confirmatory bioassays from PubChem were systematically analyzed. It was found that ~77% of all confirmed active compounds were tested in more than 50 different assays15. Thus, these active PubChem compounds provided a sound basis for promiscuity assessment. These results were in part surprising. An active PubChem compound displayed a ~50% probability to interact with two or more targets. The probability to interact with more than five targets was only ~8%. On average, a PubChem screening hit was active against 2.5 targets. For comparison, compounds from the IC50- and Ki-based subsets of ChEMBL release 14 (August 2012) interacted on average with 1.4 and 1.7 targets, respectively15. The comparably low ratios observed for both compound subsets indicated that IC50 measurements did not systematically increase promiscuity rates (vide supra). The analysis of active compounds from PubChem confirmatory assays provided an upper level estimate of promiscuity, which was not significantly higher than that for ChEMBL compounds.
Detailed analysis of compound activity data from ChEMBL release 14 (August 2012) has made it possible to derive a promiscuity profile that is most characteristic of bioactive compounds from medicinal chemistry sources. The majority of currently available promiscuous compounds is active in the sub-µM range against two to five targets from the same family and displays potency differences against these targets within one or two orders of magnitude16. An important aspect of this representative profile is that promiscuity does not imply low potency. Furthermore, compounds that are highly potent against a (primary) target and weakly potent against others are not frequently found16.
In Table 1, current average promiscuity rates are summarized for compounds from ChEMBL, PubChem, and DrugBank. For promiscuity assessment of drugs, all targets reported in DrugBank were considered.
The average number of targets is reported for compounds from ChEMBL release 14 (divided into Ki and IC50 value-based subsets), approved or experimental drugs from DrugBank 3.0, and active compounds from PubChem confirmatory bioassays. Corresponding statistics are provided in italics for promiscuous compounds (having two or more target annotations). For compounds from ChEMBL, only high-confidence activity annotations were taken into account (i.e., explicit activity measurements with the highest confidence level of direct ligand-target interactions). For calculations on drugs, all DrugBank target categories were taken into account.
If all compounds with single or multiple target annotations are analyzed, ChEMBL compounds interact on average with one to two targets and PubChem compounds with two to three. However, approved drugs have on average close to six targets. In contrast, the degree of promiscuity of experimental drugs is considerably lower, with less than two targets per drug candidate. If only promiscuous compounds or drugs are taken into account (i.e., if compounds with single target annotations are excluded), promiscuity rates only slightly increase by about one target per compound, the exception being experimental drugs whose average number of targets increases from 1.8 to 4.7. Furthermore, median promiscuity rates were also calculated for promiscuous compounds from different sources, i.e., ChEMBL compounds with activity against at least two targets (Ki and IC50), approved and experimental drugs annotated with more than four or at least two targets, respectively, and PubChem compounds active against at least three targets. Compared to the average promiscuity rates reported in Table 1, the median rates were consistently lower. However, the differences between the average and median rates were small, i.e., less than one for ChEMBL and PubChem compounds. By contrast, differences were larger than one for approved and experimental drugs, i.e., on the basis of median rates, drug target numbers were reduced by 1.9 and 2.7, respectively. Hence, average promiscuity rates for drugs were likely biased by highly promiscuous drugs.
In Table 2, the probability of promiscuity is reported for compounds from different sources (calculated from target distributions of compounds). For a ChEMBL compound with available IC50 and Ki measurements, the current probability of activity against two or more targets is ~25% and ~38%, respectively (if both IC50 and Ki measurements were available for a compound, they were separately considered). However, for activity against more than five targets, the probabilities are reduced to only ~1%. Similar observations are made for confirmed PubChem screening hits (providing an upper-limit promiscuity assessment for bioactive compounds, vide supra). In this case, the probability of activity against two or more, or against more than five targets is ~51% and ~8%, respectively. Furthermore, the probability of promiscuity of approved drugs from DrugBank is ~84% and the probability to interact with more than five targets still ~37%. For experimental drugs, the corresponding probabilities are much lower, with only ~24% and ~3%, respectively.
All available compounds active against targets belonging to the five target families, including G protein-coupled receptor (GPCR) class A, protein kinases, ion channels, proteases, and nuclear hormone receptors, were assembled from ChEMBL release 14 and separated into Ki and IC50 value-based subsets, as described above. Average promiscuity rates were calculated for all compounds active against a given family as well as compounds active against multiple targets within the family, as reported in Table 3. With the exception of the Ki subset of the ion channel family, promiscuity degrees for compounds active against these target families were similar to those reported in Table 1. In Table 4, the probability of promiscuity (i.e., activity against at least two or more than five targets) is reported for compounds active against these families (according to Table 2). Similar observations were made. A significant relative increase (~10%) in probability of promiscuity was only observed for compounds active against two or more targets from the nuclear receptor family on the basis of the IC50 subset. Thus, for prominent target families, no above-average compound promiscuity rates were detected.
From ChEMBL release 14, Ki and IC50 value-based compound subsets active against targets belonging to five prominent families were collected. The average number of targets is reported for compounds from individual target families. In addition, corresponding statistics are provided (in italics) for promiscuous compounds only (i.e., compounds having two or more target annotations within the family).
The degree of promiscuity was also determined for compounds with different sizes, i.e., molecular weight (MW). Seven subsets of compounds with increasing MW were collected from ChEMBL release 14 and organized into Ki and IC50 value-based subsets, as reported in Table 5. Average promiscuity rates of compounds with increasing MW were found to be comparable to the global rates. However, a significant relative increase in promiscuity was observed for the smallest compounds with MW ≤ 200 in Ki subset. Furthermore, the probability of activity against two or more targets also increased by more than 10% for the smallest compounds in both subsets, as reported in Table 6. For larger compounds across all MW ranges, no significant increases in promiscuity were observed compared to the global degree and probability of compound promiscuity reported in Table 1 and Table 2, respectively.
From ChEMBL release 14, compounds were selected and divided into seven subsets with increasing MW. The average number of targets is reported for compounds in all MW ranges. In addition, corresponding statistics are provided (in italics) for promiscuous compounds only (i.e., compounds having two or more target annotations).
Herein, we have provided a detailed and up-to-date view of compound promiscuity, the molecular basis of polypharmacology. For active compounds from medicinal chemistry and biological screening sources, the degree of promiscuity is lower than for drugs. There is a notable increase in promiscuity from bioactive compounds over drug candidates to approved drugs. The exploration of possible reasons for this apparent "promiscuity enrichment" along the drug discovery pathway should provide interesting opportunities for future research. On the basis of currently available high-confidence activity data, promiscuity of bioactive compounds is limited (and very low across different target families). However, if compounds are promiscuous, they typically bind to their targets with relatively high potency. Given the overall low degree of promiscuity of bioactive compounds including screening hits in the presence of nearly exponential data growth in recent years, it remains an open question if future chemogenomics efforts might substantially change the current picture of compound promiscuity (vide supra). The majority of available bioactive compounds have single target annotations and we believe it is unlikely that most of them will display a high degree of currently undiscovered promiscuity. Hence, we would also conclude that the target specificity paradigm that has long dominated small molecule discovery efforts should continue to play a major role, despite emerging "anti-reductionism" and the increasing focus on phenotypic readouts.
JB conceived the study, YH collected and organized the data and information, YH and JB wrote the manuscript.
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Competing Interests: No competing interests were disclosed.
Competing Interests: No competing interests were disclosed.
Competing Interests: No competing interests were disclosed.
Competing Interests: No competing interests were disclosed.
Competing Interests: No competing interests were disclosed.
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | |||
---|---|---|---|
1 | 2 | 3 | |
Version 2 (update) 26 Jul 13 |
read | read | |
Version 1 27 Jun 13 |
read | read | read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
9. Drug Discov Today (2013); 18 (13-14): 644-650.
15. AAPS J. (2013); 15 (3): 808-815
16. Med. Chem. Commun. (2013), in press.
9. Drug Discov Today (2013); 18 (13-14): 644-650.
15. AAPS J. (2013); 15 (3): 808-815
16. Med. Chem. Commun. (2013), in press.
9. Drug Discov Today (2013); 18 (13-14): 644-650.
15. AAPS J. (2013); 15 (3): 808-815
16. Med. Chem. Commun. (2013), in press.