<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.12370.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                    <subj-group>
                        <subject>Small Molecule Chemistry</subject>
                    </subj-group>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Towards a systematic assessment of assay interference: Identification of extensively tested compounds with high assay promiscuity</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 3 approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Gilberg</surname>
                        <given-names>Erik</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Stumpfe</surname>
                        <given-names>Dagmar</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Bajorath</surname>
                        <given-names>J&#x00fc;rgen</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-0557-5714</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universit&#x00e4;t, Bonn, D-53113, Germany</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:bajorath@bit.uni-bonn.de">bajorath@bit.uni-bonn.de</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were declared.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>17</day>
                <month>8</month>
                <year>2017</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2017</year>
            </pub-date>
            <volume>6</volume>
            <elocation-id>Chem Inf Sci-1505</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>16</day>
                    <month>8</month>
                    <year>2017</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2017 Gilberg E et al.</copyright-statement>
                <copyright-year>2017</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/6-1505/pdf"/>
            <abstract>
                <p>A large-scale statistical analysis of hit rates of extensively assayed compounds is presented to provide a basis for a further assessment of assay interference potential and multi-target activities. A special feature of this investigation has been the inclusion of compound series information in activity analysis and the characterization of analog series using different parameters derived from assay statistics. No prior knowledge of compounds or targets was taken into consideration in the data-driven study of analog series. It was anticipated that taking large volumes of activity data, assay frequency, and assay overlap information into account would lead to statistically sound and chemically meaningful results. More than 6000 unique series of analogs with high hit rates were identified, more than 5000 of which did not contain known interference candidates, hence providing ample opportunities for follow-up analyses from a medicinal chemistry perspective.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Biological screening data</kwd>
                <kwd>statistical analysis</kwd>
                <kwd>active compounds</kwd>
                <kwd>assay frequency</kwd>
                <kwd>hit rate distribution</kwd>
                <kwd>assay interference</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1" xlink:href="http://dx.doi.org/10.13039/501100001659">
                    <funding-source>Deutsche Forschungsgemeinschaft</funding-source>
                    <award-id>SFB704</award-id>
                </award-group>
                <funding-statement>DS is supported by Sonderforschungsbereich 704 of the Deutsche Forschungsgemeinschaft. </funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>Compounds with false-positive signals in biological assays cause substantial problems for biological screening and medicinal chemistry
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>. Assay artifacts often remain undetected or are unveiled only at later stages of compound development efforts, leading to substantial loss of time and resources. Moreover, once published, artificial activities spread through the scientific literature and potentially cause even more harm by inspiring follow-up investigations that are doomed to fail. Known assay interference compounds include colloidal aggregators
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-7">7</xref>
                </sup> and many other compound classes that can react in different ways or are fluorescent under assay conditions
                <sup>
                    <xref ref-type="bibr" rid="ref-6">6</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-15">15</xref>
                </sup>. Systematic efforts to identify interference compounds include the compilation of aggregators
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup> and pan-assay interference compounds (PAINS)
                <sup>
                    <xref ref-type="bibr" rid="ref-8">8</xref>,
                    <xref ref-type="bibr" rid="ref-9">9</xref>
                </sup>. The latter comprise a set of 480 classes of compounds originally identified in AlphaScreen assays
                <sup>
                    <xref ref-type="bibr" rid="ref-8">8</xref>
                </sup>. PAINS are typically contained as substructures in larger compounds. However, the assessment and prediction of assay interference is far from being a trivial exercise. For example, analysis of screening data from PubChem
                <sup>
                    <xref ref-type="bibr" rid="ref-16">16</xref>
                </sup> has revealed that many compounds containing PAINS, including most reactive chemical entities, have very different hit rates or might be consistently inactive
                <sup>
                    <xref ref-type="bibr" rid="ref-17">17</xref>,
                    <xref ref-type="bibr" rid="ref-18">18</xref>
                </sup>. Moreover, analogs or different series of analogs containing the same PAINS substructure often have distinct activity profiles and are active against different targets
                <sup>
                    <xref ref-type="bibr" rid="ref-19">19</xref>
                </sup>. Thus, interference characteristics of related compounds frequently differ and a substructure with interference potential does not necessarily give rise to false-positive assay signals. To further complicate matters, promiscuous compounds may also have true multi-target activities
                <sup>
                    <xref ref-type="bibr" rid="ref-20">20</xref>
                </sup> that are relevant for polypharmacology
                <sup>
                    <xref ref-type="bibr" rid="ref-20">20</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-22">22</xref>
                </sup>. Moreover, even highly promiscuous screening hits include molecules with no apparent liabilities, in addition to obvious interference compounds
                <sup>
                    <xref ref-type="bibr" rid="ref-12">12</xref>
                </sup>.</p>
            <p>Without doubt, judging assay interference and candidate compounds requires profound chemical knowledge and experience. It is equally relevant, however, to strive for a data-driven assessment of promiscuity by exploring compound activity data on a large scale
                <sup>
                    <xref ref-type="bibr" rid="ref-20">20</xref>
                </sup>, aiming to identify compounds with interference potential for further analysis. Therefore, we have carried out a statistical analysis of hit rates of compounds that were extensively tested in screening assays. A special feature of this study has been its focus on pairs or larger series of analogs, rather than single compounds, which provides additional confidence criteria for activity assessment and further increases the information content of activity data analysis. Many series of analogs with much higher than typically observed hit rates and largely consistent activity profiles across many different assays were identified. This collection of series provides a basis for further investigating compounds with interference potential or true multi-target activities.</p>
        </sec>
        <sec sec-type="methods">
            <title>Methods</title>
            <p>All calculations were carried out using in-house scripts and implementations.</p>
            <sec>
                <title>Compounds</title>
                <p>From the PubChem BioAssay database
                    <sup>
                        <xref ref-type="bibr" rid="ref-16">16</xref>
                    </sup>, 437,257 compounds were pre-selected that were tested in both primary and confirmatory assays, representing extensively assayed screening compounds
                    <sup>
                        <xref ref-type="bibr" rid="ref-23">23</xref>
                    </sup>. Approximately 95% of these compounds were evaluated in more than 50 primary and/or confirmatory assays
                    <sup>
                        <xref ref-type="bibr" rid="ref-23">23</xref>
                    </sup>. Primary PubChem assays report compound activity (e.g., percentage activity) for a single dose, while confirmatory assays are dose-response assays yielding titration curves and IC
                    <sub>50</sub> values. Our current analysis focused on primary assays, for which much larger data volumes were available than for confirmatory assay. Primary assays also included assays for which no target was specified (such as cell-based assays). For pre-selected compounds, hit rate statistics were determined.</p>
            </sec>
            <sec>
                <title>Matched molecular pairs and series</title>
                <p>A matched molecular pair (MMP) is a pair of compounds that are only distinguished by a chemical change at a single site
                    <sup>
                        <xref ref-type="bibr" rid="ref-24">24</xref>
                    </sup>, termed a chemical transformation
                    <sup>
                        <xref ref-type="bibr" rid="ref-25">25</xref>
                    </sup>. As an extension of the MMP concept, a matched molecular series (MMS) was defined as the union of all MMP compounds that are only distinguished by chemical modifications at a given site
                    <sup>
                        <xref ref-type="bibr" rid="ref-26">26</xref>
                    </sup>. Accordingly, an MMS represents a series of analogs sharing a single substitution site. To generate MMPs, exocyclic single bonds in screening compounds were systematically fragmented
                    <sup>
                        <xref ref-type="bibr" rid="ref-25">25</xref>
                    </sup> following retrosynthetic fragmentation rules
                    <sup>
                        <xref ref-type="bibr" rid="ref-27">27</xref>
                    </sup>, yielding so-called RECAP-MMPs
                    <sup>
                        <xref ref-type="bibr" rid="ref-28">28</xref>
                    </sup>. These MMPs were subject to transformation size restrictions in order to limit chemical changes to modifications typically observed in series of analogs
                    <sup>
                        <xref ref-type="bibr" rid="ref-29">29</xref>
                    </sup>. An MMS was designated as redundant if it was a subset of a larger MMS or if there was another MMS representing the same series of analogs but having a larger MMP core. For screening compounds with high hit rates, non-redundant MMS were systematically determined.</p>
            </sec>
            <sec>
                <title>MMS parameters</title>
                <p>For each MMS, three parameters were calculated. First, the 
                    <italic toggle="yes">MMS hit rate (HR)</italic> was obtained from the union of all assays (i.e., the number of unique assays in which one or more analogs were tested in) and assays with activity signals (active assays, i.e., the number of unique assays in which one or more analogs were found to be active). Second, 
                    <italic toggle="yes">assay overlap</italic> was determined as the proportion of assays in which all MMS compounds were tested in (shared assays, i.e., the intersection of assays) relative to the union of assays. Third, from assay overlap, 
                    <italic toggle="yes">assays with inconsistent activity</italic> were calculated as the proportion of shared assays in which different MMS compounds were active or inactive.</p>
            </sec>
        </sec>
        <sec sec-type="results | discussion">
            <title>Results and discussion</title>
            <sec>
                <title>Study design</title>
                <p>A statistical analysis of hit rates of extensively tested screening compounds is presented taking assay frequency into account. On the basis of the hit rate distribution, ranges of unusually high hit rates were determined. From compounds with high hit rates, analog series with single substitution sites (MMSs), i.e., &#x201c;minimal&#x201d; chemical modifications within series, were systematically extracted, which provided structural context information and hit rate controls for closely related compounds. For MMSs, different parameters were calculated, making it possible to compare and prioritize these series. The collection of MMSs with high hit rates provides a basis for investigating assay interference candidates, as well as chemical entities with potential multi-target activities.</p>
            </sec>
            <sec>
                <title>Source compounds and assay distribution</title>
                <p>
                    <xref ref-type="fig" rid="f1">Figure 1</xref> (boxplot on the left) shows the global distribution of primary assays for 437,257 extensively tested PubChem compounds, with a median value of 347 assays per compound. From these, a subset of 327,532 compounds was selected that were tested in more than 257 primary assays, corresponding to the lower quartile boundary of the global distribution. For this subset, the assay distribution was separately monitored (
                    <xref ref-type="fig" rid="f1">Figure 1</xref>, boxplot on the right), yielding a median of 426 (and a maximum of 626) assays per compound. Hence, half of these compounds were tested in more than 426 primary assays.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>Figure 1. </label>
                    <caption>
                        <title>Assay frequency distribution.</title>
                        <p>The frequency distribution of primary assays is shown in a boxplot format for 437,257 pre-selected PubChem compounds and a subset of 327,532 compounds. The plot gives the smallest number of primary assays (lower whisker), first quartile (lower boundary of the box), median value (thick line), third quartile (upper boundary of the box), and largest number of assays (upper whisker). Outliers are not displayed. The dashed blue line indicates the selection criterion for the compound subset (i.e., tested in more than 257 primary assays).</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/13396/5f33bf3b-9ae5-45f2-9d84-1cc4f8893ecc_figure1.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Hit rate distribution</title>
                <p>For 327,532 compounds tested in more than 257 assays, hit rates were determined. The distribution is reported in 
                    <xref ref-type="fig" rid="f2">Figure 2</xref> (boxplot on the left), resulting in a median hit rate of 0.4%. The lower quartile boundary and lower whisker of the boxplot were identical and represented consistently inactive compounds, which were not of interest for our current analysis. On the basis of the distribution, the interval of &#x201c;bulk hit rates&#x201d; (b_hr) for these extensively assayed PubChem compounds was defined as 0% &lt; b_hr &#x2264; 1.0%, covering the lower quartile, median, and upper quartile (and hence the &#x201c;bulk&#x201d; of the distribution). There were 80,495 compounds with hit rates &#x2265; 1.0%. The hit rate distribution of this compound subset is shown in 
                    <xref ref-type="fig" rid="f2">Figure 2</xref> (middle), yielding a median of 1.8%. This value was set as the hit rate threshold for most active screening compounds. The threshold was exceeded by 41,609 compounds, representing 12.7% of the initial compound pool. The hit rate distribution of these compounds is reported in 
                    <xref ref-type="fig" rid="f2">Figure 2</xref> (right), resulting in a median of 2.9%. We determined that 93.1% of the compounds with hit rates greater than 1.8% in primary assays were also active in confirmatory assays (yielding IC
                    <sub>50</sub> values). Hence, their activity was not confined to primary assays.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <title>Hit rate distribution.</title>
                        <p>For three different subsets of PubChem compounds, hit rate distributions are shown in boxplots according to 
                            <xref ref-type="fig" rid="f1">Figure 1</xref>. The subsets are characterized by increasing hit rates (marked by dashed blue lines).</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/13396/5f33bf3b-9ae5-45f2-9d84-1cc4f8893ecc_figure2.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Compound series and parameters</title>
                <p>From the 41,609 compounds with highest hit rates, MMSs were systematically extracted on the basis of RECAP-MMPs. After removal of redundant MMSs (see Methods), 6941 unique MMSs were obtained comprising 14,646 compounds, which represented our final hit rate- and series-based selection set. 
                    <xref ref-type="table" rid="T1">Table 1</xref> reports the size distribution of the MMSs, ranging from two to 17 analogs per series. With 6111 instances, compound pairs and triplets dominated the distribution, but more than 800 larger MMSs were also obtained. As further discussed below, compound pairs and triplets already provide informative controls for activity analysis and enable a more confident assessment compared to the analysis of individual compounds. This was a major motivation for focusing the analysis on MMSs.</p>
                <table-wrap id="T1" orientation="portrait" position="anchor">
                    <label>Table 1. </label>
                    <caption>
                        <title>Size distribution of matched molecular series (MMSs).</title>
                        <p>The distribution of 6941 frequently active MMSs (#MMSs) over increasing numbers of compounds (#CPDs) is reported.</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="center" colspan="1" rowspan="1">#CPDs</th>
                                <th align="center" colspan="1" rowspan="1">#MMSs</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">2</td>
                                <td align="center" colspan="1" rowspan="1">4965</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">3</td>
                                <td align="center" colspan="1" rowspan="1"> 1156</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">4</td>
                                <td align="center" colspan="1" rowspan="1">435</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">5</td>
                                <td align="center" colspan="1" rowspan="1">190</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">6</td>
                                <td align="center" colspan="1" rowspan="1">70</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">7</td>
                                <td align="center" colspan="1" rowspan="1">48</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">8</td>
                                <td align="center" colspan="1" rowspan="1">22</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">9</td>
                                <td align="center" colspan="1" rowspan="1">21</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">10</td>
                                <td align="center" colspan="1" rowspan="1">11</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">11</td>
                                <td align="center" colspan="1" rowspan="1">12</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">12</td>
                                <td align="center" colspan="1" rowspan="1">3</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">13</td>
                                <td align="center" colspan="1" rowspan="1">4</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">14</td>
                                <td align="center" colspan="1" rowspan="1">2</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">15</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">17</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>
                    <xref ref-type="fig" rid="f3">Figure 3a</xref> illustrates the derivation of three parameters for the characterization and comparison of MMSs (rationalized in the Methods section). The cumulative 
                    <italic toggle="yes">MMS hit rate</italic> is a direct measure for the activity of a series. In addition, 
                    <italic toggle="yes">assay overlap</italic> represents a confidence criterion for MMS assessment, i.e., large assay overlap of compounds comprising a series assigns high confidence to hit rate comparisons. By contrast, the proportion of 
                    <italic toggle="yes">assays with inconsistent activity</italic> should best be minimal to draw firm conclusions. 
                    <xref ref-type="fig" rid="f3">Figure 3b</xref> reports the distribution of these three parameters for the 6941 MMSs. Assay overlap (upper left plot) and MMS hit rates (lower left) were generally high, with median values of 79.3% and 5.8%, respectively. By contrast, the proportion of inconsistent assays (upper right) was overall low, with a median of only 3.7%. Thus, the distributions of MMS parameters indicated that the set of MMSs was suitable for the analysis of series-based hit rates and hit rate comparison of compounds comprising individual MMSs. We note that MMSs can be ranked in the order of decreasing assay overlap and MMS hit rates and increasing inconsistent assays and prioritized, for example, on the basis of rank fusion calculations.</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>Figure 3. </label>
                    <caption>
                        <title>Characterization of matched molecular series (MMSs).</title>
                        <p>(
                            <bold>a</bold>) An exemplary MMS comprising three analogs is shown. The MMS core and varying substituents are colored in black and orange, respectively. For each compound, the number of assays it was tested and active in is reported, respectively. Furthermore, the assay union, intersection, and MMS hit rate (purple) are given. From these data, the assay overlap (green) of MMS analogs was determined as well as the proportion of assays with consistent activity, inactivity, and inconsistent activity (blue). (
                            <bold>b</bold>) Boxplots are shown reporting the distribution of assay overlap, assays with inconsistent activity as well as assay- or target-based MMS hit rates for PubChem compounds with greater than 1.8% hit rate.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/13396/5f33bf3b-9ae5-45f2-9d84-1cc4f8893ecc_figure3.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Target distribution in primary assays</title>
                <p>Our analysis was intentionally focused on hit rates over assays (i.e., assay promiscuity) to take as many activity readouts as possible into account. Therefore, as a control, assay- and target-based hit rates were also compared. Compounds forming the 6941 MMS were evaluated in a total of 1213 assays. For 255 of these assays, no individual target was specified. The remaining 958 assays covered 426 different targets. 
                    <xref ref-type="fig" rid="f3">Figure 3b</xref> reports the distributions of MMS hit rates over assays (lower left plot) and targets (lower right). The distributions were overall similar, with median values of assay- and target-based hit rates of 5.8% and 4.9%, respectively. Hence, despite the presence of multiple assays for a subset of targets, assay-based hit rates were only slightly higher than target-based rates, indicating that corresponding conclusions would be drawn from the analysis of these distributions.</p>
            </sec>
            <sec>
                <title>Known interference candidates</title>
                <p>The computational aggregation advisor
                    <sup>
                        <xref ref-type="bibr" rid="ref-4">4</xref>
                    </sup> and compound strings taken from PAINS filters
                    <sup>
                        <xref ref-type="bibr" rid="ref-30">30</xref>,
                        <xref ref-type="bibr" rid="ref-31">31</xref>
                    </sup> (
                    <ext-link ext-link-type="uri" xlink:href="http://www.rdkit.org">http://www.rdkit.org</ext-link>) were used to search the MMSs for known assay interference candidates. The 14,646 MMS compounds contained 783 aggregators (on the basis of 100% similarity) and 2381 compounds with PAINS substructures. There were 611 MMSs with one or more aggregators, 1139 MMSs with one or more PAINS, and 126 MMSs including aggregators and PAINS. However, 5065 MMSs with high hit rates did not contain known compounds with aggregation potential or PAINS substructures. Thus, the MMSs provide a large source of analogs for the exploration of other interference candidates, as well as compounds with true multi-assay/target activities.</p>
            </sec>
            <sec>
                <title>Exemplary series</title>
                <p>
                    <xref ref-type="fig" rid="f4">Figure 4</xref> shows exemplary compound pairs and triplets with high assay promiscuity. The two analogs in 
                    <xref ref-type="fig" rid="f4">Figure 4a</xref> were tested in more than 380 assays with 93.5% assay overlap and only 1.6% inconsistent assays, yielding comparable hit rates of 2.8% and 3.2%, respectively, resulting in an MMS hit rate of 4.5%. These analogs contained a classical PAINS substructure (ene_rhodanine)
                    <sup>
                        <xref ref-type="bibr" rid="ref-8">8</xref>,
                        <xref ref-type="bibr" rid="ref-9">9</xref>
                    </sup>. Furthermore, compounds in 
                    <xref ref-type="fig" rid="f4">Figure 4b</xref> were analogs of a molecule with aggregation potential
                    <sup>
                        <xref ref-type="bibr" rid="ref-4">4</xref>
                    </sup>. They were tested in more than 300 and 400 assays, respectively, yielding a relatively low assay overlap of 59%, and had hit rates of 2.2% and 2.6%, respectively, resulting in a low MMS hit rate of 2.9%. Thus, these analogs were far from being consistently active, as one might assume for strong aggregators. In 
                    <xref ref-type="fig" rid="f4">Figure 4c</xref>, a pair of thieno[2,3-d]pyrimidine-2-acetic acid ethyl ester analogs is shown that were tested in 442 assays with large overlap. These compounds had high hit rates of 5.9% and 7.9%, respectively, resulting in a high MMS hit rate of 9.8%. Moreover, 
                    <xref ref-type="fig" rid="f4">Figure 4d</xref> shows a triplet of sulfonylpyrimidines that were tested in 357&#x2013;361 assays with 89.7% overlap, having very high hit rates of 8.7% (one analog) and more than 13% (two analogs). Compounds forming each of the MMSs in 
                    <xref ref-type="fig" rid="f4">Figure 4</xref> displayed consistent hit rate characteristics, hence assigning confidence to their observed activity phenotype. The analogs in 
                    <xref ref-type="fig" rid="f4">Figure 4c</xref> and 
                    <xref ref-type="fig" rid="f4">Figure 4d</xref> have previously not been classified as interference candidates. However, they might well be reactive under assay conditions. Taken together, these examples of analog pairs and triplets (i.e., minimally sized MMSs) are indicative of the potential of well characterized MMSs for follow-up investigations focusing on assay interference and multi-target activities.</p>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>Figure 4. </label>
                    <caption>
                        <title>Exemplary matched molecular series (MMSs).</title>
                        <p>(
                            <bold>a</bold>&#x2013;
                            <bold>d</bold>) Four exemplary MMSs (core, black; substituents, orange) are shown and the MMS hit rate, assay overlap, and proportion of assays with inconsistent activity are reported. In addition, for each individual analog, its assay frequency and hit rate are provided.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/13396/5f33bf3b-9ae5-45f2-9d84-1cc4f8893ecc_figure4.gif"/>
                </fig>
            </sec>
        </sec>
        <sec sec-type="conclusions">
            <title>Conclusions</title>
            <p>Herein, a detailed analysis of hit rates of nearly 440,000 extensively assayed screening compounds has been presented. On the basis of hit rate distributions, 12.7% of the compounds with highest hit rates were selected. From these compounds, analog series with single substitution sites were systematically extracted to complement hit rate statistics with the assessment of structural relationships between active compounds. A total of 6941 unique MMSs were obtained comprising 14,646 compounds. These MMSs were characterized using different parameters prioritizing high-confidence series for activity analysis. A major goal of our study has been the data-driven generation of a pool of analog series for the evaluation of assay interference potential and multi-target activities. More than 5000 MMSs did not contain known interference candidates, providing an opportunity to evaluate compounds with interference potential on a large scale. In the next step, analog series will be evaluated from a medicinal chemistry perspective to complement and further extend statistical considerations. Annotated series and associated assay/target information will then be made freely available. The statistics and selection steps reported herein also make it possible to regenerate compound subsets at different hit rate levels and subject them to further analysis. In addition, large numbers of compounds with high hit rates that were not part of MMSs are also available. For reasons discussed, our preferred approach is taking compound series information into account when judging assay promiscuity.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <p>The data sets used in this study are freely available in PubChem and can be generated following the selection protocol reported in the Methods.</p>
        </sec>
    </body>
    <back>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Aldrich</surname>
                            <given-names>C</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Bertozzi</surname>
                            <given-names>C</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Georg</surname>
                            <given-names>GI</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>The Ecstasy and Agony of Assay Interference Compounds.</article-title>
                    <source>
					
                        <italic toggle="yes">ACS Cent Sci.</italic>
				</source>
                    <year>2017</year>;<volume>3</volume>(<issue>3</issue>):<fpage>143</fpage>&#x2013;<lpage>147</lpage>.
                    <pub-id pub-id-type="pmid">28386587</pub-id>
                    <pub-id pub-id-type="doi">10.1021/acscentsci.7b00069</pub-id>
                    <pub-id pub-id-type="pmcid">5364449</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>McGovern</surname>
                            <given-names>SL</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Caselli</surname>
                            <given-names>E</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Grigorieff</surname>
                            <given-names>N</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>A Common mechanism underlying promiscuous inhibitors from virtual and high-throughput screening.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>1996</year>;<volume>45</volume>(<issue>8</issue>):<fpage>1712</fpage>&#x2013;<lpage>1722</lpage>.
                    <pub-id pub-id-type="pmid">11931626</pub-id>
                    <pub-id pub-id-type="doi">10.1021/jm010533y</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Shoichet</surname>
                            <given-names>BK</given-names>
                        </name>
				</person-group>:
                    <article-title>Screening in a spirit haunted world.</article-title>
                    <source>
					
                        <italic toggle="yes">Drug Discov Today.</italic>
				</source>
                    <year>2006</year>;<volume>11</volume>(<issue>13&#x2013;14</issue>):<fpage>607</fpage>&#x2013;<lpage>615</lpage>.
                    <pub-id pub-id-type="pmid">16793529</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.drudis.2006.05.014</pub-id>
                    <pub-id pub-id-type="pmcid">1524586</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Irwin</surname>
                            <given-names>JJ</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Duan</surname>
                            <given-names>D</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Torosyan</surname>
                            <given-names>H</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>An Aggregation Advisor for Ligand Discovery.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2015</year>;<volume>58</volume>(<issue>17</issue>):<fpage>7076</fpage>&#x2013;<lpage>7087</lpage>.
                    <pub-id pub-id-type="pmid">26295373</pub-id>
                    <pub-id pub-id-type="doi">10.1021/acs.jmedchem.5b01105</pub-id>
                    <pub-id pub-id-type="pmcid">4646424</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Feng</surname>
                            <given-names>BY</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Simeonov</surname>
                            <given-names>A</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Jadhav</surname>
                            <given-names>A</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>A high-throughput screen for aggregation-based inhibition in a large compound library.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2007</year>;<volume>50</volume>(<issue>10</issue>):<fpage>2385</fpage>&#x2013;<lpage>2390</lpage>.
                    <pub-id pub-id-type="pmid">17447748</pub-id>
                    <pub-id pub-id-type="doi">10.1021/jm061317y</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Jadhav</surname>
                            <given-names>A</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Ferreira</surname>
                            <given-names>RS</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Klumpp</surname>
                            <given-names>C</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>Quantitative analyses of aggregation, autofluorescence, and reactivity artifacts in a screen for inhibitors of a thiol protease.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2010</year>;<volume>53</volume>(<issue>1</issue>):<fpage>37</fpage>&#x2013;<lpage>51</lpage>.
                    <pub-id pub-id-type="pmid">19908840</pub-id>
                    <pub-id pub-id-type="doi">10.1021/jm901070c</pub-id>
                    <pub-id pub-id-type="pmcid">2992957</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Ferreira</surname>
                            <given-names>RS</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Simeonov</surname>
                            <given-names>A</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Jadhav</surname>
                            <given-names>A</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>Complementarity between a docking and a high-throughput screen in discovering new cruzain inhibitors.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2010</year>;<volume>53</volume>(<issue>13</issue>):<fpage>4891</fpage>&#x2013;<lpage>4905</lpage>.
                    <pub-id pub-id-type="pmid">20540517</pub-id>
                    <pub-id pub-id-type="doi">10.1021/jm100488w</pub-id>
                    <pub-id pub-id-type="pmcid">2895358</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Baell</surname>
                            <given-names>JB</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Holloway</surname>
                            <given-names>GA</given-names>
                        </name>
				</person-group>:
                    <article-title>New substructure filters for removal of pan assay interference compounds (PAINS) from screening libraries and for their exclusion in bioassays.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2010</year>;<volume>53</volume>(<issue>7</issue>):<fpage>2719</fpage>&#x2013;<lpage>2740</lpage>.
                    <pub-id pub-id-type="pmid">20131845</pub-id>
                    <pub-id pub-id-type="doi">10.1021/jm901137j</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Baell</surname>
                            <given-names>J</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Walters</surname>
                            <given-names>MA</given-names>
                        </name>
				</person-group>:
                    <article-title>Chemistry: Chemical con artists foil drug discovery.</article-title>
                    <source>
					
                        <italic toggle="yes">Nature.</italic>
				</source>
                    <year>2014</year>;<volume>513</volume>(<issue>7519</issue>):<fpage>481</fpage>&#x2013;<lpage>483</lpage>.
                    <pub-id pub-id-type="pmid">25254460</pub-id>
                    <pub-id pub-id-type="doi">10.1038/513481a</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Dahlin</surname>
                            <given-names>JL</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Nissink</surname>
                            <given-names>JW</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Strasser</surname>
                            <given-names>JM</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>PAINS in the assay: chemical mechanisms of assay interference and promiscuous enzymatic inhibition observed during a sulfhydryl-scavenging HTS.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2015</year>;<volume>58</volume>(<issue>5</issue>):<fpage>2091</fpage>&#x2013;<lpage>2113</lpage>.
                    <pub-id pub-id-type="pmid">25634295</pub-id>
                    <pub-id pub-id-type="doi">10.1021/jm5019093</pub-id>
                    <pub-id pub-id-type="pmcid">4360378</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Dahlin</surname>
                            <given-names>JL</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Nissink</surname>
                            <given-names>JW</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Francis</surname>
                            <given-names>S</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>Post-HTS case report and structural alert: Promiscuous 4-aroyl-1,5-disubstituted-3-hydroxy-2
                        <italic toggle="yes">H</italic>-pyrrol-2-one actives verified by ALARM NMR.</article-title>
                    <source>
					
                        <italic toggle="yes">Bioorg Med Chem Lett.</italic>
				</source>
                    <year>2015</year>;<volume>25</volume>(<issue>21</issue>):<fpage>4740</fpage>&#x2013;<lpage>4752</lpage>.
                    <pub-id pub-id-type="pmid">26318992</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.bmcl.2015.08.020</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Gilberg</surname>
                            <given-names>E</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Jasial</surname>
                            <given-names>S</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Stumpfe</surname>
                            <given-names>D</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>Highly Promiscuous Small Molecules from Biological Screening Assays Include Many Pan-Assay Interference Compounds but Also Candidates for Polypharmacology.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2016</year>;<volume>59</volume>(<issue>22</issue>):<fpage>10285</fpage>&#x2013;<lpage>10290</lpage>.
                    <pub-id pub-id-type="pmid">27809519</pub-id>
                    <pub-id pub-id-type="doi">10.1021/acs.jmedchem.6b01314</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-13">
                <label>13</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Baell</surname>
                            <given-names>JB</given-names>
                        </name>
				</person-group>:
                    <article-title>Feeling Nature&#x2019;s PAINS: Natural Products, Natural Product Drugs, and Pan Assay Interference Compounds (PAINS).</article-title>
                    <source>
					
                        <italic toggle="yes">J Nat Prod.</italic>
				</source>
                    <year>2016</year>;<volume>79</volume>(<issue>3</issue>):<fpage>616</fpage>&#x2013;<lpage>628</lpage>.
                    <pub-id pub-id-type="pmid">26900761</pub-id>
                    <pub-id pub-id-type="doi">10.1021/acs.jnatprod.5b00947</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Bisson</surname>
                            <given-names>J</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>McAlpine</surname>
                            <given-names>JB</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Friesen</surname>
                            <given-names>JB</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>Can Invalid Bioactives Undermine Natural Product-Based Drug Discovery?</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2016</year>;<volume>59</volume>(<issue>5</issue>):<fpage>1671</fpage>&#x2013;<lpage>1690</lpage>.
                    <pub-id pub-id-type="pmid">26505758</pub-id>
                    <pub-id pub-id-type="doi">10.1021/acs.jmedchem.5b01009</pub-id>
                    <pub-id pub-id-type="pmcid">4791574</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Nelson</surname>
                            <given-names>KM</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Dahlin</surname>
                            <given-names>JL</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Bisson</surname>
                            <given-names>J</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>The Essential Medicinal Chemistry of Curcumin.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2017</year>;<volume>60</volume>(<issue>5</issue>):<fpage>1620</fpage>&#x2013;<lpage>1637</lpage>.
                    <pub-id pub-id-type="pmid">28074653</pub-id>
                    <pub-id pub-id-type="doi">10.1021/acs.jmedchem.6b00975</pub-id>
                    <pub-id pub-id-type="pmcid">5346970</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>Y</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Xiao</surname>
                            <given-names>J</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Suzek</surname>
                            <given-names>TO</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>PubChem&#x2019;s BioAssay Database.</article-title>
                    <source>
					
                        <italic toggle="yes">Nucleic Acids Res.</italic>
				</source>
                    <year>2012</year>;<volume>40</volume>(<issue>Database issue</issue>):<fpage>D400</fpage>&#x2013;<lpage>D412</lpage>.
                    <pub-id pub-id-type="pmid">22140110</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkr1132</pub-id>
                    <pub-id pub-id-type="pmcid">3245056</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Capuzzi</surname>
                            <given-names>SJ</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Muratov</surname>
                            <given-names>EN</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Tropsha</surname>
                            <given-names>A</given-names>
                        </name>
				</person-group>:
                    <article-title>Phantom PAINS: Problems with the Utility of Alerts for Pan-Assay INterference CompoundS.</article-title>
                    <source>
					
                        <italic toggle="yes">J Chem Inf Model.</italic>
				</source>
                    <year>2017</year>;<volume>57</volume>(<issue>3</issue>):<fpage>417</fpage>&#x2013;<lpage>427</lpage>.
                    <pub-id pub-id-type="pmid">28165734</pub-id>
                    <pub-id pub-id-type="doi">10.1021/acs.jcim.6b00465</pub-id>
                    <pub-id pub-id-type="pmcid">5411023</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Jasial</surname>
                            <given-names>S</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Hu</surname>
                            <given-names>Y</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Bajorath</surname>
                            <given-names>J</given-names>
                        </name>
				</person-group>:
                    <article-title>How Frequently Are Pan-Assay Interference Compounds Active? Large-Scale Analysis of Screening Data Reveals Diverse Activity Profiles, Low Global Hit Frequency, and Many Consistently Inactive Compounds.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2017</year>;<volume>60</volume>(<issue>9</issue>):<fpage>3879</fpage>&#x2013;<lpage>3886</lpage>.
                    <pub-id pub-id-type="pmid">28421750</pub-id>
                    <pub-id pub-id-type="doi">10.1021/acs.jmedchem.7b00154</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Gilberg</surname>
                            <given-names>E</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Stumpfe</surname>
                            <given-names>D</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Bajorath</surname>
                            <given-names>J</given-names>
                        </name>
				</person-group>:
                    <article-title>Activity profiles of analog series containing pan assay interference compounds.</article-title>
                    <source>
					
                        <italic toggle="yes">RSC Adv.</italic>
				</source>
                    <year>2017</year>;<volume>7</volume>(<issue>57</issue>):<fpage>35638</fpage>&#x2013;<lpage>35649</lpage>.
                    <pub-id pub-id-type="doi">10.1039/C7RA06736D</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Hu</surname>
                            <given-names>Y</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Bajorath</surname>
                            <given-names>J</given-names>
                        </name>
				</person-group>:
                    <article-title>Compound promiscuity: what can we learn from current data?</article-title>
                    <source>
					
                        <italic toggle="yes">Drug Discov Today.</italic>
				</source>
                    <year>2013</year>;<volume>18</volume>(<issue>13&#x2013;14</issue>):<fpage>644</fpage>&#x2013;<lpage>650</lpage>.
                    <pub-id pub-id-type="pmid">23524195</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.drudis.2013.03.002</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Paolini</surname>
                            <given-names>GV</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Shapland</surname>
                            <given-names>RH</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>van Hoorn</surname>
                            <given-names>WP</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>Global mapping of pharmacological space.</article-title>
                    <source>
					
                        <italic toggle="yes">Nat Biotechnol.</italic>
				</source>
                    <year>2006</year>;<volume>24</volume>(<issue>7</issue>):<fpage>805</fpage>&#x2013;<lpage>815</lpage>.
                    <pub-id pub-id-type="pmid">16841068</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nbt1228</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Boran</surname>
                            <given-names>AD</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Iyengar</surname>
                            <given-names>R</given-names>
                        </name>
				</person-group>:
                    <article-title>Systems approaches to polypharmacology and drug discovery.</article-title>
                    <source>
					
                        <italic toggle="yes">Curr Opin Drug Discov Devel.</italic>
				</source>
                    <year>2010</year>;<volume>13</volume>(<issue>3</issue>):<fpage>297</fpage>&#x2013;<lpage>309</lpage>.
                    <pub-id pub-id-type="pmid">20443163</pub-id>
                    <pub-id pub-id-type="pmcid">3068535</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Jasial</surname>
                            <given-names>S</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Hu</surname>
                            <given-names>Y</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Bajorath</surname>
                            <given-names>J</given-names>
                        </name>
				</person-group>:
                    <article-title>Determining the Degree of Promiscuity of Extensively Assayed Compounds.</article-title>
                    <source>
					
                        <italic toggle="yes">PLoS One.</italic>
				</source>
                    <year>2016</year>;<volume>11</volume>(<issue>4</issue>):<fpage>e0153873</fpage>.
                    <pub-id pub-id-type="pmid">27082988</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0153873</pub-id>
                    <pub-id pub-id-type="pmcid">4833426</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Griffen</surname>
                            <given-names>E</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Leach</surname>
                            <given-names>AG</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Robb</surname>
                            <given-names>GR</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>Matched molecular pairs as a medicinal chemistry tool.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2011</year>;<volume>54</volume>(<issue>22</issue>):<fpage>7739</fpage>&#x2013;<lpage>7750</lpage>.
                    <pub-id pub-id-type="pmid">21936582</pub-id>
                    <pub-id pub-id-type="doi">10.1021/jm200452d</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Hussain</surname>
                            <given-names>J</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Rea</surname>
                            <given-names>C</given-names>
                        </name>
				</person-group>:
                    <article-title>Computationally efficient algorithm to identify matched molecular pairs (MMPs) in large data sets.</article-title>
                    <source>
					
                        <italic toggle="yes">J Chem Inf Model.</italic>
				</source>
                    <year>2010</year>;<volume>50</volume>(<issue>3</issue>):<fpage>339</fpage>&#x2013;<lpage>348</lpage>.
                    <pub-id pub-id-type="pmid">20121045</pub-id>
                    <pub-id pub-id-type="doi">10.1021/ci900450m</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Wawer</surname>
                            <given-names>M</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Bajorath</surname>
                            <given-names>J</given-names>
                        </name>
				</person-group>:
                    <article-title>Local structural changes, global data views: graphical substructure-activity relationship trailing.</article-title>
                    <source>
					
                        <italic toggle="yes">J Med Chem.</italic>
				</source>
                    <year>2011</year>;<volume>54</volume>(<issue>8</issue>):<fpage>2944</fpage>&#x2013;<lpage>2951</lpage>.
                    <pub-id pub-id-type="pmid">21443196</pub-id>
                    <pub-id pub-id-type="doi">10.1021/jm200026b</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-27">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Lewell</surname>
                            <given-names>XQ</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Judd</surname>
                            <given-names>DB</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Watson</surname>
                            <given-names>SP</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>RECAP--retrosynthetic combinatorial analysis procedure: a powerful new technique for identifying privileged molecular fragments with useful applications in combinatorial chemistry.</article-title>
                    <source>
					
                        <italic toggle="yes">J Chem Inf Comput Sci.</italic>
				</source>
                    <year>1998</year>;<volume>38</volume>(<issue>3</issue>):<fpage>511</fpage>&#x2013;<lpage>522</lpage>.
                    <pub-id pub-id-type="pmid">9611787</pub-id>
                    <pub-id pub-id-type="doi">10.1021/ci970429i</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-28">
                <label>28</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>de la Vega de Le&#x00f3;n</surname>
                            <given-names>A</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Bajorath</surname>
                            <given-names>J</given-names>
                        </name>
				</person-group>:
                    <article-title>Matched molecular pairs derived by retrosynthetic fragmentation.</article-title>
                    <source>
					
                        <italic toggle="yes">Med Chem Commun.</italic>
				</source>
                    <year>2014</year>;<volume>5</volume>(<issue>1</issue>):<fpage>64</fpage>&#x2013;<lpage>67</lpage>.
                    <pub-id pub-id-type="doi">10.1039/C3MD00259D</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-29">
                <label>29</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Hu</surname>
                            <given-names>X</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Hu</surname>
                            <given-names>Y</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Vogt</surname>
                            <given-names>M</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>MMP-Cliffs: systematic identification of activity cliffs on the basis of matched molecular pairs.</article-title>
                    <source>
					
                        <italic toggle="yes">J Chem Inf Model.</italic>
				</source>
                    <year>2012</year>;<volume>52</volume>(<issue>5</issue>):<fpage>1138</fpage>&#x2013;<lpage>1145</lpage>.
                    <pub-id pub-id-type="pmid">22489665</pub-id>
                    <pub-id pub-id-type="doi">10.1021/ci3001138</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-30">
                <label>30</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Sterling</surname>
                            <given-names>T</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Irwin</surname>
                            <given-names>JJ</given-names>
                        </name>
				</person-group>:
                    <article-title>ZINC 15--Ligand Discovery for Everyone.</article-title>
                    <source>
					
                        <italic toggle="yes">J Chem Inf Model.</italic>
				</source>
                    <year>2015</year>;<volume>55</volume>(<issue>11</issue>):<fpage>2324</fpage>&#x2013;<lpage>2337</lpage>.
                    <pub-id pub-id-type="pmid">26479676</pub-id>
                    <pub-id pub-id-type="doi">10.1021/acs.jcim.5b00559</pub-id>
                    <pub-id pub-id-type="pmcid">4658288</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-31">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
					
                        <name name-style="western">
                            <surname>Gaulton</surname>
                            <given-names>A</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Bellis</surname>
                            <given-names>LJ</given-names>
                        </name>
					
                        <name name-style="western">
                            <surname>Bento</surname>
                            <given-names>AP</given-names>
                        </name>
					
                        <etal/>
				</person-group>:
                    <article-title>ChEMBL: a large-scale bioactivity database for drug discovery.</article-title>
                    <source>
					
                        <italic toggle="yes">Nucleic Acids Res.</italic>
				</source>
                    <year>2012</year>;<volume>40</volume>(<issue>Database issue</issue>):<fpage>D1100</fpage>&#x2013;<lpage>D1107</lpage>.
                    <pub-id pub-id-type="pmid">21948594</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkr777</pub-id>
                    <pub-id pub-id-type="pmcid">3245175</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report25426">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.13396.r25426</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Walters</surname>
                        <given-names>Michael</given-names>
                    </name>
                    <xref ref-type="aff" rid="r25426a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-5650-9277</uri>
                </contrib>
                <aff id="r25426a1">
                    <label>1</label>Department of Medicinal Chemistry, University of Minnesota, Minneapolis, MN, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>2</day>
                <month>10</month>
                <year>2017</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2017 Walters M</copyright-statement>
                <copyright-year>2017</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport25426" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.12370.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This is an excellent manuscript that will certainly help researchers further understand the liabilities of "good-actors"; compounds that provide structure-interference relationships (SIR) that may not be pertinent to the biology being studied.</p>
            <p> </p>
            <p> Notes: 
                <list list-type="order">
                    <list-item>
                        <p>Inconsistent is misspelled in Panel B of Figure 3.</p>
                    </list-item>
                    <list-item>
                        <p>Though this will need to be experimentally-verified, the sulfonylpyrimidines&#x00a0;are almost certainly interfering by SnAr reactions. A literature search to gauge this potential reactivity of the core structure retrieved &gt;100 articles and &gt;1000 reactions. This group is certainly not in the PAINS definitions, but its assay interference potential can readily be ascertained by a quick reaction substructure search. Most importantly, this observation reinforces that lack of PAINS "flagging" is not sufficient to ensure that compounds don't have assay interference potential. The authors may wish to highlight this in their discussion.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>assay interference, medicinal chemistry</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report25674">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.13396.r25674</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Medina-Franco</surname>
                        <given-names>Jose L.</given-names>
                    </name>
                    <xref ref-type="aff" rid="r25674a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-4940-1107</uri>
                </contrib>
                <contrib contrib-type="author">
                    <name>
                        <surname>Sald&#x00ed;var-Gonz&#x00e1;lez</surname>
                        <given-names>Fernanda I.</given-names>
                    </name>
                    <xref ref-type="aff" rid="r25674a1">1</xref>
                    <role>Co-referee</role>
                </contrib>
                <aff id="r25674a1">
                    <label>1</label>Department of Pharmacy, Faculty of Chemistry, National Autonomous University of Mexico, Mexico City, Mexico</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>12</day>
                <month>9</month>
                <year>2017</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2017 Medina-Franco JL and Sald&#x00ed;var-Gonz&#x00e1;lez FI</copyright-statement>
                <copyright-year>2017</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport25674" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.12370.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The manuscript addresses a timely topic and is part of an effort to understand the multitarget activities of small molecules and their potential utility or drawbacks for drug discovery.</p>
            <p> </p>
            <p> Using the MMs formalism, the authors developed a protocol to identify analog series with high hit rates. The MMs approach responds to the need to generate SAR information of interference compounds. This approach allows for more reliable evaluation compared to the analysis of individual compounds.</p>
            <p> </p>
            <p> The analogue series identified with data from PubChem should be useful for medicinal chemistry programs. Similarly, the protocol developed in this work using MMPs should be useful to mine other large screening data sources (either public or proprietary data sets).</p>
            <p> </p>
            <p> Minor suggestions to further improve the quality of the manuscript: 
                <list list-type="bullet">
                    <list-item>
                        <p>Comment on the manuscript the effect of the compound concentration that is used to define a &#x201c;hit&#x201d; compound in a given assay. In other words, since it is unlikely that the same compound concentration is used across all assays, how this variable influences the &#x201c;hit rates&#x201d; and conclusions of the study?</p>
                    </list-item>
                    <list-item>
                        <p>The current analysis is made based primarily in primary assays in PubChem. In the Methods authors justify that there are larger data volumes for primary assays. We agree but would be nice to see in the manuscript a comment regarding the balance between volume vs. quality of the data.</p>
                    </list-item>
                    <list-item>
                        <p>We found it very pertinent that the study focused on compounds with high rates in primary assays, since more than 90% of these compounds are also active in confirmatory trials. It is interesting to comment how the hit rate is related to the quality of the data.</p>
                    </list-item>
                    <list-item>
                        <p>An earlier study 
                            <sup>
                                <xref ref-type="bibr" rid="rep-ref-25674-1">1</xref>
                            </sup> mentions that the assay frequency is not correlated with increased promiscuity. It is desirable to include a commentary in the manuscript when discussing the frequency of the test distribution and the generation of the first subset of 327,532 compounds.</p>
                    </list-item>
                    <list-item>
                        <p>Three parameters for the characterization and comparison of MMSs were determined (hit rate (HR), assay overlap, assays with inconsistent activity). Does the MMS size affect these determinations?</p>
                    </list-item>
                </list> Other suggestions (per section): 
                <list list-type="bullet">
                    <list-item>
                        <p>Introduction, last paragraph:&#x00a0; briefly summarize the major findings of previous works related to this study (e.g., refs. 18, 19, 23) and emphasize the novelty of this work.</p>
                    </list-item>
                    <list-item>
                        <p>Shorten the title (and include the concept of MMPs).</p>
                    </list-item>
                    <list-item>
                        <p>In the Introduction (last paragraph), include figure numbers for expressions such as &#x201c;extensively&#x201d;, &#x201c;larger&#x2019;, &#x201c;many&#x201d;, &#x201c;much higher&#x201d;. For instance, when the authors mention &#x201c;extensively tested&#x201d; in the title and through the manuscript, they mean &#x201c;&gt;10,000&#x201d;, &#x201c;&gt;100,000&#x201d;, &#x201c;&gt;400,000&#x201d;, etc.</p>
                    </list-item>
                    <list-item>
                        <p>Methods: elaborate a bit more on the in-house scripts, e.g., the program language.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-25674-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Entering the 'big data' era in medicinal chemistry: molecular promiscuity analysis revisited.</article-title>
                        <source>
                            <italic>Future Sci OA</italic>
                        </source>.<year>2017</year>;<volume>3</volume>(<issue>2</issue>) :
                        <elocation-id>10.4155/fsoa-2017-0001</elocation-id>
                        <fpage>FSO179</fpage>
                        <pub-id pub-id-type="pmid">28670471</pub-id>
                        <pub-id pub-id-type="doi">10.4155/fsoa-2017-0001</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report25140">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.13396.r25140</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Lowe III</surname>
                        <given-names>John A.</given-names>
                    </name>
                    <xref ref-type="aff" rid="r25140a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-7799-4144</uri>
                </contrib>
                <aff id="r25140a1">
                    <label>1</label>Jl3pharma LLC, Stonington, CT, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>18</day>
                <month>8</month>
                <year>2017</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2017 Lowe III JA</copyright-statement>
                <copyright-year>2017</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport25140" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.12370.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This article makes an important contribution to addressing a serious complication in screening for new biological activity, especially in the context of new drug discovery.&#x00a0; Several researchers continue to point out the waste of time and money in following up on false positive hits from biological screening programs.&#x00a0; While paradigms to filter out false positives, such as aggregators or other &#x201c;frequent hitters&#x201d; such as PAINs, are becoming mainstream techniques, these may not be sufficient, either because they miss some false positives or because they mistakenly classify legitimate hits as false positives.&#x00a0; By carrying out a statistical analysis of a large set of screening data for hit rates using matched molecular pairs and series, this article offers a valuable perspective on this issue.</p>
            <p> There are several aspects of this article that are particularly valuable.&#x00a0; For example, Figure 3b is an important control showing that using assay hit rate does not overstate promiscuity, in that target hit rate is similar, linking the results to a biological mechanism of action.&#x00a0; In Figures 4c and 4d, the exemplified compounds appear to be Michael acceptors, and could react with Cys residues covalently, or sequester thiol reagents used in the assay, which could explain their promiscuity.&#x00a0; It is worth pointing these out, as they would not be picked up in PAINs filters and only by a knowledgeable chemist.&#x00a0; Overall, this work is as important contribution to the ongoing effort to reduce the occurrence of false positive hits serving as starting points for discovery programs.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
</article>
