ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Method Article
Revised

The ACCE method: an approach for obtaining quantitative or qualitative estimates of residual confounding that includes unmeasured confounding

[version 2; peer review: 2 approved]
PUBLISHED 29 Apr 2015
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

Abstract

Background:  Nonrandomized studies typically cannot account for confounding from unmeasured factors. 
 
Method:  A method is presented that exploits the recently-identified phenomenon of  “confounding amplification” to produce, in principle, a quantitative estimate of total residual confounding resulting from both measured and unmeasured factors.  Two nested propensity score models are constructed that differ only in the deliberate introduction of an additional variable(s) that substantially predicts treatment exposure.  Residual confounding is then estimated by dividing the change in treatment effect estimate between models by the degree of confounding amplification estimated to occur, adjusting for any association between the additional variable(s) and outcome.
 
Results:  Several hypothetical examples are provided to illustrate how the method produces a quantitative estimate of residual confounding if the method’s requirements and assumptions are met.  Previously published data is used to illustrate that, whether or not the method routinely provides precise quantitative estimates of residual confounding, the method appears to produce a valuable qualitative estimate of the likely direction and general size of residual confounding.
 
Limitations:  Uncertainties exist, including identifying the best approaches for: 1) predicting the amount of confounding amplification, 2) minimizing changes between the nested models unrelated to confounding amplification, 3) adjusting for the association of the introduced variable(s) with outcome, and 4) deriving confidence intervals for the method’s estimates (although bootstrapping is one plausible approach).
 
Conclusions:  To this author’s knowledge, it has not been previously suggested that the phenomenon of confounding amplification, if such amplification is as predictable as suggested by a recent simulation, provides a logical basis for estimating total residual confounding. The method's basic approach is straightforward.  The method's routine usefulness, however, has not yet been established, nor has the method been fully validated. Rapid further investigation of this novel method is clearly indicated, given the potential value of its quantitative or qualitative output.

Keywords

observational study, nonrandomized study, intervention research, residual confounding, unmeasured confounding, bias amplification, confounding amplification, propensity scores

Revised Amendments from Version 1

The following are the principal changes made in response to the reviewer’s helpful comments. While this manuscript has been accepted by both reviewers, I gave great weight to the comments from both reviewers mentioning that they had some difficulties, at least initially, in completely understanding the method as described. The manuscript and Appendices have therefore been substantially revised and expanded, as follows:
 
- The description of the method and hypothetical examples have been expanded, adding cross-references to the exact steps in the Appendix Table.
 
- My language and presentation have been made more precise, detailed, and consistent.
 
- Perhaps most importantly, the entire method has been expressed mathematically in a single Summary Equation.
 
- New Appendices were added to 1) expand the metaphors for the method offered (Appendix 1); 2) discuss other possible challenges to the proposed correction for the association between the Introduced Variable(s) and outcome (Appendix 4); explain the Summary Equation in detail and map various uncertainties to terms in the Equation (Appendix 5); initiate the consideration of how multiple Introduced Variables might be used (Appendix 6); and offer practical tips for how the method might be implemented and the key trade-offs that need to be considered (Appendix 7).
 
Finally, the manuscript’s Discussion better describes what subsequent research steps are most immediately needed, and points out such research should be a high priority given the possibility that the ACCE Method may permit unmeasured confounding to be estimated in a system that 1) can use variables with an association with outcome, 2) can use multiple variables, and 3) may help address residual/unmeasured confounding arising during treatment, as well as at baseline. The overall method needs validation in general, however, in addition to exploration of whether the method can perform one or more of these three valuable functions.

See the author's detailed response to the review by Gregory Matthews
See the author's detailed response to the review by Mark Lunt

Introduction

Confounding is a central challenge for virtually all nonrandomized studies. Recent research14 has revealed that propensity score methods or other highly-multivariate methods may actually significantly increase, or “amplify,” the residual confounding remaining after their application. Understandably, this recently-recognized property of propensity score methods and other highly-multivariate methods have been generally viewed as a limitation or complication to their use. More recently, however, a study has indicated that the degree of confounding amplification (also termed “bias amplification”4) occurring between propensity score models appears to be quantitatively predictable (at least in simulation)5. This quantitative predictability of confounding amplification may also be suggested by more theoretical presentations of confounding amplification4.

Not yet recognized, to my knowledge, is the extremely valuable corollary that results: the predictability of confounding amplification should, in principle, permit extrapolation back to an unamplified value of the total residual confounding originally present. (Throughout this manuscript “confounding” refers to baseline confounding. Confounding occurring after treatment initiation from differential discontinuation of the intervention in the treatment group of interest versus the comparison group is not addressed in this manuscript. A possible approach to estimating confounding occurring after treatment initiation, based on the same principles described here, is briefly discussed in Appendix 2.3b). In this manuscript and the associated appendices, I describe the general framework and detailed specifics of a new method designed to use amplified confounding to estimate total residual confounding (including from unmeasured factors), and thus provide an unconfounded treatment effect estimate.

The basic logic of this method is straightforward, but its performance in practice has yet to be confirmed. Testing of this method on both simulated and real-world data is clearly needed. This manuscript does illustrate, however, that even when this method is not able to provide a precise quantitative estimate of residual confounding, it may provide a very useful qualitative estimate of the likely direction and general size of residual confounding. This manuscript provides detailed information to the research community intended to facilitate the rapid evaluation of the performance of this method when applied to simulated and real-world data.

Method

This four-step method deliberately amplifies confounding to permit estimation of unmeasured confounding. This estimate is then subtracted, along with the measured confounding of the variable or variables producing the amplification, from the original treatment effect estimate. This approach of deliberately amplifying confounding initially may seem counterintuitive. The text below seeks to explain the steps of the method (and their rationale) and follows in parallel to their mathematical description in the Appendix Table. Ultimately, a single equation incorporating all components of the method is derived. Some readers may also find the largely nonmathematical metaphors provided in Appendix 1 helpful.

Step 1 – Create nested propensity score models and generate treatment effect estimates

The “Amplified Confounding-based Confounding Estimation (ACCE) Method” depends on the use of two propensity score models, one (“Model 1”) nested in the other (“Model 2”) so that Model 2 contains all the Model 1 covariates plus an additional variable or variables. The variable(s) introduced to produce Model 2 from Model 1 is termed the “Introduced Variable(s).” Importantly, the Introduced Variable(s) should be sufficiently associated with treatment exposure to produce discernible confounding amplification. That is, the Introduced Variable(s) should further predict treatment exposure sufficiently to substantively increase differences between the treatment groups in the prevalence of those confounding factors that are not present in either model.

Step 2 – Estimate both the proportional amplification of confounding and the quantitative change in the treatment effect estimate between Model 1 and Model 2

In principle, the original confounding existing prior to amplification can be estimated by extrapolation backwards if both the proportional amount of confounding amplification and the quantitative change in the treatment effect estimate occurring between two propensity score models can be estimated accurately. To give a very simple example, consider a model (Model 2), which adds a single Introduced Variable compared to the original model (Model 1). This Introduced Variable sufficiently explains treatment exposure that its inclusion is expected to exhibit 50% confounding amplification compared to Model 1. Assume the observed treatment effect risk ratio (RR) changed in this circumstance from 1.10 in Model 1 (a beta coefficient of 0.09531) to approximately 1.15 (a beta coefficient of 0.142965, which is strictly equivalent to an RR of 1.15369). If the Introduced Variable(s) had no association itself with exposure, then the increase in beta coefficient would result entirely from the 50% amplification of confounding (ignoring random variation). Since a 50% increase in the confounding increased the beta coefficient 0.047655, this would imply that the original confounding was 0.047655 / 0.5 = 0.09531. This is enough to account for the entire association with outcome originally attributed to treatment. That is, the observed increase of the effect estimate upon confounding amplification is sufficient to suggest that the entire originally-observed “treatment effect” estimate was in fact due to confounding (Endnote A).

Attention is needed during the method’s implementation, however, to ensure: 1) that either the Introduced Variable(s) truly does have no association with outcome, or to correct for this Introduced Variable-outcome association if it is present (Steps 3 and 4 of the method); and 2) that changes between the two models distinct from confounding amplification are minimized to the extent feasible (Appendix 2). In addition, the method requires an ability to estimate the proportional amount of confounding amplification occurring between two propensity score models.

Two very different approaches to estimating proportional confounding amplification suggest themselves. One approach would be to estimate amplification from existing or future simulation research based on particular metrics of exposure prediction. An example of this approach is research published using the linear measure of exposure prediction, R25. This work demonstrated that, for propensity score stratification or matching approaches, a linear relationship exists between unexplained variance in exposure (i.e., 1 - R2) and the proportional amount of confounding amplification occurring across the range of R2 = 0.04 to 0.56. This simulation study5, using a propensity score based on a linear probability model, also made the important demonstration that different unmeasured confounders appear to be amplified to a highly similar degree. A key assumption of the ACCE Method is that residual confounding attributable to different confounders is uniformly or relatively uniformly amplified in Model 2 compared to Model 1.

There is still a possibility, however, that the mathematical predictability of the proportional amplification based on 1 - R2 that was observed in simulation may be merely a consequence of the particular conditions of that simulation. Other work, however, also suggests the possibility of estimating the proportional amount of confounding amplification through the 1 - R2 relationship4.

Further research is needed to thoroughly confirm that a predictable relationship does indeed exist between predication of exposure as measured by R2 and resulting confounding amplification. Research is also needed to determine if a similarly predictable relationship exists for other metrics of exposure prediction (such as those proposed for logistic regression6,7). Finally, research is needed to establish whether the apparent nonlinearities between the prediction of exposure and confounding amplification at more extreme ranges of prediction, suggested by some manuscripts5 but not others4, actually do exist.

A second approach to estimating the proportional amplification of confounding between two models would be to adopt an “internal marker” strategy. This strategy consists of deliberately withholding a measured covariate from both models to allow the increase in its imbalance between treatment groups in Model 2 to serve as an approximate indicator of the proportional confounding amplification that has occurred. It is possible, however, that the “internal marker” strategy might consistently yield at least a slight degree of underestimate of the amount of confounding amplification (Appendix 3.1).

If the “Introduced Variable” is known to be a true instrumental variable, then Steps 1 and 2 are the only steps required. Whether this approach would be any advantage over a conventional instrumental variable regression, however, is uncertain. The next two steps describe the additional calculations necessary to adjust for an association between the Introduced Variable and outcome if the Introduced Variable is not known to be an instrumental variable. These steps add a minor amount of computational complexity to the method, as well as increase the uncertainty concerning the strict quantitative accuracy of the method’s estimates (as discussed below). Importantly, however, these steps also may greatly broaden the method’s applicability, since many more variables with substantial association with exposure (i.e., candidate Introduced Variables) are likely to exist that have some association with outcome than do not.

Step 3 – Adjust for the association between the Introduced Variable and outcome

In most cases, the addition of a variable(s) to Model 2 will alter the amount of residual confounding present in Model 2 compared to Model 1, independent of its effect producing confounding amplification (i.e., it is rare for a variable to have absolutely no association with outcome). The consequence of this is that what is being amplified in Model 2 is not the actual quantity being sought (the total residual confounding in Model 1) but only a fraction of this quantity. Specifically, the quantity being amplified is the fraction of the Model 1 residual confounding separate from that attributable to the Introduced Variable.

Because the Introduced Variable(s) is included in the Model 2 propensity score, the Introduced Variable does not amplify. Not only does the confounding from this variable not amplify, but any contribution to confounding attributable to the Introduced Variable would be generally expected to decrease in Model 2 compared to Model 1. This decrease results from the fact that the Introduced Variable will almost certainly become more balanced now that it is included in the propensity score. As a result, when we want to estimate the quantitative change in the treatment effect estimates attributable to amplified confounding, we must first subtract the contribution of the decreased confounding attributable to the change in the balance of the Introduced Variable(s). Arriving at an estimate of the change in treatment effect estimates between Model 2 and Model 1 that is solely attributable to the amplification of confounding between Model 1 and Model 2 is crucial, because this quantity will allow us to extrapolate backwards to an estimate of the residual confounding attributable to all unmeasured confounders except the Introduced Variable(s). Because the Introduced Variable does not amplify, its contribution to residual confounding cannot be estimated through this extrapolation. Instead, its effect must be removed separately. Second, we must also remove the contribution of the Introduced Variable(s) from the original, Model 1 treatment effect estimate to obtain the desired unconfounded treatment effect estimate.

To illustrate the need for the first adjustment (adjusting the change in treatment effect estimate to account for the change attributable to improved balance in the Introduced Variable), consider the following case: the amplification of residual confounders that are unmeasured or nonincluded in both Models (which we will term the “amplifiable fraction” of total residual confounding) increases the treatment effect estimate by beta = 0.09531, and insertion of the Introduced Variable into Model 2 changes its confounding by beta = -0.09531. In this case, the observed change in the treatment effect estimate between Model 2 and Model 1 would be zero. However, it would not be correct to conclude that no quantitative change in the Model 2 treatment effect estimate attributable to confounding amplification had occurred. Instead, a sizeable quantitative change due to confounding amplification occurred, but it had simply been concealed by an equal change in the other direction due to reduced confounding from the Introduced Variable. Only by subtracting the change in confounding expected to result from increased balance in the Introduced Variable does the quantitative impact of the confounding amplification become apparent.

The need for the second adjustment exists because the Introduced Variable did not amplify, and thus its contribution to Model 1 confounding will not be included in the back-extrapolation that is performed to estimate the original amount of confounding due to the “amplifiable fraction” (the fraction of confounding that can be amplified). The Introduced Variable(s)’s contribution to Model 1 confounding must be directly estimated and removed separately. These two adjustments involving the Introduced Variable(s) usually will be similar in magnitude, but not identical, as explained later.

To make both of these adjustments I propose obtaining a coefficient(s) for the Introduced Variable(s) from regression models of the outcome that include all other propensity score covariates (Endnote B and Appendix Table Step 3a). This Introduced Variable-outcome regression coefficient can then be inserted into the Bross equation8 is used to estimate the confounding attributable to the Introduced Variable(s) in both Model 1 and Model 2. (The Bross equation8, which recently has been used by Schneeweiss and colleagues in their high-dimensional propensity score algorithm9, quantifies the amount of confounding attributable to a confounder. The Bross equation provides this estimate by combining the strength of the association between the covariate and outcome with the imbalance in the covariate between the treatment groups. Its use is demonstrated in the Appendix Table Step 3b1 and Appendix Table Step 3b2).

This regression-based correction appears, in theory, to be an imperfect solution, but how much these imperfections routinely interfere with the method’s performance is uncertain. The potential imperfections arise from two sources. First, it is plausible that the regression-based coefficient may not fully reflect the sum effect upon confounding that results when the Introduced Variable is inserted into Model 2. If the Introduced Variable is correlated with any unmeasured confounders, then inserting of the Introduced Variable(s) in Model 2 would also be expected to also reduce the imbalance in these other unmeasured confounders (at least relative to their Model 2 imbalance if no correlation existed). Of note, this correlation would also be expected to affect the Introduced Variable(s) regression coefficient: it is well appreciated that in regression models two correlated variables can influence the regression coefficient obtained for each of the variables. Unfortunately, it is not well understood, to my knowledge, whether the effect of correlation in regression alters the regression coefficient in a manner that, when this coefficient is used in the Bross equation, the estimated change in confounding approximates the actual change in confounding resulting from the change in the Introduced Variable(s) and its correlates.

Second, the regression equations used to derive the Introduced Variable(s)-outcome regression coefficient optimally would have the same number of variables within them as the propensity score. Thus, some degree of confounding amplification may also exist in the Introduced Variable(s)-outcome coefficients, although this amplification is likely less than observed for the treatment effect estimate, and possibly less problematic (Appendix 4).

Using the Bross equation, the estimate of the confounding attributable to the Introduced Variable in Model 1 is then subtracted from estimate of such confounding in Model 2. This produces an estimate of the change in the treatment effect estimate between Model 1 and Model 2 that is attributable to increased balance in the Introduced Variable(s) (and potentially, to some degree, its correlates) (Appendix Table Step 3b3). This estimate then is subtracted from the overall change in the treatment effect estimate observed between Model 2 and Model 1 (Appendix Table Step 3c). The result is an important quantity: the quantitative change in the treatment effect estimate attributable to the proportional confounding amplification that occurred between Model 1 and Model 2 (Appendix Table Step 3d).

Step 4 – Calculate the unconfounded treatment effect estimate

The final step involves two substeps. First, divide the final result from Step 3d (the change in the treatment effect estimate from Model 1 to Model 2, adjusted to remove the change produced by increased balance in the Introduced Variable(s) and potentially its correlates) by the proportional amount of confounding amplification occurring between Model 1 and Model 2 (Appendix Table Step 4a). This proportional confounding amplification can be calculated by the ratio of the proportional confounding amplification of both models relative to a state with no confounding amplification. For example if Model’s R2 = was 0.50, then, based on the 1 - R2 relationship, this model it would be expected to contain 2-fold confounding amplification (1 / (1 - 0.5) = 1 / 0.5 = 2). But since we are comparing Model 2’s treatment effect estimate to Model 1’s treatment effect estimate, rather than to a hypothetical, unobserved model with absolutely no confounding amplification, we instead need to take into account the difference in confounding amplification that occurs between the models. In a sense, the comparison of amplification between Model 2 and Model 1 respecifies the Model 2 confounding amplification by quantifying it relative to the “starting point” of the confounding amplification observed in Model 1.

For example, if Model 1 had an R2 of 0.25 (leading to a confounding amplification of 1 / (1 - 0.25) = 1 / 0.75 = 1.33, then the proportional amplification of confounding occurring between the 2-fold confounding amplification in Model 2 and the 1.33-fold confounding amplification in Model 1 would be 2 / 1.33 = 1.5. If the difference between treatment effect estimates reflected confounding amplification of 1.5, this means that the difference in the treatment effect estimate represents 1.5 × the original confounding (or a 50% increase in confounding). Therefore the adjusted treatment effect estimate difference observed between the two models would then need to be multiplied by a factor of 2 (i.e., 1 / 0.5) to extrapolate back to an estimate of the original confounding in Model 1. This factor of 2 can be obtained mathematically by subtracting 1 from the proportional confounding amplification predicted between the models (1 / (1.5 - 1) = 1 / 0.5 = 2). (Determining the ratio of the confounding amplification occurring in each of the two models provides the proportional change in confounding amplification between Model 1 and Model 2; subtracting “1” from this ratio accounts for the fact that if no amplification between the models occurs, the ratio will equal “1”, but the confounding amplification will be “0”).

This calculation derives by extrapolation an estimate of the total residual confounding originally in Model 1, except for the confounding attributable to the yet-to-be-inserted Introduced Variable(s), and can be represented by the following mathematical term:

((TEEM2TEEM1)ConfIntVΔ(M2M1)((1(1RM22)1(1RM12))1))

In this overall term, “TEE” refers to the treatment effect estimate of the particular model (subscript “M2” denoting Model 2 and “M1” denoting Model 1), thus the term (TEEM2 - TEEM1) represents the observed difference in treatment effects between the two models. The term “ConfIntVΔ(M2-M1)” represents the change in confounding attributable to the change in balance in the Introduced Variable in Model 2 compared to Model 1. This term is calculated by use of the multivariate Introduced Variable-outcome regression coefficient and the Bross equation. The terms (1 / (1 - R2M2)) and (1 / (1 - R2M1)) represent the proportional confounding amplification expected in Model 2 and Model 1. The ratio of this proportional confounding amplification provides the proportional amplification occurring between the two models. Subtracting 1 from this ratio permits the extrapolation, from the adjusted change in treatment effect estimates, of the total residual confounding from the amplifiable fraction of Model 1. An algebraic derivation of this term is provided in Endnote F.

One further term is needed to estimate the total residual confounding in Model 1. To reiterate, the confounding from the Introduced Variable is not subject to amplification in Model 2, since it is now included in the propensity score, unlike the rest of the residual confounding in Model 1. Thus, the Introduced Variable(s)’s contribution to Model 1 residual confounding must be accounted for separately, through use of the Introduced Variable-outcome regression coefficient and the Bross equation (Appendix Table Step 3b1). Entering the imbalances in the Introduced Variable between the treatment groups that are present in Model 1 (that is, relative to a perfect 50%/50% balance) into the Bross equation provides an estimate of the contribution of the Introduced Variable to the Model 1 residual confounding (Endnote G).

Adding the two components of Model 1 residual confounding (i.e., the estimate of residual confounding from the amplifiable fraction plus the confounding attributable to the original, Model 1 imbalance in the yet-to-be-inserted Introduced Variable(s)) produces the method’s estimate of total residual confounding present in Model 1 (Appendix Table Step 4b1). This total is then subtracted from the Model 1 treatment effect estimate to produce an estimate of the unconfounded treatment effect (Appendix Table Step 4b2).

The entire approach to estimating residual confounding using the ACCE Method can be summarized by the following equation:

TEEM1((TEEM2TEEM1)ConfIntVΔ(M2M1)((1(1RM22)1(1RM12))1))ConfIntVM1=UnconfoundedTEE

This equation subtracts from the original, Model 1 treatment effect estimate (TEEM1) the back-extrapolation term estimating the Model 1 residual confounding from the amplifiable confounding fraction (discussed above), as well as the term ConfIntVM1, which represents the separate contribution of the Introduced Variable(s) to Model 1 residual confounding. Subtracting these terms from the Model 1 treatment effect estimate produces, in general principle, an estimate of the unconfounded treatment effect (Appendix Table Step 4 and Endnote F).

The accuracy of this unconfounded treatment effect estimate, however, is not yet established. The largest uncertainties in this estimate likely come from several factors, including the basic uncertainty whether the proportional confounding amplification occurring between two models is consistently predictable. Two manuscripts suggest such prediction may be possible4,5, but certainly a more extensive confirmation of this relationship, for R2 and possibly for other metrics of exposure prediction, would be beneficial. In addition, neither of these manuscripts examined real-world data. Thus, questions remain, such as whether real-world data might contain “constraints” to confounding amplification (Appendix 3.3). Also pertinent are the two uncertainties discussed in Step 3 concerning the adequacy of the Introduced Variable(s) regression coefficient(s) for performing the necessary adjustments to the Model 2 - Model 1 treatment effect estimate term and to the Model 1 total residual confounding. These uncertainties relate to whether the Introduced Variable-outcome regression coefficient adequately reflect changes that would occur in Model 2 in unmeasured confounders correlated with the Introduced Variable(s), as well as whether this Introduced Variable(s)-outcome regression coefficient(s) would also suffer some confounding amplification. Investigation is also needed into the practical question of whether other differences between the models can be sufficiently minimized to prevent them from producing changes in the Model 2 treatment effect estimate separate from confounding amplification (Appendix 2). These uncertainties, plus others, are highlighted in Appendix Figure 1b in Appendix 5 and listed as research needs in the Discussion.

Nevertheless, the method’s potential to perform an adjustment for the association between the Introduced Variable(s) and outcome suggests that this method might provide quantitatively or qualitatively useful unconfounded treatment effect estimates when instrumental variable analysis is not possible. Associations between the Introduced Variable and outcome may merely complicate, but not preclude, use of the method. In other words, Introduced Variables may not have to meet the “exclusion restriction” traditionally applied to instrumental variables (i.e., having no correlation with outcome other than exclusively through an association with treatment). However, for optimal performance it may still prove advantageous for the Introduced Variable to meet, or nearly meet, the condition of having no correlation with other confounders, at least with respect to unmeasured confounders.

Conceptualizing the ACCE Method as consisting of two basic components

Since these steps and substeps may seem somewhat complex initially, it may help to conceptualize the ACCE Method as simply involving two overarching components: 1) attempting to quantify the two contributions to residual confounding in Model 1; and 2) subtracting these estimates of unmeasured confounding from the Model 1 treatment effect estimate.

Component 1 involves several operations: creating models to deliberately amplify confounding, measuring their treatment effect estimates, and dividing the change in the treatment effect estimate (adjusted to remove the effect of the change in confounding attributable to the Introduced Variable(s)) by the predicted change in confounding amplification. This entire process estimates one contribution to Model 1 residual confounding: the Model 1 confounding that was amplified through insertion of the Introduced Variable(s) in the Model 2 propensity score. The separate contribution of the Introduced Variable(s) to Model 1 confounding needs to be estimated. This estimate is achieved by entering the imbalance of the Introduced Variable, and its Introduced Variable-outcome regression coefficient, into the Bross equation. (Using slightly different values, the Bross equation also generates the adjustment mentioned above to the change in the treatment effect estimate).

The second component is much simpler, involving only the summing the two parts of original residual confounding estimate and subtracting this sum from the Model 1 treatment effect estimate.

Illustrative examples

Four hypothetical examples

Four hypothetical examples are presented to help illustrate the ACCE Method. (As mentioned previously, largely nonmathematical metaphors to help illustrate the method are provided in Appendix 1).

The first hypothetical example simply fleshes out in more detail the particularly simple case already discussed. A propensity score model with an R2 of 0.25 for the prediction of treatment exposure yields a treatment effect estimate of approximately RR = 1.10 when it is used to compare the treated group to a comparison group by matching or stratification. A second propensity score model is generated by adding a single additional covariate that boosts the overall R2 of the expanded propensity score model to 0.5. This change in R2 leads to a decrease in the unexplained variance of exposure (1 - R2), and, as discussed in Step 4 on the preceding page, a predicted 50% amplification of confounding between the models. This second propensity score model yields a treatment effect estimate of approximately RR = 1.15 (Endnote C). If the Introduced Variable added to the set of Model 1 covariates to produce Model 2 has no genuine association with outcome (and no association with unmeasured covariates that have an association with outcome), then there is no need to adjust for this association in Steps 3 and 4 of the method. In this case, a simple conclusion results: if the treatment effect estimate increased by 50% when confounding amplification is expected to increase by 50%, this suggests that the entire, apparent treatment effect estimate that was observed in Model 1 is due to confounding.

In this simple scenario (i.e., involving no Introduced Variable-outcome association), the only way for a genuine treatment effect to exist, if the genuine treatment effect and unmeasured confounding are in the same direction, is if the Model 2 treatment effect estimate increased by an amount less than the proportional confounding amplification occurring from Model 1 to Model 2. This is because it is only the confounding that amplifies, not the treatment effect, as the propensity score model’s R2 increases. In a sense, the treatment effect provides a “kernel” of constant effect amidst the change (amplification) of confounding. In this case, the more the original (Model 1) treatment effect estimate reflects genuine treatment effect, the more refractory the treatment effect estimate should be to amplification in Model 2.

Alternatively, if a genuine treatment effect existed in the opposite direction of confounding, then the change in the Model 2 treatment effect estimate would have to be greater than the amount predicted by strictly applying the expected proportional confounding amplification to the Model 1 treatment effect. This is because more confounding would be necessary than simply that needed to account for the difference between the treatment effect estimate and a null association: additional confounding would be required to also account for the “distance’ between the genuine treatment effect (in the opposite direction) and the null value. As a result, this additional confounding beyond that required to explain simply the entire treatment effect estimate (compared to the null) would lead to a change greater than predicted if the treatment effect estimate represented only the effect of confounding. In both these cases, the presence of a genuine treatment effect means that a change would be observed that was different (either greater or lesser) from that expected from the simple multiplication of the Model 1 treatment effect estimate by the amount of increased confounding amplification.

The second Hypothetical Example makes this clearer. Assume an identical scenario to the first example above, with only one difference: the Model 2 treatment effect estimate remains completely unchanged at RR 1.10. In this case the same 50% increase in confounding amplification between the two models produced a complete lack of a difference in the treatment effect estimates, implying essentially no residual confounding exists in Model 1. Furthermore, if residual confounding is in the same direction as the genuine treatment effect, the only way (absent an effect of random variability) that the Model 1 estimate can reflect a genuine treatment effect is if the Model 2 RR ends up between 1.10 and 1.14. A Model 2 estimate of RR = 1.15 would imply essentially no genuine treatment effect (Hypothetical Example 1), while an RR > 1.15 would imply that the treatment effect and residual confounding are in opposite directions (and that some degree of a genuine, protective treatment effect exists) (Endnote D).

Hypothetical Example 3 examines a simplified version of the example provided in the Appendix Table. (The more complex version is discussed as Hypothetical Example 4). The simplification is to assume no association between the Introduced Variable and outcome. Assume Model 1 has a treatment effect estimate of RR = 1.265 (beta coefficient = 0.235072) and an R2, in terms of prediction of exposure, of 0.25. Assume Model 2 has a treatment effect estimate of RR = 1.2985 (a beta coefficient = 0.2612) and an R2 of 0.5. (This again would produce 50% confounding amplification). If the Introduced Variable has no association with outcome, we can immediately determine, by mere inspection, that confounding is relatively modest and the effect estimate of 1.265 primarily represents genuine treatment effect. The reason is that little change occurs in the treatment effect estimate. Certainly the increase in the treatment effect estimate does not come close to the value of RR = 1.422 that would be expected if the entire original RR = 1.265 was due to confounding (beta coefficient 0.235072 × 1.5 = 0.352608, which exponentiated equals 1.422). In fact, the difference in beta coefficients between Model 1 and Model 2 is just +0.0261. This means that, if 50% confounding amplification increases beta by 0.0261, then total confounding in Model 1 = 0.0261 / (1 - 0.75) / (1 - 0.5) = 0.0522, and thus the genuine treatment effect is beta = 0.2351 - 0.0522, or 0.1829, or RR = 1.20. This demonstrates that if no Introduced Variable-outcome association is present, then a treatment effect estimate that is generally refractory to the addition of the Introduced Variable(s) suggests that most of the effect estimate is genuine treatment effect.

The three examples above help demonstrate the important need to be able to accurately detect small differences in the treatment effect estimate between Model 2 and Model 1. In addition, the differences that are detected need to be due to confounding amplification, rather than due to other differences (Appendix 2) or random variation. The next example illustrates the importance of accurately detecting and correcting for any Introduced Variable-outcome relationship.

Hypothetical Example number 4 illustrates the somewhat more complex, but still relatively straightforward, calculations required when the Introduced Variable does have an association with outcome. The full calculations have been described in the Methods section, with further detail provided in the Appendix Table. Let us return to Hypothetical Example 3 but assume that the Introduced Variable has an association with outcome of beta = 0.04879 [RR = 1.05]. We also need to know how the degree of imbalance in this variable initially between the treatment groups in Model 1 (in this example, there is an 80% [treatment group] to 20% [comparison group] imbalance), and how much more closely into balance it becomes in Model 2 (in this example, a 52% to 48% difference). Immediately, we can appreciate that the quantity of unmeasured confounding in the treatment effect estimate is considerably larger than in Hypothetical Example 3, for one simple reason: ordinarily, if we markedly reduce the imbalance in a variable that is more prevalent in the treatment group (as in this case) and which biases in the same direction as the treatment effect estimate, we would expect to see a decrease in the treatment effect estimate, not an increase. (The treatment effect beta coefficient increases by +0.0261). The fact that an increase is observed must mean that considerable additional confounding exists, separate from the effects of the Introduced Variable. This additional confounding must be large enough so that its relatively modest amplification (50%) is more than sufficient to overcome the effect of the increased balance in the Introduced Variable. Using the Bross equation allows us to quantify the expected effect of the increased balance of the Introduced Variable. This change in balance would be estimated to decrease the treatment effect estimate by beta = 0.0273. So, to determine the actual quantitative amount of amplified confounding, we subtract this decrease in treatment effect estimate from the change in the treatment effect estimate that was observed. Thus the change in the treatment effect estimate due to confounding amplification is beta = 0.0534 (i.e., 0.0261 minus -0.0273). Dividing this quantity by the predicted amplification of 50%, as determined by the calculation ((1 / (1 - 0.5)) / (1 / (1 - 0.25))) - 1 = 0.5, gives an estimate of total Model 1 residual confounding attributable to the amplifiable fraction (i.e., the total residual confounding except that contributed by the Introduced Variable and its correlates) of beta = 0.0534 / 0.5, or 0.1068.

We now must subtract this value, plus the confounding attributable to the Introduced Variable, from the Model 1 treatment effect estimate. At this point, the Bross equation uses the difference between an 80%/20% imbalance and the 50%/50% balance that would be observed if there was no confounding in Model 1 due to the Introduced Variable. This produces a slightly larger number (beta = 0.0293) than in the previous application of the Bross equation (beta = 0.0273), which estimated confounding resulting from the difference between the initial 80%/20% imbalance and the 52%/48% balance observed after propensity score balancing. Adding this beta = 0.0293 quantity to our estimate of the confounding attributable to the amplifiable fraction means that we estimate that Model 1 contained residual confounding of beta = 0.1068 + 0.0293, or 0.1361. Since the treatment effect estimate is RR = 1.265 (beta = 0.2351), this means the genuine treatment effect estimate is beta = 0.2351 - 0.1361, or beta = 0.09903 [i.e., RR = 1.10]). That is, the findings imply that more than half of the original, sizeable “treatment effect estimate” (beta = 0.2351; RR = 1.265) was attributable to residual confounding. (Please see the Appendix Table for complete calculations).

Put in the form of the Summary Equation, the following calculation of the unconfounded treatment effect estimate would result:

0.2351 - ((0.0261 - -0.273) / (((1 / (1 - 0.5)) / (1 / (1 - 0.25))) - 1)) - 0.293 = 0.09903

Thus, despite the fact that the treatment effect estimates for Model 1 and Model 2 are both confounded by an unknown amount of unmeasured confounding, it is possible, in principle, to derive an estimate of an unconfounded treatment effect. This estimate is possible because knowledge of the quantitative change between these two treatment estimates and the estimated proportional confounding amplification underlying this change allows, in a few steps, the derivation of an estimate of the unconfounded treatment effect.

Results

Partial application using published data

The example provided here from published data builds from a rare opportunity in the literature in which sufficient information has already been provided to partially apply the method. Thanks to their detailed reporting, Patrick et al.,10 fortuitously present results that provide an opportunity to apply some aspects of the ACCE Methodology on real-world data. Obviously, their study was not constructed to illustrate the ACCE Method; therefore it is being used post hoc to explore the potential of the method. The full quantitative version of the ACCE Method cannot be applied for several reasons (discussed below). As a result, the data provided include several additional uncertainties beyond those that would accompany a deliberate implementation of the ACCE Method. However, by permitting the performance of even a partial version of the ACCE Method to be assessed, this study illustrates the value this method may have in serving as a probe to provide at least a qualitative sense of whether substantial residual confounding is likely, along with its likely direction.

Patrick et al.,10 analyzed the associations between statins and both all-cause mortality and hip fracture using a number of propensity scores. For both of the outcomes, two of the propensity scores formed an important nested pair. One propensity score was nested within a slightly larger propensity score that only differed in a single added covariate (glaucoma diagnosis). Glaucoma diagnosis was considered to be a potential instrumental variable in these analyses. First, glaucoma diagnosis was strongly associated with treatment exposure (since the comparison group for both analyses consisted of individuals who used medications to treat glaucoma). Patients with a glaucoma diagnosis had an odds ratio for statin exposure of 0.07. That is, patients with glaucoma diagnosis had approximately 14:1 odds of being in the comparison group (the group receiving medications for glaucoma) than the statin treatment group. Second, it is plausible (although not certain) that glaucoma diagnosis lacks a substantial association with the outcomes of all-cause mortality and hip fracture, and thus may be functioning as an instrumental variable or near-instrumental variable. (Although not termed an “instrumental variable” originally10, such a term was used for glaucoma diagnosis in these analyses in a subsequent manuscript describing these findings11).

Several information gaps limit this example, however, making it not possible to derive quantitative estimates of unmeasured confounding. Patrick et al.10 did not report R2 since they used logistic propensity scores, but rather provided c statistics. The relationship of the c statistic to confounding amplification has not been examined, in contrast to the relationship between R2 and confounding amplification. In addition, it is not possible to adjust for any association between the Introduced Variable (glaucoma diagnosis) with outcome, since the needed coefficient from a full multivariate regression containing all the propensity score covariates are not provided. The manuscript does note that the minimally-adjusted hazard ratio (HR) for glaucoma diagnosis (adjusted for age, age2, and sex) is >1.175 or <1 / 1.175 for both outcomes. (The actual age and-sex-adjusted HR observed is HR ≈ 0.85 for both outcomes [Dr. Amanda Patrick, Personal Communication]). While the age- and sex-adjusted HR has some value, what is truly needed is the glaucoma diagnosis HR, adjusted for all the other propensity score covariates. (This would total 143 covariates for the mortality analysis and 120 covariates for the hip fracture analysis10). This fully-adjusted HR would provide information about whether the glaucoma diagnosis HR would approximate a null value if all the other covariates were included. It is also not possible to determine if close similarity exists between the models in the balance achieved in the measured covariates and the intervention delivered (e.g., dose or duration) (Appendix 2). Finally, the measure of treatment effect, the hazard ratio, may possibly complicate efforts to derive a quantitative estimate of confounding due to the noncollapsibility of the hazard ratio.

Interpretation of the published results using a partial version of the ACCE Method

Despite such limitations, application of even this partial version of the ACCE Method appears to provide useful qualitative estimates of the residual confounding present in these analyses. Table 1A shows that in the all-cause mortality analyses the addition of the Introduced Variable (glaucoma diagnosis) moves the treatment effect estimate away from the null by a modest amount. This implies that the total residual confounding (including residual confounding from unmeasured factors) likely biases, but only very modestly, towards observing a larger effect size for statins than is genuinely present. This observation is consistent with randomized data12. In contrast, Table 1B shows that addition of the same Introduced Variable in the hip fracture analysis changes the observed treatment effect HR from 0.76 to 0.69. This is a much more sizeable change, implying a larger quantity of underlying residual confounding biasing the estimate away from the null. If glaucoma diagnosis is in fact a near-instrumental variable, the results would suggest that the unconfounded treatment effect estimate is considerably closer (than HR = 0.76) to the null value suggested by randomized data13.

Table 1. Examples of qualitative application of the ACCE Method (drawn from Reference 10).

Table 1A. Statin - mortality analysis
Model 1
Exposure (Treatment) of Interest: Receipt of Statin (vs. Glaucoma Medication)

Outcome: All-Cause Mortality

Included Covariates: 143 Variables with a +/- 20% association with All-Cause Mortality
c statisticTreatment Effect
Estimate
(Hazard Ratio)
Expected Result
(from RCT meta-analyses)
Likely Confounding and Treatment Effect
(based on comparison with RCT data)
0.82HR = 0.84HR = 0.85 or less
(i.e., closer to null)
Away from null (treatment effect estimate overestimates statin
protective effect), although confounding appears small to
modest.

Genuine treatment effect most likely closer to the null than
HR = 0.84.
Model 2

(identical to Model 1 except for Addition of a Single “Introduced Variable”: Glaucoma Diagnosis)
Exposure (Treatment) of Interest: Receipt of Statin (vs. Glaucoma Medication)

Outcome: All-Cause Mortality

Introduced Variable “Probe”: Glaucoma Diagnosis, a variable with a strong association with exposure (approximately 14X more
common in the comparison group [glaucoma medication users] than in the statin group)

Expected Association of Introduced Variable with Outcome: Minimal

Included Covariates: 143 Variables with a +/- 20% association with All-Cause Mortality, plus the “Introduced Variable”
c statisticTreatment Effect
Estimate
(Hazard Ratio)
Size & Direction of Change
of Treatment Effect
Estimate,
compared to Model 1
Likely Confounding and Treatment Effect
(based on ACCE Method)
0.90HR = 0.82Small
(0.84 → 0.82)

in direction away from null
Away from null (since amplifying confounding pushes
treatment effect estimate in that direction), although
confounding appears small to modest.

Genuine treatment effect most likely closer to the null than
HR = 0.84.
Table 1B. Statin - hip fracture analysis
Model 1
Exposure (Treatment) of Interest: Receipt of Statin (vs. Glaucoma Medication)

Outcome: Hip Fracture

Included Covariates: 120 Variables with a +/- 20% association with Hip Fracture
c statisticTreatment Effect
Estimate
(Hazard Ratio)
Expected Result
(from RCT meta-analyses)
Likely Confounding and Treatment Effect
(based on comparison with RCT data)
0.81HR = 0.76Approximately
HR = 1.0
Away from null (treatment effect estimate overestimates statin
protective effect), and confounding appears likely to be large.

Genuine treatment effect most likely closer to the null than
HR = 0.76, potentially substantially closer.
Model 2

(identical to Model 1 except for Addition of a Single “Introduced Variable”: Glaucoma Diagnosis)
Exposure (Treatment) of Interest: Receipt of Statin (vs. Glaucoma Medication)

Outcome: Hip Fracture

Introduced Variable “Probe”: Glaucoma Diagnosis, a variable with a strong association with exposure (approximately 14X more
common in the comparison group [glaucoma medication users] than in the statin group)

Expected Association of Introduced Variable with Outcome: Minimal

Included Covariates: 120 Variables with a +/- 20% association with Hip Fracture, plus the “Introduced Variable”
c statisticTreatment Effect
Estimate
(Hazard Ratio)
Size & Direction of Change of
Treatment Effect Estimate,
compared to Model 1
Likely Confounding and Treatment Effect
(based on ACCE Method)
0.89HR = 0.69Sizeable
(0.76 → 0.69)

in direction away from null
Away from null (since amplifying confounding pushes treatment
effect estimate in that direction), and confounding might be
large.

Genuine treatment effect most likely closer to the null than
HR = 0.76, potentially substantially closer.
Comparison Between the Two Analyses
Use of the glaucoma diagnosis Introduced Variable “probe” suggests substantially more confounding in the hip fracture analysis
than the all-cause mortality analysis. From a highly similar starting point (c = 0.81 or 0.82) and highly similar magnitude of c statistic
change (0.08), the treatment effect estimate for the hip fracture analysis moved substantially further away from the null than for the
all-cause mortality analysis. This matches what is suggested from RCT data.
Additional Considerations:

1) For these two examples there are randomized trial meta-analyses (extrinsic information) to separately inform judgments about
likely confounding. This helps boost confidence in the ACCE Method in that it provides the same qualitative conclusions about likely
confounding that reference to the RCT meta-analysis provides. However, the ACCE Method is likely to have its greatest value in
circumstances in which such meta-analyses are lacking, since it permits evidence from the analysis itself to inform judgments about
confounding.

2) Several elements are lacking that are necessary to derive a quantitative estimate of residual confounding. Missing data elements
include knowledge of whether (and how) c statistics reliably index confounding amplification in a manner analogous to R2 values,
and a regression coefficient for glaucoma diagnosis that includes all the covariates in the propensity score. What is known is that in
age and sex-adjusted analyses, the association of glaucoma diagnosis with both outcomes (as measured by the hazard ratio) was
extremely similar (HR = 0.85) (Dr. Amanda Patrick, personal communication). However, whether this similarity between the analyses
in the Introduced Variable-outcome association persists in the full propensity score analyses is uncertain (since the two propensity
score analyses contain at least some different covariates). Nor is it known if the glaucoma diagnosis-outcome associations in the fully
multivariate regression are close to HR = 1.0 (which is possible, but far from certain).
Notes:

Data taken from Reference 10. Specifically, data for each “Model 1” presented here was taken from Table 2 of Reference 10 (the
Outcome +/- 20% model, or the 6th model listed for both the all-cause mortality and the hip fracture outcomes). Data for each “Model 2”
presented here was taken from information provided in the text of Reference 10 (page 554).

HR = hazard ratio

Even if glaucoma diagnosis is not functioning as a near-instrumental variable, as long as the fully-multivariate regression coefficients for glaucoma diagnosis for each outcome are generally similar (and the age- and sex-adjusted hazard ratios for glaucoma diagnosis presented previously are highly similar), the similar change in c statistics observed would suggest the presence of considerably more residual confounding in the hip fracture analysis than the all-cause mortality analysis (Endnote E). This is a conclusion independently suggested by the randomized trial meta-analyses12,13 cited by the authors. Notably, the ACCE Method, even when applied in a very partial and qualitative form, suggests the same conclusion. In this fashion, the ACCE Method may prove useful for estimating at least the likely general size and direction of residual confounding in the many circumstances where substantial randomized trial data does not exist. This capacity of the method to provide even a qualitative estimate of residual confounding may constitute an important analytic advance.

Discussion

This paper presents a relatively straightforward method exploiting the phenomenon of confounding amplification to potentially obtain quantitative estimates of total residual confounding and unconfounded treatment effects. To my knowledge, it has not been previously recognized that the phenomenon of confounding amplification, if predictable (as suggested by both recent simulation5 and theoretical work4), provides a potential mechanism to estimate total residual confounding. The fundamental approach of deliberately introducing amplified confounding into an analysis to evaluate, qualitatively or quantitatively, the total residual confounding originally present appears to possess both clear logic and considerable promise.

Even if subsequent research determines that ACCE Method estimates are too imprecise to serve as useful quantitative estimates, this general approach may have considerable value as a semi-quantitative or qualitative “probe” for detecting the general size and direction of residual confounding. While important facets of the method are not yet fully resolved concerning its quantitative accuracy and optimum implementation (see below), further research is clearly indicated given the potential value of a new approach to removing confounding from nonrandomized treatment effect estimates. It is hoped that the description of the method provided here is sufficient to permit the larger research community to immediately begin participating in the validation and refinement of this novel approach.

Considerations for validation and further research

This method will have its greatest value to the extent it succeeds in providing a useful quantification of residual confounding. Establishing such performance by the method will involve more detailed and precise examination of both simulated and real-world data, and almost certainly will involve the contributions of multiple research teams. Useful avenues for validation research include (in anticipated priority order):

1) Determining the predictability of the relationship between the proportional amount of confounding amplification and measures of exposure prediction or change in internal markers. The predictability of the proportional amount confounding amplification is the linchpin of this proposed method. While this predictability is suggested by two publications4,5, ways can be envisioned in which this predictability might break down in real-world datasets (Appendix 3.1, Appendix 3.2 and Appendix 3.3).

2) Establishing that multivariate regressions can be used to accurately estimate the contribution of the Introduced Variable and its correlate(s) to both the original confounding and change in confounding between models. This is discussed extensively in Appendix 3.2 and Appendix 4.

3) Determining whether sufficiently precise results can be routinely obtained from the ACCE Method despite random variability in the treatment effect estimates. Some recent studies do suggest that quite subtle changes in relative risk or hazard ratio resulting from the application of slightly different propensity score models can be detected9,10.

4) Developing methodology to develop confidence limits around the ACCE Method’s final treatment effect estimate. An obvious need for such methods exists. The procedure of bootstrapping would be one candidate approach.

5) Identifying approaches to, or circumstances that would, ensure other differences between the two models (e.g., in the balance achieved for included confounders, in the patient sample, and in the intervention received) are minimized. Whether these differences (discussed in Appendix 2) would create substantial error is uncertain.

An important need also exists to determine whether a set of Introduced Variables can be used, as appears possible (Appendix 6), if a single Introduced Variable does not produce sufficient confounding amplification. Indeed, part of the imperative for research on the ACCE Method stems from the possibility that the method may have unusually broad flexibility by: 1) permitting estimation using variables with a substantial association with exposure but also having an independent associations with outcome; 2) permitting a set of variables to be used to predict exposure for the purposes of the method; and 3) possibly also functioning to permit estimates of unmeasured confounding after treatment initiation (Appendix 2.3b).

Simulation studies will almost certainly be the most immediate approach to addressing these research needs and evaluating the performance of this method in general. (These studies have the advantage that given that the genuine treatment-outcome association and the amount of unmeasured confounding is able to be precisely specified by the investigator). Such simulations might build upon the recent simulation study reporting predictable confounding amplification within the lower range of R25, and others that have considered the impacts of unmeasured confounding14,15. Simulations might start with a simple 4- or 5-variable scenario: the treatment, a measured confounder, the Introduced Variable, and one or two unmeasured confounders. The simulations might start by testing the accuracy of the treatment effect estimates achieved when the Introduced Variable does or does not have an association without outcome, and expand to examine whether, and how much, the performance of the method suffers when varying strengths of correlation exist between the Introduced Variable and one or more unmeasured confounders.

Real-world studies will also be needed to help resolve whether the ACCE Method, when applied to complex and often highly-correlated real-world data, succeeds in making results from nonrandomized studies better parallel results from randomized trials16,17 (Appendix 3.2 and Appendix 7).

Potential application of the method to comparative effectiveness and surveillance research

Even if this method ultimately does not demonstrate strong quantitative precision, the potential qualitative estimates of this method may prove to have some benefit for nonrandomized comparative effectiveness research in general, especially for studies in which substantial residual or unmeasured confounding is expected. For example, many studies of mental health and/or behavioral interventions might be expected to have substantial unmeasured confounding. Important elements of the conversation between patient and provider that inform judgments of the severity of the patient’s condition and help influence treatment decisions may often go unrecorded in administrative data or even in the patient’s chart, and thus be unmeasured.

Another notable use would be to enhance medication surveillance efforts. By providing even an approximate sense of whether substantial unmeasured confounding is likely to be present, the ACCE Method could help more accurately indicate which prominent “signals” (either in effectiveness or safety) observed during the screening of large datasets appear to be less confounded (and thus should be a priority for additional investigation).

Conclusions

This paper has outlined a relatively straightforward yet novel method to potentially obtain a quantitative estimate of total residual confounding. This total residual confounding estimate then allows, in principle, for an estimate of the unconfounded treatment effect to be calculated. This paper has described the two overarching components of the method and described the specific individual steps and substeps necessary for its implementation. This paper has also offered a preliminary examination of the performance of a simple, partial version of this method on published data, and outlined research needs for refinement and validation of this method. Given the importance of identifying methods that may help remove confounding from nonrandomized treatment effect estimates, further investigation of this method by multiple research groups is clearly warranted. Even if the ACCE Method is eventually shown to have limitations or evolves from the form proposed here, the method’s general approach of deliberately amplifying confounding to permit estimation of the residual confounding originally present may have enduring analytic value. The ACCE Method and its underlying logic therefore have the potential to constitute a substantial advance for nonrandomized intervention research, and follow-up research should be rapidly conducted.

ENDNOTES

  • A. In actual application, these calculations need to account for any association between the Introduced Variable(s) and outcome if present. This adjustment is not included in this very simple example, but is likely needed in most implementations of the method (and is discussed in Steps 3 and 4).

  • B. These regressions could be performed either within treatment arms or across both treatment arms while including an indicator for treatment arm, as well as a covariate(s) for treatment arm-Introduced Variable interaction(s). My expectation is that regressions within each treatment arm may be more useful, given the correlation between the Introduced Variable and treatment, although this approach raises the question about how to best combine the two within-treatment arm coefficients (e.g., an average, weighted by the number of patients in each treatment arm).

  • C. This example and the subsequent examples use a linear propensity score model and a linear (risk ratio) outcome model. This is because: 1) the existing simulation demonstrating proportional confounding amplification is for a linear propensity score model, and 2) it is conceivable (but not certain) that the noncollapsibility of logistic outcome models might interfere with the accuracy of the subtraction of the Model 1 treatment effect estimate from the Model 2 treatment effect estimate.

  • D. The only way to produce such a large quantitative change in the treatment effect estimate (i.e., RR > 1.15) with 50% confounding amplification and starting from an RR of 1.10 would be for the unmeasured confounding to be so substantial as to exceed the entire Model 1 treatment effect estimate. Subtracting this estimated amount of confounding from the Model 1 treatment effect estimate would therefore produce an estimate of the genuine treatment effect that was below the null (ignoring, once again, random variability and making the important assumption of no association between the Introduced Variable and outcome).

  • E. In fact, the change in c statistic is highly similar (all-cause mortality: Model 1 c = 0.82, Model 2 c = 0.90; hip fracture: Model 1 c = 0.81, Model 2 c = 0.89). Thus, even though it is not clear whether the c statistic in general can serve as even an approximate index of confounding amplification, the c statistics in this case are so similar as to suggest similar proportional confounding amplification is likely. The much greater change in the treatment effect estimate observed for the hip fracture analysis implies this analysis contains greater residual confounding biasing in the direction of a protective effect, if the assumption made, that the fully-adjusted glaucoma diagnosis HR is similar in the two analyses, is valid.

  • F. At the most fundamental level, the ACCE Method can be conceptualized as determining the unconfounded treatment effect estimate by subtracting from the Model 1 Treatment Effect Estimate both a) the estimate of confounding in Model 1 for the “amplifiable fraction” of Model 1 confounding (that is, the confounding from all the confounders except the Introduced Variable(s) and, to some degree, its correlates), and b) subtracting the confounding due to the Introduced Variable, and, to some degree, its correlates. The terms described in “a)” and “b)” are intended to be complements of each other, in that together they are intended to encompass between them all the (baseline) confounding present in Model 1. Thus, what remains when they are removed is the unconfounded treatment effect estimate for Model 1. (The most rigorous, but much more labor-intensive, estimate of Model 1 residual confounding would also include the confounding contributed by the residual imbalance of each of the propensity score covariates and, potentially, correlates of these covariates, as well as the change in the balance of these propensity score covariates occurring between Model 1 and Model 2. For simplicity, those terms are not considered here, but are discussed in Appendix 2.1, Appendix 3.2a, and Appendix Figure 1c).

    The quantities that we can estimate by relatively conventional analysis of the data (i.e., in a sense, the “known quantities”) are the Model 1 treatment effect estimate and the estimated confounding attributable to the Introduced Variable(s) and its correlates (by using multivariate regression coefficients and the Bross equation). Therefore, the only term for which we are lacking an estimate is the confounding due to the “amplifiable fraction” of Model 1 confounding. Furthermore, we can obtain important information that bears upon the confounding attributable to the “amplifiable fraction” of Model 1 confounding. This information consists of the Model 2 treatment effect estimate, and the proportional amount of confounding amplification expected, which recent simulation and theoretical work suggests can be predicted for linear propensity score models from the model R2 (for predicting exposure). More precisely, the proportional confounding amplification is estimated from (1 / (1 - R2)).

    The most fundamental contribution of the ACCE Method is to call to attention to the fact that, with this readily available information, the final term needed (the Model 1 confounding attributable to the “amplifiable fraction”) should be able to be estimated. While the accuracy of this estimate has yet to be determined, the ability to come up with even an approximate estimate of the aggregate effect of all remaining residual confounding is noteworthy. In the manuscript and Appendices, the confounding in Model 1 attributable to the “amplifiable fraction” is represented by the following term:

    ((TEEM2TEEM1)ConfIntVΔ(M2M1)((1(1RM22)1(1RM12))1))

    where TEEM2 equals the Model 2 Treatment Effect Estimate, TEEM1 equals the Model 1 Treatment Effect Estimate, ConfIntVΔ(M2-M1) equals the confounding associated with the change in the balance between the treatment groups of the Introduced Variable(s) and its correlates in Model 2 compared to Model 1, and R2M1 and R2M2 equal the R2 values for propensity score Model 1 and Model 2, respectively.

    To derive this term mathematically, we can proceed with the following reasoning. Consider the simplest, “ideal” case in which no changes occur between Model 1 and Model 2 except confounding amplification. For example, that would mean that there are no differences between Model 1 and Model 2 in the balance in measured confounders or in the ”dose” of intervention received (although it is important to recognize, as discussed in Appendix 2, that means exist to address differences in either of these characteristics). Then the difference in the Model 2 treatment effect and the Model 1 treatment effect, once the differences attributable to the change in balance of the Introduced Variable are taken into account (i.e., ConfIntVΔ(M2-M1)) should reflect amplification of the Model 1 confounding that was not attributable to the Introduced Variable(s) and, to some degree, its unmeasured or nonincluded correlates. For this case, let us assume an Introduced Variable or Variables that is not correlated with other unmeasured or nonincluded confounders. To represent the difference in treatment effect estimates independent of the contribution from the change in balance of the Introduced Variable as it is added to generate Model 2, we can use the term:

    (TEEM2 - TEEM1) - ConfIntVΔ(M2-M1)

    where TEEM2, TEEM1, and ConfIntVΔ(M2-M1) are as defined above.

    For simplicity for the next few steps, let us substitute a single term for this quantity, “AdjΔTEE”, with “Adj” referring to “Adjusted”, that is, this change in Treatment Effect Estimates between Model 2 and Model 1 has been adjusted to reflect the contribution from the change in balance of the Introduced Variable. The “AdjΔTEE” terminology matches that used in Steps 3d and 4a in Appendix Table 1. That is:

    (TEEM2 - TEEM1) - ConfIntVΔ(M2-M1) = AdjΔTEE

    Since we are examining conditions where this AdjΔTEE reflects only the amplification of the amplifiable fraction of Model 1 confounding, this term can be set equal to this amplification. This amplification can be represented algebraically, using the 1 / (1 - R2) relationships, in a simple form, as follows, starting with the Model 1 Treatment Effect Estimate:

    Model 1 Treatment Effect Estimate = x + y + k

    where x equals confounding from the amplifiable fraction in Model 1, y equals the confounding due to the Introduced Variable and, to some degree, any nonincluded correlated confounders, and k equals the constant that is of crucial interest (the unconfounded treatment effect estimate). With this notation, Model 2’s Treatment Effect Estimate can be represented as:

    Model2TreatmentEffectEstimate=((1(1RM22)1(1RM12))x)+y+k

    In this equation, the term (1(1RM22)1(1RM12)) represents the proportional amount of additional confounding in Model 2 compared to Model 1. Confounding amplification in Model 2 cannot be estimated simply by (1(1RM22)) because that would represent the confounding amplification relative to a model with an R2 of 0.0. Instead, the proportional amount of confounding amplification in Model 2 relative to Model 1 needs to be determined, and for this reason the (1(1RM22)) term is divided by (1(1RM12)), creating the term (1(1RM22)1(1RM12)).

    Given these terms for the Model 1 and Model 2 Treatment Effect Estimates, then AdjΔTEE, the Model 2 – Model 1 change in the Treatment Effect Estimate, adjusted to reflect only amplification of confounding (that is, not the change attributable to insertion of the Introduced Variable(s) “probe” needed to generate confounding), can be expressed as:

    AdjΔTEE=(((1(1RM22)1(1RM12))x)+y+k)(x+y+k)

    Focusing on the right side of the equation, the y and k terms cancel out and can be removed: + y + k - ( + y + k) = 0. This makes intuitive sense. Regarding y, the component of y that is unchanging (i.e., in common between Model 1 and Model 2) cancels out and is not part of the difference between TEEM2 and TEEM1. Regarding k, the genuine underlying treatment effect estimate has not changed between Model 1 and Model 2, assuming that we can keep elements such as balance in the propensity score covariates and/or dose received the same or, for practical purposes, extremely similar between the models. If the genuine treatment effect is not varying between models, then it is not making a contribution to the change in the treatment effect estimates between Model 2 and Model 1. (Alternatively, in the case of the balance of measured confounders included as covariates in the propensity score, the effect of changes in balance of those confounders could be estimated by the Bross equation [Appendix 2.1 and Appendix Figure 1c]).

    This gives the equation:

    AdjΔTEE=((1(1RM22)1(1RM12))x)x

    Next, we can factor out x to produce the following term.

    AdjΔTEE=((1(1RM22)1(1RM12))1)x

    Solving for x yields:

    (AdjΔTEE((1(1RM22)1(1RM12))1))=x

    Substituting back in the ((TEEM2 - TEEM1) - ConfIntVΔ(M2-M1)) term for AdjΔTEE yields the equation for the original confounding (in Model 1) attributable to the amplifiable fraction of confounding, x:

    ((TEEM2TEEM1)ConfIntVΔ(M2M1)((1(1RM22)1(1RM12))1))=x

    Of note, the left hand side of the equation consists entirely of terms that can be readily estimated from data. This gives us the final quantity needed to estimate an unconfounded treatment effect estimate. Therefore, if we subtract this term plus an estimate of the Model 1 Confounding due to the Introduced Variable (s) and, to some degree, its correlates, from the Model 1 Treatment Effect Estimate, we should obtain an estimate of an unconfounded treatment effect:

    TEEM1((TEEM2TEEM1)ConfIntVΔ(M2M1)((1(1RM22)1(1RM12))1))ConfIntVM1=UnconfoundedTEE

    Quantities are as defined previously, with the additions of ConfIntVM1, which represents the confounding association with the Introduced Variable and, to some degree, its correlates, given the initial imbalance of the Introduced Variable observed in Model 1, and Unconfounded TEE, which represents the unconfounded treatment effect estimate that thereby results. As a point of clarity, no subscript is given to the Unconfounded TEE term to designate it as being the Unconfounded TEE for Model 1, even though the equation develops its Unconfounded TEE by subtracting the two components of Model 1 confounding from the observed Model 1 treatment effect estimate. No subscript is used because a key assumption is that the underlying treatment effect estimate for both models is the same.

    As pointed out in the manuscript, Appendix 3.2, and Appendix 4, the ability of the Introduced Variable(s) coefficient(s) to capture the contribution of confounding from the unmeasured or nonincluded confounders that are correlated with the Introduced Variable is a major source of uncertainty for the method. While it may (or may not) be determined that often this is not a major practical concern, or that Introduced Variables can be identified that largely lack any significant correlation with unmeasured confounding (Appendix 4), further research is clearly needed. It should be somewhat straightforward to use simulated datasets to make at least an initial inquiry into the impact of unmeasured confounding correlated with the Introduced Variable on the unconfounded treatment effect estimate that this method produces. Another question concerns whether real-world constraints exist that may limit confounding amplification. A more complete list of potential uncertainties is provided in Appendix 5, Appendix Figure 1b.

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 11 Aug 2014
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Smith EG. The ACCE method: an approach for obtaining quantitative or qualitative estimates of residual confounding that includes unmeasured confounding [version 2; peer review: 2 approved]. F1000Research 2015, 3:187 (https://doi.org/10.12688/f1000research.4801.2)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 11 Aug 2014
Views
15
Cite
Reviewer Report 05 Jan 2015
Gregory Matthews, Department of Mathematics and Statistics, Loyola University Chicago, Chicago, IL, USA 
Approved
VIEWS 15
The authors present a manuscript describing a procedure that allows for the quantification of the total amount of residual confounding prior to bias amplification caused by propensity score models.  I believe the procedure described in reasonable, and my biggest concerns ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Matthews G. Reviewer Report For: The ACCE method: an approach for obtaining quantitative or qualitative estimates of residual confounding that includes unmeasured confounding [version 2; peer review: 2 approved]. F1000Research 2015, 3:187 (https://doi.org/10.5256/f1000research.5125.r7091)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 29 Apr 2015
    Eric Smith, Psychiatrist, The Center for Organizational and Implementation Research (CHOIR) and the Mental Health Service Line of the Department of Veterans Affairs, Edith Nourse Rogers Memorial Medical Center, Bedford, MA 01730, USA
    29 Apr 2015
    Author Response
    I would like to thank both reviewers for their thoughtful, insightful, and encouraging reviews.  I particular appreciate their openness to a new methodology to attempt to estimate residual/unmeasured confounding.  I ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 29 Apr 2015
    Eric Smith, Psychiatrist, The Center for Organizational and Implementation Research (CHOIR) and the Mental Health Service Line of the Department of Veterans Affairs, Edith Nourse Rogers Memorial Medical Center, Bedford, MA 01730, USA
    29 Apr 2015
    Author Response
    I would like to thank both reviewers for their thoughtful, insightful, and encouraging reviews.  I particular appreciate their openness to a new methodology to attempt to estimate residual/unmeasured confounding.  I ... Continue reading
Views
33
Cite
Reviewer Report 27 Nov 2014
Mark Lunt, Arthritis Research UK Epidemiology Unit, University of Manchester, Manchester, UK 
Approved
VIEWS 33
This article outlines a very interesting approach to using propensity score methods to correct for unmeasured confounding. That was not the aim of the propensity score, and current methods are not able to do this, so it potentially represents a ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Lunt M. Reviewer Report For: The ACCE method: an approach for obtaining quantitative or qualitative estimates of residual confounding that includes unmeasured confounding [version 2; peer review: 2 approved]. F1000Research 2015, 3:187 (https://doi.org/10.5256/f1000research.5125.r6843)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 29 Apr 2015
    Eric Smith, Psychiatrist, The Center for Organizational and Implementation Research (CHOIR) and the Mental Health Service Line of the Department of Veterans Affairs, Edith Nourse Rogers Memorial Medical Center, Bedford, MA 01730, USA
    29 Apr 2015
    Author Response
    I would like to thank both reviewers for their thoughtful, insightful, and encouraging reviews.  I particular appreciate their openness to a new methodology to attempt to estimate residual/unmeasured confounding.  I ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 29 Apr 2015
    Eric Smith, Psychiatrist, The Center for Organizational and Implementation Research (CHOIR) and the Mental Health Service Line of the Department of Veterans Affairs, Edith Nourse Rogers Memorial Medical Center, Bedford, MA 01730, USA
    29 Apr 2015
    Author Response
    I would like to thank both reviewers for their thoughtful, insightful, and encouraging reviews.  I particular appreciate their openness to a new methodology to attempt to estimate residual/unmeasured confounding.  I ... Continue reading

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 11 Aug 2014
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.