ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Method Article

The ACCE method: an approach for obtaining quantitative or qualitative estimates of residual confounding

[version 1; peer review: 2 approved]
PUBLISHED 11 Aug 2014
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

Abstract

Background:  Nonrandomized studies typically cannot account for confounding from unmeasured factors. 
 
Method:  A method is presented that exploits the recently-identified phenomenon of  “confounding amplification” to produce, in principle, a quantitative estimate of total residual confounding resulting from both measured and unmeasured factors.  Two nested propensity score models are constructed that differ only in the deliberate introduction of an additional variable(s) that substantially predicts treatment exposure.  Residual confounding is then estimated by dividing the change in treatment effect estimate between models by the degree of confounding amplification estimated to occur, adjusting for any association between the additional variable(s) and outcome.
 
Results:  A hypothetical example is provided to illustrate how the method produces a quantitative estimate of residual confounding if the method’s requirements and assumptions are met.  Previously published data is used to illustrate that, whether or not the method routinely provides precise quantitative estimates of residual confounding, the method appears to produce a valuable qualitative estimate of the likely direction and general size of residual confounding.
 
Limitations:  Uncertainties exist, including identifying the best approaches for: 1) predicting the amount of confounding amplification, 2) minimizing changes between the nested models unrelated to confounding amplification, 3) assessing the association of the introduced variable(s) with outcome, and 4) deriving confidence intervals for the method’s estimates (although bootstrapping is one plausible approach).
 
Conclusions:  To this author’s knowledge, it has not been previously suggested that the phenomenon of confounding amplification, if such amplification is as predictable as suggested by a recent simulation, provides a logical basis for estimating total residual confounding. The method's basic approach is straightforward.  The method's routine usefulness, however, has not yet been established, nor has the method been fully validated. Rapid further investigation of this novel method is clearly indicated, given the potential value of its quantitative or qualitative output.

Introduction

Confounding is a central challenge for virtually all nonrandomized studies. Recent research14 has revealed that propensity score methods may actually increase, or “amplify”, the residual confounding remaining after their application. In general, this recently recognized property of propensity score methods has been viewed as a limitation or complication to the use of propensity scores, for understandable reasons. More recently, however, a study has indicated that the degree of confounding amplification (also termed “bias amplification”4) occurring between propensity score models appears to be quantitatively predictable (at least in simulation)5. Not yet recognized, to my knowledge, is the extremely valuable corollary that results: the predictability of confounding amplification should, in principle, permit extrapolation back to an unamplified value of the total residual confounding originally present prior to amplification. (Throughout this manuscript “confounding” refers to baseline confounding. Confounding occurring after treatment initiation from differential discontinuation of the intervention in the treatment group and comparison group is not addressed, but some consideration is given to post-initiation confounding and a possible related approach to addressing to its estimation is briefly discussed in Appendix 1.3.b). In this manuscript and the associated appendices, I describe the general framework and detailed specifics of a new method designed to use amplified confounding to estimate total residual confounding and an unconfounded treatment effect estimate.

The basic logic of this method is straightforward, but its performance in practice has yet to be confirmed. Testing of this method on both simulated and real-world data is clearly needed. Under specific circumstances, this method may theoretically provide a quantitative estimate of total residual confounding, including from unmeasured factors. Whether and how often this is attainable in practice remains to be determined. This manuscript also illustrates, however, that even when this method is not able to provide a precise quantitative estimate of residual confounding, it may provide a very helpful qualitative estimate of the likely direction and general size of residual confounding. This manuscript is intended to provide detailed information to the research community to facilitate the rapid evaluation of the practical feasibility of this proposed approach.

Method

Step 1 – Create nested propensity score models and generate treatment effect estimates

The “Amplified Confounding-based Confounding Estimation (ACCE) Method” depends on the use of two propensity score models, one (“Model 1”) nested in the other (“Model 2”) so that Model 2 contains all the Model 1 covariates plus an additional variable or variables. Importantly, these added variable(s) should be sufficiently associated with treatment exposure to produce discernible confounding amplification. That is, the variables introduced to the model should further predict treatment exposure sufficiently to substantively increase differences between the treatment groups in the prevalences of those confounding factors that are not present in the either model.

Step 2 – Estimate both the proportional amplification of confounding and the quantitative change in the treatment effect estimate between Model 1 and Model 2

In principle, the original confounding existing prior to amplification can be estimated by extrapolation backwards if the proportional amount of confounding amplification and quantitative change in the treatment effect occurring between two propensity score models can be estimated with precision. For example, 2-fold confounding amplification that changed the observed treatment effect odds ratio (OR) from 1.10 in Model 1 to 1.21 in Model 2 (a difference in coefficients of 0.09531) would imply that residual confounding initially existed in Model 1 at such a magnitude as to entirely explain the initial, Model 1 treatment effect (β = 0.09531, or approximately OR = 1.10). (Please see Endnote A, provided at the end of the manuscript, for more detail). That is, doubling the residual confounding doubled the observed treatment effect estimate, implying all the original treatment effect estimate was due to residual confounding. Attention is needed during the method’s implementation, however, to ensure that changes between the two models distinct from confounding amplification are minimized to the extent feasible (Appendix 1).

The method requires an ability to estimate the proportional amount of confounding amplification occurring between two propensity score models. Two very different approaches suggest themselves. One approach would be to estimate amplification from existing or future simulation research based on particular metrics of exposure prediction. An example of this approach is research published5 using the linear measure of exposure prediction, R2. This work demonstrated that, for propensity score stratification or matching approaches, a linear relationship exists between unexplained variance in exposure and confounding amplification across the range of R2 = 0.04 to 0.56. This simulation study5, using a propensity score based on a linear probability model, also made the important demonstration that different unmeasured confounders appear to be amplified to a highly similar degree. Whether this is true in real-world datasets, or is simply a byproduct of this simulation, clearly merits further investigation. (Further discussion is provided in Appendix 2.2). Additional research is clearly needed to determine if a similarly predictable relationship exists for other metrics of exposure prediction (such as those proposed for logistic regression6,7), and whether apparent nonlinearities between the prediction of exposure and confounding amplification at more extreme ranges of prediction5 can be addressed quantitatively.

A second approach would be to adopt an “internal marker” strategy: deliberately withholding a measured covariate from both models to allow the increase in its imbalance between treatment groups in Model 2 to serve as an approximate indicator of the proportional confounding amplification that has occurred. It is possible, however, that the “internal marker” strategy might consistently yield at least a slight degree of underestimate of the amount of confounding amplification (Appendix 2.1).

A key assumption of this ACCE method is that residual confounding attributable to different confounders is uniformly or relatively uniformly amplified in Model 2 compared to Model 1. This important characteristic has been observed in the initial simulation that this method draws upon5, but some possibility still exists that the quantitative predictability of amplification that was observed may be merely a consequence of the particular conditions of this simulation (Appendix 2.2).

Step 3 – Adjust for the association between the introduced variable and outcome

The addition of a variable(s) to Model 2 will almost always alter the amount of residual confounding present compared to Model 1, independent of its effect producing confounding amplification (i.e., true instrumental variables are rare). A challenge arises in that it is not the total residual confounding in Model 1 (the quantity being sought) that is amplified in Model 2, only the fraction of that residual confounding that remains after introduction of the introduced variable. Because of this, adjustments are needed that reflect the confounding attributable to the introduced variable. However, to estimate total residual confounding through this method, such adjustments must occur to two quantities: 1) the change in the treatment effect estimate between Model 1 and Model 2, and 2) the Model 1 treatment effect estimate.

To make these adjustments, I propose obtaining coefficients for the introduced variable from regression models of the outcome that include all other propensity score covariates. (Please see Endnote B for more detail). This regression coefficient for the introduced variable may be biased by partially reflecting the associations with outcome of those unmeasured confounders that are correlated with the introduced variable. However, the adjustment that is needed at this step of the Method needs to reflect both the change resulting from both the improved balance in the introduced variable and from the less extensive changes in the balance of correlated variables that result. To the extent that correlations between the introduced variable and unmeasured confounders produce biases in the introduced variable-outcome association that are similar in size to the amount of increased balance occurring in these covariates, an adjustment that partially reflects the unmeasured covariates could actually be advantageous. The degree of similarity in how correlation affects the introduced variable-outcome association compared to how such correlation affects the balance between treatment groups for the correlated variables is currently uncertain. This is an area worthy of further research.

Once this introduced variable regression coefficient(s) is estimated, the Bross equation8 is used to estimate the confounding attributable to the introduced variable(s) and its correlates in both Model 1 and Model 2. (The Bross equation8, which recently has been used by Schneeweiss and colleagues in their high-dimensional propensity score algorithm9, quantifies the amount of confounding attributable to a confounder by combining the strength of the association between the covariate and outcome and the imbalance in the covariate between the treatment groups. Please see the demonstration of its use in Appendix Table 1). The amount of such confounding in Model 1 is then subtracted from the amount in Model 2 to produce an estimate of the amount of the change in the treatment effect estimate between Model 1 and Model 2 that is attributable to increased balance in the introduced variable(s) and its correlates. This estimate then is subtracted from the overall treatment effect estimate change from Model 1 to Model 2 to produce the quantity being amplified (the residual confounding in Model 1 separate from the introduced variable). (Please see Endnote C for more detail).

The degree to which this step functions successfully to separate the effect of confounding amplification from any change in the treatment effect estimate attributable directly to the improved balance in the introduced variable has yet to be determined, especially if the introduced variable is correlated with other uncontrolled confounders. However, the theoretical potential to perform the proposed adjustment suggests that this method possibly might provide a quantitatively or qualitatively accurate estimate of an unconfounded treatment effect in circumstances in which instrumental variable analysis may not be possible. At a minimum, the method may prove to provide a relatively accurate estimate of an unconfounded treatment effect in the special case in which the introduced variable is suspected to be largely uncorrelated with important unmeasured confounders. Stated in other words, unlike instrumental variable analysis, it is possible that associations between the exposure-predicting introduced variables and outcome simply complicate, but do not preclude, the use of the method. Further research, however, is clearly needed to determine whether this is the case.

Step 4 – Calculate the unconfounded treatment effect estimate

The final step involves two substeps. First, divide the result from Step 3 (the change in the treatment effect estimate from Model 1 to Model 2, adjusted to remove the change produced by increased balance in the introduced variable(s) and its correlates) by the amount of confounding amplification. This calculation derives by extrapolation an estimate of the total residual confounding in Model 1 except for the confounding attributable to the yet-to-be-introduced variable(s). Finally, subtract both that extrapolated estimate of residual confounding and the confounding attributable to the yet-to-be-introduced variable from the Model 1 treatment effect estimate. (Please see Endnote C for more detail). The result is, in general principle, an estimate of the unconfounded treatment effect.

The accuracy of this estimate, however, is not yet established. The largest uncertainty in this estimate, as discussed above, likely involves the accuracy of the adjustments proposed in Steps 3 and 4 in the context of unmeasured confounders correlated with the introduced variable. In addition, the consistent predictability of confounding amplification needs to be further established. The degree to which other differences between the models can be sufficiently minimized to prevent them from biasing the quantitative estimate of confounding amplification also deserves investigation.

Other research needs include: 1) determining whether random variability particularly reduces the method’s usefulness in smaller samples; 2) developing a methodology, such as bootstrapping, to estimate the variance for the final effect estimates; and 3) investigating whether multiple variables can be introduced together if needed to produce sufficient amplification. Nevertheless, the potential significance of a method that may produce estimates of total residual confounding and unconfounded treatment effects from nonrandomized studies should spur research into the method’s feasibility.

Results

Hypothetical example

Consider an example in which the (confounded) Model 1 treatment effect estimate equals OR = 1.265 (with an R2 of 0.25), the (confounded) Model 2 treatment effect estimate equals OR = 1.2985 (with an R2 of 0.50), the introduced variable has an association of approximately OR = 1.05 with outcome, an 80% prevalence in the treated group and 20% prevalence in the comparison group in Model 1, and a 52% prevalence the treated group and 48% prevalence in the comparison group in Model 2. (This example assumes a linear propensity score model but a logistic regression outcome model. Please see Endnote E for more detail). What is observed is an increase in the treatment effect estimate away from the null in Model 2. This change away from the null occurs despite tight control in Model 2 (but not Model 1) of a variable (the “introduced variable”) that is not only highly predictive of exposure but is also, to some degree, a confounder that would have been expected to have biased the treatment-outcome association at least modestly away from the null in Model 1. This suggests, in the absence of confounding amplification, that the tight control of this covariate in Model 2 would ordinarily result in a less biased treatment effect estimate moving towards, not away from, the null. Furthermore, given the mere 1.5-fold amplification of confounding that would be expected to result (0.75 remaining variance unexplained in Model 1 versus 0.50 variance remaining unexplained in Model 2, or 0.75/0.50 = 1.5), the fact that this modest confounding amplification is sufficient to move the treatment effect estimate away from the null despite tight control of a confounder with an OR = 1.05 implies that a substantial proportion of the Model 1 effect estimate is attributable to confounding (biasing away from the null). Specifically, these findings would imply that more than half of the original, sizeable “treatment effect” estimate (OR = 1.265) was attributable to residual confounding, and would suggest a genuine unconfounded treatment effect estimate of only OR = 1.10. (Please see Supplementary Table 1 for complete calculations).

Thus, despite the fact that the treatment effect estimates for Model 1 and Model 2 are both confounded, knowledge of the amount of expected confounding amplification allows the comparison of the effect estimates of models (with appropriate adjustments) to yield an estimate of an unconfounded treatment effect.

Application to published data

The study of Patrick et al.10 provides sufficient detail to fortuitously provide a partial opportunity to test some aspects of the ACCE methodology on real-world data. Obviously, this study was not constructed to illustrate the ACCE Method; therefore it is being used post hoc to explore the potential of the method. As a result, the data provided include several additional uncertainties beyond those that would accompany a deliberate implementation of the ACCE Method. However, by permitting an examination of the performance of even a partial version of the ACCE Method, this study illustrates the potential value this method may have as a probe indicating whether substantial residual confounding is likely (and its likely direction), even in circumstances in which a firm quantitative estimate of residual confounding is not able to be derived.

Patrick et al.10 derived a substantial number of propensity scores during their analyses of the association between statins and both all-cause mortality and hip fracture outcomes. Of note, two of the propensity scores used (for both outcomes) included an important pair in which one propensity score was nested within a slightly larger propensity score identical to the original propensity score except for the addition of a single covariate (glaucoma diagnosis). Glaucoma diagnosis was considered to be a potential instrumental variable in these analyses. First, glaucoma diagnosis was associated extremely strongly with treatment exposure (since the treatment group compared to statins for both analyses consisted of users of medications for glaucoma). Patients with a glaucoma diagnosis had an odds ratio for statin exposure of 0.07 (that is, patients with glaucoma diagnosis had approximately a 14× greater odds of being in the comparison treatment group than the statin treatment group). Second, it is plausible (although not provable) that glaucoma diagnosis lacks a substantial association with the outcomes of all-cause mortality and hip fracture, and thus may be functioning as an instrumental variable or near-instrumental variable. (Although not termed an “instrumental variable” originally10, such a term was used for glaucoma diagnosis in these analyses in a subsequent manuscript describing these findings11).

Patrick et al.10 reported both effect estimates and a measure of prediction (the c statistic) for the original (“Model 1”) model and after adding the “introduced variable” (i.e., glaucoma diagnosis) (“Model 2”). This permits an examination of the valuable qualitative findings that might result even when the ACCE Method is unable to produce a precise quantitative estimate of residual confounding. In this somewhat artificial case, the partial version of the ACCE Method that can be implemented is unlikely to produce precise quantitative estimates of residual confounding for several reasons, including the fact that the relationship of the c statistic to confounding amplification has yet to be explored, unlike the relationship between R2 and confounding amplification. In addition, the partial version of the method that can be implemented does not include the possible checks of model similarity in confounding control, patient sample, and intervention delivered (e.g., dose) described in Appendix 1. Of particular importance, this partial, illustrative version of the method does not include any adjustment to account for the association of the introduced variable (glaucoma diagnosis) with outcome (using information estimated from a full multivariate regression containing the other propensity score covariates). This lack of adjustment somewhat limits this example, since even a small association with outcome of a covariate with such an imbalance in prevalence between the treatment groups may contribute substantively to overall confounding. In fact, the manuscript notes that the minimally-adjusted hazard ratio (HR) for glaucoma diagnosis (adjusted for age, age2, and sex) is >1.175 or <1/1.175 for both outcomes. (The actual age and-sex-adjusted HR observed is HR≈0.85 for both outcomes [Amanda Patrick, Personal Communication]). What is lacking, however, is the glaucoma diagnosis HR adjusted for all the covariates in the propensity score model, rather than just age and sex. (This adjustment would involve including a total of 143 covariates for the mortality analysis and 120 covariates for the hip fracture analysis10). This fully-adjusted HR would provide information about whether or not the age and sex-adjusted glaucoma diagnosis HR might be related to aspects of care-seeking, care access, health attitudes, or other factors that might be also represented by other covariates (leaving a much lesser or close-to-null association for glaucoma diagnosis in the actual analysis). Most importantly, this fully-adjusted association would provide the quantity needed to help calculated the estimate of the unconfounded treatment effect estimate for Model 1 (Steps 3 and 4 of the method).

Interpretation of the published results using a highly partial version of the ACCE Method

Despite the limitation of not having a fully-adjusted regression coefficient for the glaucoma diagnosis-outcome association, as well as the other substantial limitations mentioned above, application of even this highly partial version of the ACCE Method appears to provide useful qualitative estimates of residual confounding for these two analyses (all-cause mortality and hip fracture).

Table 1A shows that in the all-cause mortality analyses, addition of the introduced variable (glaucoma diagnosis) moves the treatment effect estimate away from the null by a modest amount. This implies that the total residual confounding (including residual confounding from unmeasured factors) likely biases, but only very modestly, towards observing a larger effect size for statins than is genuinely present. This result is consistent with the effect estimate derived from available randomized data. In contrast, Table 1B shows that in the hip fracture analyses, addition of the same introduced variable changes the observed treatment effect HR from 0.76 to 0.69. This is a much more sizeable change in the treatment effect estimate, implying a larger quantity of underlying residual confounding biasing the estimate away from the null. If glaucoma diagnosis is in fact a near-instrumental variable, the results would imply that the unconfounded hip fracture treatment effect estimate is considerably closer to null, the approximate value that the authors expect to be the genuine treatment effect based on randomized data12.

Table 1. Application of the qualitative version of the ACCE Method to published data (Patrick et al., 2011).

A. Nested Models differing by single variable with observed strong association with exposure and expected minimal association with outcome.

Exposure-Predicting
Introduced Variable
InterventionOutcomeOriginal Model


(“Model 1”)
Larger Model
(by a single covariate)

(“Model 2”)
C statistics
Model 1Model 2
Glaucoma DiagnosisReceipt of
statin

(vs. receipt of
antiglaucoma
medication)
All-cause
mortality
PS Model restricted
to variables with
a +/- 20%
association with
the outcome
PS Model restricted to
variables with a +/- 20%
association with outcome
PLUS
Introduced variable
(glaucoma diagnosis)
0.820.90
Results
Model 1
Hazard Ratio

(Central Estimate)
Expected Genuine
Treatment Effect

(Based on
Randomized Trial
Meta-analyses)
Model 2
Hazard Ratio

(Central estimate)
Residual Confounding Suggested by Model 2
Treatment Effect Estimate compared to
Model 1 Treatment Effect Estimate
0.84HR ≈ 0.85 or less
(i.e., closer to the null)a
0.82bModestc, in the direction away from the null
(i.e., towards a more protective apparent effect
than likely genuinely exists)

That is, the ACCE Method suggests the likely
direction and general size of residual confounding
(and thus that the genuine treatment effect is
likely closer to the null than the initial Model 1
estimate), even in the absence of a precise
quantitative estimate of residual confounding.

a Reference 10, Table 2 and Discussion.

b Reference 10, Results section text (4th paragraph).

c For residual confounding not to be modest (relative to treatment effect estimate) either 1) the introduced variable would have to have a substantial association with increased mortality risk. (This seems rather unlikely, since the age and sex-adjusted HR is in the protective direction [HR ≈ 0.85; M. Patrick, personal communication], but cannot be rigorously excluded), or 2) the amplification would have to be distinctly minor (e.g., approximately 1.25×). It is assumed here that amplification from the c statistic occurs in similar fashion as with R2 in the simulation of Reference 5; that is, that the change in the remaining unexplained variance of exposure predicts amplification. This has not been established for the c statistic (and it is generally appreciated that the c statistic is not a very desirable metric for comparisons between models). Nevertheless, while we do not know the amplification precisely, amplification would appear to have to be much less than that observed by Reference 5 in similar ranges of exposure prediction using R2 for the partial ACCE method applied here to predict a large amount of residual confounding in this analysis. Furthermore, whatever the amplification is, it is likely to be highly similar between Table 1A and Table 1B. Thus, the conclusion concerning the relative amount of unmeasured confounding in the all-cause mortality compared to the hip fracture analyses given in Table 1B is likely to be valid (as long as the fully-adjusted glaucoma diagnosis association does not differ markedly for the two outcomes).

Table 1. Application of the qualitative version of the ACCE Method to published data (Patrick et al., 2011) (continued).

B. Nested Models differing by single variable with observed strong association with exposure and expected minimal association with outcome.

Exposure-Predicting
Introduced Variable
InterventionOutcomeOriginal Model


(“Model 1”)
Larger Model
(by a single covariate)

(“Model 2”)
C statistics
Model 1Model 2
Glaucoma DiagnosisReceipt of
Statin

(vs. receipt of
antiglaucoma
medication)
All-cause
Mortality
PS Model
restricted to
variables with
a +/- 20%
association with
the outcome
PS Model restricted to
variables with a +/- 20%
association with outcome
PLUS
Introduced variable
(Glaucoma Diagnosis)
0.810.89
Results
Model 1 Hazard Ratio



(Central Estimate)
Expected Genuine
Treatment Effect

(Based on
Randomized Trial
Meta-analyses)
Model 2
Hazard Ratio

(Central Estimate)
Residual Confounding Suggested by Model 2
Treatment Effect Estimate compared to
Model 1 Treatment Effect Estimate
0.76HR ≈ 1.03a0.69bMore substantial than for all-cause mortality,
in the direction away from the null (i.e., towards
a more protective apparent effect than likely
genuinely exists)c.

That is, the ACCE Method suggests the likely
direction and general size of residual confounding
(and thus that the genuine treatment effect is
likely to be closer to the null than the initial
Model 1 estimate), even in the absence of a
precise quantitative estimate of residual
confounding.

a Reference 10, Table 2 and Discussion.

b Reference 10, Results section text (4th paragraph).

c Given that the all-cause mortality and hip fracture analyses have propensity score c statistics suggesting highly similar predictions of exposure, seemingly the only likely plausible scenario by which the all-cause mortality analysis could be more confounded than the hip fracture analysis is if the introduced variable of glaucoma diagnosis has a substantially stronger protective association with outcome after control for the other propensity score covariates than the association between glaucoma diagnosis and all-cause mortality. Since these results are not available (i.e., results from an extensive multivariate regression), such a possibility cannot be rigorously excluded. Some difference might even be plausible given that considerably less is known about the predictors of hip fracture (and what is known may be less represented in healthcare databases) than for all-cause mortality. It can be inferred, however, that the magnitude of this difference would need to be substantial for the ACCE Method to suggest that the hip fracture analysis is less confounded than the all-cause mortality analysis. In an actual implementation of the ACCE method, highly-adjusted multivariate regression of the introduced-variable-outcome association would be conducted involving all or many (if the number of outcomes did not permit all the covariates to be simultaneously included) of the propensity score covariates.

Even if glaucoma diagnosis is not functioning as a near-instrumental variable, as long as the full multivariate regression coefficients for glaucoma diagnosis are even somewhat similar between the models, these two analyses considered together suggest the presence of considerably more residual confounding in the hip fracture analysis than the all-cause mortality analysis. (Please see Endnote F for more details). This is a conclusion independently suggested by the randomized trial meta-analyses12,13 cited by the authors. That is, based on the differences between the propensity score findings compared to the randomized trial meta-analyses (i.e., the hip fracture HR differed much more from previous randomized findings than the all-cause mortality HR), the general supposition would be that the hip fracture analysis is likely to be considerably more confounded than the all-cause mortality analysis. The ACCE Method, even when applied in a very partial and qualitative form, suggests the same conclusion. In this fashion, the ACCE Method may prove useful for estimating at least the likely general size and direction of residual confounding in the many circumstances where substantial randomized trial data is not available to guide one’s interpretation. This capacity of the method to provide even a qualitative estimate of residual confounding may constitute an important analytic advance.

Discussion

This paper presents a relatively straightforward four-step method exploiting the phenomenon of confounding amplification to potentially provide quantitative estimates of total confounding and unconfounded treatment effects. To my knowledge, it has not been previously recognized that the phenomenon of confounding amplification, if predictable (as suggested by recent simulation5), provides a potential mechanism to estimate total residual confounding. The fundamental approach of deliberately introducing amplified confounding into an analysis to evaluate the total residual confounding existing prior to amplification appears to possess both clear logic and considerable promise. The method hinges on part on whether the recently observed predictability of confounding amplification is found to be a generally observed phenomenon; in addition, at this stage it is unclear whether the method will need particularly large sample sizes to be routinely useful in providing quantitative estimates. Nevertheless, although aspects of the method’s implementation and precise accuracy are not yet fully resolved, further research is clearly indicated given the potential value of a new approach that may advance efforts to remove confounding from nonrandomized treatment effect estimates.

Furthermore, even if subsequent research determines that the estimates from this approach typically are sufficiently imprecise as to limit the quantitative usefulness of the method, this general approach may have considerable value as a semi-quantitative or qualitative “probe” of whether a substantial amount of residual confounding likely exists. It is hoped that the description of the method provided here is sufficient to permit the larger research community to immediately begin participating in the validation and refinement of this novel approach.

Considerations for validation and further research

The ACCE method is fundamentally a conceptually simple approach, but one that may require some care in its implementation (e.g., in the need to structure the two models so as to minimize other changes that might influence the treatment effect estimate while obtaining sufficient confounding amplification). The value of this method will depend on how often in practice it provides a useful quantitative or qualitative estimate of residual confounding. Answering this question will involve more detailed and precise examination of both simulated and real-world data, and almost certainly will involve the contributions of multiple research teams.

Useful avenues for validation research likely include: 1) the predictability of the relationship between a particular metric measuring prediction of exposure and confounding amplification and/or the potential substitutability of an “internal marker” as an alternative approach; 2) approaches to, or circumstances that would, ensure other changes between the models (in patient sample, intervention received, and the degree of control achieved for measured, included confounders) are minimized; 3) confirming that multivariate regressions provide an accurate measure of the change in confounding resulting from balancing of the introduced variable in Model 2 (and thus permits adjustment for the direct and indirect contributions of the introduced variable(s) to confounding in Model 1); 4) determining how easily multiple introduced variables can be used if a single introduced variable does not produce sufficient confounding amplification; and 5) determining whether sufficiently precise results can be routinely obtained from the ACCE Method despite the effects of random variability in treatment effect estimates, since this method requires the accurate detection of what may be fairly small changes in treatment effect estimates. It may prove that, for this reason, this method may be most useful when applied to particularly large databases; however, some recent studies using propensity score-based stratification do suggest that quite subtle changes in relative risk or hazard ratio from application of slightly different propensity score models can be detected9,10. Finally, an obvious need exists for methodology to develop confidence limits around the effect estimates emerging from the ACCE method. The procedure of bootstrapping would be one obvious candidate approach.

Simulation studies, given that the genuine treatment-outcome association is able to be specified by the investigator, may be the most immediate approach to addressing these research needs and evaluating the performance of this method in general. (Such simulations would be similar to the recent simulation study initially observing that confounding amplification may be predictable5, and others that have considered the impacts of unmeasured confounding13,14). Real-world studies might investigate whether the method appears to accomplish the task of making results from nonrandomized studies better parallel results from randomized trials15.

Potential application of the method to comparative effectiveness and surveillance research

Regardless of its ultimate precision, this method may prove beneficial for nonrandomized comparative effectiveness research in general, as well as especially beneficial for studies in which substantial residual or unmeasured confounding is expected. For example, many studies of mental health and/or behavioral interventions might be expected to have substantial unmeasured confounding, since the important elements of the conversation between provider and patient that contributes to judgments of the severity of the patient’s condition and helps influence treatment decisions often may go unrecorded even in the patient’s chart, and thus becomes unmeasurable.

Another notable use would be to enhance medication surveillance efforts. By providing even a highly approximate estimate of unmeasured confounding in a few simple steps, the ACCE Method could help more accurately indicate which prominent “signals” (either in effectiveness or safety) observed during the screening of large datasets appear to be less confounded (and thus are a particular priority for further investigation).

Conclusions

This paper has outlined a relatively straightforward yet novel method to potentially obtain a quantitative estimate of total residual confounding. This total residual confounding estimate (which would include confounding from unmeasured as well as measured factors) then allows, in principle, for an estimate of unconfounded treatment effects to be calculated. This paper has described the steps involved in applying this method, offered a very preliminary examination of the performance of a simple, partial version of this method using published data, and outlined research needs for refinement and validation of this method. Given the importance of a method that may potentially help remove confounding from nonrandomized treatment effect estimates, further investigation of this method by multiple research groups is clearly warranted. Even if the ACCE method is eventually shown to have limitations or evolves from the form proposed here, the method’s general approach of deliberately amplifying confounding to reveal existing residual confounding may have enduring analytic value. The ACCE Method and its underlying logic therefore have the potential to constitute a substantial advance for nonrandomized intervention research, and follow-up research should be rapidly conducted.

Endnotes

  • A. Not addressed in this simple example is the fact that, in almost all implementation of this Method (i.e., all implementations other than introducing a true instrumental variable), these calculations would need to adjust for the association with outcome of the variable(s) introduced into Model 2 to produce the amplification. This is discussed subsequently in Steps 3 and 4.

  • B. These regressions could be performed either within treatment arms or across both treatment arms while including an indicator for treatment arm, as well as a covariate(s) for treatment arm-introduced variable interaction(s). Comparing the results of all these approaches may be useful.

  • C. A key area for additional investigation is whether the effects upon the treatment effect estimate of the increasing balance in Model 2 in variables correlated with the introduced variable is adequately reflected by the adjustment proposed in Steps 3 and 4. This proposed adjustment does separate the residual confounding associated, directly or indirectly, with the introduced variable (which is being controlled in Model 2 and therefore cannot amplify) from the residual confounding being amplified. However, whether this separation and calculation fully captures the change in confounding attributable to the resulting increase in control, even if modest, of unmeasured confounders correlated with the introduced variable is unclear. Even if this adjustment should prove only incompletely effective in capturing the change in confounding attributable to correlated covariates, it may be determined that sometimes this is a relatively small source of error. The method would be also expected to exhibit its strongest performance when introduced variable(s) can be chosen that are suspected to be largely uncorrelated with potential unmeasured confounders. Please see Appendix 2.2 for further discussion.

  • D. Subtraction of both these quantities is necessary because, as pointed out in Step 3, the process of adding the introduced variable to Model 2 means that the amplification that occurs in Model 2 is not amplification of all the residual Model 1 confounding, but only the remaining Model 1 residual confounding (i.e., minus the contribution of the introduced variable and its correlates). Therefore, the value for the original residual confounding in Model 1 that is extrapolated from the amplified value does not include the contribution of the yet-to-be-introduced variable(s) and its correlates. The contribution to Model 1’s original residual confounding that is attributable to the yet-to-be-introduced variable(s) and its correlates must be subtracted, along with the extrapolated remaining residual confounding, from the Model 1 treatment effect estimate to estimate an unconfounded treatment effect.

  • E. This example assumes a linear propensity score model but a logistic regression outcome model because the existing simulation demonstrating proportional confounding amplification is for a linear propensity score model5. Still to be determined is whether linear, rather than logistic, outcome models will need to be used for the ACCE method’s estimates to be the most accurate, due to the need to compare risks of outcome between Model 1 and Model 2. A requirement for linear outcome models, if it exists, would add complications; however, it may prove that these complications are relatively minor drawbacks in the context of permitting the ACCE’s Method’s estimation of residual confounding and an unconfounded treatment effect estimate. This is another worthwhile area for additional research. Meanwhile, the published data examples suggests the ACCE Method may contribute useful information to guide inferences from nonrandomized studies even when only outcomes from nonlinear analyses (i.e., hazard ratios from Cox regression) are available. However, these examples are premised on the assumption that the c statistic can serve at least as an approximate index of confounding amplification.

  • F. This comparison can be made in this straightforward fashion since for the two analyses, the change in the prediction of exposure (in this case the c statistic) was highly similar (all-cause mortality: Model 1 c = 0.82, Model 2 c = 0.90; hip fracture: Model 1 c = 0.81, Model 2 c = 0.89). Thus, the resulting confounding amplification would be expected to be generally similar.

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 11 Aug 2014
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Smith EG. The ACCE method: an approach for obtaining quantitative or qualitative estimates of residual confounding [version 1; peer review: 2 approved]. F1000Research 2014, 3:187 (https://doi.org/10.12688/f1000research.4801.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 11 Aug 2014
Views
15
Cite
Reviewer Report 05 Jan 2015
Gregory Matthews, Department of Mathematics and Statistics, Loyola University Chicago, Chicago, IL, USA 
Approved
VIEWS 15
The authors present a manuscript describing a procedure that allows for the quantification of the total amount of residual confounding prior to bias amplification caused by propensity score models.  I believe the procedure described in reasonable, and my biggest concerns ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Matthews G. Reviewer Report For: The ACCE method: an approach for obtaining quantitative or qualitative estimates of residual confounding [version 1; peer review: 2 approved]. F1000Research 2014, 3:187 (https://doi.org/10.5256/f1000research.5125.r7091)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 29 Apr 2015
    Eric Smith, Psychiatrist, The Center for Organizational and Implementation Research (CHOIR) and the Mental Health Service Line of the Department of Veterans Affairs, Edith Nourse Rogers Memorial Medical Center, Bedford, MA 01730, USA
    29 Apr 2015
    Author Response
    I would like to thank both reviewers for their thoughtful, insightful, and encouraging reviews.  I particular appreciate their openness to a new methodology to attempt to estimate residual/unmeasured confounding.  I ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 29 Apr 2015
    Eric Smith, Psychiatrist, The Center for Organizational and Implementation Research (CHOIR) and the Mental Health Service Line of the Department of Veterans Affairs, Edith Nourse Rogers Memorial Medical Center, Bedford, MA 01730, USA
    29 Apr 2015
    Author Response
    I would like to thank both reviewers for their thoughtful, insightful, and encouraging reviews.  I particular appreciate their openness to a new methodology to attempt to estimate residual/unmeasured confounding.  I ... Continue reading
Views
33
Cite
Reviewer Report 27 Nov 2014
Mark Lunt, Arthritis Research UK Epidemiology Unit, University of Manchester, Manchester, UK 
Approved
VIEWS 33
This article outlines a very interesting approach to using propensity score methods to correct for unmeasured confounding. That was not the aim of the propensity score, and current methods are not able to do this, so it potentially represents a ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Lunt M. Reviewer Report For: The ACCE method: an approach for obtaining quantitative or qualitative estimates of residual confounding [version 1; peer review: 2 approved]. F1000Research 2014, 3:187 (https://doi.org/10.5256/f1000research.5125.r6843)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 29 Apr 2015
    Eric Smith, Psychiatrist, The Center for Organizational and Implementation Research (CHOIR) and the Mental Health Service Line of the Department of Veterans Affairs, Edith Nourse Rogers Memorial Medical Center, Bedford, MA 01730, USA
    29 Apr 2015
    Author Response
    I would like to thank both reviewers for their thoughtful, insightful, and encouraging reviews.  I particular appreciate their openness to a new methodology to attempt to estimate residual/unmeasured confounding.  I ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 29 Apr 2015
    Eric Smith, Psychiatrist, The Center for Organizational and Implementation Research (CHOIR) and the Mental Health Service Line of the Department of Veterans Affairs, Edith Nourse Rogers Memorial Medical Center, Bedford, MA 01730, USA
    29 Apr 2015
    Author Response
    I would like to thank both reviewers for their thoughtful, insightful, and encouraging reviews.  I particular appreciate their openness to a new methodology to attempt to estimate residual/unmeasured confounding.  I ... Continue reading

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 11 Aug 2014
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.