Adjusting for covariates representing potential confounders, mediators, or competing predictors in the presence of measurement error: Dispelling a potential misapprehension and insights for optimal study design with nutritional epidemiology examples

Roger S. Zoh; Diana M. Thomas; Carmen D. Tekwe; Xiaoxin Yu; Colby J. Vorland; Nikhil V. Dhurandhar; David M. Klurfeld; David B. Allison

doi:10.12688/f1000research.152466.2

Home Browse Adjusting for covariates representing potential confounders, mediators,...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Revised

Adjusting for covariates representing potential confounders, mediators, or competing predictors in the presence of measurement error: Dispelling a potential misapprehension and insights for optimal study design with nutritional epidemiology examples

[version 2; peer review: 2 approved, 1 approved with reservations]

Roger S. Zoh ¹, Diana M. Thomas², Carmen D. Tekwe¹, [...] Xiaoxin Yu¹, Colby J. Vorland³, Nikhil V. Dhurandhar⁴, David M. Klurfeld³, David B. Allison¹

Roger S. Zoh ¹, Diana M. Thomas², [...] Carmen D. Tekwe¹, Xiaoxin Yu¹, Colby J. Vorland³, Nikhil V. Dhurandhar⁴, David M. Klurfeld³, David B. Allison¹

PUBLISHED 19 May 2025

Author details Author details

¹ Department of Epidemiology and Biostatistics, Indiana University Bloomington, Bloomington, Indiana, 47408, USA
² Department of Mathematical Sciences, West Point United States Military Academy, New York, NY, USA
³ Department of Applied Health Science, Indiana University Bloomington, Bloomington, Indiana, 47408, USA
⁴ Department of Nutritional Sciences, Texas Tech University, Lubbock, Texas, USA

Roger S. Zoh
Roles: Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

Diana M. Thomas
Roles: Writing – Original Draft Preparation, Writing – Review & Editing

Carmen D. Tekwe
Roles: Writing – Original Draft Preparation, Writing – Review & Editing

Xiaoxin Yu
Roles: Writing – Original Draft Preparation, Writing – Review & Editing

Colby J. Vorland
Roles: Writing – Original Draft Preparation, Writing – Review & Editing

Nikhil V. Dhurandhar
Roles: Writing – Original Draft Preparation, Writing – Review & Editing

David M. Klurfeld
Roles: Writing – Original Draft Preparation, Writing – Review & Editing

David B. Allison
Roles: Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

Background

Variables such as dietary intake are measured with error yet frequently used in observational epidemiology. Although this limitation is sometimes noted, these variables are still often modeled as covariates without formal correction or sincere dialogue about measurement unreliability potentially weakening the validity of statistical conclusions. Further, larger sample sizes increase power (bias) to detect spurious correlations. Counterintuitively, recent work suggested a non-monotonic relationship between confounder unreliability and how much controlling for the confounder reduces (or induces) bias when testing for an exposure-outcome association. If true, such non-monotonicity would be especially concerning for applications such as nutrition, where measurement reliability varies substantially, and large sample sizes are common.

Methods

We offer a detailed derivations of the square partial correlation between the outcome and exposure, controlling for the confounder. In our derivation, the measurement reliabilities of exposures and confounders are not arbitrarily constrained to be equal. Further, our theoretical results are investigated using simulations.

Results

Reassuringly, these derivations and simulations show that the counterintuitive non-monotonicity relationship between confounder unreliability and how much controlling for the confounder reduces (or induces) bias when testing for an exposure-outcome association is an artifact of the arbitrary constraint which forces the measurement reliabilities of exposures and confounders to be equal, which that does not always hold.

Conclusions

The profound and manifold effects of measurement error on estimation and statistical conclusion validity in realistic scenarios indicate that merely mentioning measurement error as a limitation and then dispensing with it is not an adequate response. We also explore questions for optimal study design subject to resource constraints when considering reliability of exposures, covariates, and outcomes.

Keywords

Measurement Error, Bias, Measurement reliability, Type I Error, Covariate Adjustment

Corresponding authors: Roger S. Zoh, David B. Allison

Competing interests: No competing interests were disclosed.

Grant information: This work was supported by the National Institute of Diabetes, Digestive, and Kidney Diseases Award numbers R01DK132385 and 1R01DK136994-01A1.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2025 Zoh RS et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Zoh RS, Thomas DM, Tekwe CD et al. Adjusting for covariates representing potential confounders, mediators, or competing predictors in the presence of measurement error: Dispelling a potential misapprehension and insights for optimal study design with nutritional epidemiology examples [version 2; peer review: 2 approved, 1 approved with reservations]. F1000Research 2025, 13:827 (https://doi.org/10.12688/f1000research.152466.2) First published: 24 Jul 2024, 13:827 (https://doi.org/10.12688/f1000research.152466.1) Latest published: 19 May 2025, 13:827 (https://doi.org/10.12688/f1000research.152466.2)

Revised Amendments from Version 1

The changes made to the article were done mostly in response to the reviewers comments/suggestions. We have revised the manuscript to integrate the energy-blood pressure example more consistently across the discussion of each model, as suggested. Specifically, we have added this example to clarify key concepts, including the role of measurement error, the implications of assuming different relationships between variables, and the evaluation of predictive ability. We also refined the discussion on optimal study design, highlighting cases where improving the measurement of confounders, such as sodium intake, may be more critical than improving the measurement of the primary exposure. Additionally, we have ensured that the notation used for variables is consistent throughout the text. These revisions, we think, improve the clarity, coherence, and applicability of the models presented. We also expanded our discussion to include recommendation for researcher working with secondary data with potential measurement error. We have revised the manuscript to incorporate some clarification regarding assumptions in nutritional epidemiology research. Specifically, we replaced the original statement with one that acknowledges that many high-quality studies avoid assuming a linear or monotonic dose-response by using splines or categorical exposure modeling. However, we also emphasize that numerous other studies continue to rely on these untrue yet seemingly implicit default assumptions. Finally, we included a sentence highlighting that the results obtained in this manuscript were obtained in the case of linear model and do not directly extend to the case of non-linear or generalized models. We also include references discussing measurement error in non-linear models.

See the authors' detailed response to the review by Nicola Bondonno
See the authors' detailed response to the review by Tolulope Sajobi

Introduction

The challenges of measuring dietary intake and other variables in observational epidemiology in nutrition and obesity research have been widely discussed.¹^–³ Data on nutrient or energy intake inform prevention and treatment guidance and policies for many health conditions. Fidelity of data to true nutrient and energy intake is essentially related to the ability to draw valid scientific conclusions about nutritional effects. It is not easy to obtain food intake information, however, particularly from a large number of individuals as is done for the highly reputable and influential National Health and Nutrition Examination Survey (NHANES). NHANES obtains nutrient and energy intake data through self-reported dietary recall questionnaires. This approach involves individuals reporting food and drinks consumed in the past 24 hours followed by detailed questions to accurately gauge the amounts of nutrients and energy consumed. Another such subjective approach that is popular is the food-frequency questionnaire, which asks individuals about the frequency of consumption of various foods.

Understandably, such information related to nutrient and energy intake is influenced by intentional or unintentional misreporting, which substantially impacts accuracy. For example, a study pooled data from five large studies that had examined the validity of food-frequency questionnaires and 24-h dietary recalls against recovery biomarkers. Across this diverse sample of Americans, subjective estimates of energy intake explained less than 10% of the variance in true energy intake.¹ It appears that the self-reported measures systematically underestimate energy intake by hundreds of kcal per day, and the NHANES surveys may underreport energy intake by as much as 800 kcal per day.²^,³

Accuracy of data should be a concern for any researcher, and this level of inaccuracy should be disconcerting. However, it is a common practice to acknowledge the limitations of such measurements and to then carry on with the research as though it has at least some validity despite these limitations. Some common practices to dismiss or minimize concerns around lack of accuracy of data include rationalization that self-reported data about energy or nutrient intake have good reproducibility, or that they could be adjusted by a common factor.⁴ There are serious counterarguments against such rationalizations. Settling for reproducibility of data instead of accuracy is like using a mismarked measuring tape. One can reproduce the result, but it is still inaccurate. Also, dietary underreporting varies with the type of food consumed, gender, age, smoking habits, social class, education level, dietary restraints, body mass index (BMI) of the respondents, and other life stage factors,⁵ which makes it implausible that one could adequately correct a given dataset by a single factor and arithmetic operation.

Considering the serious implications of using decidedly inaccurate information, some investigators have raised concerns that some limitations are so severe that we are better off not conducting certain research or not using certain methods at all until more accurate methods become available. For example, Lear wrote “One may wonder if we should stop nutritional research altogether until we can get it right”.⁴ While this may be an extreme rhetorical position, Dhurandhar et al.⁵ noted that in some cases, measurement properties are so bad that it is better to not perform the research at all than to use the poor measurement instruments (even if the instruments are the best available for a particular context). Putting it succinctly, on some occasions, something is not better than nothing.

Recently, we became aware of a particular finding regarding the effects of controlling for a confounding variable that is measured with error. Westfall and Yarkoni⁶ showed results that seemed to imply that there is not necessarily a monotonic relationship between (a) the degree of measurement error in a confounding variable and (b) the extent to which controlling for the measurement of the confounding variable reduces bias in testing for an exposure association with an outcome. Rather, the degree of bias in testing appears to increase as one moves from a completely unreliable measure of the confounder to a partially reliable measure and then decreases again as one approaches a perfectly reliable measure.

Note that in frequentist significance testing (the type of testing generally used in this journal), a test may be said to be biased when the sampling distribution of P values from applying the test is not uniform on the interval [0,1] under the null hypothesis.⁷ The main concern is that the “size of the test” is inflated such that, in probability, too many small P values are generated, leading to too many type I errors.

Another key point from the article by Westfall and Yarkoni⁶ that appears counterintuitive is that the larger the sample size, the higher the error rate. This is critical in nutritional epidemiology because greater importance is regularly given to the largest cohort studies when in fact the largest studies may be the most susceptible to biasing in significance testing both because they have more power to detect all effects, including spurious ones, and because large scale studies may be less well able to measure variables well.⁸

Although we were able to identify some articles in the nutrition and obesity literature that cited the Westfall and Yarkoni article,⁹^–¹⁷ none of them addressed the issue of modest reliability potentially being worse than poorer reliability. To the best of our ability to discern, this issue has not been recognized in the nutrition and obesity literature. Yet, this methodologic issue in which intermediary measurement reliability may be worse than either very low or very high reliability seems especially concerning for the nutritional epidemiology field, where even aspects of dietary intake that can be measured with some degree of reliability and validity often have very modest degrees of reliability or validity.¹⁸ Thus, this non-monotonicity problem may be a particular concern, and if true, might suggest the startling conclusion that in nutrition epidemiology it is better to not measure some covariates at all than to include them in the study, if they have only modest reliability.

Here we provide an explanation of the phenomenon in simple scenarios so that researchers, reviewers, and editors in the nutrition and obesity research communities can see that the conclusion about non-monotonicity of bias as a function of covariate reliability holds only under specific and arguably contrived circumstances. Awareness of this offers some modest reassurance and also interesting insights into the design, analysis, and interpretation of studies.

Derivation

Consider the following extension to simple linear classical measurement error model

Model 1

\begin{matrix} Y = βZ + ϵ_{y} \\ X = γZ + ϵ_{x} \\ \begin{matrix} Z^{'} = αZ + e \\ Z \sim N (0, 1) \end{matrix} \end{matrix}

where we assume that the error terms

ϵ_{y}, ϵ_{x}, and e

are all independent of each other and of Z and have a normal distribution with mean zero and finite variance. Without loss of generality, we choose the error variances so that

Y

X

, and

Z^{'}

each have standard deviation 1. We assume that

Y

X

, and

Z^{'}

are observed, but Z is unobserved. We note that Model 1 is a more general model than the classical measurement error model. Namely, Model 1 agrees with a classical measurement error model only when

α = 1

Additionally, we depict Model 1 in Figure 1 (omitting the error terms for $Y$ , $X$ , and $Z^{'}$ ).

Figure 1. Path diagram for Model 1.

Simple model including one confounder measured with error.

In Figure 1, $X$ represents the independent variable (i.e., exposure, predictor, the putatively causal factor of interest), which could be a measured nutrient, a treatment received or not received, a personal factor, or any other measurable quantity. $Z$ represents a confounding variable which we assume is not observed. Again, $Z$ could be any measurable variable, particularly an aspect of dietary intake. Random measurement error is represented by $e$ . Here we consider classic true score phenomenon, which is a simple conceptualization of measurement error. We denote the error-contaminated measure of $Z$ as $Z^{'}$ , with $Z^{'}$ being a simple additive function of $Z$ and $e$ , where $e$ is independent of $Z$ .

“Classical test theory, also known as true score theory, assumes that each person has a true score, T, that would be obtained if there were no errors in measurement. A person’s true score is defined as the expected score over an infinite number of independent administrations of the scale. Scale users never observe a person’s true score, only an observed score …. It is assumed that observed score … = true score (T) plus some error (E) …. It is also assumed that … the expected value of such random fluctuations (i.e., mean of the distribution of errors over a hypothetical infinite number of administrations on the same subject) is taken to be 0. In addition, random errors are assumed to be uncorrelated with a true score, with no systematic relationship between a person’s true score and whether that person has positive or negative errors”.¹⁹

Finally, $Y$ represents the outcome variable (i.e., the dependent variable, the thing to be predicted).

Confounding, direct effects in mediation, and incremental predictive validity

It is noteworthy that:

(a) With minor modifications, the model depicted in Figure 1 can specify a situation of confounding, causal mediation, or correlated predictors.
(b) The resulting correlation matrix among the variables depicted will be unchanged as a result of these modifications.
(c) The same statistical test will be needed when testing for an association while controlling for confounding, testing for direct effects beyond mediated effects in a mediation model, or testing for incremental predictive validity.⁶^,²⁰^,²¹

For example, in the model as depicted, Z is a confounder of the association between X and Y, and the test of interest is whether the squared partial correlation of X and Y after controlling for Z is not zero. However, if we reverse the direction of the arrow from Z to X (the arrow labeled with the coefficient γ), we are describing a mediation model²² in which Z is the hypothesized mediator and the test for the direct effect of X on Y can again be whether the squared partial correlation of X and Y after controlling for Z is not zero.²³ Suppose for example we want to test whether energy intake (X) is associated with blood pressure (Y), after controlling for sodium intake (Z). X and Z represent their true (unobserved) intakes, while X′ and Z′ are the error-prone measures obtained from self-report. In Model 1, we assume that energy intake (X) is measured without error, but sodium intake (Z′) is the error-prone measurement of true sodium intake. As an example, Feng et al.²⁴^{,p. 3336} assessed both the direct effects and “the potential indirect effect of sodium intake on blood pressure via body mass index.” That is, Feng et al. modeled BMI as the mediator of sodium intake on systolic and diastolic blood pressure and performed mediation analysis to evaluate the total effect, direct effect, and indirect effect via BMI.

Finally, if we replace the arrow from Z to X with a double-headed arc,²⁵ we are not specifying the causal relation between X and Z, only that they are correlated. In this case, we can then test whether, after accounting for the predictive ability of Z, X can add to the ability to predict Y. In the context of our energy-sodium-blood pressure example, imagine we do not assume that sodium intake (Z) causally impacts energy intake (X) or vice versa. Instead, the two might be correlated, because people who consume high-sodium foods may also consume higher total energy. We then can ask whether after accounting for sodium’s predictive ability for blood pressure (Y), energy intake provides additional predictive value for blood pressure - how much variance in blood pressure each can explain beyond the other. As another example, Pichler et al.²⁶^{,p. 616} aimed “to assess the accuracy and precision of a BIA [bioimpedance analysis] device and the relative contribution of BIA beyond the anthropometric parameters [emphasis added].”

Given points a, b, and c above, the implications of the methodological issue we are addressing apply to a broad swath of the research questions and hypotheses typically addressed in our field.

For smplicity, we consider that all variables are such that the use of ordinary least squares linear multiple regression (i.e., multivariable regression²⁷) is appropriate. Extensions to other data forms are available.²⁸ Suppose that investigators wish to test whether X is associated with (i.e., predicts) Y after controlling for Z. However, the investigators do not have measures of Z. Rather, they have measures of Z′, the error-contaminated or imperfect measurement of Z. That is why in Figure 1, Z′ is depicted in a rectangle whereas the other variables, which are assumed to be measured without error for this stylized example, are depicted in ellipses.

The question then becomes, will the type I error rate (i.e., sometimes called the false-positive rate) be maintained at its appropriate nominal levels if we control for Z′ rather than Z and how will any degree of bias in the significance testing be related to the reliability of Z′? From Figure 1, the reliability²⁹ of Z′ is $α^{2}$ . If controlling for Z′ leads to unbiased inference using standard frequentist significance testing at the 0.05 significance level, then (because in this hypothetical, X has no effect on Y) the null hypothesis for the test or the association of X with Y after controlling for Z′ would yield significant results only 5% of the time. The (false) power (i.e., type-1 error rate inflation above the nominal significance level) of the test of the association of X with Y after controlling for Z′ would be the degree of bias in the significance testing procedure under these circumstances. To calculate this degree of bias or false power, we can begin by constructing the correlation matrix depicted in Table 1.

Table 1. Correlation matrix derived from Model 1 and Figure 1.

	Z′	X	Y
$Z^{'}$	1
$X$	$ρ_{X, Z^{'}} = γα$	1
$Y$	$ρ_{Z^{'}, Y} = αβ$	$ρ_{X, Y} = γβ$	1

The correlation coefficients in the cells of the matrix in Table 1 are derived by simple application of the standard rules of path analysis.³⁰ From there, we can calculate power by writing out the F-test³¹ of the association of X with Y after controlling for Z′. In our specific example, these yields (with N the sample size):

(1)

F = [\frac{R_{Y . X, Z^{'}}^{2} - R_{Y . Z^{'}}^{2}}{1 - R_{Y . Z^{'}}^{2}}] [\frac{N - 2 - 1}{2 - 1}]

For any fixed sample size and specific model, the number of covariates and sample size are constant, meaning that the rightmost term of the equation becomes irrelevant if one is only concerned with the power as a function of the measurement reliability, which is our context here. Therefore, recognizing that the non-centrality parameter and the power (false power or bias) will be monotonic functions of only the left term on the right side of equation 1, $[\frac{R_{Y . X, Z^{'}}^{2} - R_{Y . Z^{'}}^{2}}{1 - R_{Y . Z^{'}}^{2}}]$ , we can deal only with that. This term, $[\frac{R_{Y . X, Z^{'}}^{2} - R_{Y . Z^{'}}^{2}}{1 - R_{Y . Z^{'}}^{2}}]$ , is in fact none other than the squared partial correlation coefficient (in nonmathematical terms, a squared partial correlation coefficient can be defined as the proportion of the remaining variance in the dependent variable after controlling for the covariates that can be “explained” by the independent variable of interest³²) of X and Y after controlling for Z′. It is important to note that this is the squared partial correlation coefficient and not the squared semi-partial correlation coefficient [in nonmathematical terms, a squared semi-partial correlation coefficient can be defined as the proportion of the total variance in the dependent variable that can be “explained” by the independent variable of interest, after controlling for the covariates³²], because the difference in the denominators of the two coefficients can be helpful in understanding the phenomena of interest in this paper. The quantities $R_{Y . X, Z^{'}}^{2}$ and $R_{Y . Z^{'}}^{2}$ can then be expressed as functions of the elements of Table 1, as follows:

(2)

R_{y . x, z'}^{2} = \frac{ρ_{xy}^{2} + ρ_{z' y}^{2} - 2 ρ_{xy} ρ_{z' y} ρ_{xz'}}{1 - ρ_{XZ'}^{2}}

(3)

R_{Y . Z^{'}}^{2} = ρ_{Y Z^{'}}^{2}

The squared partial correlation of Y and X controlling for (i.e., after partialing out) Z′ is:

(4)

R_{Y, X . Z^{'}}^{2} = [\frac{R_{Y . X, Z^{'}}^{2} - R_{Y . Z^{'}}^{2}}{1 - R_{Y . Z^{'}}^{2}}]

Substituting the right sides of equations 2 and 3 into the right side of equation 4 yields:

(5)

R_{Y, X . Z^{'}}^{2} = [\frac{\frac{ρ_{xy}^{2} + ρ_{z' y}^{2} - 2 ρ_{xy} ρ_{z' y} ρ_{xz'}}{1 - ρ_{XZ'}^{2}} - ρ_{Y Z^{'}}^{2}}{1 - ρ_{Y Z^{'}}^{2}}]

Equation 5 can then be re-expressed by substituting the zero-order correlations expressed as ‘ρ’s in terms of the path coefficients from Figure 1 as follows:

(6)

R_{Y, X . Z^{'}}^{2} = [\frac{\frac{{(γβ)}^{2} + {(αβ)}^{2} - 2 γβαβγα}{1 - {(γα)}^{2}} - {(αβ)}^{2}}{1 - {(αβ)}^{2}}]

One can then calculate the squared partial correlation coefficient (the quantity in equation 6) as a function of the reliability of Z′. The reliability of Z′ is $α^{2}$ . The first derivative of the squared partial correlation coefficient with respect to $α$ (the square root of the reliability coefficient) is

\frac{d R^{2}}{d α} = \frac{2 α (- 1 + α^{2}) β^{2} γ^{2} 2 - (1 + α^{2}) γ^{2} + β^{2} (- 1 + α^{2} (- 1 + 2 γ^{2}))}{{(- 1 + α^{2} β^{2})}^{2} {(- 1 + α^{2} γ^{2})}^{2}}

The first derivative of $R^{2}$ with respect to $α$ has no real roots within the open intervals $α, β, γ \in (- 1, 1)$ . The value of the derivative $\frac{d R^{2}}{d α}$ for any value of $α, β$ and, $γ$ in (-1,1) is negative indicating that $R^{2}$ is a decreasing function of $α$ for fixed values of $β, γ$ in (-1,1) (see Figure 2).

Figure 2. $Plot of R_{Y, X . Z^{'}}^{2}$ as a Function of α over the interval [0,1] assuming $γ = 0.64$ , $β = 0.13$ .

All symbolic calculations for the derivative and root solutions were performed in the computer algebra system, Wolfram Mathematica 12.0.

This shows that the squared partial correlation coefficient quantifying the association between X and Y conditional on Z′ is always decreasing in α over the half-closed interval [0,1). This means that the lower the value of α (i.e., the lower the reliability of the confounder measurement), the higher the (spurious) partial correlation of X and Y controlling for Z′ will be. This illustrates the classic concept of residual confounding. This is intuitively sensible and reassuring and refutes any implication that bias due to imperfect reliability of a confounding variable increases as the reliability increases before coming back down.

In contrast, consider the just slightly more complex model as

Model 2

\begin{matrix} Y = βZ + ϵ_{y} \\ X = γZ + ϵ_{x} \\ \begin{matrix} X^{'} = α_{x} X + e_{x} \\ Z^{'} = α_{z} Z + e_{z} \\ Z \sim N ormal (0, 1) \end{matrix} \end{matrix}

also depicted in Figure 3. Again, all the error terms are assumed independent of each other and independent of Z. Also, we assume that all the error terms have a normal distribution with mean zero and finite variance. Without loss of generality, the error variances for Y, X,

X^{'}, and Z^{'}

are chosen so that they each have standard deviation 1. We have omitted the error terms for Y, X, and

X^{'}

in Figure 3. We assume that in Model 2, only Y,

X^{'}, and Z^{'}

are observed, but X and Z are not observed.

Figure 3. Path diagram for Model 2. The same model as in Figure 1, expect that now the exposure (X) is also measured with error.

Mapping Model 2 ( Figure 3) to our energy-sodium-blood pressure example, we now assume that both energy intake and sodium levels are self-reported and prone to measurement error, so we observe X′ and Z′, but that blood pressure (Y) can still be assessed objectively. This then produces the 3×3 correlation matrix in Table 2.

Table 2. Correlation Matrix Derived From Model 2 and Figure 3.

	Z′	X′	Y
Z′	1
X′	$ρ_{X^{'}, Z^{'}} = α_{X} γ α_{Z}$	1
Y	$ρ_{Z^{'}, Y} = β α_{Z}$	$ρ_{X^{'}, Y} = βγ α_{X}$	1

The analogues for equations 5 and 6 from Model 1 are now equations 7 and 8 for Model 2, which are:

(7)

R_{Y, X^{'} . Z^{'}}^{2} = [\frac{\frac{ρ_{X^{'} Y}^{2} + ρ_{Z^{'} Y}^{2} - 2 ρ_{X^{'} Y} ρ_{Z^{'} Y} ρ_{X^{'} Z^{'}}}{1 - ρ_{X^{'} Z^{'}}^{2}} - ρ_{Y Z^{'}}^{2}}{1 - ρ_{Y Z^{'}}^{2}}]

Equation 7 can then be re-expressed substituting the zero-order correlations expressed as ‘ρ’s in terms of the path coefficients from Figure 3 as follows:

(8)

R_{Y, X^{'} . Z^{'}}^{2} = [\frac{\frac{{(α_{X} γβ)}^{2} + {(α_{Z} β)}^{2} - 2 α_{X} γβ α_{Z} β α_{X} {γα}_{Z}}{1 - {(α_{X} {γα}_{Z})}^{2}} - {(α_{Z} β)}^{2}}{1 - {(α_{Z} β)}^{2}}]

For pedagogical purposes, let us follow Westfall and Yarkoni’s approach and let: $α_{X} = α_{Z} = α$ .

Then, equation 8 simplifies to:

(9)

R_{Y, X^{'} . Z^{'}}^{2} = [\frac{\frac{{(αγβ)}^{2} + {(αβ)}^{2} - 2 αγβαβ α^{2} γ}{1 - {(α^{2} γ)}^{2}} - {(αβ)}^{2}}{1 - {(αβ)}^{2}}]

One can then calculate the squared partial correlation coefficient (the quantity in equation 9) as a function of $α$ , or the reliability of Z′ and X′, which is $α^{2}$ . Taking the first derivative of the squared partial correlation coefficient with respect to the reliability coefficient, $α^{2}$ , we obtain:

\frac{d R^{2}}{d α} = \frac{2 α (- 1 + α^{2}) β^{2} γ^{2} (- 1 + 3 α^{2} + α^{6} (- 1 + 2 β^{2}) γ^{2} - α^{4} (2 β^{2} + γ^{2}))}{{((- 1 + α^{2} β^{2})}^{2} {(- 1 + α^{4} γ^{2})}^{2})}

Setting the derivative equal to zero and solving for $α$ under the constraints that $α, β, γ$ are all in the interval (-1,1) results in a unique critical point, $α^{*} \in (0, 1)$ . Because $\frac{d R^{2}}{d α}$ is an even function of $α, β, γ$ , there is a symmetric critical point, $- α^{*} \in (- 1, 0)$ . The region where $α < 0$ is a reflection of the region $α > 0$ across the y-axis; therefore, we can restrict our analysis to the region, $α > 0$ . In this region, with $α > 0$ , the derivative is positive for $α < α^{*}$ and negative for $α > α^{*}$ in the interval (0,1), verifying that the function increases before the critical point and decreases after. As a result, the squared partial correlation coefficient quantifying the association between X′ and Y conditional on Z′ is not monotonic in α over the half-closed interval [0,1). This is illustrated in Figure 4 below with $(γ = 0.19, β = 0.50$ ).

Figure 4. $Plot of R_{Y, X^{'} . Z^{'}}^{2}$ against $α$ when $γ = 0.19, β = 0.50$ . The critical point is labeled $α = α^{}$ and the regions are identified as $α < α^{}$ to the left of $α^{}$ and $α > α^{}$ to the right of $α^{*}$ .

This would seem to support Westfall and Yarkoni’s statement that “the effect of reliability on error rates is even less intuitive: there is a non-monotonic relationship, such that type 1 error approaches 5% [the nominal rate they set] when reliability nears 0 or 1, but is highest when reliability is moderate”.⁶ What is not made clear in their paper is the reliability of what? Specifically, their statement is not true when referring to a single reliability coefficient. It is only true because they have constrained the reliability of the exposure measurement and the reliability of the confounder measurement to have the same value. If we decouple them (as there is no a priori reason that they must have the same value), and return to equation 8 instead of equation 9, we can see that, when Z is a confounder, the squared partial correlation coefficient (the quantity in equation 8) as a function of $α_{Z}$ , or the reliability of Z′, is always increasing as a function of $α_{X}$ , or the reliability of Z′.

This leads to the insight that it is the relative values of $α_{Z}$ and $α_{X}$ that determine this pattern. To make this clear, we re-express $α_{X} = ω α_{Z}$ and then substitute into equation 8 to yield:

R_{Y, X^{'} . Z^{'}}^{2} = [\frac{\frac{{(ω α_{z} γβ)}^{2} + {(α_{Z} β)}^{2} - 2 ω α_{z} γβ α_{Z} β ω α_{z} γ α_{Z}}{1 - {(ω α_{z} γ α_{Z})}^{2}} - {(α_{Z} β)}^{2}}{1 - {(α_{Z} β)}^{2}}]

in reduced form we have $R_{(Y, X^{'} . Z^{'})}^{2} = [\frac{{(α_{z} β ω γ)}^{2} (1 - α_{z}^{2} (2 - α_{z}^{2}))}{(1 - {(ω α_{z}^{2} γ)}^{2}) (1 - {(α_{z} β)}^{2})}]$

provided that $ω < 1 / (γ α_{z}^{2})$

Using this expression, we can make the following observations. This expression is a product of all positive terms hence is positive. If we relax the assumptions made above and only impose that $α_{z} \in (0, 1)$ and $γ$ and $β$ are both set at 1.0 and $ω$ = -1, then we can show that the partial correlation above is monotone increasing with the reliability $α_{z}$ and converges to ½ as $α_{z}$ get close to 1 from the left ( $α_{z} < 1) .$ Figures 5 and 6 show the plots of the partial correlation and its first derivative, respectively, as a function of $α_{z}$ when $γ$ and $β$ are both set at 1.0 and $ω$ = -1, both illustrating that the partial correlation is a non-decreasing function of $α_{z}$ in these settings. Further, it turns out that if we assume $α_{X} = ω α_{Z} + a_{0}$ and we set $ω = 0.8$ and $a_{0} = 0.2$ while keeping $γ$ and $β$ both at 1.0, the partial correlation is also monotone in $α_{z} (plot not shown)$ .⁵¹ We provide a small R code to reproduce the plots presented in this paper along with the case where we set $ω = 0.8$ and $a_{0} = 0.2$ while keeping $γ$ and $β$ both at 1.0.⁵¹

Figure 5. $Plot of R_{Y, X^{'} . Z^{'}}^{2}$ against $α_{z}$ , the square root of the reliability of the confounder measurement. Partial correlation is obtained assuming $γ$ = $β = 1$ and $ω$ = -1.

Figure 6. $Plot of the first derivative of R_{Y, X^{'} . Z^{'}}^{2}$ against $α_{z},$ the square root of the reliability of the confounder measurement. The partial correlation is obtained assuming $γ$ = $β = 1$ and $ω$ = -1.

What these derivations and illustrations show is that the degree of ‘false power’ or bias can actually goes up as one moves from a completely unreliable measure of a confounding variable to a modestly reliable measure of a confounding variable as Westfall and Yarkoni opined, but only if one artificially constrains the reliability of the exposure measurement to also be increasing as a function of the reliability of the confounder measurement. This is because, as we showed previously,³³ although the numerator of a ratio and the denominator of a ratio may both move toward their “proper” values monotonically as a function of a single variable, they may do so at different rates such that the ratio may change non-monotonically.

Implications for optimal design

Many epidemiologic studies involve observational (not interventional) studies and as such are not under experimental control.³⁴ Such observational data are often prone to measurement error, missing data, and confounding. While it is well known that measurement error associated with a single exposure, X, under some very specific circumstances, leads to attenuated associations in simple linear regression, less is known about how measurement error influences estimation, testing validity, and optimal study designs when investigators are interested in modeling the association between the exposure, and an outcome, Y, after controlling for Z, a confounder, where are all measured with error and complex error structures and analysis methods may prevail.

In nutritional studies involving estimation of dietary exposures on disease outcome models, linear regression calibration is the most applied method.³⁵ Linear calibration-based methods to measurement error correction involve two-stage models where replacement values for the true dietary exposure are simulated or obtained from its conditional distribution given the measured value and association between the simulated measures of the exposures and the outcome are estimated in the second stage.³⁶ Under the linear calibration approaches, non-differential measurement error and uncorrelated errors between the dietary and reference instruments are assumed.³⁵ In models where a confounder and an exposure are both measured with error or when multiple covariates are measured with error, multivariable regression calibration is often used in nutritional studies.³⁷ The multivariable regression calibration approach is performed under assumptions of random within-person errors or under combinations of random error. In general, the effect of measurement error in a single mis-measured or imprecisely observed exposure is to attenuate its effects on health outcomes. However, how measurement error in both a confounder and exposure influence their estimated effects on the dependent variable is less clear. Measurement error can modify or mask the effects of a confounder when both the confounder and exposure are error prone.³⁴

While efforts have been made to develop statistical approaches that correct for measurement error to reduce biases in the estimation of exposure effects, less has been done to determine how the presence of measurement error in an exposure or in both exposure and confounder influences optimal study designs. By optimal study design, we mean the design which minimizes some loss function (e.g., some weighted function of quantities of interest such as power, type 1 error rate, bias, financial cost, study duration, etc.) subject to user-defined constraints.³⁸ Spiegelman and Gray³⁹ developed cost-efficient and statistically powerful cohort study designs for continuous exposures measured with error in binary logistic regression models. The authors proposed three ways to optimize study designs in the presence of errors. These include the inclusion of an internal validation study, external validation of subsamples obtained from other studies, and the use of better exposure methods for assessment.³⁹

Such thinking invites several questions for future research when we wish to test for the association between an independent variable and a dependent variable after controlling for a third variable, and all may be measured with error.

• A first question is a meta-research question⁴⁰: What is the state of practice? When investigators apply measurement error correction procedures, do they typically do so for X, Z, Y, or some combination? Although systematic reviews of related questions have appeared,³⁵ we are not aware of one that has answered this exact question.
• A second question is how does applying such measurement error corrections affect power and bias if applied to various combinations of X vs Y vs Z?
• Third, do the answers to the second question suggest under which circumstances, to what extent, and for which variables (X, Y, or Z) investigators should invest their resources in improving the reliability of if the possibility of increasing reliability of each at the cost of finite funds exists? Such inquires might lead to the counterintuitive finding that under some circumstances one should invest more in studying the nuisance variables one is not interested in than the putative causal factors in which one is interested. For example, to the energy-sodium-blood pressure example, it may be optimal in certain scenarios to invest in a more accurate sodium-intake measure (Z′) rather than energy-intake (X′) if it is suspected that sodium may drive the confounding effects on blood pressure.

General implications

Without knowing the exact nature of any phenomenon or set of phenomena we are studying, we cannot know exactly which situation will prevail. If we did, we would not need to undertake the research. However, we can say from general knowledge of the measurement properties of variables frequently used in nutrition and epidemiologic research, and of the magnitude of associations typically studied in epidemiology and nutrition and obesity, that the hypothetical scenarios we have evaluated above are within the realm of the typical association study. Therefore, it seems plausible that not only is measurement error a substantial concern in our models but so are statements that may be patently untrue, such as that an association actually becomes more significant when controlling for a plausible confounder or that measurement error only attenuates effects. We thus may need to take observational research with more than a considerable grain of salt when drawing causal inferences. Yet further, the plausibility of our hypothetical example suggests the importance of not merely acknowledging measurement error as most of us do in our articles on nutrition epidemiology, but actually building formal measurement error corrections into the analysis as described elsewhere.⁴¹ Many quality papers in the field wisely do not assume there is a linear or monotonic dose-response of risk with intake of a food or nutrient and instead use splines or categorize exposures to avoid such assumptions (e.g., Refs. 42,43). However, included in overall implications for many other papers in nutritional epidemiology research (e.g., Ref. 44) are the reliance on the untrue yet-seeming implicit default assumptions that: 1) there is a linear or monotonic dose-response of risk with intake of a food or nutrient; 2) confounders act independently of one another, additively, linearly, with predictable effect, when they clearly do not, accentuating concerns about the ignoring of potential interactions of multiple confounders that are either not measured at all or not measured accurately⁴⁵; and 3) unmeasured or poorly measured confounders can be dismissed as serious threats to statistical conclusion validity because it is implausible that a confounder or set of confounders could imbue sufficient bias to produce the observed association, when in fact large samples produce large power to mistakenly detect small biases as actual associations or effects. Finally, our illustrations and derivations point out the importance of evaluating the quality of our procedures and improving them when possible. The results obtained here were developed in the context of linear models and are not readily extendable to the case of generalized linear models with non-linear link functions.⁴⁶^–⁵⁰

As outlined in the beginning, the topic of measurement error in covariates has critical application for nutritional epidemiology around the practice of determining nutrient and energy intake and using it for developing nutritional guidance and policies. It is unclear what role interpretations based on inaccurate measurements have played in national and local health-related policies or health care guidance for individuals. It should be recognized that a great need exists for accurately determining nutrient or energy intake. This need will not become a priority if we continue to accept inaccurate data as “good enough” for the purpose. Therefore, moving forward, we need to discontinue the use of methods that generate decidedly inaccurate data. Next, we urge that concerted efforts be made in redirecting resources to developing methods that objectively measure nutrient and energy intake with high fidelity and for long periods of time. Until such time, we need to be prepared to accept the counterintuitive conclusion that under some circumstances, in our efforts to improve the reliability or power of our methods, sometimes nothing is better than something.

Ethics and consent

Ethical approval and consent were not required.

Data availability

Underlying data

No data are associated with this article.

Extended data

Zenodo: rszoh/ReliabME: Version 1. https://zenodo.org/doi/10.5281/zenodo.12639728.⁵¹

Analysis code available from: https://github.com/rszoh/ReliabME/.

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

Acknowledgments

The authors acknowledge the reviewers for their comments which help improve the presentation of the paper. The authors also acknowledge Jennifer Holmes, ELS, for language editing and editorial assistance.

References

1. McHenry EW, Ferguson HP, Gurland J: Sources of error in dietary surveys. Can. J. Public Health. 1945; 36(9): 355–361. Reference Source
2. Klurfeld DM, Hekler EB, Nebeker C, et al.: Technology innovations in dietary intake and physical activity assessment: challenges and recommendations for future directions. Am. J. Prev. Med. 2018; 55(4): e117–e122. Publisher Full Text
3. Woteki CE: Measuring dietary patterns in surveys. Vital Health Stat. 1992; 4(27): 101–108.
4. Lear S: Should you eat red meat? Navigating a world of contradicting studies.2019 Oct 11 [Cited 2021 Dec 13.]. Reference Source
5. Dhurandhar NV, Schoeller D, Brown AW, et al.: Energy Balance Measurement Working Group. Energy balance measurement: when something is not better than nothing. Int. J. Obes. 2015; 39(7): 1109–1113. PubMed Abstract | Publisher Full Text | Free Full Text
6. Westfall J, Yarkoni T: Statistically controlling for confounding constructs is harder than you think. PLoS One. 2016; 11(3): e0152719. PubMed Abstract | Publisher Full Text | Free Full Text
7. Gadbury GL, Allison DB: Inappropriate fiddling with statistical analyses to obtain a desirable p-value: tests to detect its presence in published literature. PLoS One. 2012; 7(10): e46363. PubMed Abstract | Publisher Full Text | Free Full Text
8. Kaplan RM, Chambers DA, Glasgow RE: Big data and large sample size: a cautionary note on the potential for bias. Clin. Transl. Sci. 2014; 7(4): 342–346. PubMed Abstract | Publisher Full Text | Free Full Text
9. Brown AW, Aslibekyan S, Bier D, et al.: Toward more rigorous and informative nutritional epidemiology: the rational space between dismissal and defense of the status quo. Crit. Rev. Food Sci. Nutr. 2021; 63: 3150–3167. PubMed Abstract | Publisher Full Text | Free Full Text
10. Meadows A, Higgs S: A bifactor analysis of the Weight Bias Internalization Scale: what are we really measuring? Body Image. 2020; 33: 137–151. PubMed Abstract | Publisher Full Text
11. Hobbs M, Green M, Roberts K, et al.: Reconsidering the relationship between fast-food outlets, area-level deprivation, diet quality and body mass index: an exploratory structural equation modelling approach. J. Epidemiol. Community Health. 2019; 73(9): 861–866. PubMed Abstract | Publisher Full Text
12. Vainik U, García-García I, Dagher A: Uncontrolled eating: a unifying heritable trait linked with obesity, overeating, personality and the brain. Eur. J. Neurosci. 2019; 50(3): 2430–2445. PubMed Abstract | Publisher Full Text
13. Heino MTJ, Knittle K, Haukkala A, et al.: Simple and rationale-providing SMS reminders to promote accelerometer use: a within-trial randomised trial comparing persuasive messages. BMC Public Health. 2018; 18(1): 1352. PubMed Abstract | Publisher Full Text | Free Full Text
14. Vainik U, Meule A: Jangle fallacy epidemic in obesity research: a comment on Ruddock et al. (2017). Int. J. Obes. 2018; 42(3): 585–586. PubMed Abstract | Publisher Full Text
15. Ruddock HK, Christiansen P, Halford JCG, et al.: Response to ‘Jangle fallacy epidemic in obesity research: A comment on Ruddock et al. (2017).’. Int. J. Obes. 2018; 42(3): 586. PubMed Abstract | Publisher Full Text | Free Full Text
16. Thibodeau PH, Uri R, Thompson B, et al.: Narratives for obesity: effects of weight loss and attribution on empathy and policy support. Health Educ. Behav. 2017; 44(4): 638–647. PubMed Abstract | Publisher Full Text
17. Jacka FN: Nutritional psychiatry: where to next? EBioMedicine. 2017; 17: 24–29. PubMed Abstract | Publisher Full Text | Free Full Text
18. Prentice RL: Dietary assessment and opportunities to enhance nutritional epidemiology evidence. Ann. Intern. Med. 2020; 172(5): 354–355. PubMed Abstract | Publisher Full Text | Free Full Text
19. Cappelleri JC, Jason Lundy J, Hays RD: Overview of classical test theory and item response theory for the quantitative assessment of items in developing patient-reported outcomes measures. Clin. Ther. 2014; 36(5): 648–662. PubMed Abstract | Publisher Full Text | Free Full Text
20. MacKinnon DP, Krull JL, Lockwood CM: Equivalence of the mediation, confounding and suppression effect. Prev. Sci. 2000; 1(4): 173–181. PubMed Abstract | Publisher Full Text | Free Full Text
21. Vickers AJ, Cronin AM, Begg CB: One statistical test is sufficient for assessing new predictive markers. BMC Med. Res. Methodol. 2011; 11: 13. PubMed Abstract | Publisher Full Text | Free Full Text
22. Fairchild AJ, McDaniel HL: Best (but oft-forgotten) practices: mediation analysis. Am. J. Clin. Nutr. 2017; 105(6): 1259–1271. PubMed Abstract | Publisher Full Text | Free Full Text
23. Fairchild AJ, MacKinnon DP, Taborga MP, et al.: R² effect-size measures for mediation analysis. Behav. Res. Methods. 2009; 41: 486–498. PubMed Abstract | Publisher Full Text | Free Full Text
24. Feng Q, Yang Z, May M, et al.: The role of body mass index in the association between dietary sodium intake and blood pressure: a mediation analysis with NHANES. Nutr. Metab. Cardiovasc. Dis. 2021; 31(12): 3335–3344. PubMed Abstract | Publisher Full Text
25. Olobatuyi ME: A user’s guide to path analysis. Lanham, Md: University Press of America; 2006.
26. Pichler GP, Amouzadeh-Ghadikolai O, Leis A, et al.: A critical analysis of whole body bioimpedance spectroscopy (BIS) for the estimation of body compartments in health and disease. Med. Eng. Phys. 2013; 35(5): 616–625. PubMed Abstract | Publisher Full Text
27. Hidalgo B, Goodman M: Multivariate or multivariable regression?. Am. J. Public Health. 2013; 103(1): 39–40. PubMed Abstract | Publisher Full Text | Free Full Text
28. McCullagh P, Nelder JA: Generalized linear models. 2nd ed.Routledge; 1983. Publisher Full Text
29. Piedmont RL: Reliability coefficient.Michalos AC, editor. Encyclopedia of quality of life and well-being research. Dordrecht, Netherlands: Springer; 2014. Publisher Full Text
30. Alwin DF, Hauser RM: The decomposition of effects in path analysis. Am. Sociol. Rev. 1975; 40(1): 37–47. Publisher Full Text
31. Smithson M: Correct confidence intervals for various regression effect sizes and parameters: the importance of noncentral distributions in computing intervals. Educ. Psychol. Meas. 2001; 61(4): 605–632. Publisher Full Text
32. Abdi H: Part (semi partial) and partial regression coefficients.Salkind N, editor. Encyclopedia of measurement and statistics. Sage; 2007; pp. 1–9.
33. Allison DB, Heo M, Flanders DW, et al.: Examination of “early mortality exclusion” as an approach to control for confounding by occult disease in epidemiologic studies of mortality risk factors. Am. J. Epidemiol. 1997; 146(8): 672–680. PubMed Abstract | Publisher Full Text
34. Thomas D, Stram D, Dwyer J: Exposure measurement error: influence on exposure-disease relationships and methods of correction. Annu. Rev. Public Health. 1993; 14(1): 69–93. PubMed Abstract | Publisher Full Text
35. Bennett DA, Landry D, Little J, et al.: Systematic review of statistical approaches to quantify, or correct for, measurement error in a continuous exposure in nutritional epidemiology. BMC Med. Res. Methodol. 2017; 17: 146. PubMed Abstract | Publisher Full Text | Free Full Text
36. Fuller WA: Measurement error models. John Wiley & Sons; 2009; vol. 305. .
37. Rosner B, Spiegelman D, Willett WC: Correction of logistic regression relative risk estimates and confidence intervals for random within-person measurement error. Am. J. Epidemiol. 1992; 136(11): 1400–1413. PubMed Abstract | Publisher Full Text
38. Allison DB, Allison RL, Faith MS, et al.: Power and money: designing statistically powerful studies while minimizing financial costs. Psychol. Methods. 1997; 2(1): 20–33. Publisher Full Text
39. Spiegelman D, Gray R: Cost-efficient study designs for binary response data with Gaussian covariate measurement error. Biometrics. 1991 Sep; 47(3): 851–69. [Erratum in: Biometrics 1991 Dec;47(4):1641.]. PubMed Abstract | Publisher Full Text
40. Ioannidis JPA: Meta-research: why research on research matters. PLoS Biol. 2018; 16(3): e2005468. PubMed Abstract | Publisher Full Text | Free Full Text
41. Murillo AL, Affuso O, Peterson CM, et al.: Illustration of measurement error models for reducing bias in nutrition and obesity research using 2-d body composition data. Obesity (Silver Spring). 2019; 27(3): 489–495. PubMed Abstract | Publisher Full Text | Free Full Text
42. Schennach SM: Estimation of nonlinear models with measurement error. Econometrica. 2004; 72(1): 33–75. Publisher Full Text
43. Carroll RJ, Ruppert D, Stefanski LA, et al.: Measurement Error in Nonlinear Models: A Modern Perspective. 2nd ed.Chapman and Hall/CRC; 2006. Publisher Full Text
44. Zhao J, Li Z, Gao Q, et al.: A review of statistical methods for dietary pattern analysis. Nutr. J. 2021; 20: 37. PubMed Abstract | Publisher Full Text | Free Full Text
45. Wyss R, Lunt M, Brookhart MA, et al.: Reducing bias amplification in the presence of unmeasured confounding through out-of-sample estimation strategies for the disease risk score. J. Causal Inference. 2014; 2(2): 131–146. PubMed Abstract | Publisher Full Text | Free Full Text
46. Huang X: Dual model misspecification in generalized linear models with error in variables. In: New Developments in Statistical Modeling, Inference and Application.Springer International Publishing; 2016; pp. 3–35. Publisher Full Text
47. Muller KE, Lavange LM, Ramey SL, et al.: Power calculations for general linear multivariate models including repeated measures applications. J. Am. Stat. Assoc. 1992; 87(420): 1209–1226. PubMed Abstract | Publisher Full Text | Free Full Text
48. Neyestanak M: Measurement Error in Generalized Linear Models. Dissertation. University of Illinois at Chicago; 2021.
49. Rocci MAR: A general maximum likelihood analysis of measurement error in generalized linear models. Stat. Comput. 2002; 12: 163–174. Publisher Full Text
50. Schafer DW: Covariate measurement error in generalized linear models. Biometrika. 1987; 74(2): 385–391. Publisher Full Text
51. rszoh: rszoh/ReliabME: Version 1 (v1.0.0). [Dataset]. Zenodo. 2024. Publisher Full Text

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 24 Jul 2024