ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Research Article

How adjusting elicited health utilities after the fact can adversely affect shared decision making

[version 1; peer review: 1 approved with reservations, 1 not approved]
PUBLISHED 19 Dec 2022
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Health Services gateway.

Abstract

Background: The elicitation of inconsistent health-state utility values (HSUVs) is a prevalent problem. There are two approaches to address this problem: (1) intervention during the elicitation process to ensure that patients estimate consistent HSUVs; (2) no intervention during the elicitation process and inconsistent HSUVs are adjusted after the fact. This paper studies three models recently proposed for adjusting inconsistent HSUVs and consistent HSUVs that some may consider unrealistic.
Analysis: The three models are analyzed using a sound theoretical framework: the mathematical equivalence of HSUVs elicited using the standard gamble and probabilities, the Fréchet bounds, and preference theory. It is proven that none of these models accounts for the Fréchet lower bound and health conditions that are preference substitutes.
Results: A clinical vignette proves these models may recommend treatments that result in premature death over treatments that cause acceptable adverse effects.
Conclusions: The three models are incorrect and may mislead patients and physicians to poor medical decisions. In the spirit of shared decision making, patients should be given the opportunity to reassess inconsistent HSUVs and confirm that the revised HSUVs reflect their preferences.

Keywords

Elicitation intervention, Fréchet bounds, Health-state utility values, Preference complements, Preference substitutes, Reasonableness test

Introduction

The elicitation of inconsistent health-state utility values (HSUVs) is a prevalent problem. A rational person should not prefer a joint health state (JHS) to any of the constituent single health states (SHSs). This property is termed logical consistency.1 For instance, at the individual level, 41% of the HSUVs for the JHS (incontinence & impotence) elicited by Dale et al.1 violated this property. Given the mathematical equivalence of HSUVs elicited using the standard gamble (SG) and probabilities, logical consistency corresponds to the Fréchet upper bound (FUB).2 It is highly likely that the percentage of inconsistent HSUVs was higher than 41% because Dale et al. did not consider the Fréchet lower bound (FLB).2 The later has significant implications for the disutility of multiple coexisting morbidities: the joint disutility cannot exceed immediate death (ID) or the sum of the individual disutilities.2

Two approaches to the problem of eliciting inconsistent HSUVs have been proposed:

  • 1. Prevent inconsistent elicited HSUVs through interviewers intervening whenever necessary to ensure that patients estimate consistent HSUVs.3

  • 2. No intervention during the elicitation process. Inconsistent elicited HSUVs are adjusted after the fact.4,5

The first approach requires trained and knowledgeable interviewers. The second approach requires realistic and mathematically valid models. However, the adjusted HSUVs may not accurately represent a patient’s preferences.

Triantaphyllou and Yanase4 (referred to as T-Y in this paper) proposed three models for adjusting inconsistent as well as consistent HSUVs which they say “may still not be realistic”:

  • Model (i): “Readjusting the original health utility values via an error minimization approach based on the monotonicity property.”

  • Model (ii): “Multiplicative functions for health states and a new model for adjusting the initial health-state utilities.”

  • Model (iii): “A combined approach for adjusting the initial health-state utilities.”

This study has a sound theoretical framework: the mathematical equivalence of HSUVs elicited using SG and probabilities.2 It uses probability theory and preference theory to prove that the T-Y models are incorrect. A simple clinical vignette (Box 1) demonstrates that these models can be misleading. Therefore, they are inappropriate for shared decision making (SDM) where reliable HSUVs are critical for patients and physicians to decide upon a preferred treatment.6

Box 1. Clinical vignette.

At their annual physical examination, patient HP is diagnosed to suffer from the asymptomatic health condition X. HP is otherwise in good health. HP is neither a trained nor innate probability assessor.

The physician is a proponent of SDM, and HP agrees to participate.

The physician informs HP that:

  • X would reduce their expected years of life from 15 to 7, unless treated

  • Treatment Tx has a 100% success rate in curing X

  • Tx has a 100% probability of two side effects: ai and bj.

  • HP agrees to partake in assessing the HSUVs for ai, bj and ai&bj.

This paper proceeds as follows. The theoretical framework used for analyzing the T-Y models is presented. These models are analyzed, and it is demonstrated that they are inappropriate for life-critical SDM. Concluding remarks are presented.

Methods

Theoretical framework

Health-state utility values

Health states are identified by health conditions (HCs) (also termed “attributes” and “dimensions”) and severity levels. Each combination of levels of HCs represents a unique health state. HSUVs measure the strength of a person’s subjective preferences for health-related quality of life (HRQL).7 They are cardinal values specified on the (ID = 0.0, perfect health (PH) = 1.0) scale.

Estimating HSUVs is challenging. People are affected by the information which they receive, emotional factors, and elicitation methods.7 HSUVs elicited by different methods may not agree and can affect treatment choices.8 The SG has a theoretical foundation in von Neumann-Morgenstern expected utility theory9, which establishes the validity of HSUVs elicited using the SG as a measure for HRQL.10 Using arbitrary scales for HSUVs can lead to serious errors.11,p.17

Probabilities: Fréchet inequalities

Fréchet12 proved that the joint probability of two events is bounded by the marginal probabilities of each event regardless of the dependence between them. For two events A and B,

(1)
max0.0PA+PB1.0PA&BminPAPB.

Few people realize they assign inconsistent values to joint probabilities. Osherson et al.13 state: “It is striking to observe, for example, how few people realize that it is inconsistent to attribute probabilities of 0.8 to each of two sentences and probability 0.5 to their conjunction.” From (1), the conjunction of 0.8 and 0.8 cannot be less than 0.6: PFLB0.80.8=0.8+0.81.0.

Mathematical equivalence of HSUVs and probabilities

In the SG, an individual is asked to make the hypothetical choice between living for T years with health stateai (health condition Awith severity level i) and a gamble with a binary outcome (probability p of living in PH for T years or ID with probability 1.0p). The probability p is varied until the individual is indifferent between living T years with ai and the gamble. The indifference probability paicorresponds to the individual’s HSUV for ai:10 Uai=pai. Given the mathematical equivalence of HSUVs elicited using the SG and probabilities, probability theory can be applied to the problem of identifying inconsistent HSUVs.2

Consistent HSUVs: Fréchet inequalities

Given that HSUVs are mathematically equivalent to probabilities, the Fréchet inequalities play an important role in identifying inconsistent HSUVs. Uai&bj is bounded by the FUB and FLB on conjunction irrespective of preference interactions:2

(2a)
max0.0Uai+Ubj1.0Uai&bjminUaiUbj.
(2b)
UFUBai&bj=minUaiUbj.
(2c)
UFLBai&bj=max0.0Uai+Ubj1.0.

The FUB (2b) ensures logical consistency.1 The FLB (2c) has significant implications for the disutility of multiple coexisting morbidities. The joint disutility cannot exceed ID or the sum of the individual disutilities:2

(3)
U¯FLBai&bj=min1.0U¯ai+U¯bj
where U¯1.0U.

Preference interactions

HCs can be mutually utility independent (MUI), preference complements (PCs), or preference substitutes (PSs).2 If a patient’s preference for condition A is independent of the level of condition B and vice versa, A and B are said to be MUI: UMUIai&bj=Uai×Ubj. If a patient believes that both A and B need to improve for their HRQL to improve, A and are said to be PCs. PC HSUVs are positively correlated: UMUIai&bj<UPCai&bjFUB. If a person believes that only A or only B needs to improve for their HRQL to improve, A and B are PSs. PS HSUVs are negatively correlated: FLBUPSai&bj<UMUIai&bj.

Quality-adjusted life-years

The linear quality-adjusted life-year (simply termed the QALY) is presently the principal model for medical decision making (MDM). The expected number of QALYs for living yi years in a health state xi which has a probability of occurrencepxi is10

(4)
EQpxixiyi=pxi×Uxi×yi+1.0pxi×yi.

Analysis of T-Y models

In the following subsections, the T-Y models4 are analyzed using the above theoretical framework and data shown in Table 1.14 The analysis was done using Microsoft Excel 2016.

Table 1. Elicited and adjusted HSUVs using T-Y models.

MethodsU (ai)U (bj)U (ai&bj)
No intervention0.620.730.15
Model (i)0.620.730.15
Model (ii)0.620.730.45
Model (iii)0.620.73 [0.45, 0.62]

Model (i):4 “Readjusting the original health utility values via an error minimization approach based on the monotonicity property.”

The monotonicity property requires that a rational individual should not prefer a JHS to any of the constituent health states. Hence, Model (i) satisfies the FUB (2b). For consistency with probability theory, HSUVs elicited using the SG are also required to satisfy the FLB (2c). Model (i) does not address the FLB. For instance, it does not identify the HSUVs Uai=0.62,Ubj=0.73, Uai&bj=0.15 as inconsistent: Uai&bj<UFLB0.620.73=0.35=0.62+0.731.0.

Model (ii):4 “Multiplicative functions for health states and a new model for adjusting the initial health-state utilities.”

Model (ii) posits Uiiai&bj=Uai×Ubj. This assumes that health conditions ai and bj are MUI.

Keeney & Raiffa advocated assuming MUI with the significantly important qualifier15,p. 244: “the utility independence assumptions are appropriate in many realistic problems”. Moreover, their focus was principally on decision making outside of the medical domain. More recently, Howard and Abbas wrote16,p. 578

“We have several issues with this type of ‘utility independence’ reasoning … Enforcing these ‘utility independence’ assumptions result in functional forms that are simple, but quite frequently they will not represent the preference of the decision maker.”

Experimental studies have concluded that the multiplicative model is not a suitable model for JHSUVs.17 MUI is a strong assumption that is usually inappropriate for HSUVs.18

Model (iii):4 A combined approach for adjusting the initial health-state utilities

Model (iii) posits that JHSs have a level of utility independence controlled by a parameter 0.0γ0.1. Thus, the JHSUVs lie between the MUI HSUV and the FBU (2b) and they do not account for HCs that are PSs2. For instance, given Uai=0.62 and Ubj=0.73, Model (iii) predicts 0.45Uiiiai&bj0.62. This is wrong: HCs can be PSs, in which case 0.35UPSai&bj<0.45.

T-Y4 recommend using Models (ii) and (iii) for JHSUVs that “would easily pass the previous monotonicity test but could still be considered as not realistic.” This can mislead clinicians to recommend and patients to choose unwanted treatments. Case in point, a patient who wants to avoid treatments with HSUVs 0.45 chooses treatment TX based on Uiiai&bj and Uiiiai&bj0.45.

Results and discussion

The clinical vignette in Box 1 is analyzed assuming HSUVs that are elicited with and without intervention.

HSUVs elicited with intervention

Interviewers intervene when necessary to ensure that patient HP assesses consistent HSUVs which truly represent their preferences. Elicited single HSUVs (SHSUVs) are not always more correct than elicited JHSUVs.1 HP adjusts the JHSUV and SHSUVs as shown in Box 1: Uai=0.55, Ubj=0.62, Uai&bj=0.38. These HSUVs satisfy the FUB (= 0.55) and FLB (= 0.17).

HSUVs elicited without intervention

Interviewers do not intervene during the elicitation of HSUVs. The no-intervention elicited HSUVs shown in Table 1 violate the FLB (= 0.35). As discussed above, the T-Y models do not identify these HSUVs as inconsistent. T-Y recommend using Models (ii) and (iii) for JHSUVs that “could still be considered as not realistic.4 These models predict the significantly different JHSUVs shown in Table 1.

Decision analysis

For illustration, we consider the clinical vignette and data shown in Box 1 and Table 2, respectively. Patient HP has a complicated decision to make: “to be or not to be” treated with TX? The expected number of QALYs for each alternative and set of HSUVs is calculated using (4). Table 2 summarizes the results and recommendations. The HSUVs elicited with and without intervention provide contradictory recommendations:

  • HSUVs elicited with intervention. The prediction is: 15.0 YLs, 7.8 QALYs. The recommendation is “Yes Tx”.

  • HSUVs elicited without intervention and adjusted after the fact. Model (i) predicts 15 YLs and 2.25 QALYs. Models (ii) predicts 15 YLs and 6.75 QALYs. Based on the number of QALYs, Models (i,) and(ii) recommend “No Tx”. Model (iii) recommends either “No Tx” or “Yes Tx” depending on the control parameter γ.

Table 2. Impact of HSUV elicitation and T-Y models on treatment recommendation.

OptionsYLsElicitationP (ai& bj)U (ai & bj)QALYsRECsRT
No Tx7.001.001.007.00
Yes Tx15.00
Intervention1.000.527.80Yes TxP
No intervention
Model (i)1.000.152.25No TxF
Model (ii)1.000.456.75No TxF
Model (iii)1.00(0.45, 0.62][6.75, 9.30]?F

Kujawski et al.19 proposed an intuitive reasonableness test that decision models used for SMD should pass to qualify as SDM tools: “Can a treatment that results in premature death trump a treatment that causes acceptable adverse effects?” A “Yes” answer may mislead clinicians into recommending and patients into choosing decisions with unintended consequences. As shown in Table 2, the three T-Y models fail this test.

Conclusions

The elicitation of reliable HSUVs is critical to ensure medical decisions that patients truly prefer. As shown in this paper, the three T-Y models4,5 do not accurately account for individual preferences and the mathematical equivalence of HSUVs with probabilities elicited using the SG. Given consistent elicited HSUVs, it is not the function of clinicians to judge whether these are realistic or unrealistic.

A clinical vignette proves that the three T-Y models may recommend treatments that result in premature death over treatments that cause acceptable adverse effects. This is a sure sign that these models are faulty and can be misleading. Well-trained interviewers are still essential to elicit reliable HSUVs. Practical tools are being developed to assist with the assessment of HSUVs, e.g., Gambler II.20 The uncertainties of elicited HSUVs and calculated QALYs need to be addressed for sound SDM. Assuming point estimates causes false confidence in the analysis results.21,22

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 19 Dec 2022
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Kujawski E. How adjusting elicited health utilities after the fact can adversely affect shared decision making [version 1; peer review: 1 approved with reservations, 1 not approved]. F1000Research 2022, 11:1533 (https://doi.org/10.12688/f1000research.128862.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 19 Dec 2022
Views
5
Cite
Reviewer Report 30 Oct 2024
Pierfelice Cutrufelli, University of Catania, Catania, Italy 
Approved with Reservations
VIEWS 5
As a clinician, I will focus my peer review on the clinical aspects of this paper while acknowledging that I am not qualified to evaluate its mathematical components.
I must first clarify that I am not qualified to assess ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Cutrufelli P. Reviewer Report For: How adjusting elicited health utilities after the fact can adversely affect shared decision making [version 1; peer review: 1 approved with reservations, 1 not approved]. F1000Research 2022, 11:1533 (https://doi.org/10.5256/f1000research.141495.r235544)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
11
Cite
Reviewer Report 14 Feb 2024
Michal Lewandowski, Warsaw School of Economics, Warsaw, Poland 
Not Approved
VIEWS 11
Report on the article How adjusting elicited health utilities after the fact can adversely affect shared decision making,  written by Eduard Kujawski and submitted to F1000.

The article discusses the consistency of utility values of health states. ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Lewandowski M. Reviewer Report For: How adjusting elicited health utilities after the fact can adversely affect shared decision making [version 1; peer review: 1 approved with reservations, 1 not approved]. F1000Research 2022, 11:1533 (https://doi.org/10.5256/f1000research.141495.r235536)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 19 Dec 2022
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.