Keywords
blinding, RCT, randomised controlled trials, placebo, surgery, procedures, non-pharmacological
blinding, RCT, randomised controlled trials, placebo, surgery, procedures, non-pharmacological
The aim of a trial is to produce unbiased evidence. As randomised controlled trials (RCTs) with a placebo arm control for many types of bias, they are regarded as the most reliable method of demonstrating treatment efficacy and provide the highest level of evidence1. RCTs of interventional procedures are rare2–5, partly because they are challenging6; however, they are not impossible to perform, even if they involve a placebo arm7. In this paper we will discuss why trials should be blinded and summarise the methods which have been used in the published placebo-controlled trials of interventional procedures to achieve blinding.
Blinding in interventional trials is often necessary because nowadays many procedures are performed to reduce pain and improve function and quality of life. Pain, function and quality of life are sometimes regarded as preferable outcome measures because they reflect patients’ needs and point of view8. However, as these outcomes depend on patients’ subjective perception, they are prone to bias and may lead to an exaggerated treatment effect in open-label trials9,10. Using subjective outcomes in an open-label study undermines its internal validity because it is not possible to determine how much of the reported effect is related to the investigated treatment and how much is related to bias.
Blinding means concealing the treatment allocation from patients and any other people involved in the trial who may bias the results of the trial by knowing which groups the patients were randomised to.
Blinding of patients prevents reporting bias in patient-reported measures. For example, it has been demonstrated that non-blinded patients exaggerate the effects size by 0.56 standard deviations and that the effect is even larger in trials on interventional procedures, such as acupuncture11,12. This bias may be caused by patients’ expectations of treatment effect and information given to them before the treatment13. Patients may also report symptoms depending on their “hunches” about treatment being effective or they may give answers they believe are “correct” or expected from them, for example, because it would have been impolite not to report improvement11. Therefore, it has been suggested that patients should be blinded whenever possible11.
Blinding of patients also reduces adherence bias, i.e. patients in the control group not following the protocol/treatment. It may also prevent so called “contamination of the control group”, i.e. seeking additional treatment/help outside the trial and receiving concomitant treatment. Blinding improves patient retention in the trial. Risk of attrition in blinded trials is about 4% whereas in non-blinded trials it is 7%11. Specifically in placebo-controlled surgical trials, subject retention is often reported as “excellent”14, and in our analysis the withdrawal rate was low (4%) and comparable between the treatment and the placebo arm15.
Unlike drug trials, in which the physician gives a tablet prepared somewhere else, the surgeon has to perform a specific procedure considered to be therapeutic; therefore, blinding of surgeons may not always be possible.
There have been attempts to blind surgeons, for example a surgeon inserted a catheter under fluoroscopic guidance and handed over the procedure to a technician who delivered the radiofrequency energy (or not) according to the allocation16. In other trials, a palatal implant delivery system was prepared by the manufacturer to either contain the implant or not, which allowed for blinding of surgeons17,18.
In 81% of placebo-controlled surgical trials both patients and assessors were blinded15. It has been demonstrated that, non-blinded assessors of subjective outcomes cause less bias in trials than non-blinded patients reporting their symptoms19. Blinding of assessors prevents observer-related bias, detection bias, and the Pygmalion effect. The Pygmalion effect refers to a situation when investigators looking for a particular response are predisposed to interpret the result in a way that shows the response they expect even if it is objectively absent. For example, a study by Hrobjartsson and colleagues demonstrated that non-blinded assessors were over-optimistic and “over-rated” patients in the treatment group rather than “under-rated” patients in the control group20. In some trials the assessment was done by people not involved in the surgery, for example blinded researchers, staff at another hospital that they were operated on at21, or by pathologist blinded to the treatment allocation22.
Apart from blinding patients and assessors, it is important that caregivers and clinical/research staff do not know patient treatment allocation, because their behaviour and attitudes may influence patient responses23–25. Patient-clinician interaction plays an important role in treatment response, and patients in trials do better as they get more attention and time from clinical staff than patients receiving standard care26,27. Therefore, the interactions between patients and the trial team should be standardised so that the “treatment context” (similar attention from doctors, expectations, and settings) are comparable between the groups.
A placebo control is necessary if we want to know whether improvement is really caused by the investigated procedure. It compares the intervention of interest with a procedure that seems identical but does not involve the crucial element of believed to be “the cure”. The aim of a placebo arm is to control for effects of receiving treatment not specifically related to the investigated intervention.
It is often difficult to determine what is a specific and what is a non-specific effect in a trial27,28, and to disentangle placebo response from response bias or the effect of patient-doctor interactions29. It is beyond the scope of this review to discuss definitions of placebo1,29. Whether something is or is not a placebo depends on the intervention and chosen outcome variables1, but in order for blinding to be successful, the control procedure has to be as similar as possible to the investigated procedure27. Interventional trials differ from drug trials as they require access to the anatomical structure of interest; therefore, they involve a skin incision or an insertion of a scope.
Many published surgical trials used general anaesthesia or heavy sedation, which made blinding easier because patients were unaware of the details of surgical procedures. In such trials, only the surgical wound had to be similar in both groups. Some studies did not add any placebo procedure but simply omitted part of interventional procedure, for example, in the trial by Stone and colleagues, all patients underwent a percutaneous coronary intervention and maximal medical therapy but only patients in the active arm also had percutaneous transmyocardial revascularisation30.
When light sedation or local anaesthesia are used, surgical staff have to simulate the actual intervention to preserve the blinding. The complexity of a surgical procedure can make blinding challenging, and ingenious ideas are required to make the real and placebo interventions indistinguishable.
If a procedure requires open surgery, then it will leave an obvious mark where the incision has been made, which will have to be imitated in the placebo group. There have been very few trials involving full skin incision, in both the surgical and placebo arms. In the seminal trials on internal mammary artery ligation31,32 a skin incision was made to expose the arteries in all patients but no ligation was made in the placebo group. Similarly, Guyuron and colleagues used a skin incision to expose superficial nerves and muscles, which were cut during the active surgery but, in the placebo group, the integrity of these structures was maintained33. Trials investigating transplantation of dopaminergic neurones as a treatment for Parkinson’s disease not only required skin incision but also burr holes in the skull34,35.
However, most of the published placebo-controlled surgical trials used minimally-invasive methods to access the structure of interest. For example, the placebo procedure involved laparoscopy but without ablation36, endoscopy without radiofrequency energy delivery37, bronchoscopy without radiofrequency energy delivery38, or bronchoscopy without valve placement39. Therefore, most of the studies required a small incision to mimic the portals created during the laparoscopy or arthroscopy, or to mimic the incision through which an intravascular catheter was inserted40. Interestingly, Sutton and colleagues used three incisions in both groups so that patients could not tell apart a diagnostic laparoscopy from a laparoscopic surgery; even though the third instrument port was not necessary in the placebo group41. Trials using endoscopy and bronchoscopy were even easier to blind as natural orifices were used to insert the scope, and the incision or actual procedure site was not visible to patients, caregivers, and assessors.
Typically, the preparation for the placebo procedure and the active procedure was as similar as possible and immitated the visual, auditory, and physical cues42–45. In order to mimick the sounds, surgeons were required to talk through the procedure steps46, ask for instruments47,48, use suction48 or ask for a laser or other device to be activated, even though it was not used in the placebo group49–53.
Clinical staff performing the intervention were screened from the patients’ view54 either by a surgical drape52 or by arranging the operating room in a way that the patient could not see the procedure44. In the trial by Stone and colleagues, patients were heavily sedated and wore opaque goggles30. In a trial by Maurer and colleagues, the manufacturer delivered tools that looked identical but did not contain an implant, which allowed for blinding of patients and clinical staff18.
Surgeons also attempted to immitate sensory cues, for example by manipulating the knee as if the actual arthroscopy were performed48, injecting saline to imitate tidal irrigation14, or by splashing saline on the knee to simulate lavage45. In a trial on meniscectomy, surgeons used a mechanised shaver (without the blade) pushing it firmly against the patella to simulate the sensations the patient would experience during the surgery48. In a trial on intragastric balloon for obesity, operators manipulated the endoscope as during the balloon insertion to create the sensation of resistance in the stomach53.
Even smell during the surgery was imitated to make the placebo procedure indistinguishable from surgery. For example, in the trial by Deviere and colleagues there were concerns that patients could have known the allocation because the copolymer used in the active arm had a distinct smell55. In trials on vertebroplasty, a container with cement was opened during placebo procedure to help with blinding42,56.
It is important that the procedure used for blinding does not have any therapeutic effect. For example, the results of the vertebroplasty trials42,56 were criticised because the elements of placebo procedure could have had an effect on the reported pain, namely, a potential pharmacological anaesthesia due to injection of an anaesthetic into the facet capsule and periosteum57.
On the other hand, the procedure used for blinding may have diagnostic use, as with diagnostic laparoscopy36,41,58 or diagnostic laparoscopy with biopsy59. In the trial by Sihvonen and colleagues, all participants underwent diagnostic arthroscopy, but only after they had been confirmed to be eligible for inclusion in the trial was the envelope with the assignment opened and the assignment revealed to the surgeon48.
Many trials specifically stated that the duration of procedure in the surgical and control arms were matched, either by immitating the elements of the surgical procedure or by keeping all patients in the operation room for the same duration of time34,38,45,47,48,60,61. However, in some trials, the placebo procedure was shortened in comparison to the actual surgery because it was believed it would have been ethically unacceptable to prolong the placebo intervention49,55.
Interventional treatment often requires additional procedures, such as diagnostic scans or medication to prevent infection43,56,62, blood clots40, transplant rejection63, or epileptic fits35. For example, in the trial by Freed and colleagues, both groups received identical preoperative evaluation, intraoperative sedation and pain control, underwent two PET scans and a MRI scan, and received phenytoin35. In some trials, the same medication was given in both groups, whereas in others unnecessary treatment was omitted or imitated, for example by injecting saline instead of antibiotics64.
The active and placebo procedure have to be indistinguishable but they also have to be stable and standardised. Standardisation of the procedure itself may be difficult but is important because surgeons vary in their experience, and gain experience throughout the trial.
Some of the changes observed in a trial may not be related to the treatment or the placebo intervention, but may be caused by the natural course of the disease, spontaneous remissions or fluctuations in the severity of symptoms or regression to the mean27,65. Some changes may be a result of just being in a trial either because of lifestyle changes that are part of the protocol such as self-monitoring using diaries or avoiding alcohol or due to so called “Hawthorne effect”, which refers to change in the behaviour when people, both patients and doctors, know that they are being observed27. Finally, it has been demonstrated that adhering to a protocol improves the performance of doctors, and that patients who adhere to treatment regimes have better outcomes66. Therefore, it is important to standardise pre- and post-operative care, and the explanations given to the patients. For example, in a trial by Sihvonen and colleagues all procedures were standardised and recorded on video; the post-operative care was also standardised, and all patients received the same exercise programme and walking aids48.
Most trials blinded the assessors while the surgeon and other staff in the operating room were aware of the group assignment, and did not participate in further treatment, post-operative care or follow-up of the patient36,48,67. In a trial by Thomsen and colleagues, the post-operative care and assessment was done at a different hospital than the surgery21. In a trial by Cotton and colleagues a post-operative care was provided by the referring physician, who was blinded when deciding on treatment, and when this was not sufficient, by the evaluating physician at the study site (who was also blinded)67.
There are other types of bias that are specific to surgical trials. For example, in trials on upper gastrointestinal tract bleeding, the endoscopic procedure was not performed if the rate of blood loss was too fast, or the endoscopy was judged to be life-threatening and posed an unacceptable risk68–70. In other trials, some patients were excluded because they could not tolerate endoscopy, or due to anatomical conditions that made the surgery impossible to carry out (laser could not be aimed at the bleeding arteries)70. Alternatively, some patients were not included in a trial because they were no longer eligible, for example because the bleeding had stopped70 or they no longer reported the symptoms on the day of the study71.
Blinding in trials of interventional procedures is possible and many creative methods have been used to maintain the blinding. Interventional procedures are challenging to blind, but the effort is worthwhile because of the obvious benefits, such as avoiding bias, as well as the less evident benefits, such as avoiding patient drop-out in the control arm.
This work has received funding from the NIHR Oxford Musculoskeletal Biomedical Research Unit.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Is the topic of the review discussed comprehensively in the context of the current literature?
Partly
Are all factual statements correct and adequately supported by citations?
Yes
Is the review written in accessible language?
Yes
Are the conclusions drawn appropriate in the context of the current research literature?
Yes
References
1. Moustgaard H, Bello S, Miller FG, Hróbjartsson A: Subjective and objective outcomes in randomized clinical trials: definitions differed in methods publications and were often absent from trial reports.J Clin Epidemiol. 2014; 67 (12): 1327-34 PubMed Abstract | Publisher Full TextCompeting Interests: No competing interests were disclosed.
Is the topic of the review discussed comprehensively in the context of the current literature?
Yes
Are all factual statements correct and adequately supported by citations?
Yes
Is the review written in accessible language?
Yes
Are the conclusions drawn appropriate in the context of the current research literature?
Yes
References
1. Walach H, Loef M: Using a matrix-analytical approach to synthesizing evidence solved incompatibility problem in the hierarchy of evidence.J Clin Epidemiol. 2015; 68 (11): 1251-60 PubMed Abstract | Publisher Full TextCompeting Interests: No competing interests were disclosed.
Reviewer Expertise: Research methodology, clinical and other trials
Is the topic of the review discussed comprehensively in the context of the current literature?
Yes
Are all factual statements correct and adequately supported by citations?
Yes
Is the review written in accessible language?
Yes
Are the conclusions drawn appropriate in the context of the current research literature?
Yes
References
1. Berkman ND, Santaguida PL, Viswanathan M, Morton SC: The Empirical Evidence of Bias in Trials Measuring Treatment Differences. PubMed AbstractCompeting Interests: No competing interests were disclosed.
Reviewer Expertise: Orthopaedics, Epidemiology
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | |||
---|---|---|---|
1 | 2 | 3 | |
Version 2 (revision) 30 Jan 18 |
|||
Version 1 08 Sep 17 |
read | read | read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)