ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Brief Report

Use of routinely collected data in a UK cohort of publicly funded randomised clinical trials

[version 1; peer review: 1 approved with reservations]
PUBLISHED 04 May 2020
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Health Services gateway.

Abstract

Routinely collected data about health in medical records, registries and hospital activity statistics is now routinely collected in an electronic form. The extent to which such sources of data are now being routinely accessed to deliver efficient clinical trials, is unclear. The aim of this study was to ascertain current practice amongst a United Kingdom (UK) cohort of recently funded and ongoing randomised controlled trials (RCTs) in relation to sources and use of routinely collected outcome data.
Recently funded and ongoing RCTs were identified for inclusion by searching the National Institute for Health Research journals library. Trials that have a protocol available were assessed for inclusion and those that use or plan to use routinely collected health data for at least one outcome were included. Routinely collected data sources and outcome information were extracted.
A total of 279 studies were identified with 102 eligible for data extraction. An Electronic Health Record (EHR) was the sole source of outcome data for at least one outcome in 46 trials. The most frequent sources are Hospital Episode Statistics (HES) and Office for National Statistics (ONS), with the most common outcome data to be extracted being on mortality, hospital admission, and health service resource use.
Our study has found that around half of publicly funded trials in a UK cohort plan to collect outcome data from routinely collected data sources. This is much higher than the figure of 8% found in a cohort of 189 RCTs published since 2000, the majority of were carried out in North America (McCord et al., 2019).

Keywords

Electronic Health Records, Data linkage, EHR, NIHR HTA, Randomised Clinical Trial, Randomised Controlled Trial, RCT, Registry, Routinely collected data

Introduction

Routinely collected data about health in medical records, registries and hospital activity statistics is now routinely collected in an electronic form. Progress in achieving connectivity, data linkage and security now offers the possibility of better use of this data for research purposes. For example, recent evidence shows the utility of long-term follow-up of trial patients through the electronic health record (EHR) (Fitzpatrick et al., 2018). Innovative data-enabled study designs can answer pressing knowledge gaps in research evidence. However, the extent to which such sources of data are now being routinely employed in research to deliver efficient clinical trials, potentially at a wide scale, is unclear.

The aim of this study was to ascertain current practice amongst a United Kingdom (UK) cohort of recently funded and ongoing randomised controlled trials (RCTs) in relation to sources and use of routinely collected outcome data. We define routinely collected health data to be data collected without specific a priori research questions developed prior to using the data for research.

Methods

Inclusion criteria

The following inclusion criteria were used:

  • 1. Ongoing RCT of any type including feasibility or pilot work, funded by the National Institute for Health Research (NIHR) Health Technology Assessment (HTA) programme;

  • 2. use of routinely collected health data for at least one study outcome; and

  • 3. availability of a protocol.

Search methods

A search of the NIHR Journals Library was undertaken to find protocols registered as of 25/10/2019. The search fields and terms used to select were:

  • 1. Search term: ‘Random’

  • 2. Research type: ‘Primary research’

  • 3. Programme: ‘HTA’

  • 4. Status: ‘Research in progress’

If the final published report was shown alongside the protocol this was taken to mean that the RCT was not ongoing but the status had not been updated to ‘Published’, and the study was excluded.

In the absence of a protocol, the study was excluded. For studies with multiple protocol versions, the most recently available version was used.

Data extraction

One person (AM) extracted the information and categorised each EHR, with a second person (PW) checking classifications and explanations. The information extracted was as follows: Lead Investigator surname, year started, ISRCTN, project title, study type, use of routinely collected health data for at least one study outcome, availability of a protocol, any details of EHR data quality assessment prior to use, EHR name, reasons for sourcing outcome data from EHR, specific outcomes and outcome type where clear data to be used will come from named EHRs.

Results

Figure 1 shows the study flow diagram. There were 102 eligible trials available for further study.

c12d65d4-0d8c-4875-b3a6-4ced5be3c5f6_figure1.gif

Figure 1. PRISMA flow diagram.

Table 1 shows the reasons for collecting trial outcome data from routine sources. The EHR was the sole source of outcome data for at least one outcome in 46 trials (categories 3, 4 and 6 in Table 1). In five of these 46 protocols there was reference to prior feasibility work confirming aspects of the quality of the data to be sufficient for the main trial. Of the 102 trials, 14 (categories 7a-7d in Table 1) planned to assess the feasibility of using the EHR data sources during the trial, although details of the assessment were often lacking. Raw data for Figure 1 and Table 1 and Table 2 are available (see Underlying data, McKay et al. (2020)).

Table 1. Reasons for sourcing outcome data from EHRs in 102 studies.

Multiple categories can apply to a single study.

CategoriesTotal
(1)(1a) 'Supplementing data collection for withdrawn patients (consent asked for at time of withdrawal)'7
(1b) 'Supplementing data collection for lost-to-follow-up patients'8
(1c) 'Supplementing data collection for withdrawn patients (consent NOT ASKED FOR at time of withdrawal)'2
(1e) 'Continued data collection for withdrawn patients (consent asked for at time of withdrawal)'1
(2)(2) 'Supplementing data collection for unobtainable/missing data'3
(3)(3a) 'As the sole source of all outcome data'0
(3b) 'As the sole source of all outcome data except for data related to protocol adherence and adverse event
reporting being collected using CRFs'
0
(4)(4) 'As the sole source of some outcome data'43
(5)(5a) 'As a source of some outcome data, alongside other sources for the same outcome data (e.g. CRF)'51
(5b) 'As a source of some outcome data, but collected by CRF if unable to access data'3
(6)(6a) 'Registry trial: As the sole source of outcome data with purpose-built Module to collect remaining
outcome data'
1
(6b) 'Registry trial: All outcome data collected through multiple EHRs except for questionnaire data'1
(6c) 'Registry trial: All outcome data collected through multiple EHRs except for some baseline data,
questionnaire data and other patient-reported data'
1
(7)(7a) 'EHR data compared to trial collected data as part of feasibility assessment criteria'11
(7b) 'EHR data compared to trial collected data as a main trial secondary outcome'1
(7c) 'EHR data compared to trial collected data and then collect long-term follow-up data as part of trial'1
(7d) 'EHR data compared to trial collected data and then collect long-term follow-up data after trial has been
completed'
1
(7e) 'Representativeness of randomised patients compared with all eligible patients using EHR data as part
of feasibility assessment criteria'
1
(8)(8a) 'Participants flagged with NHS Digital/other: Check health status of patient prior to contacting in case
patient has died'
2
(8b) 'Participants flagged with NHS Digital/other: Check health status/notification of any deaths, causes'12
(9)(9) 'Set up mechanisms for long-term follow-up'4
(10)(10) 'Patients asked to provide written consent for continuation in the study once have regained capacity.
Those who prefer not to be actively involved in the study follow-up, then asked to provide co
1
Total155

Table 2. Categories of EHR sources of outcome data in 46 studies where this was the sole source for at least one outcome.

SourceNumber (%)
(i) Primary care data (all regional equivalents)8 (17%)
(ii) HES (and/or regional equivalents)27 (59%)
(iii) ONS (and/or regional equivalents)27 (59%)
(iv) Data collected specifically for patient group
or healthcare intervention (to include patient
registries, ICNARC, ambulance, etc)
26 (57%)
(v) Other5 (11%)

Table 2 shows the sources of outcome data to be used in these 46 studies. The most frequent sources are Hospital Episode Statistics (HES) and Office for National Statistics (ONS), with the most common outcome data to be extracted being on mortality, hospital admission, and health service resource use (see Underlying data, Data Set 5; McKay et al. (2020)). The full list of data sources is given in Extended data, Supplementary Table 1 (McKay et al., 2020).

Discussion

Our study has found that around half of publicly funded trials in a UK cohort plan to collect outcome data from routinely collected data sources. This is much higher than the figure of 8% found in a cohort of 189 RCTs published since 2000, the majority of which were carried out in North America (McCord et al., 2019).

Very few trial teams described any assessments of data quality from EHRs in the protocol. Work is ongoing that should determine whether such information should be reported in the trial publication (Kwakkenbos et al., 2018). An extension to the SPIRIT guidelines for EHR-supported trials is soon to be initiated, and will determine whether this information should be included in the trial protocol. As a minimum, it is recommended that trialists provide evidence in any funding application about the quality of the data from the EHR.

Data availability

Underlying data

Figshare: Use of routinely collected data in a UK cohort of publicly funded randomised clinical trials. https://doi.org/10.6084/m9.figshare.12185193 (McKay et al., 2020).

This project contains the following underlying data:

  • Data_Set_1_Details_and_Figure_1_v1.0.csv. (Study identifiers and raw data used for Figure 1.)

  • Data_Set_2_Table_1_v1.0.csv. (Raw data used for Table 1.)

  • Data_set_3_Supp_Table_1_v1.0.csv. (Raw data used for Supplementary Table 1.)

  • Data_set_4_Table_2_v1.0.csv. (Raw data used for Table 2.)

  • Data_set_5_Outcomes_using_EHR_data_v1.0.csv. (Raw data showing details of outcomes using EHR data.)

Extended data

Figshare: Use of routinely collected data in a UK cohort of publicly funded randomised clinical trials. https://doi.org/10.6084/m9.figshare.12185193 (McKay et al., 2020).

This project contains the following extended data:

  • Supplementary Table 1 - EHR sources of outcome data v1.0.pdf. (Supplementary Table 1.)

Data are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

Comments on this article Comments (0)

Version 3
VERSION 3 PUBLISHED 04 May 2020
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
McKay AJ, Jones AP, Gamble CL et al. Use of routinely collected data in a UK cohort of publicly funded randomised clinical trials [version 1; peer review: 1 approved with reservations]. F1000Research 2020, 9:323 (https://doi.org/10.12688/f1000research.23316.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 04 May 2020
Views
32
Cite
Reviewer Report 13 May 2020
Sharon Love, MRC Clinical Trials Unit, University College London, London, UK 
Approved with Reservations
VIEWS 32
This is a review of the protocols of RCTs, currently in progress, funded by NIHR, UK. RCTs were selected from NIHR HTA funding stream list if they claimed to be using routinely collected data for at least one study outcome. ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Love S. Reviewer Report For: Use of routinely collected data in a UK cohort of publicly funded randomised clinical trials [version 1; peer review: 1 approved with reservations]. F1000Research 2020, 9:323 (https://doi.org/10.5256/f1000research.25738.r63052)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 01 Jun 2020
    Andrew McKay, Liverpool Clinical Trials Centre, University of Liverpool, a member of Liverpool Health Partners, Liverpool, UK
    01 Jun 2020
    Author Response
    Major comments: Thanks for your important comments. We have made these clearer within the article update to version 2.
     
    Major comments part 1: We have now made it clear that the ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 01 Jun 2020
    Andrew McKay, Liverpool Clinical Trials Centre, University of Liverpool, a member of Liverpool Health Partners, Liverpool, UK
    01 Jun 2020
    Author Response
    Major comments: Thanks for your important comments. We have made these clearer within the article update to version 2.
     
    Major comments part 1: We have now made it clear that the ... Continue reading

Comments on this article Comments (0)

Version 3
VERSION 3 PUBLISHED 04 May 2020
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.