Keywords
Electronic Health Records, Data linkage, EHR, NIHR HTA, Randomised Clinical Trial, Randomised Controlled Trial, RCT, Registry, Routinely collected data
This article is included in the Health Services gateway.
Electronic Health Records, Data linkage, EHR, NIHR HTA, Randomised Clinical Trial, Randomised Controlled Trial, RCT, Registry, Routinely collected data
Routinely collected data about health in medical records, registries and hospital activity statistics is now routinely collected in an electronic form. Progress in achieving connectivity, data linkage and security now offers the possibility of better use of this data for research purposes. For example, recent evidence shows the utility of long-term follow-up of trial patients through the electronic health record (EHR) (Fitzpatrick et al., 2018). Innovative data-enabled study designs can answer pressing knowledge gaps in research evidence. However, the extent to which such sources of data are now being routinely employed in research to deliver efficient clinical trials, potentially at a wide scale, is unclear.
The aim of this study was to ascertain current practice amongst a United Kingdom (UK) cohort of recently funded and ongoing randomised controlled trials (RCTs) in relation to sources and use of routinely collected outcome data. We define routinely collected health data to be data collected without specific a priori research questions developed prior to using the data for research.
A search of the NIHR Journals Library was undertaken to find protocols registered as of 25/10/2019. The search fields and terms used to select were:
1. Search term: ‘Random’
2. Research type: ‘Primary research’
3. Programme: ‘HTA’
4. Status: ‘Research in progress’
If the final published report was shown alongside the protocol this was taken to mean that the RCT was not ongoing but the status had not been updated to ‘Published’, and the study was excluded.
In the absence of a protocol, the study was excluded. For studies with multiple protocol versions, the most recently available version was used.
One person (AM) extracted the information and categorised each EHR, with a second person (PW) checking classifications and explanations. The information extracted was as follows: Lead Investigator surname, year started, ISRCTN, project title, study type, use of routinely collected health data for at least one study outcome, availability of a protocol, any details of EHR data quality assessment prior to use, EHR name, reasons for sourcing outcome data from EHR, specific outcomes and outcome type where clear data to be used will come from named EHRs.
Figure 1 shows the study flow diagram. There were 102 eligible trials available for further study.
Table 1 shows the reasons for collecting trial outcome data from routine sources. The EHR was the sole source of outcome data for at least one outcome in 46 trials (categories 3, 4 and 6 in Table 1). In five of these 46 protocols there was reference to prior feasibility work confirming aspects of the quality of the data to be sufficient for the main trial. Of the 102 trials, 14 (categories 7a-7d in Table 1) planned to assess the feasibility of using the EHR data sources during the trial, although details of the assessment were often lacking. Raw data for Figure 1 and Table 1 and Table 2 are available (see Underlying data, McKay et al. (2020)).
Multiple categories can apply to a single study.
Table 2 shows the sources of outcome data to be used in these 46 studies. The most frequent sources are Hospital Episode Statistics (HES) and Office for National Statistics (ONS), with the most common outcome data to be extracted being on mortality, hospital admission, and health service resource use (see Underlying data, Data Set 5; McKay et al. (2020)). The full list of data sources is given in Extended data, Supplementary Table 1 (McKay et al., 2020).
Our study has found that around half of publicly funded trials in a UK cohort plan to collect outcome data from routinely collected data sources. This is much higher than the figure of 8% found in a cohort of 189 RCTs published since 2000, the majority of which were carried out in North America (McCord et al., 2019).
Very few trial teams described any assessments of data quality from EHRs in the protocol. Work is ongoing that should determine whether such information should be reported in the trial publication (Kwakkenbos et al., 2018). An extension to the SPIRIT guidelines for EHR-supported trials is soon to be initiated, and will determine whether this information should be included in the trial protocol. As a minimum, it is recommended that trialists provide evidence in any funding application about the quality of the data from the EHR.
Figshare: Use of routinely collected data in a UK cohort of publicly funded randomised clinical trials. https://doi.org/10.6084/m9.figshare.12185193 (McKay et al., 2020).
This project contains the following underlying data:
Data_Set_1_Details_and_Figure_1_v1.0.csv. (Study identifiers and raw data used for Figure 1.)
Data_Set_2_Table_1_v1.0.csv. (Raw data used for Table 1.)
Data_set_3_Supp_Table_1_v1.0.csv. (Raw data used for Supplementary Table 1.)
Data_set_4_Table_2_v1.0.csv. (Raw data used for Table 2.)
Data_set_5_Outcomes_using_EHR_data_v1.0.csv. (Raw data showing details of outcomes using EHR data.)
Figshare: Use of routinely collected data in a UK cohort of publicly funded randomised clinical trials. https://doi.org/10.6084/m9.figshare.12185193 (McKay et al., 2020).
This project contains the following extended data:
Data are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Is the work clearly and accurately presented and does it cite the current literature?
Yes
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Partly
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Partly
Competing Interests: With others, I have conducted a review of the use of routinely collected health data by using release lists from registries which has been accepted for publication but is not yet published.
Reviewer Expertise: Trial conduct, particularly monitoring and the use of routinely collected health data.
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | ||
---|---|---|
1 | 2 | |
Version 3 (revision) 12 Mar 21 |
||
Version 2 (revision) 01 Jun 20 |
read | read |
Version 1 04 May 20 |
read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)