ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Software Tool Article
Revised

DRETools: A tool-suite for differential RNA editing detection

[version 2; peer review: 1 approved, 1 approved with reservations, 1 not approved]
PUBLISHED 19 Sep 2018
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Bioinformatics gateway.

Abstract

Recent tools to detect RNA editing have expanded our understanding of epitranscriptomics, linking changes in RNA editing to both disease and normal cellular processes. However, the research community currently lacks tools for determining if change in RNA editing or "differential editing" has occurred. To meet this need, we present DRETools, a command-line tool-set for finding differential editing among samples, editing islands, and editing sites.

Keywords

epitranscriptomics, RNA-seq, RNA editing, differential RNA editing, editing-per-kilobase, EPK

Revised Amendments from Version 1

In this revised manuscript, a brief description of RNAEditor was added in the Methods section. Furthermore, the required hardware configuration for running DRETools, along with run times when analyzing each testing sample, were added.

See the authors' detailed response to the review by Yicheng Zhao

Introduction

RNA editing is a class of epitranscriptomic post-transcriptional modification found throughout metazoa consisting of the abundant conversion of adenosine-to-inosine (A-to-I) by ADARs (adenosine deaminases acting on RNA) and rare conversion of cytosine-to-uridine (C-to-U) by APOBEC (apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like)1. RNA editing is particularly interesting as it is detectable as A-to-G and C-to-T mismatches to the reference genome within standard RNA-sequencing data via specialized computational pipelines2. An increasing number of studies link changes in editing at specific sites or clusters-of-sites to diseases, such as epilepsy and atherosclerosis3,4. Yet, no software for detecting differential editing is available. To meet this need, we present DRETools5: 1) to calculate units that help reduce sample-bias, similar to FPKM for RNA expression; and 2) to find differentially edited sites and editing islands (i.e., clusters of editing sites)6. Further, we showcase two examples of finding differential editing and related tasks with DREtools7

Methods

Implementation

DRETools can be run via command-line by typing “dretools”, which will print the main help menu. The main help menu contains a list of operations that are available from dretools with short descriptions of each operation’s purpose. To run an operation, type dretools followed by the operation name. Further detail on each operation, including available command-line arguments and usage examples, can be found by running an operation with the --help argument. On the main help menu, operations are organized into sub-headings based on similar functions. Further detail of each sub-heading and corresponding operations can be found in the following sections.

DRETools requires the output from RNA detection software. Here we used RNAEditor6 to detect editing sites in standard RNA-seq data. RNAEditor uses a specialized alignment and variant calling pipeline to find potential editing sites and then uses filters to remove false positives. In addition to RNAEditor, there are a number of other editing detection tools available. DRETools is usable with any of these tools that work in a similar matter and produces a VCF file containing editing sites and a BAM file containing aligned reads. Details regarding the usage of RNAEditor (e.g. analysis pipelines, configuration files, and non-downloadable reference files) can be found in the archived data5.

Units

One fundamental problem between groups of samples is a lack of standardized units for describing editing within samples, editing islands, and sites. To this end, DRETools implements Editing Per Kilobase (EPK) based on “overall editing” (OE)8. EPK builds upon OE by considering both A-to-G and C-to-T transitions, excludes editing sites with 100% edited bases as potential mutations, and scaling by 103 for readability (similar to FPKM). EPK is calculated by dividing the total number of “edited” bases by the total number of bases overlapping known editing sites and multiplying by 103. In addition to samples, DRETools can compute EPKs for editing islands and sites. Sample-wise editing can be computed with the “sample-epk” function and can be thought of as the global-editing-rate, whereas, the EPK of islands and sites can be computed with "region-epk" and "edsite-epk" respectively, and thought of as the “local-editing-intensity”.

Differential editing

Recently, a method was developed to find differentially edited sites between epileptic or control mouse hippocampi3. However, methods capable of comparing different tissues are also needed. The problem is that unless the global-editing-rates are similar, we cannot determine if changes are due to differing global-editing-rates or other phenomena, such as competition with N6-methyladenosine (m6A)9. Furthermore, ADARs have been described to edit both specific sites in some cases and non-specifically within small regions in other cases10. Therefore, in addition to individual editing sites, looking at the clusters of editing is also of interest. DRETools addresses both these issues by allowing the normalization of both the global-editing-rate and site or island local-editing-intensity in EPK and testing for differential editing using a linear model (LM) with the formula: "logFeatureEPK ~ logSampleEPK + featureLength + averageReadDepth" (features can be sites or islands), which adjusts expectations for what constitutes differential editing.

Merge and stats

DRETools also includes various helper functions. For example, the merge section contains functions to find editing islands6 and create consensus sets of editing sites by merging sites from multiple samples. Finally, the stats heading contains functions that calculate useful information about editing at the sample, gene, and site levels, such as the editable area or the number of editing sites falling in 3’/5’-untranslated regions, introns, or exons.

Operation

Minimum requirements for DRETools are 8 gigabytes (GB) of RAM, a 100 GB hard drive, and an operating system with a Bash command-line interface, R version 3.3+, and Python version 3.5+. The first two operations required on average (n=5) 4.2 minutes (min) and 449 megabytes of RAM memory (MB) for edsite-merge and 4.4 min and 192 MB for find-islands. The benchmarks of remaining operations are primarily dependent on the BAM files used for computation. The BAM files used here ranged from 3.2-29 GB and 31-282 million reads. Performance was as follows: sample-epk (6-42 min, 40-54 MB), edsite-epk (6-41 min, 40-310 MB), region-epk (7-35 min, 40-323 MB), edsite-diff (0.41-3.49 min and 534.09-2280 MB), and region-diff (0.05-0.23 min, 190.31-522 MB).

Results

To illustrate the utility of DRETools, we surveyed differential editing in human umbilical vein endothelial cells (HUVEC) transfected with either an siRNA against ADAR1 or against a random sequence (control)4 and the immortalized cell lines GM12787 and K56211. First we surveyed sample-wise editing using the function “sample-epk.” (Figure 1A,B). Using EPK reduces variation within groups compared to the usage of number of editing sites. For example, the coefficient of variance drops from 0.21 to 0.05 for the silenced ADAR1 group and 0.52 to 0.01 for the control group. Similarly, when comparing the immortalized cell lines, the coefficient of variance is reduced from 0.57 to 0.25 and 0.46 to 0.11, respectively (Figure 1C, D).

4f9a29ce-6922-470f-b377-62af2058730b_figure1.gif

Figure 1.

(A) The number of editing sites in HUVEC control and silenced ADAR1 groups (p=0.77). NS, p>0.05. (B) HUVEC control and silenced ADAR1 (siADAR1) represented in EPK (p=0.7.8E-5). **p<0.0001. (C) The number of editing sites detected in GM12787 and K562 cells (p=1.2E-3). *p<0.05. (D) Editing in GM12787 and K562 cells represented in EPK (p=2.5E-6). **p<0.00011E-4. (EH) Histograms detailing the distribution of p-values when testing for differential editing in a site- or island-wise manner. The site-wise comparison between: (E) siADAR1 and control; and (F) GM12787 and K562 cells. The island-wise comparison between: (G) siADAR1 and control; and (H) GM12787 and K562 cells.

Next, we compared the EPKs of editing islands within the immortalized cell lines using “epk-region”. Using EPK to represent editing islands as opposed to the number of edited bases reduces the coefficient of variance from 0.60 ± 0.21 to 0.31 ± 0.11 (p=2E-30). Finally, we tested for differential editing using the functions “region-diff” for islands and “site-diff” for editing sites (Figure 1E–H). Comparing silenced ADAR1 to the control, the LM yielded a uniform distribution of p-values. In contrast, when using t-test applied to the same data, the distribution of p-values is shifted to the left and exhibits greater skew. However, in the immortalized cell lines, p-values calculated by the LM are more leftward skewed while p-values from the t-test became more uniformly distributed. This provides evidence that the LM can effectively reduce type I errors when testing for differential editing. For example, the LM correctly recognizes that most of the differences between the silenced ADAR1 and control groups arise from the reduction of the global-editing-rate in the silenced samples. Whereas the t-test, which does not consider the global-editing-rates, finds many differentially edited sites and islands. Conversely, when comparing the immortalized cell lines, despite the large difference in EPK, many differentially edited sites and islands are detected. While deeper biological validation is needed to be certain, these could be instances of some other phenomena, such as m6A9, affecting the editing in individual sites or islands.

Conclusions

DRETools is a command-line tool suite for finding differentially edited sites and islands. It allows users to calculate units that reduce sample-bias and find differentially edited sites and islands even when the global-editing-rate of groups being compared is different. Furthermore, it also includes a variety of other features for exploring RNA editing. These make DRETools a valuable tool for further investigating epitranscriptomics.

Data availability

All RNA-seq data are publically available and were downloaded from the NCBI SRA database12. The HUVEC data sets were generated by Stellos et al., 20164 and the GM12787 and K562 cells by the ENCODE project11. Lists of accession numbers, pipelines used to generate analyses, and intermediate files generated are archived on Zenodo7.

Software availability

Source code available from: http://dretools.bitbucket.io/.

Data and analysis pipelines: https://zenodo.org/record/14006485.

Source code at time of publication: https://zenodo.org/record/14000057.

License: The software, and data and analysis pipelines are available under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 30 Aug 2018
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Weirick T, Trainor P, Rouchka E et al. DRETools: A tool-suite for differential RNA editing detection [version 2; peer review: 1 approved, 1 approved with reservations, 1 not approved]. F1000Research 2018, 7:1366 (https://doi.org/10.12688/f1000research.16026.2)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 2
VERSION 2
PUBLISHED 19 Sep 2018
Revised
Views
40
Cite
Reviewer Report 04 Oct 2018
Graziano Pesole, Department of Biosciences,Biotechnology and Biopharmaceutics,  University of Bari Aldo Moro (UBAM), Bari, Italy 
Not Approved
VIEWS 40
This manuscript represents a valuable contribution addressing a timely and relevant problem in epitranscriptomics analysis. Indeed, several RNA editing analysis studies have been reported so far in the literature but reliable standard operative procedures for evaluating "differential editing" are still ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Pesole G. Reviewer Report For: DRETools: A tool-suite for differential RNA editing detection [version 2; peer review: 1 approved, 1 approved with reservations, 1 not approved]. F1000Research 2018, 7:1366 (https://doi.org/10.5256/f1000research.17882.r38801)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Version 1
VERSION 1
PUBLISHED 30 Aug 2018
Views
27
Cite
Reviewer Report 13 Sep 2018
Yicheng Zhao, Brown Foundation Institute of Molecular Medicine, University of Texas Health Science Center at Houston (UTHealth), Houston, TX, USA 
Approved
VIEWS 27
Currently, more research has focused on human RNA editing. This software is very useful in detecting human differentially edited sites and islands. However, I suggest the authors to add some brief description about RNAEditor in the method section, which will ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Zhao Y. Reviewer Report For: DRETools: A tool-suite for differential RNA editing detection [version 2; peer review: 1 approved, 1 approved with reservations, 1 not approved]. F1000Research 2018, 7:1366 (https://doi.org/10.5256/f1000research.17503.r37750)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 19 Sep 2018
    Shizuka Uchida, Institute of Molecular Cardiology, University of Louisville, Louisville, 40202, USA
    19 Sep 2018
    Author Response
    Thank you very much for your valuable comments. We have now added a brief description of RNAEditor in the Method section. Furthermore, we have included the required hardware configuration for running ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 19 Sep 2018
    Shizuka Uchida, Institute of Molecular Cardiology, University of Louisville, Louisville, 40202, USA
    19 Sep 2018
    Author Response
    Thank you very much for your valuable comments. We have now added a brief description of RNAEditor in the Method section. Furthermore, we have included the required hardware configuration for running ... Continue reading
Views
38
Cite
Reviewer Report 13 Sep 2018
Ernesto Picardi, University of Bari, Bari, Italy;  Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR), Bari, Italy 
Approved with Reservations
VIEWS 38
The manuscript by Weirick et al. introduces a tool-suite to calculate differential RNA editing.

The interest towards RNA editing is rapidly growing and, thus, similar tools to improve the investigation of RNA editing in different experimental conditions ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Picardi E. Reviewer Report For: DRETools: A tool-suite for differential RNA editing detection [version 2; peer review: 1 approved, 1 approved with reservations, 1 not approved]. F1000Research 2018, 7:1366 (https://doi.org/10.5256/f1000research.17503.r37753)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 30 Aug 2018
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.