Selecting differential splicing methods: Practical considerations

Ben  J. Draper; Mark J. Dunning; David C. James

doi:10.12688/f1000research.155223.1

Home Browse Selecting differential splicing methods: Practical considerations

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Review

Selecting differential splicing methods: Practical considerations

[version 1; peer review: 2 approved with reservations]

Ben J. Draper ¹, Mark J. Dunning², David C. James¹

PUBLISHED 08 Jan 2025

Author details Author details

¹ Department of Chemical and Biological Engineering, Mappin St., The University of Sheffield, Sheffield, S1 3JD, UK
² Bioinformatics Core Bioinformatics Core, The Faculty of Health,, The University of Sheffield, Sheffield, S10 2HQ, UK

Ben J. Draper
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Mark J. Dunning
Roles: Investigation, Resources, Supervision, Validation, Writing – Review & Editing

David C. James
Roles: Funding Acquisition, Project Administration, Supervision

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Bioinformatics gateway.

Abstract

Alternative splicing is crucial in gene regulation, with significant implications in clinical settings and biotechnology. This review article compiles bioinformatics RNA-seq tools for investigating differential splicing; offering a detailed examination of their statistical methods, case applications, and benefits. A total of 22 tools are categorised by their statistical family (parametric, non-parametric, and probabilistic) and level of analysis (transcript, exon, and event). The central challenges in quantifying alternative splicing include correct splice site identification and accurate isoform deconvolution of transcripts. Benchmarking studies show no consensus on tool performance, revealing considerable variability across different scenarios. Tools with high citation frequency and continued developer maintenance, such as DEXSeq and rMATS, are recommended for prospective researchers. To aid in tool selection, a guide schematic is proposed based on variations in data input and the required level of analysis. Additionally, advancements in long-read RNA sequencing are expected to drive the evolution of differential splicing tools, reducing the need for isoform deconvolution and prompting further innovation.

Keywords

Bioinformatics, Alternative Splicing, RNASeq, Transcriptomics, Differential Expression

Corresponding author: Ben J. Draper

Competing interests: No competing interests were disclosed.

Grant information: This work was supported by funding from Lonza Biologics Inc & The University of Sheffield.
https://www.lonza.com/
I extend my gratitude for their financial assistance in facilitating this research project. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2025 Draper BJ et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Draper BJ, Dunning MJ and James DC. Selecting differential splicing methods: Practical considerations [version 1; peer review: 2 approved with reservations]. F1000Research 2025, 14:47 (https://doi.org/10.12688/f1000research.155223.1) First published: 08 Jan 2025, 14:47 (https://doi.org/10.12688/f1000research.155223.1) Latest published: 30 May 2025, 14:47 (https://doi.org/10.12688/f1000research.155223.2)

Introduction

Alternative splicing (AS) can be best described as fine-tuning gene expression by rearranging exons and introns in pre-mRNA. With 90-95% of human multi-exon genes estimated to possess some form of alternative splicing, it is a widespread regulatory process in cellular biology.¹ The cell utilises a large ribonucleoprotein (RBP) complex known as the spliceosome which is guided to target sites through the interaction of sequence elements (splice sites, enhancers & silencers and the polypyrimidine tract) and/or splicing factors. Pre-mRNA splicing can also occur without the splicesome as in the case of self-splicing group I & II introns, tRNA splicing and trans-splicing.² This ultimately results in genome-wide transcript diversity and subsequently, measurable changes to protein functionality.

Previous research has uncovered the phenotypic consequences of alternative splicing in disease. In humans, clinical research has shown alternative splicing (AS) as a key instigator in several forms of cancer and neurodegenerative disorders.^3–5 One notable discovery in Microtubule-associated protein tau’s (MAPT) possession of mis-spliced isoforms causing abnormal TAU accumulation progressing to Alzheimer’s disease.⁶ In cancer, numerous mis-spliced variants of tumour suppressors, apoptotic and angiogenic proteins have been discovered to contribute to tumour progression.^7,8 Beyond clinical research, the utility of alternative transcripts for bioengineering purposes has been explored. For example, an alternatively spliced version of the transcription factor X-box binding protein 1 (XBP1) coexpressed in production cell lines has been shown to increase productivity in the biomanufacturing of recombinant proteins.^9–11 In bio-agriculture, the CRISPR-mediated directed evolution of SF3B1 mutants (a spliceosomal component) in rice has improved crop traits through better resistance to splicing inhibitors.¹² Increasingly, the value of AS in both clinical and biotechnology applications has been recognised; highlighting the need for robust bioinformatics pipelines to identify variants.

For prospective researchers to investigate AS, the transcriptomic data is usually generated using next-generation sequencing. Short-read RNAseq is the most commonly used experimental technique to interrogate a transcriptome owing to its versatility and cost-effectiveness.^13,14 It involves sequencing short fragments of RNA molecules, providing insights into the respective expression levels of genomic features assembled from reference genomes. These features may be coding sequences, genes, transcripts, exons, introns, codons or even untranslated regions. A typical RNAseq pre-processing pipeline will consist of quality control (QC), read alignment & quantification before statistical analysis begins. QC assesses the quality of the raw fragmented reads using a standardised tool such as FastQC and trims low-quality reads or adaptor sequences.¹⁵ Then for alignment, a reference genome/transcriptome arranges the subsequent sequences into feature bins such as genes, transcripts, exons and coding sequences using software such as STAR or HISAT.^16,17 Alignment files (usually in the form of Sequence Alignment Maps: SAMs) can then be quantified to these features using a quantification tool such as HTSeq, Salmon or featureCounts usually normalising for library size and sequencing depth.^18–20 Depending on the purpose of analysis, normalisation may be scaled by total number of reads (CPM: Counts per Million), per length of transcript (TPM: Transcripts per Million), by paired-end fragments (RPKM: Fragments Per Kilobase of Transcript) or by using a median of ratios (DESeq2’s method).²¹ Commonly, a differential expression analysis will be performed at the gene or transcript level between groups of samples to identify statistically significant changes in expression. The pre-processing steps for RNA-seq have been extensively researched over many years, and there is a consensus within the community regarding the gold-standard set of tools. Projects like nf-core enable the execution of RNA-seq pre-processing pipelines with minimal intervention and limited bioinformatics expertise.²² However, these tend to be focused on the use-case of conventional differential expression rather than the more bespoke AS pipelines as discussed here.

A growing repertoire of tools now annotate and quantify changes to splicing events. Quantification of features such as splice sites, and exon/intron junctions found in alignment files are commonly used to annotate splicing events. Although the true repertoire of splicing events is difficult to capture, conventional processes can be categorised into distinct groups. The most common events are exon skipping, retained introns, mutually exclusive exons, alternative 5′ and 3′ splice sites. More complex regulatory events involve genomic features beyond exons and introns, such as alternative promoter and polyadenylation sites, which result in varying mRNA 5′ and 3′ UTR ends. However, these events are seldom included in most bioinformatics analyses, tools such as CAGER (Cap Analysis of Gene Expression) and DaPars (Dynamic Analysis of Alternative PolyAdenylation from RNA-Seq) are available for niche research.^23,24 Visualisation of AS is predicated upon the level of detail required in the analysis. If a highly detailed analysis of individual gene structure is needed, splice graphs, sashimi plots and junction maps are commonly used.^25,26 To visualize changes to groups of transcripts, typically MA and Volcano plots are used much the same way as in differential expression level analysis.²¹

Current statistical methods for differential splicing

Commonly, researchers are interested in comparisons of two or more groups of samples known as differential analyses. Differential gene/transcript expression (DGE/DTE) of genes or transcripts involves taking raw read count data, normalizing or scaling it, and calculating whether the changes in expression levels between different biological groups are statistically significant. Differential transcript/exon usage (DTU/DEU), however, uses gene-level group modelling to assess whether the proportional use of the feature (exon or transcript) is statistically significant. Differential splicing events (DSE) on the other hand use a diverse array of statistical methods to quantify and infer splicing events. A comprehensive summary of differential splicing tools is described in the supplementary table (Supplementary Table 1) and in the following sections.

Parametric & mixed methods

Differential expression analysis tools began in the early 2000s coinciding with the development of high throughput technologies such as microarrays. An early example was LIMMA (Linear Models for Microarray Data), developed by Gordon Smyth and colleagues in 2003, which utilises a linear regression framework and empirical Bayes techniques to identify differentially expressed genes.²⁷ Whilst initially only utilised for microarrays, the functionality thus extended to RNASeq data and has been one of the most cited RNASeq methods. As the field shifted from microarray technology to RNASeq, methods were developed such as DESeq (Differential Expression Analysis for Sequence Count Data) and edgeR to capture the nature of count data better and improve modelling.^21,27,28 A major change incorporated in DESeq2 was empirical Bayes-based shrinkage to improve gene-wise variance estimation enhancing accuracy ( Figure 1). Secondly, GLMs (Generalized Linear Models) replaced the simple linear models as these were shown to adapt well to non-normally distributed count-based data.²¹ The flexibility of GLMs allowed algorithms to effectively deal with issues such as overdispersion, shrinkage, heteroscedasticity and covariates. To date, GLMs are usually fitted to the NB (Negative Binomial) distribution which confers some strong advantages. The NB distribution effectively captures overdispersion (the empirical variability of counts) and can handle a large excess of zero values commonly seen in transcript or exon-level count data. However, limma, DESeq2 and edgeR were not developed to specifically address the challenges of identifying AS.

Figure 1. Timeline of statistical methods in differential splicing tool development.

Methods are categorized into parametric and non-parametric approaches, grouped by methodological families. The classification is based on the underlying statistical procedures used for modelling or hypothesis testing, as detailed in Supplementary Table 1. Note that some methods incorporate elements of both parametric and non-parametric frameworks, resulting in overlapping features.

In 2014, DEXSeq was introduced by Michael Love and colleagues, a framework based on DESeq2’s GLM NB model becoming the de-facto tool for parametric splicing-based analysis. Instead of analysing gene-level differential expression, DEXSeq identifies exons within genes that exhibit significant changes in their usage across conditions. This is particularly useful for studying the exonic composition of alternatively spliced transcripts. The development of tools such as DSGseq, rDiff-parametric, JunctionSeq and SeqGSEA has expanded the functionality of the GLM NB family of differential splicing tools.^29–32 DSGseq utilises a holistic approach considering splicing events not as individual elements but as comprehensive gene-wise splice graphs that more accurately reflect complex splicing dependencies.²⁹ The tool rDiff-parametric on the other hand utilises isoform-specific loci such as restricted exonic regions to identify significant differences in isoform composition.³⁰ The proposed advantage of this approach is in the smaller exonic regions rather than full isoform deconvolution. Assigning reads to isoforms is challenging because these transcripts are practically identical, making it difficult to definitively attribute a read from an overlapping region to a particular region without supplementary data. Therefore, full isoform deconvolution is significantly biased against genes with many isoform variants.³³

A few newer methods such as DRIMSeq and DTUrtle have progressed onto non-parametric or mixed Dirichlet Multinomial Models (DMM) which have been argued to capture better the complex variability of count data and better estimate isoform abundance^34,35 ( Figure 1 & Supplementary Table 1). Other methods such as IsoformSwitchAnalyzeR and some custom DEXSeq workflows now incorporate modularity allowing users a selection of bioinformatics tools for filtering, hypothesis testing and posterior calculations.^36,37 An example of the usage of parametric analysis was in the discovery of a chimeric fusion transcript of PRKACA and DNAJB1 in a rare liver tumour FL-HCC (fibrolamellar hepatocellular carcinoma) using DEXSeq’s differential exon usage framework.³⁸ The discovery of differential exon usage of PRKACA’s exons 2-10 and subsequent decreased usage of DNAJB1’s exons 2-3 led the researchers to identify a chimeric transcript in FL-HCC patients. This demonstrated the utility of smaller exon-based analysis in identifying differences in transcript structure which would not be detected in larger gene or transcript-based analysis alone.

Probabilistic & non-parametric methods

Non-parametric or probabilistic techniques such as MAJIQ, SUPPA, WHIPPET and rMATS frequently utilize Bayesian inference and/or probabilistic methodologies.^26,39–41 By avoiding assumptions about the data’s underlying distribution, these methods enable more sophisticated modelling. Consequently, in contrast to the predominantly standardized parametric exon/transcript-based techniques, event-based methods often showcase a broader array of statistical approaches ( Figure 1). A few common features can be identified, however. Often the targets for event annotations are not labelled in gene-transfer format such as splice sites, exon/intron junctions and splicing quantitative trait loci (QTLs) which must be calculated. This then allows the “Percent spliced in” (PSI) to be calculated per exon, representing the ratio of the number of transcripts containing an alternative exon versus the total number of transcripts per any given splice site. By comparing PSI values, different splicing events can then be identified and explored through splice graphs and sashimi plots. An example of non-parametric tool usage was in the mapping of splicing events in the rice (Oryza sativa) transcriptome, revealing prevalent AS under deprived nutrient conditions.⁴² Importantly, this study utilised rMATs to reveal the underlying exon-intron structure of key nutrient transporter genes.

Some tools possess features for specific utility in certain scenarios. NOISeq is a non-parametric differential expression tool that is specifically designed to handle smaller numbers of biological replicates through its noise model.⁴³ For more complex modelling, tools such as GLiMMPs (Generalized Linear Mixed Model for Pedigree Data with Population Substructure) employ mixed-effects models to account for both fixed and random effects such as genetic family substructure.⁴⁴ Beyond splicing, the modular tool IsoformSwitchAnalyzeR facilitates analysis on spliced transcript quality such as Nonsense Mediated Decay (NMD) sensitivity, Intrinsically Disordered Regions (IDR) and protein domains.³⁶ Increasingly, deep learning-based approaches are being utilised to improve the accuracy of differential splicing predictions leveraging publicly available RNASeq data such as with DARTs and Bisbee.^45,46

Popularity & developer maintenance of methods

To assess the academic popularity of tools, a citation and developer engagement analysis of original research articles within the Web of Science (WoS) domain and the respective GitHub website domains (if applicable). The assessment spanned from 2010 to 2024 and encompassed 19 original papers on various differential splicing tools. Notably, the citation counts for these splicing tools were considerably lower compared to conventional RNA-Seq differential expression analysis tools. For instance, while the general purpose DGE/DTE tool DESeq2 amassed a total of 35,887 citations during the same period, citations for differential splicing tools ranged from 7 to 1300 (Figure 2). This discrepancy may pose challenges for researchers seeking resources and workflows specific to differential splicing analysis. Additionally, the importance of developer support cannot be understated, as it directly influences the usability and longevity of software tools. Notably, differential splicing tools such as DEXSeq, EBSeq, rMATS, SUPPA2, and MAJIQ^{26,39,40,47,48} have shown increasing usage and ongoing developer engagement, as evidenced by their growing citation counts and sustained support ( Figure 3; Figure 4). One possible explanation for the lower citation rates observed in exon/transcript-based methodologies could be the broader adoption of general-purpose differential expression workflows, like DESeq2 that can employ DTE.²¹ Researchers may prefer more explicit splicing event-based tools for targeted splicing analyses and defer to DTE for transcript-based analyses. While the nuances between DTU and DTE may not be a primary focus for many researchers, it is a distinction worth noting in the context of differential splicing analysis.

Figure 2. Citation counts of differential splicing tools (2010–2024) from Web of Science (WoS) Data.

Total citation counts for surveyed differential splicing tools (2010–2024) from the Web of Science Data Portal (WoS). Tools are categorized by analysis level: event, exon, or transcript. DRIMSeq’s original paper was excluded from the citation frequency analysis as it was not indexed in WoS. Certain data included herein are derived from Clarivate Web of Science. © Copyright Clarivate 2023. All rights reserved.Total citation counts for surveyed differential splicing tools (2010–2024) from the Web of Science Data Portal (WoS). Tools are categorized by analysis level: event, exon, or transcript. DRIMSeq’s original paper was excluded from the citation frequency analysis as it was not indexed in WoS. Certain data included herein are derived from Clarivate Web of Science. © Copyright Clarivate 2023. All rights reserved.

Figure 3. Citation trends of differential splicing tools (2010–2024) from Web of Science (WoS) Data.

Annual citation frequency for current differential splicing tools (2010–2024) from Web of Science (WoS). Tools are categorized by analysis level: event, exon, or transcript. DRIMSeq’s original paper is excluded as it is not indexed in WoS. Certain data included herein are derived from Clarivate Web of Science. © Copyright Clarivate 2023. All rights reserved.

Figure 4. Developer maintenance of differential splicing tools.

Annual GitHub repository commits (2010–2024) by category, highlighting community-led maintenance of differential splicing tools. Tools without GitHub pages (MAJIQ, MISO, DSGseq, and dSpliceType) were excluded from the analysis.

The decision between exon/transcript-level (typically parametric) and event-level (typically non-parametric) analyses hinges on several factors, including the particular scientific inquiry, data accessibility, and the level of granularity required to address the research goal. In certain scenarios, integrating both methodologies could offer a more holistic understanding of splicing control mechanisms and their biological significance.

Benchmarking of methods is difficult

To evaluate the quality of differential splicing bioinformatics tools, several benchmarks have been conducted to date. Benchmarking either the scientific accuracy or the computational power of methods can be challenging due to several factors. The main issue is the lack of ground truth to set as a reference to compare measurements to. Commonly, a small subset of experimentally validated splicing events is used as a gold standard to compare against. This was demonstrated in a recent systematic evaluation of 10 differential splicing tools in 2019, where a total of 62 qPCR-validated differentially spliced genes were tested.⁴⁹ The results from this benchmark revealed weak consensus over tool quality as the performance was markedly different across the 4 human and mouse cancer datasets. This demonstrates another issue with these evaluations: inherent heterogeneity in RNASeq data. Often, the performance of methods will depend on the upstream RNASeq pre-processing steps such as in library size, sequence depth, positional bias and annotation quality. To mitigate these issues, some papers use simulated data to i) increase the number of differentially spliced genes to reference and ii) achieve finer control over ground truth and variability within the data.^50–52 One such benchmark used RSEM-based simulated data based on a human prostate cancer dataset (GSE22260⁵³).⁵¹ Another comparison utilised a combination of experimental and simulated Arabidopsis heat shock RNASeq datasets using the Flux Simulator tool.⁵⁴ However, it is important to note that simulated data lacks the complexity of typical biological data. Confounding factors such as outliers, and technical/procedural biases cannot be modelled in current simulations.

The consensus drawn from these three benchmarks is that the performance of differential splicing tools exhibits considerable variability depending on the outlined factors. The ongoing evolution and upkeep of tools by developers introduce a time-dependent aspect to benchmarking. Community-led maintenance efforts consistently enhance the functionality and reliability of tools over time. Rather than aiming for a singular optimal tool for differential splicing analysis, researchers should contemplate employing a suite of tools tailored to address specific inquiries.

Method recommendations

A diagram outlining optimal tool selection is provided to guide prospective alternative splicing (AS) researchers ( Figure 5). Initially, researchers should evaluate the scope and objectives of their analysis. For instance, if the aim is to identify known transcripts, it is advisable to opt for a parametric transcript-based tool like DEXSeq or DRIMSeq and execute a DTU study following Michael Love’s protocol.³⁷ Nonetheless, variations in experimental parameters such as sample size or covariate inclusion may necessitate alternative approaches.

Figure 5. Guideline for differential splicing tool selection based on experimental parameters.

Decision tree for differential splicing analysis, categorized by three branches based on the level of analysis. Transcript-based methods are represented in blue, exon-based methods in pink, and event-based methods in yellow.

If the objective is to uncover novel transcripts, an exon-based parametric approach might be better suited. This choice circumvents the challenges associated with isoform deconvolution and the breadth of transcript annotation, given the smaller exonic regions. For general-purpose differential exon usage (DEU) analysis, DEXSeq remains the preferred protocol due to its robust and flexible statistical methods, as well as its actively maintained software.²¹ However, again intricacies within the data may prompt the usage of more specialised alternatives. Transcript and exon-based methods offer top-down visualizations such as MA/Volcano plots, heatmaps and proportional transcript/exon graphs. If the analysis aims to visualise the movement of exons/introns and splice sites, then an event-based protocol would be more appropriate. Generally, tools such as rMATs, SUPPA2 and MISO offer comprehensive and detailed splicing event analysis.^39,40,55

Commonly, sashimi plots are the best method to visualise splice junctions from aligned data with events annotated, although this can also be plotted separately in IGV.⁵⁶ For user-friendly visualization, MAJIQ offers a summative HTML-based visualizer for complex events such as exitrons or orphan junctions.²⁶ Another factor to consider is the annotation quality of the organism/tissue being studied. If researchers are not confident in the quality of annotations and would like annotation-free analysis, methods such as LeafCutter are a good alternative to conventional methods.^57,58 Overall, event-based methods are more suited to advanced programmers owing to their use of command-line tools over interpreters that use IDEs (Integrated development environments). For most analyses, however, a DEU or DTU-based analysis is recommended for simple interpretability and robustness. Optional steps for AS-specific analyses can also be performed to enhance the data quality. For example, Portcullis enables the accurate filtering of false splice junctions that are often incorrectly characterized by common aligners.⁵⁹

Discussion

While the repertoire of tools to accommodate differential splicing analysis has grown in the past two decades, they are ultimately limited by the capabilities of the RNASeq technology available to date. Since 2010 however, the development of nanopore sequencing technology such as Oxford Nanopore Technologies (ONT) and PacBio’s single-molecule real-time (SMRT) has facilitated the development of long-read RNAseq.^60–62 Long read lengths typically fall within the range of 10kb to 100kb, with ultra-long read lengths now up to 1-2 Mb.⁶³ The main benefit this technology confers is the ability to bypass the aforementioned deconvolution issue stemming from multiple mapping and reconstruct full-length transcript isoforms in a single read. This can not only more accurately identify known transcripts but also novel or splice variants as well as fusion genes. Most current parametric DS tools can therefore be utilised in long-read-based analyses. A recent study utilized IsoformSwitchAnalyzeR’s DEXSeq-based DTU workflow on ONT long reads, demonstrating the capability of current long-standing methods on long-read data.^21,36,64 This was facilitated through long-read custom annotation of the transcriptome using TALON to identify novel transcripts.⁶⁵ Additionally, specific novel technologies such as LIQA have been developed to analyse long reads.⁶⁶

Long-read RNAseq still possess notable disadvantages, however. Early on, long-read RNASeq possessed error rates of 10-20%.^67,68 The development of HiFi sequencing by PacBio using circular consensus sequencing has since reduced the error rate to a reported 0.5%.⁶⁹ While the development of deep-learning algorithms such as DeepConsensus has sought to push HiFi accuracy further bringing it on par to short read.⁷⁰ However, this is still highly dependent on the depth of sequencing. The most efficient error correction method involves hybridising the analysis with short-read RNAseq methods.⁷¹ This ultimately means that while accuracy can now be brought to close to 99.5%, error correction drives the cost of long-read RNAseq methods up significantly. The field is progressing towards optimal error correction and is now focusing on lowering costs which is currently the largest hurdle for practical use for common research.

As interest in alternative splicing grows, researchers have access to an expanding array of tools. Advances in statistical methods and longer RNA sequencing read lengths are overcoming technical limitations. This leads to more precise transcript alignment and reduces the need for complex computational steps. With workflows becoming streamlined and modular, platforms like Nextflow enable researchers to create tailored pipelines for their specific goals and data types.⁷² These developments promise a brighter future for alternative splicing analysis, facilitating a deeper exploration of transcriptomic regulation and its functional significance.

Ethical approval and consent statement

Ethical approval and consent were not required.

Data availability statement

Underlying data

No data associated with this article.

Extended data

Zenodo: Selecting differential splicing methods: Practical considerations https://doi.org/10.5281/zenodo.14293573.⁷³

The repository contains the following underlying data:

• Supplementary Table 1.docx: Statistical details on differential splicing tools.
• citations_2023.csv: WoS citation count for differential splicing tools.
• citations_year_plot_new.R: R script to visualise citation trends.
• github_repos_txt: Github repository locations cloned on 20.02.2024.
• github_repos.R: Github maintenance analysis and visualisation.
• citations_2023.xlsx

Software availability statement

• https://github.com/bjdraper/Selecting-differential-splicing-methods-Practical-Considerations---R-Scripts-and-Data
• Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 1.0).

Archived software available from: 10.5281/zenodo.14293573

github_repos.R

citations_year_plot_new.R

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

References

1. Baralle FE, Giudice J: Alternative splicing as a regulator of development and tissue identity. Nat. Rev. Mol. Cell Biol. 2017 Jul; 18(7): 437–451. PubMed Abstract | Publisher Full Text | Free Full Text
2. Matera AG, Wang Z: A day in the life of the spliceosome. Nat. Rev. Mol. Cell Biol. 2014 Feb; 15(2): 108–121. PubMed Abstract | Publisher Full Text | Free Full Text
3. Singh B, Eyras E: The role of alternative splicing in cancer. Transcription. 2017 Mar 15; 8(2): 91–98. PubMed Abstract | Publisher Full Text | Free Full Text
4. Bonnal SC, López-Oreja I, Valcárcel J: Roles and mechanisms of alternative splicing in cancer — implications for care. Nat. Rev. Clin. Oncol. 2020 Aug; 17(8): 457–474. PubMed Abstract | Publisher Full Text
5. Zhang Y, Qian J, Gu C, et al.: Alternative splicing and cancer: a systematic review. Signal Transduct. Target. Ther. 2021 Feb 24; 6(1): 1–14. Publisher Full Text
6. Kar A, Kuo D, He R, et al.: Tau Alternative Splicing and Frontotemporal Dementia. Alzheimer Dis. Assoc. Disord. 2005; 19(Suppl 1): S29–S36. PubMed Abstract | Publisher Full Text | Free Full Text
7. Yanagisawa M, Huveldt D, Kreinest P, et al.: A p120 Catenin Isoform Switch Affects Rho Activity, Induces Tumor Cell Invasion, and Predicts Metastatic Disease. J. Biol. Chem. 2008 Jun 27; 283(26): 18344–18354. PubMed Abstract | Publisher Full Text | Free Full Text
8. McEvoy J, Ulyanov A, Brennan R, et al.: Analysis of MDM2 and MDM4 Single Nucleotide Polymorphisms, mRNA Splicing and Protein Expression in Retinoblastoma. PLoS One. 2012 Aug 20; 7(8): e42739. PubMed Abstract | Publisher Full Text | Free Full Text
9. Cain K, Peters S, Hailu H, et al.: A CHO cell line engineered to express XBP1 and ERO1-Lα has increased levels of transient protein expression. Biotechnol. Prog. 2013 Jun; 29(3): 697–706. PubMed Abstract | Publisher Full Text
10. Johari YB, Estes SD, Alves CS, et al.: Integrated cell and process engineering for improved transient production of a “difficult-to-express” fusion protein by CHO cells. Biotechnol. Bioeng. 2015; 112(12): 2527–2542. PubMed Abstract | Publisher Full Text
11. Torres M, Dickson AJ: Reprogramming of Chinese hamster ovary cells towards enhanced protein secretion. Metab. Eng. 2022 Jan; 69(69): 249–261. Publisher Full Text
12. Butt H, Eid A, Momin AA, et al.: CRISPR directed evolution of the spliceosome for resistance to splicing inhibitors. Genome Biol. 2019 Apr 30; 20(1): 73. PubMed Abstract | Publisher Full Text | Free Full Text
13. Wang Z, Gerstein M, Snyder M: RNA-Seq: a revolutionary tool for transcriptomics. Nat. Rev. Genet. 2009 Jan; 10(1): 57–63. PubMed Abstract | Publisher Full Text | Free Full Text
14. Stark R, Grzelak M, Hadfield J: RNA sequencing: the teenage years. Nat. Rev. Genet. 2019 Nov; 20(11): 631–656. Publisher Full Text
15. Babraham Bioinformatics - FastQC A Quality Control tool for High Throughput Sequence Data.[cited 2022 Aug 7]. Reference Source
16. Dobin A, Davis CA, Schlesinger F, et al.: STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013 Jan 1; 29(1): 15–21. PubMed Abstract | Publisher Full Text | Free Full Text
17. Kim D, Langmead B, Salzberg SL: HISAT: a fast spliced aligner with low memory requirements. Nat. Methods. 2015 Apr; 12(4): 357–360. PubMed Abstract | Publisher Full Text | Free Full Text
18. Patro R, Duggal G, Love MI, et al.: Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods. 2017 Apr; 14(4): 417–419. PubMed Abstract | Publisher Full Text | Free Full Text
19. Putri GH, Anders S, Pyl PT, et al.: Analysing high-throughput sequencing data in Python with HTSeq 2.0. Bioinformatics. 2022 May 13; 38(10): 2943–2945. PubMed Abstract | Publisher Full Text | Free Full Text
20. Liao Y, Smyth GK, Shi W: featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014 Apr 1; 30(7): 923–930. PubMed Abstract | Publisher Full Text
21. Love MI, Huber W, Anders S: Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014 Dec 5; 15(12): 550. PubMed Abstract | Publisher Full Text | Free Full Text
22. Ewels PA, Peltzer A, Fillinger S, et al.: nf-core: Community curated bioinformatics pipelines. bioRxiv. 2019 [cited 2024 Mar 15]; p. 610741. Publisher Full Text
23. Kawaji H, Lizio M, Itoh M, et al.: Comparison of CAGE and RNA-seq transcriptome profiling using clonally amplified and single-molecule next-generation sequencing. Genome Res. 2014 Apr; 24(4): 708–717. PubMed Abstract | Publisher Full Text | Free Full Text
24. Xia Z, Donehower LA, Cooper TA, et al.: Dynamic analyses of alternative polyadenylation from RNA-seq reveal a 3′-UTR landscape across seven tumour types. Nat. Commun. 2014 Nov 20; 5(1): 5274. PubMed Abstract | Publisher Full Text | Free Full Text
25. Katz Y, Wang ET, Silterra J, et al.: Quantitative visualization of alternative exon expression from RNA-seq data. Bioinformatics. 2015 Jul 1; 31(14): 2400–2402. PubMed Abstract | Publisher Full Text | Free Full Text
26. Vaquero-Garcia J, Barrera A, Gazzara MR, et al.: A new view of transcriptome complexity and regulation through the lens of local splicing variations. Valcárcel J, editor. elife. 2016 Feb 1; 5(5): e11752. PubMed Abstract | Publisher Full Text | Free Full Text
27. Ritchie ME, Phipson B, Wu D, et al.: Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015 Apr 20; 43(7): e47. PubMed Abstract | Publisher Full Text | Free Full Text
28. Robinson MD, McCarthy DJ, Smyth GK: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010 Jan 1; 26(1): 139–140. PubMed Abstract | Publisher Full Text | Free Full Text
29. Wang W, Qin Z, Feng Z, et al.: Identifying differentially spliced genes from two groups of RNA-seq samples. Gene. 2013 Apr 10; 518(1): 164–170. PubMed Abstract | Publisher Full Text
30. Drewe P, Stegle O, Hartmann L, et al.: Accurate detection of differential RNA processing. Nucleic Acids Res. 2013 May 1; 41(10): 5189–5198. PubMed Abstract | Publisher Full Text | Free Full Text
31. Hartley SW, Mullikin JC: Detection and visualization of differential splicing in RNA-Seq data with JunctionSeq. Nucleic Acids Res. 2016 Sep 6; 44(15): e127. PubMed Abstract | Publisher Full Text
32. Wang X, Cairns MJ: SeqGSEA: a Bioconductor package for gene set enrichment analysis of RNA-Seq data integrating differential expression and splicing. Bioinforma Oxf. Engl. 2014 Jun 15; 30(12): 1777–1779. PubMed Abstract | Publisher Full Text
33. Hiller D, Jiang H, Xu W, et al.: Identifiability of isoform deconvolution from junction arrays and RNA-Seq. Bioinformatics. 2009 Dec 1; 25(23): 3056–3059. PubMed Abstract | Publisher Full Text | Free Full Text
34. Nowicka M, Robinson MD: DRIMSeq: a Dirichlet-multinomial framework for multivariate count outcomes in genomics. F1000Research. 2016 Dec 6; 5: 1356. PubMed Abstract | Publisher Full Text | Free Full Text
35. Tekath T, Dugas M: Differential transcript usage analysis of bulk and single-cell RNA-seq data with DTUrtle. Bioinformatics. 2021 Nov 1; 37(21): 3781–3787. PubMed Abstract | Publisher Full Text | Free Full Text
36. Vitting-Seerup K, Sandelin A: IsoformSwitchAnalyzeR: analysis of changes in genome-wide patterns of alternative splicing and its functional consequences. Bioinformatics. 2019 Nov 1; 35(21): 4469–4471. PubMed Abstract | Publisher Full Text
37. Love MI, Soneson C, Patro R: Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification. F1000Res. 2018 [cited 2023 Feb 16]. Publisher Full Text Reference Source
38. Honeyman JN, Simon EP, Robine N, et al.: Detection of a Recurrent DNAJB1-PRKACA Chimeric Transcript in Fibrolamellar Hepatocellular Carcinoma. Science. 2014 Feb 28; 343(6174): 1010–1014. PubMed Abstract | Publisher Full Text | Free Full Text
39. Trincado JL, Entizne JC, Hysenaj G, et al.: SUPPA2: fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions. Genome Biol. 2018 Mar 23; 19(1): 40. PubMed Abstract | Publisher Full Text | Free Full Text
40. Shen S, Park JW, Lu Z, et al.: rMATS: Robust and flexible detection of differential alternative splicing from replicate RNA-Seq data. Proc. Natl. Acad. Sci. 2014 Dec 23; 111(51): E5593–E5601. Publisher Full Text
41. Sterne-Weiler T, Weatheritt RJ, Best AJ, et al.: Efficient and Accurate Quantitative Profiling of Alternative Splicing Patterns of Any Complexity on a Laptop. Mol. Cell. 2018 Oct 4; 72(1): 187–200.e6. PubMed Abstract | Publisher Full Text
42. Dong C, He F, Berkowitz O, et al.: Alternative Splicing Plays a Critical Role in Maintaining Mineral Nutrient Homeostasis in Rice (Oryza sativa). Plant Cell. 2018 Oct 1; 30(10): 2267–2285. PubMed Abstract | Publisher Full Text | Free Full Text
43. Tarazona S, Furió-Tarí P, Turrà D, et al.: Data quality aware analysis of differential expression in RNA-seq with NOISeq R/Bioc package. Nucleic Acids Res. 2015 Dec 2; 43(21): e140. PubMed Abstract | Publisher Full Text
44. Zhao K, Lu Z, Xiang, et al.: GLiMMPS: robust statistical model for regulatory variation of alternative splicing using RNA-seq data. Genome Biol. 2013 Jul 22; 14(7): R74. PubMed Abstract | Publisher Full Text | Free Full Text
45. Halperin RF, Hegde A, Lang JD, et al.: Improved methods for RNAseq-based alternative splicing analysis. Sci. Rep. 2021 May 24; 11(1): 10740. PubMed Abstract | Publisher Full Text | Free Full Text
46. Zhang Z, Pan Z, Ying Y, et al.: Deep-learning augmented RNA-seq analysis of transcript splicing. Nat. Methods. 2019 Apr; 16(4): 307–310. PubMed Abstract | Publisher Full Text | Free Full Text
47. Anders S, Reyes A, Huber W: Detecting differential usage of exons from RNA-seq data. Genome Res. 2012 Oct 1; 22(10): 2008–2017. PubMed Abstract | Publisher Full Text | Free Full Text
48. Leng N, Dawson JA, Thomson JA, et al.: EBSeq: an empirical Bayes hierarchical model for inference in RNA-seq experiments. Bioinformatics. 2013 Apr 15; 29(8): 1035–1043. PubMed Abstract | Publisher Full Text | Free Full Text
49. Mehmood A, Laiho A, Venäläinen MS, et al.: Systematic evaluation of differential splicing tools for RNA-seq studies. Brief. Bioinform. 2020 Dec 1; 21(6): 2052–2065. PubMed Abstract | Publisher Full Text | Free Full Text
50. Liu R, Loraine AE, Dickerson JA: Comparisons of computational methods for differential alternative splicing detection using RNA-seq in plant systems. BMC Bioinformatics. 2014 Dec 16; 15(1): 364. PubMed Abstract | Publisher Full Text | Free Full Text
51. Merino GA, Conesa A, Fernández EA, et al.: A benchmarking of workflows for detecting differential splicing and differential expression at isoform level in human RNA-seq studies. Brief. Bioinform. 2019 Mar 25; 20(2): 471–481. PubMed Abstract | Publisher Full Text
52. Jiang M, Zhang S, Yin H, et al.: A comprehensive benchmarking of differential splicing tools for RNA-seq analysis at the event level. Brief. Bioinform. 2023 May 1; 24(3): bbad121. PubMed Abstract | Publisher Full Text
53. Kannan K, Wang L, Wang J, et al.: Recurrent chimeric RNAs enriched in human prostate cancer identified by deep sequencing. Proc. Natl. Acad. Sci. 2011 May 31; 108(22): 9172–9177. PubMed Abstract | Publisher Full Text | Free Full Text
54. Griebel T, Zacher B, Ribeca P, et al.: Modelling and simulating generic RNA-Seq experiments with the flux simulator. Nucleic Acids Res. 2012 Nov 1; 40(20): 10073–10083. PubMed Abstract | Publisher Full Text
55. Katz Y, Wang ET, Airoldi EM, et al.: Analysis and design of RNA sequencing experiments for identifying isoform regulation. Nat. Methods. 2010 Dec; 7(12): 1009–1015. PubMed Abstract | Publisher Full Text | Free Full Text
56. Robinson JT, Thorvaldsdóttir H, Winckler W, et al.: Integrative genomics viewer. Nat. Biotechnol. 2011 Jan; 29(1): 24–26. PubMed Abstract | Publisher Full Text | Free Full Text
57. Li YI, Knowles DA, Humphrey J, et al.: Annotation-free quantification of RNA splicing using LeafCutter. Nat. Genet. 2018 Jan; 50(1): 151–158. PubMed Abstract | Publisher Full Text | Free Full Text
58. Benegas G, Fischer J, Song YS: Robust and annotation-free analysis of alternative splicing across diverse cell types in mice. Eyras E, Manley JL, editors. elife. 2022 Mar 1; 11: e73520. PubMed Abstract | Publisher Full Text | Free Full Text
59. Mapleson D, Venturini L, Kaithakottil G, et al.: Efficient and accurate detection of splice junctions from RNA-seq with Portcullis. GigaScience. 2018 Dec 1; 7(12): giy131. PubMed Abstract | Publisher Full Text | Free Full Text
60. Eid J, Fehr A, Gray J, et al.: Real-Time DNA Sequencing from Single Polymerase Molecules. Science. 2009 Jan 2; 323(5910): 133–138. Publisher Full Text
61. Derrington IM, Butler TZ, Collins MD, et al.: Nanopore DNA sequencing with MspA. Proc. Natl. Acad. Sci. 2010 Sep 14; 107(37): 16060–16065. PubMed Abstract | Publisher Full Text | Free Full Text
62. Feng Y, Zhang Y, Ying C, et al.: Nanopore-based Fourth-generation DNA Sequencing Technology. Genomics Proteomics Bioinformatics. 2015 Feb 1; 13(1): 4–16. PubMed Abstract | Publisher Full Text | Free Full Text
63. Jain M, Koren S, Miga KH, et al.: Nanopore sequencing and assembly of a human genome with ultra-long reads. Nat. Biotechnol. 2018 Apr; 36(4): 338–345. PubMed Abstract | Publisher Full Text | Free Full Text
64. Wright DJ, Hall NAL, Irish N, et al.: Long read sequencing reveals novel isoforms and insights into splicing regulation during cell state changes. BMC Genomics. 2022 Jan 10; 23(1): 42. PubMed Abstract | Publisher Full Text | Free Full Text
65. Wyman D, Balderrama-Gutierrez G, Reese F, et al.: A technology-agnostic long-read analysis pipeline for transcriptome discovery and quantification. bioRxiv. 2020 [cited 2023 Sep 7]; p. 672931. Publisher Full Text
66. Hu Y, Fang L, Chen X, et al.: LIQA: long-read isoform quantification and analysis. Genome Biol. 2021 Jun 17; 22(1): 182. PubMed Abstract | Publisher Full Text | Free Full Text
67. Carneiro MO, Russ C, Ross MG, et al.: Pacific biosciences sequencing technology for genotyping and variation discovery in human data. BMC Genomics. 2012 Aug 5; 13(1): 375. PubMed Abstract | Publisher Full Text | Free Full Text
68. Jain M, Fiddes IT, Miga KH, et al.: Improved data analysis for the MinION nanopore sequencer. Nat. Methods. 2015 Apr; 12(4): 351–356. PubMed Abstract | Publisher Full Text | Free Full Text
69. Hon T, Mars K, Young G, et al.: Highly accurate long-read HiFi sequencing data for five complex genomes. Sci. Data. 2020 Nov 17; 7(1): 399. PubMed Abstract | Publisher Full Text | Free Full Text
70. Baid G, Cook DE, Shafin K, et al.: DeepConsensus improves the accuracy of sequences with a gap-aware sequence transformer. Nat. Biotechnol. 2023 Feb; 41(2): 232–238. PubMed Abstract | Publisher Full Text
71. Amarasinghe SL, Su S, Dong X, et al.: Opportunities and challenges in long-read sequencing data analysis. Genome Biol. 2020 Feb 7; 21(1): 30. PubMed Abstract | Publisher Full Text | Free Full Text
72. Di Tommaso P, Chatzou M, Floden EW, et al.: Nextflow enables reproducible computational workflows. Nat. Biotechnol. 2017 Apr; 35(4): 316–319. PubMed Abstract | Publisher Full Text
73. Draper BJ: Selecting differential splicing methods: Practical Considerations - R Scripts and Data. Zenodo. 2024. Publisher Full Text

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 08 Jan 2025

Author details Author details

Ben J. Draper
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Mark J. Dunning
Roles: Investigation, Resources, Supervision, Validation, Writing – Review & Editing

David C. James
Roles: Funding Acquisition, Project Administration, Supervision

Competing interests

No competing interests were disclosed.

Grant information

This work was supported by funding from Lonza Biologics Inc & The University of Sheffield.
https://www.lonza.com/
I extend my gratitude for their financial assistance in facilitating this research project. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (2)

version 2

Revised

Published: 30 May 2025, 14:47

https://doi.org/10.12688/f1000research.155223.2

version 1

Published: 08 Jan 2025, 14:47

https://doi.org/10.12688/f1000research.155223.1

© 2025 Draper BJ et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Draper BJ, Dunning MJ and James DC. Selecting differential splicing methods: Practical considerations [version 1; peer review: 2 approved with reservations]. F1000Research 2025, 14:47 (https://doi.org/10.12688/f1000research.155223.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 08 Jan 2025

Views

Reviewer Report 25 Apr 2025

Charlotte Capitanchik, UK DRI at King’s College London, London, UK

Approved with Reservations

https://doi.org/10.5256/f1000research.170367.r375800

The article by Draper and colleagues presents a well-researched and well-organised overview of differential splicing analysis tools for short-read RNA sequencing, which will be valuable for many researchers. The practical focus is excellent, I especially like the section on developer maintenance of tooling, which is often neglected in software review articles. The decision tree in Figure 5 is also a great framework for navigating this crowded space and I’m sure will be useful to many.

Despite this, in many places the use of language is imprecise: words are missing or used incorrectly, sentences are too vague—perhaps from being over-simplified in an attempt to make the text more understandable. I also find some of the recommendations are not adequately supported by evidence. Please find some comments that I hope will be helpful, below:

Major comments

The method benchmarking section needs some attention. ‘Scientific accuracy’ should be replaced by just ‘accuracy’. ‘Computational power’ should be specified, presumably you mean performance characteristics such as memory usage and run time. “Lack of ground truth to set as a reference to compare to..” is unnecessary, just say a lack of ground truth splicing quantifications. Library size and sequencing depth are the same thing? Library size and positional bias are qualities of the RNA-seq data itself, not the bioinformatic pre-processing steps. Annotation quality is not a ‘pre-processing’ step - this is a feature of the organism under study. “Achieve finer control over the ground truth” could be rephrased to something more meaningful like ‘to explore the impact of increasing variability between replicates, changing replicate numbers, sequencing depth’…..etc. “The consensus drawn from these three benchmarks is that the performance of differential splicing tools exhibits considerable variability depending on the outlined factors.” This seems weak, perhaps a more nuanced conclusion can be drawn - when data is very good, deep sequencing, low variability, which tool performs best? Which tools have the lowest run times, compute requirements .etc. I don’t think it’s sufficient to simply say its complicated - please dive more into the details here. “The ongoing evolution and upkeep of tools by developers introduce a time-dependent aspect to benchmarking. Community-led maintenance efforts consistently enhance the functionality and reliability of tools over time.” I’m not sure what this means, please clarify.
The discussion brings in developments in long read sequencing and is quite nice - I would suggest making this a section of its own and expanding on what is already written. Alternatively, I would consider cutting it back a bit and changing the title of the article to reflect a focus on short-read sequencing data. Perhaps one point for the discussion is that whilst new methods are developing, there remains hundreds of thousands of publicly available short read RNA sequencing datasets through which novel biological insights can still be made.
In the section of recommendations - “For instance, if the aim is to identify known transcripts, it is advisable to opt for a parametric transcript-based tool like DEXSeq or DRIMSeq and execute a DTU study following Michael Love’s protocol.37” The citation does not support the statement - why shouldn’t researchers opt for an exon-based approach when the transcriptomic annotations are good? Also in this section, DEXSeq is recommended for DEU analysis, when rMATs is the most highly cited splicing tool and provides accurate quantifications of exon usage. “Overall, event-based methods are more suited to advanced programmers owing to their use of command-line tools over interpreters that use IDEs (Integrated development environments). For most analyses, however, a DEU or DTU-based analysis is recommended for simple interpretability and robustness.” I don’t understand, how is DEX-Seq easier to use than rMATs (for example), or MAJIQ which has extensive graphical reporting? Are you saying this because DEX-Seq is an R package so you can use RStudio? - this doesn’t seem like a particularly helpful argument - I can run rMATs or MAJIQ or any of these using a bash script in Visual Studio Code which is also an IDE…

Minor comments

Alternative splicing (AS) abbreviation is given several times throughout text and sometimes used, sometimes not - please be consistent.
“The proposed advantage of this approach is in the smaller exonic regions rather than full isoform deconvolution.” Rephrase for clarity, presumably you mean by focusing on regions unique to distinct isoforms the tool avoids the issue of assigning ambiguous reads to isoforms.
"A few newer methods such as DRIMSeq and DTUrtle have progressed onto non-parametric or mixed Dirichlet Multinomial Models (DMM)" ‘progression’ suggests there is some kind of hierarchy, you can just say that these models ‘use’ other distributions
“More complex regulatory events involve genomic features beyond exons and introns, such as alternative promoter and polyadenylation sites, which result in varying mRNA 5′ and 3′ UTR ends. However, these events are seldom included in most bioinformatics analyses, tools such as CAGER (Cap Analysis of Gene Expression) and DaPars (Dynamic Analysis of Alternative PolyAdenylation from RNA-Seq) are available for niche research” I wouldn’t say these events are more complex from a biological standpoint. The issue in analysis of alternative TSS use and APA is that short read sequencing with typical library preparation methods (e.g. random hexamer priming) won’t have good coverage of exact transcript 5’ and 3’ ends. Therefore typically library preparations with mRNA cap capture (CAGE) or 3’ end sequencing (eg. Quantseq) are used when this is the analysis goal. Also, to be a pedant, 3’UTRs and 5’UTRs are exons.
The discussion mentions Nextflow, and nf-core is mentioned earlier, but it might be nice to specifically mention the efforts of nf-core/rnasplice. As you know, one of the benefits of these pipelines is that everything is containerised so you don’t have to mess about installing everything. Generally speaking, one of the barriers to using tools can be installation - of the presented tools there is quite a range of levels of developer investment in making the tools easy to install. Some are in bioconda or bioconductor and have containers, others you have to contact the authors to get permission to download (MAJIQ!) - this might be related to the amount of citations that tools get. It would be nice (but not necessary) to address this too.

Is the topic of the review discussed comprehensively in the context of the current literature?

Yes
Are all factual statements correct and adequately supported by citations?

Partly
Is the review written in accessible language?

Partly
Are the conclusions drawn appropriate in the context of the current research literature?

Partly

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: bioinformatics, splicing, RNA biology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Author Response 16 Jun 2025

Ben Draper, Department of Chemical and Biological Engineering, Mappin St., The University of Sheffield, Sheffield, S1 3JD, UK

16 Jun 2025

Author Response

Dear Dr. Capitanchik,

Thank you for your thorough and insightful review of our manuscript. Your comments on the benchmarking, recommendations, long-read discussion, and minor edits have significantly improved the ... Continue reading Dear Dr. Capitanchik,

Thank you for your thorough and insightful review of our manuscript. Your comments on the benchmarking, recommendations, long-read discussion, and minor edits have significantly improved the manuscript’s clarity and accuracy. We have incorporated minimal changes to address your concerns, ensuring the revisions align with your suggestions while maintaining the manuscript’s focus.

For the benchmarking section, we corrected terminology (“accuracy” instead of “scientific accuracy,” “computational performance” instead of “computational power”), clarified that library size, positional bias, and annotation quality are dataset characteristics, not pre-processing steps, and rephrased vague terms (e.g., “lack of ground truth” to “lack of comprehensive ground truth splicing quantifications”). We added specific examples drawn from the benchmarks (e.g., DEXSeq, rMATS, NOISEQ) to strengthen the conclusions.

In the recommendations section, we clarified our positions on programming environments, developer and community support, making sure not to discriminate harshly against command line-based tools. We believe this is still worth mentioning, however, as in our experience, the programming platform matters for accessibility.

In the discussion, we opted for shortening the long-read section for brevity and making the article more focused towards short-read. We have added a few minor points in agreement with Dr. Donega. We agree with all the minor edits (e.g., consistent AS acronym usage, clarified rDiff-parametric, revised TSS/APA) that were made as requested.

We believe these changes should address your concerns effectively.

Best wishes

Ben J. Draper
University of Sheffield
Dear Dr. Capitanchik,

Thank you for your thorough and insightful review of our manuscript. Your comments on the benchmarking, recommendations, long-read discussion, and minor edits have significantly improved the manuscript’s clarity and accuracy. We have incorporated minimal changes to address your concerns, ensuring the revisions align with your suggestions while maintaining the manuscript’s focus.

For the benchmarking section, we corrected terminology (“accuracy” instead of “scientific accuracy,” “computational performance” instead of “computational power”), clarified that library size, positional bias, and annotation quality are dataset characteristics, not pre-processing steps, and rephrased vague terms (e.g., “lack of ground truth” to “lack of comprehensive ground truth splicing quantifications”). We added specific examples drawn from the benchmarks (e.g., DEXSeq, rMATS, NOISEQ) to strengthen the conclusions.

In the recommendations section, we clarified our positions on programming environments, developer and community support, making sure not to discriminate harshly against command line-based tools. We believe this is still worth mentioning, however, as in our experience, the programming platform matters for accessibility.

In the discussion, we opted for shortening the long-read section for brevity and making the article more focused towards short-read. We have added a few minor points in agreement with Dr. Donega. We agree with all the minor edits (e.g., consistent AS acronym usage, clarified rDiff-parametric, revised TSS/APA) that were made as requested.

We believe these changes should address your concerns effectively.

Best wishes

Ben J. Draper
University of Sheffield
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 16 Jun 2025

Ben Draper, Department of Chemical and Biological Engineering, Mappin St., The University of Sheffield, Sheffield, S1 3JD, UK

16 Jun 2025

Author Response

Dear Dr. Capitanchik,

Thank you for your thorough and insightful review of our manuscript. Your comments on the benchmarking, recommendations, long-read discussion, and minor edits have significantly improved the ... Continue reading Dear Dr. Capitanchik,

Thank you for your thorough and insightful review of our manuscript. Your comments on the benchmarking, recommendations, long-read discussion, and minor edits have significantly improved the manuscript’s clarity and accuracy. We have incorporated minimal changes to address your concerns, ensuring the revisions align with your suggestions while maintaining the manuscript’s focus.

For the benchmarking section, we corrected terminology (“accuracy” instead of “scientific accuracy,” “computational performance” instead of “computational power”), clarified that library size, positional bias, and annotation quality are dataset characteristics, not pre-processing steps, and rephrased vague terms (e.g., “lack of ground truth” to “lack of comprehensive ground truth splicing quantifications”). We added specific examples drawn from the benchmarks (e.g., DEXSeq, rMATS, NOISEQ) to strengthen the conclusions.

In the recommendations section, we clarified our positions on programming environments, developer and community support, making sure not to discriminate harshly against command line-based tools. We believe this is still worth mentioning, however, as in our experience, the programming platform matters for accessibility.

In the discussion, we opted for shortening the long-read section for brevity and making the article more focused towards short-read. We have added a few minor points in agreement with Dr. Donega. We agree with all the minor edits (e.g., consistent AS acronym usage, clarified rDiff-parametric, revised TSS/APA) that were made as requested.

We believe these changes should address your concerns effectively.

Best wishes

Ben J. Draper
University of Sheffield
Dear Dr. Capitanchik,

Thank you for your thorough and insightful review of our manuscript. Your comments on the benchmarking, recommendations, long-read discussion, and minor edits have significantly improved the manuscript’s clarity and accuracy. We have incorporated minimal changes to address your concerns, ensuring the revisions align with your suggestions while maintaining the manuscript’s focus.

For the benchmarking section, we corrected terminology (“accuracy” instead of “scientific accuracy,” “computational performance” instead of “computational power”), clarified that library size, positional bias, and annotation quality are dataset characteristics, not pre-processing steps, and rephrased vague terms (e.g., “lack of ground truth” to “lack of comprehensive ground truth splicing quantifications”). We added specific examples drawn from the benchmarks (e.g., DEXSeq, rMATS, NOISEQ) to strengthen the conclusions.

In the recommendations section, we clarified our positions on programming environments, developer and community support, making sure not to discriminate harshly against command line-based tools. We believe this is still worth mentioning, however, as in our experience, the programming platform matters for accessibility.

In the discussion, we opted for shortening the long-read section for brevity and making the article more focused towards short-read. We have added a few minor points in agreement with Dr. Donega. We agree with all the minor edits (e.g., consistent AS acronym usage, clarified rDiff-parametric, revised TSS/APA) that were made as requested.

We believe these changes should address your concerns effectively.

Best wishes

Ben J. Draper
University of Sheffield
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 22 Apr 2025

Stefano Donega, National Institute on Aging, Bethesda, USA

Approved with Reservations

https://doi.org/10.5256/f1000research.170367.r375793

In this review, Dr. Draper, Dunning, and James provide a very good comprehensive and well-written summary of the tools applied to the investigation of alternative splicing, with a thorough literature perspective and a detailed guide to choosing the most appropriate statistical methods and software. While I really appreciated and enjoyed reading the manuscript, I would like to provide a few comments that I believe would enhance and elevate the quality of the work:

In general, the entire manuscript discusses methods that directly apply to short-read platforms. Therefore, I think this should be better highlighted both in the manuscript title and throughout the whole review.
The long-read platforms appear only in the discussion section. I recommend the authors dedicate a separate paragraph to them, independent of the discussion, while keeping the discussion to connect together the main findings investigated in the main text.

Now, I will provide some minor comments:

In a recent Nature Aging paper, Ferrucci et al. 2022 (Ref 1) discussed the "energy-splicing axis hypothesis on aging," which is worthy of mentioning in the introductory paragraph on the importance of splicing.
There have been efforts to clarify modern nomenclature in gene expression studies, and guidelines were recently proposed to increase precision and clarity when communicating about gene expression, most notably to reserve 'gene' for the DNA template and 'transcript' for the RNA transcribed from that gene (Cunningham ASG, et al., 2024 [Ref 2]). I suggest authors consider aligning some definitions found in the manuscript with these guidelines.
There is no mention of the possibility of combining short- and long-read sequencing to enhance quantity and quality of results. I strongly suggest the authors include in their review a section on "StringTie" which utilizes both short and long RNA-seq reads for transcript assembly to generate a hybrid strategy (Shumate A, et al., 2022 [Ref 3]).

After these improvements, I am confident this article will be highly cited in the field.

Is the topic of the review discussed comprehensively in the context of the current literature?

Yes
Are all factual statements correct and adequately supported by citations?

Yes
Is the review written in accessible language?

Yes
Are the conclusions drawn appropriate in the context of the current research literature?

Partly

References

1. Ferrucci L, Wilson DM, Donegà S, Gorospe M: The energy-splicing resilience axis hypothesis of aging.Nat Aging. 2022; 2 (3): 182-185 PubMed Abstract | Publisher Full Text
2. Cunningham ASG, Gorospe M: Striving for clarity in language about gene expression.Nucleic Acids Res. 2024; 52 (18): 10747-10753 PubMed Abstract | Publisher Full Text
3. Shumate A, Wong B, Pertea G, Pertea M: Improved transcriptome assembly using a hybrid of long and short reads with StringTie.PLoS Comput Biol. 2022; 18 (6): e1009730 PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Aging, muscle, mitochondria, energy, hypoxia, exercise

CITE

Report a concern

Author Response 09 Aug 2025

Ben Draper, Department of Chemical and Biological Engineering, Mappin St., The University of Sheffield, Sheffield, S1 3JD, UK

09 Aug 2025

Author Response

Dear Dr. Donega,

First of all, thank you for reviewing the article. I appreciate the time you took and the constructive feedback you gave me to improve this work.
... Continue reading Dear Dr. Donega,

First of all, thank you for reviewing the article. I appreciate the time you took and the constructive feedback you gave me to improve this work.

The title was revised to “Selecting Differential Splicing Methods: Practical Considerations for Short-Read RNA Sequencing” to emphasise short-read platforms, and the abstract and introduction now explicitly state this focus. I am hesitant to expand and write a full section on long-read technology, as this isn't really my field of expertise. Therefore, we decided to streamline this section in line with Dr Capitanchik's recommendations while weaving in the hybridised approaches of current short-read technologies.

I agree with the minor points and have addressed these by including the recommended citations in the introduction and discussion.

These changes align the manuscript with your recommendations, maintaining its comprehensive scope while clarifying its primary focus on short-read RNA-seq.

Sincerely,

Ben J. Draper
University of Sheffield
Dear Dr. Donega,

First of all, thank you for reviewing the article. I appreciate the time you took and the constructive feedback you gave me to improve this work.

The title was revised to “Selecting Differential Splicing Methods: Practical Considerations for Short-Read RNA Sequencing” to emphasise short-read platforms, and the abstract and introduction now explicitly state this focus. I am hesitant to expand and write a full section on long-read technology, as this isn't really my field of expertise. Therefore, we decided to streamline this section in line with Dr Capitanchik's recommendations while weaving in the hybridised approaches of current short-read technologies.

I agree with the minor points and have addressed these by including the recommended citations in the introduction and discussion.

These changes align the manuscript with your recommendations, maintaining its comprehensive scope while clarifying its primary focus on short-read RNA-seq.

Sincerely,

Ben J. Draper
University of Sheffield
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 09 Aug 2025

Ben Draper, Department of Chemical and Biological Engineering, Mappin St., The University of Sheffield, Sheffield, S1 3JD, UK

09 Aug 2025

Author Response

Dear Dr. Donega,

First of all, thank you for reviewing the article. I appreciate the time you took and the constructive feedback you gave me to improve this work.
... Continue reading Dear Dr. Donega,

First of all, thank you for reviewing the article. I appreciate the time you took and the constructive feedback you gave me to improve this work.

The title was revised to “Selecting Differential Splicing Methods: Practical Considerations for Short-Read RNA Sequencing” to emphasise short-read platforms, and the abstract and introduction now explicitly state this focus. I am hesitant to expand and write a full section on long-read technology, as this isn't really my field of expertise. Therefore, we decided to streamline this section in line with Dr Capitanchik's recommendations while weaving in the hybridised approaches of current short-read technologies.

I agree with the minor points and have addressed these by including the recommended citations in the introduction and discussion.

These changes align the manuscript with your recommendations, maintaining its comprehensive scope while clarifying its primary focus on short-read RNA-seq.

Sincerely,

Ben J. Draper
University of Sheffield
Dear Dr. Donega,

First of all, thank you for reviewing the article. I appreciate the time you took and the constructive feedback you gave me to improve this work.

The title was revised to “Selecting Differential Splicing Methods: Practical Considerations for Short-Read RNA Sequencing” to emphasise short-read platforms, and the abstract and introduction now explicitly state this focus. I am hesitant to expand and write a full section on long-read technology, as this isn't really my field of expertise. Therefore, we decided to streamline this section in line with Dr Capitanchik's recommendations while weaving in the hybridised approaches of current short-read technologies.

I agree with the minor points and have addressed these by including the recommended citations in the introduction and discussion.

These changes align the manuscript with your recommendations, maintaining its comprehensive scope while clarifying its primary focus on short-read RNA-seq.

Sincerely,

Ben J. Draper
University of Sheffield
Competing Interests: No competing interests were disclosed. Close
Report a concern

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 08 Jan 2025

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3	4	5
Version 2 (revision) 30 May 25		read	read	read	read
Version 1 08 Jan 25	read	read

Stefano Donega, National Institute on Aging, Bethesda, USA
Charlotte Capitanchik, UK DRI at King’s College London, London, UK
Julia Olivieri, University of the Pacific, California, USA
Zhaoqi Liu, China National Center for Bioinformation, Beijing, China
Monica Salinas, Centre de Regulacio Genomica (Ringgold ID: 16372), Barcelona, Spain

Juan Valcarcel, The Barcelona Institute of Science and Technology, Barcelona, Spain

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

12 Views

28 Aug 2025 | for Version 2

Monica Salinas, Genome Biology, Centre de Regulacio Genomica (Ringgold ID: 16372), Barcelona, Catalonia, Spain

Juan Valcarcel, The Barcelona Institute of Science and Technology, Barcelona, Spain

12 Views Cite this report Responses(0)

Approved With Reservations

This review by Draper et al. provides a comprehensive and well-curated overview of the available tools for RNA-Seq-based alternative splicing (AS) analysis. While differential gene expression remains the most common application of RNA-Seq data, the authors successfully bring attention to the expanding repertoire of computational methods dedicated to AS. The manuscript is clearly written, well-structured, and the figures are informative; in particular the decision-tree diagram in Figure 5, which serves as a helpful visual guide for tool selection.

Requested revisions:

1) It is not entirely clear whether the distinction between exon-based and event-based approaches refers solely to whether different types of splicing events (e.g., retained introns, alternative splice sites, etc.)or only exons are considered; or if there is an additional level of distinction. A more explicit definition of this stratification would help improve clarity and guide readers in understanding the differences between both strategies.

2)Vast-tools, a widely used tool in the AS community, is notably absent from the current manuscript. The main publication describing Vast-tools has garnered >400 citations and the tool is actively maintained with comprehensive documentation available on GitHub. It meets the inclusion criteria set by the authors (ie citation impact and repository maintenance). Vast-tools enables PSI-based quantification of >700000 annotated splicing events and is part of a broader analytical framework that includes matt (for downstream analysis of AS event features) and vastDB (a comprehensive database of AS profiles). Including this tool in the review would provide a more complete representation of the field and would be useful for readers.
Associated references:

Vast-tools: https://doi.org/10.1101/gr.220962.117 (Ref 1)

Vast-tools GitHub: https://github.com/vastgroup/vast-tools

Matt: https://doi.org/10.1093/bioinformatics/bty606 (Ref 2)

Matt GitHub: https://gitlab.com/aghr/matt

VastDB: https://vastdb.crg.eu/wiki/Main_Page

3) Sequencing depth is a critical parameter that is often underappreciated in splicing analyses. The standard 30M reads per sample, typically sufficient for gene expression studies, are generally inadequate for reliable quantification of splicing events—particularly for low-abundance isoforms. This limitation should be emphasized more clearly in the manuscript, especially in the context of data generation and when selecting publicly available datasets. Many public RNA-Seq datasets were not originally designed for AS analysis, which can severely limit downstream interpretations when re-analysed.

4) About Tools Comparison:
4.1) While the statistical frameworks underlying each tool are briefly mentioned, a more detailed explanation of their analytical approaches (beyond simply naming the statistical test used) would enhance the review’s educational value, especially for non-expert readers.
4.2) The manuscript could benefit from a more explicit discussion on whether each tool relies on predefined annotations or performs de novo detection of splice junctions. This distinction is crucial depending on the biological question (e.g., whether the aim is to identify novel or neojunctions, or to compare known events across conditions).
4.3) Similarly, an overview comparing the output metrics used by the different tools would be highly informative and would help readers determine which software best suits their analysis goals. In those cases that do not rely on PSI values, which alternative metrics are used?

5) On page 5, when comparing expression normalization metrics such as CPM, TPM, and RPKM, a brief explanation of the contexts in which each metric is most appropriate would be helpful.

6)In the discussion on long-read sequencing technologies, it would be useful to indicate which of the reviewed tools are compatible with long-read RNA-Seq data, and which are restricted to short-read inputs. This information could, for example, be conveyed as an additional annotation in one of the existing figures (e.g., Figure 5), rather than in the main text.

Is the topic of the review discussed comprehensively in the context of the current literature?

Yes
Are all factual statements correct and adequately supported by citations?

Yes
Is the review written in accessible language?

Yes
Are the conclusions drawn appropriate in the context of the current research literature?

Yes

References

1. Tapial J, Ha K, Sterne-Weiler T, Gohr A, et al.: An atlas of alternative splicing profiles and functional associations reveals new regulatory programs and genes that simultaneously express multiple major isoforms. Genome Research. 2017; 27 (10): 1759-1768 Publisher Full Text
2. Gohr A, Irimia M: Matt : Unix tools for alternative splicing analysis. Bioinformatics. 2019; 35 (1): 130-132 Publisher Full Text

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Alternative splicing, RNA biology

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however we have significant reservations, as outlined above.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

8 Views

13 Aug 2025 | for Version 2

Zhaoqi Liu, China National Center for Bioinformation, Beijing, China

8 Views Cite this report Responses(0)

Approved

Summary:
This manuscript provides a comprehensive review of differential alternative splicing analysis methods primarily based on short-read RNA sequencing data. The authors cover a wide range of computational tools and statistical frameworks, including both parametric and non-parametric approaches, and discuss their applications, strengths, and limitations. Overall, the article offers a valuable resource for researchers aiming to select appropriate tools for AS analysis and to understand the landscape of current methodologies.

Minor Comments:

User Experience and Usability: It would strengthen the review to include a systematic comparison of installation complexity, dependency requirements, and user-friendliness across the reviewed tools. Furthermore, a discussion on each tool’s visualization capabilities—such as graphical user interfaces, scripting options—would greatly benefit non-expert users and enhance the practical guidance of the review.
Computational Resource Requirements: Given that large-scale RNA-Seq datasets are increasingly common, the authors are encouraged to comment on the computational efficiency of the tools, including CPU and memory demands, support for multi-threading or distributed computing. This would provide readers with a better understanding of which tools are most suitable for extensive datasets.
Future Directions: Besides short-read and long-read sequencing integration, more elaborate coverage of emerging trends would be valuable. Specifically, a deeper analysis of deep learning techniques for splicing event detection and interpretation should be considered. Additionally, current challenges and prospects in single-cell splicing analysis could be addressed in more detail, especially how short-read tools are applied in single-cell contexts by aggregating data into pseudo-bulk samples.

Conclusion:
This manuscript is well-organized and informative. Addressing the above points would further enhance its clarity, practical utility, and relevance to current and future research trends in alternative splicing analysis.

Is the topic of the review discussed comprehensively in the context of the current literature?

Yes
Are all factual statements correct and adequately supported by citations?

Yes
Is the review written in accessible language?

Yes
Are the conclusions drawn appropriate in the context of the current research literature?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Computational cancer genomics with a focus on RNA splicing dysregulation in human cancer

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

15 Views

29 Jul 2025 | for Version 2

Julia Olivieri, University of the Pacific, California, USA

15 Views Cite this report Responses(0)

Approved With Reservations

This paper provides a review of alternative splicing (AS) analysis software for short-read RNA sequencing data. Tool popularity and active maintenance are compared using metrics such as citation count and number of GitHub commits. The flowchart provided in Figure 5 is especially helpful for researchers determining which approach is best for their use case. This review is written in accessible language, and breaks the different approaches down into helpful categories, such as parametric vs nonparametric and differential transcript expression (DTE) vs differential exon usage (DEU) vs differential splicing events (DSE).

This paper will be a useful resource as alternative splicing analysis on short-read RNA sequencing data becomes more prevalent. However, I have several concerns with the paper that I recommend addressing.

Concerns that should be addressed to make the article scientifically sound:

The abstract states that 22 tools are analyzed. However, the tools are not fully consistent throughout. DESeq2 is mentioned in the supplemental table, although it is not specifically optimized for AS analysis, and similar tools such as LIMMA and edgeR are not included in the table. Also, WHIPPET is included in the table and mentioned in the text but not included in any of the figures, and Cuffdiff2 is mentioned in the table and Figure 1, but not elsewhere. Keeping the tool list consistent would aid the clarity of the paper.
DRIMSeq and DTUrtle are categorized as parametric models, but the text says that they use non-parametric models. This should be clarified.
Bisbee is stated to be a deep-learning method, but this is not supported by the citation.

Additional recommendations:

In Figure 5, the “isoform-based analysis” diamond breaks the flow of the chart: one cannot trace a path from the top left to this diamond. I recommend re-positioning it to fit into the flow chart more naturally. Also, I recommend adding arrows to SplicingCompass and dSpliceType in the flowchart based on when they should be used (unless they would never be recommended).
Understanding the difference between DGE/DTE, DTU/DEU, and DSE is critical to fully understanding the paper. A figure to go along with the in-text description would be very helpful to clarify this for the reader.
Visualization tools are mentioned several times in the manuscript. It would be helpful to include example plots, at least an example sashimi plot, because many readers are likely unfamiliar with these plot types.
The text states about nonparametric models that “By avoiding assumptions about the data’s underlying distribution, these methods enable more sophisticated modeling.” Rather than “enable,” I would say “require.”
It is stated that the lower citation counts of software for AS analysis compared to DGE analysis “may pose challenges for researchers seeking resources and workflows specific to differential splicing analysis.” I am not sure why fewer citations would pose challenges - I would suggest clarifying or changing the wording (I would understand if fewer options posed challenges, or if lower usage meant less developer support).
There are several typos, words missing, and grammatical errors that could be fixed with another round of proofreading. For example, in the supplemental table “riff-parametric” should be “rdiff-parametric” and “To assess the academic popularity of tools, a citation and developer engagement analysis of original research articles within the Web of Science (WoS) domain and the respective GitHub website domains (if applicable)” is a sentence fragment.
The section about long-read sequencing in the discussion feels somewhat out of place based on the short-read focus of the rest of the paper. I recommend shortening this section.

Is the topic of the review discussed comprehensively in the context of the current literature?

Yes
Are all factual statements correct and adequately supported by citations?

Partly
Is the review written in accessible language?

Yes
Are the conclusions drawn appropriate in the context of the current research literature?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Alternative splicing analysis, bioinformatics, biostatistics, single cell RNA sequencing

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

10 Views

27 Jun 2025 | for Version 2

Charlotte Capitanchik, UK DRI at King’s College London, London, UK

10 Views Cite this report Responses(0)

Approved

I find the updated article much improved.

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

bioinformatics, splicing, RNA biology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

29 Views

25 Apr 2025 | for Version 1

Charlotte Capitanchik, UK DRI at King’s College London, London, UK

29 Views Cite this report Responses(1)

Approved With Reservations

The method benchmarking section needs some attention. ‘Scientific accuracy’ should be replaced by just ‘accuracy’. ‘Computational power’ should be specified, presumably you mean performance characteristics such as memory usage and run time. “Lack of ground truth to set as a reference to compare to..” is unnecessary, just say a lack of ground truth splicing quantifications. Library size and sequencing depth are the same thing? Library size and positional bias are qualities of the RNA-seq data itself, not the bioinformatic pre-processing steps. Annotation quality is not a ‘pre-processing’ step - this is a feature of the organism under study. “Achieve finer control over the ground truth” could be rephrased to something more meaningful like ‘to explore the impact of increasing variability between replicates, changing replicate numbers, sequencing depth’…..etc. “The consensus drawn from these three benchmarks is that the performance of differential splicing tools exhibits considerable variability depending on the outlined factors.” This seems weak, perhaps a more nuanced conclusion can be drawn - when data is very good, deep sequencing, low variability, which tool performs best? Which tools have the lowest run times, compute requirements .etc. I don’t think it’s sufficient to simply say its complicated - please dive more into the details here. “The ongoing evolution and upkeep of tools by developers introduce a time-dependent aspect to benchmarking. Community-led maintenance efforts consistently enhance the functionality and reliability of tools over time.” I’m not sure what this means, please clarify.
The discussion brings in developments in long read sequencing and is quite nice - I would suggest making this a section of its own and expanding on what is already written. Alternatively, I would consider cutting it back a bit and changing the title of the article to reflect a focus on short-read sequencing data. Perhaps one point for the discussion is that whilst new methods are developing, there remains hundreds of thousands of publicly available short read RNA sequencing datasets through which novel biological insights can still be made.
In the section of recommendations - “For instance, if the aim is to identify known transcripts, it is advisable to opt for a parametric transcript-based tool like DEXSeq or DRIMSeq and execute a DTU study following Michael Love’s protocol.37” The citation does not support the statement - why shouldn’t researchers opt for an exon-based approach when the transcriptomic annotations are good? Also in this section, DEXSeq is recommended for DEU analysis, when rMATs is the most highly cited splicing tool and provides accurate quantifications of exon usage. “Overall, event-based methods are more suited to advanced programmers owing to their use of command-line tools over interpreters that use IDEs (Integrated development environments). For most analyses, however, a DEU or DTU-based analysis is recommended for simple interpretability and robustness.” I don’t understand, how is DEX-Seq easier to use than rMATs (for example), or MAJIQ which has extensive graphical reporting? Are you saying this because DEX-Seq is an R package so you can use RStudio? - this doesn’t seem like a particularly helpful argument - I can run rMATs or MAJIQ or any of these using a bash script in Visual Studio Code which is also an IDE…

Minor comments

Alternative splicing (AS) abbreviation is given several times throughout text and sometimes used, sometimes not - please be consistent.
“The proposed advantage of this approach is in the smaller exonic regions rather than full isoform deconvolution.” Rephrase for clarity, presumably you mean by focusing on regions unique to distinct isoforms the tool avoids the issue of assigning ambiguous reads to isoforms.
"A few newer methods such as DRIMSeq and DTUrtle have progressed onto non-parametric or mixed Dirichlet Multinomial Models (DMM)" ‘progression’ suggests there is some kind of hierarchy, you can just say that these models ‘use’ other distributions
“More complex regulatory events involve genomic features beyond exons and introns, such as alternative promoter and polyadenylation sites, which result in varying mRNA 5′ and 3′ UTR ends. However, these events are seldom included in most bioinformatics analyses, tools such as CAGER (Cap Analysis of Gene Expression) and DaPars (Dynamic Analysis of Alternative PolyAdenylation from RNA-Seq) are available for niche research” I wouldn’t say these events are more complex from a biological standpoint. The issue in analysis of alternative TSS use and APA is that short read sequencing with typical library preparation methods (e.g. random hexamer priming) won’t have good coverage of exact transcript 5’ and 3’ ends. Therefore typically library preparations with mRNA cap capture (CAGE) or 3’ end sequencing (eg. Quantseq) are used when this is the analysis goal. Also, to be a pedant, 3’UTRs and 5’UTRs are exons.
The discussion mentions Nextflow, and nf-core is mentioned earlier, but it might be nice to specifically mention the efforts of nf-core/rnasplice. As you know, one of the benefits of these pipelines is that everything is containerised so you don’t have to mess about installing everything. Generally speaking, one of the barriers to using tools can be installation - of the presented tools there is quite a range of levels of developer investment in making the tools easy to install. Some are in bioconda or bioconductor and have containers, others you have to contact the authors to get permission to download (MAJIQ!) - this might be related to the amount of citations that tools get. It would be nice (but not necessary) to address this too.

Is the topic of the review discussed comprehensively in the context of the current literature?

Yes
Are all factual statements correct and adequately supported by citations?

Partly
Is the review written in accessible language?

Partly
Are the conclusions drawn appropriate in the context of the current research literature?

Partly

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

bioinformatics, splicing, RNA biology

Respond to this report

Responses (1)

Author Response

16 Jun 2025

Ben Draper, Department of Chemical and Biological Engineering, Mappin St., The University of Sheffield, Sheffield, S1 3JD, UK

Dear Dr. Capitanchik,

Thank you for your thorough and insightful review of our manuscript. Your comments on the benchmarking, recommendations, long-read discussion, and minor edits have significantly improved the manuscript’s clarity and accuracy. We have incorporated minimal changes to address your concerns, ensuring the revisions align with your suggestions while maintaining the manuscript’s focus.

For the benchmarking section, we corrected terminology (“accuracy” instead of “scientific accuracy,” “computational performance” instead of “computational power”), clarified that library size, positional bias, and annotation quality are dataset characteristics, not pre-processing steps, and rephrased vague terms (e.g., “lack of ground truth” to “lack of comprehensive ground truth splicing quantifications”). We added specific examples drawn from the benchmarks (e.g., DEXSeq, rMATS, NOISEQ) to strengthen the conclusions.

In the recommendations section, we clarified our positions on programming environments, developer and community support, making sure not to discriminate harshly against command line-based tools. We believe this is still worth mentioning, however, as in our experience, the programming platform matters for accessibility.

In the discussion, we opted for shortening the long-read section for brevity and making the article more focused towards short-read. We have added a few minor points in agreement with Dr. Donega. We agree with all the minor edits (e.g., consistent AS acronym usage, clarified rDiff-parametric, revised TSS/APA) that were made as requested.

We believe these changes should address your concerns effectively.

Best wishes

Ben J. Draper
University of Sheffield

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

27 Views

22 Apr 2025 | for Version 1

Stefano Donega, National Institute on Aging, Bethesda, USA

27 Views Cite this report Responses(1)

Approved With Reservations

In general, the entire manuscript discusses methods that directly apply to short-read platforms. Therefore, I think this should be better highlighted both in the manuscript title and throughout the whole review.
The long-read platforms appear only in the discussion section. I recommend the authors dedicate a separate paragraph to them, independent of the discussion, while keeping the discussion to connect together the main findings investigated in the main text.

Now, I will provide some minor comments:

In a recent Nature Aging paper, Ferrucci et al. 2022 (Ref 1) discussed the "energy-splicing axis hypothesis on aging," which is worthy of mentioning in the introductory paragraph on the importance of splicing.
There have been efforts to clarify modern nomenclature in gene expression studies, and guidelines were recently proposed to increase precision and clarity when communicating about gene expression, most notably to reserve 'gene' for the DNA template and 'transcript' for the RNA transcribed from that gene (Cunningham ASG, et al., 2024 [Ref 2]). I suggest authors consider aligning some definitions found in the manuscript with these guidelines.
There is no mention of the possibility of combining short- and long-read sequencing to enhance quantity and quality of results. I strongly suggest the authors include in their review a section on "StringTie" which utilizes both short and long RNA-seq reads for transcript assembly to generate a hybrid strategy (Shumate A, et al., 2022 [Ref 3]).

After these improvements, I am confident this article will be highly cited in the field.

Is the topic of the review discussed comprehensively in the context of the current literature?

Yes
Are all factual statements correct and adequately supported by citations?

Yes
Is the review written in accessible language?

Yes
Are the conclusions drawn appropriate in the context of the current research literature?

Partly

References

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Aging, muscle, mitochondria, energy, hypoxia, exercise

Respond to this report

Responses (1)

Author Response

09 Aug 2025

Ben Draper, Department of Chemical and Biological Engineering, Mappin St., The University of Sheffield, Sheffield, S1 3JD, UK

Dear Dr. Donega,

First of all, thank you for reviewing the article. I appreciate the time you took and the constructive feedback you gave me to improve this work.

The title was revised to “Selecting Differential Splicing Methods: Practical Considerations for Short-Read RNA Sequencing” to emphasise short-read platforms, and the abstract and introduction now explicitly state this focus. I am hesitant to expand and write a full section on long-read technology, as this isn't really my field of expertise. Therefore, we decided to streamline this section in line with Dr Capitanchik's recommendations while weaving in the hybridised approaches of current short-read technologies.

I agree with the minor points and have addressed these by including the recommended citations in the introduction and discussion.

These changes align the manuscript with your recommendations, maintaining its comprehensive scope while clarifying its primary focus on short-read RNA-seq.

Sincerely,

Ben J. Draper
University of Sheffield

View more View less

Competing Interests

No competing interests were disclosed.

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Baralle FE, Giudice J: Alternative splicing as a regulator of development and tissue identity. Nat. Rev. Mol. Cell Biol. 2017 Jul; 18(7): 437–451. PubMed Abstract | Publisher Full Text | Free Full Text

[2] 2. Matera AG, Wang Z: A day in the life of the spliceosome. Nat. Rev. Mol. Cell Biol. 2014 Feb; 15(2): 108–121. PubMed Abstract | Publisher Full Text | Free Full Text

[3] 3. Singh B, Eyras E: The role of alternative splicing in cancer. Transcription. 2017 Mar 15; 8(2): 91–98. PubMed Abstract | Publisher Full Text | Free Full Text

[4] 4. Bonnal SC, López-Oreja I, Valcárcel J: Roles and mechanisms of alternative splicing in cancer — implications for care. Nat. Rev. Clin. Oncol. 2020 Aug; 17(8): 457–474. PubMed Abstract | Publisher Full Text

[5] 5. Zhang Y, Qian J, Gu C, et al.: Alternative splicing and cancer: a systematic review. Signal Transduct. Target. Ther. 2021 Feb 24; 6(1): 1–14. Publisher Full Text

[6] 6. Kar A, Kuo D, He R, et al.: Tau Alternative Splicing and Frontotemporal Dementia. Alzheimer Dis. Assoc. Disord. 2005; 19(Suppl 1): S29–S36. PubMed Abstract | Publisher Full Text | Free Full Text

[7] 7. Yanagisawa M, Huveldt D, Kreinest P, et al.: A p120 Catenin Isoform Switch Affects Rho Activity, Induces Tumor Cell Invasion, and Predicts Metastatic Disease. J. Biol. Chem. 2008 Jun 27; 283(26): 18344–18354. PubMed Abstract | Publisher Full Text | Free Full Text

[8] 8. McEvoy J, Ulyanov A, Brennan R, et al.: Analysis of MDM2 and MDM4 Single Nucleotide Polymorphisms, mRNA Splicing and Protein Expression in Retinoblastoma. PLoS One. 2012 Aug 20; 7(8): e42739. PubMed Abstract | Publisher Full Text | Free Full Text

[9] 9. Cain K, Peters S, Hailu H, et al.: A CHO cell line engineered to express XBP1 and ERO1-Lα has increased levels of transient protein expression. Biotechnol. Prog. 2013 Jun; 29(3): 697–706. PubMed Abstract | Publisher Full Text

[10] 10. Johari YB, Estes SD, Alves CS, et al.: Integrated cell and process engineering for improved transient production of a “difficult-to-express” fusion protein by CHO cells. Biotechnol. Bioeng. 2015; 112(12): 2527–2542. PubMed Abstract | Publisher Full Text

[11] 11. Torres M, Dickson AJ: Reprogramming of Chinese hamster ovary cells towards enhanced protein secretion. Metab. Eng. 2022 Jan; 69(69): 249–261. Publisher Full Text

[12] 12. Butt H, Eid A, Momin AA, et al.: CRISPR directed evolution of the spliceosome for resistance to splicing inhibitors. Genome Biol. 2019 Apr 30; 20(1): 73. PubMed Abstract | Publisher Full Text | Free Full Text

[13] 13. Wang Z, Gerstein M, Snyder M: RNA-Seq: a revolutionary tool for transcriptomics. Nat. Rev. Genet. 2009 Jan; 10(1): 57–63. PubMed Abstract | Publisher Full Text | Free Full Text

[14] 14. Stark R, Grzelak M, Hadfield J: RNA sequencing: the teenage years. Nat. Rev. Genet. 2019 Nov; 20(11): 631–656. Publisher Full Text

[15] 15. Babraham Bioinformatics - FastQC A Quality Control tool for High Throughput Sequence Data.[cited 2022 Aug 7]. Reference Source

[16] 16. Dobin A, Davis CA, Schlesinger F, et al.: STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013 Jan 1; 29(1): 15–21. PubMed Abstract | Publisher Full Text | Free Full Text

[17] 17. Kim D, Langmead B, Salzberg SL: HISAT: a fast spliced aligner with low memory requirements. Nat. Methods. 2015 Apr; 12(4): 357–360. PubMed Abstract | Publisher Full Text | Free Full Text

[18] 18. Patro R, Duggal G, Love MI, et al.: Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods. 2017 Apr; 14(4): 417–419. PubMed Abstract | Publisher Full Text | Free Full Text

[19] 19. Putri GH, Anders S, Pyl PT, et al.: Analysing high-throughput sequencing data in Python with HTSeq 2.0. Bioinformatics. 2022 May 13; 38(10): 2943–2945. PubMed Abstract | Publisher Full Text | Free Full Text

[20] 20. Liao Y, Smyth GK, Shi W: featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014 Apr 1; 30(7): 923–930. PubMed Abstract | Publisher Full Text

[21] 21. Love MI, Huber W, Anders S: Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014 Dec 5; 15(12): 550. PubMed Abstract | Publisher Full Text | Free Full Text

[22] 22. Ewels PA, Peltzer A, Fillinger S, et al.: nf-core: Community curated bioinformatics pipelines. bioRxiv. 2019 [cited 2024 Mar 15]; p. 610741. Publisher Full Text

[23] 23. Kawaji H, Lizio M, Itoh M, et al.: Comparison of CAGE and RNA-seq transcriptome profiling using clonally amplified and single-molecule next-generation sequencing. Genome Res. 2014 Apr; 24(4): 708–717. PubMed Abstract | Publisher Full Text | Free Full Text

[24] 24. Xia Z, Donehower LA, Cooper TA, et al.: Dynamic analyses of alternative polyadenylation from RNA-seq reveal a 3′-UTR landscape across seven tumour types. Nat. Commun. 2014 Nov 20; 5(1): 5274. PubMed Abstract | Publisher Full Text | Free Full Text

[25] 25. Katz Y, Wang ET, Silterra J, et al.: Quantitative visualization of alternative exon expression from RNA-seq data. Bioinformatics. 2015 Jul 1; 31(14): 2400–2402. PubMed Abstract | Publisher Full Text | Free Full Text

[26] 26. Vaquero-Garcia J, Barrera A, Gazzara MR, et al.: A new view of transcriptome complexity and regulation through the lens of local splicing variations. Valcárcel J, editor. elife. 2016 Feb 1; 5(5): e11752. PubMed Abstract | Publisher Full Text | Free Full Text

[27] 27. Ritchie ME, Phipson B, Wu D, et al.: Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015 Apr 20; 43(7): e47. PubMed Abstract | Publisher Full Text | Free Full Text

[28] 28. Robinson MD, McCarthy DJ, Smyth GK: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010 Jan 1; 26(1): 139–140. PubMed Abstract | Publisher Full Text | Free Full Text

[29] 29. Wang W, Qin Z, Feng Z, et al.: Identifying differentially spliced genes from two groups of RNA-seq samples. Gene. 2013 Apr 10; 518(1): 164–170. PubMed Abstract | Publisher Full Text

[30] 30. Drewe P, Stegle O, Hartmann L, et al.: Accurate detection of differential RNA processing. Nucleic Acids Res. 2013 May 1; 41(10): 5189–5198. PubMed Abstract | Publisher Full Text | Free Full Text

[31] 31. Hartley SW, Mullikin JC: Detection and visualization of differential splicing in RNA-Seq data with JunctionSeq. Nucleic Acids Res. 2016 Sep 6; 44(15): e127. PubMed Abstract | Publisher Full Text

[32] 32. Wang X, Cairns MJ: SeqGSEA: a Bioconductor package for gene set enrichment analysis of RNA-Seq data integrating differential expression and splicing. Bioinforma Oxf. Engl. 2014 Jun 15; 30(12): 1777–1779. PubMed Abstract | Publisher Full Text

[33] 33. Hiller D, Jiang H, Xu W, et al.: Identifiability of isoform deconvolution from junction arrays and RNA-Seq. Bioinformatics. 2009 Dec 1; 25(23): 3056–3059. PubMed Abstract | Publisher Full Text | Free Full Text

[34] 34. Nowicka M, Robinson MD: DRIMSeq: a Dirichlet-multinomial framework for multivariate count outcomes in genomics. F1000Research. 2016 Dec 6; 5: 1356. PubMed Abstract | Publisher Full Text | Free Full Text

[35] 35. Tekath T, Dugas M: Differential transcript usage analysis of bulk and single-cell RNA-seq data with DTUrtle. Bioinformatics. 2021 Nov 1; 37(21): 3781–3787. PubMed Abstract | Publisher Full Text | Free Full Text

[36] 36. Vitting-Seerup K, Sandelin A: IsoformSwitchAnalyzeR: analysis of changes in genome-wide patterns of alternative splicing and its functional consequences. Bioinformatics. 2019 Nov 1; 35(21): 4469–4471. PubMed Abstract | Publisher Full Text

[37] 37. Love MI, Soneson C, Patro R: Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification. F1000Res. 2018 [cited 2023 Feb 16]. Publisher Full Text Reference Source

[38] 38. Honeyman JN, Simon EP, Robine N, et al.: Detection of a Recurrent DNAJB1-PRKACA Chimeric Transcript in Fibrolamellar Hepatocellular Carcinoma. Science. 2014 Feb 28; 343(6174): 1010–1014. PubMed Abstract | Publisher Full Text | Free Full Text

[39] 39. Trincado JL, Entizne JC, Hysenaj G, et al.: SUPPA2: fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions. Genome Biol. 2018 Mar 23; 19(1): 40. PubMed Abstract | Publisher Full Text | Free Full Text

[40] 40. Shen S, Park JW, Lu Z, et al.: rMATS: Robust and flexible detection of differential alternative splicing from replicate RNA-Seq data. Proc. Natl. Acad. Sci. 2014 Dec 23; 111(51): E5593–E5601. Publisher Full Text

[41] 41. Sterne-Weiler T, Weatheritt RJ, Best AJ, et al.: Efficient and Accurate Quantitative Profiling of Alternative Splicing Patterns of Any Complexity on a Laptop. Mol. Cell. 2018 Oct 4; 72(1): 187–200.e6. PubMed Abstract | Publisher Full Text

[42] 42. Dong C, He F, Berkowitz O, et al.: Alternative Splicing Plays a Critical Role in Maintaining Mineral Nutrient Homeostasis in Rice (Oryza sativa). Plant Cell. 2018 Oct 1; 30(10): 2267–2285. PubMed Abstract | Publisher Full Text | Free Full Text

[43] 43. Tarazona S, Furió-Tarí P, Turrà D, et al.: Data quality aware analysis of differential expression in RNA-seq with NOISeq R/Bioc package. Nucleic Acids Res. 2015 Dec 2; 43(21): e140. PubMed Abstract | Publisher Full Text

[44] 44. Zhao K, Lu Z, Xiang, et al.: GLiMMPS: robust statistical model for regulatory variation of alternative splicing using RNA-seq data. Genome Biol. 2013 Jul 22; 14(7): R74. PubMed Abstract | Publisher Full Text | Free Full Text

[45] 45. Halperin RF, Hegde A, Lang JD, et al.: Improved methods for RNAseq-based alternative splicing analysis. Sci. Rep. 2021 May 24; 11(1): 10740. PubMed Abstract | Publisher Full Text | Free Full Text

[46] 46. Zhang Z, Pan Z, Ying Y, et al.: Deep-learning augmented RNA-seq analysis of transcript splicing. Nat. Methods. 2019 Apr; 16(4): 307–310. PubMed Abstract | Publisher Full Text | Free Full Text

[47] 47. Anders S, Reyes A, Huber W: Detecting differential usage of exons from RNA-seq data. Genome Res. 2012 Oct 1; 22(10): 2008–2017. PubMed Abstract | Publisher Full Text | Free Full Text

[48] 48. Leng N, Dawson JA, Thomson JA, et al.: EBSeq: an empirical Bayes hierarchical model for inference in RNA-seq experiments. Bioinformatics. 2013 Apr 15; 29(8): 1035–1043. PubMed Abstract | Publisher Full Text | Free Full Text

[49] 49. Mehmood A, Laiho A, Venäläinen MS, et al.: Systematic evaluation of differential splicing tools for RNA-seq studies. Brief. Bioinform. 2020 Dec 1; 21(6): 2052–2065. PubMed Abstract | Publisher Full Text | Free Full Text

[50] 50. Liu R, Loraine AE, Dickerson JA: Comparisons of computational methods for differential alternative splicing detection using RNA-seq in plant systems. BMC Bioinformatics. 2014 Dec 16; 15(1): 364. PubMed Abstract | Publisher Full Text | Free Full Text

[51] 51. Merino GA, Conesa A, Fernández EA, et al.: A benchmarking of workflows for detecting differential splicing and differential expression at isoform level in human RNA-seq studies. Brief. Bioinform. 2019 Mar 25; 20(2): 471–481. PubMed Abstract | Publisher Full Text

[52] 52. Jiang M, Zhang S, Yin H, et al.: A comprehensive benchmarking of differential splicing tools for RNA-seq analysis at the event level. Brief. Bioinform. 2023 May 1; 24(3): bbad121. PubMed Abstract | Publisher Full Text

[53] 53. Kannan K, Wang L, Wang J, et al.: Recurrent chimeric RNAs enriched in human prostate cancer identified by deep sequencing. Proc. Natl. Acad. Sci. 2011 May 31; 108(22): 9172–9177. PubMed Abstract | Publisher Full Text | Free Full Text

[54] 54. Griebel T, Zacher B, Ribeca P, et al.: Modelling and simulating generic RNA-Seq experiments with the flux simulator. Nucleic Acids Res. 2012 Nov 1; 40(20): 10073–10083. PubMed Abstract | Publisher Full Text

[55] 55. Katz Y, Wang ET, Airoldi EM, et al.: Analysis and design of RNA sequencing experiments for identifying isoform regulation. Nat. Methods. 2010 Dec; 7(12): 1009–1015. PubMed Abstract | Publisher Full Text | Free Full Text

[56] 56. Robinson JT, Thorvaldsdóttir H, Winckler W, et al.: Integrative genomics viewer. Nat. Biotechnol. 2011 Jan; 29(1): 24–26. PubMed Abstract | Publisher Full Text | Free Full Text

[57] 57. Li YI, Knowles DA, Humphrey J, et al.: Annotation-free quantification of RNA splicing using LeafCutter. Nat. Genet. 2018 Jan; 50(1): 151–158. PubMed Abstract | Publisher Full Text | Free Full Text

[58] 58. Benegas G, Fischer J, Song YS: Robust and annotation-free analysis of alternative splicing across diverse cell types in mice. Eyras E, Manley JL, editors. elife. 2022 Mar 1; 11: e73520. PubMed Abstract | Publisher Full Text | Free Full Text

[59] 59. Mapleson D, Venturini L, Kaithakottil G, et al.: Efficient and accurate detection of splice junctions from RNA-seq with Portcullis. GigaScience. 2018 Dec 1; 7(12): giy131. PubMed Abstract | Publisher Full Text | Free Full Text

[60] 60. Eid J, Fehr A, Gray J, et al.: Real-Time DNA Sequencing from Single Polymerase Molecules. Science. 2009 Jan 2; 323(5910): 133–138. Publisher Full Text

[61] 61. Derrington IM, Butler TZ, Collins MD, et al.: Nanopore DNA sequencing with MspA. Proc. Natl. Acad. Sci. 2010 Sep 14; 107(37): 16060–16065. PubMed Abstract | Publisher Full Text | Free Full Text

[62] 62. Feng Y, Zhang Y, Ying C, et al.: Nanopore-based Fourth-generation DNA Sequencing Technology. Genomics Proteomics Bioinformatics. 2015 Feb 1; 13(1): 4–16. PubMed Abstract | Publisher Full Text | Free Full Text

[63] 63. Jain M, Koren S, Miga KH, et al.: Nanopore sequencing and assembly of a human genome with ultra-long reads. Nat. Biotechnol. 2018 Apr; 36(4): 338–345. PubMed Abstract | Publisher Full Text | Free Full Text

[64] 64. Wright DJ, Hall NAL, Irish N, et al.: Long read sequencing reveals novel isoforms and insights into splicing regulation during cell state changes. BMC Genomics. 2022 Jan 10; 23(1): 42. PubMed Abstract | Publisher Full Text | Free Full Text

[65] 65. Wyman D, Balderrama-Gutierrez G, Reese F, et al.: A technology-agnostic long-read analysis pipeline for transcriptome discovery and quantification. bioRxiv. 2020 [cited 2023 Sep 7]; p. 672931. Publisher Full Text

[66] 66. Hu Y, Fang L, Chen X, et al.: LIQA: long-read isoform quantification and analysis. Genome Biol. 2021 Jun 17; 22(1): 182. PubMed Abstract | Publisher Full Text | Free Full Text

[67] 67. Carneiro MO, Russ C, Ross MG, et al.: Pacific biosciences sequencing technology for genotyping and variation discovery in human data. BMC Genomics. 2012 Aug 5; 13(1): 375. PubMed Abstract | Publisher Full Text | Free Full Text

[68] 68. Jain M, Fiddes IT, Miga KH, et al.: Improved data analysis for the MinION nanopore sequencer. Nat. Methods. 2015 Apr; 12(4): 351–356. PubMed Abstract | Publisher Full Text | Free Full Text

[69] 69. Hon T, Mars K, Young G, et al.: Highly accurate long-read HiFi sequencing data for five complex genomes. Sci. Data. 2020 Nov 17; 7(1): 399. PubMed Abstract | Publisher Full Text | Free Full Text

[70] 70. Baid G, Cook DE, Shafin K, et al.: DeepConsensus improves the accuracy of sequences with a gap-aware sequence transformer. Nat. Biotechnol. 2023 Feb; 41(2): 232–238. PubMed Abstract | Publisher Full Text

[71] 71. Amarasinghe SL, Su S, Dong X, et al.: Opportunities and challenges in long-read sequencing data analysis. Genome Biol. 2020 Feb 7; 21(1): 30. PubMed Abstract | Publisher Full Text | Free Full Text

[72] 72. Di Tommaso P, Chatzou M, Floden EW, et al.: Nextflow enables reproducible computational workflows. Nat. Biotechnol. 2017 Apr; 35(4): 316–319. PubMed Abstract | Publisher Full Text

[73] 73. Draper BJ: Selecting differential splicing methods: Practical Considerations - R Scripts and Data. Zenodo. 2024. Publisher Full Text

Selecting differential splicing methods: Practical considerations

Abstract

Keywords

Introduction

Current statistical methods for differential splicing

Parametric & mixed methods

Figure 1. Timeline of statistical methods in differential splicing tool development.

Probabilistic & non-parametric methods

Popularity & developer maintenance of methods

Figure 2. Citation counts of differential splicing tools (2010–2024) from Web of Science (WoS) Data.

Figure 3. Citation trends of differential splicing tools (2010–2024) from Web of Science (WoS) Data.

Figure 4. Developer maintenance of differential splicing tools.

Benchmarking of methods is difficult

Method recommendations

Figure 5. Guideline for differential splicing tool selection based on experimental parameters.

Discussion

Ethical approval and consent statement

Data availability statement

Underlying data

Extended data

Software availability statement

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated