StackbarExtended: a user-friendly stacked bar-plot representation incorporating phylogenetic information and microbiota differential abundance analysis

Thibault Cuisiniere; Manuela M Santos

doi:10.12688/f1000research.151662.1

Home Browse StackbarExtended: a user-friendly stacked bar-plot representation...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Software Tool Article

StackbarExtended: a user-friendly stacked bar-plot representation incorporating phylogenetic information and microbiota differential abundance analysis

[version 1; peer review: 1 approved, 1 approved with reservations]

Thibault Cuisiniere ^1,2, Manuela M Santos^1-3

PUBLISHED 09 Aug 2024

Author details Author details

¹ Nutrition and Microbiome Laboratory, Centre de recherche du Centre hospitalier de l'Université de Montréal (CRCHUM), Montréal, Québec, Canada
² Institut du cancer de Montréal, Montréal, Québec, Canada
³ Department of Medicine, Faculty of Medicine, Université de Montréal, Montréal, Québec, Canada

Thibault Cuisiniere
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Manuela M Santos
Roles: Funding Acquisition, Project Administration, Resources, Supervision, Validation, Writing – Original Draft Preparation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

Background

Microbial communities are mainly composed of bacteria, archaea, viruses and fungi, and are present in the gut, mouth, nose, skin, lungs, vagina, and bladder, among other places. In recent years, research has highlighted the critical role that these highly complex communities play in health and disease. Advances in sequencing technology have resulted in the development of high-dimensional data, which are challenging to effectively analyze and visualize. In this context, traditional stacked bar-plot visualizations, while widely used, fall short of conveying the fundamental phylogenic relationships between community members and are thus difficult to interpret.

Methods

StackbarExtended is implemented in native R, required version (≥ 4.0), and is platform independent, with its source code available on GitHub and archived on Zenodo.

Results

StackbarExtended allows for the plotting of relative abundance at user-defined taxonomic levels while displaying phylogenetic information using color gradients. Additionally, StackbarExtended integrates differential abundance statistics directly into the visualization process and performs clustering of low-abundance taxa.

Conclusions

StackbarExtended offers researchers a user-friendly tool for rapid visualization, presentation, and analysis of the microbiota composition.

Keywords

Microbiota, differential abundance analysis, visualization

Corresponding author: Thibault Cuisiniere

Competing interests: No competing interests were disclosed.

Grant information: This work was supported by a grant from the Natural Sciences and Engineering Research Council of Canada [NSERC, grant RGPIN-2024-05660] to MMS.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2024 Cuisiniere T and M Santos M. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Cuisiniere T and M Santos M. StackbarExtended: a user-friendly stacked bar-plot representation incorporating phylogenetic information and microbiota differential abundance analysis [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2024, 13:914 (https://doi.org/10.12688/f1000research.151662.1) First published: 09 Aug 2024, 13:914 (https://doi.org/10.12688/f1000research.151662.1) Latest published: 09 Aug 2024, 13:914 (https://doi.org/10.12688/f1000research.151662.1)

Introduction

Microbiota communities consisting of diverse microbial members, including bacteria, viruses, fungi, and other microorganisms (Berg et al., 2020) in the digestive tract, skin, respiratory and urogenital systems, and other locations (Kennedy and Chang, 2020), have emerged as a crucial factor in maintaining health and preventing disease (Hou et al., 2022). With advancements in sequencing technologies (Satam et al., 2023) there is an increasing amount of data available on the composition and function of microbiota communities (Hasan and Yang, 2019), but analyzing and visualizing these data remain challenging due to their complexity and high dimensionality (Panek et al., 2018).

Microbiota communities are organized into multiple phylogenetic levels, including Phylum, Family, Genus, and Species, with each level providing unique insights into the community’s structure and functional potential (Jandhyala et al., 2015). Integrating visualization techniques to simultaneously represent relative abundances across multiple taxonomic levels would enhance the accessibility and ease of interpretation.

Stacked bar-plot representations of the relative abundance of microorganisms remain one of the most commonly used visualizations to show the global composition of microbiota communities, as well as potential shifts in a given community (Liu et al., 2021; McMurdie and Holmes, 2013). However, current tools used to generate stacked bar visualizations in microbiota analysis do not capture phylogenetic relationships between microbial taxa (Peeters et al., 2021). This is because traditional stacked bar visualizations utilize random colors to represent different microbes within the same taxonomic level. Furthermore, differential abundance analysis, a key step in interpreting statistically significant compositional shifts and their biological implications, is typically conducted and shown separately.

To address these issues, we developed a novel R package that allows users to easily generate stacked bar-plots to visualize microbiota composition at a user-defined taxonomic level while integrating information on taxa phylogeny through the use of color gradients. In addition, statistically significant differences in relative abundance are indicated on the plot. These simple solutions allow for more information to be communicated within one single graphical representation.

Methods

Operation

StackbarExtended is implemented in native R, required version (≥ 4.0), and is platform independent, with its source code available on GitHub and archived on Zenodo (Cuisiniere, 2024a).

Implementation

StackbarExtended is designed for microbiota data analysis and visualization. It requires three main inputs: a taxonomic table providing phylogenetic information, a count table detailing the abundance of the taxa, and metadata containing biological or experimental group data (Navgire et al., 2022). These tables can be obtained through established pipelines such as DADA2 (Callahan et al., 2016), Mothur (Schloss et al., 2009), and QIIME2 (Bolyen et al., 2019) in conjunction with reference databases such as Greengenes (DeSantis et al., 2006) or SILVA (Quast et al., 2013). The package supports a phyloseq S4 class object as an input, promoting a data management approach that regroups the count, taxonomy, and metadata tables into a single entity, thus simplifying data handling and enhancing reproducibility. The Phyloseq S4 object can be obtained through the phyloseq() function of the phyloseq R package (McMurdie and Holmes, 2013).

One of the StackbarExtended functionalities is the ability to handle multiple taxonomic levels simultaneously, typically focusing on user-defined levels set by default to Family (X) and Phylum (X+1). This is achieved through the use of the tax_glom() function from the phyloseq package. Most importantly, StackbarExtended uses a color-coding mechanism where taxa at a specified level (X) are visually differentiated using user-defined color palettes that match those of their Phylum level (X+1), making visual identification of the taxonomic hierarchies straightforward. This is achieved by applying different shades of the same color to represent individual taxa (X) within the same Phylum (X+1), creating a gradient effect that maps the phylogenetic taxonomy.

Another challenge in analyzing microbiota data is the large number of taxa present in the dataset. In microbiota communities, a small number of taxa typically dominate, while numerous others are present at significantly lower abundance levels (Neu et al., 2021). In classic stacked bar-plot representations such as the ones included in Phyloseq (McMurdie and Holmes, 2013), MicrobiomeAnalyst (Chong et al., 2020) or MicroEco (Liu et al., 2021) pipelines, these low-abundance taxa are simply filtered out before plotting. StackbarExtended allows users to classify taxa based on their relative abundance, and, by default, the package plots only taxa representing more than 1% of the total abundance. In addition, low-abundance taxa (X) can be grouped into their respective “Others” taxonomic level and the information about their taxonomy (i.e. Phylum) is kept while taxa belonging to low abundance Phyla (X+1) are grouped into a general “Others” category. Thus, by still including the low-abundance taxa, StackbarExtended provides a more accurate representation of the taxa in a microbiota community.

Finally, StackbarExtended includes DESeq2 differential abundance analysis functionality (Love et al., 2014) which allows users to statistically assess the difference in taxa abundance between experimental groups and apply fdr corrections. Significant features (i.e. taxa with significant differences in abundance between 2 groups) are highlighted in the legend using bold font, and the significance levels (p-value) after fdr corrections are automatically shown using stars in the legend. When more than two treatment groups are compared, multiple pairwise comparisons can be computed and data-frames are produced with results providing information about the taxa identified as significant at each phylogenetic level analyzed through the DESeq2 analysis. This information includes the taxa names and phylogeny, their corresponding abundance levels, statistical metrics such as log2 fold-change, p-values and fdr-corrected p-values.

Use cases

To demonstrate the use of this package, we have provided ready-to-use example data “ps” from our previously published study (Cuisiniere, 2024a; Cuisiniere et al., 2021) which assessed shifts in mouse gut microbiota composition after antibiotic treatment (Figure 1).

Figure 1. StackbarExtended graphical output.

Mice (n = 9) received oral antibiotic treatment with neomycin and metronidazole for one week. Fecal samples were collected before (Day 0) and after antibiotic treatment (Day 7). 16S rRNA of DNA extracted from fecal samples was sequenced using the Illumina MiSeq platform. The 4 most abundant phyla and families are represented. Families with a mean abundance lower than 1% across the samples are regrouped into “Others”. Taxa represented in bold within the legends are statistically significant after fdr correction (*P < 0.05, **P < 0.01, ***P < 0.001). Data used are from the “ps” dataset and are included in the StackbarExtended R package.

Animal experiments were approved by the Institutional Animal Care committee of the Centre de recherche du Centre hospitalier de l’Université de Montréal (CRCHUM) in agreement with the guidelines of the Canadian Council of Animal Care. The study was carried out in compliance with the ARRIVE guidelines (Cuisiniere, 2024b). No criteria were set for including and excluding animals during the experiment or data points during the analysis. No exclusions of animals, experimental units, or data points were applied for the analysis.

In order to obtain the data, four-week-old female C57Bl/6 mice were purchased from Charles River Laboratories (Saint-Constant, QC, Canada). Constant efforts were made to minimize the suffering of the animals. Nine mice were kept under controlled specific pathogen free (SPF) conditions in the CRCHUM animal facility at a temperature of 22°C, 45-60% humidity with a light-dark cycle of 12-12. They were housed at three mice per cage with ad libitum access to chow and water. Cages were enriched with nesting material. Mice were allowed one week of acclimation following arrival to the CRCHUM animal facility, after which oral antibiotics consisting of metronidazole (1 mg.ml⁻¹, Hospira, St-Laurent, QC, Canada) and neomycin (1 mg.ml⁻¹, Sigma, St-Louis, MO, USA) were added to the drinking water for one week. Fecal samples were collected before (Day 0) and after antibiotic treatment (Day 7), snap-frozen and stored at -80°C. Mice were then euthanized using CO₂ followed by cervical dislocation. DNA was extracted using the Qiagen DNeasy PowerSoil^® kit (Qiagen, Toronto, ON) and quantified using a spectrophotometer (DeNovix DS-11 FX, Wilmington, DE). The 16S ribosomal RNA (rRNA) library preparation and sequencing was performed using the Illumina MiSeq platform at Genome Québec targeting the V3-V4 (Primers: 341F, 805R) region of the 16S rRNA gene. Forward and reverse, raw, demultiplexed 16S rRNA reads were denoised, chimera filtered, and clustered into sequence variants using the Dada2 package (version 1.16) (Callahan et al., 2016) in R (version 4.0.1). Reads were trimmed at the first instance of a quality score less than or equal to 2 or removed if they contained ambiguous nucleotides (N) or if two or more errors were expected based on the quality of the trimmed read. After taxonomic assignment using Silva training set v132 (Quast et al., 2013), ASV (Table 1), taxonomy (Table 2) and metadata (Table 3) were combined into a phyloseq object (McMurdie and Holmes, 2013).

Table 1. Subset of the ASV table used.

Rows represent each sample and columns represent individual ASVs. Each cell indicates the count of a particular ASV in a specific sample.

	ASV1	ASV2	ASV3	ASV4	ASV5
15186T0	2035	2717	160	851	346
15189T7	4119	2552	3	41	209
15187T0	2757	868	552	15	389
15190T7	4173	3	0	27	1
15188T0	2535	1230	87	445	434

Table 2. Subset of the taxonomy table used.

Rows represent individual ASVs, and columns represent each taxonomic rank. Each cell contains the taxonomic name at that rank for the corresponding ASV.

	Kingdom	Phylum	Class	Order	Family	Genus	Species
ASV1	Bacteria	Verrucomicrobia	Verrucomicrobiae	Verrucomicrobiales	Akkermansiaceae	Akkermansia	muciniphila
ASV2	Bacteria	Bacteroidetes	Bacteroidia	Bacteroidales	Tannerellaceae	Parabacteroides	NA
ASV3	Bacteria	Bacteroidetes	Bacteroidia	Bacteroidales	Bacteroidaceae	Bacteroides	vulgatus
ASV4	Bacteria	Bacteroidetes	Bacteroidia	Bacteroidales	Tannerellaceae	Parabacteroides	distasonis
ASV5	Bacteria	Bacteroidetes	Bacteroidia	Bacteroidales	Rikenellaceae	Alistipes	NA

Table 3. Subset of the metadata table used.

Rows contain metadata associated with each sample.

	timepoint	Mouse.Id	SampleID	concentration.ng.ul	antibiotic
15186T0	Day 0	15186	15186T0	89.764	0
15189T7	Day 7	15189	15189T7	7.702	1
15187T0	Day 0	15187	15187T0	35.692	0
15190T7	Day 7	15190	15190T7	18.386	1
15188T0	Day 0	15188	15188T0	21.619	0

Users can then use the following code to create graphical representation of the gut microbiota composition at the phylum and family levels comparing mice before and after antibiotic treatment and performing differential abundance with fdr correction.

# The plot and the output tables are stored into a single list object

my_plot <- plot_microbiota(
  ps_object = ps,
  exp_group = 'timepoint',
  sample_name = 'SampleID',
  hues = c("Purples", "Blues", "Greens", "Oranges"),
  differential_analysis = T,
  sig_lab = T,
  fdr_threshold = 0.05
)

print (my_plot$plot)

In addition to the graphical representation, two output tables (one for each level) are created containing statistical information concerning the differentially abundant taxa (Table 4). The tables can be accessed as follows:

#Display statistically significant taxa at the phylum level
print(my_plot$significant_table_main)

#Display statistically significant taxa at the family level
print(my_plot$significant_table_sub)

Table 4. Subset of data frame object output of the StackBarExtended R package.

The data frame output contains the results columns: baseMean, log2FoldChange, lfcSE, stat, pvalue (unajusted p-values) and padj (fdr-corrected p-values), and also includes metadata columns of related taxonomic information. The lfcSE gives the standard error of the log2FoldChange. For the Wald test, stat is the Wald statistic: the log2FoldChange divided by lfcSE, which is compared to a standard Normal distribution to generate a two-tailed pvalue. For the likelihood ratio test (LRT), stat is the difference in deviance between the reduced model and the full model, which is compared to a chi-squared distribution to generate a p-value.

	baseMean	log2FoldChange	lfcSE	stat	pvalue	padj	Kingdom	Phylum	Class	Order	Family
ASV1	5761.0	2.2	0.44	5.1	3.65E-07	5.48E-07	Bacteria	Verrucomicrobia	Verrucomicrobiae	Verrucomicrobiales	Akkermansiaceae
ASV2	3577.7	1.1	0.50	2.1	0.037113271	0.038726891	Bacteria	Bacteroidetes	Bacteroidia	Bacteroidales	Tannerellaceae
ASV5	205.1	-2.2	0.75	-2.9	0.003503241	0.004003703	Bacteria	Bacteroidetes	Bacteroidia	Bacteroidales	Rikenellaceae
ASV8	228.5	-5.9	0.89	-6.6	3.62E-11	7.25E-11	Bacteria	Proteobacteria	Deltaproteobacteria	Desulfovibrionales	Desulfovibrionaceae
ASV9	880.8	5.9	0.52	11.3	1.75E-29	2.10E-28	Bacteria	Proteobacteria	Gammaproteobacteria	Betaproteobacteriales	Burkholderiaceae

The primary objective of our example dataset is to identify which taxa at the Family and Phylum levels were impacted by antibiotic treatment in mice gut microbiota. This analysis is crucial for understanding how antibiotic interventions alter microbial communities. The graphical output (Figure 1) provided a clear assessment, revealing that among the four most abundant Phyla, three were significantly affected by the antibiotic treatment. Specifically, significant alterations were observed in Verrucomicrobia, Bacteroidetes, and Proteobacteria (fdr < 0.05), indicating a substantial shift in the microbial composition due to antibiotic exposure. Among the abundant Families (>1% of the total abundance) within these Phyla, 10 out of 11 exhibited significant alterations (fdr < 0.05). On average, the six remaining low-abundance Phyla, grouped into the “Others” category, accounted for 2.5% of the total relative abundance. Similarly, within the abundant Phyla, the 14 low-abundance Families accounted for 2.4% of the total relative abundance.

The two generated output tables provide detailed statistical information about the differentially abundant taxa at each level, including log2 fold-change, exact p-values, and fdr-corrected p-values. This comprehensive data allows for identification of taxa significantly impacted by the treatment. Notably, four additional low-abundance Phyla - Cyanobacteria, Deferribacteres, Tenericutes, and Actinobacteria - were found to be significantly affected (fdr < 0.05). Furthermore, a total of 24 families were identified as statistically different (fdr < 0.05) (Cuisiniere, 2024a).

These findings are consistent with previous studies that have reported significant shifts in gut microbiota composition following antibiotic treatment (Fishbein et al., 2023). Hence, this use case demonstrated the advantage of using StackbarExtended to present a clear and interpretable graphical representation while retaining the capacity to perform detailed statistical analysis.

Discussion

StackbarExtended offers the opportunity to enhance the widely utilized stacked bar graphical representation by incorporating information about taxonomy and statistical significance in regard to differentially abundant taxa. Furthermore, it enables users to retain data on rare taxa while maintaining phylogeny information. These functionalities have been implemented into a user-friendly R package, StackbarExtended, which is freely accessible on GitHub. The package facilitates the processing of large microbiota datasets and produces publication-ready and information-rich graphical representations with a high level of personalization. StackbarExtended is particularly useful to biology and molecular biology students, fellows and researchers working in microbiota analysis, and its output visualizations are suitable for publications and time-limited presentations at conferences and seminars requiring quick interpretation of displayed data.

Software availability

Source code available from: https://github.com/ThibaultCuisiniere/StackbarExtended.

Archived software available from: https://doi.org/10.5281/zenodo.11166800 (Cuisiniere, 2024a).

License: This R package and underlying data are freely available under the Gnu Public License (GPL-3).

Ethics and consent

Animal experiments were approved on April 3^rd 2019 by the Institutional Animal Care committee of the Centre de recherche du Centre hospitalier de l’Université de Montréal (CRCHUM) in agreement with the guidelines of the Canadian Council of Animal Care, approval number C19006MSs. The study was carried out in compliance with the ARRIVE guidelines (Cuisiniere, 2024b).

Data availability

Underlying data

The dataset analyzed in this study is stored in the data/directory of the StackbarExtended package.

Reporting guidelines

ARRIVE checklist available from: https://zenodo.org/records/12583605 (Cuisiniere, 2024b).

Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC0 1.0 Public domain dedication).

Acknowledgements

The authors thanks Marco Constante (Department of Medicine, Farncombe Family Digestive Health Research Institute, McMaster University, Hamilton, Ontario, Canada) for his insight in developing the StackbarExtended package. We also thank Claire Gerkins and Claire McCartney (Nutrition and Microbiome Laboratory, Centre de recherche du Centre hospitalier de l’Université de Montréal (CRCHUM), Montréal, Québec, Canada) for their help in editing the manuscript.

References

Berg G, et al.: Microbiome definition re-visited: old concepts and new challenges. Microbiome. 2020; 8(1): 103. PubMed Abstract | Publisher Full Text | Free Full Text
Bolyen E, et al.: Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotechnol. 2019; 37(8): 852–857. PubMed Abstract | Publisher Full Text | Free Full Text
Callahan BJ, et al.: DADA2: High-resolution sample inference from Illumina amplicon data. Nat. Methods. 2016; 13(7): 581–583. PubMed Abstract | Publisher Full Text | Free Full Text
Chong J, et al.: Using MicrobiomeAnalyst for comprehensive statistical, functional, and meta-analysis of microbiome data. Nat. Protoc. 2020; 15(3): 799–821. PubMed Abstract | Publisher Full Text
Cuisiniere T: StackbarExtended. Zenodo. 2024a. Publisher Full Text
Cuisiniere T: tackbarExtended - ARRIVE checklist. Zenodo. 2024b. Reference Source
Cuisiniere T, et al.: Oral iron supplementation after antibiotic exposure induces a deleterious recovery of the gut microbiota. BMC Microbiol. 2021; 21(1): 259. PubMed Abstract | Publisher Full Text | Free Full Text
DeSantis TZ, et al.: Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl. Environ. Microbiol. 2006; 72(7): 5069–5072. PubMed Abstract | Publisher Full Text | Free Full Text
Fishbein SRS, Mahmud B, Dantas G: Antibiotic perturbations to the gut microbiome. Nat. Rev. Microbiol. 2023; 21(12): 772–788. PubMed Abstract | Publisher Full Text
Hasan N, Yang H: Factors affecting the composition of the gut microbiota, and its modulation. PeerJ. 2019; 7: e7502. PubMed Abstract | Publisher Full Text | Free Full Text
Hou K, et al.: Microbiota in health and diseases. Signal Transduct. Target. Ther. 2022; 7(1): 135. PubMed Abstract | Publisher Full Text | Free Full Text
Jandhyala SM, et al.: Role of the normal gut microbiota. World J. Gastroenterol. 2015; 21(29): 8787–8803. PubMed Abstract | Publisher Full Text | Free Full Text
Kennedy MS, Chang EB: The microbiome: Composition and locations. Prog. Mol. Biol. Transl. Sci. 2020; 176: 1–42. PubMed Abstract | Publisher Full Text | Free Full Text
Liu C, et al.: microeco: an R package for data mining in microbial community ecology. FEMS Microbiol. Ecol. 2021; 97(2). PubMed Abstract | Publisher Full Text
Love MI, Huber W, Anders S: Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014; 15(12): 550. PubMed Abstract | Publisher Full Text | Free Full Text
McMurdie PJ, Holmes S: phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLoS One. 2013; 8(4): e61217. PubMed Abstract | Publisher Full Text | Free Full Text
Navgire GS, et al.: Analysis and Interpretation of metagenomics data: an approach. Biol. Proced. Online. 2022; 24(1): 18. PubMed Abstract | Publisher Full Text | Free Full Text
Neu AT, Allen EE, Roy K: Defining and quantifying the core microbiome: Challenges and prospects. Proc. Natl. Acad. Sci. USA. 2021; 118(51). PubMed Abstract | Publisher Full Text | Free Full Text
Panek M, et al.: Methodology challenges in studying human gut microbiota - effects of collection, storage, DNA extraction and next generation sequencing technologies. Sci. Rep. 2018; 8(1): 5143. PubMed Abstract | Publisher Full Text | Free Full Text
Peeters J, et al.: Exploring the Microbiome Analysis and Visualization Landscape. Front. Bioinform. 2021; 1: 774631. PubMed Abstract | Publisher Full Text | Free Full Text
Quast C, et al.: The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013; 41(Database issue): D590–D596. PubMed Abstract | Publisher Full Text
Satam H, et al.: Next-Generation Sequencing Technology: Current Trends and Advancements. Biology (Basel). 2023; 12(7). Publisher Full Text
Schloss PD, et al.: Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl. Environ. Microbiol. 2009; 75(23): 7537–7541. PubMed Abstract | Publisher Full Text | Free Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 09 Aug 2024

Author details Author details

¹ Nutrition and Microbiome Laboratory, Centre de recherche du Centre hospitalier de l'Université de Montréal (CRCHUM), Montréal, Québec, Canada
² Institut du cancer de Montréal, Montréal, Québec, Canada
³ Department of Medicine, Faculty of Medicine, Université de Montréal, Montréal, Québec, Canada

Thibault Cuisiniere
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Manuela M Santos
Roles: Funding Acquisition, Project Administration, Resources, Supervision, Validation, Writing – Original Draft Preparation, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

This work was supported by a grant from the Natural Sciences and Engineering Research Council of Canada [NSERC, grant RGPIN-2024-05660] to MMS.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (1)

version 1

Published: 09 Aug 2024, 13:914

https://doi.org/10.12688/f1000research.151662.1

Copyright

© 2024 Cuisiniere T and M Santos M. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Cuisiniere T and M Santos M. StackbarExtended: a user-friendly stacked bar-plot representation incorporating phylogenetic information and microbiota differential abundance analysis [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2024, 13:914 (https://doi.org/10.12688/f1000research.151662.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 09 Aug 2024

Views

22

Reviewer Report 29 Aug 2024

Monica Steffi Matchado, Ludwig-Maximilians-Universitat Munchen Medizinische Fakultat (Ringgold ID: 54187), Munich, Bavaria, Germany

Approved with Reservations

https://doi.org/10.5256/f1000research.166324.r312937

Overview:
This manuscript describes a new R package, StackbarExtended, which combines phylogenetic information and statistical significance into a single graphical output. This will visually improve microbiota composition in better understanding. The technique solves conventional stacked bar plot shortcomings, most ... Continue reading

Overview:
This manuscript describes a new R package, StackbarExtended, which combines phylogenetic information and statistical significance into a single graphical output. This will visually improve microbiota composition in better understanding. The technique solves conventional stacked bar plot shortcomings, most notably the separation of differential abundance analysis from visual depiction and the absence of evolutionary context. The manuscript is well-written, the methodology is sound, and the use case effectively demonstrates the package's capabilities.
Major comments:
1. The package appears to focus only to users who are familiar with R programming knowledge. There is no discussion on how the package might be made more accessible to a broader audience, such as through a user-friendly interface or integration with other platforms. This limits the potential impact and adoption of the tool in the wider research community. Are there any plans to develop a graphical user interface (GUI) or Shiny app to make the package more accessible to users who may not be proficient in R? This could significantly broaden the tool’s usability, particularly among biologists without extensive programming experience.
2. The manuscript presents examples only up to the Family level, raising concerns about how the package would handle more detailed taxonomic levels, such as Genus. Given that a single Phylum can contain numerous Genera, the proposed colour scheme might become overly complex and difficult to interpret at these lower levels. The manuscript does not address how the tool will manage the increased number of taxa or how it will prevent the visualization from becoming visually cluttered and messy. This is a critical issue that needs to be resolved to ensure that the tool remains effective at higher taxonomic resolutions.
3. Incorporating a measure of prevalence into the visualization would be a valuable addition to the StackbarExtended package, particularly since the package does not require users to filter out low-abundance taxa. Showing the prevalence of taxa across sample groups can provide important context for interpreting the data, especially when dealing with low-abundance taxa that might still play significant ecological or biological roles despite their low relative abundance.
4. If possible, consider implementing interactive plots where users can hover over a bar segment to see detailed information, including the prevalence of that taxon across the sample groups. This approach would make the visual representation more informative without overcrowding the static plot.
5. Also consider an interactive feature where users can click on the "Others" category in the stacked bar plot to reveal a separate table or plot detailing the low-abundance taxa would significantly enhance the usability and depth of analysis in StackbarExtended. This feature would allow users to explore the "hidden" diversity within their microbiota data without overcrowding the primary visualization.
6. Incorporating the ability for users to customize the threshold for filtering (default 1%) for including taxa in the visualization is essential, especially given the diversity of microbial environments that researchers may be working with. While the default 1% threshold may be suitable for many studies, environments characterized by low-abundance microbial communities could require a lower threshold to capture significant taxa that would otherwise be excluded.

Conclusion:
Overall, StackbarExtended is a good contribution to the field of microbiota research. The manuscript is well-structured, and the tool itself is innovative and highly applicable. Addressing the major comments will significantly strengthen the manuscript.

Is the rationale for developing the new software tool clearly explained?

Partly
Is the description of the software tool technically sound?

Partly
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Partly
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Partly
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Microbiome and Bioinformatics

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Views

12

Reviewer Report 14 Aug 2024

Zheng Sun, Harvard Medical School, Boston, Massachusetts, USA

Approved

https://doi.org/10.5256/f1000research.166324.r312942

Interesting tool! It provides a clear and effective visualization of microbiome data, particularly by using similar colors to represent genus and its corresponding higher family, which enhances human interpretability. To further improve the tool, I suggest the following: (1) Adapt ... Continue reading

Interesting tool! It provides a clear and effective visualization of microbiome data, particularly by using similar colors to represent genus and its corresponding higher family, which enhances human interpretability. To further improve the tool, I suggest the following: (1) Adapt the tool to be easily applied to WMS data by integrating a phylogenetic tree from the GTDB/RefSeq database; (2) Replace lfcSE with AMCOM to better address the compositionality issue; (3) Expand the color palette and include detailed guidance on its use in the help documentation.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Yes
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: microbiome methodology development

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Respond or Comment

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 09 Aug 2024

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 1 09 Aug 24	read	read

Zheng Sun, Harvard Medical School, Boston, USA
Monica Steffi Matchado, Ludwig-Maximilians-Universitat Munchen Medizinische Fakultat (Ringgold ID: 54187), Munich, Germany

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

Back to all reports

Reviewer Report

22 Views

29 Aug 2024 | for Version 1

Monica Steffi Matchado, Ludwig-Maximilians-Universitat Munchen Medizinische Fakultat (Ringgold ID: 54187), Munich, Bavaria, Germany

22 Views Cite this report Responses(0)

Approved With Reservations

Overview:
This manuscript describes a new R package, StackbarExtended, which combines phylogenetic information and statistical significance into a single graphical output. This will visually improve microbiota composition in better understanding. The technique solves conventional stacked bar plot shortcomings, most notably the separation of differential abundance analysis from visual depiction and the absence of evolutionary context. The manuscript is well-written, the methodology is sound, and the use case effectively demonstrates the package's capabilities.
Major comments:
1. The package appears to focus only to users who are familiar with R programming knowledge. There is no discussion on how the package might be made more accessible to a broader audience, such as through a user-friendly interface or integration with other platforms. This limits the potential impact and adoption of the tool in the wider research community. Are there any plans to develop a graphical user interface (GUI) or Shiny app to make the package more accessible to users who may not be proficient in R? This could significantly broaden the tool’s usability, particularly among biologists without extensive programming experience.
2. The manuscript presents examples only up to the Family level, raising concerns about how the package would handle more detailed taxonomic levels, such as Genus. Given that a single Phylum can contain numerous Genera, the proposed colour scheme might become overly complex and difficult to interpret at these lower levels. The manuscript does not address how the tool will manage the increased number of taxa or how it will prevent the visualization from becoming visually cluttered and messy. This is a critical issue that needs to be resolved to ensure that the tool remains effective at higher taxonomic resolutions.
3. Incorporating a measure of prevalence into the visualization would be a valuable addition to the StackbarExtended package, particularly since the package does not require users to filter out low-abundance taxa. Showing the prevalence of taxa across sample groups can provide important context for interpreting the data, especially when dealing with low-abundance taxa that might still play significant ecological or biological roles despite their low relative abundance.
4. If possible, consider implementing interactive plots where users can hover over a bar segment to see detailed information, including the prevalence of that taxon across the sample groups. This approach would make the visual representation more informative without overcrowding the static plot.
5. Also consider an interactive feature where users can click on the "Others" category in the stacked bar plot to reveal a separate table or plot detailing the low-abundance taxa would significantly enhance the usability and depth of analysis in StackbarExtended. This feature would allow users to explore the "hidden" diversity within their microbiota data without overcrowding the primary visualization.
6. Incorporating the ability for users to customize the threshold for filtering (default 1%) for including taxa in the visualization is essential, especially given the diversity of microbial environments that researchers may be working with. While the default 1% threshold may be suitable for many studies, environments characterized by low-abundance microbial communities could require a lower threshold to capture significant taxa that would otherwise be excluded.

Conclusion:
Overall, StackbarExtended is a good contribution to the field of microbiota research. The manuscript is well-structured, and the tool itself is innovative and highly applicable. Addressing the major comments will significantly strengthen the manuscript.

Is the rationale for developing the new software tool clearly explained?

Partly
Is the description of the software tool technically sound?

Partly
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Partly
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Partly
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Microbiome and Bioinformatics

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

12 Views

14 Aug 2024 | for Version 1

Zheng Sun, Harvard Medical School, Boston, Massachusetts, USA

12 Views Cite this report Responses(0)

Approved

Interesting tool! It provides a clear and effective visualization of microbiome data, particularly by using similar colors to represent genus and its corresponding higher family, which enhances human interpretability. To further improve the tool, I suggest the following: (1) Adapt the tool to be easily applied to WMS data by integrating a phylogenetic tree from the GTDB/RefSeq database; (2) Replace lfcSE with AMCOM to better address the compositionality issue; (3) Expand the color palette and include detailed guidance on its use in the help documentation.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Yes
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

microbiome methodology development

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

[1] Berg G, et al.: Microbiome definition re-visited: old concepts and new challenges. Microbiome. 2020; 8(1): 103. PubMed Abstract | Publisher Full Text | Free Full Text

[2] Bolyen E, et al.: Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotechnol. 2019; 37(8): 852–857. PubMed Abstract | Publisher Full Text | Free Full Text

[3] Callahan BJ, et al.: DADA2: High-resolution sample inference from Illumina amplicon data. Nat. Methods. 2016; 13(7): 581–583. PubMed Abstract | Publisher Full Text | Free Full Text

[4] Chong J, et al.: Using MicrobiomeAnalyst for comprehensive statistical, functional, and meta-analysis of microbiome data. Nat. Protoc. 2020; 15(3): 799–821. PubMed Abstract | Publisher Full Text

[5] Cuisiniere T: StackbarExtended. Zenodo. 2024a. Publisher Full Text

[6] Cuisiniere T: tackbarExtended - ARRIVE checklist. Zenodo. 2024b. Reference Source

[7] Cuisiniere T, et al.: Oral iron supplementation after antibiotic exposure induces a deleterious recovery of the gut microbiota. BMC Microbiol. 2021; 21(1): 259. PubMed Abstract | Publisher Full Text | Free Full Text

[8] DeSantis TZ, et al.: Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl. Environ. Microbiol. 2006; 72(7): 5069–5072. PubMed Abstract | Publisher Full Text | Free Full Text

[9] Fishbein SRS, Mahmud B, Dantas G: Antibiotic perturbations to the gut microbiome. Nat. Rev. Microbiol. 2023; 21(12): 772–788. PubMed Abstract | Publisher Full Text

[10] Hasan N, Yang H: Factors affecting the composition of the gut microbiota, and its modulation. PeerJ. 2019; 7: e7502. PubMed Abstract | Publisher Full Text | Free Full Text

[11] Hou K, et al.: Microbiota in health and diseases. Signal Transduct. Target. Ther. 2022; 7(1): 135. PubMed Abstract | Publisher Full Text | Free Full Text

[12] Jandhyala SM, et al.: Role of the normal gut microbiota. World J. Gastroenterol. 2015; 21(29): 8787–8803. PubMed Abstract | Publisher Full Text | Free Full Text

[13] Kennedy MS, Chang EB: The microbiome: Composition and locations. Prog. Mol. Biol. Transl. Sci. 2020; 176: 1–42. PubMed Abstract | Publisher Full Text | Free Full Text

[14] Liu C, et al.: microeco: an R package for data mining in microbial community ecology. FEMS Microbiol. Ecol. 2021; 97(2). PubMed Abstract | Publisher Full Text

[15] Love MI, Huber W, Anders S: Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014; 15(12): 550. PubMed Abstract | Publisher Full Text | Free Full Text

[16] McMurdie PJ, Holmes S: phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLoS One. 2013; 8(4): e61217. PubMed Abstract | Publisher Full Text | Free Full Text

[17] Navgire GS, et al.: Analysis and Interpretation of metagenomics data: an approach. Biol. Proced. Online. 2022; 24(1): 18. PubMed Abstract | Publisher Full Text | Free Full Text

[18] Neu AT, Allen EE, Roy K: Defining and quantifying the core microbiome: Challenges and prospects. Proc. Natl. Acad. Sci. USA. 2021; 118(51). PubMed Abstract | Publisher Full Text | Free Full Text

[19] Panek M, et al.: Methodology challenges in studying human gut microbiota - effects of collection, storage, DNA extraction and next generation sequencing technologies. Sci. Rep. 2018; 8(1): 5143. PubMed Abstract | Publisher Full Text | Free Full Text

[20] Peeters J, et al.: Exploring the Microbiome Analysis and Visualization Landscape. Front. Bioinform. 2021; 1: 774631. PubMed Abstract | Publisher Full Text | Free Full Text

[21] Quast C, et al.: The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013; 41(Database issue): D590–D596. PubMed Abstract | Publisher Full Text

[22] Satam H, et al.: Next-Generation Sequencing Technology: Current Trends and Advancements. Biology (Basel). 2023; 12(7). Publisher Full Text

[23] Schloss PD, et al.: Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl. Environ. Microbiol. 2009; 75(23): 7537–7541. PubMed Abstract | Publisher Full Text | Free Full Text

StackbarExtended: a user-friendly stacked bar-plot representation incorporating phylogenetic information and microbiota differential abundance analysis

Abstract

Background

Methods

Results

Conclusions

Keywords

Introduction

Methods

Operation

Implementation

Use cases

Figure 1. StackbarExtended graphical output.

Table 1. Subset of the ASV table used.

Table 2. Subset of the taxonomy table used.

Table 3. Subset of the metadata table used.

Table 4. Subset of data frame object output of the StackBarExtended R package.

Discussion

Software availability

Ethics and consent

Data availability

Underlying data

Reporting guidelines

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated