Inferring and simulating a gene regulatory network for the sympathoadrenal differentiation from single-cell transcriptomics in human.

Olivier Gandrillon

doi:10.12688/f1000research.164530.1

Home Browse Inferring and simulating a gene regulatory network for the sympathoadrenal...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Inferring and simulating a gene regulatory network for the sympathoadrenal differentiation from single-cell transcriptomics in human.

[version 1; peer review: 1 approved with reservations]

Olivier Gandrillon ^1,2

PUBLISHED 11 Sep 2025

Author details Author details

¹ Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratoire de Biologie et de Modélisation de la Cellule, 46 allée d’Italie Site Jacques Monod, Lyon, France
² Inria, Villeurbanne, France

Olivier Gandrillon
Roles: Conceptualization, Formal Analysis, Software, Writing – Original Draft Preparation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Bioinformatics gateway.

Abstract

Background

Neuroblastoma is a malignant childhood cancer with significant inter- and intrapatient heterogeneity arising from the abnormal differentiation of neural crest cells into sympathetic neurons. The lack of actionable mutations limits therapeutic options, highlighting the need to better understand the molecular mechanisms that drive this differentiation. Although RNA velocity has provided some insights, modeling regulatory relationships is limited.

Methods

To address this, we applied our integrated gene regulatory network (GRNs) inference (CARDAMOM) and simulation (HARISSA) tools using a published single-cell RNAseq dataset from human sympathoadrenal differentiation.

Results

Our analysis identified a 97-gene GRN that drives the transition from Schwann cell precursors to chromaffin cells and sympathoblasts, highlighting dynamic interactions such as self-reinforcing loops and toggle switches. The simulation of that GRN was able to reproduce very satisfactorily the experimentally observed gene expression distributions.

Conclusions

Altogether, these findings demonstrate the utility of our GRN model framework for inferring GRN structure, even in the absence of a time-resolved dataset.

Keywords

Single cell / neuroblastomas / sympathoblast differentiation / GRN /

Corresponding author: Olivier Gandrillon

Competing interests: No competing interests were disclosed.

Grant information: This work was supported by the Ligue Contre le Cancer (comités du Rhône et de l’Ardèche).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2025 Gandrillon O. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Gandrillon O. Inferring and simulating a gene regulatory network for the sympathoadrenal differentiation from single-cell transcriptomics in human. [version 1; peer review: 1 approved with reservations]. F1000Research 2025, 14:910 (https://doi.org/10.12688/f1000research.164530.1) First published: 11 Sep 2025, 14:910 (https://doi.org/10.12688/f1000research.164530.1) Latest published: 11 Sep 2025, 14:910 (https://doi.org/10.12688/f1000research.164530.1)

Introduction

Neuroblastoma is a childhood cancer responsible for approximately 15 % of cancer-related deaths in children aged 0 to 4 years. These tumors typically develop near the adrenal glands and are characterized by significant inter- and intrapatient heterogeneity.¹ Treatments are still limited, with few actionable mutations, and very few drugs have been clinically validated,² although CAR-T cells targeting the GD2 antigen represent a promising prospect.³ Neuroblastoma is an archetypal example of a tumor arising through alterations in the differentiation process, more precisely during the differentiation of migratory neural crest cells into sympathetic neurons.^4–7 Therefore, there is an urgent need for a better understanding of the mechanisms driving normal development from neural crest and mesodermal lineages,⁸ to better understand the origin of different neuroblastoma subtypes, their internal heterogeneity, and interpatient variation in malignancy.

Initial studies were performed in mice.⁹ Although mouse models provide in-depth, experimentally supported insights into developmental processes, inconsistencies between mouse and human development can hinder our understanding of certain diseases. For example, transitions between cell fates during the differentiation of migratory neural crest cells into sympathetic neurons appear to occur in a different order in humans versus mice.¹⁰ Furthermore, neuroblastomas do not appear naturally in mice, and current genetic models do not recapitulate the full spectrum of natural properties of the disease.¹¹

Therefore, it is critical to identify the human-specific aspects of development, which has led to a recent surge in studies of normal development at the single-cell level in humans.^10,12,13 To go beyond a static description of the differentiation process, the authors of the aforementioned papers have resorted to the use of RNA velocity analysis to obtain a more dynamic view.^10,13 RNA velocity is driven by a kinetic model of RNA dynamics and works by distinguishing between unspliced and spliced mRNA counts and fitting gene-specific parameters based on an on-off deterministic transcription model. This allows the RNA velocity to predict the direction of future changes in spliced mRNA levels.^14–16 However, although RNA velocity uses a time-dependent gene expression model, time has no explicit biological interpretation.¹⁷ Furthermore, each gene is modelled independently, and regulatory relationships are ignored.¹⁵

We recently described a model for Gene Regulatory Networks (GRNs) that alleviates most of these drawbacks and provides detailed access to a fully mechanistic view of gene regulation during a differentiation process.^18,19 In this model, each gene is modelled as a piecewise deterministic Markov process (PDMP), describing bursts in the production of mRNA and the resulting synthesis of proteins.^20,21 Genes are coupled via an interaction function that describes how the proteomic field feeds back into burst frequency. The coupling of a GRN inference algorithm (CARDAMOM) with a simulation algorithm (HARISSA) allowed us to demonstrate the potential of this approach for inferring executable GRN models that can reproduce the observed experimental data.

In the present work, we describe the use of our integrated approach to infer the GRN driving sympathoadrenal differentiation in humans based on single-cell RNA-seq data.¹⁰ Our analysis identified a 97 genes-based GRN involved in the transition from Schwann cell precursors to chromaffin cells and sympathoblasts. Their study revealed dynamic gene interactions such as self-reinforcing loops and toggle switches.

Material and methods

Data preparation

The raw data is available at GEO GSE14782.

The processed adrenal.human.seurat.scrublet.rds dataset used in this study can be found at http://pklab.med.harvard.edu/artem/adrenal/data/Seurat/.

This represents single-cell transcriptomic analysis using 10x chromium on isolated individual cells from dissected adrenal glands with surrounding tissue from human embryos at 6, 8, 9, 11, 12, and 14. The initial matrix displays the expression of 25787 genes across 72571 cells. Only cells harboring a cell doublet probability (Scrublet score) <0.2 were kept.

The cell types were annotated by the authors based upon gene expression pattern.

To elucidate the sympathoadrenal cell fate transitions in humans at a higher resolution, we focused on the 3,901 cells of SCP, chromaffin, and sympathetic fates, omitting annotated cell cycle genes. As advocated by the authors, we removed a small potentially contaminating population of cells expressing the STAR gene. This resulted in a final matrix displaying the expression level of 20807 genes in 3759 cells at six developmental time points. We verified that the UMAP generated from this dataset was in perfect agreement with Figure 2a in the original paper (see Figure 1a). UMAP was computed using the Uwot package.²² All the UMAPs were embedded within the experimental UMAP using a combination of the ret_model and umap_transform functions of the uwot package.

Figure 1. UMAP representation of the dataset.

a and b: the initial 3759 cells x 20807 genes matrix. c: the final 1800 cell x 97 genes dataset after time sampling (see Figure 2). In a and c cells are color-coded according to their cell fate. In b, cells are color-coded according to their sampling time.

Stemness score

The stemness score was computed per cell as signature_SCP/signature_SCP + signature_sympatoblast + signature_chromaffin, by summing up the gene expression level for the following signature genes:

- SCPgenes: PLP1, FOXD3, FABP7, S100B, NGFR, ERBB3, MBP, MPZ, COL2A1, POSTN, MOXD1, and GAS7
- Sympatoblast genes: STMN2, HAND2, ELAVL4, STMN4, ISL1, PRPH, ELAVL2, and HMX1.
- chromaffin_genes: CHGA, CHGB, INSM1, PENK, PNMT and SLC35D3.

GRN inference and representation

First, we cloned the original CARDAMOM git repository at https://github.com/ogandril/cardamom. All computations were performed at the IN2P3 computing center (https://cc.in2p3.fr). The scripts used for launching CARDAMOM on a distant server are available at https://github.com/ogandril/Launching_Cardamom. All interaction values were scaled by a factor of 10 and thresholded at a value of four to maintain only the most relevant interactions.

The resulting inter.csv interaction matrix was opened with R²³ and plotted using the qgraph package.²⁴ A list of transcription factors was obtained from https://github.com/saezlab/CollecTRI.²⁵ Violin plots were drawn using the ggbeeswarm package.

GRN simulation

The simulation was performed using the CARDAMOM build-in version of HARISSA, a software for performing GRN simulations based on an underlying stochastic dynamical model driven by the transcriptional bursting phenomenon.²⁶

The dynamics of each gene is given by the following ordinary differential equations:

{\overset{`}{M}}_{i} = s_{0, i} E_{i} - d_{0, i} M_{i}

{\overset{`}{P}}_{i} = s_{1, i} M_{i} - d_{1, i} P_{i}

where M_i is the number of mRNA molecules for gene i, P_i is the number of proteins for gene I, E_i is the state of the promoter, s_0,i is the mRNA synthesis rate, d_0,i is the mRNA degradation rate, s_1,i is the protein synthesis rate, and d_1,i is the protein degradation rate.

E_i can randomly switch from 0 to 1 (activation) with a protein-dependent rate k_on,i (P) and from 1 to 0 (inactivation) with a constant rate k_off,i.

We define the protein-dependent rate function k_on,i as in Ref. 26.

k_{on, i} (P) = \frac{k_{min, i} + k_{max, i} exp (β_{i} + \sum_{j} θ_{j, i} P_{j})}{1 + exp (β_{i} + \sum_{j} θ_{j, i} P_{j})}

such that the promoter activation rate is between k_min,i and k_max,i. The parameter β_i represents the basal activity of gene i, and the parameter θ_j;i encodes the value of the j → i interaction (see below for its dertemination).

Because we simulated a mechanistic model, one needs to provide HARISSA with half-lives for both mRNAs (d_0,i) and proteins (d_1,i). Experimentally determined values for mRNAs and proteins were extracted from²⁷ and,²⁸ respectively. Genes that were not found in these datasets were attributed to the mean value of experimentally determined half-lives. Degradation rates were obtained using the following formula: degradation rate = ln(2)/half-life. The half-lives of proteins were limited by a parameter called the cell cycle, which was set to 20h.

Color code for a rapid estimation of the fit quality was as follows: given all Kantorovich distances for all genes at all times, we considered that 40 % of the values should be considered as correct (after a careful examination of the marginals) and therefore colored in green. The remaining 60% was split into two, the first half colored in orange, and the last half colored red.

Dynamical GRN representation

In CARDAMOM,we compute at each time point t_i with i ≥ 1 the value of each interaction, using

θ_{i} ≔ \underset{θ}{argmin {} \sum_{c = 1}^{N_{i}} ∥ k_{on}^{θ} (P_{c}) - k_{P_{c}} ∥^{2}} + λ ∥ θ - θ_{i - 1} ∥^{2}

where Ni is the number of cells at time t_i, P_c the amount of protein in cell c,

k_{P_{c}}

the frequency mode and

θ_{0} = 0

.

So we do obtain at each time t_i a variation in the network

∆_{i} ≔ | θ_{i} - θ_{i - 1} |,

which quantifies how much each interaction estimate has been impacted by the cells measures at that time compared to all previous times. We can therefore, for any interaction between genes k and l, computes the times for which the largest impact was observed using

t^{k, l} ≔ \underset{i = 1, \dots, T}{argmax} ∆_{i}^{k, l} .

T being the latest time point available.

The dynamical interactions $Θ_{i}^{k, l}$ can therefore be defined as:

Θ_{i}^{k, l} = θ_{T}^{k, l} 1_{t^{k, l} = i} .

1 being the indicator function.

Results

Organizing the cells

Our first task was to organize the cells according to their differentiation status, as our GRN inference is anchored in the time-dependent evolution of the differentiation process. Cells were collected from human embryos at 6, 8, 9, 11, 12, and 14 weeks post-conception. First, we assessed whether this cell collection time could be a relevant framework for generating time-reconstructed data. As shown in Figure 1b, this was not the case, since all cell types were present at all collection times. This is in line with the fact that in the human adrenal gland, the transition from Schwann cell precursor (SCP) to sympathoadrenal states persists for several weeks,¹⁰ which leads to the expectation that cells at all stages of differentiation should be present at all collection times.

Therefore, we were faced with a different situation from the one we used to benchmark CARDAMOM, where the cell sampling time was in accordance with the actual process time.¹⁸

Consequently, we organized the cells according to their stemness scores ( Figure 2a).

Figure 2. Cell ordering by their stemness score.

a: all cells are displayed. b: Focus on the first 1300 cells, where the “process time” are indicated. The process time is defined along six time point with sliding windows of 300 cells overlapping by 100 cells.

For this, we assumed a differentiation sequence starting with the SCP, giving rise to chromaffin cells and sympathoblasts through a bifurcation. Such a scheme is in line with the conclusions of the original paper from which we obtained dataset¹⁰ and is shared by most^6,12,13,29 but not all^4,30 authors working on sympathoadrenal differentiation in humans. This led to the definition of the stemness score (see Materials and Methods) as the ratio of SCP-signature gene expression, evolving from 1 (only the SCP signature genes are expressed) to zero (none of the SCP signature genes are expressed). This allowed us to define the “process time” ( Figure 2b) using an overlapping sliding window of 300 cells. This was defined only on the first 1300 cells since the stemness score was almost down to zero at the end of that sequence, and because we wanted to capture in detail the abrupt decrease in the stemness score. The use of an overlapping window is justified by the fact that differentiation should be considered a discontinuous nonlinear process (see e.g., Ref. 31), and therefore should allow cells to go back and forth in the gene expression space. This resulted in a 20807 genes x 1800 cells matrix.

Gene selection

We then explored the most relevant gene set to be used for the GRN inference. Because our inference and simulation scheme CARDAMOM is fundamentally rooted in exploiting the power of distributions,^18,32 we first computed a Kantorovitch distance³³ for all genes for all delta-times. Similarly, since most delta-entropic genes were shown to represent the most relevant genes for the differentiation process,³⁴ we also computed delta-entropy for all genes for all delta-times.

The combination of these two lists ( Figure 3) provided us with a list of 97 genes ( Table 1) that were used for inferring the GRN.

Figure 3. Gene selection procedure.

We first computed a Kantorovitch distance for all genes between 2 consecutive time points. We then took the union of the 133 genes harboring the largest distance for each delta-time, which left us with a list of 200 genes (some genes might be among the most distant between two or more time points), labelled MaxKD. Very similarly, we computed an entropy value for all genes between 2 consecutive time points. We then took the union of the 50 genes harboring the largest entropy for each delta-time, which left us with a list of 376 genes labelled MostDE. The union of those two lists gave us a list of 97 genes ( Table 1) that were used for inferring the GRN.

Table 1. The 97 genes that were used for inferring the GRN.

ACTG1	DBH	IGFBP2	PPIA	RTN1
ALDH1A1	DLK1	ITM2B	PRPH	S100A10
ANXA2	EDNRB	LGALS1	PTMA	S100A6
APOE	EEF1A2	MAP 1B	PTMS	S100B
ATP5F1E	EEF2	MDK	PTN	SCG2
B2M	EIF4A2	MEG3	PTPRZ1	SPARC
BASP1	FAU	MEG8	QKI	STMN2
BEX1	FOS	METRN	RAMP1	TH
BTF3	GAL	MPZ	RGS4	TIMP3
C4orf48	GAP43	MT-CYB	RGS5	TMSB10
CARTPT	GAPDH	NEFL	RPL10	TPM1
CD24	H2AFZ	NPY	RPL12	UBC
CHGA	HAND2	NRXN1	RPL30	UCHL1
CHGB	HAND2-AS1	OLFML2A	RPL32	VCAN
CNTN1	HINT1	PCSK1N	RPL34	VIM
COL18A1	HMGB2	PENK	RPL41	YBX1
COL1A1	HMGN2	PHOX2A	RPS13	ZEB2
COL1A2	HNRNPA1	PLP1	RPS2
COL5A2	HTATSF1	PNMT	RPS29
CRYAB	IFITM3	POSTN	RPS4X

This left us with a matrix displaying the expression levels of 97 genes in 1800 cells distributed over six time points. We verified that the information contained within this matrix was sufficient to correctly capture the differentiation process of interest, as assessed by UMAP representation ( Figure 1c). It is important to note that these genes were chosen purely based on their expression patterns, irrespective of their putative biological functions (see Discussion).

GRN inference and analysis

We then applied the CARDAMOM algorithm to infer the GRN structure ( Figure 4a), resulting in a relatively densely connected network.

Figure 4. Representation of the inferred GRN structure.

a: the full inferred GRN. Activations are displayed in green, inhibitions are displayed in red. The strength of the interaction is reflected by the arrow thickness. On the right is shown the color-code that has been applied to the GRN nodes. The stimulus is highlighted in yellow. b: violin plot representing the number of outgoing or incoming interactions per gene. c: the actual values of the outgoing interactions are shown for the four genes with the highest number of interactions.

First, we explored the network connectivity. We observed a striking difference between the distribution of the numbers of outgoing and incoming nodes ( Figure 4b). The incoming nodes were distributed according to a normal (i.e., Gaussian) distribution centered around a mean of three, whereas the outgoing nodes displayed a very heavy long tail toward higher values, as well as a large number of null values (i.e., leaves, that is, genes only receiving and not sending any signals within the GRN). The long tail was mostly due to 4 genes (CHGA, CHGB, STMN2, and HMGB2). The intensity of all outgoing interactions for these four genes is shown in Figure 4c. HMGB2 displayed a specific pattern with quasi-exclusively inhibitory interactions, whereas the other three genes displayed a relatively similar number of positive and negative interactions. It should be noted that all but HMGB2 are part of the defining gene signatures (see Materials and Methods).

One key feature of our inference process is the possibility of decomposing the overall GRN into dynamical subparts, where each edge appears at the time point transition for which it was detected with the strongest intensity by the inference algorithm ( Figure 5;¹⁸ see Materials and Methods for a formal definition).

Figure 5. Representation of the time-dependent evolution of the inferred GRN structure.

a: t1 → t2; b: t2 → t3; c: t3 → t4; d: t4 → t5; e: t5 → t6. For the time definition, see Figure 2. For the color-code applied to the GRN nodes, see Figure 4a. For the definition of the dynamical interactions displayed, see Material and methods.

The dynamics start with a small number of genes being turned on by the stimulus ( Figure 5a). In our previous studies, the stimulus was explicitly identified as an exogenous medium change/addition.^18,35 Here, it should be understood as a complex influence exerted by the cell environment, whether hormonal or through cell-cell interactions, which induces SCP cell differentiation. It is needed to “set the GRN in motion” and push the cells out of their quasi-steady state.

There was a much larger number of interactions important for the next time point ( Figure 5b), including a very large set of genes being repressed by HMGB2. All known SCP-specific genes were repressed at that point, which was expected. A CHGA-CHGB self-reinforcing loop activates genes from the chromaffin lineage, while a few sympathoblast-specific genes (STMN2, HAND2) are activated.

At the next time point, an RTN1-PENK toggle switch was apparently supported by positive loops on both sides ( Figure 5c). We can see a lot of genes being repressed by CHGA and CHGB genes. This includes the repression of MPZ-and PLP1 SCP-specific genes by CGHB, a chromaffin-specific gene.

The two latest time points showed a very small number of interactions ( Figure 5d and 5e).

It is clear that the two time points where there is a large number of interactions correspond to the times at which one observes a steep variation in the stemness score ( Figure 2). Altogether, this time decomposition offers a clear illustration of a signal propagating through the GRN in the form of waves of gene activation and repression.³⁵

Finally, we extracted from the full GRN a subset of the genes connected by specific dynamical motifs, that is, either cross-positive self-reinforcing loops or mutually repressive toggle switches ( Figure 6). Among these, only one direct toggle switch can be observed, the one linking PENK to RTN1 genes. All other motifs were positive. One can see a group of genes (COL5A2, PTPRZ1, EDRNB, POSTN), including one SCP-specific gene (POSTN) which reinforce together and are repressed concomitantly by chromaffin specific genes (CHGA and CHGB) and a sympathoblast-specific gene (STMN2).

Figure 6. Representation of a subset of the GRN showing genes involved in specific dynamical motifs.

GRN simulation

One of the main advantages of our approach is that it produces executable GRN models,³⁶ the time-dependent evolution of which can be simulated, and the resulting simulated dataset can be compared with the experimentally observed dataset.

There are many ways in which the results of a simulation can be assessed. Using a UMAP representation allows for quick evaluation of the quality of the fit. This was first used to calibrate the process times t1–t5 in real time ( Figure 7).

Figure 7. Adjusting the process time to real time values.

In a, the UMAP is computed on the experimental dataset and color coded according to the process times (see Figure 2). b and c: the UMAP is computed on the simulated dataset and color coded according to the model time, in hours.

We first tested an evenly spaced time frame, where each time point was set 4 h apart ( Figure 7b). All the cells were instantly projected from their t1 position to the final position of the process. Therefore, we attempted different adjustments and obtained the best visual fit using a time sequence extending up to 80 h ( Figure 7c), which was used for further simulations.

We then assessed the extent to which the simulation captured the time-dependent evolution of a few genes whose expression was characteristic of the three cell types we were trying to reproduce ( Figure 8).

Figure 8. UMAP representation of the dataset used for GRN inference (a-f ) and of the dataset obtained through GRN simulation (g-m).

The cells are colored coded according to: (i) time (a and g); (ii) ZEB2 expression level (b and h); (iii) CHGA expression level (c and i); (iv): CHGB expression level (d and k); (v): STMN2 expression level (e and l) and (vi): HMGB2 expression level (f and m). All gene expression are log-transformed (LGE= ln(x+1)).

Overall, the gene expression pattern, when observed through its UMAP projection, was quite well reproduced by our GRN simulation. Most notably, chromaffin/sympathoblast bifurcation was quite well captured by our model.

Another way to assess the quality of the fit is to investigate the distance between the distribution of gene expression values of the experimentally observed mRNA distribution and the simulated distribution. As shown in Figure 9, there was a close proximity between the two distributions, as assessed by a low Kantorovich distance for the four genes displayed. Altogether, although some refinement could be performed (see discussion), we considered the fit quality of the experimental dataset to be quite satisfactory.

Figure 9. Fit quality for each gene at each time point as assessed by the Kantorovich distance (Kanto.dist) between the experimentally observed mRNA distribution and the simulated distribution.

The marginals computed by the model are shown by bold-delimited bars, the experimenatlly-observed margnials are shown as plain bars in green.

We finally analyzed the run-to-run variability of the model, given the intrinsically stochastic behavior of our gene expression model (see “GRN simulation” upper). We therefore performed 10 runs of the GRN without specifying the seed, and analyzed the result by taking advantage of the clear separation of the three cell types in the 2D UMAP space ( Figure 1c). This allowed us to count the number of cells belonging to each cell type following the simulation ( Figure 10). We observed a very reproducible behavior of our model with only marginal differences between the number of cells for each cell type between different simulations.

Figure 10. The UMAP space was divided in three regions, representing SCP, sympathoblasts and chromaffin cells (see Figure 1c).

Shown the mean +/- SD of the percent of cells belonging to each cell category, as assessed by their 2D UMAP position, in 10 independent runs of the model.

Discussion

One of the difficulties we had to solve in the present work is that the sampling time is not equal to the process time because all differentiation steps appear at all collected time points. As previously mentioned, “our approach can be applied to any biological process in which time-stamped single-cell transcriptomic data are obtained after applying a given stimulus. When such time-stamped snapshots are not available [which is the case here], the algorithm could, in principle, take as input time-reconstructed data (i.e., artificially ordered snapshots). In this case, the quality of the inference strictly depends on the effectiveness of the time reconstruction algorithm.”¹⁸

Therefore, we assumed a differentiation sequence starting with the Schwann cell precursor (SCP), giving rise to chromaffin cells and sympathoblasts through a bifurcation. This allowed the organization of cells along a stemness score aimed at capturing the unfolding of the differentiation process. This differentiation sequence was demonstrated in the study in which we obtained our dataset.¹⁰ Such a sequence is shared by some authors^13,12,6 but not all.^4,30 In the last case, the authors assumed a differentiation sequence starting with sympathoblasts and giving rise to chromaffin cells and SCP through a bifurcation. The reason for such discrepancies is unclear and sheds a crude light on the complexity of ascertaining a differentiation sequence in vivo in humans. One must stress that our method relies on a given definition of the differentiation sequence and cannot be used to untangle different hypotheses.

Assuming this order in the differentiation sequence, we proposed a framework through which cells could be organized, genes could be selected, and a 97 genes-based GRN could be inferred.

When compared to the GRNs previously inferred on different differentiation sequences,^35,18 the structure of the adrenal differentiation GRN displays both similarities and differences. Among these differences, one should note that the depth of the GRN (i.e., the largest number of nodes connected in a row) is slightly larger⁶ than that for the erythroid differentiation sequence (depth of 3) or for the RA-induced ES cell differentiation (depth of 4). This is not imposed by our inference algorithm and can potentially infer a GRN of any length. In any case, this depth is limited by the duration of the differentiation sequence, which is in the same range (80 h) as that for the erythroid (72 h) or ES (96 h) differentiation sequences. If the network depth is too large, the signal is too damped and delayed to reproduce the experimental data accurately. Another difference was the role played by the stimulus; sympathoblast differentiation displayed a relatively modest effect (four genes affected directly by the stimulus), whereas it had a much more pronounced effect on ES (14 genes affected) and erythroid (29 genes affected) differentiation sequences.

A very clear similarity in all GRN behaviors is that there is an obvious wave-like pattern in the way the signal propagates through the network. In the case of sympathoblast differentiation, the more dense interaction times are seen at the t2-t3 and t3-t4 transitions points ( Figure 5), in line with the very strong and abrupt change in stemness observed at those time points ( Figure 2b).

The analysis of the subset of genes connected by specific dynamical motifs showed the existence of a group of genes, including two signature SCP-specific genes (POSTN and S100B) that reinforce together and are repressed by chromaffin-specific genes and a sympathoblast-specific gene. This is reminiscent of the “team” concept: a team is a group of genes that activates member genes belonging to the same team while inhibiting genes of other teams directly and/or indirectly. It has been shown to be a network design principle that can drive cell fate canalization in diverse decision-making processes³⁷ in a robust manner.³⁸ In our case, this could have been involved in the maintenance of an undifferentiated SCP phenotype, which would have to be turned down to allow differentiation to proceed.

This study aimed to infer and simulate a GRN operating during normal sympathoadrenal differentiation in humans. The main incentive behind the study of this specific differentiation sequence is to provide insights from human developmental studies relevant to cases of neural crest-related pathologies and cancers. Therefore, our work is potentially an appropriate step in the understanding the developmental origin of neuroblastoma. It has been suggested that some immature sympathetic cells or their immediate progenitors may be of tumor origin.⁵ Nevertheless, the current ambiguity regarding what should be considered in the actual differentiation sequence^4,6,12,13,30 complicates this debate. It is clear that the generation of neuroblastoma cells cannot be resumed as a simple differentiation blockade in an immature SCP state.

However, there are clear limitations to this approach. Although we describe an informed gene selection procedure, it is dependent upon the use of a given arbitrary threshold for the number of genes selected. The use of a double selection procedure (using both Kantorovich distances and entropy) might alleviate this effect, but we cannot rule out that selecting more genes would have led to the incorporation of biologically relevant genes. We would also like to emphasize that contrary to most, if not all, GRN inference algorithms our gene selection procedure is completely blind to the function of the selected genes and relies purely on the time-dependent evolution of their mRNA distributions. This implies that many genes selected are not transcription factors, a very clear difference from most GRN inference, where regulatory proteins within a GRN are restricted to transcription factors (TF), as in Refs. 39–41. Possible indirect interactions were completely ignored. A trivial example is the gene encoding a protein that induces the nuclear translocation of a constitutive TF. In this case, the regulator gene indirectly regulates TF target genes, and its effect is crucial for understanding GRN behavior. In our case, assuming a time-scale separation, we reasoned that only the genes that displayed a change in their expression pattern (i.e., their mRNA distribution at the single-cell level) within the time frame of the experimental sampling could be captured and used for GRN inference, based on single-cell transcriptomics. This means that most of the interactions that our GRN display are probably indirect and are molecularly mediated by changes that act at scales that are not captured by the sampling process (e.g., very fast protein phosphorylation). Therefore, edges in our GRN should be interpreted with caution, and elucidating their molecular nature would require dedicated efforts using different techniques and sampling strategies.

One of the main difficulties in correctly reproducing experimental data using a mechanistic model is that critical values regarding the dynamical behavior of the model, that is both mRNAs and proteins half-life, have to be feeded to the model, based either on literature or through a rough estimate. Therefore an algorithm that would infer the half-life values while inferring the GRN structure would represent an invaluable step forward compared to the actual CARDAMOM algorithm.

In summary, we have proposed an approach for inferring and simulating GRN based on single-cell transcriptomic data generated to sample an in vivo differentiation process. We showed that the absence of a time-stamped process in a dataset can be overcome by ad hoc measures. There is little doubt that, given the flood of such a dataset in the literature, the use of our approach could lead to a better understanding of the mechanisms operating during cell decision-making.

Ethics approval and consent to participate

Not applicable

Software availability

Source code available from: https://github.com/ogandril/cardamom and https://github.com/ogandril/Launching_ Cardamom

Archived software available from: https://doi.org/10.5281/zenodo.15389612 and https://doi.org/10.5281/zenodo.15389635

License: BSD 3-Clause License.

Acknowledgements

I am grateful to Elias Ventre, Ulysse Herbach, and Matteo Bouvier for their continuous support and help in setting up and improving the CARDAMOM/HARISSA framework.

I thank Elias Ventre and Sandrine Gonin-Giraud for their critical reading of the manuscript.

I thank the computational center of IN2P3 (Villeurbanne/France), where the computations were performed. I thank the BioSyL Federation and LabEx Ecofect (ANR-11-LABX-0048) of the University of Lyon for inspiring scientific events.

This work was not financed by the Association Pour la Recherche Contre le Cancer (ARC).

A preliminary version of this work is available at: https://www.biorxiv.org/content/10.1101/2025.03.21.644507v2.

References

1. Gomez RL, Ibragimova S, Ramachandran R, et al.: Tumoral heterogeneity in neuroblastoma. Biochim. Biophys. Acta Rev. Cancer. 2022; 1877(6): 188805. Publisher Full Text
2. Almstedt E, Elgendy R, Hekmati N, et al.: Integrative discovery of treatments for high-risk neuroblastoma. Nat. Commun. 2020; 11(1): 71. PubMed Abstract | Publisher Full Text | Free Full Text
3. Li CH, Sharma S, Heczey AA, et al.: Long-term outcomes of GD2-directed CAR-T cell therapy in patients with neuroblastoma. Nat. Med. 2025; 31: 1125–1129. Publisher Full Text
4. Zeineldin M, Patel AG, Dyer MA: Neuroblastoma: When differentiation goes awry. Neuron. 2022; 110(18): 2916–2928. PubMed Abstract | Publisher Full Text | Free Full Text
5. Ratner N, Brodeur GM, Dale RC, et al.: The “neuro” of neuroblastoma: Neuroblastoma as a neurodevelopmental disorder. Ann. Neurol. 2016; 80(1): 13–23. PubMed Abstract | Publisher Full Text | Free Full Text
6. Ponzoni M, Bachetti T, Corrias MV, et al.: Recent advances in the developmental origin of neuroblastoma: an overview. J. Exp. Clin. Cancer Res. 2022; 41(1): 92. PubMed Abstract | Publisher Full Text | Free Full Text
7. Patel AG, Ashenberg O, Collins NB, et al.: Dyer MA: A spatial cell atlas of neuroblastoma reveals developmental, epigenetic and spatial axis of tumor heterogeneity. bioRxiv. 2024. 2024.2001.2007.574538.
8. Del Valle I, Buonocore F, Duncan AJ, et al.: A genomic atlas of human adrenal and gonad development. Wellcome Open Res. 2017; 2: 25. PubMed Abstract | Publisher Full Text | Free Full Text
9. Furlan A, Dyachuk V, Kastriti ME, et al.: Adameyko I: Multipotent peripheral glial cells generate neuroendocrine cells of the adrenal medulla. Science. 2017; 357(6346). PubMed Abstract | Publisher Full Text | Free Full Text
10. Kameneva P, Artemov AV, Kastriti ME, et al.: Single-cell transcriptomics of human embryos identifies multiple sympathoblast lineages with potential implications for neuroblastoma origin. Nat. Genet. 2021; 53(5): 694–706. PubMed Abstract | Publisher Full Text | Free Full Text
11. Rasmuson A, Segerstrom L, Nethander M, et al.: Tumor development, growth characteristics and spectrum of genetic aberrations in the TH-MYCN mouse model of neuroblastoma. PLoS One. 2012; 7(12): e51297. PubMed Abstract | Publisher Full Text | Free Full Text
12. Kildisiute G, Kholosy WM, Young MD, et al.: Tumor to normal single-cell mRNA comparisons reveal a pan-neuroblastoma cancer cell. Sci. Adv. 2021; 7(6). PubMed Abstract | Publisher Full Text | Free Full Text
13. Jansky S, Sharma AK, Korber V, et al.: Single-cell transcriptomic analyses provide insights into the developmental origins of neuroblastoma. Nat. Genet. 2021; 53(5): 683–693. PubMed Abstract | Publisher Full Text
14. La Manno G, Soldatov R, Zeisel A, et al.: RNA velocity of single cells. Nature. 2018; 560(7719): 494–498. PubMed Abstract | Publisher Full Text | Free Full Text
15. Bergen V, Lange M, Peidli S, et al.: Generalizing RNA velocity to transient cell states through dynamical modeling. bioRxiv. 2019; 820936.
16. Gorin G, Fang M, Chari T: L P: RNA velocity unraveled. PLoS Comput. Biol. 2022; 18(9): e1010492. PubMed Abstract | Publisher Full Text | Free Full Text
17. Fang M, Gorin G, Pachter L: Trajectory inference from single-cell genomics data with a process time model. PLoS Comput. Biol. 2025; 21(1): e1012752. PubMed Abstract | Publisher Full Text | Free Full Text
18. Ventre E, Herbach U, Espinasse T, et al.: One model fits all: Combining inference and simulation of gene regulatory networks. PLoS Comput. Biol. 2023; 19(3): e1010962. PubMed Abstract | Publisher Full Text | Free Full Text
19. Ventre E: Reverse engineering of a mechanistic model of gene expression using metastability and temporal dynamics. In Silico Biol. 2020; 14: 89–113. Publisher Full Text
20. Herbach U, Bonnaffoux A, Espinasse T, et al.: Inferring gene regulatory networks from single-cell data: a mechanistic approach. BMC Syst. Biol. 2017; 11(1): 105. PubMed Abstract | Publisher Full Text | Free Full Text
21. Ventre E, Espinasse T, Bréhier C-E, et al.: Reduction of a stochastic model of gene expression: Lagrangian dynamics gives access to basins of attraction as cell types and metastabilty. J. Math. Biol. 2021; 83(5): 59. PubMed Abstract | Publisher Full Text
22. Melville J: uwot: The Uniform Manifold Approximation and Projection (UMAP) Method for Dimensionality Reduction.2024.
23. R: A language and environment for statistical computing. http
24. Epskamp S, Cramer AOJ, Waldorp LJ, et al.: qgraph: Network Visualizations of Relationships in Psychometric Data. J. Stat. Softw. 2012; 48: 1–18. Publisher Full Text
25. Muller-Dott S, Tsirvouli E, Vazquez M, et al.: Expanding the coverage of regulons from high-confidence prior knowledge for accurate estimation of transcription factor activities. Nucleic Acids Res. 2023; 51(20): 10934–10949. PubMed Abstract | Publisher Full Text | Free Full Text
26. Herbach U: Harissa: Stochastic Simulation and Inference of Gene Regulatory Networks Based on Transcriptional Bursting. Computational Methods in Systems Biology: 2023//2023. Cham: Springer Nature Switzerland; pp. 97–105.
27. Blumberg A, Zhao Y, Huang YF, et al.: Characterizing RNA stability genome-wide through combined analysis of PRO-seq and RNA-seq data. BMC Biol. 2021; 19(1): 30. PubMed Abstract | Publisher Full Text | Free Full Text
28. Li J, Cai Z, Vaites LP, et al.: Proteome-wide mapping of short-lived proteins in human cells. Mol. Cell. 2021; 81(22): 4722–4735.e5. PubMed Abstract | Publisher Full Text
29. Dong R, Yang R, Zhan Y, et al.: Single-Cell Characterization of Malignant Phenotypes and Developmental Trajectories of Adrenal Neuroblastoma. Cancer Cell. 2020; 38(5): 716–733.e6. PubMed Abstract | Publisher Full Text
30. Sehgal M, Nayak SP, Sahoo S, et al.: Mutually exclusive teams-like patterns of gene regulation characterize phenotypic heterogeneity along the noradrenergic-mesenchymal axis in neuroblastoma. Cancer Biol. Ther. 2024; 25(1): 2301802. PubMed Abstract | Publisher Full Text | Free Full Text
31. Moris N, Pina C, Arias AM: Transition states and cell fate decisions in epigenetic landscapes. Nat. Rev. Genet. 2016; 17(11): 693–703. PubMed Abstract | Publisher Full Text
32. Mar JC: The rise of the distributions: why non-normality is important for understanding the transcriptome and beyond. Biophys. Rev. 2019; 11: 89–94. PubMed Abstract | Publisher Full Text | Free Full Text
33. Vershik AM: Kantorovich metric: initial history and little-known applications. J. Math. Sci. 2006; 133(4): 1410–1417. Publisher Full Text
34. Dussiau C, Boussaroque A, Gaillard M, et al.: Hematopoietic differentiation is characterized by a transient peak of entropy at a single-cell level. BMC Biol. 2022; 20(1): 60. PubMed Abstract | Publisher Full Text
35. Bonnaffoux A, Herbach U, Richard A, et al.: WASABI: a dynamic iterative framework for gene regulatory network inference. BMC Bioinformatics. 2019; 20(1): 220. PubMed Abstract | Publisher Full Text | Free Full Text
36. Fisher J, Henzinger TA: Executable cell biology. Nat. Biotechnol. 2007; 25(11): 1239–1249. Publisher Full Text
37. Hari K, Ullanat V, Balasubramanian A, et al.: Landscape of epithelial-mesenchymal plasticity as an emergent property of coordinated teams in regulatory networks. elife. 2022; 11. PubMed Abstract | Publisher Full Text | Free Full Text
38. Shyam S, Nandan N, Jolly MK, Hari K: Topological conditions of gene regulatory networks enabling robust binary cell-fate decision-making. bioRxiv. 2025. 2025.2002.2007.636066.
39. Matsumoto H, Kiryu H, Furusawa C, et al.: SCODE: An efficient regulatory network inference algorithm from single-cell RNA-Seq during differentiation. Bioinformatics. 2017; 33: 2314–2321. PubMed Abstract | Publisher Full Text | Free Full Text
40. Sanchez-Castillo M, Blanco D, Tienda-Luna IM, et al.: A Bayesian framework for the inference of gene regulatory networks from time and pseudo-time series data. Bioinformatics. 2018; 34(6): 964–970. PubMed Abstract | Publisher Full Text
41. Ocone A, Haghverdi L, Mueller NS, et al.: Reconstructing gene regulatory dynamics from high-dimensional single-cell snapshot data. Bioinformatics. 2015; 31(12): i89–i96. PubMed Abstract | Publisher Full Text | Free Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 11 Sep 2025

Author details Author details

¹ Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratoire de Biologie et de Modélisation de la Cellule, 46 allée d’Italie Site Jacques Monod, Lyon, France
² Inria, Villeurbanne, France

Olivier Gandrillon
Roles: Conceptualization, Formal Analysis, Software, Writing – Original Draft Preparation, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

This work was supported by the Ligue Contre le Cancer (comités du Rhône et de l’Ardèche).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (1)

version 1

Published: 11 Sep 2025, 14:910

https://doi.org/10.12688/f1000research.164530.1

Copyright

© 2025 Gandrillon O. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Gandrillon O. Inferring and simulating a gene regulatory network for the sympathoadrenal differentiation from single-cell transcriptomics in human. [version 1; peer review: 1 approved with reservations]. F1000Research 2025, 14:910 (https://doi.org/10.12688/f1000research.164530.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 11 Sep 2025

Views

7

Reviewer Report 23 Jun 2026

Olivia Johnson, Biostatistics and Medical Informatics, University of Wisconsin–Madison, Madison, Wisconsin, USA

Approved with Reservations

https://doi.org/10.5256/f1000research.181055.r489246

Overall summary:
The author applied a GRN inference algorithm, CARDAMOM, to a published single-cell RNAseq dataset from human sympathoadrenal differentiation and verified the capture of gene regulatory patterns through simulating scRNA-seq expression from the inferred GRN using HARISSA. They ... Continue reading

Overall summary:
The author applied a GRN inference algorithm, CARDAMOM, to a published single-cell RNAseq dataset from human sympathoadrenal differentiation and verified the capture of gene regulatory patterns through simulating scRNA-seq expression from the inferred GRN using HARISSA. They propose a new concept, “process” time of cells, similar to a pseudo time trajectory, by stemness score and assumed a differentiation sequence starting with Schwann cell precursor (SCP) and leading to chromaffin cells and sympathoblasts through a bifurcation. From the GRN, they identified dynamic interactions such as self-reinforcing loops and toggle switches in human sympathoadrenal differentiation.
The manuscript provides a well-justified explanation for the assumed differentiation sequence. Additionally, the discussion of the study's limitations is thorough, particularly noting the threshold-dependent nature of the gene selection. The paper is largely an application of existing tools, so methodologically does not contribute much, but the application is interesting.
The following points can improve the paper:

The Materials and Methods section would benefit from additional explanation of the mathematical framework underlying CARDAMOM and HARISSA. In particular, several variables and parameters appear without explicit definition. For example, the parameter λ in the 'Dynamical GRN representation' section is introduced in the equation but is never described in the text. Additionally, the distinction between experimentally observed mRNA measurements and the latent promoter states and protein concentrations used internally by the model is not immediately clear. A clearer explanation of how HARISSA represents transcriptional bursting and generates variables would improve accessibility for readers who are not already familiar with the framework.
In the 'Materials and Methods' section, there is a paragraph detailing a 'Color code for a rapid estimation of the fit quality'. This color-coding scheme does not appear to be utilized in any of the figures provided. The author should either include the corresponding figure or remove this paragraph from the Methods section if the analysis was intentionally omitted.
While defining a novel 'process time' using a custom stemness score nicely addresses the fact that the experimental collection times do not align with the differentiation trajectory, the manuscript would benefit from benchmarking this specific approach. The author should compare their stemness-based cell ordering against established pseudo time inference methods (e.g., Monocle3, Slingshot, Diffusion Pseudo time, Palantir, CellRank). Demonstrating whether these standard tools fail to reproduce the expected trajectory would rigorously justify the necessity of the proposed stemness process time.
The manuscript would benefit from a more detailed characterization of the relationship between the six newly defined 'process time' groups and the original developmental time points. While the UMAP visualization (Figure 1b) effectively demonstrates that the original cell collection times inadequately represent differentiation since all cell types are present at all collection times, it remains unclear what proportion of cells from each original time point contributes to each newly defined process-time bin. This information would help determine whether the method effectively reconstructs the original developmental time points or whether some original stages are disproportionately represented or excluded. In particular, because the dynamic analysis suggests that the majority of regulatory interactions are observed in the first three process-time groups, it would be interesting to determine which developmental weeks contribute most strongly to these groups. This analysis may provide additional biological insight into when meaningful regulatory changes occur during sympathoadrenal differentiation.
The use of overlapping cell populations between consecutive process-time groups could be affecting the inferred time-dependent GRNs. A useful sensitivity analysis would be to evaluate the robustness of the inferred networks to different degrees of overlap between consecutive groups. If the inferred interactions change substantially with overlap size, the conclusions regarding network dynamics may be less robust than suggested. Similarly, the observation that the final two process-time groups contain relatively few interactions raises the question of whether the choice of six groups is optimal. It would be informative to investigate how the inferred GRNs change when using fewer process-time groups. These analyses would help establish whether the reported temporal network evolution reflects underlying biology rather than choices made during the construction of the virtual time series.
When describing the resulting GRN simulation, the manuscript omits the exact number of simulated cells, meaning the full dimensions of the simulated matrix are never explicitly stated. We do not know if the author simulated exactly 1,800 cells to match the input. However, we can assume that the simulated matrix contains exactly 97 genes, because HARISSA is executing the "97 genes-based GRN" inferred by CARDAMOM. The author states that they assessed the simulation's ability to capture the evolution of 'a few genes whose expression was characteristic of the three cell types we were trying to reproduce', pointing to Figure 8. Figure 8 highlights ZEB2 alongside CHGA, CHGB, STMN2, and HMGB2. However, ZEB2 is absent from the signature gene lists provided in the 'Stemness score' Methods section and is coded generically as a Transcription Factor in the network diagrams (e.g., Figure 4a). The author should clarify which specific cell type ZEB2 is meant to characterize in this context or explicitly acknowledge that it is not part of the defining gene signatures, similarly to the clarification provided for HMGB2.
To assess the model's run-to-run variability (Figure 10), the author assigns simulated cells to the three cell fates (SCP, sympathoblast, and chromaffin cells) based exclusively on their spatial localization within divided regions of the 2D UMAP space. While the UMAP provides a useful visual summary, relying on 2D UMAP boundaries for quantitative cell type classification is imprecise due to the non-linear nature of UMAP projections. To ensure a more rigorous classification, the author could identify the identities of the simulated cells using the same metrics tied to the original experimental data. Specifically, simulated cells could be categorized quantitatively based on their expression levels of the specific signature gene lists or the established stemness score formula detailed earlier in the Materials and Methods.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Partly

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: GRN inference, single-cell technologies

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 11 Sep 2025

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1
Version 1 11 Sep 25	read

Olivia Johnson, University of Wisconsin–Madison, Madison, USA

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

Back to all reports

Reviewer Report

7 Views

23 Jun 2026 | for Version 1

Olivia Johnson, Biostatistics and Medical Informatics, University of Wisconsin–Madison, Madison, Wisconsin, USA

7 Views Cite this report Responses(0)

Approved With Reservations

Overall summary:
The author applied a GRN inference algorithm, CARDAMOM, to a published single-cell RNAseq dataset from human sympathoadrenal differentiation and verified the capture of gene regulatory patterns through simulating scRNA-seq expression from the inferred GRN using HARISSA. They propose a new concept, “process” time of cells, similar to a pseudo time trajectory, by stemness score and assumed a differentiation sequence starting with Schwann cell precursor (SCP) and leading to chromaffin cells and sympathoblasts through a bifurcation. From the GRN, they identified dynamic interactions such as self-reinforcing loops and toggle switches in human sympathoadrenal differentiation.
The manuscript provides a well-justified explanation for the assumed differentiation sequence. Additionally, the discussion of the study's limitations is thorough, particularly noting the threshold-dependent nature of the gene selection. The paper is largely an application of existing tools, so methodologically does not contribute much, but the application is interesting.
The following points can improve the paper:

The Materials and Methods section would benefit from additional explanation of the mathematical framework underlying CARDAMOM and HARISSA. In particular, several variables and parameters appear without explicit definition. For example, the parameter λ in the 'Dynamical GRN representation' section is introduced in the equation but is never described in the text. Additionally, the distinction between experimentally observed mRNA measurements and the latent promoter states and protein concentrations used internally by the model is not immediately clear. A clearer explanation of how HARISSA represents transcriptional bursting and generates variables would improve accessibility for readers who are not already familiar with the framework.
In the 'Materials and Methods' section, there is a paragraph detailing a 'Color code for a rapid estimation of the fit quality'. This color-coding scheme does not appear to be utilized in any of the figures provided. The author should either include the corresponding figure or remove this paragraph from the Methods section if the analysis was intentionally omitted.
While defining a novel 'process time' using a custom stemness score nicely addresses the fact that the experimental collection times do not align with the differentiation trajectory, the manuscript would benefit from benchmarking this specific approach. The author should compare their stemness-based cell ordering against established pseudo time inference methods (e.g., Monocle3, Slingshot, Diffusion Pseudo time, Palantir, CellRank). Demonstrating whether these standard tools fail to reproduce the expected trajectory would rigorously justify the necessity of the proposed stemness process time.
The manuscript would benefit from a more detailed characterization of the relationship between the six newly defined 'process time' groups and the original developmental time points. While the UMAP visualization (Figure 1b) effectively demonstrates that the original cell collection times inadequately represent differentiation since all cell types are present at all collection times, it remains unclear what proportion of cells from each original time point contributes to each newly defined process-time bin. This information would help determine whether the method effectively reconstructs the original developmental time points or whether some original stages are disproportionately represented or excluded. In particular, because the dynamic analysis suggests that the majority of regulatory interactions are observed in the first three process-time groups, it would be interesting to determine which developmental weeks contribute most strongly to these groups. This analysis may provide additional biological insight into when meaningful regulatory changes occur during sympathoadrenal differentiation.
The use of overlapping cell populations between consecutive process-time groups could be affecting the inferred time-dependent GRNs. A useful sensitivity analysis would be to evaluate the robustness of the inferred networks to different degrees of overlap between consecutive groups. If the inferred interactions change substantially with overlap size, the conclusions regarding network dynamics may be less robust than suggested. Similarly, the observation that the final two process-time groups contain relatively few interactions raises the question of whether the choice of six groups is optimal. It would be informative to investigate how the inferred GRNs change when using fewer process-time groups. These analyses would help establish whether the reported temporal network evolution reflects underlying biology rather than choices made during the construction of the virtual time series.
When describing the resulting GRN simulation, the manuscript omits the exact number of simulated cells, meaning the full dimensions of the simulated matrix are never explicitly stated. We do not know if the author simulated exactly 1,800 cells to match the input. However, we can assume that the simulated matrix contains exactly 97 genes, because HARISSA is executing the "97 genes-based GRN" inferred by CARDAMOM. The author states that they assessed the simulation's ability to capture the evolution of 'a few genes whose expression was characteristic of the three cell types we were trying to reproduce', pointing to Figure 8. Figure 8 highlights ZEB2 alongside CHGA, CHGB, STMN2, and HMGB2. However, ZEB2 is absent from the signature gene lists provided in the 'Stemness score' Methods section and is coded generically as a Transcription Factor in the network diagrams (e.g., Figure 4a). The author should clarify which specific cell type ZEB2 is meant to characterize in this context or explicitly acknowledge that it is not part of the defining gene signatures, similarly to the clarification provided for HMGB2.
To assess the model's run-to-run variability (Figure 10), the author assigns simulated cells to the three cell fates (SCP, sympathoblast, and chromaffin cells) based exclusively on their spatial localization within divided regions of the 2D UMAP space. While the UMAP provides a useful visual summary, relying on 2D UMAP boundaries for quantitative cell type classification is imprecise due to the non-linear nature of UMAP projections. To ensure a more rigorous classification, the author could identify the identities of the simulated cells using the same metrics tied to the original experimental data. Specifically, simulated cells could be categorized quantitatively based on their expression levels of the specific signature gene lists or the established stemness score formula detailed earlier in the Materials and Methods.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Partly

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

GRN inference, single-cell technologies

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Respond to this report

Responses (0)

[1] 1. Gomez RL, Ibragimova S, Ramachandran R, et al.: Tumoral heterogeneity in neuroblastoma. Biochim. Biophys. Acta Rev. Cancer. 2022; 1877(6): 188805. Publisher Full Text

[2] 2. Almstedt E, Elgendy R, Hekmati N, et al.: Integrative discovery of treatments for high-risk neuroblastoma. Nat. Commun. 2020; 11(1): 71. PubMed Abstract | Publisher Full Text | Free Full Text

[3] 3. Li CH, Sharma S, Heczey AA, et al.: Long-term outcomes of GD2-directed CAR-T cell therapy in patients with neuroblastoma. Nat. Med. 2025; 31: 1125–1129. Publisher Full Text

[4] 4. Zeineldin M, Patel AG, Dyer MA: Neuroblastoma: When differentiation goes awry. Neuron. 2022; 110(18): 2916–2928. PubMed Abstract | Publisher Full Text | Free Full Text

[5] 5. Ratner N, Brodeur GM, Dale RC, et al.: The “neuro” of neuroblastoma: Neuroblastoma as a neurodevelopmental disorder. Ann. Neurol. 2016; 80(1): 13–23. PubMed Abstract | Publisher Full Text | Free Full Text

[6] 6. Ponzoni M, Bachetti T, Corrias MV, et al.: Recent advances in the developmental origin of neuroblastoma: an overview. J. Exp. Clin. Cancer Res. 2022; 41(1): 92. PubMed Abstract | Publisher Full Text | Free Full Text

[7] 7. Patel AG, Ashenberg O, Collins NB, et al.: Dyer MA: A spatial cell atlas of neuroblastoma reveals developmental, epigenetic and spatial axis of tumor heterogeneity. bioRxiv. 2024. 2024.2001.2007.574538.

[8] 8. Del Valle I, Buonocore F, Duncan AJ, et al.: A genomic atlas of human adrenal and gonad development. Wellcome Open Res. 2017; 2: 25. PubMed Abstract | Publisher Full Text | Free Full Text

[9] 9. Furlan A, Dyachuk V, Kastriti ME, et al.: Adameyko I: Multipotent peripheral glial cells generate neuroendocrine cells of the adrenal medulla. Science. 2017; 357(6346). PubMed Abstract | Publisher Full Text | Free Full Text

[10] 10. Kameneva P, Artemov AV, Kastriti ME, et al.: Single-cell transcriptomics of human embryos identifies multiple sympathoblast lineages with potential implications for neuroblastoma origin. Nat. Genet. 2021; 53(5): 694–706. PubMed Abstract | Publisher Full Text | Free Full Text

[11] 11. Rasmuson A, Segerstrom L, Nethander M, et al.: Tumor development, growth characteristics and spectrum of genetic aberrations in the TH-MYCN mouse model of neuroblastoma. PLoS One. 2012; 7(12): e51297. PubMed Abstract | Publisher Full Text | Free Full Text

[12] 12. Kildisiute G, Kholosy WM, Young MD, et al.: Tumor to normal single-cell mRNA comparisons reveal a pan-neuroblastoma cancer cell. Sci. Adv. 2021; 7(6). PubMed Abstract | Publisher Full Text | Free Full Text

[13] 13. Jansky S, Sharma AK, Korber V, et al.: Single-cell transcriptomic analyses provide insights into the developmental origins of neuroblastoma. Nat. Genet. 2021; 53(5): 683–693. PubMed Abstract | Publisher Full Text

[14] 14. La Manno G, Soldatov R, Zeisel A, et al.: RNA velocity of single cells. Nature. 2018; 560(7719): 494–498. PubMed Abstract | Publisher Full Text | Free Full Text

[15] 15. Bergen V, Lange M, Peidli S, et al.: Generalizing RNA velocity to transient cell states through dynamical modeling. bioRxiv. 2019; 820936.

[16] 16. Gorin G, Fang M, Chari T: L P: RNA velocity unraveled. PLoS Comput. Biol. 2022; 18(9): e1010492. PubMed Abstract | Publisher Full Text | Free Full Text

[17] 17. Fang M, Gorin G, Pachter L: Trajectory inference from single-cell genomics data with a process time model. PLoS Comput. Biol. 2025; 21(1): e1012752. PubMed Abstract | Publisher Full Text | Free Full Text

[18] 18. Ventre E, Herbach U, Espinasse T, et al.: One model fits all: Combining inference and simulation of gene regulatory networks. PLoS Comput. Biol. 2023; 19(3): e1010962. PubMed Abstract | Publisher Full Text | Free Full Text

[19] 19. Ventre E: Reverse engineering of a mechanistic model of gene expression using metastability and temporal dynamics. In Silico Biol. 2020; 14: 89–113. Publisher Full Text

[20] 20. Herbach U, Bonnaffoux A, Espinasse T, et al.: Inferring gene regulatory networks from single-cell data: a mechanistic approach. BMC Syst. Biol. 2017; 11(1): 105. PubMed Abstract | Publisher Full Text | Free Full Text

[21] 21. Ventre E, Espinasse T, Bréhier C-E, et al.: Reduction of a stochastic model of gene expression: Lagrangian dynamics gives access to basins of attraction as cell types and metastabilty. J. Math. Biol. 2021; 83(5): 59. PubMed Abstract | Publisher Full Text

[22] 22. Melville J: uwot: The Uniform Manifold Approximation and Projection (UMAP) Method for Dimensionality Reduction.2024.

[23] 23. R: A language and environment for statistical computing. http

[24] 24. Epskamp S, Cramer AOJ, Waldorp LJ, et al.: qgraph: Network Visualizations of Relationships in Psychometric Data. J. Stat. Softw. 2012; 48: 1–18. Publisher Full Text

[25] 25. Muller-Dott S, Tsirvouli E, Vazquez M, et al.: Expanding the coverage of regulons from high-confidence prior knowledge for accurate estimation of transcription factor activities. Nucleic Acids Res. 2023; 51(20): 10934–10949. PubMed Abstract | Publisher Full Text | Free Full Text

[26] 26. Herbach U: Harissa: Stochastic Simulation and Inference of Gene Regulatory Networks Based on Transcriptional Bursting. Computational Methods in Systems Biology: 2023//2023. Cham: Springer Nature Switzerland; pp. 97–105.

[27] 27. Blumberg A, Zhao Y, Huang YF, et al.: Characterizing RNA stability genome-wide through combined analysis of PRO-seq and RNA-seq data. BMC Biol. 2021; 19(1): 30. PubMed Abstract | Publisher Full Text | Free Full Text

[28] 28. Li J, Cai Z, Vaites LP, et al.: Proteome-wide mapping of short-lived proteins in human cells. Mol. Cell. 2021; 81(22): 4722–4735.e5. PubMed Abstract | Publisher Full Text

[29] 29. Dong R, Yang R, Zhan Y, et al.: Single-Cell Characterization of Malignant Phenotypes and Developmental Trajectories of Adrenal Neuroblastoma. Cancer Cell. 2020; 38(5): 716–733.e6. PubMed Abstract | Publisher Full Text

[30] 30. Sehgal M, Nayak SP, Sahoo S, et al.: Mutually exclusive teams-like patterns of gene regulation characterize phenotypic heterogeneity along the noradrenergic-mesenchymal axis in neuroblastoma. Cancer Biol. Ther. 2024; 25(1): 2301802. PubMed Abstract | Publisher Full Text | Free Full Text

[31] 31. Moris N, Pina C, Arias AM: Transition states and cell fate decisions in epigenetic landscapes. Nat. Rev. Genet. 2016; 17(11): 693–703. PubMed Abstract | Publisher Full Text

[32] 32. Mar JC: The rise of the distributions: why non-normality is important for understanding the transcriptome and beyond. Biophys. Rev. 2019; 11: 89–94. PubMed Abstract | Publisher Full Text | Free Full Text

[33] 33. Vershik AM: Kantorovich metric: initial history and little-known applications. J. Math. Sci. 2006; 133(4): 1410–1417. Publisher Full Text

[34] 34. Dussiau C, Boussaroque A, Gaillard M, et al.: Hematopoietic differentiation is characterized by a transient peak of entropy at a single-cell level. BMC Biol. 2022; 20(1): 60. PubMed Abstract | Publisher Full Text

[35] 35. Bonnaffoux A, Herbach U, Richard A, et al.: WASABI: a dynamic iterative framework for gene regulatory network inference. BMC Bioinformatics. 2019; 20(1): 220. PubMed Abstract | Publisher Full Text | Free Full Text

[36] 36. Fisher J, Henzinger TA: Executable cell biology. Nat. Biotechnol. 2007; 25(11): 1239–1249. Publisher Full Text

[37] 37. Hari K, Ullanat V, Balasubramanian A, et al.: Landscape of epithelial-mesenchymal plasticity as an emergent property of coordinated teams in regulatory networks. elife. 2022; 11. PubMed Abstract | Publisher Full Text | Free Full Text

[38] 38. Shyam S, Nandan N, Jolly MK, Hari K: Topological conditions of gene regulatory networks enabling robust binary cell-fate decision-making. bioRxiv. 2025. 2025.2002.2007.636066.

[39] 39. Matsumoto H, Kiryu H, Furusawa C, et al.: SCODE: An efficient regulatory network inference algorithm from single-cell RNA-Seq during differentiation. Bioinformatics. 2017; 33: 2314–2321. PubMed Abstract | Publisher Full Text | Free Full Text

[40] 40. Sanchez-Castillo M, Blanco D, Tienda-Luna IM, et al.: A Bayesian framework for the inference of gene regulatory networks from time and pseudo-time series data. Bioinformatics. 2018; 34(6): 964–970. PubMed Abstract | Publisher Full Text

[41] 41. Ocone A, Haghverdi L, Mueller NS, et al.: Reconstructing gene regulatory dynamics from high-dimensional single-cell snapshot data. Bioinformatics. 2015; 31(12): i89–i96. PubMed Abstract | Publisher Full Text | Free Full Text

Inferring and simulating a gene regulatory network for the sympathoadrenal differentiation from single-cell transcriptomics in human.

Abstract

Background

Methods

Results

Conclusions

Keywords

Introduction

Material and methods

Data preparation

Figure 1. UMAP representation of the dataset.

Stemness score

GRN inference and representation

GRN simulation

Dynamical GRN representation

Results

Organizing the cells

Figure 2. Cell ordering by their stemness score.

Gene selection

Figure 3. Gene selection procedure.

Table 1. The 97 genes that were used for inferring the GRN.

GRN inference and analysis

Figure 4. Representation of the inferred GRN structure.

Figure 5. Representation of the time-dependent evolution of the inferred GRN structure.

Figure 6. Representation of a subset of the GRN showing genes involved in specific dynamical motifs.

GRN simulation

Figure 7. Adjusting the process time to real time values.

Figure 8. UMAP representation of the dataset used for GRN inference (a-f ) and of the dataset obtained through GRN simulation (g-m).

Figure 9. Fit quality for each gene at each time point as assessed by the Kantorovich distance (Kanto.dist) between the experimentally observed mRNA distribution and the simulated distribution.

Figure 10. The UMAP space was divided in three regions, representing SCP, sympathoblasts and chromaffin cells (see Figure 1c).

Discussion

Ethics approval and consent to participate

Software availability

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated