Identification of informative genes and sub-pathways using Improved Differential Expression Analysis for Pathways (iDEAP) for cancer classification

Nurul Athirah Nasarudin; Mohd Saberi Mohamad; Zalmiyah Zakaria; Richard O. Sinnott; Fatma Al Jasmi; Noura Al Dhaheri

doi:10.12688/f1000research.132899.1

Home Browse Identification of informative genes and sub-pathways using Improved...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Identification of informative genes and sub-pathways using Improved Differential Expression Analysis for Pathways (iDEAP) for cancer classification

[version 1; peer review: 1 approved, 1 approved with reservations]

Nurul Athirah Nasarudin¹, Mohd Saberi Mohamad ¹, Zalmiyah Zakaria², Richard O. Sinnott³, Fatma Al Jasmi¹, Noura Al Dhaheri¹

Nurul Athirah Nasarudin¹, Mohd Saberi Mohamad ¹, [...] Zalmiyah Zakaria², Richard O. Sinnott³, Fatma Al Jasmi¹, Noura Al Dhaheri¹

PUBLISHED 06 Nov 2023

Author details Author details

¹ Health Data Science Lab, Department of Genetics and Genomics, College of Medicine and Health Sciences, United Arab Emirates University, Al Ain, Abu Dhabi, 17666, United Arab Emirates
² School of Computing, Faculty of Engineering, Universiti Teknologi Malaysia, Skudai, Johor, 81300, Malaysia
³ School of Computing and Information Systems, The University of Melbourne, Melbourne, Victoria, 3010, Australia

Nurul Athirah Nasarudin
Roles: Investigation, Software, Writing – Review & Editing

Mohd Saberi Mohamad
Roles: Conceptualization, Methodology, Writing – Review & Editing

Zalmiyah Zakaria
Roles: Supervision, Writing – Review & Editing

Richard O. Sinnott
Roles: Writing – Original Draft Preparation, Writing – Review & Editing

Fatma Al Jasmi
Roles: Data Curation, Formal Analysis, Investigation, Writing – Review & Editing

Noura Al Dhaheri
Roles: Formal Analysis, Validation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

Background: Pathway-based analysis primarily focuses on sub-pathway-based analysis, which aids in understanding biological reactions. Several studies have found abnormalities in pathways caused by certain regions based on the etiology of diseases. The Differential Expression Analysis for Pathways (DEAP) method is one such sub-pathway-based analysis method, that identifies a local region perturbed by complex diseases based on larger pathway data. However, the method has low performance in identifying informative pathways and sub-pathways.
Methods: In this paper we propose an improved DEAP (iDEAP) method for enhanced identification of perturbed sub-pathways that achieves higher performance in the detection of differentially expressed pathways. Firstly, a search algorithm adapted from the Detect Module from Seed Protein (DMSP) algorithm was implemented as part of the DEAP method to search for informative sub-pathways. Secondly, the relation among sub-pathways was taken into account by averaging the maximum absolute value for the DEAP score for the reaction among sub-pathways to support the efficient identification of informative pathways. Three gene expression data sets were applied to this research.
Results: The proposed improved method shows a better performance over the previous methods. In fact, when the identified genes from the results were assessed using 10-fold cross validation to classify cancer based on accuracy, the improved method shows higher accuracy for colorectal cancer (90%) and breast cancer (94%).
Conclusions: This shows that the proposed method effectively identifies informative genes related to the targeted phenotype. A biological validation was also conducted on the top five significant pathways and selected genes based on biological literature. The results from this analysis will be useful especially in the medical field for disease detection. In 10 years and beyond, computational biology will become ever more entwined with biomedical research and medicine.

Keywords

Sub-pathway-based analysis, iDEAP, Support Vector Machine, 10-fold Cross Validation, Cancer Classification

Corresponding author: Mohd Saberi Mohamad

Competing interests: No competing interests were disclosed.

Grant information: This work was supported by the United Arab Emirates University (UAEU) through Strategic Research Program [Vot: 12R111].

Copyright: © 2023 Nasarudin NA et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Nasarudin NA, Mohamad MS, Zakaria Z et al. Identification of informative genes and sub-pathways using Improved Differential Expression Analysis for Pathways (iDEAP) for cancer classification [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2023, 12:1433 (https://doi.org/10.12688/f1000research.132899.1) First published: 06 Nov 2023, 12:1433 (https://doi.org/10.12688/f1000research.132899.1) Latest published: 06 Nov 2023, 12:1433 (https://doi.org/10.12688/f1000research.132899.1)

Introduction

In the life sciences, emerging high–throughput technologies such as next-generation sequencing, -omics technology and microarrays allow for creation of massive amounts of highly dimensional biological data. Such data can cover genome, transcriptome, epigenome, proteome, metabolome, molecular imaging, and molecular pathways. Many sophisticated analytic methods have been developed to broaden the biological interpretation of differentially expressed genes and pathways. The earliest approach involved gene-by-gene analysis through individual gene analysis (IGA), which produced a list of altered genes using a cut-off threshold. ¹ Subsequently, systems-level methodologies have pushed forward the transition of IGA to include gene set analysis (GSA), that can identify gene sets in a more subtle way, coordinated by a single-step process. Even though GSA methods have the advantage to researchers in characterizing groups of genes, they have limitations when applied to pathway datasets. Many of GSA methods disregard the graphical structure of pathway data, resulting in the potential omission of critical information regarding the biological interactions between molecules. As a consequence, this can lead to inaccurate outcomes. Pathway topology-based analysis has been introduced to overcome the limitations of GSA methods by considering the pathway structure. This analysis has integrated the information of gene ontology and pathway structure to discover which pathways are associated with a particular phenotype. In addition, two hypothesis tests can be observed. First, entire pathways can be tested for differential expression, and secondly identified informative sub-pathways represent the entire pathway with massive information associated with the differential expression. The second hypothesis is a more recent evolution in topology-based analysis as it can improve the specificity and sensitivity of the outcomes.² Previous studies stated that the pathway structure information can provide relevant biological insights and contribute to the understanding of higher integrative levels of biological functions that are more complex with many variations and characteristics of information.³ Recently, topology-based analysis has shifted towards sub-pathway-based analysis, which provides information on biological phenomena more precisely, and hence can identify regions of the pathway that are dysregulated by diseases.⁴ In addition, previous studies have proved that deformities in sub-pathway regions of the pathway might contribute to the etiology of the disease.⁵ An overview of sub-pathway-based analysis is illustrated in Figure 1.

Figure 1. Overview illustration of sub-pathway-based analysis.

Several sub-pathway-based analysis methods have been developed which share the same target in the search pathway portion related to disease modeling, drug targeting and other objectives.⁶^–⁹ However, most of the methods have constraints that need to be improved due to a range of challenges. One of the challenges is how to examine the sub-pathways.¹⁰ Most of the sub-pathway methods independently search sub-pathways without implementing any search algorithm. Moreover, some researchers assume extraction of sub-pathway strategies do not affect the results.² In addition, the pathway structures can be complex and involve the combination of many sub-pathways and interactions. Due to this, an efficient sub-pathway-based analysis method is essential to identify the specific region that is differentially expressed by utilizing all information within a given pathway. Therefore, this paper integrates the DMSP (Module from Seed Protein) algorithm to identify informative sub-pathways from the specific nodes and expanding them to the entire pathway network. The sub-pathway with the most informative genes is used to produce the best biological insight in identifying informative pathway related to the given phenotype under investigation.

Numerous biological pathway databases now exist such as Kyoto Encyclopedia of Genes and Genomes (KEGG),¹¹ Gene Ontology (GO),¹² Biocarta (https://maayanlab.cloud/Harmonizome/dataset/Biocarta+Pathways) and many more. The majority of these pathway databases are not specific to certain biological contexts such as cancer. By implementing sub-pathway-based analysis, many informative pathways can be identified and leveraged and improve biological databases. In addition, the knowledge of genes within informative sub-pathways highly related to diseases can be applied for future studies such as cancer classification.¹³ Previous cancer studies are clinical and have limitations in diagnostic ability.¹⁴ Usually, gene expression data that is gained from microarray experiments is used to performed cancer classification with the implementation of advanced machine learning methods. The use of gene expression data poses a challenge for cancer classification, especially when using traditional approaches.

Methods

Differential Expression Analysis for Pathways (DEAP) method

Differential Expression Analysis for Pathways (DEAP) method is a sub-pathway analysis method that can find informative pathways based on the maximum absolute running sum score of sub-pathways. This method calculates the sub-pathway maximum absolute summation score by considering the interaction between nodes, where catalytic/inhibitory edges are taken as positive/negative summands. There are five primary steps in the DEAP method: data pre-processing, data mapping, identification of sub-pathways, absolute value calculation and statistical calculation of pathways as shown in Figure 2. First, the gene expression data undergoes a pre-processing stage to generate a null distribution using random rotations. Then, the algorithm starts by mapping the expression data onto the pathway graph. Next, the sub-pathways in the pathway are recursively identified from root to leaves nodes. A recursive function calculates the maximum absolute running summation score for all sub-pathways within the pathway by considering whether the reactions are catalytic or inhibitory. Every reaction represents a value, which is used for calculating each sub-pathway. From the calculation, the sub-pathway with maximal differential expression is determined. The maximum absolute value represents the DEAP score, which is returned.

Figure 2. Flowchart of the Differential Expression Analysis for Pathway (DEAP) method.

The proposed improved method (iDEAP)

In order to extract a sub-pathway that is related to the targeted phenotype, many methods had been developed based around differentially expressed (DE) genes. The differential expression analysis for pathway (DEAP) method¹⁵ utilizes the information in biological pathways to identify important paths by integrating differential expression data. Despite the good performance of DEAP, the identification of sub-pathway efficiency still can be improved by integrating a searching algorithm starting from the most associated genes in the pathway. The DEAP method searches for the possible sub-pathway within the pathway without taking other information into account. It also only uses a basic search algorithm embedded in the DEAP method. Basically, the search starts from the root node to the leaves node and goes through all the nodes until the end. The efficiency of searching can be improved by implementing a search algorithm to find informative pathways. Recent research has shown that the use of search algorithms can provide better results.¹⁶ In sub-pathway-based analysis, one of the main concerns is the precision in identifying sub-pathways within a given pathway. Sub-pathways were often considered without taking into account biological relations within the pathway. Thus, non-informative sub-pathways could be erroneously identified during the analysis of data used for specific biological contexts like cancer. This is because not every gene in the pathway is involved in biological processes and diseases like cancer. This research proposes the DMSP search algorithm to the DEAP method to improve the efficiency of the identification of informative sub-pathways and genes. The proposed method is referred to as iDEAP.¹⁷ In the proposed method, the search process is improved to search for the best informative sub-pathway with an additional step introduced to average all of the DEAP scores to consider all interactions between sub-pathways. The overview of the workflow of the iDEAP method is shown in Figure 3. The improved parts that show the differences between DEAP and iDEAP can be seen in Figure 3. Where the proposed algorithm is implemented. The code from previous work is combined with a proposed searching algorithm and average calculation code. Furthermore, new codes have been developed specifically for data pre-processing, ensuring that the data are appropriately prepared.

Figure 3. Flowchart of the proposed improved Differential Expression Analysis for Pathway (iDEAP) method where the grey shaded area represents the improvement parts of Differential Expression Analysis for Pathway (DEAP) method.

Pre-processing data

The purpose of data pre-processing is to remove uninformative data that affect the results and to ensure the data is suitable for input. Firstly, the uninformative data are removed from the dataset, followed by a normalization process. This step is important to correct the expression data value according to different cellular inputs. Next, the geneID is converted to UniProtID as this analysis utilizes the UniProt identifier. Figure 4 shows the flowchart of the data pre-processing activity.

Figure 4. Flowchart of the data pre-processing step.

Mapping gene expression data onto pathway graph

In this phase, the gene expression data are integrated with the pathway graphs. All cellular components within the pathway are extracted as nodes in the corresponding graph. Each node contains multiple genes which have a different unique ID. This ID is used to derive the expression values from the gene expression data. The process of mapping is illustrated in Figure 5.

Figure 5. Illustration of the mapping process.

Identification of sub-pathways using the proposed algorithm (DMSP algorithm)

DMSP searching algorithm was introduced in 2007 for integrating gene expression data and protein-protein interactions (PPI) to determine functional modules. This algorithm is able to determine interactions among a set of proteins in each graph. Generally, the idea is to discover biologically relevant PPI subnetworks within a larger network, whose proteins interact significantly. This algorithm is setup to identify functional modules starting from a ‘seed’ protein (the most informative protein) in the dataset.

In this work, a search algorithm was implemented to extract sub-pathways by taking topology information into account. This algorithm is adapted from the DMSP algorithm¹⁸ where the search process starts from the most interesting node. The package edgeR¹⁹ is used to detect the most differentially expressed genes to be appointed as the most interesting nodes in the pathway. The sub-pathways were extracted based on the perturbation caused by the most interesting nodes in the pathway as shown in Figure 6(b). The most interesting nodes are selected based on the value of the most differentially expressed genes.

Figure 6. Illustration of the sub-pathway identification process.

The search algorithm works in two phases. First, the search is conducted by selecting the internal and external nodes to form a sub-pathway. Second, the nodes in the sub-pathway are pruned to select highly connected sub-pathways as shown in Figure 6(c) and (d). To obtain a compact sub-pathway, the node is removed from the sub-pathway if it satisfies the criteria: N_internal > N_external where N_internal is the internal node nearest to the interesting node and N_external is the external node. This process is recursively repeated for every sub-pathway. The process is illustrated in Figure 6 and the flow of the process is shown in Figure 7.

Figure 7. Flowchart of the sub-pathway identification process, where N_external = total gene expression value for external nodes and N_internal = total value of gene expression for internal nodes.

Calculation of DEAP score for each sub-pathway

Given an edge and all other edges in the graph together with the expression values for all genes, we consider the expression data with two conditions (health/cancer), $E (x)$ . This is defined to be the difference between the logarithm of the arithmetic mean of expression values associated with gene(s) x. Next, all edges in the sub-pathway are examined recursively, where max_recursive represents the maximum values and min_recursive represents the minimum values of the edge. The algorithm examines all possible edges in the sub-pathway set whose reactant node is the current edge’s product node. If there are no such edges, we set max_recursive and min_recursive as $\sum E (x)$ where y∈products refer to each gene contained in the edge’s products. Otherwise, max_recursive and min_recursive are defined as the maximum and minimum score, respectively. The formula to calculate the maximum score and minimum scores is as follows:

(1)

\sum E (z) + T (edge) * {max}_{recursive}

(2)

\sum E (z) + T (edge) * {min}_{recursive}

Where $T (edge)$ is the multiplier associated with the edge type (-1 or 1) for inhibition or catalysis, z∈reactants refer to each protein, z, contained in the edge’s reactants. Finally, the maximum and minimum value of DEAP scores are returned for each sub-pathway. The process of establishing the DEAP score is shown in Figure 8.

Figure 8. Flowchart of recursive function processing for the Differential Expression Analysis for Pathway (DEAP) score calculation.

Calculation of average DEAP Score (proposed additional step)

The goal of this research is to include all relations or interactions within a pathway to provide a finer resolution to represent relevant biological process related to a given target phenotype. To support this, all scores of the sub-pathways are calculated together and the average of the scores taken. The average scores are calculated based on the maximum score of each sub-pathway based on the equation below:

(3)

avg = \frac{\sum_{i = 1}^{n} {MaxScore}_{i}}{n}

In this equation, MaxScore represents the maximum score of every sub-pathway in the pathway. The maximum score of the i-th sub-pathway, MaxScore_i is calculated based on the recursive function in DEAP. The average of the maximum score, avg is calculated by summing all MaxScores for a sub-pathway and dividing by the number of loops (n) in the recursive function for one pathway.

Statistics calculation

The average of the maximum score is used for the statistical calculation to evaluate the significance of a given pathway to the target phenotype. The statistics used here are based on maximum order statistics.¹⁵ The statistic calculation of the proposed method is shown by the following equation:

(4)

p = \frac{{avg}_{S_{i}}}{n}

where avg_Si represents the average DEAP score of all sub-pathways within a pathway based on the calculated average maximum score. Similar to the previous method workflow, the pathway was defined based on the p-value from the test statistics based on a significantly expressed p-value less than or equal to 0.05 (p ≤ 0.05). The significant pathways are used for performance measurements based on 10-fold cross validation classification.

Data sets

Since the research focuses on pathway-based analysis, two common data sets have been used: pathway data sets and gene expression data sets. Specifically, three gene expression datasets were used in this research. Table 1 shows the gene expression dataset used here. These three datasets which are head and neck tumor,²⁰ colorectal cancer,²¹ and breast cancer²² can be downloaded from the NCBI GEO database.

Table 1. Summary of the gene expression data sets.

Dataset	No. of sample	No. of gene	Class	Author
Head and neck tumor cell lines	7	22284	2 (normal/tumor)	²⁰
Colorectal cancer	12	54676	2 (normal/tumor)	²¹
Breast cancer	100	22284	2 (normal/tumor)	²²

A total of 177 graph-based pathway data sets were downloaded in Systems Biology Markup Language (SMBL) format. For the interpretation, the pathways were broken into their protein components. Every pathway represented information about protein, biochemical reaction, and other substrates. The pathway data set is used to group the gene expression data based on the sample pathways. Matching processes between the gene expression data set and the pathway data set was undertaken. All the pathways were taken from the Protein Analysis Through Evolutionary Relationships (PANTHER) database related to regulatory and metabolic pathways.²³ The pathway data sets can be downloaded at http://www.pantherdb.org/downloads/.

Results/Use cases

The significant pathways identified related to targeted phenotype

Most sub-pathway analysis methods are assessed based on the number of differentially expressed pathways found. In order to verify the performance of the iDEAP method, this research applied three sets of biological data on the biological pathways and made comparisons with previous work. The result of the proposed method was based on comparison with previous methods, namely the DEAP,¹⁵ SubSPIA,¹⁶ MinePath,²⁴ HiPathia,²⁵ and PsSubpathway.²⁶ Table 2 shows the comparison result of the iDEAP method with the previous methods based on the number of informative pathways found. As noted, significant pathways were selected based on a p-value less than 0.05 (p-value ≤ 0.05). Generally, the performance of iDEAP method was improved based on the results obtained. This proves the effectiveness of the search algorithm for identification of informative sub-pathways. In addition, the interaction and relation between sub-pathways are important as additional information to help the medical sector detect diseases. By considering all interactions, the tendency to identify informative pathways related to the targeted phenotype is increased.

Table 2. Number of informative pathways found based on previous work and the proposed improved Differential Expression Analysis for Pathway (iDEAP) method.

Method	Number of significant pathways
Method	Head and neck tumor	Colorectal cancer	Breast cancer
iDEAP¹⁷	81	78	95
DEAP¹⁵	12	98	5
SubSPIA¹⁶	3	18	31
MinePath²⁴	17	64	26
HiPathia²⁵	30	43	25
PsSubpathway²⁶	26	51	77

The iDEAP method performed well for head and neck tumor and breast cancer but unfortunately, not for colorectal cancer. This is because the colorectal cancer data was not suitable for this research, since most of the genes were not complimentary to the pathway data resulting in sub-pathway interactions being reduced. Moreover, the huge size of colorectal cancer data could affect the results, since it might contain significant noise data that obfuscates the informative data.

10-fold cross validation

The performance of the proposed method was evaluated through a 10-fold cross validation classification in terms of accuracy. These measurements were used to justify the method performance by using the identified informative genes in the sub-pathways. These genes were selected from sub-pathways with p-value less than 0.05 and underwent a classification process using support vector machine (SVM) algorithm based on a cross validation. SVM algorithm is widely used for cancer genomic classification and prediction. Therefore, the classification accuracy can be used to analyze the effectiveness of the proposed method in identifying informative genes to targeted phenotype. To get a consistent result, the classifications were run 10 times, then, the average was calculated. The comparison of the average 10-fold cross validation (CV) classification accuracy between the iDEAP method and DEAP method for all data sets is presented in Table 3.

Table 3. Comparison of the average 10-fold cross validation classification accuracy of identified informative genes in the significant pathway between the Differential Expression Analysis for Pathway (DEAP) and improved Differential Expression Analysis for Pathway (iDEAP) methods.

Data	Without Selection	Selected by DEAP		Selected by iDEAP
Data	10-fold CV accuracy (%)	No. of genes	10-fold CV accuracy (%)	No. of genes	10-fold CV accuracy (%)
Head and Neck tumor	50.00	22	50.00	202	50.00
Colorectal cancer	85.00	347	67.50	197	90.00
Breast cancer	77.00	114	79.30	157	94.00

Biological validation

It is important to validate the result in the biological context with literature and databases to show the relevance of the research. This validation was manually undertaken after the experiments were conducted. All the identified informative genes and pathways based on the result in the proposed method were checked through biological literatures and databases. This study used google scholar as the biological literature and Genecards as the biological database.

The pathways were analyzed individually and consequently produced corresponding p-value and informative sub-pathways. According to Figure 9, the top five pathways with the corresponding p-value and the associated list of informative genes were selected and validated based on the biological database and literature in order to show biological relevance.

Figure 9. Illustration of selection of top five pathways from results for biological validation.

Head and neck tumor

Table 4 presents the top five pathways selected from the proposed method for the head and neck tumor data sets. The first ranked pathway was the Notch signaling pathway that has been implicated in the regulation of self-renewal capacity, cell cycle exit, and neural stem cell survival.²⁷ Notch signaling is often associated with cancer diseases including head and neck squamous cell carcinoma (HNSCC).²⁸ In addition, meta-analysis reveals that this pathway plays an important role in tumor development.²⁹ The proposed method selected three informative genes for classification where all genes found were related to the development of head and neck tumors. CSL is one of the nucleus DNA-binding factors which interacts with an intracellular fragment of NOTCH.²⁹ The NOTCH4 gene was found to be significantly related to HEY1 gene activation in HNSCC which promotes cell proliferation, cisplatin resistance, inhibition of apoptosis and cell-cycle dysregulation.³⁰ MAML1 regulates transcription of Notch target genes and interacts with muscle-specific genes like MEF2C as a fundamental coactivator of other cell signaling pathways.³¹

Table 4. Top five pathways using the improved Differential Expression Analysis for Pathway (iDEAP) method based on head and neck tumor cell lines data.

Pathway	p-value (<0.05)	Selected genes
Notch signaling pathway	0.000033	CSL,²⁹ NOTCH4,³⁰ MAML1³¹
TGF beta signaling pathway	0.001236	JUND³⁹ EP300,⁴⁰ CITED1, CITED2,⁴¹ FOXH1, DCP1A, JUNB,⁴² SMAD5,³² FOSL1, SNIP1, SKIL,⁴³ DCP1B, SMURF1,⁴⁴ SMURF2, DCP1B
Ras signaling pathway	0.001412	PIK3CA,⁴⁵ PIK3C3,⁴⁶ PIK3CG,⁴⁶ PIK3CB,⁴⁷ PIK3CD, HRAS, NRAS,⁴⁷ KRAS⁴⁸
JAK STAT signaling pathway	0.004789	STAT5A, STAT5B, STAT3, STAT4, STAT1, STAT6,⁴⁹ JAK3,⁵⁰ JAK2, JAK1⁵¹
Interleukin signaling pathway	0.037188	STAT5A, STAT5B,⁴⁹ STAT2,⁵⁰ STAT3, STAT1, STAT4, STAT6,⁴⁹ MAPK6,⁵² MAPK1,⁵³ MAPK3,⁵⁴ MAPK7, MAPK15

The second rank pathway with the lowest p-value was TGF beta signaling pathway. Transforming growth factor-β (TGF-β) is a homodimeric protein that is known as a multifunctional regulator in the target cell and plays a role in numerous types of cancer including HNSCC.³² The defective TGF-β signaling within epithelial cells promotes the growth of tumors and increases the inflammation in tumor stromal cells.³³ From biological validation, seven of the 15 informative genes were identified (JUND, EP300, CITED2, JUNB, SMAD5, SKIL, SMURF1) as being related to the development of HNSCC.

The third rank pathway was the Ras signaling pathway. Ras is a family of proteins called GTPase that is commonly involved in cellular signal transduction.³⁴ The Ras signaling pathway is rarely related to HNSCC, but this pathway had been shown to be highly significant to this cancer by a meta-analysis of differential protein expression networks.³⁵ In this pathway, eight informative genes were selected by the proposed method and all the genes (PIK3CA, PIK3C3, PIK3CG, PIK3CB, PIK3CD, HRAS, NRAS, KRAS) were found to be involved in HNSCC.

The fourth rank pathway was the JAK STAT signaling pathway. This is involved in cell division, cell death and most importantly tumor formation.³⁶ STAT signaling was identified by the literature to play an important role in cancer formation and progression.³⁷ In addition, the constituents of JAK STAT activation have been recognized in many cancers including head and neck cancer.³⁸ In this pathway, nine genes were selected for the classification and all these genes (STAT5A, STAT5B, STAT3, STAT4, STAT1, STAT6, JAK, JAK2, JAK1) were associated with HNSCC based on the literature.

Colorectal cancer

Table 5 shows the top five ranked pathways identified by the iDEAP method for colorectal cancer data. The first top ranked pathway was the FAS signaling pathway. FAS also known as Apo1 or CD95, are death domain-containing members of the Tumor Necrosis Factor Receptor (TNFR).⁵⁵ The FAS pathway was identified to be functional in colorectal cancer and induced apoptosis.⁵⁶ Based on biological validation, only one (FASL) gene was identified to be related to the growth of colorectal cancer.

Table 5. Top five pathways using improved Differential Expression Analysis for Pathway (iDEAP) method based on colorectal cancer data.

Pathway	p-value (<0.05)	Selected genes
FAS signaling pathway	0.000182	FAS, FASL⁶⁶
5HT3 type receptor mediated signaling pathway	0.000851	VAMP8, STX3,⁶⁷ SNAP25, VAMP2,⁶⁸ VAMP3, SNAP29, SNAP23, VAMP1,⁶⁹ SLC6A4,⁷⁰ SLC18A1, SLC18A2
p53 pathway	0.005318	TP73,⁷¹ RCHY1,⁶⁹ TP53,⁷² TP63, MDM2,⁷³ MDM4⁷⁴
PDGF signaling pathway	0.036023	PDGFRL,⁷⁵ PDGFB,⁷⁶ PDGFA⁷⁷
JAK STAT signaling pathway	0.012335	STAT5A, STAT5B, STAT3, STAT4, STAT1, STAT6,⁶⁴ JAK3,⁷⁸ JAK2, JAK1⁶⁴

The second top ranked pathway was the 5HT3 type receptor-mediated signaling pathway also known as the 5-Hydroxytryptamine3 receptor. This is a member of the cys-loop family of ligand-gated ion channels.⁵⁷ Although there has been no study that has proven the direct relationship of this pathway to colorectal cancer, digestive function involvement in colorectal cancer is common.⁵⁸ Furthermore, the expression of 5-HT3 subunits exist in the colon and the small intestine is related to colorectal cancer.⁵⁹ From the iDEAP method, 11 genes were selected for classification and five of those genes (STX3, SNAP25, VAMP2, VAMP1, SLC6A4) were found to be involved with colorectal cancer.

The third ranked pathway was the p53 pathway which is a tumor suppressor protein in humans regulated by the TP53 gene.⁶⁰ A previous study showed that the p53 tumor suppressor was frequently found in colorectal cancer.⁶¹ Consequently, six informative genes were selected by the proposed method and only five genes (TP73, RCHY1, TP53, MDM2, MDM4) were related to the development of colorectal cancer.

The fourth ranked pathway was the platelet-derived growth factor (PDGF) signaling pathway. This pathway has been studied in cancer progression as PDGF can regulate many cellular processes.⁶² The PDGF signaling pathway consists of four ligands and two receptors involved in colorectal cancer progression.⁶³

The fifth ranked pathway was the JAK STAT signaling pathway, which is a chain of interactions between proteins involved in immune function, cell growth and tumor formation.⁶⁴ Previous research has reported that this pathway was differentially expressed in colorectal cancer tissues.⁶⁵ In this pathway, the proposed method selected nine genes for classification and all of these genes (STAT5A, STAT5B, STAT3, STAT4, STAT1, STAT6, JAK3, JAK2, JAK1) were found to be related to the development of colorectal cancer.

Breast cancer

For the breast cancer data set, the result of the top five pathways is shown in Table 6. The first top ranked pathway was the Ras signaling pathway. As noted, Ras is a family of related proteins which belongs to the small GTPase class that is involved in cellular signal transduction. Mutation in Ras genes can cause unintended and overactive signaling inside the cell, thus Ras signaling pathways ultimately lead to cancer.⁷⁹ Previous studies showed that this pathway was activated persistently in nine widely studied human breast cancer lines.⁸⁰ Based on the proposed method, seven informative genes were selected for classification and all the genes (PIK3C3, PIK3CG, PIK3CB, PIK3CD, HRAS, NRAS, KRAS) were found to be involved in the development of breast cancer.

Table 6. Top five pathways using improved Differential Expression Analysis for Pathway (iDEAP) method based on breast cancer data.

Pathway	p-value (<0.05)	Selected genes
Ras signaling pathway	0.001182	PIK3C3,⁸⁸ PIK3CG,⁸⁹ PIK3CB,⁹⁰ PIK3CD,⁹¹ HRAS,⁹² NRAS, KRAS⁹³
Notch signaling pathway	0.002017	RBPJ,⁹⁴ NOTCH1,⁹⁵ NOTCH3,⁹⁶ MAML1, NOTCH2,⁹⁷ NOTCH4⁹⁸
JAK STAT signaling pathway	0.003416	CNR1,⁹⁹ GNAI3¹⁰⁰
Thyrotropin-releasing hormone receptor signaling pathway	0.006196	CACNB3, CACNA1E, CACNA1A,¹⁰¹ CACNB2, CACNA1B,¹⁰¹ CACNB1,¹⁰² CACNB4¹⁰³
Interleukin signaling pathway	0.029594	STAT5A,¹⁰⁴ STAT5B,¹⁰⁵ STAT2,¹⁰⁶ STAT3,¹⁰⁷ STAT1,¹⁰⁸ STAT4,¹⁰⁹ STAT6,¹¹⁰ MAPK6,¹¹¹ MAPK1, MAPK3,⁶⁴ MAPK7,¹¹² MAPK15

The second top ranked pathway was the Notch signaling pathway. This is involved in the development of neural tissues, blood vessels, heart, pancreas, mammary gland, T lymphocytes, hematopoietic lineages, and other cell types.⁸¹ The current study identified that the Notch signaling pathway had major participation and multiple roles during breast tumor progression.⁸²

The third top ranked pathway was the JAK STAT signaling pathway. The Janus kinase-signal transducer and activator of transcription (JAK-STAT) pathway significantly contributes to the transmission of signals from cell-membrane receptors to the nucleus, playing a pivotal role in this process. Moreover, the JAK-STAT pathway is indispensable for numerous cytokines and growth factors, which are responsible for crucial cellular processes such as hematopoiesis, lactation, and the development of the immune system and mammary glands.⁸³ A previous study revealed that chemoresistance in breast cancer was associated with the activation of JAK/STAT signaling and it was suggested that JAK2 may be useful in combating chemoresistance in breast cancer.⁸⁴

Two informative genes (CNR1, GNAI3) were selected for classification and both genes were identified as being involved in breast cancer progression. The fourth top ranked pathway was the Thyrotropin-releasing hormone receptor (TRHR) signaling pathway. The TRHR is a G protein-coupled receptor that binds to the tripeptide thyrotropin releasing hormone.⁸⁵ Dating back to the late 18th century, the administration of thyroid extract was often used in conjunction with oophorectomy as a treatment for breast cancer.⁸⁶ In this pathway, seven genes were selected by the proposed method and six genes (CACNB3, CACNA1E, CACNA1A, CACNA1B, CACNB1, CACNB4) were found to be involved in the progression of breast cancer.

Finally, the fifth top ranked pathway was the Interleukin (IL) signaling pathway from the proteins of interleukins family. This pathway regulates numerous biochemical events, including cellular proliferation and long-term survival. Previous studies have shown many of the interleukin families contribute to the progression of breast cancer. For example, the IL17 family consists of six protein members, among them IL17B and its receptor. The IL17RB signaling pathway plays a key role in the development and progression of breast cancer.⁸⁷ For this pathway, 12 informative genes were selected to undergo the classification process and only 11 genes (STAT5A, STAT5B, STAT2, STAT3, STAT1, STAT4, STAT6, MAPK6, MAPK1, MAPK3, MAPK7) were identified as being involved in the development of breast cancer.

Conclusions/Discussion

Pathway-based analysis has led to a new era in genomic studies which integrates the benefits of gene-set analysis and enhances them with prior information based on gene interaction within pathways. However, early methods of pathway-based analysis relied on enrichment-based approaches and identified differentially expressed pathways without identifying specific regions related to the target phenotype.² The results are not entirely accurate and complete since the current methods do not consider interactions involving functional molecular pathways.¹¹³ Usually, complex diseases like cancers involve interactions between genes that causes the genes to be expressed differently compared to a single gene. In order to obtain more specific biological knowledge, pathway-based analysis needs to shift to sub-pathway-based analysis which can identify regions of pathways that are dysregulated by diseases or involved in drug-related perturbations. Therefore, investigating sub-pathways is more relevant, since it can provide finer-grained resolution representing the underlying biological processes more accurately.¹¹⁴

One important feature of sub-pathway-based analysis is the ability to exploit the maximum interaction between nodes in pathways. In recent years, a series of methods had been developed to find solutions for sub-pathway analysis that identify informative sub-pathways accurately. This research proposes an improved differential expression analysis for the pathway (iDEAP) method which identifies informative sub-pathways and genes in pathways by considering all the interactions involved in the pathways. The iDEAP method extends the DEAP method by implementing the DMSP search algorithm to identify the informative sub-pathway as well as through modifying the calculation algorithm based on a recursive function used to obtain the average DEAP score for all sub-pathways. This is because the DEAP score of a single sub-pathway can lead to inaccurate interpretation since the size and structure among pathways are different.¹⁵ A Support Vector Machine (SVM) classification algorithm had been implemented to measure the performance of the proposed method based on the genes selected within significant pathways. Lastly, the iDEAP method used Genecards and literatures to validate the identified pathways and genes.

Data availability

Data used in this research is available in the Gene Expression Omnibus (GEO) database:

Gene Expression Omnibus: Radioresistant tumor response to interferons. Accession number: GDS3126. https://identifiers.org/geo:GDS3126.¹¹⁵

Gene Expression Omnibus: Early onset colorectal cancer: normal-appearing colonic mucosa. Accession number: GSE4107. https://identifiers.org/geo:GSE4107.¹¹⁶

Gene Expression Omnibus: Breast cancer relapse free survival. Accession number: GSE2034. https://identifiers.org/geo:GSE2034.¹¹⁷

Pathway data used in this research is available in the Protein Analysis Through Evolutionary Relationships database: PANTHER Pathway 3.6.6 http://www.pantherdb.org/downloads/.¹¹⁸

Software availability

Source code available from: https://github.com/NNasarudin/iDEAP

Archived source code at time of publication: https://doi.org/10.5281/zenodo.7816661.¹⁷

License: GNU Lesser General Public License v3.0.

References

1. Nam D, Kim SY: Gene-set approach for expression pattern analysis. Brief. Bioinform. 2008; 9(3): 189–197. Publisher Full Text
2. Ihnatova I, Popovici V, Budinska E: A critical comparison of topology-based pathway analysis methods. PLoS One. 2018; 13(1): 1–24. Publisher Full Text
3. Emmert-Streib F, Tripathi S, Matos Simoes RD: Harnessing the complexity of gene expression data from cancer: from single gene to structural pathway methods. Biol. Direct. 2012; 7(1): 25–44. Publisher Full Text
4. Bezerianos A, Dragomir A, Balomenos P: Computational methods for processing and analysis of biological pathways. Springer; 2017.
5. Li C, Han J, Yao Q, et al.: Subpathway-GM: Identification of metabolic s via joint power of interesting genes and metabolites and their topologies within pathways. Nucleic Acids Res. 2013; 41(9): e101. Publisher Full Text
6. Judeh T, Johnson C, Kumar A, et al.: TEAK: topology enrichment analysis framework for detecting activated biological subpathways. Nucleic Acids Res. 2013; 41(3): 1425–1437. PubMed Abstract | Publisher Full Text | Free Full Text
7. Nam S, Chang HR, Kim KT, et al.: PATHOME: an algorithm for accurately detecting differentially expressed subpathways. Oncogene. 2014; 33(41): 4941–4951. PubMed Abstract | Publisher Full Text | Free Full Text
8. Vrahatis AG, Dimitrakopoulou K, Balomenos P, et al.: CHRONOS: A time-varying method for microRNA-mediated subpathway enrichment analysis. Bioinformatics. 2016; 32(6): 884–892. PubMed Abstract | Publisher Full Text
9. Ning Z, Feng C, Song C, et al.: Topologically inferring active miRNA-mediated subpathways toward precise cancer classification by directed random walk. Mol. Oncol. 2019; 13(10): 2211–2226. PubMed Abstract | Publisher Full Text | Free Full Text
10. Li C, Li X, Miao Y, et al.: SubpathwayMiner: a software package for flexible identification of pathways. Nucleic Acids Res. 2009; 37(19): e131–e131. PubMed Abstract | Publisher Full Text | Free Full Text
11. Kanehisa M, Goto S: KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000; 28(1): 27–30. PubMed Abstract | Publisher Full Text | Free Full Text
12. Ashburner M, Ball CA, Blake JA, et al.: Gene ontology: tool for the unification of biology. Nat. Genet. 2000; 25(1): 25–29. PubMed Abstract | Publisher Full Text | Free Full Text
13. Mallavarapu T, Hao J, Kim Y, et al.: Pathway-based deep clustering for molecular subtyping of cancer. Methods. 2020; 173: 24–31. PubMed Abstract | Publisher Full Text | Free Full Text
14. Gatza ML, Lucas JE, Barry WT, et al.: A pathway-based classification of human breast cancer. Proc. Natl. Acad. Sci. 2010; 107(15): 6994–6999. PubMed Abstract | Publisher Full Text | Free Full Text
15. Haynes WA, Higdon R, Stanberry L, et al.: Differential expression analysis for pathways. PLoS Comput. Biol. 2013; 9(3): e1002967. Publisher Full Text
16. Li X, Shen L, Shang X, et al.: analysis based on signaling-pathway impact analysis of signaling pathway. PLoS One. 2015; 10(7): e0132813. PubMed Abstract | Publisher Full Text | Free Full Text
17. Nasarudin NA, Mohamad MS, Zakaria Z, et al.: Improved Differential Expression Analysis for Pathways (iDEAP).2023. Publisher Full Text
18. Maraziotis IA, Dimitrakopoulou K, Bezerianos A: Growing functional modules from a seed protein via integration of protein interaction and gene expression data. BMC Bioinformatics. 2007; 8(1): 1–15. Publisher Full Text
19. Robinson MD, McCarthy DJ, Smyth GK: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010; 26(1): 139–140. PubMed Abstract | Publisher Full Text | Free Full Text
20. Khodarev NN, Minn AJ, Efimova EV, et al.: Signal transducer and activator of transcription 1 regulates both cytotoxic and prosurvival functions in tumor cells. Cancer Res. 2007; 67(19): 9214–9220. PubMed Abstract | Publisher Full Text
21. Hong Y, Ho KS, Eu KW, et al.: A susceptibility gene set for early onset colorectal cancer that integrates diverse signaling pathways: implication for tumorigenesis. Clin. Cancer Res. 2007; 13(4): 1107–1114. PubMed Abstract | Publisher Full Text
22. Wang Y, Klijn JG, Zhang Y, et al.: Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet. 2005; 365(9460): 671–679. PubMed Abstract | Publisher Full Text
23. Thomas PD, Kejariwal A, Campbell MJ, et al.: PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification. Nucleic Acids Res. 2003; 31(1): 334–341. Publisher Full Text
24. Koumakis L, Kanterakis A, Kartsaki E, et al.: MinePath: mining for phenotype differential sub-paths in molecular pathways. PLoS Comput. Biol. 2016; 12(11): e1005187. PubMed Abstract | Publisher Full Text | Free Full Text
25. Hidalgo MR, Cubuk C, Amadoz A, et al.: High throughput estimation of functional cell activities reveals disease mechanisms and predicts relevant clinical outcomes. Oncotarget. 2017; 8(3): 5160–5178. PubMed Abstract | Publisher Full Text | Free Full Text
26. Han J, Han X, Kong Q, et al.: PsSubpathway: A software package for flexible identification of phenotype-specific subpathways in cancer progression. Bioinformatics. 2020; 36(7): 2303–2305. PubMed Abstract | Publisher Full Text
27. Liu J, Sato C, Cerletti M, et al.: Notch signaling in the regulation of stem cell self-renewal and differentiation. Curr. Top. Dev. Biol. 2010; 92: 367–409. Publisher Full Text
28. Zhao YY, Yu GT, Xiao T, et al.: The Notch signaling pathway in head and neck squamous cell carcinoma: A meta-analysis. Adv. Clin. Exp. Med. 2017; 26(5): 881–887. PubMed Abstract | Publisher Full Text
29. Sun W, Gaykalova DA, Ochs MF, et al.: Activation of the NOTCH pathway in head and neck cancer. Cancer Res. 2014; 74(4): 1091–1104. PubMed Abstract | Publisher Full Text | Free Full Text
30. Fukusumi T, Guo TW, Sakai A, et al.: The NOTCH4–HEY1 Pathway Induces Epithelial–Mesenchymal Transition in Head and Neck Squamous Cell Carcinoma. Clin. Cancer Res. 2018; 24(3): 619–633. PubMed Abstract | Publisher Full Text | Free Full Text
31. Ardalan Khales S, Ebrahimi E, Jahanzad E, et al.: MAML1 and TWIST1 co-overexpression promote invasion of head and neck squamous cell carcinoma. Asia Pac. J. Clin. Oncol. 2018; 14: e434–e441. PubMed Abstract | Publisher Full Text
32. Pang X, Tang YL, Liang XH: Transforming growth factor β signaling in head and neck squamous cell carcinoma: Insights into cellular responses. Oncol. Lett. 2018; 16(4): 4799–4806. PubMed Abstract | Publisher Full Text
33. White RA, Malkoski SP, Wang XJ: TGFβ signaling in head and neck squamous cell carcinoma. Oncogene. 2010; 29(40): 5437–5446. PubMed Abstract | Publisher Full Text | Free Full Text
34. Colicelli J: Human RAS superfamily proteins and related GTPases. Sci. STKE. 2004; 2004(250): re13-re13. PubMed Abstract
35. Sikdar S, Datta S, Datta S: Exploring the importance of cancer pathways by meta-analysis of differential protein expression networks in three different cancers. Biol. Direct. 2016; 11(1): 65. Publisher Full Text
36. Hu X, Li J, Fu M, et al.: The JAK/STAT signaling pathway: From bench to clinic. Signal Transduct. Target. Ther. 2021; 6(1): 402. PubMed Abstract | Publisher Full Text | Free Full Text
37. Garcia R, Jove R: Activation of STAT transcription factors in oncogenic tyrosine kinase signaling. J. Biomed. Sci. 1998; 5(2): 79–85. Publisher Full Text
38. Cedars E, Johnson DE, Grandis JR: Jak/STAT Signaling in Head and Neck Cancer. Molecular Determinants of Head and Neck Cancer. 2018; 155–184. Publisher Full Text
39. Mangone FR, Brentani MM, Nonogaki S, et al.: Overexpression of Fos-related antigen-1 in head and neck squamous cell carcinoma. Int. J. Exp. Pathol. 2005; 86(4): 205–212. PubMed Abstract | Publisher Full Text | Free Full Text
40. Riaz N, Morris LG, Lee W, et al.: Unraveling the molecular genetics of head and neck cancer through genome-wide approaches. Genes Dis. 2014; 1(1): 75–86. PubMed Abstract | Publisher Full Text | Free Full Text
41. Zhao Y, Fu D, Xu C, et al.: Identification of genes associated with tongue cancer in patients with a history of tobacco and/or alcohol use. Oncol. Lett. 2017; 13(2): 629–638. PubMed Abstract | Publisher Full Text | Free Full Text
42. Hyakusoku H, Sano D, Takahashi H, et al.: JunB promotes cell invasion, migration and distant metastasis of head and neck squamous cell carcinoma. J. Exp. Clin. Cancer Res. 2016; 35(1): 6. PubMed Abstract | Publisher Full Text | Free Full Text
43. Hagerstrand D, Tong A, Schumacher SE, et al.: Systematic interrogation of 3q26 identifies TLOC1 and SKIL as cancer drivers. Cancer Discov. 2013; 3: 1044–1057. Publisher Full Text
44. Khammanivong A, Gopalakrishnan R, Dickerson EB: SMURF1 silencing diminishes a CD44-high cancer stem cell-like population in head and neck squamous cell carcinoma. Mol. Cancer. 2014; 13(1): 260. PubMed Abstract | Publisher Full Text | Free Full Text
45. Qiu W, Schönleben F, Li X, et al.: PIK3CA mutations in head and neck squamous cell carcinoma. Clin. Cancer Res. 2006; 12(5): 1441–1446. PubMed Abstract | Publisher Full Text | Free Full Text
46. Giudice FS, Squarize CH: The determinants of head and neck cancer: Unmasking the PI3K pathway mutations. J. Carcinog. Mutagen. 2013.
47. Jung K, Kang H, Mehra R: Targeting phosphoinositide 3-kinase (PI3K) in head and neck squamous cell carcinoma (HNSCC). Cancers Head Neck. 2018; 3(1): 3. PubMed Abstract | Publisher Full Text | Free Full Text
48. Boeckx C, Weyn C, Bempt IV, et al.: Mutation analysis of genes in the EGFR pathway in Head and Neck cancer patients: implications for anti-EGFR treatment response. BMC. Res. Notes. 2014; 7(1): 337. PubMed Abstract | Publisher Full Text | Free Full Text
49. Lui VW, Xi S, Raymond CL, et al.: Activation of STAT5 contributes to tumor growth and epithelial-mesenchymal transition in head and neck cancer.2006.
50. Lui VW, Hedberg ML, Li H, et al.: Frequent mutation of the PI3K pathway in head and neck cancer defines predictive biomarkers. Cancer Discov. 2013; 3: 761–769. PubMed Abstract | Publisher Full Text | Free Full Text
51. De Carvalho TG, De Carvalho AC, Maia DCC, et al.: Search for mutations in signaling pathways in head and neck squamous cell carcinoma. Oncol. Rep. 2013; 30(1): 334–340. Publisher Full Text
52. Elkhadragy L, Chen M, Miller K, et al.: A regulatory BMI1/let-7i/ERK3 pathway controls the motility of head and neck cancer cells. Mol. Oncol. 2017; 11(2): 194–207. PubMed Abstract | Publisher Full Text | Free Full Text
53. Reyes-Gibby CC, Wang J, Silvas MRT, et al.: MAPK1/ERK2 as novel target genes for pain in head and neck cancer patients. BMC Genet. 2016; 17(1): 40. PubMed Abstract | Publisher Full Text | Free Full Text
54. Naghavi AO, Ahmed KA, Kim Y, et al.: Head and Neck Cancer Genes Predictive of Radioresistance and Detriment to Local Control. Int. J. Radiat. Oncol. Biol. Phys. 2017; 99(2): S122–S123. Publisher Full Text
55. Strasser A, Jost PJ, Nagata S: The many roles of FAS receptor signaling in the immune system. Immunity. 2009; 30(2): 180–192. PubMed Abstract | Publisher Full Text | Free Full Text
56. Houghton JA, Harwood FG, Gibson AA, et al.: The fas signaling pathway is functional in colon carcinoma cells and induces apoptosis. Clin. Cancer Res. 1997; 3(12): 2205–2209.
57. Thompson AJ, Lummis R, S. C.: 5-HT3 receptors. Curr. Pharm. Des. 2006; 12(28): 3615–3630. PubMed Abstract | Publisher Full Text | Free Full Text
58. Gershon MD, Tack J: The serotonin signaling system: from basic understanding to drug development for functional GI disorders. Gastroenterology. 2007; 132: 397–414. PubMed Abstract | Publisher Full Text
59. Davies PA, Pistis M, Hanna MC, et al.: The 5-HT3B subunit is a major determinant of serotonin-receptor function. Nature. 1999; 397(6717): 359–363. Publisher Full Text
60. Richardson C, Zhang S, Hernandez Borrero LJ, et al.: Small-molecule CB002 restores p53 pathway signaling and represses colorectal cancer cell growth. Cell Cycle. 2017; 16(18): 1719–1725. PubMed Abstract | Publisher Full Text | Free Full Text
61. Fearon ER: Molecular genetics of colorectal cancer. Annu. Rev. Pathol. 2011; 6: 479–507. Publisher Full Text
62. Huang F, Wang D, Yao Y, et al.: PDGF signaling in cancer progression. Int. J. Clin. Exp. Med. 2017; 10(7): 9918–9929.
63. Mönch R: The Growth Factor PDGF and its Signaling Pathways in Colorectal Cancer (Doctoral dissertation, Universität Würzburg).2017.
64. Slattery ML, Lundgreen A, John EM, et al.: MAPK genes interact with diet and lifestyle factors to alter risk of breast cancer: the Breast Cancer Health Disparities Study. Nutr. Cancer. 2015; 67(2): 292–304. PubMed Abstract | Publisher Full Text | Free Full Text
65. Uchiyama T, Takahashi H, Endo H, et al.: Role of the long form leptin receptor and of the STAT3 signaling pathway in colorectal cancer progression. Int. J. Oncol. 2011; 39(4): 935–940. PubMed Abstract | Publisher Full Text
66. Zhang W, Ding EX, Wang Q, et al.: Fas ligand expression in colon cancer: a possible mechanism of tumor immune privilege. World J Gastroenterol: WJG. 2005; 11(23): 3632–3635. PubMed Abstract | Publisher Full Text | Free Full Text
67. Rivetti S, Lauriola M, Voltattorni M, et al.: Gene expression profile of human colon cancer cells treated with cross-reacting material 197, a diphtheria toxin non-toxic mutant. Int. J. Immunopathol. Pharmacol. 2011; 24(3): 639–649. Publisher Full Text
68. Grabowski P, Schönfelder J, Ahnert-Hilger G, et al.: Expression of neuroendocrine markers: a signature of human undifferentiated carcinoma of the colon and rectum. Virchows Arch. 2002; 441(3): 256–263. PubMed Abstract | Publisher Full Text
69. Joyce T, Oikonomou E, Kosmidou V, et al.: A molecular signature for oncogenic BRAF in human colon cancer cells is revealed by microarray analysis. Curr. Cancer Drug Targets. 2012; 12(7): 873–898. PubMed Abstract | Publisher Full Text
70. Savas S, Hyde A, Stuckless SN, et al.: Serotonin transporter gene (SLC6A4) variations are associated with poor survival in colorectal cancer patients. PLoS One. 2012; 7(7): e38953. PubMed Abstract | Publisher Full Text | Free Full Text
71. Zhou CZ, Qiu GQ, Fang Zhang LH, et al.: Loss of heterozygosity on chromosome 1 in sporadic colorectal carcinoma. World J. Gastroenterol. 2004; 10(10): 1431–1435. PubMed Abstract | Publisher Full Text | Free Full Text
72. Iacopetta B: TP53 mutation in colorectal cancer. Hum. Mutat. 2003; 21(3): 271–276. Publisher Full Text
73. Sugano N, Suda T, Godai TI, et al.: MDM2 gene amplification in colorectal cancer is associated with disease progression at the primary site, but inversely correlated with distant metastasis. Genes Chromosom. Cancer. 2010; 49(7): 620–629.
74. Suda T, Yoshihara M, Nakamura Y, et al.: Rare MDM4 gene amplification in colorectal cancer: The principle of a mutually exclusive relationship between MDM alteration and TP53 inactivation is not applicable. Oncol. Rep. 2011; 26(1): 49–54.
75. Flanagan JM, Healey S, Young J, et al.: Mapping of a candidate colorectal cancer tumor-suppressor gene to a 900-kilobase region on the short arm of chromosome 8. Genes Chromosom. Cancer. 2004; 40(3): 247–260. PubMed Abstract | Publisher Full Text
76. Nakamura Y, Tanaka F, Yoshikawa Y, et al.: PDGF-BB is a novel prognostic factor in colorectal cancer. Ann. Surg. Oncol. 2008; 15(8): 2129–2136. Publisher Full Text
77. Manzat Saplacan RM, Balacescu L, Gherman C, et al.: The role of PDGFs and PDGFRs in colorectal cancer. Mediat. Inflamm. 2017; 2017: 1–9. PubMed Abstract | Publisher Full Text | Free Full Text
78. Lin Q, Lai R, Chirieac LR, et al.: Constitutive activation of JAK3/STAT3 in colon carcinoma tumors and cell lines: inhibition of JAK3/STAT3 signaling induces apoptosis and cell cycle arrest of colon carcinoma cells. Am. J. Pathol. 2005; 167(4): 969–980. PubMed Abstract | Publisher Full Text | Free Full Text
79. Goodsell DS: The molecular perspective: the ras oncogene. Oncologist. 1999; 4(3): 263–264. Publisher Full Text
80. Eckert LB, Repasky GA, Ülkü AS, et al.: Involvement of Ras activation in human breast cancer cell signaling, invasion, and anoikis. Cancer Res. 2004; 64(13): 4585–4592. PubMed Abstract | Publisher Full Text
81. Miller MA, Zachary JF: Mechanisms and morphology of cellular injury, adaptation, and death. Pathologic Basis of Veterinary Disease. 6th ed.2017; pp. 2–43. Publisher Full Text
82. Kontomanolis EN, Kalagasidou S, Pouliliou S, et al.: The Notch Pathway in Breast Cancer Progression. Sci. World J. 2018; 2018: 1–11. PubMed Abstract | Publisher Full Text | Free Full Text
83. Seif F, Khoshmirsafa M, Aazami H, et al.: The role of JAK-STAT signaling pathway and its regulators in the fate of T helper cells. Cell Commun. Signal. 2017; 15(1): 1–13. Publisher Full Text
84. Nascimento AS, Peres LL, Fari AV, et al.: Phosphoproteome profiling reveals critical role of JAK-STAT signaling in maintaining chemoresistance in breast cancer. Oncotarget. 2017; 8(70): 114756–114768. PubMed Abstract | Publisher Full Text | Free Full Text
85. Yamada M, Monden T, Konaka S, et al.: Assignment of human thyrotropin-releasing hormone (TRH) receptor gene to chromosome 8. Somat. Cell Mol. Genet. 1993; 19(6): 577–580. PubMed Abstract | Publisher Full Text
86. Page F, Bishop W: Recurrent Carcinoma Of The Female Breast Entirely Disappearing Under The Persistent Use Of Thyroid Extract Continued For Eighteen Months. Lancet. 1898; 151(3900): 1460–1461. Publisher Full Text
87. Alinejad V, Dolati S, Motallebnezhad M, et al.: The role of IL17B-IL17RB signaling pathway in breast cancer. Biomed. Pharmacother. 2017; 88: 795–803. PubMed Abstract | Publisher Full Text
88. Chalabi N, Satih S, Delort L, et al.: Expression profiling by whole-genome microarray hybridization reveals differential gene expression in breast cancer cell lines after lycopene exposure. Biochimica et Biophysica Acta (BBA)-Gene Structure and Expression. 2007; 1769(2): 124–130. PubMed Abstract | Publisher Full Text
89. Zhang S, Liu J, Xu K, et al.: Notch signaling via regulation of RB and p AKT but not PIK3CG contributes to MIA PaCa 2 cell growth and migration to affect pancreatic carcinogenesis. Oncol. Lett. 2018; 15(2): 2105–2110. PubMed Abstract | Publisher Full Text
90. Nakanishi Y, Walter K, Spoerke JM, et al.: Activating mutations in PIK3CB confer resistance to PI3K inhibition and define a novel oncogenic role for p110β. Cancer Res. 2016; 76: 1193. Publisher Full Text
91. Kok K, Nock GE, Verrall EA, et al.: Regulation of p110δ PI 3-kinase gene expression. PLoS One. 2009; 4(4): e5145. PubMed Abstract | Publisher Full Text | Free Full Text
92. Miller FR, Soule HD, Tait L, et al.: Xenograft model of progressive human proliferative breast disease. JNCI: Journal of the National Cancer Institute. 1993; 85(21): 1725–1732. Publisher Full Text
93. Sánchez-Muñoz A, Gallego E, de Luque V , et al.: Lack of evidence for KRAS oncogenic mutations in triple-negative breast cancer. BMC Cancer. 2010; 10(1): 136. PubMed Abstract | Publisher Full Text | Free Full Text
94. Capaccione KM, Pine SR: The Notch signaling pathway as a mediator of tumor survival. Carcinogenesis. 2013; 34(7): 1420–1430. PubMed Abstract | Publisher Full Text | Free Full Text
95. Parr C, Watkins G, Jiang WG: The possible correlation of Notch-1 and Notch-2 with clinical outcome and tumor clinicopathological parameters in human breast cancer. Int. J. Mol. Med. 2004; 14(5): 779–786. PubMed Abstract
96. Dou XW, Liang YK, Lin HY, et al.: Notch3 Maintains Luminal Phenotype and Suppresses Tumorigenesis and Metastasis of Breast Cancer via Trans-Activating Estrogen Receptor-α. Theranostics. 2017; 7(16): 4041–4056. PubMed Abstract | Publisher Full Text | Free Full Text
97. Fu YP, Edvardsen H, Kaushiva A, et al.: NOTCH2 in breast cancer: association of SNP rs11249433 with gene expression in ER-positive breast tumors without TP53 mutations. Mol. Cancer. 2010; 9(1): 113. Publisher Full Text
98. Wang JW, Wei XL, Dou XW, et al.: The association between Notch4 expression, and clinicopathological characteristics and clinical outcomes in patients with breast cancer. Oncol. Lett. 2018; 15(6): 8749–8755. PubMed Abstract | Publisher Full Text
99. Sarnataro D, Grimaldi C, Pisanti S, et al.: Plasma membrane and lysosomal localization of CB1 cannabinoid receptor are dependent on lipid rafts and regulated by anandamide in human breast cancer cells. FEBS Lett. 2005; 579(28): 6343–6349. PubMed Abstract | Publisher Full Text
100. Kelly P, Moeller BJ, Juneja J, et al.: The G12 family of heterotrimeric G proteins promotes breast cancer invasion and metastasis. Proc. Natl. Acad. Sci. 2006; 103(21): 8173–8178. PubMed Abstract | Publisher Full Text | Free Full Text
101. Phan NN, Wang CY, Chen CF, et al.: Voltage-gated calcium channels: Novel targets for cancer therapy. Oncol. Lett. 2017; 14(2): 2059–2074. PubMed Abstract | Publisher Full Text | Free Full Text
102. Bravatà V, Cammarata FP, Forte GI, et al.: “Omics” of HER2-positive breast cancer. Omics. 2013; 17(3): 119–129. PubMed Abstract | Publisher Full Text
103. Chung S, Low SK, Zembutsu H, et al.: A genome-wide association study of chemotherapy-induced alopecia in breast cancer patients. Breast Cancer Res. 2013; 15(5): R81. PubMed Abstract | Publisher Full Text | Free Full Text
104. Mukhopadhyay UK, Cass J, Raptis L, et al.: Dataset of STAT5A status in breast cancer. Data Brief. 2016; 7: 490–492. PubMed Abstract | Publisher Full Text | Free Full Text
105. Peck AR, Witkiewicz AK, Liu C, et al.: Low levels of Stat5a protein in breast cancer are associated with tumor progression and unfavorable clinical outcomes. Breast Cancer Res. 2012; 14(5): R130. PubMed Abstract | Publisher Full Text | Free Full Text
106. Yan GR, Xu SH, Tan ZL, et al.: Global identification of miR-373-regulated genes in breast cancer by quantitative proteomics. Proteomics. 2011; 11(5): 912–920. PubMed Abstract | Publisher Full Text
107. Banerjee K, Resat H: Constitutive activation of STAT 3 in breast cancer cells: A review. Int. J. Cancer. 2016; 138(11): 2570–2578. PubMed Abstract | Publisher Full Text | Free Full Text
108. Koromilas AE, Sexl V: The tumor suppressor function of STAT1 in breast cancer. Jak-Stat. 2013; 2(2): e23353. PubMed Abstract | Publisher Full Text | Free Full Text
109. Nunez AR: The role of the interleukin-12/STAT4 axis in breast cancer.2016.
110. Gooch JL, Christy B, Yee D: STAT6 mediates interleukin-4 growth inhibition in human breast cancer cells. Neoplasia. 2002; 4(4): 324–331. PubMed Abstract | Publisher Full Text | Free Full Text
111. Al-Mahdi R, Babteen N, Thillai K, et al.: A novel role for atypical MAPK kinase ERK3 in regulating breast cancer cell morphology and migration. Cell Adhes. Migr. 2015; 9(6): 483–494. PubMed Abstract | Publisher Full Text | Free Full Text
112. Javaid S, Zhang J, Smolen GA, et al.: MAPK7 regulates EMT features and modulates the generation of CTCs. Mol. Cancer Res. 2015; 13: 934. Publisher Full Text
113. Tarca AL, Draghici S, Khatri P, et al.: A novel signaling pathway impact analysis. Bioinformatics. 2009; 25(1): 75–82. PubMed Abstract | Publisher Full Text | Free Full Text
114. Chen X, Xu J, Huang B, et al.: A sub-pathway-based approach for identifying drug response principal network. Bioinformatics. 2011; 27(5): 649–654. PubMed Abstract | Publisher Full Text
115. G: GEO DataSet Browser. GEO DataSet Browser.n.d.Reference Source
116. n.d.. http
117. n.d.. http
118. n.d.. http

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 06 Nov 2023

Author details Author details

¹ Health Data Science Lab, Department of Genetics and Genomics, College of Medicine and Health Sciences, United Arab Emirates University, Al Ain, Abu Dhabi, 17666, United Arab Emirates
² School of Computing, Faculty of Engineering, Universiti Teknologi Malaysia, Skudai, Johor, 81300, Malaysia
³ School of Computing and Information Systems, The University of Melbourne, Melbourne, Victoria, 3010, Australia

Nurul Athirah Nasarudin
Roles: Investigation, Software, Writing – Review & Editing

Mohd Saberi Mohamad
Roles: Conceptualization, Methodology, Writing – Review & Editing

Zalmiyah Zakaria
Roles: Supervision, Writing – Review & Editing

Richard O. Sinnott
Roles: Writing – Original Draft Preparation, Writing – Review & Editing

Fatma Al Jasmi
Roles: Data Curation, Formal Analysis, Investigation, Writing – Review & Editing

Noura Al Dhaheri
Roles: Formal Analysis, Validation, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

This work was supported by the United Arab Emirates University (UAEU) through Strategic Research Program [Vot: 12R111].

Article Versions (1)

version 1

Published: 06 Nov 2023, 12:1433

https://doi.org/10.12688/f1000research.132899.1

Copyright

© 2023 Nasarudin NA et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Nasarudin NA, Mohamad MS, Zakaria Z et al. Identification of informative genes and sub-pathways using Improved Differential Expression Analysis for Pathways (iDEAP) for cancer classification [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2023, 12:1433 (https://doi.org/10.12688/f1000research.132899.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 06 Nov 2023

Views

4

Reviewer Report 01 Feb 2024

Nora Muda, Department of Mathematical Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Malaysia

Approved with Reservations

https://doi.org/10.5256/f1000research.145851.r232310

Dear Authors,

I have read your paper and would like to thank you all for your kind attention.

This manuscript is about the analysis of pathways, primarily focused on sub-pathway-based analysis. The authors proposed ... Continue reading

Dear Authors,

I have read your paper and would like to thank you all for your kind attention.

This manuscript is about the analysis of pathways, primarily focused on sub-pathway-based analysis. The authors proposed an extended method of DEAP by improving the identification of perturbed sub-pathways that achieves higher performance in the detection of differentially expressed pathways.

The comments that need to be highlighted in the manuscript are as follows:
1. Since the proposed statistics calculation is based on maximum order statistics, do you rank your maximum score before calculating it using the proposed method?
2. Is there any statistical theorem of maximum order statistics you are applying to your proposed method? It is good to provide it in your manuscript.
3. What is 'p' stand for in Equation 4?
4. Did the 'n' in Eq. 3 and Eq. 4 have the same value?
5. You are calculating twice the average of the score, one in Eq. 3 and another in Eq. 4. I am not very clear on the difference between these two methods.
6. Has any comparison been made with other previous methods? So that we can see your proposed method is better than the previous one.

The above comments should, at a minimum, be checked and taken into consideration.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Partly
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Applied statistics, mathematical statistics, computational biology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Views

5

Reviewer Report 20 Dec 2023

Suhaila Zainudin, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia

Approved

https://doi.org/10.5256/f1000research.145851.r221362

The paper introduces an enhanced version of the Differential Expression Analysis for Pathways (DEAP) method, termed iDEAP, for improved identification of perturbed sub-pathways in biological reactions related to complex diseases. The method incorporates a search algorithm adapted from the Detect ... Continue reading

The paper introduces an enhanced version of the Differential Expression Analysis for Pathways (DEAP) method, termed iDEAP, for improved identification of perturbed sub-pathways in biological reactions related to complex diseases. The method incorporates a search algorithm adapted from the Detect Module from Seed Protein (DMSP) algorithm and considers the relationship among sub-pathways to enhance the identification of differentially expressed pathways. The study applies three gene expression datasets and demonstrates that the proposed method outperforms previous approaches. In particular, when assessing identified genes for cancer classification through 10-fold cross-validation, the improved method shows higher accuracy for colorectal cancer (90%) and breast cancer (94%). The findings suggest that the iDEAP method effectively identifies informative genes associated with the targeted phenotype, and a biological validation of the top five significant pathways and selected genes supports its utility in disease detection. The paper concludes by highlighting the potential impact of computational biology on biomedical research and medicine in the future.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: bioinformatics, data science

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Respond or Comment

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 06 Nov 2023

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 1 06 Nov 23	read	read

Suhaila Zainudin, Universiti Kebangsaan Malaysia, Bangi, Malaysia
Nora Muda, Universiti Kebangsaan Malaysia, Bangi, Malaysia

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

Back to all reports

Reviewer Report

4 Views

01 Feb 2024 | for Version 1

Nora Muda, Department of Mathematical Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Malaysia

4 Views Cite this report Responses(0)

Approved With Reservations

Dear Authors,

I have read your paper and would like to thank you all for your kind attention.

This manuscript is about the analysis of pathways, primarily focused on sub-pathway-based analysis. The authors proposed an extended method of DEAP by improving the identification of perturbed sub-pathways that achieves higher performance in the detection of differentially expressed pathways.

The comments that need to be highlighted in the manuscript are as follows:
1. Since the proposed statistics calculation is based on maximum order statistics, do you rank your maximum score before calculating it using the proposed method?
2. Is there any statistical theorem of maximum order statistics you are applying to your proposed method? It is good to provide it in your manuscript.
3. What is 'p' stand for in Equation 4?
4. Did the 'n' in Eq. 3 and Eq. 4 have the same value?
5. You are calculating twice the average of the score, one in Eq. 3 and another in Eq. 4. I am not very clear on the difference between these two methods.
6. Has any comparison been made with other previous methods? So that we can see your proposed method is better than the previous one.

The above comments should, at a minimum, be checked and taken into consideration.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Partly
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Applied statistics, mathematical statistics, computational biology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

5 Views

20 Dec 2023 | for Version 1

Suhaila Zainudin, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia

5 Views Cite this report Responses(0)

Approved

The paper introduces an enhanced version of the Differential Expression Analysis for Pathways (DEAP) method, termed iDEAP, for improved identification of perturbed sub-pathways in biological reactions related to complex diseases. The method incorporates a search algorithm adapted from the Detect Module from Seed Protein (DMSP) algorithm and considers the relationship among sub-pathways to enhance the identification of differentially expressed pathways. The study applies three gene expression datasets and demonstrates that the proposed method outperforms previous approaches. In particular, when assessing identified genes for cancer classification through 10-fold cross-validation, the improved method shows higher accuracy for colorectal cancer (90%) and breast cancer (94%). The findings suggest that the iDEAP method effectively identifies informative genes associated with the targeted phenotype, and a biological validation of the top five significant pathways and selected genes supports its utility in disease detection. The paper concludes by highlighting the potential impact of computational biology on biomedical research and medicine in the future.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

bioinformatics, data science

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

[1] 1. Nam D, Kim SY: Gene-set approach for expression pattern analysis. Brief. Bioinform. 2008; 9(3): 189–197. Publisher Full Text

[2] 2. Ihnatova I, Popovici V, Budinska E: A critical comparison of topology-based pathway analysis methods. PLoS One. 2018; 13(1): 1–24. Publisher Full Text

[3] 3. Emmert-Streib F, Tripathi S, Matos Simoes RD: Harnessing the complexity of gene expression data from cancer: from single gene to structural pathway methods. Biol. Direct. 2012; 7(1): 25–44. Publisher Full Text

[4] 4. Bezerianos A, Dragomir A, Balomenos P: Computational methods for processing and analysis of biological pathways. Springer; 2017.

[5] 5. Li C, Han J, Yao Q, et al.: Subpathway-GM: Identification of metabolic s via joint power of interesting genes and metabolites and their topologies within pathways. Nucleic Acids Res. 2013; 41(9): e101. Publisher Full Text

[6] 6. Judeh T, Johnson C, Kumar A, et al.: TEAK: topology enrichment analysis framework for detecting activated biological subpathways. Nucleic Acids Res. 2013; 41(3): 1425–1437. PubMed Abstract | Publisher Full Text | Free Full Text

[7] 7. Nam S, Chang HR, Kim KT, et al.: PATHOME: an algorithm for accurately detecting differentially expressed subpathways. Oncogene. 2014; 33(41): 4941–4951. PubMed Abstract | Publisher Full Text | Free Full Text

[8] 8. Vrahatis AG, Dimitrakopoulou K, Balomenos P, et al.: CHRONOS: A time-varying method for microRNA-mediated subpathway enrichment analysis. Bioinformatics. 2016; 32(6): 884–892. PubMed Abstract | Publisher Full Text

[9] 9. Ning Z, Feng C, Song C, et al.: Topologically inferring active miRNA-mediated subpathways toward precise cancer classification by directed random walk. Mol. Oncol. 2019; 13(10): 2211–2226. PubMed Abstract | Publisher Full Text | Free Full Text

[10] 10. Li C, Li X, Miao Y, et al.: SubpathwayMiner: a software package for flexible identification of pathways. Nucleic Acids Res. 2009; 37(19): e131–e131. PubMed Abstract | Publisher Full Text | Free Full Text

[11] 11. Kanehisa M, Goto S: KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000; 28(1): 27–30. PubMed Abstract | Publisher Full Text | Free Full Text

[12] 12. Ashburner M, Ball CA, Blake JA, et al.: Gene ontology: tool for the unification of biology. Nat. Genet. 2000; 25(1): 25–29. PubMed Abstract | Publisher Full Text | Free Full Text

[13] 13. Mallavarapu T, Hao J, Kim Y, et al.: Pathway-based deep clustering for molecular subtyping of cancer. Methods. 2020; 173: 24–31. PubMed Abstract | Publisher Full Text | Free Full Text

[14] 14. Gatza ML, Lucas JE, Barry WT, et al.: A pathway-based classification of human breast cancer. Proc. Natl. Acad. Sci. 2010; 107(15): 6994–6999. PubMed Abstract | Publisher Full Text | Free Full Text

[15] 15. Haynes WA, Higdon R, Stanberry L, et al.: Differential expression analysis for pathways. PLoS Comput. Biol. 2013; 9(3): e1002967. Publisher Full Text

[16] 16. Li X, Shen L, Shang X, et al.: analysis based on signaling-pathway impact analysis of signaling pathway. PLoS One. 2015; 10(7): e0132813. PubMed Abstract | Publisher Full Text | Free Full Text

[17] 17. Nasarudin NA, Mohamad MS, Zakaria Z, et al.: Improved Differential Expression Analysis for Pathways (iDEAP).2023. Publisher Full Text

[18] 18. Maraziotis IA, Dimitrakopoulou K, Bezerianos A: Growing functional modules from a seed protein via integration of protein interaction and gene expression data. BMC Bioinformatics. 2007; 8(1): 1–15. Publisher Full Text

[19] 19. Robinson MD, McCarthy DJ, Smyth GK: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010; 26(1): 139–140. PubMed Abstract | Publisher Full Text | Free Full Text

[20] 20. Khodarev NN, Minn AJ, Efimova EV, et al.: Signal transducer and activator of transcription 1 regulates both cytotoxic and prosurvival functions in tumor cells. Cancer Res. 2007; 67(19): 9214–9220. PubMed Abstract | Publisher Full Text

[21] 21. Hong Y, Ho KS, Eu KW, et al.: A susceptibility gene set for early onset colorectal cancer that integrates diverse signaling pathways: implication for tumorigenesis. Clin. Cancer Res. 2007; 13(4): 1107–1114. PubMed Abstract | Publisher Full Text

[22] 22. Wang Y, Klijn JG, Zhang Y, et al.: Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet. 2005; 365(9460): 671–679. PubMed Abstract | Publisher Full Text

[23] 23. Thomas PD, Kejariwal A, Campbell MJ, et al.: PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification. Nucleic Acids Res. 2003; 31(1): 334–341. Publisher Full Text

[24] 24. Koumakis L, Kanterakis A, Kartsaki E, et al.: MinePath: mining for phenotype differential sub-paths in molecular pathways. PLoS Comput. Biol. 2016; 12(11): e1005187. PubMed Abstract | Publisher Full Text | Free Full Text

[25] 25. Hidalgo MR, Cubuk C, Amadoz A, et al.: High throughput estimation of functional cell activities reveals disease mechanisms and predicts relevant clinical outcomes. Oncotarget. 2017; 8(3): 5160–5178. PubMed Abstract | Publisher Full Text | Free Full Text

[26] 26. Han J, Han X, Kong Q, et al.: PsSubpathway: A software package for flexible identification of phenotype-specific subpathways in cancer progression. Bioinformatics. 2020; 36(7): 2303–2305. PubMed Abstract | Publisher Full Text

[27] 27. Liu J, Sato C, Cerletti M, et al.: Notch signaling in the regulation of stem cell self-renewal and differentiation. Curr. Top. Dev. Biol. 2010; 92: 367–409. Publisher Full Text

[28] 28. Zhao YY, Yu GT, Xiao T, et al.: The Notch signaling pathway in head and neck squamous cell carcinoma: A meta-analysis. Adv. Clin. Exp. Med. 2017; 26(5): 881–887. PubMed Abstract | Publisher Full Text

[29] 29. Sun W, Gaykalova DA, Ochs MF, et al.: Activation of the NOTCH pathway in head and neck cancer. Cancer Res. 2014; 74(4): 1091–1104. PubMed Abstract | Publisher Full Text | Free Full Text

[30] 30. Fukusumi T, Guo TW, Sakai A, et al.: The NOTCH4–HEY1 Pathway Induces Epithelial–Mesenchymal Transition in Head and Neck Squamous Cell Carcinoma. Clin. Cancer Res. 2018; 24(3): 619–633. PubMed Abstract | Publisher Full Text | Free Full Text

[31] 31. Ardalan Khales S, Ebrahimi E, Jahanzad E, et al.: MAML1 and TWIST1 co-overexpression promote invasion of head and neck squamous cell carcinoma. Asia Pac. J. Clin. Oncol. 2018; 14: e434–e441. PubMed Abstract | Publisher Full Text

[32] 32. Pang X, Tang YL, Liang XH: Transforming growth factor β signaling in head and neck squamous cell carcinoma: Insights into cellular responses. Oncol. Lett. 2018; 16(4): 4799–4806. PubMed Abstract | Publisher Full Text

[33] 33. White RA, Malkoski SP, Wang XJ: TGFβ signaling in head and neck squamous cell carcinoma. Oncogene. 2010; 29(40): 5437–5446. PubMed Abstract | Publisher Full Text | Free Full Text

[34] 34. Colicelli J: Human RAS superfamily proteins and related GTPases. Sci. STKE. 2004; 2004(250): re13-re13. PubMed Abstract

[35] 35. Sikdar S, Datta S, Datta S: Exploring the importance of cancer pathways by meta-analysis of differential protein expression networks in three different cancers. Biol. Direct. 2016; 11(1): 65. Publisher Full Text

[36] 36. Hu X, Li J, Fu M, et al.: The JAK/STAT signaling pathway: From bench to clinic. Signal Transduct. Target. Ther. 2021; 6(1): 402. PubMed Abstract | Publisher Full Text | Free Full Text

[37] 37. Garcia R, Jove R: Activation of STAT transcription factors in oncogenic tyrosine kinase signaling. J. Biomed. Sci. 1998; 5(2): 79–85. Publisher Full Text

[38] 38. Cedars E, Johnson DE, Grandis JR: Jak/STAT Signaling in Head and Neck Cancer. Molecular Determinants of Head and Neck Cancer. 2018; 155–184. Publisher Full Text

[39] 39. Mangone FR, Brentani MM, Nonogaki S, et al.: Overexpression of Fos-related antigen-1 in head and neck squamous cell carcinoma. Int. J. Exp. Pathol. 2005; 86(4): 205–212. PubMed Abstract | Publisher Full Text | Free Full Text

[40] 40. Riaz N, Morris LG, Lee W, et al.: Unraveling the molecular genetics of head and neck cancer through genome-wide approaches. Genes Dis. 2014; 1(1): 75–86. PubMed Abstract | Publisher Full Text | Free Full Text

[41] 41. Zhao Y, Fu D, Xu C, et al.: Identification of genes associated with tongue cancer in patients with a history of tobacco and/or alcohol use. Oncol. Lett. 2017; 13(2): 629–638. PubMed Abstract | Publisher Full Text | Free Full Text

[42] 42. Hyakusoku H, Sano D, Takahashi H, et al.: JunB promotes cell invasion, migration and distant metastasis of head and neck squamous cell carcinoma. J. Exp. Clin. Cancer Res. 2016; 35(1): 6. PubMed Abstract | Publisher Full Text | Free Full Text

[43] 43. Hagerstrand D, Tong A, Schumacher SE, et al.: Systematic interrogation of 3q26 identifies TLOC1 and SKIL as cancer drivers. Cancer Discov. 2013; 3: 1044–1057. Publisher Full Text

[44] 44. Khammanivong A, Gopalakrishnan R, Dickerson EB: SMURF1 silencing diminishes a CD44-high cancer stem cell-like population in head and neck squamous cell carcinoma. Mol. Cancer. 2014; 13(1): 260. PubMed Abstract | Publisher Full Text | Free Full Text

[45] 45. Qiu W, Schönleben F, Li X, et al.: PIK3CA mutations in head and neck squamous cell carcinoma. Clin. Cancer Res. 2006; 12(5): 1441–1446. PubMed Abstract | Publisher Full Text | Free Full Text

[46] 46. Giudice FS, Squarize CH: The determinants of head and neck cancer: Unmasking the PI3K pathway mutations. J. Carcinog. Mutagen. 2013.

[47] 47. Jung K, Kang H, Mehra R: Targeting phosphoinositide 3-kinase (PI3K) in head and neck squamous cell carcinoma (HNSCC). Cancers Head Neck. 2018; 3(1): 3. PubMed Abstract | Publisher Full Text | Free Full Text

[48] 48. Boeckx C, Weyn C, Bempt IV, et al.: Mutation analysis of genes in the EGFR pathway in Head and Neck cancer patients: implications for anti-EGFR treatment response. BMC. Res. Notes. 2014; 7(1): 337. PubMed Abstract | Publisher Full Text | Free Full Text

[49] 49. Lui VW, Xi S, Raymond CL, et al.: Activation of STAT5 contributes to tumor growth and epithelial-mesenchymal transition in head and neck cancer.2006.

[50] 50. Lui VW, Hedberg ML, Li H, et al.: Frequent mutation of the PI3K pathway in head and neck cancer defines predictive biomarkers. Cancer Discov. 2013; 3: 761–769. PubMed Abstract | Publisher Full Text | Free Full Text

[51] 51. De Carvalho TG, De Carvalho AC, Maia DCC, et al.: Search for mutations in signaling pathways in head and neck squamous cell carcinoma. Oncol. Rep. 2013; 30(1): 334–340. Publisher Full Text

[52] 52. Elkhadragy L, Chen M, Miller K, et al.: A regulatory BMI1/let-7i/ERK3 pathway controls the motility of head and neck cancer cells. Mol. Oncol. 2017; 11(2): 194–207. PubMed Abstract | Publisher Full Text | Free Full Text

[53] 53. Reyes-Gibby CC, Wang J, Silvas MRT, et al.: MAPK1/ERK2 as novel target genes for pain in head and neck cancer patients. BMC Genet. 2016; 17(1): 40. PubMed Abstract | Publisher Full Text | Free Full Text

[54] 54. Naghavi AO, Ahmed KA, Kim Y, et al.: Head and Neck Cancer Genes Predictive of Radioresistance and Detriment to Local Control. Int. J. Radiat. Oncol. Biol. Phys. 2017; 99(2): S122–S123. Publisher Full Text

[55] 55. Strasser A, Jost PJ, Nagata S: The many roles of FAS receptor signaling in the immune system. Immunity. 2009; 30(2): 180–192. PubMed Abstract | Publisher Full Text | Free Full Text

[56] 56. Houghton JA, Harwood FG, Gibson AA, et al.: The fas signaling pathway is functional in colon carcinoma cells and induces apoptosis. Clin. Cancer Res. 1997; 3(12): 2205–2209.

[57] 57. Thompson AJ, Lummis R, S. C.: 5-HT3 receptors. Curr. Pharm. Des. 2006; 12(28): 3615–3630. PubMed Abstract | Publisher Full Text | Free Full Text

[58] 58. Gershon MD, Tack J: The serotonin signaling system: from basic understanding to drug development for functional GI disorders. Gastroenterology. 2007; 132: 397–414. PubMed Abstract | Publisher Full Text

[59] 59. Davies PA, Pistis M, Hanna MC, et al.: The 5-HT3B subunit is a major determinant of serotonin-receptor function. Nature. 1999; 397(6717): 359–363. Publisher Full Text

[60] 60. Richardson C, Zhang S, Hernandez Borrero LJ, et al.: Small-molecule CB002 restores p53 pathway signaling and represses colorectal cancer cell growth. Cell Cycle. 2017; 16(18): 1719–1725. PubMed Abstract | Publisher Full Text | Free Full Text

[61] 61. Fearon ER: Molecular genetics of colorectal cancer. Annu. Rev. Pathol. 2011; 6: 479–507. Publisher Full Text

[62] 62. Huang F, Wang D, Yao Y, et al.: PDGF signaling in cancer progression. Int. J. Clin. Exp. Med. 2017; 10(7): 9918–9929.

[63] 63. Mönch R: The Growth Factor PDGF and its Signaling Pathways in Colorectal Cancer (Doctoral dissertation, Universität Würzburg).2017.

[64] 64. Slattery ML, Lundgreen A, John EM, et al.: MAPK genes interact with diet and lifestyle factors to alter risk of breast cancer: the Breast Cancer Health Disparities Study. Nutr. Cancer. 2015; 67(2): 292–304. PubMed Abstract | Publisher Full Text | Free Full Text

[65] 65. Uchiyama T, Takahashi H, Endo H, et al.: Role of the long form leptin receptor and of the STAT3 signaling pathway in colorectal cancer progression. Int. J. Oncol. 2011; 39(4): 935–940. PubMed Abstract | Publisher Full Text

[66] 66. Zhang W, Ding EX, Wang Q, et al.: Fas ligand expression in colon cancer: a possible mechanism of tumor immune privilege. World J Gastroenterol: WJG. 2005; 11(23): 3632–3635. PubMed Abstract | Publisher Full Text | Free Full Text

[67] 67. Rivetti S, Lauriola M, Voltattorni M, et al.: Gene expression profile of human colon cancer cells treated with cross-reacting material 197, a diphtheria toxin non-toxic mutant. Int. J. Immunopathol. Pharmacol. 2011; 24(3): 639–649. Publisher Full Text

[68] 68. Grabowski P, Schönfelder J, Ahnert-Hilger G, et al.: Expression of neuroendocrine markers: a signature of human undifferentiated carcinoma of the colon and rectum. Virchows Arch. 2002; 441(3): 256–263. PubMed Abstract | Publisher Full Text

[69] 69. Joyce T, Oikonomou E, Kosmidou V, et al.: A molecular signature for oncogenic BRAF in human colon cancer cells is revealed by microarray analysis. Curr. Cancer Drug Targets. 2012; 12(7): 873–898. PubMed Abstract | Publisher Full Text

[70] 70. Savas S, Hyde A, Stuckless SN, et al.: Serotonin transporter gene (SLC6A4) variations are associated with poor survival in colorectal cancer patients. PLoS One. 2012; 7(7): e38953. PubMed Abstract | Publisher Full Text | Free Full Text

[71] 71. Zhou CZ, Qiu GQ, Fang Zhang LH, et al.: Loss of heterozygosity on chromosome 1 in sporadic colorectal carcinoma. World J. Gastroenterol. 2004; 10(10): 1431–1435. PubMed Abstract | Publisher Full Text | Free Full Text

[72] 72. Iacopetta B: TP53 mutation in colorectal cancer. Hum. Mutat. 2003; 21(3): 271–276. Publisher Full Text

[73] 73. Sugano N, Suda T, Godai TI, et al.: MDM2 gene amplification in colorectal cancer is associated with disease progression at the primary site, but inversely correlated with distant metastasis. Genes Chromosom. Cancer. 2010; 49(7): 620–629.

[74] 74. Suda T, Yoshihara M, Nakamura Y, et al.: Rare MDM4 gene amplification in colorectal cancer: The principle of a mutually exclusive relationship between MDM alteration and TP53 inactivation is not applicable. Oncol. Rep. 2011; 26(1): 49–54.

[75] 75. Flanagan JM, Healey S, Young J, et al.: Mapping of a candidate colorectal cancer tumor-suppressor gene to a 900-kilobase region on the short arm of chromosome 8. Genes Chromosom. Cancer. 2004; 40(3): 247–260. PubMed Abstract | Publisher Full Text

[76] 76. Nakamura Y, Tanaka F, Yoshikawa Y, et al.: PDGF-BB is a novel prognostic factor in colorectal cancer. Ann. Surg. Oncol. 2008; 15(8): 2129–2136. Publisher Full Text

[77] 77. Manzat Saplacan RM, Balacescu L, Gherman C, et al.: The role of PDGFs and PDGFRs in colorectal cancer. Mediat. Inflamm. 2017; 2017: 1–9. PubMed Abstract | Publisher Full Text | Free Full Text

[78] 78. Lin Q, Lai R, Chirieac LR, et al.: Constitutive activation of JAK3/STAT3 in colon carcinoma tumors and cell lines: inhibition of JAK3/STAT3 signaling induces apoptosis and cell cycle arrest of colon carcinoma cells. Am. J. Pathol. 2005; 167(4): 969–980. PubMed Abstract | Publisher Full Text | Free Full Text

[79] 79. Goodsell DS: The molecular perspective: the ras oncogene. Oncologist. 1999; 4(3): 263–264. Publisher Full Text

[80] 80. Eckert LB, Repasky GA, Ülkü AS, et al.: Involvement of Ras activation in human breast cancer cell signaling, invasion, and anoikis. Cancer Res. 2004; 64(13): 4585–4592. PubMed Abstract | Publisher Full Text

[81] 81. Miller MA, Zachary JF: Mechanisms and morphology of cellular injury, adaptation, and death. Pathologic Basis of Veterinary Disease. 6th ed.2017; pp. 2–43. Publisher Full Text

[82] 82. Kontomanolis EN, Kalagasidou S, Pouliliou S, et al.: The Notch Pathway in Breast Cancer Progression. Sci. World J. 2018; 2018: 1–11. PubMed Abstract | Publisher Full Text | Free Full Text

[83] 83. Seif F, Khoshmirsafa M, Aazami H, et al.: The role of JAK-STAT signaling pathway and its regulators in the fate of T helper cells. Cell Commun. Signal. 2017; 15(1): 1–13. Publisher Full Text

[84] 84. Nascimento AS, Peres LL, Fari AV, et al.: Phosphoproteome profiling reveals critical role of JAK-STAT signaling in maintaining chemoresistance in breast cancer. Oncotarget. 2017; 8(70): 114756–114768. PubMed Abstract | Publisher Full Text | Free Full Text

[85] 85. Yamada M, Monden T, Konaka S, et al.: Assignment of human thyrotropin-releasing hormone (TRH) receptor gene to chromosome 8. Somat. Cell Mol. Genet. 1993; 19(6): 577–580. PubMed Abstract | Publisher Full Text

[86] 86. Page F, Bishop W: Recurrent Carcinoma Of The Female Breast Entirely Disappearing Under The Persistent Use Of Thyroid Extract Continued For Eighteen Months. Lancet. 1898; 151(3900): 1460–1461. Publisher Full Text

[87] 87. Alinejad V, Dolati S, Motallebnezhad M, et al.: The role of IL17B-IL17RB signaling pathway in breast cancer. Biomed. Pharmacother. 2017; 88: 795–803. PubMed Abstract | Publisher Full Text

[88] 88. Chalabi N, Satih S, Delort L, et al.: Expression profiling by whole-genome microarray hybridization reveals differential gene expression in breast cancer cell lines after lycopene exposure. Biochimica et Biophysica Acta (BBA)-Gene Structure and Expression. 2007; 1769(2): 124–130. PubMed Abstract | Publisher Full Text

[89] 89. Zhang S, Liu J, Xu K, et al.: Notch signaling via regulation of RB and p AKT but not PIK3CG contributes to MIA PaCa 2 cell growth and migration to affect pancreatic carcinogenesis. Oncol. Lett. 2018; 15(2): 2105–2110. PubMed Abstract | Publisher Full Text

[90] 90. Nakanishi Y, Walter K, Spoerke JM, et al.: Activating mutations in PIK3CB confer resistance to PI3K inhibition and define a novel oncogenic role for p110β. Cancer Res. 2016; 76: 1193. Publisher Full Text

[91] 91. Kok K, Nock GE, Verrall EA, et al.: Regulation of p110δ PI 3-kinase gene expression. PLoS One. 2009; 4(4): e5145. PubMed Abstract | Publisher Full Text | Free Full Text

[92] 92. Miller FR, Soule HD, Tait L, et al.: Xenograft model of progressive human proliferative breast disease. JNCI: Journal of the National Cancer Institute. 1993; 85(21): 1725–1732. Publisher Full Text

[93] 93. Sánchez-Muñoz A, Gallego E, de Luque V , et al.: Lack of evidence for KRAS oncogenic mutations in triple-negative breast cancer. BMC Cancer. 2010; 10(1): 136. PubMed Abstract | Publisher Full Text | Free Full Text

[94] 94. Capaccione KM, Pine SR: The Notch signaling pathway as a mediator of tumor survival. Carcinogenesis. 2013; 34(7): 1420–1430. PubMed Abstract | Publisher Full Text | Free Full Text

[95] 95. Parr C, Watkins G, Jiang WG: The possible correlation of Notch-1 and Notch-2 with clinical outcome and tumor clinicopathological parameters in human breast cancer. Int. J. Mol. Med. 2004; 14(5): 779–786. PubMed Abstract

[96] 96. Dou XW, Liang YK, Lin HY, et al.: Notch3 Maintains Luminal Phenotype and Suppresses Tumorigenesis and Metastasis of Breast Cancer via Trans-Activating Estrogen Receptor-α. Theranostics. 2017; 7(16): 4041–4056. PubMed Abstract | Publisher Full Text | Free Full Text

[97] 97. Fu YP, Edvardsen H, Kaushiva A, et al.: NOTCH2 in breast cancer: association of SNP rs11249433 with gene expression in ER-positive breast tumors without TP53 mutations. Mol. Cancer. 2010; 9(1): 113. Publisher Full Text

[98] 98. Wang JW, Wei XL, Dou XW, et al.: The association between Notch4 expression, and clinicopathological characteristics and clinical outcomes in patients with breast cancer. Oncol. Lett. 2018; 15(6): 8749–8755. PubMed Abstract | Publisher Full Text

[99] 99. Sarnataro D, Grimaldi C, Pisanti S, et al.: Plasma membrane and lysosomal localization of CB1 cannabinoid receptor are dependent on lipid rafts and regulated by anandamide in human breast cancer cells. FEBS Lett. 2005; 579(28): 6343–6349. PubMed Abstract | Publisher Full Text

[100] 100. Kelly P, Moeller BJ, Juneja J, et al.: The G12 family of heterotrimeric G proteins promotes breast cancer invasion and metastasis. Proc. Natl. Acad. Sci. 2006; 103(21): 8173–8178. PubMed Abstract | Publisher Full Text | Free Full Text

[101] 101. Phan NN, Wang CY, Chen CF, et al.: Voltage-gated calcium channels: Novel targets for cancer therapy. Oncol. Lett. 2017; 14(2): 2059–2074. PubMed Abstract | Publisher Full Text | Free Full Text

[102] 102. Bravatà V, Cammarata FP, Forte GI, et al.: “Omics” of HER2-positive breast cancer. Omics. 2013; 17(3): 119–129. PubMed Abstract | Publisher Full Text

[103] 103. Chung S, Low SK, Zembutsu H, et al.: A genome-wide association study of chemotherapy-induced alopecia in breast cancer patients. Breast Cancer Res. 2013; 15(5): R81. PubMed Abstract | Publisher Full Text | Free Full Text

[104] 104. Mukhopadhyay UK, Cass J, Raptis L, et al.: Dataset of STAT5A status in breast cancer. Data Brief. 2016; 7: 490–492. PubMed Abstract | Publisher Full Text | Free Full Text

[105] 105. Peck AR, Witkiewicz AK, Liu C, et al.: Low levels of Stat5a protein in breast cancer are associated with tumor progression and unfavorable clinical outcomes. Breast Cancer Res. 2012; 14(5): R130. PubMed Abstract | Publisher Full Text | Free Full Text

[106] 106. Yan GR, Xu SH, Tan ZL, et al.: Global identification of miR-373-regulated genes in breast cancer by quantitative proteomics. Proteomics. 2011; 11(5): 912–920. PubMed Abstract | Publisher Full Text

[107] 107. Banerjee K, Resat H: Constitutive activation of STAT 3 in breast cancer cells: A review. Int. J. Cancer. 2016; 138(11): 2570–2578. PubMed Abstract | Publisher Full Text | Free Full Text

[108] 108. Koromilas AE, Sexl V: The tumor suppressor function of STAT1 in breast cancer. Jak-Stat. 2013; 2(2): e23353. PubMed Abstract | Publisher Full Text | Free Full Text

[109] 109. Nunez AR: The role of the interleukin-12/STAT4 axis in breast cancer.2016.

[110] 110. Gooch JL, Christy B, Yee D: STAT6 mediates interleukin-4 growth inhibition in human breast cancer cells. Neoplasia. 2002; 4(4): 324–331. PubMed Abstract | Publisher Full Text | Free Full Text

[111] 111. Al-Mahdi R, Babteen N, Thillai K, et al.: A novel role for atypical MAPK kinase ERK3 in regulating breast cancer cell morphology and migration. Cell Adhes. Migr. 2015; 9(6): 483–494. PubMed Abstract | Publisher Full Text | Free Full Text

[112] 112. Javaid S, Zhang J, Smolen GA, et al.: MAPK7 regulates EMT features and modulates the generation of CTCs. Mol. Cancer Res. 2015; 13: 934. Publisher Full Text

[113] 113. Tarca AL, Draghici S, Khatri P, et al.: A novel signaling pathway impact analysis. Bioinformatics. 2009; 25(1): 75–82. PubMed Abstract | Publisher Full Text | Free Full Text

[114] 114. Chen X, Xu J, Huang B, et al.: A sub-pathway-based approach for identifying drug response principal network. Bioinformatics. 2011; 27(5): 649–654. PubMed Abstract | Publisher Full Text

[115] 115. G: GEO DataSet Browser. GEO DataSet Browser.n.d.Reference Source

[116] 116. n.d.. http

[117] 117. n.d.. http

[118] 118. n.d.. http

Identification of informative genes and sub-pathways using Improved Differential Expression Analysis for Pathways (iDEAP) for cancer classification

Abstract

Keywords

Introduction

Figure 1. Overview illustration of sub-pathway-based analysis.

Methods

Differential Expression Analysis for Pathways (DEAP) method

Figure 2. Flowchart of the Differential Expression Analysis for Pathway (DEAP) method.

The proposed improved method (iDEAP)

Figure 3. Flowchart of the proposed improved Differential Expression Analysis for Pathway (iDEAP) method where the grey shaded area represents the improvement parts of Differential Expression Analysis for Pathway (DEAP) method.

Pre-processing data

Figure 4. Flowchart of the data pre-processing step.

Mapping gene expression data onto pathway graph

Figure 5. Illustration of the mapping process.

Identification of sub-pathways using the proposed algorithm (DMSP algorithm)

Figure 6. Illustration of the sub-pathway identification process.

Figure 7. Flowchart of the sub-pathway identification process, where Nexternal = total gene expression value for external nodes and Ninternal = total value of gene expression for internal nodes.

Calculation of DEAP score for each sub-pathway

(1)

(2)

Figure 8. Flowchart of recursive function processing for the Differential Expression Analysis for Pathway (DEAP) score calculation.

Calculation of average DEAP Score (proposed additional step)

(3)

Statistics calculation

(4)

Data sets

Table 1. Summary of the gene expression data sets.

Results/Use cases

The significant pathways identified related to targeted phenotype

Table 2. Number of informative pathways found based on previous work and the proposed improved Differential Expression Analysis for Pathway (iDEAP) method.

10-fold cross validation

Table 3. Comparison of the average 10-fold cross validation classification accuracy of identified informative genes in the significant pathway between the Differential Expression Analysis for Pathway (DEAP) and improved Differential Expression Analysis for Pathway (iDEAP) methods.

Biological validation

Figure 9. Illustration of selection of top five pathways from results for biological validation.

Head and neck tumor

Table 4. Top five pathways using the improved Differential Expression Analysis for Pathway (iDEAP) method based on head and neck tumor cell lines data.

Colorectal cancer

Table 5. Top five pathways using improved Differential Expression Analysis for Pathway (iDEAP) method based on colorectal cancer data.

Breast cancer

Table 6. Top five pathways using improved Differential Expression Analysis for Pathway (iDEAP) method based on breast cancer data.

Conclusions/Discussion

Data availability

Software availability

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated

Figure 7. Flowchart of the sub-pathway identification process, where N_external = total gene expression value for external nodes and N_internal = total value of gene expression for internal nodes.