Using complex networks for refining survival prognosis in prostate cancer patient

Massimiliano Zanin

doi:10.12688/f1000research.8282.1

Home Browse Using complex networks for refining survival prognosis in prostate...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Method Article

Using complex networks for refining survival prognosis in prostate cancer patient

[version 1; peer review: 2 approved]

Massimiliano Zanin^1,2

PUBLISHED 16 Nov 2016

Author details Author details

¹ Innaxis Foundation & Research Institute, Madrid, Spain
² Department of Electrical Engineering, Faculty of Sciences and Technology, Universidade Nova de Lisboa, Caparica, Portugal

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

Complex network theory has been used, during the last decade, to understand the structures behind complex biological problems, yielding new knowledge in a large number of situations. Nevertheless, such knowledge has remained mostly qualitative. In this contribution, I show how information extracted from a network representation can be used in a quantitative way, to improve the score of a classification task. As a test bed, I consider a dataset corresponding to patients suffering from prostate cancer, and the task of successfully prognosing their survival. When information from a complex network representation is added on top of a simple classification model, the error is reduced from 27.9% to 23.8%. This confirms that network theory can be used to synthesize information that may not readily be accessible by standard data mining algorithms.

Keywords

Prostate cancer, survival prognosis, complex networks, classification

Corresponding author: Massimiliano Zanin

Competing interests: The author declares no competing interests.

Grant information: The author declares that no grants were involved in supporting this work.

Copyright: © 2016 Zanin M. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

How to cite: Zanin M. Using complex networks for refining survival prognosis in prostate cancer patient [version 1; peer review: 2 approved]. F1000Research 2016, 5:2675 (https://doi.org/10.12688/f1000research.8282.1) First published: 16 Nov 2016, 5:2675 (https://doi.org/10.12688/f1000research.8282.1) Latest published: 16 Nov 2016, 5:2675 (https://doi.org/10.12688/f1000research.8282.1)

Introduction

Constructing prognostic models for different types of cancers is a problem that is attracting increasing attention, due to the high impact that these models may have in the clinical treatment. This is clearly related to the movement of personalized medicine (Jain, 2005; Samani et al., 2010; Van't Veer & Bernards, 2008). As more and more data describing human biology are available, both for healthy and pathological conditions, coming from heterogeneous sources (e.g. from all the -omics fields), there is a well-founded hope that such data may be of help to improve the treatment of individual patients, personalizing the way drugs and therapies are provided.

When one ought to extract a model from a collection of data, the customary solution is to resort to data mining algorithms. In the case of cancer prognosis, this has resulted in the development of numerous models - see, for instance, Alexe et al., 2006; Gupta et al., 2011; Halabi et al., 2003; Halabi et al., 2014; Mangasarian & Wolberg, 2000 and Quaranta et al., 2005 for a few examples. Data mining nevertheless presents some drawbacks, the most important of which is the way features are analyzed. Elements are considered individually, or by being pairwise combined; yet, data mining does not provide a way to create a global picture of the available data.

In the last decade, a novel solution has been proposed. The complex network theory provides an elegant way for representing the structure created by the interactions between the elements of a complex system (Boccaletti et al., 2006; Strogatz, 2001). The result is encoded in an adjacency matrix, which can then be analyzed by means of multiple metrics (Costa et al., 2007). Applications span from the characterization of social networks, to the internet or the human brain (Costa et al., 2011).

In this contribution, I explore the possibility of using complex networks as an instrument for improving a model of survival prognosis of patients with metastatic castrate resistant prostate cancer (mCRPC) treated with docetaxel. In order to achieve this, I compare two models. The first one is a classification model, i.e. classifying between surviving and non-surviving patients, which only uses raw features like baseline lab results and patient vital signs. The second one combines such information with structural metrics extracted from a network representation of the same data. The hypothesis tested here is that complex networks should synthesize information present in the raw data in a new way that should reflect an improved classification score (Zanin et al., 2014b).

The paper is organized as follows: first, I describe the main methods of the analysis, with a special focus on the networks reconstruction methodology and the metrics used for their characterization, and the dataset considered here; afterwards, the results obtained are presented, i.e. the comparison of the two classification models; finally, some conclusions are drawn.

Methods

Network reconstruction

Reconstructing a network representation of a given system entails two steps. First, one needs to define the elements of such a system. This is usually constrained by the type of available data; thus, in this case, the nodes of the network are going to correspond to the different available biomarkers.

Second, one should detect when two of such elements are connected by some kind of relationship. If a priori knowledge is available, e.g. information about how different metabolites or proteins are connected in a pathway, such information can directly be mapped into the network. Alternatively, if a time evolution (i.e. a time series) is available for each element, functional links can be established between them, by means of metrics like correlations or causalities. Note that this last option entails two important problems: a time evolution should be available, which is not straightforward in the case of biomedical analyses; and that functional links represent the “co-evolution” of factors, while in some cases, and specifically in the diagnosis of a disease, it is more interesting to detect “deviations” from the expected (healthy) behavior.

Recently, a new methodology for network reconstruction has been proposed, which solves the two aforementioned problems (Zanin & Boccaletti, 2011; Zanin et al., 2014). Starting with a set of scalar values, pairs of elements are analyzed by firstly detecting if a standard relation is present between them in a set of control subjects; afterwards, data corresponding to new subjects are compared with such relation, and a link is created between two nodes if they present an abnormal deviation. The resulting object is called a parenclitic network, named after the Greek term for “deviation”, originally used by the Greek philosopher Epicurus to designate the spontaneous and unpredictable swerving of free-falling atoms (Zanin et al., 2014).

In mathematical terms, suppose n healthy subjects are described by a vector of features, such that the i-th of them is represented by f_i = (f_i,1, f_i,2, … , f_{i,n_f}). All the n_f features are mapped into nodes of the network, which is now described by an adjacency matrix A_{n_f×n_f}. As the final aim is to construct a network for each subject under study, suppose a new subject j, with its corresponding vector f_j, is introduced in the system. The reconstruction process should analyze each pair of features, denoted by k and l, to understand if they deviated from the expected (healthy) behavior. For the sake of simplicity, in this work we consider that the healthy relation can be obtained as a linear regression between both features:

f._,l = α_k,l + β_k,lf._,k + ∈_k,l.

Here, f._,k represents the vector of values of feature k for all healthy subjects, and α_k,l and β_k,l the two parameters of the best linear fit. Additionally, ∈_k,l is a vector containing all fit errors; note that a linear relation may not describe well the relationship between k and l, and that this vector will be key to understand its statistical significance. Now, suppose a new subject h is available, for which their health condition is unknown, and for which one wants to create the corresponding network representation. A link between nodes k and l is then created, with a weight equal to its distance from the previously detected normal relation:

$w_{k, l} = \frac{f_{h, l} - (α_{k_{,} l} + β_{k_{,} l} f_{h, k})}{σ_{k, l}},$

being σ_k,l the standard deviation of ∈_k,l. In other words, w_k,l represents the Z-score of the distance of the subject h with respect to the normal behavior of features k and l - large values of w_k,l, both positive and negative, indicate that the subject under analysis presents an abnormal behavior, which may be symptomatic of a disease. When the process is repeated for all pairs of features, the result is a parenclitic network for each patient.

Network interpretation

Intuitively, healthy subjects should be associated with random-like networks, as strong links may appear due to the intrinsic noise of biological processes, but should not form coherent structures; on the other hand, patients should present networks with non-trivial topologies. Also, the more a network is different from a random structure, the more severe the pathology is expected to be.

In order to transform the obtained networks into a representation suitable to be used in a data mining (classification) algorithm, first these have been binarized, i.e. links with a weight |w_k,l| ≤ 0.5 have been discarded. The threshold of 0.5 has manually been set, in order to obtain structures dense enough to support the subsequent analysis, but still being able to discard statistically insignificant connections. Afterwards, two topological (i.e. structural) properties have been considered:

Link density, defined as the number of links present in the network, divided by the number of all possible links. The higher the link density, the more pairs of features present an abnormal behavior.
Information content (Zanin et al., 2014). This metric assesses the presence of mesoscale structures, i.e. structures created by small groups of nodes, by evaluating the information lost when pairs of nodes are iteratively merged together. Low values of Information Content indicate a random-like structure; conversely, high values suggest a non-trivial topology, potentially fingerprint of a severe condition.

Classification

In order to evaluate the performance of a complex network representation with respect to a baseline, a classification between the two groups of patients (i.e. surviving vs. not surviving patients) is performed, and the resulting scores compared. Such classification is based on a support vector machine (SVM) model with linear kernel (Noble, 2006; Wang, 2005).

SVMs are binary linear classifiers that model concepts by creating hyperplanes in a multidimensional space, which can be used for both classification and regression (Cortes & Vapnik, 1995). A good separation is achieved by the hyperplane that has the largest distance to the nearest training-data point of any class, as this minimises the error. The SVM model has been chosen for two reasons: its good performance and diffusion in biomedical classification problems; and its simplicity: only linear relationships are mined, allowing a better identification of the contribution of the complex network representation.

The validation of the results has been performed using a 10-fold cross-validation (Friedman et al., 2001). The original sample of subjects is randomly partitioned into 10 equal sized subsamples. A single subsample is retained as the validation data for testing the model, and the remaining 9 subsamples are used as training data. The cross-validation process is then repeated 10 times, with each of the 10 subsamples used exactly once as the validation data. The average value of the error obtained in the 10 executions is used for estimating the error.

Initial dataset

The dataset considered here is part of the Prostate Cancer DREAM Challenge, including information from the prostate cancer clinical trials ASCENT-2 (Novacea, provided by Memorial Sloan Kettering Cancer Center) (Scher et al., 2011), VENICE (Sanofi) (Tannock et al., 2013), MAINSAIL (Celgene) (Petrylak et al., 2015), and ENTHUSE-33 (AstraZeneca) (Fizazi et al., 2013). Only the data included in the CoreTable have been considered, representing the core patient level data. They cover information about demographics, co-existing disease conditions, prior treatment of the tumor and other co-existing conditions, important baseline lab results and vital signs, lesion measure and early response to therapy. More information on the dataset can be found at https://www.synpase.org/ProstateCancerChallenge.

One of the limitations of the network reconstruction process previously described is that it can only handle numerical features. Thus, only those features fulfilling this condition have been selected. Additionally, binary features have been transformed into numbers, i.e. 1 for “yes” and 0 for “no”. The final data set included 92 features for each patient.

Afterwards, 2000 patients have been randomly selected, of which half of them did not survive cancer - as coded by the DEATH flag in the dataset. The rationale of selecting only a subset of patients is two-fold: first, to reduce the computational cost, and thus allow a more detailed analysis of results; and second, to ensure that the data set used in the classification task is balanced, i.e. it includes the same number of subjects in both classes. All other patients have been discarded.

Results

Standard scenario: raw features classification

Figure 1 presents the results obtained in the classification of patients using only raw features. As previously introduced, this classification will be the baseline against which the benefits of using complex networks will be evaluated. In order to reduce the computational cost of the analysis, and to reduce the risk of overfitting, a greedy feature selection algorithm has been executed. The three selected features were: LDH (Lactate Dehydrogenase level), TURP (prior transurethral resection of the prostate, binary value) and MHGEN (presence of general disorders, binary value). The probability distributions for the three features are presented in Figure 1 top and bottom left.

Figure 1. Classification with raw features.

Probability distributions of the LDH feature for surviving and not surviving patients (top left). Appearance probability of the features TURP and MHGEN, for surviving and not surviving patients (top right and bottom left). Classification score when considering LDH, LDH + TURP, and all three features (bottom right).

By using these three selected features, the classification score reaches 72.1% (Figure 1, bottom right). Adding more features does not yield substantial improvements.

Enhanced scenario: complex network features

In the second case, I consider the same original raw features, plus the two features synthesized from the complex network representation, as previously described. A network has been created for each subject, by using the information of surviving patients as baseline- in other words, surviving patients have been considered as healthy, following the convention previously described. In order to avoid overfitting, a new baseline has been calculated in each one of the 10 cross-validation rounds, ensuring no patient was included both in the training and in the classification steps. Finally, a greedy feature selection algorithm has been executed on the complete feature set, following the same process described previosuly.

Figure 2 presents the results obtained, both in terms of the network features probability distributions (top), and the classification score (bottom). It can be appreciated as the classification score improves, from 72.1% up to 76.2%; this corresponds to a decrease of 15% in the classification error.

Figure 2. Classification with complex network features.

(Top) Probability distributions of the link density and Information Content features, for surviving and not surviving patients. See main text for definitions. (Bottom) Classification score when considering LDH, LDH + link density, and all three features.

Conclusions

If complex networks have by and large been used to describe biomedical problems (Costa et al., 2011), much less attention has been devoted to their relation with prediction, i.e. to how the information they provide could be used in the construction of diagnosis models. In this contribution, I make a first step in this direction, by studying the following hypothesis: can the precision of a predictive model be improved, if information extracted from a complex network representation is fed to a data mining algorithm along with raw features?

I used, as a test bed, a data set describing patients suffering from prostate cancer, and a classification task in which patients are discriminated according to the expected prognosis (surviving vs. not surviving). The inclusion of complex network features, obtained through a parenclitic representation (Zanin & Boccaletti, 2011; Zanin et al., 2014), resulted in a small but significant reduction of the classification error (from 27.9% to 23.8%).

When comparing these results with the state of the art, as for instance (Halabi et al., 2003; Halabi et al., 2014), it is clear that they are still far away from representing an efficient prognostic instrument. Within the Prostate Cancer DREAM Challenge, the proposed method ranked 50 out of 51 in Subchallenge 1a (iAUC of 0.6171, against a reference of 0.7429 of the Halabi et al. method and 0.7915 of the winning team); and 27 out of 49 in Subchallenge 1b (RMSE of 214.39, against 194.41 of the winning team). Additionally, an error of the 23.8% in the survival probability is clearly intolerable for clinical applications.

It is also important to note that complex networks introduce a “black box” element in the analysis. As features are represented and analyzed in a topological way, i.e. focusing on the structure created by their relationships, it is not possible to identify which element(s) contribute the most to the final model. This complicates direct comparisons with standard prognostic models, and the design of therapeutic solutions.

In spite of the discussed drawbacks, we believe that the results here reported shed light on the importance of using complex networks in future prognostic models, as a way of synthesizing complex relationships in simple and numerical metrics.

Data availability

The Challenge datasets can be accessed at: https://www.projectdatasphere.org/projectdatasphere/html/pcdc

Challenge documentation, including the detailed description of the Challenge design, overall results, scoring scripts, and the clinical trials data dictionary can be found at: https://www.synapse.org/ProstateCancerChallenge

The code and documentation underlying the method presented in this paper can be found at: http://dx.doi.org/10.7303/syn4732239 (Zanin, 2016)

Competing interests

The author declares no competing interests.

Grant information

The author(s) declared that no grants were involved in supporting this work.

Acknowledgments

This publication is based on research using information obtained from www.projectdatasphere.org, which is maintained by Project Data Sphere, LLC. Neither Project Data Sphere, LLC nor the owner(s) of any information from the web site have contributed to, approved or are in any way responsible for the contents of this publication.

The author further acknowledges Sage Bionetworks, the DREAM organization, and Project Data Sphere for developing and supplying data for the Challenge.

Supplementary material

Additional software, e.g. the MATLAB code for calculating the Information Content of a network, can also be found at: www.mzanin.com

Faculty Opinions recommended

References

Alexe G, Alexe S, Axelrod DE, et al.: Breast cancer prognosis by combinatorial analysis of gene expression data. Breast Cancer Res. 2006; 8(4): R41. PubMed Abstract | Publisher Full Text | Free Full Text
Boccaletti S, Latora V, Moreno Y, et al.: Complex networks: Structure and dynamics. Phys Rep. 2006; 424(4–5): 175–308. Publisher Full Text
Cortes C, Vapnik V: Support-vector networks. Mach Learn. 1995; 20(3): 273–297. Publisher Full Text
Costa LD, Rodrigues FA, Travieso G, et al.: Characterization of complex networks: A survey of measurements. Adv Phys. 2007; 56(1): 167–242. Publisher Full Text
Costa LD, Oliveira ON Jr, Travieso G, et al.: Analyzing and modeling real-world phenomena with complex networks: a survey of applications. Adv Phys. 2011; 60(3): 329–412. Publisher Full Text
Fizazi K, Higano CS, Nelson JB, et al.: Phase III, randomized, placebo-controlled study of docetaxel in combination with zibotentan in patients with metastatic castration-resistant prostate cancer. J Clin Oncol. 2013; 31(14): 1740–1747. PubMed Abstract | Publisher Full Text
Friedman J, Hastie T, Tibshirani R: The elements of statistical learning. Springer, Berlin: Springer series in statistics. 2001: 1.
Gupta S, Kumar D, Sharma A: Data mining classification techniques applied for breast cancer diagnosis and prognosis. Indian Journal of Computer Science and Engineering (IJCSE). 2011; 2(2): 188–195. Reference Source
Halabi S, Small EJ, Kantoff PW, et al.: Prognostic model for predicting survival in men with hormone-refractory metastatic prostate cancer. J Clin Oncol. 2003; 21(7): 1232–1237. PubMed Abstract | Publisher Full Text
Halabi S, Lin CY, Kelly WK, et al.: Updated prognostic model for predicting overall survival in first-line chemotherapy for patients with metastatic castration-resistant prostate cancer. J Clin Oncol. 2014; 32(7): 671–677. PubMed Abstract | Publisher Full Text | Free Full Text
Jain KK: Personalised medicine for cancer: from drug development into clinical practice. Expert Opin Pharmacother. 2005; 6(9): 1463–1476. PubMed Abstract | Publisher Full Text
Lee YJ, Mangasarian OL, Wolberg WH: Breast cancer survival and chemotherapy: a support vector machine analysis. In Discrete Mathematical Problems with Medical Applications: DIMACS Workshop Discrete Mathematical Problems with Medical Applications, December 8–10, 1999, DIMACS Center. American Mathematical Soc. 2000; 55: 1. Reference Source
Noble WS: What is a support vector machine? Nat Biotechnol. 2006; 24(12): 1565–1567. PubMed Abstract | Publisher Full Text
Petrylak DP, Vogelzang NJ, Budnik N, et al.: Docetaxel and prednisone with or without lenalidomide in chemotherapy-naive patients with metastatic castration-resistant prostate cancer (MAINSAIL): a randomised, double-blind, placebo-controlled phase 3 trial. Lancet Oncol. 2015; 16(4): 417–425. PubMed Abstract | Publisher Full Text
Quaranta V, Weaver AM, Cummings PT, et al.: Mathematical modeling of cancer: the future of prognosis and treatment. Clin Chim Acta. 2005; 357(2): 173–179. PubMed Abstract | Publisher Full Text
Samani NJ, Tomaszewski M, Schunkert H: The personal genome--the future of personalised medicine? Lancet. 2010; 375(9725): 1497–1498. PubMed Abstract | Publisher Full Text
Scher HI, Jia X, Chi K, et al.: Randomized, open-label phase III trial of docetaxel plus high-dose calcitriol versus docetaxel plus prednisone for patients with castration-resistant prostate cancer. J Clin Oncol. 2011; 29(16): 2191–2198. PubMed Abstract | Publisher Full Text
Strogatz SH: Exploring complex networks. Nature. 2001; 410(6825): 268–276. PubMed Abstract | Publisher Full Text
Tannock IF, Fizazi K, Ivanov S, et al.: Aflibercept versus placebo in combination with docetaxel and prednisone for treatment of men with metastatic castration-resistant prostate cancer (VENICE): a phase 3, double-blind randomised trial. Lancet Oncol. 2013; 14(8): 760–768. PubMed Abstract | Publisher Full Text
van't Veer LJ, Bernards R: Enabling personalized cancer medicine through analysis of gene-expression patterns. Nature. 2008; 452(7187): 564–570. PubMed Abstract | Publisher Full Text
Wang L (Ed.): Support vector machines: theory and applications. Springer Science & Business Media. 2005; 177. Publisher Full Text
Zanin M, Boccaletti S: Complex networks analysis of obstructive nephropathy data. Chaos. 2011; 21(3): 033103. PubMed Abstract | Publisher Full Text
Zanin M, Alcazar JM, Carbajosa JV, et al.: Parenclitic networks: uncovering new functions in biological data. Sci Rep. 2014; 4: 5112. PubMed Abstract | Publisher Full Text | Free Full Text
Zanin M, Sousa PA, Menasalvas E: Information content: Assessing meso-scale structures in complex networks. EPL (Europhys Lett). 2014; 106(3): 30001. Publisher Full Text
Zanin M, Menasalvas E, Boccaletti S, et al.: Analysis of complex data by means of complex networks. In Technological Innovation for Collective Awareness Systems. Springer Berlin Heidelberg. 2014; 423: 39–46. Publisher Full Text
Zanin M: “Using networks representations to improve the prognosis of Prostate Cancer patients”. Synapse Storage, 2016. Publisher Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 16 Nov 2016

Author details Author details

¹ Innaxis Foundation & Research Institute, Madrid, Spain
² Department of Electrical Engineering, Faculty of Sciences and Technology, Universidade Nova de Lisboa, Caparica, Portugal

Competing interests

The author declares no competing interests.

Grant information

The author declares that no grants were involved in supporting this work.

Article Versions (1)

version 1

Published: 16 Nov 2016, 5:2675

https://doi.org/10.12688/f1000research.8282.1

Copyright

© 2016 Zanin M. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Zanin M. Using complex networks for refining survival prognosis in prostate cancer patient [version 1; peer review: 2 approved]. F1000Research 2016, 5:2675 (https://doi.org/10.12688/f1000research.8282.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 16 Nov 2016

Views

7

Reviewer Report 27 Feb 2017

Diego Raphael Amancio, Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, Brazil

Approved

https://doi.org/10.5256/f1000research.8908.r18888

The manuscript describes the application of complex networks concepts to the problem of identifying survival patients suffering from prostate cancer. The manuscript is scientifically sound, and the conclusions are supported by the results.

The author, however, should ... Continue reading

The manuscript describes the application of complex networks concepts to the problem of identifying survival patients suffering from prostate cancer. The manuscript is scientifically sound, and the conclusions are supported by the results.

The author, however, should address the following issues raised below:

How performance depends on the weight chosen to binarize your data? Note that this step may affect considerably the performance.
Why traditional topological structures are not considered?
The author should further motivate the combination of traditional and network features, as similar approaches have been applied in other contexts¹^,².

References

1. Amancio DR: A Complex Network Approach to Stylometry.PLoS One. 2015; 10 (8): e0136076 PubMed Abstract | Publisher Full Text
2. Mota NB, Furtado R, Maia PP, Copelli M, et al.: Graph analysis of dream reports is especially informative about psychosis.Sci Rep. 2014; 4: 3691 PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Respond or Comment

Views

8

Reviewer Report 24 Nov 2016

Osvaldo Anibal Rosso, Buenos Aires Institute of Technology (ITBA), Buenos Aires, Argentina

Approved

https://doi.org/10.5256/f1000research.8908.r17941

In the present manuscript the author explores the possibility of using complex networks as a tool for evaluation of the improving a model of survival prognosis of patients with metastatic castrate resistant prostate cancer treated with docetaxel. The main hypothesis ... Continue reading

In the present manuscript the author explores the possibility of using complex networks as a tool for evaluation of the improving a model of survival prognosis of patients with metastatic castrate resistant prostate cancer treated with docetaxel. The main hypothesis tested in the manuscript is that complex networks should synthesize information present in raw data in a new way that should reflect an improved classification score, proposed by the author and co-workers previously¹.
The manuscript is clearly and well written, and the conclusions are supported by the obtained results. I share the opinion of the author that the results of present research shed light on the importance of using complex networks in the future prognostic models. I recommend the indexing of the manuscript in the present form.

References

1. Zanin M, Alcazar JM, Carbajosa JV, Paez MG, et al.: Parenclitic networks: uncovering new functions in biological data.Sci Rep. 2014; 4: 5112 PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Respond or Comment

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 16 Nov 2016

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 1 16 Nov 16	read	read

Osvaldo Anibal Rosso, Buenos Aires Institute of Technology (ITBA), Buenos Aires, Argentina
Diego Raphael Amancio, University of São Paulo, São Carlos, Brazil

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

Back to all reports

Reviewer Report

7 Views

27 Feb 2017 | for Version 1

Diego Raphael Amancio, Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, Brazil

7 Views Cite this report Responses(0)

Approved

The manuscript describes the application of complex networks concepts to the problem of identifying survival patients suffering from prostate cancer. The manuscript is scientifically sound, and the conclusions are supported by the results.

The author, however, should address the following issues raised below:

How performance depends on the weight chosen to binarize your data? Note that this step may affect considerably the performance.
Why traditional topological structures are not considered?
The author should further motivate the combination of traditional and network features, as similar approaches have been applied in other contexts¹^,².

References

1. Amancio DR: A Complex Network Approach to Stylometry.PLoS One. 2015; 10 (8): e0136076 PubMed Abstract | Publisher Full Text
2. Mota NB, Furtado R, Maia PP, Copelli M, et al.: Graph analysis of dream reports is especially informative about psychosis.Sci Rep. 2014; 4: 3691 PubMed Abstract | Publisher Full Text

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

8 Views

24 Nov 2016 | for Version 1

Osvaldo Anibal Rosso, Buenos Aires Institute of Technology (ITBA), Buenos Aires, Argentina

8 Views Cite this report Responses(0)

Approved

In the present manuscript the author explores the possibility of using complex networks as a tool for evaluation of the improving a model of survival prognosis of patients with metastatic castrate resistant prostate cancer treated with docetaxel. The main hypothesis tested in the manuscript is that complex networks should synthesize information present in raw data in a new way that should reflect an improved classification score, proposed by the author and co-workers previously¹.
The manuscript is clearly and well written, and the conclusions are supported by the obtained results. I share the opinion of the author that the results of present research shed light on the importance of using complex networks in the future prognostic models. I recommend the indexing of the manuscript in the present form.

References

1. Zanin M, Alcazar JM, Carbajosa JV, Paez MG, et al.: Parenclitic networks: uncovering new functions in biological data.Sci Rep. 2014; 4: 5112 PubMed Abstract | Publisher Full Text

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

[1] Alexe G, Alexe S, Axelrod DE, et al.: Breast cancer prognosis by combinatorial analysis of gene expression data. Breast Cancer Res. 2006; 8(4): R41. PubMed Abstract | Publisher Full Text | Free Full Text

[2] Boccaletti S, Latora V, Moreno Y, et al.: Complex networks: Structure and dynamics. Phys Rep. 2006; 424(4–5): 175–308. Publisher Full Text

[3] Cortes C, Vapnik V: Support-vector networks. Mach Learn. 1995; 20(3): 273–297. Publisher Full Text

[4] Costa LD, Rodrigues FA, Travieso G, et al.: Characterization of complex networks: A survey of measurements. Adv Phys. 2007; 56(1): 167–242. Publisher Full Text

[5] Costa LD, Oliveira ON Jr, Travieso G, et al.: Analyzing and modeling real-world phenomena with complex networks: a survey of applications. Adv Phys. 2011; 60(3): 329–412. Publisher Full Text

[6] Fizazi K, Higano CS, Nelson JB, et al.: Phase III, randomized, placebo-controlled study of docetaxel in combination with zibotentan in patients with metastatic castration-resistant prostate cancer. J Clin Oncol. 2013; 31(14): 1740–1747. PubMed Abstract | Publisher Full Text

[7] Friedman J, Hastie T, Tibshirani R: The elements of statistical learning. Springer, Berlin: Springer series in statistics. 2001: 1.

[8] Gupta S, Kumar D, Sharma A: Data mining classification techniques applied for breast cancer diagnosis and prognosis. Indian Journal of Computer Science and Engineering (IJCSE). 2011; 2(2): 188–195. Reference Source

[9] Halabi S, Small EJ, Kantoff PW, et al.: Prognostic model for predicting survival in men with hormone-refractory metastatic prostate cancer. J Clin Oncol. 2003; 21(7): 1232–1237. PubMed Abstract | Publisher Full Text

[10] Halabi S, Lin CY, Kelly WK, et al.: Updated prognostic model for predicting overall survival in first-line chemotherapy for patients with metastatic castration-resistant prostate cancer. J Clin Oncol. 2014; 32(7): 671–677. PubMed Abstract | Publisher Full Text | Free Full Text

[11] Jain KK: Personalised medicine for cancer: from drug development into clinical practice. Expert Opin Pharmacother. 2005; 6(9): 1463–1476. PubMed Abstract | Publisher Full Text

[12] Lee YJ, Mangasarian OL, Wolberg WH: Breast cancer survival and chemotherapy: a support vector machine analysis. In Discrete Mathematical Problems with Medical Applications: DIMACS Workshop Discrete Mathematical Problems with Medical Applications, December 8–10, 1999, DIMACS Center. American Mathematical Soc. 2000; 55: 1. Reference Source

[13] Noble WS: What is a support vector machine? Nat Biotechnol. 2006; 24(12): 1565–1567. PubMed Abstract | Publisher Full Text

[14] Petrylak DP, Vogelzang NJ, Budnik N, et al.: Docetaxel and prednisone with or without lenalidomide in chemotherapy-naive patients with metastatic castration-resistant prostate cancer (MAINSAIL): a randomised, double-blind, placebo-controlled phase 3 trial. Lancet Oncol. 2015; 16(4): 417–425. PubMed Abstract | Publisher Full Text

[15] Quaranta V, Weaver AM, Cummings PT, et al.: Mathematical modeling of cancer: the future of prognosis and treatment. Clin Chim Acta. 2005; 357(2): 173–179. PubMed Abstract | Publisher Full Text

[16] Samani NJ, Tomaszewski M, Schunkert H: The personal genome--the future of personalised medicine? Lancet. 2010; 375(9725): 1497–1498. PubMed Abstract | Publisher Full Text

[17] Scher HI, Jia X, Chi K, et al.: Randomized, open-label phase III trial of docetaxel plus high-dose calcitriol versus docetaxel plus prednisone for patients with castration-resistant prostate cancer. J Clin Oncol. 2011; 29(16): 2191–2198. PubMed Abstract | Publisher Full Text

[18] Strogatz SH: Exploring complex networks. Nature. 2001; 410(6825): 268–276. PubMed Abstract | Publisher Full Text

[19] Tannock IF, Fizazi K, Ivanov S, et al.: Aflibercept versus placebo in combination with docetaxel and prednisone for treatment of men with metastatic castration-resistant prostate cancer (VENICE): a phase 3, double-blind randomised trial. Lancet Oncol. 2013; 14(8): 760–768. PubMed Abstract | Publisher Full Text

[20] van't Veer LJ, Bernards R: Enabling personalized cancer medicine through analysis of gene-expression patterns. Nature. 2008; 452(7187): 564–570. PubMed Abstract | Publisher Full Text

[21] Wang L (Ed.): Support vector machines: theory and applications. Springer Science & Business Media. 2005; 177. Publisher Full Text

[22] Zanin M, Boccaletti S: Complex networks analysis of obstructive nephropathy data. Chaos. 2011; 21(3): 033103. PubMed Abstract | Publisher Full Text

[23] Zanin M, Alcazar JM, Carbajosa JV, et al.: Parenclitic networks: uncovering new functions in biological data. Sci Rep. 2014; 4: 5112. PubMed Abstract | Publisher Full Text | Free Full Text

[24] Zanin M, Sousa PA, Menasalvas E: Information content: Assessing meso-scale structures in complex networks. EPL (Europhys Lett). 2014; 106(3): 30001. Publisher Full Text

[25] Zanin M, Menasalvas E, Boccaletti S, et al.: Analysis of complex data by means of complex networks. In Technological Innovation for Collective Awareness Systems. Springer Berlin Heidelberg. 2014; 423: 39–46. Publisher Full Text

[26] Zanin M: “Using networks representations to improve the prognosis of Prostate Cancer patients”. Synapse Storage, 2016. Publisher Full Text

Using complex networks for refining survival prognosis in prostate cancer patient

Abstract

Keywords

Introduction

Methods

Network reconstruction

Network interpretation

Classification

Initial dataset

Results

Standard scenario: raw features classification

Figure 1. Classification with raw features.

Enhanced scenario: complex network features

Figure 2. Classification with complex network features.

Conclusions

Data availability

Competing interests

Grant information

Acknowledgments

Supplementary material

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated