TRGAted: A web tool for survival analysis using protein data in the Cancer Genome Atlas.

Nicholas Borcherding; Nicholas L. Bormann; Andrew P. Voigt; Weizhou Zhang

doi:10.12688/f1000research.15789.2

Home Browse TRGAted: A web tool for survival analysis using protein data in the...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Software Tool Article

Revised

TRGAted: A web tool for survival analysis using protein data in the Cancer Genome Atlas.

[version 2; peer review: 2 approved]

Nicholas Borcherding^1-4^*, Nicholas L. Bormann⁵^*, Andrew P. Voigt⁴, Weizhou Zhang^1-4

^* Equal contributors

PUBLISHED 01 Oct 2018

Author details Author details

¹ Holden Comprehensive Cancer Center, University of Iowa, Iowa City, Iowa, 52245, USA
² Department of Pathology, University of Iowa, Iowa City, Iowa, 52245, USA
³ Cancer Biology Graduate Program, University of Iowa, Iowa City, Iowa, 52245, USA
⁴ Medical Scientist Training Program, University of Iowa, Iowa City, Iowa, 52245, USA
⁵ Department of Psychiatry, University of Iowa, Iowa City, Iowa, 52245, USA

Nicholas Borcherding
Roles: Conceptualization, Formal Analysis, Software, Writing – Original Draft Preparation, Writing – Review & Editing

Nicholas L. Bormann
Roles: Conceptualization, Formal Analysis, Software, Writing – Original Draft Preparation, Writing – Review & Editing

Andrew P. Voigt
Roles: Software, Writing – Original Draft Preparation

Weizhou Zhang
Roles: Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

Reverse-phase protein arrays (RPPAs) are a highthroughput approach to protein quantification utilizing antibody-based micro-to-nano scale dot blot. Within the Cancer Genome Atlas (TCGA), RPPAs were used to quantify over 200 proteins in 8,167 tumor and metastatic samples. Protein-level data has particular advantages in assessing putative prognostic or therapeutic targets in tumors. However, many of the available pipelines do not allow for the partitioning of clinical and RPPA information to make meaningful conclusions. We developed a cloud-based application, TRGAted to enable researchers to better examine patient survival based on single or multiple proteins across 31 cancer types in the TCGA. TRGAted contains up-to-date overall survival, disease-specific survival, disease-free interval and progression-free interval information. Furthermore, survival information for primary tumor samples can be stratified based on gender, age, tumor stage, histological type, and subtype, allowing for highly adaptive and intuitive user experience. The code and processed data are open sourced and available on github and contains a tutorial built into the application for assisting users.

Keywords

Bioinformatics, Cancer Proteomics, Survival Analysis, TCGA

Corresponding author: Weizhou Zhang

Competing interests: No competing interests were disclosed.

Grant information: Funding for this project was provided from National Institute of Health F30 fellowship [CA206255] to N.B. and NIH Grant R01 [CA200673] to W.Z.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2018 Borcherding N et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Borcherding N, Bormann NL, Voigt AP and Zhang W. TRGAted: A web tool for survival analysis using protein data in the Cancer Genome Atlas. [version 2; peer review: 2 approved]. F1000Research 2018, 7:1235 (https://doi.org/10.12688/f1000research.15789.2) First published: 10 Aug 2018, 7:1235 (https://doi.org/10.12688/f1000research.15789.1) Latest published: 01 Oct 2018, 7:1235 (https://doi.org/10.12688/f1000research.15789.2)

Revised Amendments from Version 1

After receiving the very generous reviews, the new version of the manuscript reflects our attempts at improving the readability and flow of the manuscript. In doing so, we have tried to address the shared major concern of the reviewers in terms of grammatical and typographical errors. Additionally, as Dr. Zenklusen has suggested, we updated the description of the clinical and survival data source.

See the authors' detailed response to the review by Jean Claude Zenklusen
See the authors' detailed response to the review by Austin Gillen

Introduction

Improving prognostic prediction and the identification of potential therapeutic targets is of particular interest to clinicians. Quantification of messenger RNA at a genome-wide level has proven valuable in the discovery of gene expression profiles, which can serve as biomarkers for clinical outcomes in cancer¹. However, RNA quantification of tumor or patient cohorts is a proxy for protein level, with many cellular processes above transcription that ultimately regulate protein level. The availability of protein-level quantifications for the TCGA cohort allows for more relevant clinical outcome predictions compared to mRNA levels. Currently, TCGA-based applications provide entry-level analysis in correlational, differential, and survival modalities for the RPPA information. However, survival analysis in these applications rely on median- or mean-based survival data and do not allow for the use of clinical variables^2–4.

With these limitations in mind, we developed a new open-source web application, TRGAted (Figure 1). Built on the R shiny framework, TRGAted is an intuitive data analysis tool for parsing survival information based on over 200 proteins in 31 cancer types. TRGAted is comprised of processed RPPA information, survival information, and code, allowing users to run instances locally or modify the code with ease.

Figure 1. Diagram of the implementation of TRGAted.

Each file communicates within the R Shiny framework. On the user side (left, blue), users select pertinent cancer type, protein of interest, and clinical variables into the CSS-enabled user interface. This information is received by the server file enabling the subsequent run in R. On the server side (right, orange), the specific cancer type from the database, R packages, and functions are retrieved and executed. After execution, the server file provides both tabular and graphical output (purple) to the user interface.

Methods

Protein and survival data

Level 4 TCGA RPPA data for each cancer type was downloaded from the TCPA Portal developed by the MD Anderson Cancer Center⁴. Across all proteins, individual values were scaled using Z-scores. A summary of information available for each cancer datasets is in Table 1. Additionally, uveal melanoma (UVM) was excluded from the datasets due to a low number of samples with RPPA quantification (n=12). Clinical and survival information for each cancer data set was downloaded from recent work by Liu, et al.⁵. Overall survival, disease-specific survival, disease-free interval, and progression-free interval information was added to primary tumor RPPA quantifications for each cancer type. Unlike other cancer types, metastatic samples were retained for skin cutaneous melanoma (SKCM) RPPA-based dataset due to the highly metastatic nature of the disease. SKCM in the TRGAted application consists of 96 primary tumor samples and 258 metastatic samples. Of the 8,167 samples available in the TCPA, overall survival (OS) data was available for 7,714 patients, disease-specific survival (DSS) data was available for 7,240 patients, disease-free interval (DFI) data was available for 3,887 patients, and progression-free interval (PFI) data was available for 7,315 patients (Table 1).

Table 1. Survival information and protein summary available in TRGAted.

Cancer Type	Samples	OS	DSS	DFI	PFI	Proteins
Adrenocortical carcinoma (ACC)	46	46	46	28	46	221
Bladder Urothelial Carcinoma (BLCA)	344	344	330	153	344	223
Breast invasive carcinoma (BRCA)	901	873	855	750	873	224
Cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC)	171	171	168	112	171	220
Cholangiocarcinoma (CHOL)	30	30	29	21	30	219
Colon adenocarcinoma (COAD)	358	325	311	126	325	223
Diffuse Large B-cell Lymphoma (DLBCL)	33	33	33	19	33	219
Esophageal carcinoma (ESCA)	126	126	124	76	126	220
Glioblastoma multiforme (GBM)	205	136	123	0	136	223
Head and Neck squamous cell carcinoma (HNSC)	346	346	326	85	346	239
Kidney Chromophobe (KICH)	63	63	63	27	63	220
Kidney renal clear cell carcinoma (KIRC)	445	444	434	72	444	233
Kidney renal papillary cell carcinoma (KIRP)	208	207	205	127	207	221
Lower Grade Glioma (LGG)	427	426	420	114	426	220
Liver hepatocellular carcinoma (LIHC)	184	184	177	145	184	220
Lung adenocarcinoma (LUAD)	362	361	327	203	361	239
Lung squamous cell carcinoma (LUSC)	325	325	295	210	325	239
Mesothelioma (MESO)	61	61	45	10	61	220
Ovarian serous cystadenocarcinoma (OV)	411	405	377	199	407	224
Pancreatic adenocarcinoma (PAAD)	105	105	99	40	105	221
Pheochromocytoma and Paraganglioma (PCPG)	81	79	79	71	79	220
Prostate adenocarcinoma (PRAD)	351	351	350	233	351	220
Rectum adenocarcinoma (READ)	130	126	120	31	126	223
Sarcoma (SARC)	221	221	215	125	22	220
Skin Cutaneous Melanoma (SKCM)	354	349	346	0	349	223
Stomach adenocarcinoma (STAD)	392	357	334	207	357	220
Testicular Germ Cell Tumors (TGCT)	118	104	104	79	104	219
Thyroid carcinoma (THCA)	374	372	366	268	372	219
Thymoma (THYM)	90	90	90	9	90	219
Uterine Corpus Endometrial Carcinoma (UCEC)	404	404	403	325	404	223
Uterine Carcinosarcoma (UCS)	48	48	46	22	48	220

OS, overall survival; DSS, disease-specific survival; DFI, disease-free interval; PFI, progression-free interval.

Implementation

The TRGAted application was written and tested using R v3.5.1. The interactive plots are made using shiny (v1.1.0) and ggplot2 (v3.0.0). Plots can be downloaded as .png, .pdf, or .svg files. Data used to generate the individual plots can be downloaded as .csv files.

Operation: Minimum system requirements for running TRGAted locally are modest and include an Intel-compatible CPU and 1 gigabyte of RAM. Running TRGAted from the shiny server requires a modern browser and an internet connection.

Kaplan-Meier survival curves can be generated by selecting the cancer type, survival type and protein(s) of interest (Figure 2). Kaplan-Meier curves are generated using the survival (v2.41-3) and the survminer (v0.4-1) R packages. Multi-protein survival analysis utilizes mean values of protein probes, similar to gene-expression-based survival analysis platforms⁶. Hazard ratios for two-group comparisons, either median or optimal cut-off, utilize the Cox proportional hazards regression model in the survival R package; with the reported hazard ratio comparing high versus low protein groups. Optimal cut-off feature uses the surv_cutpoint function of the survminer package, calculating the minimal p-value based on the log-rank method. This function uses the maximally selected rank statistic (maxstat, v0.7-25) R package, which finds the maximal standardized two-sample linear rank statistic⁷. In order to find clinically or biologically meaningful biomarkers, the minimal proportion cutpoint, or the maximal disparity comparison, was set at 15% versus 85% of samples. Clinical variables dependent on the cancer type selected can be used to filter patients into user-defined groupings. Clinical information available across all types include: subtype, tumor stage, histological type, gender, age, response to primary therapy.

Figure 2. Generating survival curves.

The interface shows an example of an overall survival curve for the RAD50 protein in the basal subtype of breast cancer using the optimal cutpoint (A). Disease-specific survival, disease-free interval, and progression-free interval can also be selected (B). The cutpoint can be varied to separate samples based on protein level into quartiles, tertiles, medians or separating into two groups based on the lowest p-value (C).

TRGAted also allows for Cox proportional hazard modeling across all proteins in each cancer type or for a single protein across all cancer types. Hazard ratios and p-values are based on the Cox regression model. Values filtered from the volcano plots are proteins with –log10(p-values) less than 0.1 and hazard ratios greater than 20. These filters were implemented to improve visualization and to reduce artifacts of the analysis pipeline, respectively. The volcano plot can be graphed as linear or natural-log transformed, to assist in the visualization of good prognostic indicators. Visualizing the proportional comparison for the volcano plots is also available.

Use case

In order to demonstrate the functionality of TRGAted, we present a basic survival analysis examining the aggressive, highly-metastatic subtype of breast cancer, known as basal-like breast cancer. We found in this cancer, RAD50, involved in homologous recombination of DNA, as a novel poor prognostic marker.

Survival curves: Survival curves can be generated by selecting the cancer type, survival type, and protein or proteins of interest (Figure 2A). We also selected the subtype information to more closely examine basal-like breast cancer. Other survival types and clinical variables can be selected (Figure 2B). Samples can be divided into quartiles, tertiles, median or optimally for p-values based on the protein of interest (Figure 2C). Here we can see that the DNA repair protein, RAD50 is a poor prognostic marker for overall (Figure 2A) and disease-specific survival (Figure 2B) in basal-like breast cancer.

Across cancer: TRGAted can be used for biomarker discovery by examining the hazard ratios for all proteins available by cancer type or subtype, like basal-like breast cancer (Figure 3A). The volcano plot displays good prognostic markers on the left in blue and poor prognostic markers on the right in red. Having selected the optimal cutoff feature, a bar chart can also be generated to examine the proportion of samples in the high and low protein groups (Figure 3B). Protein labeling is adaptive for both the volcano plot and bar chart and will only label significant proteins (p-value ≤ 0.05). Here we see the RAD50 is one of the most significant predictors of poor overall survival in basal-like breast cancer (Figure 3A and B).

Figure 3. Visualizing all proteins across a single cancer type.

The interface shows an example of the visualization of Cox hazard ratio of each protein across the basal subtype of breast cancer (A). Good prognostic markers appear on the left in blue, while poor prognostic markers are on the right in red. The natural log transformation allows the graph to be centered at 0 and makes the visualization of good prognostic markers easier. Labeling for proteins can be adjusted to include more or less protein. Proportional comparisons for protein using the optimal cutpoint function is available as well (B).

Across protein: TRGAted can also be used to examine the survival outcomes of a protein of interest across multiple cancers. Here, RAD50 predicts poor survival in only five cancer types, prostate, adrenocortical, breast cancer, low-grade glioma, and head and neck cancers (Figure 4A). A summary of the hazard ratios can also be visualized by selecting for the barplot function (Figure 4B).

Figure 4. Visualizing all proteins across a single cancer type.

The interface shows an example of the visualization of Cox hazard ratio of for RAD50 across all 31 cancer types (A). This feature is similar to the Across Cancer tab with the ability to adjust labels and log-transform the Cox hazard ratios. Additionally, the hazard ratios for significant cancer types can be visualized using a bar chart (B).

Conclusions

TRGAted is an open-source survival analysis application designed to allow for quick and intuitive exploration of TCGA protein-level data. This survival analysis improves on current TCGA pipelines by providing greater diversity of clinical and survival options and relying on protein-level data. In addition to log-rank and Cox regression modeling, TRGAted allows users to download graphical displays and processed data for up to 7,714 samples across 31 cancer types. Built on the R shiny framework, a literate code architecture, the code for TRGAted is annotated and easily modified from our GitHub repository. Under the GNU General Public License v3.0, we encourage interested groups to modify TRGAted for greater usability. Downloading and modifying TRGAted is streamlined by the relatively small size of TRGAted, totally 27.2 megabytes for the application, processed data, and built-in instructional guide.

Data availability

Release 4.2 of the TCGA replicate-based normalized (level 4) RPPA data is available for 32 cancer types from the TCPA Portal at http://tcpaportal.org/tcpa/download.html. Processed data is available at https://github.com/ncborcherding/TRGAted.

Software availability

Source code is available from GitHub: https://github.com/ncborcherding/TRGAted/tree/v1.0.0

Archived source code at time of publication: http://doi.org/10.5281/zenodo.1323828⁸

License: GNU General Public License v3.0

Grant information

Funding for this project was provided from National Institute of Health F30 fellowship [CA206255] to N.B. and NIH Grant R01 [CA200673] to W.Z.

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Acknowledgements

These results here are in whole or part based upon data generated by the TCGA Research Network.

Faculty Opinions recommended

References

1. Xi X, Li T, Huang Y, et al.: RNA Biomarkers: Frontier of Precision Medicine for Cancer. Noncoding RNA. 2017; 3(1): pii: E9. PubMed Abstract | Publisher Full Text | Free Full Text
2. Cerami E, Gao J, Dogrusoz U, et al.: The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2012; 2(5): 401–4. PubMed Abstract | Publisher Full Text | Free Full Text
3. Gao J, Aksoy BA, Dogrusoz U, et al.: Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal. 2013; 6(269): pl1. PubMed Abstract | Publisher Full Text | Free Full Text
4. Li J, Lu Y, Akbani R, et al.: TCPA: a resource for cancer functional proteomics data. Nat Methods. 2013; 10(11): 1046–7. PubMed Abstract | Publisher Full Text | Free Full Text
5. Liu J, Lichtenberg T, Hoadley KA, et al.: An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics. Cell. 2018; 173(2): 400–416.e11. PubMed Abstract | Publisher Full Text | Free Full Text
6. Györffy B, Lanczky A, Eklund AC, et al.: An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Res Treat. 2010; 123(3): 725–31. PubMed Abstract | Publisher Full Text
7. Wright MN, Dankowski T, Ziegler A: Unbiased split variable selection for random survival forests using maximally selected rank statistics. Stat Med. 2017; 36(8): 1272–84. PubMed Abstract | Publisher Full Text
8. theHumanBorch: ncborcherding/TRGAted: First Release TRGAted (Version v1.0.0). Zenodo. 2018. http://www.doi.org/10.5281/zenodo.1323828

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 10 Aug 2018

Author details Author details

Nicholas Borcherding
Roles: Conceptualization, Formal Analysis, Software, Writing – Original Draft Preparation, Writing – Review & Editing

Nicholas L. Bormann
Roles: Conceptualization, Formal Analysis, Software, Writing – Original Draft Preparation, Writing – Review & Editing

Andrew P. Voigt
Roles: Software, Writing – Original Draft Preparation

Weizhou Zhang
Roles: Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

Funding for this project was provided from National Institute of Health F30 fellowship [CA206255] to N.B. and NIH Grant R01 [CA200673] to W.Z.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (2)

version 2

Revised

Published: 01 Oct 2018, 7:1235

https://doi.org/10.12688/f1000research.15789.2

version 1

Published: 10 Aug 2018, 7:1235

https://doi.org/10.12688/f1000research.15789.1

© 2018 Borcherding N et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Borcherding N, Bormann NL, Voigt AP and Zhang W. TRGAted: A web tool for survival analysis using protein data in the Cancer Genome Atlas. [version 2; peer review: 2 approved]. F1000Research 2018, 7:1235 (https://doi.org/10.12688/f1000research.15789.2)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 10 Aug 2018

Views

Reviewer Report 26 Sep 2018

Austin Gillen, RNA Bioscience Initiative, University of Colorado School of Medicine, Aurora, CO, USA

Approved

https://doi.org/10.5256/f1000research.17235.r38311

In this manuscript, Borcherding, et al. describe an interactive web interface (implemented in R using Shiny) that allows for the visualization of cancer patient survival data from TCGA (The Cancer Genome Atlas) based on protein expression measured by RPPA (Reverse-Phase Protein Array). The software is easy to use, well documented, and flexible enough for most common use cases. The code is available in a public github repository, encouraging further development and expansion of the tool to suit users' needs. There are no major flaws in either the implementation of the tool or the associated manuscript, but two minor issues should be addressed:

1. As noted by reviewer 1, the manuscript should be carefully proofread for typographical errors and standard english grammar. For example: the common R package ggplot2 is referred to in the text as "ggplots2".

2. The web interface indicates that the TCPA data included with the package were downloaded on 2017/11/10, but the current TCPA release (4.2) was made available on 2018/07/18. This tool is substantially less useful if it is not updated when new source data is released. A plan for updating the packaged TCPA and survival data should be included in the manuscript (and implemented in the package). Automating this process as a function in the package would be ideal, but detailed instructions for updating the packaged data for local installations would be acceptable as well.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Yes
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 01 Oct 2018

Nicholas Borcherding, Holden Comprehensive Cancer Center, University of Iowa, Iowa City, 52245, USA

01 Oct 2018

Author Response

We would like to thank the reviewer for the great suggestions. We have recently submitted an updated version of the manuscript with more thorough editing. Additionally, we will work on ... Continue reading We would like to thank the reviewer for the great suggestions. We have recently submitted an updated version of the manuscript with more thorough editing. Additionally, we will work on implementing an automatic pull feature for the data to ensure the most up-to-date protein data available. The specific of this feature will be updated on the github repository and in the application itself when we implement the new pipeline. This was an excellent suggestion.
We would like to thank the reviewer for the great suggestions. We have recently submitted an updated version of the manuscript with more thorough editing. Additionally, we will work on implementing an automatic pull feature for the data to ensure the most up-to-date protein data available. The specific of this feature will be updated on the github repository and in the application itself when we implement the new pipeline. This was an excellent suggestion.
Competing Interests: No competing interest Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 01 Oct 2018

Nicholas Borcherding, Holden Comprehensive Cancer Center, University of Iowa, Iowa City, 52245, USA

01 Oct 2018

Author Response

We would like to thank the reviewer for the great suggestions. We have recently submitted an updated version of the manuscript with more thorough editing. Additionally, we will work on ... Continue reading We would like to thank the reviewer for the great suggestions. We have recently submitted an updated version of the manuscript with more thorough editing. Additionally, we will work on implementing an automatic pull feature for the data to ensure the most up-to-date protein data available. The specific of this feature will be updated on the github repository and in the application itself when we implement the new pipeline. This was an excellent suggestion.
We would like to thank the reviewer for the great suggestions. We have recently submitted an updated version of the manuscript with more thorough editing. Additionally, we will work on implementing an automatic pull feature for the data to ensure the most up-to-date protein data available. The specific of this feature will be updated on the github repository and in the application itself when we implement the new pipeline. This was an excellent suggestion.
Competing Interests: No competing interest Close
Report a concern

Views

Reviewer Report 03 Sep 2018

Jean Claude Zenklusen, National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD, USA; Clinical Commissioning Groups (CCG), London, UK; National Institutes of Health (NIH) Clinical Center, Bethesda, MA, USA

Approved

https://doi.org/10.5256/f1000research.17235.r37844

This report by Borcherding et al. deals with the creation of a tool to visualize the impact of proteins represented in the Reverse Phase Protein Array (RPPA) on the survival of patients used in The Cancer Genome Atlas (TCGA). The tools are straight forward, uses a common standard (it is an R module) and thus has the potential of being highly utilized by the cancer research community. There are no major flaws with the module, the code is deposited in github, allowing easy access to users.

Two minor issues need to be corrected:

The manuscript will benefit from editing by a native English speaker. Phrasing and grammar are uncommon at times.
Reference 5 is referred as “updated TCGA clinical data”. This is incorrect. The paper referred to is an interpretation of the clinical data in the context of the Pan Can Atlas effort by the TCGA, but it is NOT the official clinical data. It is a derived product of it.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Yes
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 01 Oct 2018

Nicholas Borcherding, Holden Comprehensive Cancer Center, University of Iowa, Iowa City, 52245, USA

01 Oct 2018

Author Response

Thank you for your very kind review and suggestions. In the most recent submission, we have addressed your concerns in editing the manuscript and adding additional details on the source ... Continue reading Thank you for your very kind review and suggestions. In the most recent submission, we have addressed your concerns in editing the manuscript and adding additional details on the source of clinical information.
Thank you for your very kind review and suggestions. In the most recent submission, we have addressed your concerns in editing the manuscript and adding additional details on the source of clinical information.
Competing Interests: No competing interest Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 01 Oct 2018

Nicholas Borcherding, Holden Comprehensive Cancer Center, University of Iowa, Iowa City, 52245, USA

01 Oct 2018

Author Response

Thank you for your very kind review and suggestions. In the most recent submission, we have addressed your concerns in editing the manuscript and adding additional details on the source ... Continue reading Thank you for your very kind review and suggestions. In the most recent submission, we have addressed your concerns in editing the manuscript and adding additional details on the source of clinical information.
Thank you for your very kind review and suggestions. In the most recent submission, we have addressed your concerns in editing the manuscript and adding additional details on the source of clinical information.
Competing Interests: No competing interest Close
Report a concern

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 10 Aug 2018

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 2 (revision) 01 Oct 18
Version 1 10 Aug 18	read	read

Jean Claude Zenklusen, National Institutes of Health (NIH), Bethesda, USA; Clinical Commissioning Groups (CCG), London, UK; National Institutes of Health (NIH) Clinical Center, Bethesda, USA
Austin Gillen, University of Colorado School of Medicine, Aurora, USA

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

23 Views

26 Sep 2018 | for Version 1

Austin Gillen, RNA Bioscience Initiative, University of Colorado School of Medicine, Aurora, CO, USA

23 Views Cite this report Responses(1)

Approved

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Yes
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Back to all reports

Reviewer Report

23 Views

03 Sep 2018 | for Version 1

23 Views Cite this report Responses(1)

Approved

The manuscript will benefit from editing by a native English speaker. Phrasing and grammar are uncommon at times.
Reference 5 is referred as “updated TCGA clinical data”. This is incorrect. The paper referred to is an interpretation of the clinical data in the context of the Pan Can Atlas effort by the TCGA, but it is NOT the official clinical data. It is a derived product of it.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Yes
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Xi X, Li T, Huang Y, et al.: RNA Biomarkers: Frontier of Precision Medicine for Cancer. Noncoding RNA. 2017; 3(1): pii: E9. PubMed Abstract | Publisher Full Text | Free Full Text

[2] 2. Cerami E, Gao J, Dogrusoz U, et al.: The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2012; 2(5): 401–4. PubMed Abstract | Publisher Full Text | Free Full Text

[3] 3. Gao J, Aksoy BA, Dogrusoz U, et al.: Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal. 2013; 6(269): pl1. PubMed Abstract | Publisher Full Text | Free Full Text

[4] 4. Li J, Lu Y, Akbani R, et al.: TCPA: a resource for cancer functional proteomics data. Nat Methods. 2013; 10(11): 1046–7. PubMed Abstract | Publisher Full Text | Free Full Text

[5] 5. Liu J, Lichtenberg T, Hoadley KA, et al.: An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics. Cell. 2018; 173(2): 400–416.e11. PubMed Abstract | Publisher Full Text | Free Full Text

[6] 6. Györffy B, Lanczky A, Eklund AC, et al.: An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Res Treat. 2010; 123(3): 725–31. PubMed Abstract | Publisher Full Text

[7] 7. Wright MN, Dankowski T, Ziegler A: Unbiased split variable selection for random survival forests using maximally selected rank statistics. Stat Med. 2017; 36(8): 1272–84. PubMed Abstract | Publisher Full Text

[8] 8. theHumanBorch: ncborcherding/TRGAted: First Release TRGAted (Version v1.0.0). Zenodo. 2018. http://www.doi.org/10.5281/zenodo.1323828

TRGAted: A web tool for survival analysis using protein data in the Cancer Genome Atlas.

Abstract

Keywords

Revised Amendments from Version 1

Introduction

Figure 1. Diagram of the implementation of TRGAted.

Methods

Protein and survival data

Table 1. Survival information and protein summary available in TRGAted.

Implementation

Figure 2. Generating survival curves.

Use case

Figure 3. Visualizing all proteins across a single cancer type.

Figure 4. Visualizing all proteins across a single cancer type.

Conclusions

Data availability

Software availability

Grant information

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated