<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="other" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.11830.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Software Tool Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                    <subj-group>
                        <subject>Bioinformatics</subject>
                    </subj-group>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>CASAS: Cancer Survival Analysis Suite, a web based application</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 1 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Rupji</surname>
                        <given-names>Manali</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-0520-1624</uri>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Zhang</surname>
                        <given-names>Xinyan</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Kowalski</surname>
                        <given-names>Jeanne</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Winship Cancer Institute of Emory University, Atlanta, GA, 30322, USA</aff>
                <aff id="a2">
                    <label>2</label>Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA, 30322, USA</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:jeanne.kowalski@emory.edu">jeanne.kowalski@emory.edu</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>15</day>
                <month>6</month>
                <year>2017</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2017</year>
            </pub-date>
            <volume>6</volume>
            <elocation-id>919</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>7</day>
                    <month>6</month>
                    <year>2017</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2017 Rupji M et al.</copyright-statement>
                <copyright-year>2017</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/6-919/pdf"/>
            <abstract>
                <p>We present CASAS, a shiny R based tool for interactive survival analysis and visualization of results. The tool provides a web-based one stop shop to perform the following types of survival analysis: quantile, landmark and competing risks, in addition to standard survival analysis. The interface makes it easy to perform such survival analyses and obtain results using the interactive Kaplan-Meier and cumulative incidence plots. Univariate analysis can be performed on one or several user specified variable(s) simultaneously, the results of which are displayed in a single table that includes log rank p-values and hazard ratios along with their significance. For several quantile survival analyses from multiple cancer types, a single summary grid is constructed. The CASAS package has been implemented in R and is available via 
                    <ext-link ext-link-type="uri" xlink:href="http://shinygispa.winship.emory.edu/CASAS/">http://shinygispa.winship.emory.edu/CASAS/</ext-link>. The developmental repository is available at 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/manalirupji/CASAS/">https://github.com/manalirupji/CASAS/</ext-link>.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>survival</kwd>
                <kwd>quantile survival</kwd>
                <kwd>landmark</kwd>
                <kwd>competing risk</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1">
                    <funding-source>Winship Cancer Institute of Emory University and NIH/NCI</funding-source>
                    <award-id>P30CA138292</award-id>
                </award-group>
                <funding-statement>Research reported in this publication was supported in part by the Biostatistics and Bioinformatics Shared Resource of Winship Cancer Institute of Emory University and NIH/NCI under award number P30CA138292. </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>Kaplan-Meier (KM) estimates and the Cox Proportional Hazards model have gained huge popularity among clinicians when depicting survival trends and identifying prognostic biomarkers in cancer research. There is a range of commercial software (SAS, STATA, SPSS, PRISM) available for researchers to carry out survival analysis. However, these programs have several disadvantages; commercial software is proprietary and involves restricted usage with rigid outputs, which cannot be changed easily. Open source software such as R is gaining popularity, but the user needs to learn programming skills, which may be very time consuming for clinicians and biomedical researchers with limited programming exposure.</p>
            <p>Standard survival analysis involves a single cause of failure. However, in other cases, clinicians may encounter many other causes of failure in addition to a specific cause of interest. In such cases, a competing risk analysis needs to be carried out, where an individual is exposed to two or more causes of failure but its eventual failure is due to only one cause. While several packages are available to conduct competing risk survival analysis in R, making the right choice presents another layer of confusion to the user.</p>
            <p>As opposed to traditional KM or Cox regression analysis, typically a risk factor measured at baseline is examined for its association with survival thereafter. During follow-up, however, things may have changes, such that either the effect of a fixed baseline risk factor may vary over time, resulting in a weakening or strengthening of associations over time or the risk factor itself may vary over time. In the former case, such as effect is often seen in what appears to be significant differences in survival, not necessarily overall and among all survival times, but early on or at later survival times. We address such time-dependent effects on survival by creating two additional tools, one for landmark
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup> and another for quantile survival analysis
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>,
                    <xref ref-type="bibr" rid="ref-3">3</xref>
                </sup>. As an example, the user may want to study the effect of chemotherapy on a specific cancer population and thus divides the data into a responder vs a non-responder group. The issue with this approach is that the responder cannot be deemed one, unless they survive until the time of response. In addition, being in the responder group gives them an unfair survival advantage leading to an immortal time bias. To overcome these issues, the investigator may need to perform a landmark analysis by removing the patients with an event (or censored) before the landmark time from the analysis.</p>
            <p>Most tools are available as separate packages. As an alternative, CASAS provides a comprehensive survival analysis suite of tools commonly encountered in cancer research. By providing a GUI interface, the user can readily perform any number of these analyses by simply uploading their data and selecting the variables relevant to the analysis.</p>
            <p>In summary, CASAS suite of tools is a one-stop shop for conducting survival analysis without requiring any prior programming knowledge. It is a web-based application that, as a single tool, can carry out KM plot, univariate HR, landmark analysis, quantile survival analysis and competing risk analysis. It allows a user to combine results from various studies or cancer types as well.</p>
        </sec>
        <sec sec-type="methods">
            <title>Methods</title>
            <sec>
                <title>KM survival analysis</title>
                <p>Standard survival analysis uses the NCCTG Lung Cancer data in the &#x2018;survival&#x2019; package in R
                    <sup>
                        <xref ref-type="bibr" rid="ref-4">4</xref>
                    </sup> and uses the &#x2018;survminer&#x2019; package to display the KM plots. Either categorical or continuous variables can be used for stratification. Continuous variables can be dichomotimized by either the 25
                    <sup>th</sup>, 50
                    <sup>th</sup>, 75
                    <sup>th</sup> percentile or the optimal cut point. Log-Rank test is used to estimate the overall differences between the survival curves (
                    <xref ref-type="fig" rid="f1">Figure 1</xref>). A single overall survival curve without stratification can be plotted using the &#x2018;All patients&#x2019; option.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>Figure 1. </label>
                    <caption>
                        <title>Kaplan-Meier (KM) survival analysis plot.</title>
                        <p>The interface shows an example KM plot using NCCTG Lung Cancer data in the &#x2018;survival&#x2019; package
                            <sup>
                                <xref ref-type="bibr" rid="ref-5">5</xref>
                            </sup>. The left side comprises a user menu and the right includes the result plots/tables. The KM plot is interactive and will change depending on the categorical variable selected. Continuous data can be divided using 25
                            <sup>th</sup> percentile, 50
                            <sup>th</sup> percentile, and 75
                            <sup>th</sup> percentile or the user may opt to use an optimal cut point based on martingale residuals using the &#x2018;survMisc&#x2019; package (also available with plot in a separate tab). The user could alternatively choose the &#x201c;All patients&#x201d; drop down. If the user selects the &#x2018;All patients&#x2019; dropdown, a single KM curve is created using all the data instead of two separate KM curves by the categorical variable of interest. User could also choose to output the number of patients at risk. Univariate survival association analysis based on cox proportional hazards model, can be output by entering one variable in the model each time. Multiple variables can be selected to generate the output table. In addition, the user can also test for the proportional hazard assumption by selecting &#x201c;Test for Proportional Hazards Assumption&#x201d; as &#x201c;Yes&#x201d;. An additional column with the p-value for Proportional Hazards Assumption will be displayed as the rightmost column.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12784/758e62cb-e2c4-453c-9a55-1f5a0edb85e2_figure1.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Competing risk survival analysis</title>
                <p>Competing risk survival analysis is based on Fine and Gray&#x2019;s Model
                    <sup>
                        <xref ref-type="bibr" rid="ref-6">6</xref>
                    </sup>, using the &#x2018;cmprsk&#x2019; package in R. 35 AML/ALL patients data who underwent Hematopoietic stem cell transplantation (HSCT) affected by either AML or ALL
                    <sup>
                        <xref ref-type="bibr" rid="ref-5">5</xref>
                    </sup> are used. Cumulative incidence plot was plotted using CumIncidence.R function available in the package. This tool also displays the Gray&#x2019;s p-value based on the competing risk code (
                    <xref ref-type="fig" rid="f2">Figure 2</xref>). Either categorical or continuous group variables/&#x2018;All patients&#x2019; can be used.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <title>Competing risk analysis.</title>
                        <p>This tab includes competing risk analysis using the BMT data (
                            <ext-link ext-link-type="uri" xlink:href="http://www.stat.unipg.it/luca/R/">http://www.stat.unipg.it/luca/R/</ext-link>). The left panel includes options to select variables for univariate survival association analysis based on Fine and Gray&#x2019;s model, and includes options to select variables for cumulative incidence function analysis. For the Univariate Analysis, the user chooses variable of interest to enter into the analysis as well as the event and the censor code. To generate a Cumulative Incidence Function (CIF) plot and table, the user can choose either a categorical variable to compare or &#x2018;All Patients&#x2019; (Like in 
                            <xref ref-type="fig" rid="f1">Figure 1</xref>). Users can also select the appropriate time unit based on the data and the time points of interest, where applicable. Censor code can also be specified. The univariate result table or plot is displayed on the right panel.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12784/758e62cb-e2c4-453c-9a55-1f5a0edb85e2_figure2.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Landmark analysis</title>
                <p>Landmark analysis is based on a user input landmark time and uses the &#x2018;dynpred&#x2019; R package. The same dataset as in 
                    <xref ref-type="fig" rid="f1">Figure 1</xref> is used. The tool generates an overall KM plot and a landmark KM plot with log rank test p-values (
                    <xref ref-type="fig" rid="f3">Figure 3</xref>). The user can also opt for a CI curve instead of a KM plot and allows similar categorical/continuous variable inputs.</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>Figure 3. </label>
                    <caption>
                        <title>Landmark analysis.</title>
                        <p>This tab includes landmark analysis using the same data as in 
                            <xref ref-type="fig" rid="f1">Figure 1</xref>. The left panel includes options to select variables for landmark analysis and the landmark time. The program calls the function dynpred to generate the new landmark dataset to generate either KM/CIF plots on the right panel. Users can also select the appropriate time unit based on the data.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12784/758e62cb-e2c4-453c-9a55-1f5a0edb85e2_figure3.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Quantile survival analysis</title>
                <p>Quantile survival analysis is based on method developed by 
                    <xref ref-type="bibr" rid="ref-2">2</xref>,
                    <xref ref-type="bibr" rid="ref-3">3</xref>, implemented in the &#x2018;cequre&#x2019; package in R. The example data used is the 553 patient Expression with Clinical data for TCGA &#x2013; BRCA patients given radiotherapy
                    <sup>
                        <xref ref-type="bibr" rid="ref-4">4</xref>
                    </sup>. Survival time difference between the dichotomized continuous or categorical variable will be estimated with 95% CI for each quantile (
                    <xref ref-type="fig" rid="f4">Figure 4</xref>). A forest plot to represent quantile wise differences between the means and the overall differences is also provided as output.</p>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>Figure 4. </label>
                    <caption>
                        <title>Quantile survival analysis.</title>
                        <p>The quantile regression tab shows three plots based on TCGA BRCA data with patients who received radiation therapy. The first is the overall KM plot with number at risk table for overall survival and log rank test p-value. The second is the survival time difference with 95% CI between two dichotomized groups at the 10 quantiles Q1 to Q10 (defined as 10 percentile to 100 percentile by 10 percentile at mean survival time among all patients). The third plot is the summary using forest plot for the survival time difference at 10 quantiles. The overall in forest plot corresponds to the transformed HR and 95% CI for overall survival (log [1/HR]).</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12784/758e62cb-e2c4-453c-9a55-1f5a0edb85e2_figure4.gif"/>
                </fig>
            </sec>
        </sec>
        <sec>
            <title>Implementation</title>
            <p>The CASAS software is written in R and tested using version 3.3.0. The Interactive KM, CIF plots and data tables are made visible through a web browser using the shiny R package (
                <ext-link ext-link-type="uri" xlink:href="https://www.rstudio.com/shiny">www.rstudio.com/shiny</ext-link>).</p>
        </sec>
        <sec>
            <title>Operation</title>
            <p>Using a windows 7 Enterprise SP1 PC with a 32.0 GB RAM and an 3.30 GHz Intel
                <sup>&#x00ae;</sup> Xeon
                <sup>&#x00ae;</sup> Processor E5 Family, for a 228 patients NCCTG Lung Cancer dataset
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup> it took 11.89s to create an interactive KM plot and 2.74s to generate a landmark plot. For data of 35 patients who underwent Hematopoietic stem cell transplantation (HSCT) affected by either AML or ALL
                <sup>
                    <xref ref-type="bibr" rid="ref-5">5</xref>
                </sup>, it took 2.66s to display the CIF plot. TCGA BRCA data for 553 patients who received radiotherapy took 6.52s. The developmental repository is available at 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/manalirupji/CASAS/">https://github.com/manalirupji/CASAS/</ext-link>.</p>
            <p>Archived source code as at the time of publication is available at: 
                <ext-link ext-link-type="uri" xlink:href="http://doi.org/10.5281/zenodo.802917">http://doi.org/10.5281/zenodo.802917</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-7">7</xref>
                </sup>.</p>
        </sec>
        <sec sec-type="discussion">
            <title>Discussion</title>
            <p>CASAS is a suite of tools that allows a user to conduct various types of survival analysis through an interactive application in R. We show by example, various types of cancer survival analyses that can be performed based on the questions of interest. Our tool will serve as a platform for many physicians and researchers to conduct preliminary analyses before heading to statisticians to conduct advanced analyses.</p>
        </sec>
        <sec>
            <title>Data and software availability</title>
            <p>The CASAS web tool (
                <ext-link ext-link-type="uri" xlink:href="http://shinygispa.winship.emory.edu/CASAS/">http://shinygispa.winship.emory.edu/CASAS/</ext-link>) includes preprocessed example data under each tab. The user could use the example data or could upload a dataset of their choice in the same format as the example data. For Kaplan Meier survival analysis and Landmark analysis, NCCTG Lung Cancer data from 228 patients as available in the survival package is used
                <sup>
                    <xref ref-type="bibr" rid="ref-5">5</xref>
                </sup>. The data can be accessed in R using data(lung). For quantile survival analysis, Level 3 RNASeqV2 Breast Cancer (BRCA) data was downloaded from the TCGA data portal
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>. 553 patients that received radiotherapy and had survival information were used. Gene expression data for a specific biomarker gene was log2 transformed. The user has the choice to divide the data based on either the twenty-fifth percentile (set as default) or the fiftieth or the seventy-fifth, or an optimal cut point based on the martingale residuals. Similarly, for competing risk analysis, the user could choose the example data or upload their own. The example data consists of 35 patients with acute leukemia who underwent Hematopoietic stem cell transplantation (HSCT) affected by either AML or ALL
                <sup>
                    <xref ref-type="bibr" rid="ref-8">8</xref>
                </sup> (
                <ext-link ext-link-type="uri" xlink:href="http://www.stat.unipg.it/luca/R">http://www.stat.unipg.it/luca/R</ext-link>).</p>
            <p>The developmental repository is available at 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/manalirupji/CASAS/">https://github.com/manalirupji/CASAS/</ext-link>.</p>
            <p>Archived source code as at the time of publication: 
                <ext-link ext-link-type="uri" xlink:href="http://doi.org/10.5281/zenodo.802917">http://doi.org/10.5281/zenodo.802917</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-7">7</xref>
                </sup>
            </p>
            <p>License: CASAS is available under the GNU public license (GPL-3).</p>
        </sec>
    </body>
    <back>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Anderson</surname>
                            <given-names>JR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cain</surname>
                            <given-names>KC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gelber</surname>
                            <given-names>RD</given-names>
                        </name>
</person-group>:
                    <article-title>Analysis of survival by tumor response.</article-title>
                    <source>

                        <italic toggle="yes">J Clin Oncol.</italic>
</source>
                    <year>1983</year>;<volume>1</volume>(<issue>11</issue>):<fpage>710</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="pmid">6668489</pub-id>
                    <pub-id pub-id-type="doi">10.1200/JCO.1983.1.11.710</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Huang</surname>
                            <given-names>Y</given-names>
                        </name>
</person-group>:
                    <article-title>Restoration of monotonicity respecting in dynamic regression.</article-title>
                    <source>

                        <italic toggle="yes">J Am Stat Assoc.</italic>
</source>
                    <year>2016</year>; (In press).
                    <pub-id pub-id-type="doi">10.1080/01621459.2016.1149070</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Huang</surname>
                            <given-names>Y</given-names>
                        </name>
</person-group>:
                    <article-title>Quantile Calculus and Censored Regression.</article-title>
                    <source>

                        <italic toggle="yes">Ann Stat.</italic>
</source>
                    <year>2010</year>;<volume>38</volume>(<issue>3</issue>):<fpage>1607</fpage>&#x2013;<lpage>1637</lpage>.
                    <pub-id pub-id-type="pmid">20592942</pub-id>
                    <pub-id pub-id-type="doi">10.1214/09-AOS771</pub-id>
                    <pub-id pub-id-type="pmcid">2893401</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <collab>NCI and NHGRI</collab>:
                    <article-title>The Cancer Genome Atlas (TCGA) Data Portal</article-title>.  Accessed in December<year>2015</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://tcga-data.nci.nih.gov/docs/publications/tcga/?">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Loprinzi</surname>
                            <given-names>CL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Laurie</surname>
                            <given-names>JA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wieand</surname>
                            <given-names>HS</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Prospective evaluation of prognostic variables from patient-completed questionnaires. North Central Cancer Treatment Group.</article-title>
                    <source>

                        <italic toggle="yes">J Clin Oncol.</italic>
</source>
                    <year>1994</year>;<volume>12</volume>(<issue>3</issue>):<fpage>601</fpage>&#x2013;<lpage>7</lpage>.
                    <pub-id pub-id-type="pmid">8120560</pub-id>
                    <pub-id pub-id-type="doi">10.1200/JCO.1994.12.3.601</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fine</surname>
                            <given-names>JP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gray</surname>
                            <given-names>RJ</given-names>
                        </name>
</person-group>:
                    <article-title>A Proportional Hazards Model for the Subdistribution of a Competing Risk.</article-title>
                    <source>

                        <italic toggle="yes">J Am Stat Assoc.</italic>
</source>
                    <year>1999</year>;<volume>94</volume>(<issue>446</issue>):<fpage>496</fpage>&#x2013;<lpage>509</lpage>.
                    <pub-id pub-id-type="doi">10.2307/2670170</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <collab>manalirupji</collab>:
                    <article-title>manalirupji/CASAS: CASASv1.0.0.</article-title>
                    <source>

                        <italic toggle="yes">Zenodo.</italic>
</source>
                    <year>2017</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.5281/zenodo.802917">Data Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Scrucca</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Santucci</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Aversa</surname>
                            <given-names>F</given-names>
                        </name>
</person-group>:
                    <article-title>Competing risk analysis using R: an easy guide for clinicians.</article-title>
                    <source>

                        <italic toggle="yes">Bone Marrow Transplant.</italic>
</source>
                    <year>2007</year>;<volume>40</volume>(<issue>4</issue>):<fpage>381</fpage>&#x2013;<lpage>7</lpage>.
                    <pub-id pub-id-type="pmid">17563735</pub-id>
                    <pub-id pub-id-type="doi">10.1038/sj.bmt.1705727</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report24053">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.12784.r24053</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Han</surname>
                        <given-names>Gang</given-names>
                    </name>
                    <xref ref-type="aff" rid="r24053a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r24053a1">
                    <label>1</label>Department of Epidemiology and Biostatistics, Texas A&amp;M University School of Public Health, College Station, TX, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>6</day>
                <month>7</month>
                <year>2017</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2017 Han G</copyright-statement>
                <copyright-year>2017</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport24053" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.11830.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>Authors of this article attempted to facilitate the routine survival analysis for clinicians and general health science professionals by building a web based survival analysis software with underlying R programming. Because survival data has increased usage in clinical research, this work is useful to fill out the gap of survival analysis for researchers without training in statistics and programming. But given a number of limitations, its practical value is questionable. Due to the following multiple missing components, this article does not look complete. This reviewer suggests major revision to improve its usability: &#x00a0; 
                <list list-type="order">
                    <list-item>
                        <p>The first missing component is the test of model assumptions. The validity of the proportional hazard assumption was not described.</p>
                    </list-item>
                    <list-item>
                        <p>The second missing piece is the detailed description of required data format. For example, some practitioners may label event/censor as &#x201c;1/0&#x201d;, while others will write &#x201c;not censored/censored.&#x201d; What is the requirement for the data file header? For missing values, some may leave it blank, others may write &#x201c;missing&#x201d; or &#x201c;unknown.&#x201d; These things are all trivial for statisticians but useful for clinical users to know</p>
                        <p> and bear in mind. The authors may add a separate section to emphasize the correct format.</p>
                    </list-item>
                    <list-item>
                        <p>The third missing piece is a comprehensive example. Although some figures were shown, potential users may look for a detailed example to follow. The authors had made the point that this software can perform certain survival analyses, but they failed to explain/illustrate how to perform these analyses. A major revision is necessary to answer the question of &#x201c;how&#x201d; in an example that looks similar to majority of the users&#x2019; data and analytical needs.</p>
                    </list-item>
                </list>
            </p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>No</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment2894-24053">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Rupji</surname>
                            <given-names>Manali</given-names>
                        </name>
                        <aff>Emory University, USA</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>20</day>
                    <month>7</month>
                    <year>2017</year>
                </pub-date>
            </front-stub>
            <body>
                <p>We thank the reviewer for their thoughtful comments that have led to an improved paper.&#x00a0; Below, we respond individually to each comment.</p>
                <p>
                    <italic>1. The first missing component is the test of model assumptions. The validity of the</italic>
                </p>
                <p>
                    <italic>proportional hazard assumption was not described.</italic>
                </p>
                <p>We appreciate this reviewer&#x2019;s careful review of the tool and in bringing this point to our attention. A test for proportional hazards (PH) was included as part of the standard survival analysis option. We recognize that several methods exist for checking the proportional hazards assumption and results may vary depending on the method applied. In response to the reviewer&#x2019;s comment, we have included under the documentation tab, &#x201c;Read Me&#x201d;, that the proportional hazards assumption is examined by applying the method of Schoenfeld residuals. If this method indicates a violation of the proportional hazards assumption as indicated by a significant test of PH (versus alternative of non-PH) based on such residuals, then other methods should be examined that are outside the scope of this tool since results may vary depending on the method implemented (Hiller
                    <italic> et al. </italic>2015).&#x00a0; &#x00a0;&#x00a0;</p>
                <p>Ref:</p>
                <p>Hiller L, Marshall A, Dunn J.&#x00a0; Assessing violations of the proportional hazards assumption in cox regression:&#x00a0; does the chosen method matter?&#x00a0; Trials.&#x00a0; 2015; 16(suppl 2): P134.</p>
                <p>
                    <italic>2. The second missing piece is the detailed description of required data format.</italic>
                </p>
                <p>
                    <italic>For example, some practitioners may label event/censor as &#x201c;1/0&#x201d;, while others</italic>
                </p>
                <p>
                    <italic>will write &#x201c;not censored/censored.&#x201d; What is the requirement for the data file</italic>
                </p>
                <p>
                    <italic>header? For missing values, some may leave it blank, others may write</italic>
                </p>
                <p>
                    <italic>&#x201c;missing&#x201d; or &#x201c;unknown.&#x201d; These things are all trivial for statisticians but useful for</italic>
                </p>
                <p>
                    <italic>clinical users to know and bear in mind. The authors may add a separate section to emphasize the correct format</italic>
                </p>
                <p>We appreciate the reviewer&#x2019;s point of clarity here and have addressed their recommendation of including a separate paragraph that highlights the required format. This new paragraph is listed under the, &#x201c;Data and software availability&#x201d; section in the paper.</p>
                <p>
                    <italic>3. The third missing piece is a comprehensive example. Although some figures</italic>
                </p>
                <p>
                    <italic>were shown, potential users may look for a detailed example to follow. The</italic>
                </p>
                <p>
                    <italic>authors had made the point that this software can perform certain survival</italic>
                </p>
                <p>
                    <italic>analyses, but they failed to explain/illustrate how to perform these analyses. A</italic>
                </p>
                <p>
                    <italic>major revision is necessary to answer the question of &#x201c;how&#x201d; in an example that</italic>
                </p>
                <p>
                    <italic>looks similar to majority of the users&#x2019; data and analytical needs.</italic>
                </p>
                <p>In response to the reviewer&#x2019;s comment, we have updated the one example initially used for illustrating the quantile survival analysis to also illustrate the following: standard survival analysis, a test of proportional hazards assumption, and selection of a cutpoint for a continuous variable. Additionally, we have updated the specific dataset for landmark analysis, to include the time corresponding to the time-dependent cohort variable. We have left the other previous examples provided in the initial submission to supplement the &#x2018;how to&#x2019; of individual methods. While we recognize the importance of having a single example to streamline the methods of survival analysis, we also thought it important to emphasize that the methods are not dependent on each other and that they can be implemented as several stand-alone methods under one tool.</p>
            </body>
        </sub-article>
    </sub-article>
</article>
