<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="other" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.9582.2</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Software Tool Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                    <subj-group>
                        <subject>Bioinformatics</subject>
                    </subj-group>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Building pathway graphs from BioPAX data in R</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 2; peer review: 3 approved, 1 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Benis</surname>
                        <given-names>Nirupama</given-names>
                    </name>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Schokker</surname>
                        <given-names>Dirkjan</given-names>
                    </name>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Kramer</surname>
                        <given-names>Frank</given-names>
                    </name>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Smits</surname>
                        <given-names>Mari A.</given-names>
                    </name>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Suarez-Diez</surname>
                        <given-names>Maria</given-names>
                    </name>
                    <uri content-type="orcid">https://orcid.org/0000-0001-5845-146X</uri>
                    <xref ref-type="aff" rid="a5">5</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Host Microbe Interactomics, Wageningen University and Research, Wageningen, The Netherlands</aff>
                <aff id="a2">
                    <label>2</label>Wageningen Livestock Research, Wageningen University and Research, Wageningen, The Netherlands</aff>
                <aff id="a3">
                    <label>3</label>Department of Medical Statistics, University Medical Center Goettingen, Goettingen, Germany</aff>
                <aff id="a4">
                    <label>4</label>Wageningen Bioveterinary Research, Wageningen University and Research, Wageningen, The Netherlands</aff>
                <aff id="a5">
                    <label>5</label>Systems and Synthetic Biology, Wageningen University and Research, Wageningen, The Netherlands</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:nirupama.benis@wur.nl">nirupama.benis@wur.nl</email>
                </corresp>
                <fn fn-type="con">
                    <p>NB built the new function and prepared the manuscript. DS guided the process and edited the manuscript. FK tested the function, added it to the package and contributed to the manuscript. MS contributed significantly to the manuscript. MSD guided the building of the function, tested it and edited the manuscript.</p>
                </fn>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>12</day>
                <month>12</month>
                <year>2016</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2016</year>
            </pub-date>
            <volume>5</volume>
            <elocation-id>2414</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>2</day>
                    <month>12</month>
                    <year>2016</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Benis N et al.</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/5-2414/pdf"/>
            <abstract>
                <p>Biological pathways are increasingly available in the BioPAX format which uses an RDF model for data storage. One can retrieve the information in this data model in the scripting language R using the package 
                    <italic toggle="yes">rBiopaxParser</italic>, which converts the BioPAX format to one readable in R. It also has a function to build a regulatory network from the pathway information. Here we describe an extension of this function. The new function allows the user to build graphs of entire pathways, including regulated as well as non-regulated elements, and therefore provides a maximum of information. This function is available as part of the 
                    <italic toggle="yes">rBiopaxParser</italic> distribution from Bioconductor.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>rBiopaxParser</kwd>
                <kwd>R</kwd>
                <kwd>pathways</kwd>
                <kwd>BioPAX</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>This work has been financially supported by the Systems Biology Investment Programme of Wageningen University, KB-17-003.02-022. Frank Kramer&#x2019;s work is funded by the German Ministry of Education and Research (BMBF) grants FKZ01ZX1508 and FKZ031L0024A.</funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 1</title>
                <p>The new version of the manuscript describes in more detail&#x00a0;the differences between the existing function and the new function now added to the package. The information gained with the new function is illustrated with biologically relevant material.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>Biological pathways represent signalling and/or metabolic events involving protein and non-protein molecules. They are increasingly used in gene and protein expression studies to provide an aggregate score for gene sets encoding for defined biological events
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>. Several pathway databases, either curated or not, have adopted the BioPAX [RRID:SCR_009881] (Biological Pathway Exchange) language as a standard for pathway representation using the RDF (Resource Description Framework) data model
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>
                </sup>.</p>
            <p>The structure of BioPAX is founded upon groupings, called classes, for physical entities and interactions with hierarchical networks of their sub-classes. Interactions between physical entities are represented such that conjoint interactions may form a specific pathway with defined, but different types of interactions between the involved physical entities. The BioPAX format is being actively developed, with BioPAX level 2 format focusing on metabolic pathways and BioPAX level 3 introducing full support for signalling pathways.</p>
            <p>SPARQL (Simple Protocol And RDF Query Language) is a query language able to retrieve and manipulate data stored in RDF. Pathway information is often combined with statistical data analysis using tools such as R
                <sup>
                    <xref ref-type="bibr" rid="ref-3">3</xref>
                </sup>. The 
                <italic toggle="yes">rBiopaxParser</italic> [RRID:SCR_002744]
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup> is an R package to retrieve data stored in a BioPAX RDF format. It comes with several options that are useful to probe the data and extract specific information from it, for example participants of a pathway, stoichiometric conditions to be fulfilled for an interaction, etc.</p>
            <p>One such option is the 
                <italic toggle="yes">pathway2RegulatoryGraph</italic> (P2RG) function that converts a pathway into a graphical structure. This is extremely useful for visual representation and subsequent graph-based network analysis. The P2RG function returns the parts of a pathway that are regulated (activated or inhibited) by proteins or protein complexes; this is important to understand the role of regulated proteins in pathways. Here we present an adaptation of P2RG, denoted 
                <italic toggle="yes">pathway2Graph</italic> (P2G) which can be used to build a graph of the entire pathway, including the regulated as well as the non-regulated elements. This new function expands P2RG and can be used to investigate all different types of processes and connections of pathways instead of only studying the regulated elements of pathways. P2RG retrieves regulatory interactions, such as inhibitions and activations (shown in 
                <xref ref-type="fig" rid="f1">Figure 1</xref> as continuous edges). The new P2G additionally, retrieves protein modifications, such as translocations or complex formation, which are shown as discontinuous edges in 
                <xref ref-type="fig" rid="f1">Figure 1</xref>.</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>Figure 1. </label>
                <caption>
                    <title>Hypothetical pathway.</title>
                    <p>This cartoon of a pathway shows examples of nodes and edges that could be encountered in a BioPAX database. The nodes are proteins, complexes or other physical entities and the edges are interactions between the nodes, that represent either interactions among proteins or protein modifications. The solid edges are those detected by the P2RG function and the solid and dashed edges are detected by the P2G function.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/11188/46718377-3928-4892-a2ff-e29e987478ec_figure1.gif"/>
            </fig>
            <p>P2G is specifically aimed at retrieving results from Reactome BioPAX level 3. In this paper we describe detailed information on this function which, we believe, will help rBiopaxParser users to better understand the graphs generated from pathway information. We have verified P2G results by directly querying the original BioPAX data using SPARQL.</p>
        </sec>
        <sec sec-type="materials | results">
            <title>Methods and results</title>
            <p>The classes of 
                <bold>PhysicalEntity</bold> and 
                <bold>Interaction</bold> that are used in Reactome v51 to represent information on pathways are shown in 
                <xref ref-type="fig" rid="f2">Figure 2</xref>. This graph was generated using the tool RDF2Graph
                <sup>
                    <xref ref-type="bibr" rid="ref-5">5</xref>
                </sup> on the Reactome Level 3 RDF file. The nodes in 
                <xref ref-type="fig" rid="f2">Figure 2</xref> represent classes and the edges show the possible relationships, called predicates, these classes could have in the database. As depicted in 
                <xref ref-type="fig" rid="f2">Figure 2</xref>, the node 
                <bold>Pathway</bold> could have one or more 
                <bold>PathwaySteps</bold> that consist of different types of 
                <bold>Interaction</bold> sub-classes. All the 
                <bold>Interaction</bold> nodes shown in 
                <xref ref-type="fig" rid="f2">Figure 2</xref> describe interactions between 
                <bold>PhysicalEntities</bold>, hence are connected to them by particular types of predicates as indicated in the edge labels. The 
                <bold>Interaction</bold> classes are interconnected because they can be dependent on each other. The 
                <bold>Control</bold> interaction and its sub-classes (
                <bold>Catalysis</bold> and 
                <bold>Modulation</bold>) represent signalling events. They regulate 
                <bold>BiochemicalReaction</bold> and 
                <bold>Degradation</bold> interactions which mostly represent metabolic reactions.</p>
            <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                <label>Figure 2. </label>
                <caption>
                    <title>Interplay of classes in Reactome BioPAX.</title>
                    <p>This figure shows a network of the Interaction and PhysicalEntity classes that are a part of any pathway in Reactome v51 BioPAX level 3. Nodes are classes and the directed edges are links between them in the database. The green nodes are the Pathway and PathwayStep classes, the blue nodes are Interaction classes and orange nodes are PhysicalEntity classes.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/11188/46718377-3928-4892-a2ff-e29e987478ec_figure2.gif"/>
            </fig>
            <p>To create a regulatory graph, the P2RG function starts with the 
                <bold>Control</bold>, 
                <bold>Catalysis</bold> and 
                <bold>Modulation</bold> interactions that are either activating or inhibiting other interactions. This method provides a graph with plenty of information on the regulatory components of the pathway. The nodes of this graph are physical entities like 
                <bold>Proteins</bold> or 
                <bold>SmallMolecules</bold> and the directed edges are either activation or inhibition events. An example of such a reconstruction is shown in 
                <xref ref-type="fig" rid="f1">Figure 1</xref>, where P2RG is able to retrieve the black nodes and the continuous edges. However, interactions can be missed if they are not regulated by the 
                <bold>Control</bold> interactions and could result in the loss of valuable information in the graphical representation of the pathway.</p>
            <p>The new function P2G can start with any type of interaction in order to obtain a graph with all possible physical entities involved in the pathway. Similar to the result of the P2RG function, the P2G function gives a graph with nodes that are physical entities, but the edges are not strictly activation or inhibition events. The directed edges could represent several types of events like translocation of a protein or cleavage of DNA, these are shown as dis-continuous edges in the cartoon in 
                <xref ref-type="fig" rid="f1">Figure 1</xref>. The P2G function recognizes the continuous and the dis-continuous edges and thus retrieves the black as well as the red nodes shown in 
                <xref ref-type="fig" rid="f1">Figure 1</xref>. In some cases there is more than one documented connection between the same physical entities. In this case only the first connection is used as an edge in the final pathway graph.</p>
        </sec>
        <sec>
            <title>Comparison of two methods: P2G vs P2RG</title>
            <p>The Reactome database (v51) categorizes pathways into 27 branches. Here we worked with pathways that have more than one interaction, which resulted in 1,666 pathways. Using P2RG, graphs for 1,548 pathways were retrieved. By using the new P2G function, we were able to retrieve information on all 1,666 pathways. The highest number of pathways were obtained, using either method, in the &#x201c;Disease&#x201d; category (P2RG: 3,396 pathways, P2G: 4,888 pathways). In 85% of the cases, pathways retrieved using P2G consisted of more physical entities (nodes) than those retrieved using P2RG. 19% of the P2G retrieved pathways have at least twice the number of nodes, and 60% have at least twice the number of interactions between nodes (edges) as compared to the P2RG version, 
                <xref ref-type="fig" rid="f3">Figure 3</xref> is an example of this difference. Total numbers of nodes and edges in major Reactome categories are given in 
                <xref ref-type="table" rid="T1">Table 1</xref>. Missing information causes the appearance of disconnected graphs when reconstructing pathways. By using the new P2G function, the percentage of disconnected pathways is reduced by 9%. Additionally, P2G also has the option of only retrieving the largest connected component, for example with this option, in 
                <xref ref-type="fig" rid="f3">Figure 3</xref>. A only the top left part of the graph will be retrieved and the two other disconnected parts discarded. The pathways have directed edges because most of the interactions have direction. Edges without a direction are represented as bidirectional edges in the output of P2G.</p>
            <table-wrap id="T1" orientation="portrait" position="anchor">
                <label>Table 1. </label>
                <caption>
                    <title>Numbers of nodes and edges.</title>
                    <p>The number of nodes and edges of ten different pathways (Reactome Categories) are indicated as obtained after application of P2RG and P2G on the same set of BioPAX RDF information.</p>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1" valign="top">Reactome Categories</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">P2RG
                                <break/>Nodes</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">P2RG
                                <break/>Edges</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">P2G
                                <break/>Nodes</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">P2G
                                <break/>Edges</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Binding and Uptake of Ligands
                                <break/>by Scavenger Receptors</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">68</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">56</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Cell-Cell communication</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">13</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">14</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">142</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">142</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Disease</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">3,396</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">5,878</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">4,888</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">12,159</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Gene Expression</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">652</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">900</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1,110</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,450</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Immune System</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1,431</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,233</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,419</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">5,045</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Membrane Trafficking</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">86</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">121</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">181</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">382</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Metabolism</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">3,082</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">5,922</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">3,479</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">11,289</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Signaling Pathways</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,069</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">3,274</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">3,430</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">7,131</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Steroid hormones</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">72</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">147</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">81</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">333</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Transcription</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">281</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">420</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">623</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1,324</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                <label>Figure 3. </label>
                <caption>
                    <title>Graphs of the pathway &#x2018;Apoptosis induced DNA fragmentation&#x2019;.</title>
                    <p>Both graphs were extracted from the same BioPAX file. 
                        <bold>A</bold>) Graph recovered using the new P2G function; 
                        <bold>B</bold>) Graph recovered using P2RG function. In both panels blue nodes are proteins or protein complexes, white nodes are non-protein entities. Black encircled nodes are found in both graphs and red encircled nodes are only detected with the new P2G function. Names of the nodes are in 
                        <xref ref-type="table" rid="T2">Table 2</xref>.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/11188/46718377-3928-4892-a2ff-e29e987478ec_figure3.gif"/>
            </fig>
            <p>As an example, we discuss here the &#x2018;Apoptosis induced DNA fragmentation&#x2019; pathway, which is in the &#x201c;Programmed Cell Death&#x2019; category (
                <xref ref-type="fig" rid="f3">Figure 3</xref>). When the information in the BioPAX file is reconstructed with the P2RG function, the pathway has seven nodes (
                <xref ref-type="fig" rid="f3">Figure 3.B</xref>); with the P2G function the same pathway has 16 more nodes (
                <xref ref-type="fig" rid="f3">Figure 3.A</xref>). Detailed information on these nodes, as retrieved with P2G and P2RG, is given in 
                <xref ref-type="table" rid="T2">Table 2</xref> and demonstrates the additional information retrieved by P2G. The node &#x2018;Complex4169&#x2019;, which is found in the cytosol, translocates to the nucleus where it is called &#x2018;Complex4238&#x2019;. However, this information is only available from the P2G function because the node &#x2018;Complex4169&#x2019; does not regulate any other interaction or node. The presence of extra nodes in the P2G retrieved graph (
                <xref ref-type="fig" rid="f3">Figure 3</xref>) also visualizes that &#x2018;Complex4240&#x2019; breaks up into its&#x2019; individual components after being cleaved by Caspase-3 (&#x2018;Complex4238&#x2019;). This extra information is very useful for researchers analysing the phenomena represented by the pathway. In case P2G retrieved pathways graphs are used for analysis (e.g, differential gene expression analysis) the presence of these extra nodes may improve biological interpretation of experimental data.</p>
            <table-wrap id="T2" orientation="portrait" position="anchor">
                <label>Table 2. </label>
                <caption>
                    <title>Node names and locations of the &#x201c;Apoptosis induced DNA fragmentation&#x201d; pathway.</title>
                    <p>The first column has the names of the nodes in the pathwayas depicted in 
                        <xref ref-type="fig" rid="f3">Figure 3</xref>. The second column has the actual name of the node and the third column the cellular location of the node. All this information is represented as given in Reactome version 51. The nodes shown with a black outline in 
                        <xref ref-type="fig" rid="f3">Figure 3</xref> are shown here in bold font.</p>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1">Node</th>
                            <th align="left" colspan="1" rowspan="1">Name</th>
                            <th align="left" colspan="1" rowspan="1">Location</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Protein8776</td>
                            <td align="left" colspan="1" rowspan="1">DFFB</td>
                            <td align="left" colspan="1" rowspan="1">Cytosol</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Protein8777</td>
                            <td align="left" colspan="1" rowspan="1">DFFA</td>
                            <td align="left" colspan="1" rowspan="1">Cytosol</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Complex4232</td>
                            <td align="left" colspan="1" rowspan="1">DFFA : DFFB</td>
                            <td align="left" colspan="1" rowspan="1">Cytosol</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Complex4233</td>
                            <td align="left" colspan="1" rowspan="1">Importin alpha : Importin beta</td>
                            <td align="left" colspan="1" rowspan="1">Cytosol</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Complex4234</td>
                            <td align="left" colspan="1" rowspan="1">DFF : associated with Importin alpha : Importin beta</td>
                            <td align="left" colspan="1" rowspan="1">Cytosol</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Complex4235</td>
                            <td align="left" colspan="1" rowspan="1">DFF : associated with Importin alpha : Importin beta</td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Complex4169</td>
                            <td align="left" colspan="1" rowspan="1">Active CASP3</td>
                            <td align="left" colspan="1" rowspan="1">Cytosol</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>Complex4238</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>Active CASP3</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>Complex4236</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>DFFA : DFFB</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>Complex4239</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>Caspase cleaved DFFA</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>Complex4240</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>Caspase cleaved DFFA : DFFB</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Protein8779</td>
                            <td align="left" colspan="1" rowspan="1">DFFB</td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Protein8784</td>
                            <td align="left" colspan="1" rowspan="1">DFFA fragment</td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Protein8785</td>
                            <td align="left" colspan="1" rowspan="1">DFFA fragment</td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Protein8783</td>
                            <td align="left" colspan="1" rowspan="1">DFFA fragment</td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Complex4241</td>
                            <td align="left" colspan="1" rowspan="1">DFFB homodimer</td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>PhysicalEntity567</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>DFFB homodimer/homooligomer</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Complex2061</td>
                            <td align="left" colspan="1" rowspan="1">Histone H1 bound chromatin DNA</td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>Complex4242</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>DFFB associated with chromatin</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Protein8786</td>
                            <td align="left" colspan="1" rowspan="1">HMGB1/HMGB2</td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">PhysicalEntity109</td>
                            <td align="left" colspan="1" rowspan="1">DNA</td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>Complex4243</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>HMGB1/HMGB2 &#x2013; bound chromatin</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>Complex4244</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>DFF cleaved DNA</bold>
                            </td>
                            <td align="left" colspan="1" rowspan="1">Nucleoplasm</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
        </sec>
        <sec sec-type="conclusions">
            <title>Conclusion</title>
            <p>P2G is a useful addition to the rBiopaxParser package because it retrieves all the components of a pathway from the database and provides complete graphical information for both signalling as well as metabolic pathways. The P2G function (
                <italic toggle="yes">pathway2Graph</italic>) is currently available in the 
                <italic toggle="yes">rBiopaxParser</italic> package in the Bioconductor 3.4 release.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <p>The input data for this package is the BioPAX format of any pathway database. We used the Reactome database which is freely available for download in different formats from the website 
                <ext-link ext-link-type="uri" xlink:href="http://www.reactome.org">www.reactome.org</ext-link>. A subset of this database is given as 
                <xref ref-type="other" rid="SM1">Supplementary file 1</xref>.</p>
        </sec>
        <sec>
            <title>Software availability</title>
            <p>
                <bold>Software available from</bold>: The function pathway2Graph is available in the latest version of the R package 
                <italic toggle="yes">rBiopaxParser</italic> and can be installed from Bioconductor.</p>
            <p>
                <bold>Latest source code</bold>: 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/frankkramer-lab/rBiopaxParser/tree/2.12.0">https://github.com/frankkramer-lab/rBiopaxParser/tree/2.12.0</ext-link>
            </p>
            <p>
                <bold>Archived source code as at the time of publication</bold>: 
                <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.5281/zenodo.61618">http://dx.doi.org/10.5281/zenodo.61618</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-6">6</xref>
                </sup>
            </p>
            <p>
                <bold>Software license</bold>: 
                <ext-link ext-link-type="uri" xlink:href="https://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html">GPL-2</ext-link>
            </p>
        </sec>
    </body>
    <back>
        <sec id="SM1" sec-type="supplementary-material">
            <title>Supplementary material</title>
            <p>
                <bold>Subset of Reactome database.</bold>
            </p>
            <p>This .owl file contains information on four pathways from the Reactome v51 BioPAX level 3 database. This format can be loaded into the R environment using the rBiopaxParser package and used to test the P2G function and obtain graphs which were used as the basis for 
                <xref ref-type="fig" rid="f3">Figure 3</xref>. More information on loading and processing this file format can be found in the package documentation.</p>
            <p>
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/9582/19c8ac7b-96b2-4db4-a78a-c2defed535ae.owl">Click here to access the data </ext-link>
            </p>
        </sec>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Mitrea</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Taghavi</surname>
                            <given-names>Z</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Bokanizad</surname>
                            <given-names>B</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Methods and approaches in the topology-based analysis of biological pathways.</article-title>
                    <source>
						
                        <italic toggle="yes">Front Physiol.</italic>
					</source>
                    <year>2013</year>;<volume>4</volume>:<fpage>278</fpage>.
                    <pub-id pub-id-type="pmid">24133454</pub-id>
                    <pub-id pub-id-type="doi">10.3389/fphys.2013.00278</pub-id>
                    <pub-id pub-id-type="pmcid">3794382</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Demir</surname>
                            <given-names>E</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Cary</surname>
                            <given-names>MP</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Paley</surname>
                            <given-names>S</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>The BioPAX community standard for pathway data sharing.</article-title>
                    <source>
						
                        <italic toggle="yes">Nat Biotechnol.</italic>
					</source>
                    <year>2010</year>;<volume>28</volume>(<issue>9</issue>):<fpage>935</fpage>&#x2013;<lpage>942</lpage>.
                    <pub-id pub-id-type="pmid">20829833</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nbt.1666</pub-id>
                    <pub-id pub-id-type="pmcid">3001121</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Kramer</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Bayerlov&#x00e1;</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Bei&#x00df;barth</surname>
                            <given-names>T</given-names>
                        </name>
					</person-group>:
                    <article-title>R-based software for the integration of pathway data into bioinformatic algorithms.</article-title>
                    <source>
						
                        <italic toggle="yes">Biology (Basel).</italic>
					</source>
                    <year>2014</year>;<volume>3</volume>(<issue>1</issue>):<fpage>85</fpage>&#x2013;<lpage>100</lpage>.
                    <pub-id pub-id-type="pmid">24833336</pub-id>
                    <pub-id pub-id-type="doi">10.3390/biology3010085</pub-id>
                    <pub-id pub-id-type="pmcid">4009765</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Kramer</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Bayerlov&#x00e1;</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Klemm</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>rBiopaxParser--an R package to parse, modify and visualize BioPAX data.</article-title>
                    <source>
						
                        <italic toggle="yes">Bioinformatics.</italic>
					</source>
                    <year>2013</year>;<volume>29</volume>(<issue>4</issue>):<fpage>520</fpage>&#x2013;<lpage>522</lpage>.
                    <pub-id pub-id-type="pmid">23274212</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/bts710</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>van Dam</surname>
                            <given-names>JC</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Koehorst</surname>
                            <given-names>JJ</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Schaap</surname>
                            <given-names>PJ</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>RDF2Graph a tool to recover, understand and validate the ontology of an RDF resource.</article-title>
                    <source>
						
                        <italic toggle="yes">J Biomed Semantics.</italic>
					</source>
                    <year>2015</year>;<volume>6</volume>:<fpage>39</fpage>.
                    <pub-id pub-id-type="pmid">26500754</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s13326-015-0038-9</pub-id>
                    <pub-id pub-id-type="pmcid">4619317</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Kramer</surname>
                            <given-names>F</given-names>
                        </name>
					</person-group>:
                    <article-title>rBiopaxParser 2.12.0 [Data set].</article-title>
                    <source>
					
                        <italic toggle="yes">Zenodo.</italic>
					</source>
                    <year>2016</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.5281/zenodo.61618">Data Source</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report18420">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.11188.r18420</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Fink</surname>
                        <given-names>Lynn</given-names>
                    </name>
                    <xref ref-type="aff" rid="r18420a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r18420a1">
                    <label>1</label>University of Queensland, Diamantina Institute, Woolloongabba, QLD, Australia</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>19</day>
                <month>12</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Fink L</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport18420" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9582.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The article is much clearer now.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report18423">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.11188.r18423</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Floor</surname>
                        <given-names>Stephen N.</given-names>
                    </name>
                    <xref ref-type="aff" rid="r18423a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-9965-9694</uri>
                </contrib>
                <aff id="r18423a1">
                    <label>1</label>Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>13</day>
                <month>12</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Floor SN</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport18423" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9582.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors have addressed the (minor) issues I had with the article previously, thanks!</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report17476">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.10320.r17476</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Ellrott</surname>
                        <given-names>Kyle</given-names>
                    </name>
                    <xref ref-type="aff" rid="r17476a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r17476a1">
                    <label>1</label>Oregon Health &amp; Science University, Portland, OR, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>2</day>
                <month>12</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Ellrott K</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport17476" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9582.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors of this paper describe a new function provided by the rBiopaxParser library, which is a R based system for parsing BioPax documents. BioPax is coded in RDF, which is a linked data format that describes the subject matter using graph triples in the form of a Subject, Predicate and Object. Their pathway2Graph and the old gathwayRegulatoryGraph essentially attempt to do graph transformations, taking the RDF graph and converting it to a graph that is suitable for analysis.&#x00a0;</p>
            <p> </p>
            <p> The paper&#x2019;s primary metric for demonstrating improvements in their method is measuring the number of non-zero pathways retrieved after extracting data from Reactome. In addition, I tested code and their supplied supplemental data.&#x00a0;</p>
            <p> </p>
            <p> ```</p>
            <p> &gt; library(rBiopaxParser)</p>
            <p> &gt; biopax=readBiopax("19c8ac7b-96b2-4db4-a78a-c2defed535ae.owl")</p>
            <p> &gt; a &lt;- pathway2RegulatoryGraph(biopax, "Pathway1020")</p>
            <p> &gt; b &lt;- pathway2Graph(biopax, "Pathway1020")</p>
            <p> </p>
            <p> &gt; nodes(a)</p>
            <p> &#x00a0;[1] "CASP3" &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;"CASP3(176-277)"</p>
            <p> &#x00a0;[3] "DFFA(1-?)" &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;"DFFB"</p>
            <p> &#x00a0;[5] "DFFA(118-331)" &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;"DFFA(1-117)"</p>
            <p> &#x00a0;[7] "DFFA(225-331)" &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0;"DFFA(118-224)"</p>
            <p> &#x00a0;[9] "Histone H1" &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; &#x00a0; "DFF40 homodimer/homooligomer"</p>
            <p> [11] "HMGB1/HMGB2"</p>
            <p> </p>
            <p> &gt; nodes(b)</p>
            <p> &#x00a0;[1] "Protein8776" &#x00a0; &#x00a0; &#x00a0; "Protein8777" &#x00a0; &#x00a0; &#x00a0; "Complex4232"</p>
            <p> &#x00a0;[4] "Complex4233" &#x00a0; &#x00a0; &#x00a0; "Complex4234" &#x00a0; &#x00a0; &#x00a0; "Complex4235"</p>
            <p> &#x00a0;[7] "Complex4169" &#x00a0; &#x00a0; &#x00a0; "Complex4238" &#x00a0; &#x00a0; &#x00a0; "Complex4236"</p>
            <p> [10] "Complex4239" &#x00a0; &#x00a0; &#x00a0; "Complex4240" &#x00a0; &#x00a0; &#x00a0; "Protein8779"</p>
            <p> [13] "Protein8784" &#x00a0; &#x00a0; &#x00a0; "Protein8785" &#x00a0; &#x00a0; &#x00a0; "Protein8783"</p>
            <p> [16] "Complex4241" &#x00a0; &#x00a0; &#x00a0; "PhysicalEntity567" "Complex2061"</p>
            <p> [19] "Complex4242" &#x00a0; &#x00a0; &#x00a0; "Protein8786" &#x00a0; &#x00a0; &#x00a0; "PhysicalEntity109"</p>
            <p> [22] "Complex4243" &#x00a0; &#x00a0; &#x00a0; "Complex4244"</p>
            <p> </p>
            <p> ```</p>
            <p> </p>
            <p> From the old version (a) to the new extraction (b), for this particular pathway the number of nodes went from 11 to 23. Interestingly, the names of the vertices became less descriptive in the newer method, going from gene names like &#x2018;CASP3&#x2019; to strings extracted from the RDF urls, like &#x2018;Protein8776&#x2019;.</p>
            <p> </p>
            <p> It would seem that the authors claim, that the number of extracted elements from the BioPax files is accurate. But what is slightly confusing is the significance of this change. Was the original method faulty and this is a bug fix? Or is the method being updated to deal with the BioPax standard as it changes from version 1 to 2 to 3?&#x00a0;</p>
            <p> </p>
            <p> This seems like a minor but necessary change to the library to make it have a better extract information from BioPax. More explanation about the nature of this change, and how the new parsing strategy improved how the library was able to deal with all of the various types of relationship classes in the BioPax format would make a big difference in helping to illustrate the improvements mentioned in this paper.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report17940">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.10320.r17940</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Coller</surname>
                        <given-names>Hilary Ann</given-names>
                    </name>
                    <xref ref-type="aff" rid="r17940a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r17940a1">
                    <label>1</label>Department of Molecular, Cell and Developmental Biology, University of California Los Angeles, Los Angeles, CA, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>24</day>
                <month>11</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Coller HA</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport17940" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9582.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors have developed a new function that allows the user to build a regulatory network in a graph format based on pathway information. In the version that the authors developed, the output graph includes regulatory and non-regulatory interactions and allows the viewer to more fully comprehend the underlying network. Nodes in the network represent classes, while the edges show the relationships among these classes. An example of the approach is provided with the Reactome database. The authors&#x2019; approach, P2G, was used to analyze data for 1,666 pathways. P2G returned more nodes than was retrieved by the earlier version in 85% of the cases. The software will be available as part of the Bioconductor 3.4 release. This will likely be a valuable addition to the Bioconductor package that will provide scientists with a means for generating graphical and intuitive networks from gene expression and metabolic data. The manuscript is clear with an appropriate title and abstract. The article is clearly written and the conclusions are based on the data.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
        <sub-article article-type="response" id="comment2337-17940">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Benis</surname>
                            <given-names>Nirupama</given-names>
                        </name>
                        <aff>Wageningen University and Research, The Netherlands</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>30</day>
                    <month>11</month>
                    <year>2016</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thank you for your comments.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report17305">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.10320.r17305</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Floor</surname>
                        <given-names>Stephen N.</given-names>
                    </name>
                    <xref ref-type="aff" rid="r17305a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-9965-9694</uri>
                </contrib>
                <aff id="r17305a1">
                    <label>1</label>Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>7</day>
                <month>11</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Floor SN</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport17305" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9582.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors have developed a new function in the 
                <italic>rBiopaxParser </italic>package to generate figures from BioPAX formatted biological pathway data. This new function, called 
                <italic>pathway2Graph</italic> (P2G) replaces an older function called 
                <italic>pathway2RegulatoryGraph</italic> (P2RG). P2G includes more interaction terms than P2RG, and therefore generates more complete interaction graphs. That said, as written the changes are mysterious - why did the original function limit the interaction types when generating graphs and what is advantageous about including more interactions? Addition of the P2G function is a small improvement to the rBiopaxParser package that will be useful, but the discussion of P2G's advantages and disadvantages should be expanded.</p>
            <p> </p>
            <p> Specific points: 
                <list list-type="order">
                    <list-item>
                        <p>The language throughout is highly technical, potentially compromising its readability to end users.</p>
                    </list-item>
                    <list-item>
                        <p>The edge labels in Figure 1 are very difficult to read. As these are discussed in the text, their font size and/or weight should be increased.</p>
                    </list-item>
                    <list-item>
                        <p>It would increase the readability of this software tool article if the authors described a (biological) scenario where the P2G function would be uniquely useful compared to P2RG. It&#x2019;s obvious that including more interaction types will lead to more complete graphs, but in what scenario would this be useful for a user?</p>
                    </list-item>
                    <list-item>
                        <p>Why were all interaction types not originally included in P2RG? The advantages and any potential disadvantages of including all interaction types should be discussed more.</p>
                    </list-item>
                    <list-item>
                        <p>Currently, the difference between P2G and P2RG is rather minor (1,666 vs 1,548) &#x2013; might this difference change in the future if more edges are added through the interaction types that are unique to P2G?</p>
                    </list-item>
                </list> </p>
            <p> The work is technically sound and presents a useful extension to the rBiopaxParser package, but the paper describing this work will be useful to a broader audience if changes similar to those above are incorporated.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment2338-17305">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Benis</surname>
                            <given-names>Nirupama</given-names>
                        </name>
                        <aff>Wageningen University and Research, The Netherlands</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>30</day>
                    <month>11</month>
                    <year>2016</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thank you for your comments. In the new version we have expanded the Introduction with non-technical specifics that should explain the basic differences between the functions to a broader audience. Figure 2 (Figure 1 in the previous version) has been changed to increase visibility of edge labels. In order to explain the differences between the outputs of the two functions we have added a section in the Methods and Results section where we describe a particular pathway (&#x2018;Apoptosis induced DNA fragmentation&#x2019;) in terms of the extra information gained by using the new function. A new table (Table 2) with more biological information on this pathway has also been added.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report17096">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.10320.r17096</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Fink</surname>
                        <given-names>Lynn</given-names>
                    </name>
                    <xref ref-type="aff" rid="r17096a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r17096a1">
                    <label>1</label>University of Queensland, Diamantina Institute, Woolloongabba, QLD, Australia</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>2</day>
                <month>11</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Fink L</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport17096" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9582.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This articles describes the addition of a new function to the extant rBiopaxParser R library. This new function converts a BioPAX-formatted pathway of gene or protein interactions into a graphical structure that is human-viewable. This function supercedes an earlier function which performed the same task, but was unable to convert all nodes and egdes in the pathway. This resulted in a loss of information. This loss is now remedied, however, with the new and improved function.</p>
            <p> </p>
            <p> I installed the package and tested it and it seemed to work flawlessly, although I didn't try anything fancy. I imagine that the new function is a significant advance for people working on these pathways routinely as it must have been frustrating to have missing data.</p>
            <p> </p>
            <p> My only reservation about the paper is that I was unclear on the point of the new function until I'd read most of the paper. Perhaps the authors could extend the last paragraph of the introduction to be more clear about why the new function was necessary and how it has improved the R package.&#x00a0;Although this request is a small change, the article is confusing as it is and making this would be a big improvement.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment2339-17096">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Benis</surname>
                            <given-names>Nirupama</given-names>
                        </name>
                        <aff>Wageningen University and Research, The Netherlands</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>30</day>
                    <month>11</month>
                    <year>2016</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thank you for your comments. The Introduction has been expanded with an image (now Figure 1) to emphasize the differences between the two functions.</p>
            </body>
        </sub-article>
    </sub-article>
</article>
