<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="other" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.9050.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Software Tool Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                    <subj-group>
                        <subject>Bioinformatics</subject>
                    </subj-group>
                    <subj-group>
                        <subject>Genomics</subject>
                    </subj-group>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>CoNet app: inference of biological association networks using Cytoscape</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 2 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Faust</surname>
                        <given-names>Karoline</given-names>
                    </name>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Raes</surname>
                        <given-names>Jeroen</given-names>
                    </name>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Center for the Biology of Disease, VIB, Leuven, 3000, Belgium</aff>
                <aff id="a2">
                    <label>2</label>Microbiology Unit, Faculty of Sciences and Bioengineering Sciences, VUB, Brussel, 1050, Belgium</aff>
                <aff id="a3">
                    <label>3</label>Department of Microbiology and Immunology, REGA Institute, KU Leuven, 3000, Belgium</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:karoline.faust@vib-kuleuven.be">karoline.faust@vib-kuleuven.be</email>
                </corresp>
                <fn fn-type="con">
                    <p>K.F. developed the Cytoscape plugin, J.R. initiated and supervised the work. K.F. wrote the paper. Both authors agreed to the final content of the article.</p>
                </fn>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>The authors declare that they have no competing interests.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>27</day>
                <month>6</month>
                <year>2016</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2016</year>
            </pub-date>
            <volume>5</volume>
            <elocation-id>1519</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>21</day>
                    <month>6</month>
                    <year>2016</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Faust K and Raes J</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/5-1519/pdf"/>
            <abstract>
                <p>Here we present the Cytoscape app version of our association network inference tool CoNet. Though CoNet was developed with microbial community data from sequencing experiments in mind, it is designed to be generic and can detect associations in any data set where biological entities (such as genes, metabolites or species) have been observed repeatedly. The CoNet app supports Cytoscape 2.x and 3.x and offers a variety of network inference approaches, which can also be combined. Here we briefly describe its main features and illustrate its use on microbial count data obtained by 16S rDNA sequencing of arctic soil samples. The CoNet app is available at: 
                    <ext-link ext-link-type="uri" xlink:href="http://apps.cytoscape.org/apps/conet">http://apps.cytoscape.org/apps/conet</ext-link>.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>network generation</kwd>
                <kwd>network construction</kwd>
                <kwd>network inference</kwd>
                <kwd>association networks</kwd>
                <kwd>microbial networks</kwd>
                <kwd>CoNet</kwd>
                <kwd>Cytoscape</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>K. F. and J.R. are supported by the Research Foundation Flanders (FWO) and the Flemish agency for Innovation by Science and Technology (IWT).</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>Modern sequencing technology in combination with dedicated analysis pipelines allows determining the relative abundances of microbial community members, thereby obtaining microbial count data. Such community profiling experiments have been carried out for thousands of samples from a variety of ecosystems, ranging from the world&#x2019;s oceans (
                <xref ref-type="bibr" rid="ref-1">Bork 
                    <italic toggle="yes">et al.</italic>, 2015</xref>) to the human gut (
                <xref ref-type="bibr" rid="ref-6">Falony 
                    <italic toggle="yes">et al.</italic>, 2016</xref>; 
                <xref ref-type="bibr" rid="ref-19">The Human Microbiome Project Consortium, 2012</xref>).</p>
            <p>The analysis of species abundance patterns has a long tradition in ecology (
                <xref ref-type="bibr" rid="ref-4">Connor &amp; Simberloff, 1979</xref>; 
                <xref ref-type="bibr" rid="ref-5">Diamond, 1975</xref>; 
                <xref ref-type="bibr" rid="ref-11">Gotelli &amp; McCabe, 2002</xref>). More specifically, co-occurrence analysis detects significant co-occurrences or mutual exclusions across samples, which are interpreted as representing ecological relationships such as mutualism or competition or being due to similar responses to environmental factors. Co-occurrence analysis is an instance of network inference, an exploratory data analysis technique that attempts to unravel relationships between objects from repeated observations. The large number of microbial count tables resulting from the multitude of recent sequencing projects (e.g. 
                <xref ref-type="bibr" rid="ref-1">Bork 
                    <italic toggle="yes">et al.</italic>, 2015</xref>; 
                <xref ref-type="bibr" rid="ref-6">Falony 
                    <italic toggle="yes">et al.</italic>, 2016</xref>; 
                <xref ref-type="bibr" rid="ref-10">Gilbert 
                    <italic toggle="yes">et al.</italic>, 2014</xref>; 
                <xref ref-type="bibr" rid="ref-19">The Human Microbiome Project Consortium, 2012</xref>) opens the way to unraveling the complex relationships between microorganisms from their abundances across samples. CoNet was developed to carry out microbial network inference, but its generic design makes it applicable to any data set where objects have been observed repeatedly.</p>
        </sec>
        <sec sec-type="methods">
            <title>Methods/Implementation</title>
            <p>The CoNet app wraps the CoNet command line tool. The command line and Cytoscape 2.&#x00d7; app version are implemented in Java 1.6, whereas the Cytoscape 3.&#x00d7; app version requires Java 1.7.</p>
            <sec>
                <title>Implementation challenges and decisions</title>
                <p>In general, the CoNet app is designed with minimum contact to Cytoscape, to ensure consistent behavior across different Cytoscape versions and to ease porting to future Cytoscape versions. The CoNet app is linked to Cytoscape only via its main menu and graph visualization classes. The Cytoscape-version-specific implementation of the graph visualization class is loaded via reflection at run time and is entirely separated from graph generation.</p>
                <p>A major challenge for the implementation of the CoNet app is inclusion of the large number of options available in CoNet, which allows users to customize each network inference step, from data preprocessing via threshold setting, network construction and assessment of significance. This problem was solved by implementing a single user input handling class, which collects and checks user input from the various menus and submits it to CoNet once the GO button is pushed. This design allows to export and to read in user settings files, which make experiments carried out with the CoNet app more reproducible.</p>
                <p>Another challenge is the command line support. Network inference from large data sets is not feasible within Cytoscape and CoNet is best run on command line for these cases. To facilitate this step for the inexperienced user, the current settings of the CoNet app can be exported as a command line call, by clicking the "Generate command line call" button. This call can then be executed on command line by including the CoNet jar file in the class path. Networks generated on command line can be loaded either via Cytoscape network import functions (if saved in gml format ((
                    <xref ref-type="bibr" rid="ref-12">Himsolt</xref>)) or more conveniently via the CoNet app (if saved in the custom gdl format). The CoNet app's manual includes a step-by-step tutorial for command line usage.</p>
                <p>The CoNet app also integrates the popular network inference R Bioconductor package minet (
                    <xref ref-type="bibr" rid="ref-18">Meyer 
                        <italic toggle="yes">et al.</italic>, 2008</xref>). We decided to integrate it loosely via Rserve, a Java-R bridge capable of transferring R objects to Java and 
                    <italic toggle="yes">vice versa</italic> (
                    <ext-link ext-link-type="uri" xlink:href="http://rforge.net/Rserve/">http://rforge.net/Rserve/</ext-link>). Thus, advanced users can install and launch the Rserve server in R and configure the Rserve client settings (i.e. host and port) in CoNet app's configuration menu. The CoNet app's manual explains Rserve installation and usage.</p>
                <p>Finally, we also implemented solutions for error and help display. The CoNet app displays help pages in html format, which allows the user to follow links within these pages. The CoNet app's pdf manual is compiled from the help pages using prince (
                    <ext-link ext-link-type="uri" xlink:href="http://www.princexml.com/">http://www.princexml.com/</ext-link>). Each menu is linked to its specific help page, easing navigation.</p>
                <p>When an error has been captured, an error report is generated that includes the error message as well as the CoNet app's current settings.</p>
            </sec>
            <sec>
                <title>Network inference workflow</title>
                <p>CoNet takes a presence/absence, count or abundance matrix as input, where rows represent the objects of interest and columns their observations across locations or time points. Optionally, a second input matrix can be provided. This is of interest when two different measurements have been made for the same samples, for instance counts of microorganisms and concentrations of metabolites. CoNet's output consists of a network where significantly associated objects are connected by edges.</p>
                <p>Depending on the data type, a number of filters needs to be applied. For instance, for 16S rDNA count data, taxa with too few non-zero observations need to be removed and the data needs to be normalized or rarefied to account for sequencing depth differences. In the next step, the user can select from a number of different correlations (Pearson, Spearman, Kendall), similarities (mutual information, Steinhaus, distance correlation etc.) or dissimilarities (Kullback Leibler, Euclidean, Bray Curtis, Jensen-Shannon 
                    <italic toggle="yes">etc.</italic>) to score the association strength between the objects. For presence/absence (also termed incidence) data, the hypergeometric distribution or Jaccard distance can be chosen for the same purpose. CoNet's special strength is its capability to combine multiple such measures and/or to combine these measures with other network inference algorithms, e.g. those implemented in minet. The idea behind such an ensemble approach to network inference is to exploit the fact that different methods make different mistakes. If erroneous edges predicted by one method are not supported by the others, they can be filtered out, thereby reducing the number of false positives. The thresholds for the measures can be either set manually (using sliding windows for bounded measures) or automatically, by specifying the desired number of edges in the output network. The network can then be displayed either as a multigraph (with as many edges between two objects as selected measures) or as a graph (where scores of individual measures are combined). Optionally, the significance of the associations can be computed, 
                    <italic toggle="yes">e.g.</italic> with a permutation test. Multiple testing correction can be performed with either Bonferroni or Benjamini-Hochberg procedures. 
                    <xref ref-type="fig" rid="f1">Figure 1</xref> summarizes this workflow.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>Figure 1. </label>
                    <caption>
                        <title>Network inference workflow in CoNet.</title>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/9740/7968dcee-8c6a-4181-93fb-f5b5fbe313fc_figure1.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Special features</title>
                <p>CoNet offers a series of features that distinguish it from other network inference tools, such as its support for object groups. This feature allows a user to assign objects to different groups (
                    <italic toggle="yes">e.g</italic>. metabolites and enzymes). Relationships can then be inferred only between different object types (resulting in a bipartite network) or only within the same object type. CoNet's treatment of two input matrices is built upon this feature.</p>
                <p>Furthermore, CoNet can handle row metadata, which allows for instance to infer links between objects at different hierarchical levels (
                    <italic toggle="yes">e.g.</italic> between order Lactobacillales and genus Ureaplasma) while preventing links between different levels of the same hierarchy (e.g.  Lactobacillales and Lactobacillaceae). CoNet can also parse sample metadata such as temperature or oxygen concentration, which are then correlated with the objects in the input matrix while being excluded from normalization. In addition, CoNet recognizes abundance tables generated from biom files (
                    <xref ref-type="bibr" rid="ref-17">McDonald 
                        <italic toggle="yes">et al.</italic>, 2012</xref>) and, in its Cytoscape 3.&#x00d7; version, reads biom files in HDF5 format directly, using the BiomIO Java library (
                    <xref ref-type="bibr" rid="ref-14">Ladau</xref>). Phylogenetic lineages in these tables are automatically parsed and displayed as node attributes of the resulting network. CoNet also computes a few node properties, such as a node's total edge number as well as the number of positive and negative edges, the total row sum and the number of samples in which the object was observed (e.g. was different from zero or a missing value).</p>
                <p>To ease the selection of suitable preprocessing steps, CoNet can display input matrix properties and recommendations based on them. Importantly, CoNet can also handle missing values, by omitting sample pairs with missing values from the association strength calculation. Finally, CoNet supports a few input and output network formats absent in Cytoscape, including adjacency matrices (import), dot (the format of GraphViz (
                    <ext-link ext-link-type="uri" xlink:href="http://www.graphviz.org/">http://www.graphviz.org/</ext-link>)) and VisML (VisANT's format (
                    <xref ref-type="bibr" rid="ref-13">Hu 
                        <italic toggle="yes">et al.</italic>, 2013</xref>)) (both for export).</p>
            </sec>
        </sec>
        <sec sec-type="results">
            <title>Results</title>
            <sec>
                <title>Use case: microbial relationships in the arctic soil</title>
                <p>We demonstrate the abilities of the CoNet app on a real-world example taken from the Qiita database (
                    <xref ref-type="bibr" rid="ref-20">The Qiita Development Team, 2015</xref>). The Qiita database, which merges the previously separated QIIME and EMP databases, is a rich resource for processed 16S rDNA sequence data: each study is accompanied by a microbial count file in biom format computed from the raw sequence data with the QIIME pipeline (
                    <xref ref-type="bibr" rid="ref-2">Caporaso 
                        <italic toggle="yes">et al.</italic>, 2010</xref>).</p>
                <p>In our example, we will demonstrate how to build an association network from microbial count data obtained from arctic soil samples (
                    <xref ref-type="bibr" rid="ref-3">Chu 
                        <italic toggle="yes">et al.</italic>, 2010</xref>). This data set was chosen for its sample number (sufficient to compute associations but short run times) as well as for the biological insights that are gained from the network analysis. The example showcases the CoNet app's ability to compute associations between higher taxonomic levels and to take environmental metadata into account, which is important for the interpretation of predicted microbial relationships.</p>
                <p>In the Qiita database, the arctic soil study can be found under the title "Soil bacterial diversity in the Arctic is not fundamentally different from that found in other biomes" (study identifier: 104, see 
                    <xref ref-type="other" rid="SM">Supplementary material</xref>). This data set consists of 52 soil samples from the arctic tundra, which were sequenced with Roche FLX using primers targeting the V1V2 region of the 16S rDNA. The processed data can be downloaded from the Qiita study page (in Data Types, click on 16S, then click on the URL appearing below, expand the Files network, click on the file object containing BIOM in its name and then download the file with suffix .biom). The study also provides a mapping file with sample metadata (on the Qiita study page, click Sample Information and then the Sample Info button). We extract the pH of each sample by loading the sample information file into Excel, selecting the sample_name and ph columns and saving them to a separate, tab-delimited file.</p>
            </sec>
            <sec>
                <title>Combining multiple measures</title>
                <p>The CoNet app is composed of the main window and several menus, including a "Data" menu with input and output options, a "Preprocessing and filter" menu, a "Methods" menu to select network construction methods, a "Merge" menu where the user can specify how results from different network construction methods should be merged, a "Randomization" menu for the assessment of edge significance and finally a "Config" menu for configuration.</p>
                <p>In the following, we will build a network from the arctic tundra biom file. First, in the "Data" menu, the arctic tundra biom file is selected and the option "Biom file in HDF5" is enabled (direct biom file parsing is only supported in the Cytoscape 3.&#x00d7; version of the CoNet app). In the sub-menu "Metadata and Features", the option "explore links between higher-level taxa" is enabled together with the option "Parent-child exclusion" to compute correlations between higher-level taxa while preventing edges between taxa within the same lineage (e.g. Lactobacillales and Lactobacillaceae). Sample metadata (pH in this case) are passed to the CoNet app via the "Select file" button in the "Features" corner of the "Metadata and Features" sub-menu. Both "Transpose" and "Match samples" need to be enabled to convert sample metadata into rows and to match sample metadata identifiers to biom file identifiers.</p>
                <p>In the "Preprocessing and filtering" menu, the parameter "row_minocc" is set to 20 to discard taxa with less than 20 non-zero values across samples. The sum of the discarded rows can be kept by enabling "Keep sum of filtered rows". In addition, "col_norm" is activated to divide each matrix entry by the sum of its corresponding column, thus avoiding the inference of spurious links due to sequencing depth differences.</p>
                <p>In the "Methods" menu, Pearson, Spearman, Bray Curtis, Kullback Leibler and mutual information are selected. Their thresholds can be automatically set such that 1,000 top-scoring and 1,000 bottom-scoring edges (for anti-correlations) are included for each measure in the initial network, by typing "1000" as the value of the edge selection parameter and enabling "Top and bottom" in the "Threshold setting" sub-menu. At this stage, pushing "GO" will result in a multigraph, where microbial taxa are connected by up to five different measure-specific edges.</p>
            </sec>
            <sec>
                <title>Assessment of edge significance</title>
                <p>The significance of edges, that is their p-values, is computed in two CoNet launches, the first of which generates the permutation distributions and an intermediate network and the second the bootstrap distributions and the final network.</p>
                <p>For the first launch, the user selects the "edgeScores" routine in the "Randomization" menu, with "shuffle_rows" as resampling parameter, and enables "Renormalize" (for details on renormalization, see 
                    <xref ref-type="bibr" rid="ref-7">Faust 
                        <italic toggle="yes">et al.</italic>, 2012</xref>). The user then specifies a folder and a file name to export permutation scores and enables "Save randomizations" in the "Save" corner of the "Randomization" menu. Pushing "GO" will then launch the computation of edge- and measure-specific permutation distributions. Permutation alone is sufficient to set p-values on the edges, but we found that a combination of permutation and bootstrap is more stringent (
                    <xref ref-type="bibr" rid="ref-7">Faust 
                        <italic toggle="yes">et al.</italic>, 2012</xref>). Thus, the network generated in this first step should be considered as an intermediate result.</p>
                <p>In order to compute bootstrap distributions and the final network, the user prepares a second CoNet launch, by selecting the "bootstrap" resampling method and a p-value merging method, for instance "brown" (Brown 1975), in the "Randomization" menu. P-value merging will unite measure-specific p-values for the same edge into a single edge-specific p-value. "Renormalize" is disabled and "benjaminihochberg" is selected as the multiple testing correction method. In the "Save" corner of the "Randomization" menu, another file name should be specified to store bootstrap distributions in a separate file. P-values of the final network are computed from both permutation and bootstrap distributions, thus previously generated permutation distributions have to be loaded into the CoNet app. This is done by selecting the permutation file generated in the previous step with the "Load null distributions" button. Pushing "GO" will then result in the final network, shown in 
                    <xref ref-type="fig" rid="f2">Figure 2A</xref>.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <p>
                            <bold>A</bold>: Result network obtained for bacterial counts from the arctic soil 16S rDNA example data set, downloaded from the Qiita database. 
                            <bold>B</bold>: Same as A, but with negative edges discarded. The remaining edges form clusters with different microbial composition. 
                            <bold>C</bold>: Neighbors of the pH node form two clusters: one correlated and the other anti-correlated to pH, which reflects the opposite pH preferences of the cluster members.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/9740/7968dcee-8c6a-4181-93fb-f5b5fbe313fc_figure2.gif"/>
                </fig>
                <p>The CoNet app does not layout resulting networks, to leave the choice of the (potentially time-consuming) layout algorithm to the user. Here, the "Organic" layout from yFiles was applied and nodes were colored according to their class using Cytoscape's node coloring functionality.</p>
                <p>Once permutation and bootstrap distributions have been computed, network generation can be quickly repeated by loading both distributions via the "Load null distributions" and "Load randomization file" buttons, respectively. 
                    <xref ref-type="fig" rid="f2">Figure 2B</xref> shows the same network re-generated from pre-computed distributions, but with "positive edges only" enabled in the "Preprocessing and filter" menu. 
                    <xref ref-type="fig" rid="f2">Figure 2C</xref> displays the neighbors of the pH node, which were selected and instantiated as a separate network using Cytoscape's node selection function "First neighbors of selected nodes" for undirected networks.</p>
                <p>The computation of permutation and bootstrap distributions took ~5 minutes each for 100 iterations on a standard laptop.</p>
                <p>Input and settings files for the use case can be found in the 
                    <xref ref-type="other" rid="SM">Supplementary material</xref>.</p>
            </sec>
        </sec>
        <sec sec-type="discussion">
            <title>Discussion</title>
            <sec>
                <title>Insights into arctic soil microbiota</title>
                <p>After removal of negative edges, the arctic soil network forms two prominent clusters (
                    <xref ref-type="fig" rid="f2">Figure 2B</xref>), which are enriched with representatives of different classes, such that one cluster features mostly members of the Solibacteres and Acidobacteria, whereas the other consists mostly of Alphaproteobacteria and Chloracidobacteria. When examining the neighbors of the pH node (
                    <xref ref-type="fig" rid="f2">Figure 2C</xref>), members of the former cluster are found to be anti-correlated to pH, whereas members of the latter are correlated to it. Thus, network analysis helps to identify pH as a major driving factor for microbial soil communities, as has been found previously (
                    <xref ref-type="bibr" rid="ref-9">Fierer &amp; Jackson, 2006</xref>). The correlations with pH have also been described by the authors of the soil study (
                    <xref ref-type="bibr" rid="ref-3">Chu 
                        <italic toggle="yes">et al.</italic>, 2010</xref>). However, network analysis adds more details (correlations are computed on lower taxonomic levels) and discovers additional taxonomic groups impacted by pH, e.g. Chloracidobacteria. Furthermore, network inference suggests candidates for cross-feeding. For instance, the neighboring nodes of 
                    <italic toggle="yes">Bradyrhizobium</italic>, a nitrogen fixer that produces ammonium, may represent taxa that depend on ammonium as main nitrogen source.</p>
            </sec>
            <sec>
                <title>Related apps</title>
                <p>The CoNet app offers mostly similarity-based network inference. Complementary apps that implement various Bayesian network inference algorithms are Cyni Toolbox (
                    <ext-link ext-link-type="uri" xlink:href="http://www.proteomics.fr/Sysbio/CyniProject">http://www.proteomics.fr/Sysbio/CyniProject</ext-link>), bayelviraApp (
                    <ext-link ext-link-type="uri" xlink:href="http://apps.cytoscape.org/apps/bayelviraapp">http://apps.cytoscape.org/apps/bayelviraapp</ext-link>) and MONET (
                    <xref ref-type="bibr" rid="ref-15">Lee &amp; Lee, 2005</xref>). ARACNE (
                    <ext-link ext-link-type="uri" xlink:href="http://apps.cytoscape.org/apps/aracne">http://apps.cytoscape.org/apps/aracne</ext-link>) exploits mutual information to build networks (
                    <xref ref-type="bibr" rid="ref-16">Margolin 
                        <italic toggle="yes">et al.</italic>, 2006</xref>). ExpressionCorrelation (
                    <ext-link ext-link-type="uri" xlink:href="http://www.baderlab.org/Software/ExpressionCorrelation">http://www.baderlab.org/Software/ExpressionCorrelation</ext-link>) and MetaNetter (
                    <ext-link ext-link-type="uri" xlink:href="http://apps.cytoscape.org/apps/metanetter">http://apps.cytoscape.org/apps/metanetter</ext-link>) also offer similarity-based network inference techniques, in case of the former specialized to gene expression and in the latter to metabolomics data. Results from these different network inference approaches could be combined with Cytoscape tools such as Merge Networks.</p>
            </sec>
        </sec>
        <sec sec-type="conclusions">
            <title>Conclusion</title>
            <p>In this article, we have demonstrated the CoNet app on a typical 16S data set. Alternative use cases are for instance the inference of function networks (
                <italic toggle="yes">i.e.</italic> co-occurrence of orthologous gene groups) from metagenomics or metatranscriptomics data or taxon-metabolite networks from 16S and metabolomics data.</p>
            <p>We hope that CoNet's integration into Cytoscape will lower the barrier for its employment by users less familiar with the command line version. Due to its flexibility and comprehensiveness, CoNet can be useful in a variety of applications and we thus hope it will find a broad user base.</p>
        </sec>
        <sec>
            <title>Software availability</title>
            <p>CoNet app page: 
                <ext-link ext-link-type="uri" xlink:href="http://apps.cytoscape.org/apps/conet">http://apps.cytoscape.org/apps/conet</ext-link>
			</p>
            <p>CoNet tool web page: 
                <ext-link ext-link-type="uri" xlink:href="http://systemsbiology.vub.ac.be/conet">http://systemsbiology.vub.ac.be/conet</ext-link>
			</p>
            <p>Latest source code: 
                <ext-link ext-link-type="uri" xlink:href="http://sourceforge.net/projects/conet/">http://sourceforge.net/projects/conet/</ext-link>
			</p>
            <p>Archived source code as at the time of publication: Zenodo, Biological network inference in Cytoscape, doi: 
                <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.5281/zenodo.55715">10.5281/zenodo.55715</ext-link> (
                <xref ref-type="bibr" rid="ref-8">Faust &amp; Raes, 2016</xref>)</p>
            <p>License: GNU General Public License version 2.0</p>
        </sec>
    </body>
    <back>
        <ack>
            <title>Acknowledgements</title>
            <p>We would like to thank Gipsi Lima-Mendez and other members of the Raes lab, as well as all users of the CoNet app that have sent us constructive feedback or error reports that helped to improve this app. We further are indebted to Fah Sathirapongsasuti, Curtis Huttenhower and Jean-S&#x00e9;bastien Lerat, who significantly contributed to the command line version of CoNet.</p>
        </ack>
        <sec id="SM" sec-type="supplementary-material">
            <title>Supplementary material</title>
            <p>Use case data in CoNet app: inference of biological association networks using Cytoscape.</p>
            <p>This file contains microbial count data, sample metadata, permutation settings and bootstrap settings associated with this submission. Description of each dataset is provided in the text file.</p>
            <p>
				
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/9050/7f963ab1-8969-4333-98f0-62060ffcbc8a.zip">Click here to access the data</ext-link>.</p>
        </sec>
        <ref-list>
            <ref id="ref-1">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Bork</surname>
                            <given-names>P</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Bowler</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>de Vargas</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Tara Oceans. 
                        <italic toggle="yes">Tara</italic> Oceans studies plankton at planetary scale. Introduction.</article-title>
                    <source>
						
                        <italic toggle="yes">Science.</italic>
					</source>
                    <year>2015</year>;<volume>348</volume>(<issue>6237</issue>):<fpage>873</fpage>.
                    <pub-id pub-id-type="pmid">25999501</pub-id>
                    <pub-id pub-id-type="doi">10.1126/science.aac5605</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Caporaso</surname>
                            <given-names>JG</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Kuczynski</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Stombaugh</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>QIIME allows analysis of high-throughput community sequencing data.</article-title>
                    <source>
						
                        <italic toggle="yes">Nat Methods.</italic>
					</source>
                    <year>2010</year>;<volume>7</volume>(<issue>5</issue>):<fpage>335</fpage>&#x2013;<lpage>336</lpage>.
                    <pub-id pub-id-type="pmid">20383131</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nmeth.f.303</pub-id>
                    <pub-id pub-id-type="pmcid">3156573</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Chu</surname>
                            <given-names>H</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Fierer</surname>
                            <given-names>N</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Lauber</surname>
                            <given-names>CL</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Soil bacterial diversity in the Arctic is not fundamentally different from that found in other biomes.</article-title>
                    <source>
						
                        <italic toggle="yes">Environ Microbiol.</italic>
					</source>
                    <year>2010</year>;<volume>12</volume>(<issue>11</issue>):<fpage>2998</fpage>&#x2013;<lpage>3006</lpage>.
                    <pub-id pub-id-type="pmid">20561020</pub-id>
                    <pub-id pub-id-type="doi">10.1111/j.1462-2920.2010.02277.x</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Connor</surname>
                            <given-names>EF</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Simberloff</surname>
                            <given-names>D</given-names>
                        </name>
					</person-group>:
                    <article-title>The Assembly of Species Communities: Chance or Competition?</article-title>
                    <source>
						
                        <italic toggle="yes">Ecology.</italic>
					</source>
                    <year>1979</year>;<volume>60</volume>(<issue>6</issue>):<fpage>1132</fpage>&#x2013;<lpage>1140</lpage>.
                    <pub-id pub-id-type="doi">10.2307/1936961</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Diamond</surname>
                            <given-names>JM</given-names>
                        </name>
					</person-group>:
                    <article-title>Assembly of species communities</article-title>. In
                    <italic toggle="yes">Ecology and evolution of communities</italic>. Cody M, Diamond JM eds., Harvard University Press,<year>1975</year>;<fpage>342</fpage>&#x2013;<lpage>444</lpage>.
                    <ext-link ext-link-type="uri" xlink:href="https://books.google.co.in/books?id=j_idbVxwzpQC&amp;pg=PA342">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Falony</surname>
                            <given-names>G</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Joossens</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Vieira-Silva</surname>
                            <given-names>S</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Population-level analysis of gut microbiome variation.</article-title>
                    <source>
						
                        <italic toggle="yes">Science.</italic>
					</source>
                    <year>2016</year>;<volume>352</volume>(<issue>6285</issue>):<fpage>560</fpage>&#x2013;<lpage>564</lpage>.
                    <pub-id pub-id-type="pmid">27126039</pub-id>
                    <pub-id pub-id-type="doi">10.1126/science.aad3503</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Faust</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Sathirapongsasuti</surname>
                            <given-names>JF</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Izard</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Microbial co-occurrence relationships in the human microbiome.</article-title>
                    <source>
						
                        <italic toggle="yes">PLoS Comput Biol.</italic>
					</source>
                    <year>2012</year>;<volume>8</volume>(<issue>7</issue>):<fpage>e1002606</fpage>.
                    <pub-id pub-id-type="pmid">22807668</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pcbi.1002606</pub-id>
                    <pub-id pub-id-type="pmcid">3395616</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Faust</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Raes</surname>
                            <given-names>J</given-names>
                        </name>
					</person-group>:
                    <article-title>Biological network inference in Cytoscape.</article-title>
                    <source>
						
                        <italic toggle="yes">Zenodo.</italic>
					</source>
                    <year>2016</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.5281/zenodo.55715">Data Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Fierer</surname>
                            <given-names>N</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Jackson</surname>
                            <given-names>RB</given-names>
                        </name>
					</person-group>:
                    <article-title>The diversity and biogeography of soil bacterial communities.</article-title>
                    <source>
						
                        <italic toggle="yes">Proc Natl Acad Sci U S A.</italic>
					</source>
                    <year>2006</year>;<volume>103</volume>(<issue>3</issue>):<fpage>626</fpage>&#x2013;<lpage>631</lpage>.
                    <pub-id pub-id-type="pmid">16407148</pub-id>
                    <pub-id pub-id-type="doi">10.1073/pnas.0507535103</pub-id>
                    <pub-id pub-id-type="pmcid">1334650</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Gilbert</surname>
                            <given-names>JA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Jansson</surname>
                            <given-names>JK</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Knight</surname>
                            <given-names>R</given-names>
                        </name>
					</person-group>:
                    <article-title>The Earth Microbiome project: successes and aspirations.</article-title>
                    <source>
						
                        <italic toggle="yes">BMC Biol.</italic>
					</source>
                    <year>2014</year>;<volume>12</volume>:<fpage>69</fpage>.
                    <pub-id pub-id-type="pmid">25184604</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s12915-014-0069-1</pub-id>
                    <pub-id pub-id-type="pmcid">4141107</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Gotelli</surname>
                            <given-names>NJ</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>McCabe</surname>
                            <given-names>DJ</given-names>
                        </name>
					</person-group>:
                    <article-title>Species Co-Occurrence: A Meta-Analysis of J. M. Diamond's Assembly Rules Model.</article-title>
                    <source>
						
                        <italic toggle="yes">Ecology.</italic>
					</source>
                    <year>2002</year>;<volume>83</volume>(<issue>8</issue>):<fpage>2091</fpage>&#x2013;<lpage>2096</lpage>.
                    <pub-id pub-id-type="doi">10.2307/3072040</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-12">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Himsolt</surname>
                            <given-names>M</given-names>
                        </name>
					</person-group>:
                    <article-title>GML: A portable Graph File Format [Online]</article-title>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.fim.uni-passau.de/fileadmin/files/lehrstuhl/brandenburg/projekte/gml/gml-technical-report.pdf">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-13">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Hu</surname>
                            <given-names>Z</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Chang</surname>
                            <given-names>YC</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>Y</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>VisANT 4.0: Integrative network platform to connect genes, drugs, diseases and therapies.</article-title>
                    <source>
						
                        <italic toggle="yes">Nucleic Acids Res.</italic>
					</source>
                    <year>2013</year>;<volume>41</volume>(<issue>Web Server issue</issue>):<fpage>W225</fpage>&#x2013;<lpage>W231</lpage>.
                    <pub-id pub-id-type="pmid">23716640</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkt401</pub-id>
                    <pub-id pub-id-type="pmcid">3692070</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-14">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Ladau</surname>
                            <given-names>J</given-names>
                        </name>
					</person-group>:
                    <article-title>Lightweight, portable library for working with HDF5 BIOM files using Java [Online]</article-title>.
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/jladau/BiomIO">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-15">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Lee</surname>
                            <given-names>PH</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Lee</surname>
                            <given-names>D</given-names>
                        </name>
					</person-group>:
                    <article-title>Modularized learning of genetic interaction networks from biological annotations and mRNA expression data.</article-title>
                    <source>
						
                        <italic toggle="yes">Bioinformatics.</italic>
					</source>
                    <year>2005</year>;<volume>21</volume>(<issue>11</issue>):<fpage>2739</fpage>&#x2013;<lpage>2747</lpage>.
                    <pub-id pub-id-type="pmid">15797909</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/bti406</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-16">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Margolin</surname>
                            <given-names>AA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Nemenman</surname>
                            <given-names>I</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Basso</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>ARACNE: An algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context.</article-title>
                    <source>
						
                        <italic toggle="yes">BMC Bioinformatics.</italic>
					</source>
                    <year>2006</year>;<volume>7</volume>(<issue>Suppl 1</issue>):<fpage>S7</fpage>.
                    <pub-id pub-id-type="pmid">16723010</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2105-7-S1-S7</pub-id>
                    <pub-id pub-id-type="pmcid">1810318</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-17">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>McDonald</surname>
                            <given-names>D</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Clemente</surname>
                            <given-names>JC</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Kuczynski</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>The Biological Observation Matrix (BIOM) format or: how I learned to stop worrying and love the ome-ome.</article-title>
                    <source>
						
                        <italic toggle="yes">GigaScience.</italic>
					</source>
                    <year>2012</year>;<volume>1</volume>(<issue>1</issue>):<fpage>7</fpage>.
                    <pub-id pub-id-type="pmid">23587224</pub-id>
                    <pub-id pub-id-type="doi">10.1186/2047-217X-1-7</pub-id>
                    <pub-id pub-id-type="pmcid">3626512</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-18">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Meyer</surname>
                            <given-names>PE</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Lafitte</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Bontempi</surname>
                            <given-names>G</given-names>
                        </name>
					</person-group>:
                    <article-title>
                        <italic toggle="yes">minet</italic>: A R/Bioconductor package for inferring large transcriptional networks using mutual information.</article-title>
                    <source>
						
                        <italic toggle="yes">BMC Bioinformatics.</italic>
					</source>
                    <year>2008</year>;<volume>9</volume>:<fpage>461</fpage>.
                    <pub-id pub-id-type="pmid">18959772</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2105-9-461</pub-id>
                    <pub-id pub-id-type="pmcid">2630331</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-19">
                <mixed-citation publication-type="journal">
                    <collab>The Human Microbiome Project Consortium</collab>:
                    <article-title>A framework for human microbiome research.</article-title>
                    <source>
						
                        <italic toggle="yes">Nature.</italic>
					</source>
                    <year>2012</year>;<volume>486</volume>(<issue>7402</issue>):<fpage>215</fpage>&#x2013;<lpage>221</lpage>.
                    <pub-id pub-id-type="pmid">22699610</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nature11209</pub-id>
                    <pub-id pub-id-type="pmcid">3377744</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-20">
                <mixed-citation publication-type="confproc">
                    <collab>The Qiita Development Team</collab>:
                    <article-title>Qiita: report of progress towards an open access microbiome data analysis and visualization platform</article-title>. In:
                    <italic toggle="yes">14th Python in Science Conference (SCIPY 2015)</italic>,<year>2015</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://conference.scipy.org/proceedings/scipy2015/pdfs/qiita_development_team.pdf">Reference Source</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report15254">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.9740.r15254</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Eiler</surname>
                        <given-names>Alexander</given-names>
                    </name>
                    <xref ref-type="aff" rid="r15254a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r15254a1">
                    <label>1</label>Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>12</day>
                <month>9</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Eiler A</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport15254" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9050.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>Be more precious, here. "The idea behind such an ensemble approach to network inference is to exploit the fact that different methods make different mistakes."&#x00a0;</p>
            <p> </p>
            <p> These are different statistical inferences so based on the underlaying algorithms results will be different. Same may be better suited for parametric or none-parametric data, some perform better with larger or smaller sample numbers. The different methods have also different statistical power do identify significances. Some may produce more false positives or false negatives than others. Some guidance and references to statistical literature could be provided in the article.</p>
            <p> </p>
            <p> I really liked to see an implementation that calculates false discovery rate (after Benjamin Hochberg) over all statistical comparisons.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment2219-15254">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Faust</surname>
                            <given-names>Karoline</given-names>
                        </name>
                        <aff>VUB, Belgium</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>7</day>
                    <month>10</month>
                    <year>2016</year>
                </pub-date>
            </front-stub>
            <body>
                <p>"These are different statistical inferences so based on the underlaying algorithms results will be different."</p>
                <p> Thanks for pointing this out. We have now added an overview table of the strengths and weaknesses of selected measures available in CoNet. We also added a paragraph that discusses the different ways in which these measures can be combined in CoNet.</p>
                <p> </p>
                <p> "I really liked to see an implementation that calculates false discovery rate (after Benjamin Hochberg) over all statistical comparisons."</p>
                <p> CoNet does allow computing false discovery rate over all statistical comparisons either by setting the number of initial edges sufficiently high or by setting the thresholds on the individual measures sufficiently low. We have added this remark to the article.</p>
                <p> Despite the fact that multiple testing correction is in most cases only applied after edges are discarded through initial filtering, CoNet is among the microbial network inference tools with the lowest false positive rates tested in Weiss et al., The ISME Journal 2016 (
                    <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/pubmed/26905627">https://www.ncbi.nlm.nih.gov/pubmed/26905627</ext-link>, supplementary Figure 10) .</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report14620">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.9740.r14620</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Wilmes</surname>
                        <given-names>Paul</given-names>
                    </name>
                    <xref ref-type="aff" rid="r14620a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <contrib contrib-type="author">
                    <name>
                        <surname>Heintz-Buschart</surname>
                        <given-names>Anna</given-names>
                    </name>
                    <xref ref-type="aff" rid="r14620a1">1</xref>
                    <role>Co-referee</role>
                </contrib>
                <aff id="r14620a1">
                    <label>1</label>Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg City, Luxembourg</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>13</day>
                <month>7</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Wilmes P and Heintz-Buschart A</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport14620" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9050.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The article describes a Cytoscape plugin &#x201c;CoNet app&#x201d; designed for the inference of networks from microbial abundance or incidence matrices. The effort combining a versatile network inference tool with a user-friendly and widely used network visualization and analysis framework, such as Cytoscape is very valuable to the community.</p>
            <p> </p>
            <p> I would suggest certain improvements to the article to make it in itself more valuable for potential users to judge the applicability of the plugin to their datasets.&#x00a0; &#x00a0;</p>
            <p> </p>
            <p> 
                <bold>Introduction:</bold> As the authors of the plugin are well aware (being co-authors of &#x201c;Correlation detection strategies in microbial data sets vary widely in sensitivity and precision&#x201d; 
                <sup>
                    <xref ref-type="bibr" rid="rep-ref-14620-1">1</xref>
                </sup>), co-abundance or co-occurrence analysis is an approach to ecological data interpretation that is not without caveats and as such, the article is lacking both mention of limitations of the approach and references to the successful use cases of earlier versions of CoNet. I suggest including both in the introduction.</p>
            <p> </p>
            <p> The introduction also does little to explain the approach to potential users who are not familiar with the concept. E.g. the sentence &#x201c;More specifically, co-occurrence analysis detects significant co-occurrences or mutual exclusions across samples, which are interpreted as representing ecological relationships such as mutualism or competition or being due to similar responses to environmental factors.&#x201d; mixes up observations and analyses with interpretation. Similarly, relating to the first sentence of the introduction, microbial count data are not obtained from relative abundances, but microbial counts taken to infer relative abundances (the sentence is also ambiguous as to what these abundances are relative to). Furthermore, the second but last sentence of the introduction &#x201c;The large number of microbial count tables resulting from the multitude of recent sequencing projects&#x2026;&#x201d; can be interpreted to advise for the co-analysis of results from different studies, which is most often not possible. These parts should be revised for clarity.</p>
            <p> </p>
            <p> 
                <bold>Methods/Implementation:</bold> More details on the algorithms would be useful, or alternatively references to other publications which describe CoNet, as relates to the following points: 
                <list list-type="bullet">
                    <list-item>
                        <p>&#x201c;its capability to combine multiple such measures and/or to combine these measures with other network inference algorithms&#x201d;,</p>
                    </list-item>
                    <list-item>
                        <p>&#x201c;CoNet can also parse sample metadata such as temperature or oxygen concentration, which are then correlated with the objects in the input matrix while being excluded from normalization.&#x201d; and</p>
                    </list-item>
                    <list-item>
                        <p>&#x201c;Phylogenetic lineages in these tables are automatically parsed&#x201d;. Also, what are positive and negative edges? How is mutual information integrated with measures which can be positive or negative?</p>
                    </list-item>
                </list> 
                <bold>Use case</bold>: It would be helpful to shortly describe the size of the dataset (number of OTUs and number of samples) as part of the sentence &#x201c;This data set was chosen for its sample number (sufficient to compute associations but short run times) as well as for the biological insights that are gained from the network analysis.&#x201d; A general advice on the required sample number and or relationship between numbers of analyzed features and sample numbers would also be helpful. In addition, are the 100 iterations performed in this example a realistic number of iterations to be used in such an analysis?</p>
            <p> </p>
            <p> The formulation &#x201c;The significance of edges, that is their p-values&#x201d; is a bit unfortunate. On a similar note, next to the permutations, is there a way in CoNet or the CoNet app to assess association strengths? An example of how the assessment of edge significance affects network size and structure would be informative.</p>
            <p> Figures: The large heading in Figure 1 should be removed. Figure 2 would benefit from a heading. The labels of figure 2 are not legible. It is unclear from the text and not mentioned in the legend, how the &#x201c;classes&#x201d; used for coloring nodes are defined. Are these classes in the taxonomic sense or different kinds of data? The color scheme for positive and negative edges should be explained. In panel C, the pH node should be more clearly pointed out.</p>
            <p> 
                <bold>Small comments:</bold>
            </p>
            <p> The referenced &#x201c;Brown 1975&#x201d; does not appear in the references.</p>
            <p> </p>
            <p> The capitalization of &#x201c;P-value&#x201d; is inconsistent.</p>
            <p> </p>
            <p> As the buttons in the app are actually called that, refer to &#x201c;Data menu&#x201d;, &#x201c;Preprocessing and filter menu&#x201d; etc.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however we have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-14620-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Correlation detection strategies in microbial data sets vary widely in sensitivity and precision.</article-title>
                        <source>
                            <italic>ISME J</italic>
                        </source>.<year>2016</year>;<volume>10</volume>(<issue>7</issue>) :
                        <elocation-id>10.1038/ismej.2015.235</elocation-id>
                        <fpage>1669</fpage>-<lpage>81</lpage>
                        <pub-id pub-id-type="pmid">26905627</pub-id>
                        <pub-id pub-id-type="doi">10.1038/ismej.2015.235</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment2220-14620">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Faust</surname>
                            <given-names>Karoline</given-names>
                        </name>
                        <aff>VUB, Belgium</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>7</day>
                    <month>10</month>
                    <year>2016</year>
                </pub-date>
            </front-stub>
            <body>
                <p>"The effort combining a versatile network inference tool with a user-friendly and widely used network visualization and analysis framework, such as Cytoscape is very valuable to the community."</p>
                <p> We would like to thank the reviewer for this appreciation of our work.</p>
                <p> </p>
                <p> 
                    <bold>Introduction</bold>
                </p>
                <p> In response to the reviewers' comments, we have rewritten the introduction, thereby rephrasing problematic sentences, pointing out limitations of microbial network inference and citing the evaluation. We also added a paragraph in the discussion to mention applications of CoNet.</p>
                <p> </p>
                <p> 
                    <bold>Methods/Implementation</bold> 
                    <list list-type="bullet">
                        <list-item>
                            <p>&#x201c;its capability to combine multiple such measures and/or to combine these measures with other network inference algorithms&#x201d;,</p>
                        </list-item>
                    </list> We included an overview table comparing selected measures of association. We also added a paragraph on how measures can be combined in CoNet. 
                    <list list-type="bullet">
                        <list-item>
                            <p>&#x201c;CoNet can also parse sample metadata such as temperature or oxygen concentration, which are then correlated with the objects in the input matrix while being excluded from normalization.&#x201d;</p>
                        </list-item>
                    </list> We improved this explanation of CoNet's treatment of sample metadata. 
                    <list list-type="bullet">
                        <list-item>
                            <p>&#x201c;Phylogenetic lineages in these tables are automatically parsed&#x201d;.</p>
                        </list-item>
                    </list> We provided an example to better explain what we mean. 
                    <list list-type="bullet">
                        <list-item>
                            <p>Also, what are positive and negative edges? How is mutual information integrated with measures which can be positive or negative?</p>
                        </list-item>
                    </list> We added an explanation.</p>
                <p> </p>
                <p> 
                    <bold>Use case</bold>
                </p>
                <p> </p>
                <p> The OTU number was added to the following sentence (which already listed the sample number):</p>
                <p> 
                    <italic>This data set consists of 4,022 operating taxonomic units and 52 soil samples from the arctic tundra, which were sequenced with Roche FLX using primers targeting the V1V2 region of the 16S rDNA.</italic>
                </p>
                <p> </p>
                <p> "A general advice on the required sample number and or relationship between numbers of analyzed features and sample numbers would also be helpful."</p>
                <p> In general, the number of false positives increases with decreasing sample number. While assessment of significance counter-balances this effect, it is unreasonable to compute a correlation from a few observations only, even if it is strongly significant. However, we cannot provide a formula to compute where exactly to put the cut-off.</p>
                <p> </p>
                <p> "In addition, are the 100 iterations performed in this example a realistic number of iterations to be used in such an analysis?"</p>
                <p> </p>
                <p> We saw previously that there is no big difference between networks computed with 100 or 1000 iterations. The reason is that we are not computing p-values from a pure permutation test, where small p-values can only be reached by performing a sufficient number of iterations. Instead, we compute the p-value parametrically as the mean of the permutation distribution under the bootstrap distribution. Estimating the mean and standard deviation of normal distributions is less sensitive to iteration number than computing parameter-free p-values. We added this explanation to the text.</p>
                <p> </p>
                <p> "The formulation &#x201c;The significance of edges, that is their p-values&#x201d; is a bit unfortunate. On a similar note, next to the permutations, is there a way in CoNet or the CoNet app to assess association strengths? An example of how the assessment of edge significance affects network size and structure would be informative."</p>
                <p> The p-value is an assessment of association strength. So are the scores of the measures themselves, e.g. Pearson's r and Spearman's rho, which are correlated with the p-value. We have added a remark explaining this to the text.</p>
                <p> Assessing the significance usually discards edges from the initial network, in some cases even removing all initial edges. The number of edges removed depends on the initially selected thresholds. In the use case, the initial network consists of 10000 edges, 1546 of which remain after assessment of significance and merging of measure-specific p-values into a single p-value. The exact edge number in the final network may vary slightly from run to run, due to variations in the permutation and bootstrap distributions.</p>
                <p> </p>
                <p> "Figures: The large heading in Figure 1 should be removed."</p>
                <p> This heading was not intended as a Figure heading but as a heading to divide the text. We improved the layout and added headers to Figure 1 and 2.</p>
                <p> "Figure 2 would benefit from a heading. The labels of figure 2 are not legible."</p>
                <p> Our aim here was to show the networks as obtained with the CoNet app when executing the use case, but we understand the point of the reviewer. As a compromise, we have now removed the labels and added a class-level color code.</p>
                <p> "It is unclear from the text and not mentioned in the legend, how the &#x201c;classes&#x201d; used for coloring nodes are defined. Are these classes in the taxonomic sense or different kinds of data?"</p>
                <p> These are taxonomic classes. We clarified this in the caption of Figure 2.</p>
                <p> "The color scheme for positive and negative edges should be explained."</p>
                <p> We added an explanation of the color scheme in the main text and to the caption of Figure 2.</p>
                <p> "In panel C, the pH node should be more clearly pointed out."</p>
                <p> The pH node stands out by differing in shape from the taxon nodes. We have clarified this by adding a legend to Figure 2.</p>
                <p> </p>
                <p> 
                    <bold>Small comments:</bold>
                </p>
                <p> "The referenced &#x201c;Brown 1975&#x201d; does not appear in the references."</p>
                <p> </p>
                <p> We excuse for this oversight. We have added the reference.</p>
                <p> </p>
                <p> "The capitalization of &#x201c;P-value&#x201d; is inconsistent."</p>
                <p> We now use p-value with a lower case p, unless it is the first word of a new sentence, where we use the upper case P.</p>
                <p> </p>
                <p> "As the buttons in the app are actually called that, refer to &#x201c;Data menu&#x201d;, &#x201c;Preprocessing and filter menu&#x201d; etc. "</p>
                <p> Done</p>
            </body>
        </sub-article>
    </sub-article>
</article>
