<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="other" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.168305.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Software Tool Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>ggEDA: Visualisations for exploratory data analysis using tiled one-dimensional graphics and parallel coordinate plots</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 2 approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>El-Kamand</surname>
                        <given-names>Sam</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-2270-8088</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Quinn</surname>
                        <given-names>Julian M.W.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Cowley</surname>
                        <given-names>Mark J.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Computational Biology, Children's Cancer Institute Australia, Sydney, New South Wales, 2052, Australia</aff>
                <aff id="a2">
                    <label>2</label>School of Clinical Medicine, University of New South Wales, Sydney, New South Wales, 2052, Australia</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:selkamand@ccia.org.au">selkamand@ccia.org.au</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>13</day>
                <month>11</month>
                <year>2025</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2025</year>
            </pub-date>
            <volume>14</volume>
            <elocation-id>1248</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>7</day>
                    <month>11</month>
                    <year>2025</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 El-Kamand S et al.</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/14-1248/pdf"/>
            <abstract>
                <p>Exploratory data analysis (EDA) involves summarising trends within a dataset to help uncover data quality issues and generate hypotheses. However, identifying relationships between multiple features often requires extensive coding, manual inspection and statistical modelling. Here, we introduce the ggEDA R package, which streamlines multidimensional data exploration by providing two turnkey and complementary visualisation strategies. ggEDA generates interactive parallel coordinate plots (PCPs) well suited for examining large datasets with mostly quantitative features, and introduces tiled one-dimensional plots that more effectively show missingness and reveal categorical relationships in smaller datasets. ggEDA reduces the amount of code and time required to detect multi-feature relationships that may otherwise require statistical modelling or thorough manual review to identify. To make ggEDA visualisations accessible to a wider audience we also developed interactiveEDA, a web app that enables non-programmers to explore and interpret data patterns interactively. ggEDA and interactiveEDA are available at 
                    <uri xlink:href="https://github.com/CCICB/ggEDA">https://github.com/CCICB/ggEDA</uri> and 
                    <uri xlink:href="https://github.com/CCICB/interactiveEDA">https://github.com/CCICB/interactiveEDA</uri> respectively.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>R</kwd>
                <kwd>visualisation</kwd>
                <kwd>exploratory data analysis</kwd>
                <kwd>multidimensional</kwd>
                <kwd>parallel coordinate plots</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1" xlink:href="https://doi.org/10.13039/501100001111">
                    <funding-source>Cancer Australia</funding-source>
                    <award-id>1165556</award-id>
                </award-group>
                <award-group id="fund-2">
                    <funding-source>Australian Medical Research Future Fund (MRFF Emerging Priorities and Consumer-Driven Research Initiative)</funding-source>
                </award-group>
                <award-group id="fund-3">
                    <funding-source>Luminesce Alliance &#x2013; Innovation for Children&#x2019;s Health</funding-source>
                </award-group>
                <award-group id="fund-4">
                    <funding-source>My Room Children&#x2019;s Cancer Charity</funding-source>
                </award-group>
                <funding-statement>We acknowledge support from Cancer Australia (grant 1165556) and My Room Children&#x2019;s Cancer Charity, as well as the Australian Medical Research Future Fund (MRFF Emerging Priorities and Consumer-Driven Research Initiative)&#13;
&#13;
This work was also supported by Luminesce Alliance &#x2013; Innovation for Children&#x2019;s Health. Luminesce Alliance is a not- for-profit cooperative joint venture between the Sydney Children&#x2019;s Hospitals Network, the Children&#x2019;s Medical Research Institute, the Children&#x2019;s Cancer Institute, the University of Sydney, and the UNSW Sydney. It has been established with the support of the NSW Government to coordinate and integrate pediatric research.</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec id="sec1" sec-type="intro">
            <title>Introduction</title>
            <p>Exploratory data analysis (EDA) reveals relationships between data features, informing hypothesis generation and downstream analyses. It can also identify data-quality issues such as missingness, bias, and unexpected distribution structure. The R ecosystem already includes popular EDA packages such as 
                <bold>skimr</bold>, which textually summarises completeness and descriptive statistics for individual features (1-dimensional), and 
                <bold>GGally</bold>, which graphically describes pairwise feature correlations (2-dimensional) or multi-feature relationships through PCPs (n-dimensional). 
                <bold>ggEDA</bold> enhances this ecosystem by providing interactive versions of standard n-dimensional visualisations like PCPs and introducing tiled one-dimensional visualisations that more effectively show missingness and relationships between categorical features in smaller datasets. Together, these visualisations provide key advantages over other EDA packages, most notably an ability to reveal a greater variety of multidimensional patterns (
                <xref ref-type="fig" rid="f1">
Figure 1</xref>).</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>
Figure 1. </label>
                <caption>
                    <title>Comparison of R packages that create visualisations commonly used for exploratory data analysis, including ComplexHeatmap,
                        <sup>
                            <xref ref-type="bibr" rid="ref1">1</xref>
                        </sup> Data Explorer,
                        <sup>
                            <xref ref-type="bibr" rid="ref2">2</xref>
                        </sup> skimr,
                        <sup>
                            <xref ref-type="bibr" rid="ref3">3</xref>
                        </sup> GGally
                        <sup>
                            <xref ref-type="bibr" rid="ref4">4</xref>
                        </sup> and ggpcp.
                        <sup>
                            <xref ref-type="bibr" rid="ref5">5</xref>
                        </sup>
                    </title>
                    <p>Due to documented reproducibility issues, ggpcp features could not be verified first-hand.</p>
                </caption>
                <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185479/92c3d683-28d8-4f99-8d3d-4f0618fb5bee_figure1.gif"/>
            </fig>
        </sec>
        <sec id="sec2" sec-type="methods">
            <title>Methods</title>
            <sec id="sec3">
                <title>Implementation</title>
                <p>ggEDA is implemented as a standard R package and published on CRAN and the R-universe. The interactiveEDA web app was written using the shiny framework
                    <sup>
                        <xref ref-type="bibr" rid="ref6">6</xref>
                    </sup> and takes ggEDA as a dependency to separate the user-interface codebase from the underlying business logic, which is easier to test. interactiveEDA is compiled into a purely client-side web-assembly app using shinylive
                    <sup>
                        <xref ref-type="bibr" rid="ref7">7</xref>
                    </sup> and hosted as a static web-page on GitHub Pages. Code to produce visualisations is run in the client&#x2019;s browser instead of a third party server outside the direct control of end-users. The distributed nature of compute also provides scaling benefits compared to traditional server-side shiny apps that quickly slow as concurrent users grow. These security and scalability benefits do come at the cost of slower application startup time.</p>
            </sec>
            <sec id="sec4">
                <title>Operation</title>
                <p>The ggEDA R package can be installed from CRAN (
                    <monospace>

                        <styled-content style="color:#0000FF">install.packages</styled-content>(
                        <styled-content style="color:#BA2121">&#x201c;ggEDA&#x201d;</styled-content>)</monospace>). It is compatible with Mac OS X, Windows, and all Unix-like operating systems where R (&#x2265;3.5.0) can be installed. Package dependencies are described on the 
                    <ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/web/packages/ggEDA/index.html">

                        <styled-content style="color:#CC6638">ggEDA CRAN listing</styled-content>
</ext-link>.</p>
                <p>Data can be explored using the interactiveEDA web app in any modern browser that supports WebAssembly. We performed the most extensive testing in Chrome (version 137.0.7151.120), however the app is also compatible with Firefox, Safari and Microsoft Edge.</p>
            </sec>
        </sec>
        <sec id="sec5">
            <title>Use cases</title>
            <p>To demonstrate how ggEDA and interactiveEDA support exploratory data analysis, we present a series of use cases that highlight their capabilities in visualising multidimensional datasets.</p>
            <sec id="sec6">
                <title>Creating parallel coordinate plots</title>
                <p>PCPs are a well-established EDA visualisation that reveal trends in predominantly quantitative datasets and detect outliers in one or more dimensions. Quantitative features are represented as a series of parallel axes with samples visualised as lines passing through each axis at the point of its value. Correlative relationships are revealed when feature axes are ordered based on line crossing minimisation algorithms or mutual information with a categorical feature. ggEDA can produce interactive PCPs from any dataset with quantitative features using the 
                    <monospace>

                        <styled-content style="color:#0000FF">ggparallel</styled-content>(data)</monospace> command (
                    <xref ref-type="fig" rid="f2">
Figure 2</xref>).</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>
Figure 2. </label>
                    <caption>
                        <title>

                            <bold>ggEDA</bold> parallel coordinate plots of the dry beans imaging dataset.
                            <sup>
                                <xref ref-type="bibr" rid="ref9">9</xref>
                            </sup>
                        </title>
                        <p>A) Visualising 16 morphological features of 13,611 grains from common dry bean species reveals clear correlations amongst size-related attributes (Area, Perimeter and Axis Length). Bombay beans were the largest, most convex variety; B) Highlighting a single subclass simplifies both comparison against the full cohort and identification of within-class outliers. For example, Dermason beans (red) are smaller in size than other varieties. One Dermason bean grain had unusually low roundness, highly atypical for this variety.</p>
                    </caption>
                    <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185479/92c3d683-28d8-4f99-8d3d-4f0618fb5bee_figure2.gif"/>
                </fig>
                <p>PCPs scale well with large datasets but have several limitations. Visualising the relationships between multiple categorical variables is challenging. Missing data is also difficult to meaningfully represent. For this reason, ggEDA introduces a complementary visualisation composed of vertically aligned tile and bar plots.</p>
            </sec>
            <sec id="sec7">
                <title>Creating tiled one-dimensional graphics</title>
                <p>For small datasets (n &lt; 1000), ggEDA can represent features as distinct, vertically aligned bar or tile plots, with plot types auto-selected based on whether variables are categorical or numeric (
                    <xref ref-type="fig" rid="f3">
Figure 3</xref>).</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>
Figure 3. </label>
                    <caption>
                        <title>

                            <bold>ggEDA</bold> visualisations of common datasets revealing: A) Petals of the 
                            <italic toggle="yes">setosa</italic> species of iris are drastically smaller than other iris species; B) The majority of individuals who perished during the Titanic disaster were adult males; C) 
                            <italic toggle="yes">Gentoo</italic> penguins from Biscoe Island have shallower bill depths than 
                            <italic toggle="yes">Chinstrap</italic> or 
                            <italic toggle="yes">Adelie</italic> penguins, despite their increased body mass.</title>
                    </caption>
                    <graphic id="gr3" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185479/92c3d683-28d8-4f99-8d3d-4f0618fb5bee_figure3.gif"/>
                </fig>
            </sec>
            <sec id="sec8">
                <title>Identifying complex multidimensional patterns</title>
                <p>To demonstrate the advantages of 
                    <bold>ggEDA</bold>, we created the artificial 
                    <italic toggle="yes">Lazy Birdwatcher</italic> dataset. It describes magpie observations by two birdwatchers, one of whom routinely skips birdwatching on weekends. This introduces a missing data pattern dependent on both the birdwatcher and day of the week. The multidimensional pattern becomes immediately apparent from 
                    <bold>ggEDA</bold> stacked tile plots despite being difficult to detect using one-dimensional EDA tools like 
                    <bold>skimr</bold>, or two-dimensional tools like 
                    <bold>ggpairs</bold> from the 
                    <bold>GGally</bold> package (
                    <xref ref-type="fig" rid="f4">
Figure 4</xref>). Despite being n-dimensional, all PCP plot implementations in R also fail to uncover this trend due to either exclusion of missing data or inability to represent clearly the relationships between categorical features.</p>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>
Figure 4. </label>
                    <caption>
                        <title>Visualisation of the Lazy Birdwatcher dataset using the 
                            <bold>ggEDA</bold> package reveals a pattern of missingness (indicated by exclamation marks) dependent on multiple variables, Birdwatcher and Day (A).</title>
                        <p>This pattern is difficult to detect using one-dimensional EDA tools like 
                            <bold>skimr</bold> (B) or two-dimensional tools like ggpairs from the 
                            <bold>GGally</bold> package (C).</p>
                    </caption>
                    <graphic id="gr4" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185479/92c3d683-28d8-4f99-8d3d-4f0618fb5bee_figure4.gif"/>
                </fig>
            </sec>
            <sec id="sec9">
                <title>Exploring datasets using the interactiveEDA web-app
</title>
                <p>Despite the advancements provided by ggEDA and other tools in the R ecosystem, a key limitation remains: accessibility for non-programmers, particularly when visualising n-dimensional data. All existing R implementations lack graphical user interfaces (
                    <xref ref-type="fig" rid="f1">
Figure 1</xref>). While shiny web apps offer a potential solution, they often require uploading datasets to external servers, raising privacy concerns. To address these limitations, we developed interactiveEDA, a web-assembly compiled client-side web app for secure, interactive data exploration (
                    <xref ref-type="fig" rid="f5">
Figure 5</xref>). Operating entirely in the browser, interactiveEDA ensures data remains on the user&#x2019;s machine, increasing ease of use without compromising data privacy. interactiveEDA is available at 
                    <ext-link ext-link-type="uri" xlink:href="https://github.com/CCICB/interactiveEDA">https://github.com/CCICB/interactiveEDA</ext-link>.</p>
                <fig fig-type="figure" id="f5" orientation="portrait" position="float">
                    <label>
Figure 5. </label>
                    <caption>
                        <title>Screenshot of 
                            <bold>interactiveEDA</bold>, a web-app providing a graphical user interface for code-free generation of 
                            <bold>ggEDA</bold> visualisations.</title>
                    </caption>
                    <graphic id="gr5" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185479/92c3d683-28d8-4f99-8d3d-4f0618fb5bee_figure5.gif"/>
                </fig>
            </sec>
        </sec>
        <sec id="sec10">
            <title>Summary</title>
            <p>ggEDA provides two complementary visualisation strategies for exploratory data analysis: interactive parallel coordinate plots for high-dimensional quantitative data and tiled one-dimensional graphics for exploring missingness and categorical relationships in smaller datasets. These tools help uncover complex patterns and data quality issues with minimal coding. For users without programming experience, the same visualisations are available through the interactiveEDA web app.</p>
        </sec>
        <sec id="sec11">
            <title>Software availability</title>
            <p>

                <bold>ggEDA:</bold>

                <list list-type="bullet">
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Install using 
                            <styled-content style="color:#0000FF">install.packages</styled-content>(
                            <styled-content style="color:#BA2121">&#x201c;ggEDA&#x201d;</styled-content>).</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Available from CRAN: 
                            <ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/web/packages/ggEDA/">https://cran.r-project.org/web/packages/ggEDA/</ext-link>
                        </p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Source code: 
                            <ext-link ext-link-type="uri" xlink:href="https://github.com/CCICB/ggEDA">https://github.com/CCICB/ggEDA</ext-link>
                        </p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Archived release: 
                            <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.17290896">https://doi.org/10.5281/zenodo.17290896</ext-link>
                        </p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>License: 
                            <ext-link ext-link-type="uri" xlink:href="https://github.com/CCICB/ggEDA/blob/main/LICENSE.md">

                                <styled-content style="color:#CC6638">MIT</styled-content>
</ext-link>
                        </p>
                    </list-item>
                </list>
            </p>
            <p>

                <bold>interactiveEDA:</bold>

                <list list-type="bullet">
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Available at: 
                            <ext-link ext-link-type="uri" xlink:href="https://ccicb.github.io/interactiveEDA/">https://ccicb.github.io/interactiveEDA/</ext-link>
                        </p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Source code: 
                            <ext-link ext-link-type="uri" xlink:href="https://github.com/CCICB/interactiveEDA">https://github.com/CCICB/interactiveEDA</ext-link>
                        </p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Archived release: 
                            <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.17290912">https://doi.org/10.5281/zenodo.17290912</ext-link>
                        </p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>License: 
                            <ext-link ext-link-type="uri" xlink:href="https://github.com/CCICB/interactiveEDA/blob/main/LICENSE.md">

                                <styled-content style="color:#CC6638">MIT</styled-content>
</ext-link>
                        </p>
                    </list-item>
                </list>
            </p>
        </sec>
    </body>
    <back>
        <sec id="sec12" sec-type="data-availability">
            <title>Data availability</title>
            <p>Figshare. DryBeans. 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.29614133.v3">https://doi.org/10.6084/m9.figshare.29614133.v3</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref8">8</xref>
                </sup>
            </p>
            <p>This project contains the following underlying data:
                <list list-type="bullet">
                    <list-item>
                        <label>&#x2022;</label>
                        <p>dry_beans.csv: Sourced from the UCI Machine Learning Repository.
                            <sup>
                                <xref ref-type="bibr" rid="ref10">10</xref>
                            </sup> Originally published by Koklu and &#x00d6;zkan in 2020.
                            <sup>
                                <xref ref-type="bibr" rid="ref9">9</xref>
                            </sup> A random subsample (n = 1000) is packaged with ggEDA (
                            <monospace>ggEDA::minibeans</monospace>). Used in 
                            <xref ref-type="fig" rid="f2">
Figure 2</xref>.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Mini beans.csv</p>
                    </list-item>
                </list>
            </p>
            <p>Data is available under the terms of the 
                <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY 4.0</ext-link> license.</p>
            <p>Figshare. ggEDA. 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.30350887.v2">https://doi.org/10.6084/m9.figshare.30350887.v2</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref11">11</xref>
                </sup>
            </p>
            <p>This project contains the following underlying data:
                <list list-type="bullet">
                    <list-item>
                        <label>&#x2022;</label>
                        <p>iris.csv: Included with base R. Originally published by Anderson in 1935.
                            <sup>
                                <xref ref-type="bibr" rid="ref12">12</xref>
                            </sup> Used in 
                            <xref ref-type="fig" rid="f3">
Figure 3</xref>.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>titanic.raw.csv: Loaded from the 
                            <monospace>datarium</monospace> R package.
                            <sup>
                                <xref ref-type="bibr" rid="ref13">13</xref>
                            </sup> Originally published by the British Board of Trade in 1990.
                            <sup>
                                <xref ref-type="bibr" rid="ref14">14</xref>
                            </sup> Used in 
                            <xref ref-type="fig" rid="f3">
Figure 3</xref>.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>penguins.csv: Loaded from the 
                            <monospace>palmerpenguins</monospace> R package.
                            <sup>
                                <xref ref-type="bibr" rid="ref15">15</xref>
                            </sup> Originally published by Gorman 
                            <italic toggle="yes">et al.</italic> in 2014.
                            <sup>
                                <xref ref-type="bibr" rid="ref16">16</xref>
                            </sup> Used in 
                            <xref ref-type="fig" rid="f3">
Figure 3</xref>.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>lazy_birdwatcher.csv: Artificial dataset bundled with the 
                            <monospace>ggEDA</monospace> R package (
                            <monospace>ggEDA::lazy_birdwatcher</monospace>). Used in 
                            <xref ref-type="fig" rid="f4">
Figure 4</xref>.</p>
                    </list-item>
                </list>
            </p>
            <p>Data is available under the terms of the 
                <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/publicdomain/zero/1.0/">CC0 license</ext-link>.</p>
        </sec>
        <ack>
            <title>Acknowledgements</title>
            <p>We thank the developers of the packages integral to ggEDA, especially David Gohel for ggiraph,
                <sup>
                    <xref ref-type="bibr" rid="ref17">17</xref>
                </sup> which enables its interactivity, and Thomas Lin Pedersen for patchwork
                <sup>
                    <xref ref-type="bibr" rid="ref18">18</xref>
                </sup> and ggplot2 maintenance. We also acknowledge Hadley Wickham and all contributors to ggplot2.
                <sup>
                    <xref ref-type="bibr" rid="ref19">19</xref>
                </sup> The ggEDA graphical user interface (EDA) was made possible thanks to creators and maintainers of shiny,
                <sup>
                    <xref ref-type="bibr" rid="ref6">6</xref>
                </sup> shinylive
                <sup>
                    <xref ref-type="bibr" rid="ref7">7</xref>
                </sup> and webR.
                <sup>
                    <xref ref-type="bibr" rid="ref20">20</xref>
                </sup>
            </p>
            <p>We thank the Australian BioCommons for advice and research computing support.</p>
        </ack>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gu</surname>
                            <given-names>Z</given-names>
                        </name>
</person-group>:
                    <article-title>Complex heatmap visualization.</article-title>
                    <source>

                        <italic toggle="yes">iMeta.</italic>
</source>
                    <year>2022</year>;<volume>1</volume>(<issue>3</issue>):<fpage>e43</fpage>.
                    <pub-id pub-id-type="pmid">38868715</pub-id>
                    <pub-id pub-id-type="doi">10.1002/imt2.43</pub-id>
                    <pub-id pub-id-type="pmcid">PMC10989952</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref2">
                <label>2</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Cui</surname>
                            <given-names>B</given-names>
                        </name>
</person-group>:
                    <article-title>DataExplorer: Automate Data Exploration and Treatment.</article-title>
                    <source>

                        <italic toggle="yes">R package version 0.8.3.</italic>
</source>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.32614/CRAN.package.DataExplorer</pub-id>
                    <ext-link ext-link-type="uri" xlink:href="https://CRAN.R-project.org/package=DataExplorer">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref3">
                <label>3</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Waring</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Quinn</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>McNamara</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>
                        <italic toggle="yes">skimr:</italic> Compact and Flexible Summaries of Data.</article-title>
                    <source>

                        <italic toggle="yes">R package version 2.1.5.</italic>
</source>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.32614/CRAN.package.skimr</pub-id>
                    <ext-link ext-link-type="uri" xlink:href="https://CRAN.R-project.org/package=skimr">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref4">
                <label>4</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Schloerke</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cook</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Larmarange</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>GGally: Extension to&#x2019;ggplot2&#x2019;.</article-title>
                    <source>

                        <italic toggle="yes">R package version 2.2.1.</italic>
</source>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.32614/CRAN.package.GGally</pub-id>
                    <ext-link ext-link-type="uri" xlink:href="https://CRAN.R-project.org/package=GGally">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>VanderPlas</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ge</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Unwin</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Penguins Go Parallel: A Grammar of Graphics Framework for Generalized Parallel Coordinate Plots.</article-title>
                    <source>

                        <italic toggle="yes">J. Comput. Graph. Stat.</italic>
</source>
                    <year>2023</year>;<volume>32</volume>(<issue>4</issue>):<fpage>1572</fpage>&#x2013;<lpage>1587</lpage>.
                    <pub-id pub-id-type="doi">10.1080/10618600.2023.2195462</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref6">
                <label>6</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chang</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Allaire</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>shiny: Web Application Framework for R.</article-title>
                    <source>

                        <italic toggle="yes">R package version 1.9.1.</italic>
</source>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.32614/CRAN.package.shiny</pub-id>
                    <ext-link ext-link-type="uri" xlink:href="https://CRAN.R-project.org/package=shiny">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref7">
                <label>7</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Schloerke</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chang</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Stagg</surname>
                            <given-names>G</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>shinylive: Run&#x2019;shiny&#x2019; Applications in the Browser.</article-title>
                    <source>

                        <italic toggle="yes">R package version 0.3.0.</italic>
</source>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.32614/CRAN.package.shinylive</pub-id>
                    <ext-link ext-link-type="uri" xlink:href="https://CRAN.R-project.org/package=shinylive">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref8">
                <label>8</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>El-Kamand</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Dry Beans.</article-title>
                    <source>

                        <italic toggle="yes">Figshare.</italic>
</source>
                    <year>2025</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.29614133.v3</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Koklu</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>&#x00d6;zkan</surname>
                            <given-names>IA</given-names>
                        </name>
</person-group>:
                    <article-title>Multiclass classification of dry beans using computer vision and machine learning techniques.</article-title>
                    <source>

                        <italic toggle="yes">Comput. Electron. Agric.</italic>
</source>
                    <year>2020</year>;<volume>174</volume>:<fpage>105507</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.compag.2020.105507</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref10">
                <label>10</label>
                <mixed-citation publication-type="other">
                    <collab>Dry Bean</collab>:
                    <article-title>UCI Machine Learning Repository.</article-title>
                    <pub-id pub-id-type="doi">10.24432/C50S4B.2020</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref11">
                <label>11</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>El-Kamand</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>ggEDA Datasets.</article-title>
                    <source>

                        <italic toggle="yes">Figshare.</italic>
</source>
                    <year>2025</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.30350887.v2</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Anderson</surname>
                            <given-names>E</given-names>
                        </name>
</person-group>:
                    <article-title>The Irises of the Gasp&#x00e9; Peninsula.</article-title>
                    <source>

                        <italic toggle="yes">Bulletin of the American Iris Society.</italic>
</source>
                    <year>1935</year>;<volume>59</volume>:<fpage>2</fpage>&#x2013;<lpage>5</lpage>.</mixed-citation>
            </ref>
            <ref id="ref13">
                <label>13</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kassambara</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>datarium: Data Bank for Statistical Analysis and Visualization.</article-title>
                    <source>

                        <italic toggle="yes">R package version 0.1.0.</italic>
</source>
                    <year>2019</year>.
                    <pub-id pub-id-type="doi">10.32614/CRAN.package.datarium</pub-id>
                    <ext-link ext-link-type="uri" xlink:href="https://CRAN.R-project.org/package=datarium">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref14">
                <label>14</label>
                <mixed-citation publication-type="book">
                    <collab>British Board of Trade</collab>:
                    <source>

                        <italic toggle="yes">Report on the Loss of the&#x2019;Titanic&#x2019; (S.S.) British Board of Trade Inquiry Report (reprint).</italic>
</source>
                    <publisher-loc>Gloucester, UK</publisher-loc>:
                    <publisher-name>Allan Sutton Publishing</publisher-name>;<year>1990</year>.</mixed-citation>
            </ref>
            <ref id="ref15">
                <label>15</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Horst</surname>
                            <given-names>AM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hill</surname>
                            <given-names>AP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gorman</surname>
                            <given-names>KB</given-names>
                        </name>
</person-group>:
                    <article-title>palmerpenguins: Palmer Archipelago (Antarctica) penguin data.</article-title>
                    <source>

                        <italic toggle="yes">R package version 0.1.0.</italic>
</source>
                    <year>2020</year>.
                    <pub-id pub-id-type="doi">10.5281/zenodo.3960218</pub-id>
                    <ext-link ext-link-type="uri" xlink:href="https://allisonhorst.github.io/palmerpenguins/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gorman</surname>
                            <given-names>KB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Williams</surname>
                            <given-names>TD</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fraser</surname>
                            <given-names>WR</given-names>
                        </name>
</person-group>:
                    <article-title>Ecological Sexual Dimorphism and Environmental Variability within a Community of Antarctic Penguins (Genus Pygoscelis).</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2014</year>;<volume>9</volume>(<issue>3</issue>):<fpage>e90081</fpage>&#x2013;<lpage>e90014</lpage>.
                    <pub-id pub-id-type="pmid">24599330</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0090081</pub-id>
                    <pub-id pub-id-type="pmcid">PMC3943793</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref17">
                <label>17</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gohel</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Skintzos</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>ggiraph: Make&#x2019;ggplot2&#x2019; Graphics Interactive.</article-title>
                    <source>

                        <italic toggle="yes">R package version 0.8.10.</italic>
</source>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.32614/CRAN.package.ggiraph</pub-id>
                    <ext-link ext-link-type="uri" xlink:href="https://davidgohel.github.io/ggiraph/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref18">
                <label>18</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Pedersen</surname>
                            <given-names>TL</given-names>
                        </name>
</person-group>:
                    <article-title>patchwork: The Composer of Plots.</article-title>
                    <source>

                        <italic toggle="yes">R package version 1.2.0.9000.</italic>
</source>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.32614/cran.package.patchwork</pub-id>
                    <ext-link ext-link-type="uri" xlink:href="https://patchwork.data-imaginist.com">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref19">
                <label>19</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wickham</surname>
                            <given-names>H</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">ggplot2: Elegant Graphics for Data Analysis.</italic>
</source>
                    <publisher-loc>New York</publisher-loc>:
                    <publisher-name>Springer-Verlag</publisher-name>;<year>2016</year>.
                    <pub-id pub-id-type="doi">10.32614/CRAN.package.ggplot2</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref20">
                <label>20</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Stagg</surname>
                            <given-names>GW</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lionel</surname>
                            <given-names>H</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>webR: The statistical language R compiled to WebAssembly via Emscripten.</article-title>
                    <year>2023</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://docs.r-wasm.org/webr/latest/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report473463">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.185479.r473463</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>de Oliveira</surname>
                        <given-names>Maria Cristina Ferreira</given-names>
                    </name>
                    <xref ref-type="aff" rid="r473463a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-4729-5104</uri>
                </contrib>
                <aff id="r473463a1">
                    <label>1</label>Computer Science, Universidade de Sao Paulo, S&#x00e3;o Carlos, State of S&#x00e3;o Paulo, Brazil</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>20</day>
                <month>4</month>
                <year>2026</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2026 de Oliveira MCF</copyright-statement>
                <copyright-year>2026</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport473463" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.168305.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The paper introduces a novel exploratory data analysis package for the R ecosystem. The package, named ggEDA, that incorporates two interactive visualizations for multidimensional data: tiled 1-dimensional bar plots and parallel coordinate plots.&#x00a0;</p>
            <p> </p>
            <p> Implementation details and installation instructions are provided in the text.&#x00a0; Use cases for typical datasets are provided to illustrate the techniques' use.&#x00a0; A web-based application is also provided for non-programmers.&#x00a0;</p>
            <p> </p>
            <p> The contribution nicely complements the facilities already available in R, and the authors clearly express the advantages and limitations, e.g., regarding the scalability of the techniques.</p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Yes</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>computer science, data visualization, data mining.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report472021">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.185479.r472021</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Vel&#x00e1;squez-Zapata</surname>
                        <given-names>Valeria</given-names>
                    </name>
                    <xref ref-type="aff" rid="r472021a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r472021a1">
                    <label>1</label>GreenLight Biosciences, Durham, NC, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>10</day>
                <month>4</month>
                <year>2026</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2026 Vel&#x00e1;squez-Zapata V</copyright-statement>
                <copyright-year>2026</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport472021" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.168305.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The article introduces ggEDA, an R package designed to streamline exploratory data analysis (EDA) by providing two complementary visualization strategies: interactive parallel coordinate plots (PCPs) and tiled one-dimensional graphics. By automating complex visualizations, ggEDA helps users analyze multidimensional patterns and data quality issues that might otherwise require extensive manual review or statistical modeling.</p>
            <p> </p>
            <p> To increase accessibility for non-programmers, the authors developed interactiveEDA, a web-based application built using the a shiny framework. Overall, ggEDA and interactiveEDA together provide a powerful and user-friendly suite for multidimensional data exploration. The authors also provide example datasets so users can explore the functions.</p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Yes</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Bioinformatics</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
</article>
