<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="other" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.117334.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Case Study</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>MRI data harmonization across sites using ComBat enhances classification of meningioma and glioma brain-tumors in dogs: a case study</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 2 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Nandy</surname>
                        <given-names>Debmalya</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-1640-6895</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Yang</surname>
                        <given-names>Xinyi</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Jin</surname>
                        <given-names>Xin</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Griffin</surname>
                        <given-names>Lynn</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no" equal-contrib="yes">
                    <name>
                        <surname>Kechris</surname>
                        <given-names>Katerina</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no" equal-contrib="yes">
                    <name>
                        <surname>Xing</surname>
                        <given-names>Fuyong</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Biostatistics &amp; Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, Colorado, 80045, USA</aff>
                <aff id="a2">
                    <label>2</label>Center for Innovative Design &amp; Analysis, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, Colorado, 80045, USA</aff>
                <aff id="a3">
                    <label>3</label>Department of Environmental and Radiological Health Sciences, College of Veterinary Medicine and Biomedical Sciences, Colorado State University, Fort Collins, Colorado, 80523, USA</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:debmalya.nandy@cuanschutz.edu">debmalya.nandy@cuanschutz.edu</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>7</day>
                <month>7</month>
                <year>2022</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2022</year>
            </pub-date>
            <volume>11</volume>
            <elocation-id>759</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>21</day>
                    <month>6</month>
                    <year>2022</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2022 Nandy D et al.</copyright-statement>
                <copyright-year>2022</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/11-759/pdf"/>
            <abstract>
                <p>
                    <bold>Background:</bold> Magnetic resonance imaging (MRI) in clinical patients is often evaluated for diagnostic purposes. However, to develop a disease classifier, imaging data can be &#x201c;noisy&#x201d;, as in being heterogeneous (e.g., obtained from multiple sites), having significant crossover between normal and pathological processes, being highly imbalanced for the outcome variable (i.e., unequal numbers of cases and controls), or due to a lack of accurate quantitative analysis tools that are transferable, easily usable, and accurate to generate the final image variables for machine learning analyses.</p>
                <p>
                    <bold>Methods:</bold> In this article, we demonstrate the effectiveness of ComBat harmonization of heterogeneous MRI data on dogs&#x2019; brains, collected across multiple sites, prior to using them in the random forest (RF) classifier to attempt to differentiate the meningioma and the glioma tumor-types. We consider three image variables generated from each of the brain scans and three clinical covariates &#x2013; age, sex, and breedtype &#x2013; for each subject. The scans are generated either at Colorado State University (CSU) or outside CSU. We compare the RF classifier performance in identifying the two tumor types, with and without preprocessing the data with ComBat site-specific harmonization.</p>
                <p>
                    <bold>Results:</bold> The post-ComBat disease classification accuracy measures &#x2013; sensitivity, specificity, and total accuracy &#x2013; indicate an overall significant edge in the RF performance compared to their without-ComBat counterparts across different scenarios. Moreover, incorporating both the image variables and the clinical covariates in the RF model results in the highest total accuracy.</p>
                <p>
                    <bold>Conclusions:</bold> Use of MRI data in combination with clinical covariates is more informative than using only clinical covariates in classifying meningioma and glioma brain-tumors in dogs. Moreover, as a preprocessing step for MRI data, we recommend adjusting for the site-specific variability using ComBat harmonization prior to performing downstream analyses, such as disease classification.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Brain MRI</kwd>
                <kwd>Canines</kwd>
                <kwd>ComBat</kwd>
                <kwd>Data harmonization</kwd>
                <kwd>Multiple sites</kwd>
                <kwd>Meningioma</kwd>
                <kwd>Glioma</kwd>
                <kwd>Random forest classification</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1" xlink:href="http://dx.doi.org/10.13039/100000002">
                    <funding-source>National Institutes of Health</funding-source>
                    <award-id>UL1TR002535</award-id>
                    <award-id>NCIU01CA235488</award-id>
                </award-group>
                <funding-statement>Funds provided by the NIH/NCATS Colorado CTSA Grant Number UL1TR002535 supported DN, KK, and FX. These funds were distributed through the Translational Methods Pilot Awards 2020-2021 (Biostatistics / Bioinformatics category) offered by the Colorado Clinical and Translational Sciences Institute (CCTSI) at the University of Colorado Anschutz Medical Campus, Aurora, Colorado. DN and KK were also supported by NIH grant NCI U01CA235488. The contents of this research are the authors&#x2019; sole responsibility and do not necessarily represent official NIH views.</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec id="sec1" sec-type="intro">
            <title>Introduction</title>
            <p>Magnetic resonance imaging (MRI), a powerful technology to detect abnormalities in human and animal organs,
                <sup>
                    <xref ref-type="bibr" rid="ref1">1</xref>
                </sup>
                <sup>&#x2013;</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref9">9</xref>
                </sup> can be challenging for clinically differential diagnosis.
                <sup>
                    <xref ref-type="bibr" rid="ref10">10</xref>
                </sup>
                <sup>&#x2013;</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref14">14</xref>
                </sup> In omics sciences, data normalization (henceforth, &#x201c;harmonization&#x201d;) is a crucial preprocessing step prior to downstream analyses,
                <sup>
                    <xref ref-type="bibr" rid="ref15">15</xref>
                </sup>
                <sup>&#x2013;</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref21">21</xref>
                </sup> mitigating any spurious effects on the scientific conclusions incorporated due to undesired sources of variation, such as batch effects, intrinsic factors within the subjects, and scanning sites. Such harmonization is also essential for MRI data, as the signal intensities in these data are measured in arbitrary units that vary across study-visits and patients.
                <sup>
                    <xref ref-type="bibr" rid="ref22">22</xref>
                </sup>
                <sup>&#x2013;</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref25">25</xref>
                </sup>
            </p>
            <p>In this study, we demonstrate the effectiveness of a batch-effect correction tool, 
                <ext-link ext-link-type="uri" xlink:href="https://bit.ly/fortin-ComBat-git">ComBat</ext-link>,
                <sup>
                    <xref ref-type="bibr" rid="ref26">26</xref>
                </sup> widely used in transcriptomics
                <sup>
                    <xref ref-type="bibr" rid="ref27">27</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref28">28</xref>
                </sup> but also adopted for radiomics data,
                <sup>
                    <xref ref-type="bibr" rid="ref29">29</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref30">30</xref>
                </sup> in adjusting for undesirable effects of multiple sites on MRI signal intensities (SIs). We chose ComBat due to its superior performance in removing site-specific unwanted variations from fractional anisotropy and mean diffusivity maps in diffusor tensor MRI.
                <sup>
                    <xref ref-type="bibr" rid="ref29">29</xref>
                </sup> In their study, the authors considered only controls, used data that were from two &#x201c;pure&#x201d; sites, and implemented a sophisticated image-processing pipeline to generate the tissue outcome labels, which resulted in final measurements on the image variables (voxels) having dimensions in the order of 10,000&#x2019;s. In our case, however, each subject is diseased (meningioma/glioma) and the data come from two &#x201c;impure&#x201d; sites, i.e., the &#x201c;outside&#x201d; site consists of multiple non-CSU sites, the data thus potentially being noisy due to heterogeneous MRI scanners/protocols used. Notably, such site-heterogeneity can be commonplace to ensure a sufficient sample size. Additionally, we use only three manually recorded image variables, available for all subjects across the sites. Via the downstream performance of the ensemble machine learning classification tool, random forest
                <sup>
                    <xref ref-type="bibr" rid="ref31">31</xref>
                </sup>
                <sup>&#x2013;</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref34">34</xref>
                </sup> (RF), our study thus aims to demonstrate the utility of ComBat harmonization in a &#x201c;non-ideal&#x201d; yet practical scenario.</p>
        </sec>
        <sec id="sec2" sec-type="methods">
            <title>Methods</title>
            <sec id="sec3">
                <title>Study population and data generation</title>
                <p>We use n = 244 subjects (dogs) in our study, belonging to one of the following four subpopulations: 1) glioma, scanned at the Colorado State University &#x2013; Veterinary Teaching Hospital (CSU-VTH), n = 39; 2) glioma, obtained from a site outside CSU, n = 20; 3) meningioma scanned at the CSU-VTH, n = 106; and 4) meningioma, obtained from a site outside CSU, n = 79. Note that we treat the subjects as coming from only two sites -- CSU and &#x201c;outside&#x201d;. However, the &#x201c;outside&#x201d; site actually consists of 36 unique sites (
                    <xref ref-type="table" rid="T1">Table 1</xref>).</p>
                <table-wrap id="T1" orientation="portrait" position="float">
                    <label>Table 1. </label>
                    <caption>
                        <title>List of 36 unique sites that we combinedly call the &#x201c;outside&#x201d; site.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Advanced Veterinary Care</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Animal Emergency &amp; Speciality Center (AESC)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">AMI Diamond Hill</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">The ANIC</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Animal Imaging</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Animal Neurology &amp; MRI Center</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Aspen Valley Hospital</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Blackmore</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Boulder Road Veterinary Specialists</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Brain Med BRN</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Canada West vet Specialists</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Chicago Veterinary MRI</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Chicago Veterinary Emergency &amp; Specialty Center (CVESC)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">The Ohio State University (OSU) Veterinary Hospital</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Diagnostic Radiology Institute</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Esaote S.p.A</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">GCVNN</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">University of Utah Imaging &amp; Neurosciences Center (INC)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">ISU Vet Teaching Hospital</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Michigan State University (MSU)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">PPER_S</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Rocky Mountain Veterinary Neurology</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Tacoma Vet Imaging</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Texas A &amp; M Veterinary Teaching Hospital</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">University of Missouri Veterinary Health Center</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">VCA Alameda East Veterinary Hospital</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">VCA North West (NW) Veterinary Specialists</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">VCA Veterinary Specialists of Northern Colorado (VSNC)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Veterinary Specialty Center Tucson</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Veterinary Imaging, LLC</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Veterinary Neurology</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Veterinary Neurological Center (VNC) Phoenix</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Veterinary Speciality Hospital of SanDiego (VSHSD)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Western Orthopedics and Sports Medicine</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">WestVet</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Wheat Ridge Animal Hospital</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>DN and XY, the two &#x201c;processors&#x201d;, generate the data used in the final analyses. DN scans through the conclusion of each patient&#x2019;s brain MRI diagnostic report stored in the CSU-VTH Philips IntelliSpace PACS (picture archiving and communication system) Radiology software (henceforth referred to as &#x201c;PACS&#x201d;) database, labeling the associated brain tumor-type as either &#x201c;glioma&#x201d; or &#x201c;meningioma&#x201d; based on the radiologist&#x2019;s/principal interpreter&#x2019;s conclusion including terms such as &#x201c;likely&#x201d;/&#x201c;most likely&#x201d;/&#x201c;most consistent&#x201d;, etc. Therefore, these binary tumor-type labels are not based on surgical, histopathological evidence and are used as the outcome variable in the downstream RF classification (see the &#x201c;Statistical analysis&#x201d; section). Since we do not have access to the diagnostic reports for the subjects from the &#x201c;outside&#x201d; site, we consider instead the corresponding ones from the CSU PACS database that are closest to their original exam dates.</p>
                <p>For each patient, we only consider the transverse/axial section, T1-weighted, post-contrast scans (typically labeled as &#x201c;Trans T1 +C&#x201d;). The processors scan through all the slices within each patient&#x2019;s respective DICOM file and select up to three representative slices in which the cancerous lesions are most prominently visible (i.e., highest contrast) by naked eye. Note that, among the 244 subjects, we settle with only one suitable slice for seven subjects and two for six subjects (
                    <italic toggle="yes">Extended data</italic>: Table S1).
                    <sup>
                        <xref ref-type="bibr" rid="ref47">47</xref>
                    </sup> Then, within each chosen slice, two circular regions of interest (ROIs) are drawn encompassing the densest parts visually examined, one each on the lesion and on the &#x201c;normal&#x201d; tissue, using the PACS software in-built &#x201c;drawing&#x201d; tool. Also note that, as &#x201c;normal&#x201d; tissue, we choose facial muscle for seven meningioma subjects and muscle of mastication for the rest (
                    <italic toggle="yes">Extended data</italic>: Table S2).
                    <sup>
                        <xref ref-type="bibr" rid="ref48">48</xref>
                    </sup> From each of these two ROIs, three statistics for the SIs are noted: mean, standard deviation, and the central point-value. See 
                    <xref ref-type="fig" rid="f1">Figure 1</xref> for an example.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>Figure 1. </label>
                    <caption>
                        <title>Example of data generation from circular regions of interest (ROIs) &#x2013; diseased lesion (A) and normal tissue (B) &#x2013; drawn within the same slice (axial T1-weighted, post-contrast) using the PACS software tools.</title>
                        <p>This subject (dog) belongs to the &#x201c;meningioma outside&#x201d; subpopulation, i.e., its brain MRI is performed at a non-CSU site and diagnosed with meningioma tumor-type. The normal tissue chosen in (B) is muscle of mastication. The means and the standard deviations of the SIs within the two ROIs are indicated beside the circles drawn and the central point-value SIs are indicated at the bottom of the slides, outside of the parentheses.</p>
                    </caption>
                    <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/129171/1324e15f-f42a-454b-9e8a-ce08b185d629_figure1.gif"/>
                </fig>
                <p>Besides the three MRI variables, for each patient we also record the following covariates: three clinical &#x2013; age (in months) at the time of MRI scan, sex (male, female, male castrated, female spade/spayed), and breedtype; six related to MRI scanner &#x2013; repetition time (TR), echo time (TE), number of excitations (NEX), slice thickness (mm), frequency phase (X x Y), and field-of-view reconstruction (FOV recon; cm); and one technical &#x2013; processor.</p>
                <p>Note that, for the final analysis, we use both sex and breedtype as binary variables: sex (female/male) and breedtype (non-brachycephalic/brachycephalic). Data on frequency phase are used as two independent scanner covariates. Due to the presence of missing data, we eventually omit the &#x201c;FOV Recon&#x201d; scanner covariate from the final analysis. Thus, we have three binary covariates &#x2013; sex, breedtype, and processor, coded as 0/1; the rest are treated as continuous variables. See 
                    <xref ref-type="table" rid="T2">Table 2</xref> for a summary of all of the final variables used in our analyses.</p>
                <table-wrap id="T2" orientation="portrait" position="float">
                    <label>Table 2. </label>
                    <caption>
                        <title>Summary of the three clinical covariates, one technical covariate, six magnetic resonance imaging (MRI) scanner covariates, and three MR curated image variables used in our statistical analyses.</title>
                        <p>The data are grouped based on the four subpopulations as indicated in the columns. Apart from the three binary covariates &#x2013; sex, breedtype, and processor &#x2013; that are coded as 0/1, the rest are treated as continuous variables; each cell-value indicates the range in the top line and the median (median absolute deviation in parentheses) in the bottom line.</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="center" colspan="1" rowspan="1" valign="middle"/>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Meningioma CSU (n = 106)</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Meningioma outside (n = 79)</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Glioma CSU (n = 39)</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Glioma outside (n = 20)</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="center" colspan="5" rowspan="1" valign="middle">
                                    <bold>Clinical covariates</bold>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>Age (in months)</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">18-204
                                    <break/>119.5 (32.617)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">53-210
                                    <break/>123 (28.169)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">16-178
                                    <break/>94 (56.339)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">38-167
                                    <break/>99 (26.687)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>Sex (F/M)</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">54/52</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">40/39</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">21/18</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">14/6</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>Breed-Type (Brachycephalic/Non-brachycephalic)</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">15/91</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">7/72</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">15/24</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">10/10</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="5" rowspan="1" valign="middle">
                                    <bold>Technical covariate</bold>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>Processor (DN/XY)</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">54/52</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">39/40</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">20/19</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">9/11</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="5" rowspan="1" valign="middle">
                                    <bold>MRI scanner covariates</bold>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>TR</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">300 &#x2013; 1003
                                    <break/>573 (84.757)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">350 &#x2013; 2100
                                    <break/>600 (171.982)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">400 &#x2013; 859
                                    <break/>566.664 (103.284)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">250 &#x2013; 1310
                                    <break/>584.50 (148.26)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>TE</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">8-15.62
                                    <break/>13.016 (2.988)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">3.25 &#x2013; 26
                                    <break/>14.358 (5.400)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">7.984-15.048
                                    <break/>13 (3.011)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">2.92 &#x2013; 26
                                    <break/>11.787 (3.950)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>NEX</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">1 &#x2013; 4
                                    <break/>3 (1.483)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">1 &#x2013; 4
                                    <break/>2 (1.483)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">1 &#x2013; 4
                                    <break/>3 (1.483)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">1 &#x2013; 4
                                    <break/>2 (0.741)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>Thickness (in mm)</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">2 &#x2013; 4
                                    <break/>3 (1.483)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">2 &#x2013; 5
                                    <break/>3 (0.445)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">2 &#x2013; 4
                                    <break/>3 (1.483)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">2.5 &#x2013; 5
                                    <break/>3 (0)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>Frequency Phase 1</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">192 &#x2013; 320
                                    <break/>288 (47.443)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">192 &#x2013; 512
                                    <break/>256 (47.443)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">256 &#x2013; 320
                                    <break/>288 (47.443)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">192 &#x2013; 512
                                    <break/>256 (11.861)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>Frequency Phase 2</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">192 &#x2013; 224
                                    <break/>224 (0)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">72 &#x2013; 320
                                    <break/>224 (47.443)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">192 &#x2013; 256
                                    <break/>224 (0)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">144 &#x2013; 256
                                    <break/>195.50 (52.632)</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="5" rowspan="1" valign="middle">
                                    <bold>Image variables</bold>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>&#x03bc; (adj-mean [SI])</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.846-3.138
                                    <break/>1.932 (0.262)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">1.220-2.953
                                    <break/>2.007 (0.380)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">1.170-2.704
                                    <break/>1.591 (0.310)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">1.076-2.860
                                    <break/>1.743 (0.378)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>&#x03bc; (adj-SD [SI])</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.748-6.832
                                    <break/>2.328 (0.840)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.850-7.497
                                    <break/>1.652 (0.610)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.870-6.912
                                    <break/>2.234 (1.438)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.704-3.882
                                    <break/>1.504 (0.550)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>&#x03bc; (adj-cent [SI])</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.914-3.220
                                    <break/>1.975 (0.327)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">1.119-3.436
                                    <break/>2.041 (0.399)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">1.144-2.739
                                    <break/>1.582 (0.297)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">1.159-2.985
                                    <break/>1.722 (0.370)</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
            </sec>
            <sec id="sec4">
                <title>Statistical analysis</title>
                <p>
                    <italic toggle="yes">Preprocessing of the data and final variables</italic>
                </p>
                <p>For each of up to three selected slices corresponding to each sample, we first normalize the mean, the standard deviation, and the central point-value of the SIs within the diseased ROI by taking respective ratios to the normal ROI within that same slice (
                    <xref ref-type="fig" rid="f1">Figure 1</xref>). We call these three measures adj-mean (SI), adj-SD (SI), and adj-cent (SI), respectively. Next, for each sample, we compute the means of these adjusted measures across the selected slices. These three summarized measures, respectively referred to as &#x03bc; (adj-mean (SI)), &#x03bc; (adj-SD (SI)), and &#x03bc; (adj-cent (SI)), are used as the final three image variables in the subsequent analyses (Figures S1 and S2).
                    <sup>
                        <xref ref-type="bibr" rid="ref50">50</xref>
                    </sup>
                    <sup>&#x2013;</sup>
                    <sup>
                        <xref ref-type="bibr" rid="ref55">55</xref>
                    </sup> The intercorrelations among the three continuous image variables and the disease labels (0 = glioma, 1 = meningioma) are shown in Figure S3.
                    <sup>
                        <xref ref-type="bibr" rid="ref56">56</xref>
                    </sup>
                    <sup>,</sup>
                    <sup>
                        <xref ref-type="bibr" rid="ref57">57</xref>
                    </sup> We note that, for both CSU and outside sites &#x03bc; (adj-mean (SI)) and &#x03bc; (adj-cent (SI)) are maximally correlated with the disease labels and the correlations among the &#x03bc; (adj-SD (SI)) and disease labels are negligible. Among the continuous covariates across both sites, while age (in months), &#x03bc; (adj-mean (SI)), and &#x03bc; (adj-cent (SI)) resemble a Gaussian distribution, those of others deviate greatly from it (data not shown).</p>
            </sec>
            <sec id="sec5">
                <title>Tumor classification</title>
                <p>For the classification of meningioma and glioma brain-tumors (glioma treated as the &#x201c;positive&#x201d; class), we apply RF
                    <sup>
                        <xref ref-type="bibr" rid="ref31">31</xref>
                    </sup>
                    <sup>&#x2013;</sup>
                    <sup>
                        <xref ref-type="bibr" rid="ref34">34</xref>
                    </sup> and evaluate classification performance based on sensitivity, specificity, and total accuracy, benchmarked via &#x201c;lower&#x201d; and &#x201c;upper&#x201d; bounds (
                    <xref ref-type="table" rid="T3">Table 3</xref>). Using the same site for training and test sets, we expect better RF classification performance (upper bound) compared to when using different sites (lower bound).</p>
                <table-wrap id="T3" orientation="portrait" position="float">
                    <label>Table 3. </label>
                    <caption>
                        <title>Choice of sites for the computation of the &#x201c;lower&#x201d; and the &#x201c;upper&#x201d; bounds of random forest (RF) classification metrics.</title>
                        <p>M: Meningioma, G: Glioma. For &#x201c;lower&#x201d; bound computations, we use all the samples within the outside site (n = 99, M/G = 79/20) to train the RF model, and randomly subsample n = 38 subjects from the CSU population, ensuring M/G = 19/19 representation, for the test set. For &#x201c;upper&#x201d; bound computations, we randomly subsample n = 79 meningioma CSU subjects from the remaining 87 for the training sets and use the same test sets as those used for the lower bounds.</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="center" colspan="1" rowspan="1" valign="middle"/>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Training set (n = 99, M/G = 79/20)</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Test set (n = 38, M/G = 19/19)</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>Lower bound</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">Outside</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">CSU</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>Upper bound</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">CSU</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">CSU</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>For the &#x201c;lower&#x201d; bound calculations, we use all the samples within the outside site (n = 99, M/G = 79/20) to train the RF classifier, and randomly subsample n = 38 subjects from the CSU population, ensuring M/G = 19/19 representation, for the test set. Note that, the training set for the lower bound have 4:1 imbalanced class distribution in the outcome, which we adjust for using the Synthetic Minority Oversampling TEchnique (SMOTE),
                    <sup>
                        <xref ref-type="bibr" rid="ref35">35</xref>
                    </sup> using arguments 
                    <italic toggle="yes">perc.over</italic> = 3 and 
                    <italic toggle="yes">perc.under</italic> = 1.45 within the 
                    <italic toggle="yes">smote()</italic> function. The size of a final training set is thus increased to n = 159 (M/G = 79/80). We use the original n = 79 meningioma samples and the n = 80 glioma cases that are generated using SMOTE. Within this training set, we tune the parameters of the RF classifier using 5-fold cross-validation repeated 25 times, and using all possible combinations of predictor variables in the model via the 
                    <italic toggle="yes">mtry</italic> argument in the 
                    <italic toggle="yes">train()</italic> function. For the &#x201c;upper&#x201d; bound calculations, we keep the identical test set compositions as in lower bound computations, and form the training set by randomly subsampling n = 79 &#x201c;meningioma CSU&#x201d; subjects from the remaining 87. We repeat this exercise of computing lower and upper bounds 75 times, each time with a different training-test split. Finally, we report the medians (and median absolute deviations) of the classification metrics across these 75 random samples; see 
                    <xref ref-type="table" rid="T5">Table 5</xref> for an example.</p>
            </sec>
            <sec id="sec6">
                <title>Scenarios studied</title>
                <p>We investigate the RF classifier performance at the lower and upper bounds for the following scenarios:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>[
                                <bold>Case 0: one scenario</bold>] We examine the effectiveness of using three clinical covariates only in classifying the tumor types. No image, technical, and scanner covariates are used, and therefore, no ComBat harmonization is involved.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>[
                                <bold>Case 1: four scenarios</bold>] We use the three image variables in ComBat. Besides, we either use the three clinical covariates or not in ComBat and in subsequent RF, thus giving rise to four scenarios (a &#x2013; d; 
                                <xref ref-type="table" rid="T4">Table 4</xref>). We do not use any technical and scanner covariates in ComBat.</p>
                        </list-item>
                    </list>
                </p>
                <table-wrap id="T4" orientation="portrait" position="float">
                    <label>Table 4. </label>
                    <caption>
                        <title>Schematic table of four scenarios in Case 1 indicating use of the three clinical covariates in the ComBat harmonization and in the random forest (RF) classification model.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="center" colspan="2" rowspan="2" valign="middle"/>
                                <th align="center" colspan="2" rowspan="1" valign="middle">ComBat: 3 Clinical covariates</th>
                            </tr>
                            <tr>
                                <th align="center" colspan="1" rowspan="1" valign="middle">No</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Yes</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="2" valign="middle">
                                    <bold>Random Forest: 3 Clinical covariates</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <bold>No</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">Scenario a</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">Scenario b</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <bold>Yes</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">Scenario d</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">Scenario c</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>To assess the impact of ComBat harmonization on RF classification performance, we conduct nonparametric tests (Wilcoxon&#x2019;s signed-rank paired one-sided tests with continuity correction) to examine whether a post-ComBat classification metric lower bound is: (1) significantly greater than that for its pre-ComBat counterpart, and (2) significantly lower than the corresponding upper bound (
                    <xref ref-type="table" rid="T5">Table 5</xref>). Glioma is treated as the &#x201c;positive&#x201d; class in classification and, therefore, sensitivity measures the proportion of true glioma cases correctly identified, specificity measures the proportion of true meningioma cases correctly identified, and total accuracy measures the total proportion of true meningioma and glioma cases correctly identified.</p>
                <table-wrap id="T5" orientation="portrait" position="float">
                    <label>Table 5. </label>
                    <caption>
                        <title>Random forest (RF) classification median (median absolute deviation in parentheses) sensitivity (&#x201c;Sens&#x201d;), specificity (&#x201c;Spec&#x201d;), and total accuracies (&#x201c;Tot Acc&#x201d;) corresponding to Case 1, scenarios a &#x2013; d (
                            <xref ref-type="table" rid="T4">Table 4</xref>).</title>
                        <p>The medians and median absolute deviations of the classifiation metrics are computed based on 75 repetitions of random training/test splits. Values closer to 1 indicate better performance. For post-ComBat lower bounds: 1) bold indicates significantly greater value (
                            <italic toggle="yes">p</italic>-value &lt; 0.05, Wilcoxon&#x2019;s signed-rank paired one-sided test with continuity correction) compared to the corresponding pre-ComBat lower bound; 2) underline indicates corresponding upper bound is not significantly higher. Therefore, bold and underline together indicate the best results using ComBat.</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="center" colspan="1" rowspan="3" valign="middle"/>
                                <th align="center" colspan="9" rowspan="1" valign="middle">Lower bound</th>
                                <th align="center" colspan="3" rowspan="1" valign="middle">Upper bound</th>
                            </tr>
                            <tr>
                                <th align="center" colspan="3" rowspan="1" valign="middle">Pre-ComBat</th>
                                <th align="center" colspan="3" rowspan="1" valign="middle">Post-ComBat Clinical covariates = NO</th>
                                <th align="center" colspan="3" rowspan="1" valign="middle">Post-ComBat Clinical covariates = YES</th>
                                <th align="center" colspan="3" rowspan="1" valign="middle">No ComBat</th>
                            </tr>
                            <tr>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Sens</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Spec</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Tot Acc</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Sens</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Spec</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Tot Acc</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Sens</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Spec</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Tot Acc</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Sens</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Spec</th>
                                <th align="center" colspan="1" rowspan="1" valign="middle">Tot Acc</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="3" rowspan="1" valign="middle">
                                    <bold>Scenario a</bold>
                                </td>
                                <td align="center" colspan="3" rowspan="1" valign="middle">
                                    <bold>Scenario b</bold>
                                </td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>RF Clinical covariates = NO</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.474 (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.684 (0.156)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.605 (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <underline>
                                        <bold>0.579</bold>
                                    </underline> (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <bold>0.737</bold> (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <underline>
                                        <bold>0.658</bold>
                                    </underline> (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <underline>
                                        <bold>0.632</bold>
                                    </underline> (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <bold>0.737</bold> (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <underline>
                                        <bold>0.658</bold>
                                    </underline> (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.526 (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.789 (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.658 (0.039)</td>
                            </tr>
                            <tr>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="3" rowspan="1" valign="middle">
                                    <bold>Scenario d</bold>
                                </td>
                                <td align="center" colspan="3" rowspan="1" valign="middle">
                                    <bold>Scenario c</bold>
                                </td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">
                                    <bold>RF Clinical covariates = YES</bold>
                                </td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.526 (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.789 (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.684 (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.526 (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <underline>
                                        <bold>0.842</bold>
                                    </underline> (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <bold>0.684</bold> (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.526 (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <underline>
                                        <bold>0.842</bold>
                                    </underline> (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">
                                    <underline>
                                        <bold>0.711</bold>
                                    </underline> (0.039)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.632 (0.156)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.842 (0.078)</td>
                                <td align="center" colspan="1" rowspan="1" valign="middle">0.711 (0.078)</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
            </sec>
        </sec>
        <sec id="sec7" sec-type="results">
            <title>Results</title>
            <p>Below we discuss the full set of results for the scenarios in Cases 0 and 1.
                <sup>
                    <xref ref-type="bibr" rid="ref43">43</xref>
                </sup>
                <sup>&#x2013;</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref46">46</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref50">50</xref>
                </sup>
                <sup>&#x2013;</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref57">57</xref>
                </sup> Note that, besides these two cases, we also examine the results of another case (Case 2) in which, alongside the three image variables, we include one technical covariate and six scanner covariates (see the &#x201c;Study population and data generation&#x201d; section) in the ComBat step. However, since the essence of these results is mostly similar to that of Case 1, we set them aside as &#x201c;Extended data&#x201d; (
                <italic toggle="yes">Extended data</italic>: Table S3).
                <sup>
                    <xref ref-type="bibr" rid="ref49">49</xref>
                </sup>
            </p>
            <sec id="sec8">
                <title>Using only three clinical covariates in the RF classification model (no ComBat harmonization involved)</title>
                <p>Using only the clinical covariates of the subjects in the RF model (Case 0), the lower bound total accuracies are not significantly lower than those for upper bounds: both medians = 57.9%; 
                    <italic toggle="yes">p</italic>-value = 0.332 (
                    <xref ref-type="fig" rid="f2">Figure 2</xref>). The lower bounds of the sensitivity and the specificity measures are also not significantly lower than those for the upper bounds: 
                    <italic toggle="yes">p</italic>-values 0.133 and 0.884 respectively. Thus, the distributions of the age/sex/breed-type between meningioma/glioma subjects do not vary significantly across sites. For example, exact 
                    <italic toggle="yes">p</italic>-values corresponding to the Pearson&#x2019;s chi-squared tests (with Yates&#x2019; continuity correction) on the two 2&#x00d7;2 contingency tables for sex and breed-type distributions across CSU and Outside sites are 0.762 and 0.604, respectively. Also, among all scenarios, RF achieves the lowest medians of total accuracy and sensitivity in this case, which indicates an overall poor predictive strength of using only clinical covariates in the RF model (
                    <xref ref-type="fig" rid="f2">Figures 2</xref> and 
                    <xref ref-type="fig" rid="f3">3</xref>).</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <title>Boxplots of random forest (RF) classification metrics corresponding to Case 0: &#x201c;tota&#x201d; = total accuracy, &#x201c;sens&#x201d; = sensitivity, and &#x201c;spec&#x201d; = specificity.</title>
                        <p>L, U: lower bound (black) and upper bound (blue) obtained from RF models using only three clinical covariates.</p>
                    </caption>
                    <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/129171/1324e15f-f42a-454b-9e8a-ce08b185d629_figure2.gif"/>
                </fig>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>Figure 3. </label>
                    <caption>
                        <title>Boxplots of random forest (RF) classification metrics: (A) total accuracy, (B) sensitivity, and (C) specificity, corresponding to Case 0 (&#x201c;c0&#x201d;) and Case 1 pre-ComBat and post-ComBat scenarios a (&#x201c;1a&#x201d;) and b (&#x201c;1b&#x201d;); see 
                            <xref ref-type="table" rid="T4">Table 4</xref>.</title>
                        <p>L.c0, U.c0: lower bound (black) and upper bound (magenta) obtained from RF models using only three clinical covariates; no ComBat harmonization involved; L, L.CB, U: pre-ComBat lower bound (red), post-ComBat lower bounds (green, 1a; blue, 1b), and upper bound (cyan) obtained from RF models using only three image variables.</p>
                    </caption>
                    <graphic id="gr3" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/129171/1324e15f-f42a-454b-9e8a-ce08b185d629_figure3.gif"/>
                </fig>
            </sec>
            <sec id="sec9">
                <title>Using only three image variables in the RF classification model</title>
                <p>
                    <italic toggle="yes">Pre-harmonization</italic>
                </p>
                <p>
                    <underline>Total accuracy:</underline> Using only the image variables in the RF model, the lower bound total accuracy (pre-ComBat) does not differ significantly from that using only three clinical covariates (Case 0): medians 60.5% vs. 57.9%; 
                    <italic toggle="yes">p</italic>-value = 0.270. However, the upper bound total accuracy is significantly higher than that in Case 0: medians 65.8% vs. 57.9%; 
                    <italic toggle="yes">p</italic>-value = 4.06 E-07 (
                    <xref ref-type="fig" rid="f3">Figure 3-A</xref>).</p>
                <p>
                    <underline>Sensitivity:</underline> Using only the image variables in the RF model, the lower bound sensitivity (pre-ComBat) is significantly higher than that using only three clinical covariates (Case 0): medians 47.4% vs. 42.1%; 
                    <italic toggle="yes">p</italic>-value = 9.68 E-04. Similarly, the upper bound sensitivity is also significantly higher than that in Case 0: medians 52.6% vs. 47.4%; 
                    <italic toggle="yes">p</italic>-value = 6.58 E-04 (
                    <xref ref-type="fig" rid="f3">Figure 3-B</xref>).</p>
                <p>
                    <underline>Specificity:</underline> Using only the image variables in the RF model, interestingly, the lower bound specificity (pre-ComBat) is significantly lower than that using only three clinical covariates (Case 0): medians 68.4% vs. 73.7%; 
                    <italic toggle="yes">p</italic>-value = 3.31 E-03. However, the upper bound specificity is significantly higher than that in Case 0: medians 78.9% vs. 73.7%; 
                    <italic toggle="yes">p</italic>-value = 5.67 E-05 (
                    <xref ref-type="fig" rid="f3">Figure 3-C</xref>).</p>
                <p>
                    <italic toggle="yes">Post-harmonization</italic>
                </p>
                <p>
                    <underline>Total accuracy:</underline> Using post-ComBat harmonization (scenarios a, b), the total accuracy lower bounds are significantly higher compared to their pre-ComBat and Case 0 counterparts. For example, post-ComBat with only three image variables (scenario a): (1) vs. pre-ComBat: medians 65.8% vs. 60.5%; 
                    <italic toggle="yes">p</italic>-value = 2.64 E-08 (
                    <xref ref-type="table" rid="T5">Table 5</xref>, 
                    <xref ref-type="fig" rid="f3">Figure 3-A</xref>) and (2) vs. using only the clinical covariates (Case 0): medians 65.8% vs. 57.9%; 
                    <italic toggle="yes">p</italic>-value = 4.98 E-08 (
                    <xref ref-type="fig" rid="f3">Figure 3-A</xref>).</p>
                <p>
                    <underline>Sensitivity:</underline> Using post-ComBat harmonization (scenarios a, b), the sensitivity lower bounds are significantly higher compared to their pre-ComBat and Case 0 counterparts. For example, post-ComBat with only three image variables (scenario a): (1) vs. pre-ComBat: medians 57.9% vs. 47.4%; 
                    <italic toggle="yes">p</italic>-value = 4.33 E-08 (
                    <xref ref-type="table" rid="T5">Table 5</xref> and 
                    <xref ref-type="fig" rid="f3">Figure 3-B</xref>) and (2) vs. using only the clinical covariates (Case 0): medians 57.9% vs. 42.1%; 
                    <italic toggle="yes">p</italic>-value = 7.88 E-11 (
                    <xref ref-type="fig" rid="f3">Figure 3-B</xref>).</p>
                <p>
                    <underline>Specificity:</underline> Using post-ComBat harmonization (scenarios a, b), the specificity lower bounds are significantly higher compared to their pre-ComBat counterparts. For example, post-ComBat with only three image variables (scenario a) vs. pre-ComBat: medians 73.7% vs. 68.4%; 
                    <italic toggle="yes">p</italic>-value = 1.16 E-03 (
                    <xref ref-type="table" rid="T5">Table 5</xref> and 
                    <xref ref-type="fig" rid="f3">Figure 3-C</xref>). Interestingly though, these post-ComBat lower bounds are not significantly higher than that using only the clinical covariates (Case 0): all three medians 73.7%; 
                    <italic toggle="yes">p</italic>-values (scenarios a and b vs. Case 0) = 0.347 and 0.359, respectively (
                    <xref ref-type="fig" rid="f3">Figure 3-C</xref>).</p>
                <p>These results confirm that using just the three image variables in the RF model, ComBat harmonization enhances the RF classification performance (except for specificity) compared to that in pre-ComBat and when using only the clinical covariates.</p>
            </sec>
            <sec id="sec10">
                <title>Using three image variables and three clinical covariates in the RF classification model</title>
                <p>
                    <italic toggle="yes">Pre-harmonization</italic>
                </p>
                <p>
                    <underline>Total accuracy:</underline> Using the image variables and the clinical covariates in the RF model, the lower bound total accuracy (pre-ComBat) is significantly higher than that using only three image variables in RF: medians 68.4% vs. 60.5%; p-value = 7.48 E-09. Similarly, the upper bound total accuracy is also significantly higher: medians 71.1% vs. 65.8%; p-value = 3.64 E-07 (
                    <xref ref-type="table" rid="T5">Table 5</xref>, 
                    <xref ref-type="fig" rid="f4">Figure 4-A</xref>).</p>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>Figure 4. </label>
                    <caption>
                        <title>Boxplots of random forest (RF) classification metrics: (A) total accuracy, (B) sensitivity, and (C) specificity, corresponding to Case 1 pre-ComBat (RF model using only the image variables and using both the image variables and the clinical covariates) and post-ComBat scenarios c (&#x201c;1c&#x201d;) and d (&#x201c;1d&#x201d;); see 
                            <xref ref-type="table" rid="T4">Table 4</xref>.</title>
                        <p>L3, U3: pre-ComBat lower bound (black) and upper bound (magenta) obtained from RF models using only three image variables; L6, L6.CB, U6: pre-ComBat lower bound (red), post-ComBat lower bounds (green, 1c; blue, 1d), and upper bound (cyan) obtained from RF models using three image variables and three clinical covariates.</p>
                    </caption>
                    <graphic id="gr4" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/129171/1324e15f-f42a-454b-9e8a-ce08b185d629_figure4.gif"/>
                </fig>
                <p>
                    <underline>Sensitivity:</underline> Using the image variables and the clinical covariates in the RF model, the lower bound sensitivity (pre-ComBat) is significantly higher than that using only three image variables in RF: medians 52.6% vs. 47.4%; p-value = 8.77 E-04. Similarly, the upper bound sensitivity is also significantly higher: medians 63.2% vs. 52.6%; p-value = 1.76 E-06 (
                    <xref ref-type="table" rid="T5">Table 5</xref>, 
                    <xref ref-type="fig" rid="f4">Figure 4-B</xref>).</p>
                <p>
                    <underline>Specificity:</underline> Using the image variables and the clinical covariates in the RF model, the lower bound specificity (pre-ComBat) is significantly higher than that using only three image variables in RF: medians 78.9% vs. 68.4%; p-value = 2.33 E-10. Similarly, the upper bound specificity is also significantly higher: medians 84.2% vs. 78.9%; p-value = 2.90 E-03 (
                    <xref ref-type="table" rid="T5">Table 5</xref>, 
                    <xref ref-type="fig" rid="f4">Figure 4-C</xref>).</p>
                <p>
                    <italic toggle="yes">Post-harmonization</italic>
                </p>
                <p>
                    <underline>Total accuracy:</underline> Using post-ComBat harmonization (scenarios c, d), the total accuracy lower bounds are significantly higher compared to their pre-ComBat and post-ComBat with only image variables in RF counterparts. For example, post-ComBat using three image variables and three clinical covariates (scenario c): (1) vs. pre-ComBat: medians 71.1% vs 68.4%; p-value = 8.80 E-04 and (2) vs. using only image variables in the RF model (scenario b): medians 71.1% vs. 65.8%; p-value = 1.84 E-04. Moreover, comparing between post-ComBat scenarios c and d: medians 71.1% vs 68.4%, p-value = 6.97 E-03 (
                    <xref ref-type="table" rid="T5">Table 5</xref>, 
                    <xref ref-type="fig" rid="f4">Figure 4-A</xref>).</p>
                <p>
                    <underline>Sensitivity:</underline> Using post-ComBat harmonization (scenarios c, d), the sensitivity lower bounds are not significantly higher compared to their pre-ComBat counterparts. For example, post-ComBat using three image variables and three clinical covariates (scenario c) vs. pre-ComBat: both medians 52.6%; p-value = 0.953 (
                    <xref ref-type="table" rid="T5">Table 5</xref>, 
                    <xref ref-type="fig" rid="f4">Figure 4-B</xref>). However, this post-ComBat sensitivity lower bound in scenario c is significantly higher than that using only image variables (scenario d): both medians 52.6%; p-value = 0.0177. Interestingly, post-ComBat sensitivity in scenario c (and d) deteriorates significantly compared to those when not using the clinical covariates in the RF model in scenario b (and scenario a): medians 52.6% vs. 63.2% (52.6% vs. 57.9%); p-value = 2.07 E-05 (6.93 E-05; 
                    <xref ref-type="table" rid="T5">Table 5</xref>).</p>
                <p>
                    <underline>Specificity:</underline> Using post-ComBat harmonization (scenarios c, d), the specificity lower bounds are again significantly higher compared to their pre-ComBat counterparts. For example, post-ComBat specificity lower bound using three image variables and three clinical covariates (scenario c) vs. pre-ComBat: medians 84.2% vs. 78.9%; p-value = 9.44 E-10 (
                    <xref ref-type="table" rid="T5">Table 5</xref>, 
                    <xref ref-type="fig" rid="f4">Figure 4-C</xref>). This post-ComBat specificity lower bound in scenario c is also significantly higher than that using only image variables (scenario d): both medians 84.2%; p-value = 2.69 E-03 (
                    <xref ref-type="table" rid="T5">Table 5</xref>, 
                    <xref ref-type="fig" rid="f4">Figure 4-C</xref>) and compared to those when not using the clinical covariates in the RF model (scenario b): medians 84.2% vs. 73.7%; p-value = 3.05 E-12 (
                    <xref ref-type="table" rid="T5">Table 5</xref>).</p>
                <p>These results confirm that using the image variables and clinical covariates together in the RF model, with or without ComBat harmonization, results in better RF classification performance (except for sensitivity) than using only the image variables. Furthermore, using the image variables as well as the clinical covariates in both ComBat harmonization and the RF model provides the highest total accuracy and specificity across all scenarios.</p>
            </sec>
        </sec>
        <sec id="sec11" sec-type="discussion">
            <title>Discussion</title>
            <p>In this case-study, we demonstrate the efficacy of MRI data harmonization using ComBat in enhancing the downstream RF classification performance. Utilizing the clinical covariates along with the image variables both in ComBat and RF (Case 1, scenario c) results in the highest total accuracy. When adjusting for the technical and scanner covariates in ComBat (Case 2), we only notice significant improvements in specificity (correct identification of true meningioma cases; scenarios c, d) compared to when not using them (Case 1; 
                <xref ref-type="table" rid="T5">Tables 5</xref> and S3). For both cases, RF achieves the highest specificity with the clinical covariates included in the model, irrespective of including them in ComBat (e.g., maximum median value for Case 1 is 84.2%, scenarios c, d; 
                <xref ref-type="table" rid="T5">Table 5</xref>). Of all cases and scenarios, RF attains the highest sensitivity (correct identification of true glioma cases) when we include the clinical covariates in ComBat but not in the classification model in Case 1 (maximum median value is 63.2%, scenario b; 
                <xref ref-type="table" rid="T5">Table 5</xref>).</p>
            <p>In summary, we confirm the overall effectiveness of ComBat harmonization in adjusting for the site-specific variability even for our &#x201c;non-ideal&#x201d; as a practically feasible, noisy, low-dimensional, manually processed MRI dataset.</p>
            <sec id="sec12">
                <title>Limitations</title>
                <p>The highest median total accuracy we obtain is 71.1% (Case 1, scenario c). However, among the 75 repetitions, we do notice up to a maximum of 84.2%. The challenge in attaining any higher total accuracy is mainly poised by low sensitivity, i.e., correct identification of true glioma cases, possibly due to: 1) insufficient predictors &#x2013; we have used three available, manually generated image variables and three covariates for our analyses; 2) the possible minor mislabeling of the tumor-types or imprecise ROIs because the labels are based on the visual inspection and subjective, expert conclusion of the examining radiologists at the CSU-VTH and not confirmed via surgical histopathology, or because the ROIs in each scan-slice are drawn by two non-radiologists, and hence can possibly incur imprecise diseased/normal ROIs; 3) non-homogeneous sites &#x2013; ComBat performance can potentially sharpen further with more homogeneous composition of the &#x201c;outside&#x201d; site; 4) an imbalanced outcome classes &#x2013; although we address the severe class imbalance, a more balanced distribution in the original data may enhance RF performance
                    <sup>
                        <xref ref-type="bibr" rid="ref36">36</xref>
                    </sup>; and 5) the choice of class imbalance adjustor and classifier &#x2013; one may choose a different class-imbalance adjustment, such as &#x201c;over-sampling&#x201d;,
                    <sup>
                        <xref ref-type="bibr" rid="ref37">37</xref>
                    </sup> or a different classifier, such as logistic regression.
                    <sup>
                        <xref ref-type="bibr" rid="ref38">38</xref>
                    </sup> However, our initial exploration suggests that the SMOTE-RF combination provides better results than those of some other alternatives (data not shown).</p>
            </sec>
        </sec>
        <sec id="sec13">
            <title>Data availability</title>
            <sec id="sec14">
                <title>Underlying data</title>
                <p>Figshare: Image and Covariates Data on CSU-Meningioma Subjects. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19497671.v1">https://doi.org/10.6084/m9.figshare.19497671.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref43">43</xref>
                    </sup>
                </p>
                <p>Figshare: Image and Covariates Data on CSU-Glioma Subjects. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19497683.v1">https://doi.org/10.6084/m9.figshare.19497683.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref44">44</xref>
                    </sup>
                </p>
                <p>Figshare: Image and Covariates Data on Outside-Meningioma Subjects. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19497686.v1">https://doi.org/10.6084/m9.figshare.19497686.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref45">45</xref>
                    </sup>
                </p>
                <p>Figshare: Image and Covariates Data on Outside-Glioma Subjects. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19497692.v1">https://doi.org/10.6084/m9.figshare.19497692.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref46">46</xref>
                    </sup>
                </p>
            </sec>
            <sec id="sec15">
                <title>Extended data</title>
                <p>Figshare: Table S1: Number of Subjects with Less Than Three Image Slices Selected. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19497701.v3">https://doi.org/10.6084/m9.figshare.19497701.v3</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref47">47</xref>
                    </sup>
                </p>
                <p>Figshare: Table S2: Number of Subjects for Whom Facial Muscle is Used as Normal Tissue. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19497707.v2">https://doi.org/10.6084/m9.figshare.19497707.v2</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref48">48</xref>
                    </sup>
                </p>
                <p>Figshare: Table S3: Case 2 Full Results. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19498832">https://doi.org/10.6084/m9.figshare.19498832</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref49">49</xref>
                    </sup>
                </p>
                <p>Figshare: Figure S1-A. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19498934.v1">https://doi.org/10.6084/m9.figshare.19498934.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref50">50</xref>
                    </sup>
                </p>
                <p>This project contains the following extended data:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>New_CSUOut-MeninGlio_boxplot_final_meanSI.png (Boxplots of means [across up to three slices] of normalized mean of signal intensities measured on 244 subjects distributed across four subpopulations).</p>
                        </list-item>
                    </list>
                </p>
                <p>Figshare: Figure S1-B. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19498937.v1">https://doi.org/10.6084/m9.figshare.19498937.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref51">51</xref>
                    </sup>
                </p>
                <p>This project contains the following extended data:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>New_CSUOut-MeninGlio_boxplot_final_sdSI.png (Boxplots of means [across up to three slices] of normalized standard deviation of signal intensities measured on 244 subjects distributed across four subpopulations).</p>
                        </list-item>
                    </list>
                </p>
                <p>Figshare: Figure S1-C. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19498940.v1">https://doi.org/10.6084/m9.figshare.19498940.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref52">52</xref>
                    </sup>
                </p>
                <p>This project contains the following extended data:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>New_CSUOut-MeninGlio_boxplot_final_centSI.png (Boxplots of means [across up to three slices] of normalized central point-value of signal intensities measured on 244 subjects distributed across four subpopulations).</p>
                        </list-item>
                    </list>
                </p>
                <p>Figshare: Figure S2-A. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19498943.v1">https://doi.org/10.6084/m9.figshare.19498943.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref53">53</xref>
                    </sup>
                </p>
                <p>This project contains the following extended data:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Processors_allGroups_boxplot_final_meanSI.png (Boxplots of means [across up to three slices] of normalized mean of signal intensities measured by two processors [&#x201c;XY&#x201d; and &#x201c;DN&#x201d;] on 244 subjects distributed across four subpopulations: GC = &#x201c;Glio-CSU&#x201d;, MC = &#x201c;Menin-CSU&#x201d;, GO = &#x201c;Glio-Out&#x201d;, MO = &#x201c;Menin-Out&#x201d;).</p>
                        </list-item>
                    </list>
                </p>
                <p>Figshare: Figure S2-B. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19498946.v1">https://doi.org/10.6084/m9.figshare.19498946.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref54">54</xref>
                    </sup>
                </p>
                <p>This project contains the following extended data:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Processors_allGroups_boxplot_final_sdSI.png (Boxplots of means [across up to three slices] of normalized standard deviation of signal intensities measured by two processors [&#x201c;XY&#x201d; and &#x201c;DN&#x201d;] on 244 subjects distributed across four subpopulations: GC = &#x201c;Glio-CSU&#x201d;, MC = &#x201c;Menin-CSU&#x201d;, GO = &#x201c;Glio-Out&#x201d;, MO = &#x201c;Menin-Out&#x201d;).</p>
                        </list-item>
                    </list>
                </p>
                <p>Figshare: Figure S2-C. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19498949.v1">https://doi.org/10.6084/m9.figshare.19498949.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref55">55</xref>
                    </sup>
                </p>
                <p>This project contains the following extended data:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Processors_allGroups_boxplot_final_centSI.png (Boxplots of means [across up to three slices] of normalized central point-value of signal intensities measured by two processors [&#x201c;XY&#x201d; and &#x201c;DN&#x201d;] on 244 subjects distributed across four subpopulations: GC = &#x201c;Glio-CSU&#x201d;, MC = &#x201c;Menin-CSU&#x201d;, GO = &#x201c;Glio-Out&#x201d;, MO = &#x201c;Menin-Out&#x201d;).</p>
                        </list-item>
                    </list>
                </p>
                <p>Figshare: Figure S3-A. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19498952.v1">https://doi.org/10.6084/m9.figshare.19498952.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref56">56</xref>
                    </sup>
                </p>
                <p>This project contains the following extended data:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>meninglioCSU_corr_final_3img_dislab.png (Pearson&#x2019;s correlations among the three image variables and the disease labels [&#x201c;dis.lab&#x201d;; meningioma = 1, glioma = 0] within CSU subjects).</p>
                        </list-item>
                    </list>
                </p>
                <p>Figshare: Figure S3-B. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.6084/m9.figshare.19498964.v1">https://doi.org/10.6084/m9.figshare.19498964.v1</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref57">57</xref>
                    </sup>
                </p>
                <p>This project contains the following extended data:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>meninglioOut_corr_final_3img_dislab.png (Pearson&#x2019;s correlations among the three image variables and the disease labels [&#x201c;dis.lab&#x201d;; meningioma = 1, glioma = 0] within &#x201c;Outside&#x201d; subjects).</p>
                        </list-item>
                    </list>
                </p>
                <p>Data are available under the terms of the 
                    <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/publicdomain/zero/1.0/">Creative Commons Zero &#x201c;No rights reserved&#x201d; data waiver</ext-link> (CC0 1.0 Public domain dedication).</p>
            </sec>
        </sec>
        <sec id="sec16">
            <title>Software availability</title>
            <p>Source code available from: 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/KechrisLab/ComBat_dogBrainMRI/tree/MRI">https://github.com/KechrisLab/ComBat_dogBrainMRI/tree/MRI</ext-link>
            </p>
            <p>Archived source code at time of publication: 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.6632525">https://doi.org/10.5281/zenodo.6632525</ext-link>.
                <sup>
                    <xref ref-type="bibr" rid="ref58">58</xref>
                </sup>
            </p>
            <p>License: 
                <ext-link ext-link-type="uri" xlink:href="https://opensource.org/licenses/GPL-3.0">GNU General Public License v3.0</ext-link>
            </p>
            <p>We generated all imaging data using the Philips IntelliSpace PACS Radiology software v4.4 (Philips Healthcare Informatics, Inc, 4100 East Third Avenue, Suite 101, Foster City, CA 94404, USA); license purchased by the CSU-VTH. We performed all of the statistical analyses and generate all of the figures using the 
                <italic toggle="yes">R statistical software</italic>, version 4.1.0.
                <sup>
                    <xref ref-type="bibr" rid="ref39">39</xref>
                </sup> We implemented the ComBat data harmonization using the 
                <italic toggle="yes">neuroCombat</italic> R software package, which is publicly available in Jean-Philippe Fortin&#x2019;s GitHub: 
                <ext-link ext-link-type="uri" xlink:href="https://bit.ly/fortin-ComBat-git">https://bit.ly/fortin-ComBat-git</ext-link>, and the SMOTE imbalanced class adjustment using the 
                <italic toggle="yes">smote()</italic> function within the 
                <italic toggle="yes">performanceEstimation</italic> CRAN package.
                <sup>
                    <xref ref-type="bibr" rid="ref40">40</xref>
                </sup> For the RF classifier, we use 
                <italic toggle="yes">method = &#x201c;rf&#x201d;</italic> input argument in the 
                <italic toggle="yes">train()</italic> function and compute the classification performance evaluation metrics using the 
                <italic toggle="yes">confusionMatrix()</italic> function, both within the 
                <italic toggle="yes">caret</italic> CRAN package.
                <sup>
                    <xref ref-type="bibr" rid="ref41">41</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref42">42</xref>
                </sup> As a freely available alternative to PACS for a DICOM viewer and imaging data generator, we suggest 
                <ext-link ext-link-type="uri" xlink:href="https://horosproject.org/about/">Horos</ext-link>.</p>
        </sec>
        <sec id="sec17">
            <title>Ethical approval</title>
            <p>Approval of VCS #2018-162 &#x201c;Lymphotropic Nanoparticle Enhanced MRI for Diagnosis of Metastatic Disease in Canine Head and Neck Tumors&#x201d; was obtained by Dr. Lynn Griffin on June 4, 2018, and subsequently on August 8, 2019 (for amendment to increase the approved animal numbers), from the Colorado State University Veterinary Teaching Hospital Clinical Review Board. The Clinical Review Board consists of 14 faculty members (as of August 8, 2019) from the College of Veterinary Medicine and Biomedical Sciences including a standing member of IACUC, the Hospital Director, and the Chair of the Department of Clinical Sciences.</p>
            <p>Client consent was obtained from the respective owners of all dogs included in this study to use all obtained images and medical data for the purposes of research. Consent for publication is not applicable.</p>
        </sec>
    </body>
    <back>
        <ack>
            <title>Acknowledgements</title>
            <p>We are grateful to Kevin Kirsch of Colorado State University for providing us generous support on the setup of and access to CSU remote workstation and the Philips IntelliSpace PACS software and promptly clarifying our doubts as and when they appeared. We also thank Debashis Ghosh of Colorado School of Public Health and Natalie Serkova of University of Colorado School of Medicine for providing valuable suggestions and recommendations.</p>
        </ack>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <article-title>Chapter Two: Lifespan Communication Methodology.</article-title>
                    <source>

                        <italic toggle="yes">The Handbook of Lifespan Communication.</italic>
</source>
                    <pub-id pub-id-type="doi">10.3726/978-1-4539-1367-3/14</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Runge</surname>
                            <given-names>VM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Magnetic Resonance Imaging and Computed Tomography of the Brain-50 Years of Innovation, With a Focus on the Future.</article-title>
                    <source>

                        <italic toggle="yes">Investig. Radiol.</italic>
</source>
                    <year>2015</year>;<volume>50</volume>:<fpage>551</fpage>&#x2013;<lpage>556</lpage>.
                    <pub-id pub-id-type="pmid">26050021</pub-id>
                    <pub-id pub-id-type="doi">10.1097/RLI.0000000000000170</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sutton</surname>
                            <given-names>BP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ouyang</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Karampinos</surname>
                            <given-names>DC</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Current trends and challenges in MRI acquisitions to investigate brain function.</article-title>
                    <source>

                        <italic toggle="yes">Int. J. Psychophysiol.</italic>
</source>
                    <year>2009</year>;<volume>73</volume>:<fpage>33</fpage>&#x2013;<lpage>42</lpage>.
                    <pub-id pub-id-type="pmid">19236896</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.ijpsycho.2008.12.020</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Villanueva-Meyer</surname>
                            <given-names>JE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mabray</surname>
                            <given-names>MC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cha</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Current Clinical Brain Tumor Imaging.</article-title>
                    <source>

                        <italic toggle="yes">Neurosurgery.</italic>
</source>
                    <year>2017</year>;<volume>81</volume>:<fpage>397</fpage>&#x2013;<lpage>415</lpage>.
                    <pub-id pub-id-type="pmid">28486641</pub-id>
                    <pub-id pub-id-type="doi">10.1093/neuros/nyx103</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Henson</surname>
                            <given-names>JW</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ulmer</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Harris</surname>
                            <given-names>GJ</given-names>
                        </name>
</person-group>:
                    <article-title>Brain tumor imaging in clinical trials.</article-title>
                    <source>

                        <italic toggle="yes">AJNR Am. J. Neuroradiol.</italic>
</source>
                    <year>2008</year>;<volume>29</volume>:<fpage>419</fpage>&#x2013;<lpage>424</lpage>.
                    <pub-id pub-id-type="pmid">18272557</pub-id>
                    <pub-id pub-id-type="doi">10.3174/ajnr.A0963</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Baig</surname>
                            <given-names>MA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Klein</surname>
                            <given-names>JP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mechtler</surname>
                            <given-names>LL</given-names>
                        </name>
</person-group>:
                    <article-title>Imaging of Brain Tumors.</article-title>
                    <source>

                        <italic toggle="yes">Continuum.</italic>
</source>
                    <year>2016</year>;<volume>22</volume>:<fpage>1529</fpage>&#x2013;<lpage>1552</lpage>.
                    <pub-id pub-id-type="pmid">27740987</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Di Ieva</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Le Reste</surname>
                            <given-names>P-J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Carsin-Nicol</surname>
                            <given-names>B</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Diagnostic Value of Fractal Analysis for the Differentiation of Brain Tumors Using 3-Tesla Magnetic Resonance Susceptibility-Weighted Imaging.</article-title>
                    <source>

                        <italic toggle="yes">Neurosurgery.</italic>
</source>
                    <year>2016</year>;<volume>79</volume>:<fpage>839</fpage>&#x2013;<lpage>846</lpage>.
                    <pub-id pub-id-type="pmid">27332779</pub-id>
                    <pub-id pub-id-type="doi">10.1227/NEU.0000000000001308</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jin</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Peng</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Peng</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>Brain Glioma Localization Diagnosis Based on Magnetic Resonance Imaging.</article-title>
                    <source>

                        <italic toggle="yes">World Neurosurg.</italic>
</source>
                    <year>2021</year>;<volume>149</volume>:<fpage>325</fpage>&#x2013;<lpage>332</lpage>.
                    <pub-id pub-id-type="pmid">32992057</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.wneu.2020.09.113</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Maclellan</surname>
                            <given-names>MJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ober</surname>
                            <given-names>CP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Feeney</surname>
                            <given-names>DA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Evaluation of diffusion-weighted magnetic resonance imaging at 3.0 Tesla for differentiation between intracranial neoplastic and noninfectious inflammatory lesions in dogs.</article-title>
                    <source>

                        <italic toggle="yes">J. Am. Vet. Med. Assoc.</italic>
</source>
                    <year>2019</year>;<volume>255</volume>:<fpage>71</fpage>&#x2013;<lpage>77</lpage>.
                    <pub-id pub-id-type="pmid">31194666</pub-id>
                    <pub-id pub-id-type="doi">10.2460/javma.255.1.71</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zacharaki</surname>
                            <given-names>EI</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Classification of brain tumor type and grade using MRI texture and shape in a machine learning scheme.</article-title>
                    <source>

                        <italic toggle="yes">Magn. Reson. Med.</italic>
</source>
                    <year>2009</year>;<volume>62</volume>:<fpage>1609</fpage>&#x2013;<lpage>1618</lpage>.
                    <pub-id pub-id-type="doi">10.1002/mrm.22147</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>KY</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>MM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Malayil Lincoln</surname>
                            <given-names>CM</given-names>
                        </name>
</person-group>:
                    <article-title>Adult Primary Brain Neoplasm, Including 2016 World Health Organization Classification.</article-title>
                    <source>

                        <italic toggle="yes">Neuroimaging Clin. N. Am.</italic>
</source>
                    <year>2021</year>;<volume>31</volume>:<fpage>121</fpage>&#x2013;<lpage>138</lpage>.
                    <pub-id pub-id-type="pmid">33220825</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.nic.2020.09.011</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lin</surname>
                            <given-names>NU</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Response assessment criteria for brain metastases: proposal from the RANO group.</article-title>
                    <source>

                        <italic toggle="yes">Lancet Oncol.</italic>
</source>
                    <year>2015</year>;<volume>16</volume>:<fpage>e270</fpage>&#x2013;<lpage>e278</lpage>.
                    <pub-id pub-id-type="pmid">26065612</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S1470-2045(15)70057-4</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref13">
                <label>13</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lin</surname>
                            <given-names>NU</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Challenges relating to solid tumour brain metastases in clinical trials, part 1: patient population, response, and progression. A report from the RANO group.</article-title>
                    <source>

                        <italic toggle="yes">Lancet Oncol.</italic>
</source>
                    <year>2013</year>;<volume>14</volume>:<fpage>e396</fpage>&#x2013;<lpage>e406</lpage>.
                    <pub-id pub-id-type="pmid">23993384</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S1470-2045(13)70311-5</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lin</surname>
                            <given-names>NU</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Challenges relating to solid tumour brain metastases in clinical trials, part 2: neurocognitive, neurological, and quality-of-life outcomes. A report from the RANO group.</article-title>
                    <source>

                        <italic toggle="yes">Lancet Oncol.</italic>
</source>
                    <year>2013</year>;<volume>14</volume>:<fpage>e407</fpage>&#x2013;<lpage>e416</lpage>.
                    <pub-id pub-id-type="doi">10.1016/S1470-2045(13)70308-5</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ghandi</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Beer</surname>
                            <given-names>MA</given-names>
                        </name>
</person-group>:
                    <article-title>Group normalization for genomic data.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2012</year>;<volume>7</volume>:<fpage>e38695</fpage>.
                    <pub-id pub-id-type="pmid">22912661</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0038695</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>M&#x00fc;ller</surname>
                            <given-names>C</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Removing Batch Effects from Longitudinal Gene Expression &#x2013; Quantile Normalization Plus ComBat as Best Approach for Microarray Transcriptome Data.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2016</year>;<volume>11</volume>:<fpage>e0156594</fpage>.
                    <pub-id pub-id-type="pmid">27272489</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0156594</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sun</surname>
                            <given-names>Z</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Batch effect correction for genome-wide methylation data with Illumina Infinium platform.</article-title>
                    <source>

                        <italic toggle="yes">BMC Med. Genet.</italic>
</source>
                    <year>2011</year>;<volume>4</volume>:<fpage>84</fpage>.
                    <pub-id pub-id-type="pmid">22171553</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1755-8794-4-84</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>X</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Normalization Methods for the Analysis of Unbalanced Transcriptome Data: A Review.</article-title>
                    <source>

                        <italic toggle="yes">Front. Bioeng. Biotechnol.</italic>
</source>
                    <year>2019</year>;<volume>7</volume>.
                    <pub-id pub-id-type="pmid">32039167</pub-id>
                    <pub-id pub-id-type="doi">10.3389/fbioe.2019.00358</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Misra</surname>
                            <given-names>BB</given-names>
                        </name>
</person-group>:
                    <article-title>Data normalization strategies in metabolomics: Current challenges, approaches, and tools.</article-title>
                    <source>

                        <italic toggle="yes">Eur. J. Mass Spectrom.</italic>
</source>
                    <year>2020</year>;<volume>26</volume>:<fpage>165</fpage>&#x2013;<lpage>174</lpage>.
                    <pub-id pub-id-type="pmid">32276547</pub-id>
                    <pub-id pub-id-type="doi">10.1177/1469066720918446</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Cuevas-Delgado</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dudzik</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Miguel</surname>
                            <given-names>V</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Data-dependent normalization strategies for untargeted metabolomics&#x2014;a case study.</article-title>
                    <source>

                        <italic toggle="yes">Anal. Bioanal. Chem.</italic>
</source>
                    <year>2020</year>;<volume>412</volume>:<fpage>6391</fpage>&#x2013;<lpage>6405</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s00216-020-02594-9</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yang</surname>
                            <given-names>Q</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>NOREVA: enhanced normalization and evaluation of time-course and multi-class metabolomic data.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2020</year>;<volume>48</volume>:<fpage>W436</fpage>&#x2013;<lpage>W448</lpage>.
                    <pub-id pub-id-type="pmid">32324219</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkaa258</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Reinhold</surname>
                            <given-names>JC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dewey</surname>
                            <given-names>BE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Carass</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Evaluating the Impact of Intensity Normalization on MR Image Synthesis.</article-title>
                    <source>

                        <italic toggle="yes">Proc. SPIE Int. Soc. Opt. Eng.</italic>
</source>
                    <year>2019</year>;<volume>10949</volume>.</mixed-citation>
            </ref>
            <ref id="ref23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Shinohara</surname>
                            <given-names>RT</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Statistical normalization techniques for magnetic resonance imaging.</article-title>
                    <source>

                        <italic toggle="yes">Neuroimage Clin.</italic>
</source>
                    <year>2014</year>;<volume>6</volume>:<fpage>9</fpage>&#x2013;<lpage>19</lpage>.
                    <pub-id pub-id-type="pmid">25379412</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.nicl.2014.08.008</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Carr&#x00e9;</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Standardization of brain MR images across machines and protocols: bridging the gap for MRI-based radiomics.</article-title>
                    <source>

                        <italic toggle="yes">Sci. Rep.</italic>
</source>
                    <year>2020</year>;<volume>10</volume>:<fpage>12340</fpage>.
                    <pub-id pub-id-type="pmid">32704007</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41598-020-69298-z</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mirzaalian</surname>
                            <given-names>H</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Multi-site harmonization of diffusion MRI data in a registration framework.</article-title>
                    <source>

                        <italic toggle="yes">Brain Imaging Behav.</italic>
</source>
                    <year>2018</year>;<volume>12</volume>:<fpage>284</fpage>&#x2013;<lpage>295</lpage>.
                    <pub-id pub-id-type="pmid">28176263</pub-id>
                    <pub-id pub-id-type="doi">10.1007/s11682-016-9670-y</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Johnson</surname>
                            <given-names>WE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rabinovic</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>Adjusting batch effects in microarray expression data using empirical Bayes methods.</article-title>
                    <source>

                        <italic toggle="yes">Biostatistics.</italic>
</source>
                    <year>2007</year>;<volume>8</volume>:<fpage>118</fpage>&#x2013;<lpage>127</lpage>.
                    <pub-id pub-id-type="doi">10.1093/biostatistics/kxj037</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref27">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Conesa</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A survey of best practices for RNA-seq data analysis.</article-title>
                    <source>

                        <italic toggle="yes">Genome Biol.</italic>
</source>
                    <year>2016</year>;<volume>17</volume>:<fpage>13</fpage>.
                    <pub-id pub-id-type="pmid">26813401</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s13059-016-0881-8</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref28">
                <label>28</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Guinney</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The consensus molecular subtypes of colorectal cancer.</article-title>
                    <source>

                        <italic toggle="yes">Nat. Med.</italic>
</source>
                    <year>2015</year>;<volume>21</volume>:<fpage>1350</fpage>&#x2013;<lpage>1356</lpage>.
                    <pub-id pub-id-type="doi">10.1038/nm.3967</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref29">
                <label>29</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fortin</surname>
                            <given-names>J-P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Harmonization of multi-site diffusion tensor imaging data.</article-title>
                    <source>

                        <italic toggle="yes">NeuroImage.</italic>
</source>
                    <year>2017</year>;<volume>161</volume>:<fpage>149</fpage>&#x2013;<lpage>170</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.neuroimage.2017.08.047</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref30">
                <label>30</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fortin</surname>
                            <given-names>J-P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Harmonization of cortical thickness measurements across scanners and sites.</article-title>
                    <source>

                        <italic toggle="yes">NeuroImage.</italic>
</source>
                    <year>2018</year>;<volume>167</volume>:<fpage>104</fpage>&#x2013;<lpage>120</lpage>.
                    <pub-id pub-id-type="pmid">29155184</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.neuroimage.2017.11.024</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref31">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Liaw</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Matthew</surname>
                            <given-names>W</given-names>
                        </name>
</person-group>:
                    <article-title>Classification and Regression by randomForest.</article-title>
                    <source>

                        <italic toggle="yes">R News.</italic>
</source>
                    <year>2002</year>;<volume>2</volume>:<fpage>18</fpage>&#x2013;<lpage>22</lpage>.</mixed-citation>
            </ref>
            <ref id="ref32">
                <label>32</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ho</surname>
                            <given-names>TK</given-names>
                        </name>
</person-group>:
                    <article-title>Random decision forests.</article-title>
                    <source>

                        <italic toggle="yes">Proceedings of 3
                            <sup>rd</sup> International Conference on Document Analysis and Recognition.</italic>
</source>
                    <pub-id pub-id-type="doi">10.1109/icdar.1995.598994</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref33">
                <label>33</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ho</surname>
                            <given-names>TK</given-names>
                        </name>
</person-group>:
                    <article-title>The random subspace method for constructing decision forests.</article-title>
                    <source>

                        <italic toggle="yes">IEEE Trans. Pattern Anal. Mach. Intell.</italic>
</source>
                    <year>1998</year>;<volume>20</volume>:<fpage>832</fpage>&#x2013;<lpage>844</lpage>.
                    <pub-id pub-id-type="doi">10.1109/34.709601</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref34">
                <label>34</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hastie</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tibshirani</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Friedman</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition.</italic>
</source>
                    <publisher-name>Springer Science &amp; Business Media</publisher-name>;<year>2009</year>.</mixed-citation>
            </ref>
            <ref id="ref35">
                <label>35</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chawla</surname>
                            <given-names>NV</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bowyer</surname>
                            <given-names>KW</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hall</surname>
                            <given-names>LO</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>SMOTE: Synthetic Minority Over-sampling Technique.</article-title>
                    <source>

                        <italic toggle="yes">J. Artif. Intell. Res.</italic>
</source>
                    <year>2002</year>;<volume>16</volume>:<fpage>321</fpage>&#x2013;<lpage>357</lpage>.
                    <pub-id pub-id-type="doi">10.1613/jair.953</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref36">
                <label>36</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tharwat</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>Classification assessment methods.</article-title>
                    <source>

                        <italic toggle="yes">Applied Computing and Informatics.</italic>
</source>
                    <year>2021</year>;<volume>17</volume>:<fpage>168</fpage>&#x2013;<lpage>192</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.aci.2018.08.003</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref37">
                <label>37</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yap</surname>
                            <given-names>BW</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>An Application of Oversampling, Undersampling, Bagging and Boosting in Handling Imbalanced Datasets.</article-title>
                    <source>

                        <italic toggle="yes">Lecture Notes in Electrical Engineering.</italic>
</source>
                    <year>2014</year>:<fpage>13</fpage>&#x2013;<lpage>22</lpage>.
                    <pub-id pub-id-type="doi">10.1007/978-981-4585-18-7_2</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref38">
                <label>38</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hosmer</surname>
                            <given-names>DW</given-names>
                            <suffix>Jr</suffix>
                        </name>

                        <name name-style="western">
                            <surname>Lemeshow</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sturdivant</surname>
                            <given-names>RX</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Applied Logistic Regression.</italic>
</source>
                    <publisher-name>John Wiley &amp; Sons</publisher-name>;<year>2013</year>.</mixed-citation>
            </ref>
            <ref id="ref39">
                <label>39</label>
                <mixed-citation publication-type="other">
                    <collab>R Core Team</collab>:
                    <source>

                        <italic toggle="yes">R: A Language and Environment for Statistical Computing.</italic>
</source>
                    <year>2021</year>.</mixed-citation>
            </ref>
            <ref id="ref40">
                <label>40</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Torgo</surname>
                            <given-names>L</given-names>
                        </name>
</person-group>:
                    <article-title>An Infra-Structure for Performance Estimation and Experimental Comparison of Predictive Models in R.</article-title>
                    <source>

                        <italic toggle="yes">CoRR.</italic>
</source>
                    <year>2014</year>;<volume>abs/1412.0436</volume>.</mixed-citation>
            </ref>
            <ref id="ref41">
                <label>41</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kuhn</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">caret: Classification and Regression Training.</italic>
</source>
                    <year>2021</year>.</mixed-citation>
            </ref>
            <ref id="ref42">
                <label>42</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Irizarry</surname>
                            <given-names>RA</given-names>
                        </name>
</person-group>:
                    <article-title>The caret package.</article-title>
                    <source>

                        <italic toggle="yes">Introduction to Data Science.</italic>
</source>
                    <year>2019</year>:<fpage>523</fpage>&#x2013;<lpage>528</lpage>.
                    <pub-id pub-id-type="doi">10.1201/9780429341830-30</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref43">
                <label>43</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Image and Covariates Data on CSU-Meningioma Subjects. figshare. [Dataset].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19497671.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref44">
                <label>44</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Image and Covariates Data on CSU-Glioma Subjects. figshare. [Dataset].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19497683.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref45">
                <label>45</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Image and Covariates Data on Outside-Meningioma Subjects. figshare. [Dataset].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19497686.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref46">
                <label>46</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Image and Covariates Data on Outside-Glioma Subjects. figshare. [Dataset].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19497692.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref47">
                <label>47</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Table S1: Number of Subjects with Less Than Three Image Slices Selected. figshare. [Extended data].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19497701.v3</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref48">
                <label>48</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Table S2: Number of Subjects for Whom Facial Muscle is Used as Normal Tissue. figshare. [Extended data].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19497707.v2</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref49">
                <label>49</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Table S3: Case 2 Full Results. figshare. [Extended data].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19498832</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref50">
                <label>50</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Figure S1-A. figshare. [Extended data].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19498934.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref51">
                <label>51</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Figure S1-B. figshare. [Extended data].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19498937.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref52">
                <label>52</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Figure S1-C. figshare. [Extended data].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19498940.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref53">
                <label>53</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Figure S2-A. figshare. [Extended data].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19498943.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref54">
                <label>54</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Figure S2-B. figshare. [Extended data].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19498946.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref55">
                <label>55</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Figure S2-C. figshare. [Extended data].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19498949.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref56">
                <label>56</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Figure S3-A. figshare. [Extended data].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19498952.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref57">
                <label>57</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Figure S3-B. figshare. [Extended data].</article-title>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.6084/m9.figshare.19498964.v1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref58">
                <label>58</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nandy</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>KechrisLab/ComBat_dogBrainMRI: ComBat for Dog-Brain MRI Data Harmonization (MRI).</article-title>
                    <source>

                        <italic toggle="yes">Zenodo. [Software].</italic>
</source>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.5281/zenodo.6632525</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report276291">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.129171.r276291</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Crawford</surname>
                        <given-names>Abbe</given-names>
                    </name>
                    <xref ref-type="aff" rid="r276291a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-2176-5873</uri>
                </contrib>
                <aff id="r276291a1">
                    <label>1</label>Clinical Science and Services, Royal Veterinary College, North Mymms, Hatfield, UK</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>28</day>
                <month>5</month>
                <year>2024</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2024 Crawford A</copyright-statement>
                <copyright-year>2024</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport276291" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.117334.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This study uses a tool to &#x201c;normalise&#x201d;/reduce heterogeneity in MRI data from multiple institutes prior to analysis with a Random Forest classifier.&#x00a0; As a proof of principle this study&#x00a0; provides useful information and has a valuable goal, but evaluation in a population of confirmed tumour types (rather than presumptive diagnoses) is encouraged.&#x00a0;</p>
            <p> </p>
            <p> I struggled to grasp exactly what the ComBat harmonization entails? How was this performed? I assume it is more than your manual comparison of the lesion and &#x201c;normal&#x201d; ROIs? This should be clarified in the methods.</p>
            <p> </p>
            <p> I do not have relevant experience with computer learning and so cannot interrogate the RF methodology and data interpretation.</p>
            <p> </p>
            <p> Abstract-</p>
            <p> You conclude that &#x201c;Use of MRI data in combination with clinical covariates is more informative than using only clinical covariates in classifying meningioma and glioma brain-tumors in dogs&#x201d; &#x2013; this seems fairly intuitive and perhaps not the strongest conclusion from your study?&#x201d;</p>
            <p> </p>
            <p> Methods:</p>
            <p> Can you explain the justification and potential limitations of your decision to compare 1 site with 36 others? &#x00a0;</p>
            <p> </p>
            <p> Why were only T1W+C sequences used for analysis?</p>
            <p> </p>
            <p> &#x201c;encompassing the densest parts visually examined&#x201d; &#x2013; please explain what you mean by densest, also why and how this was selected? And why did you compare brain tumour to muscle, rather than normal brain parenchyma? How large was the selected ROI and was this standardised?</p>
            <p> </p>
            <p> I am struggling to understand how the SMOTE option allows you to &#x201c;generate&#x201d; more gliomas than you had in your starting population? This might be obvious to those using this technique but would be helpful to explain for a broader readership.</p>
            <p> </p>
            <p> Could you label each box on the boxplots to improve readability/interpretation?</p>
            <p> </p>
            <p> I think that the manuscript would be strengthened by expanding the discussion to consider the clinical application of this and how the work could be developed in the future, also potential options for improvement (particularly considering the poor sensitivity and specificity documented).&#x00a0; Presumably incorporation of more imaging sequences (other than T1+C) and additional clinical variables could improve accuracy?</p>
            <p>Is the case presented with sufficient detail to be useful for teaching or other practitioners?</p>
            <p>Partly</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>I cannot comment. A qualified statistician is required.</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Is the background of the case&#x2019;s history and progression described in sufficient detail?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>This article covers methods and data analysis that are outwith my area of expertise (which is in veterinary clinical neurology and basic cellular neuroscience).</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report207729">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.129171.r207729</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Jeffery</surname>
                        <given-names>Nick</given-names>
                    </name>
                    <xref ref-type="aff" rid="r207729a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r207729a1">
                    <label>1</label>Department of Small Animal Clinical Sciences, Texas A&amp;M University, College Station, TX, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>16</day>
                <month>10</month>
                <year>2023</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2023 Jeffery N</copyright-statement>
                <copyright-year>2023</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport207729" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.117334.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>General comments</p>
            <p> </p>
            <p> It is an interesting and important idea to apply data normalization to veterinary MR images, so as to be able to accumulate data from many sources. This manuscript does achieve a demonstration of this process reasonably well but it might be dependent on who the authors consider their target audience to be. This is not an easy manuscript to read and understand to readers who are not specialist data analysts (I am not a specialist data analyst). I think i just about understand the gist of what the authors have done but&#x00a0; don't fully understand the methods for sure.</p>
            <p> </p>
            <p> It would help to explain some of the methods in non-technical jargon, and use fewer abbreviations. One aspect that is not clear to me is how the test and training set results have been used to produce the results displayed in Table 5 &amp; Figures 2-4. Another aspect is that although ComBat is introduced in the introduction it is not described what it does in this context (and - roughly - how it works).&#x00a0;</p>
            <p> </p>
            <p> From a veterinary clinical perspective, 'Case 0' is a bit of a straw man - since few clinicians would consider those clinical covariates sufficient to distinguish the type of tumor. Also, the overall diagnostic ability - albeit against a poor gold standard (clinician opinion on images) is very poor. Anything less than about 95% sensitivity and specificity severely limits the clinical utility of a diagnostic test. I realize that the purpose of the manuscript is to examine ComBat - and it does show some benefit, but there is a need for further refinement - that perhaps could be alluded to in the discussion section.&#x00a0;</p>
            <p> </p>
            <p> </p>
            <p> Specific comments 
                <list list-type="order">
                    <list-item>
                        <p>More information / background required about ComBat</p>
                    </list-item>
                    <list-item>
                        <p>More information required about SMOTE. It is not 100% clear what this is doing.&#x00a0;</p>
                    </list-item>
                    <list-item>
                        <p>Needs better explanation of the relationship between training and test sets and the displayed results.</p>
                    </list-item>
                    <list-item>
                        <p>More information on the selection of the disease ROIs. How big were they? How were they selected by the investigator? In clinical diagnosis the differentiation of these tumor types usually relies quite heavily on the pattern of contrast enhancement (plus localization) and so if investigators were only selecting regions of homogenous enhancement it would seem unlikely that ANY RF investigation would permit differentiation.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the case presented with sufficient detail to be useful for teaching or other practitioners?</p>
            <p>Partly</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>I cannot comment. A qualified statistician is required.</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Is the background of the case&#x2019;s history and progression described in sufficient detail?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Veterinary clinical research; bench research in neuroscience</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
</article>
