<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.168092.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Impact of sample size on optimisation algorithms for the MLP used in the prediction of client subscription to a term deposit</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 1 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Botlhoko</surname>
                        <given-names>Tshegofatso</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-4939-7582</uri>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Volition Montshiwa</surname>
                        <given-names>Tlhalitshi</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-3168-3441</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Business Statistics &amp; Operations Research, North West University Faculty of Economic and Management Sciences, Potchefstroom, North West, 2735, South Africa</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:volition.montshiwa@nwu.ac.za">volition.montshiwa@nwu.ac.za</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>22</day>
                <month>12</month>
                <year>2025</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2025</year>
            </pub-date>
            <volume>14</volume>
            <elocation-id>1426</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>12</day>
                    <month>12</month>
                    <year>2025</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Botlhoko T and Volition Montshiwa T</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/14-1426/pdf"/>
            <abstract>
                <sec>
                    <title>Background</title>
                    <p>One of the disadvantages of the multilayer perception (MLP), which is a machine learning (ML) algorithm used in various fields, includes the uncontrollable growth of the number of total parameters, which may make MLP redundant in such high dimensions, and the uncontrollable growing stack of layers that ignores spatial information. Optimization algorithms were developed to determine the optimum number of parameters for MLP.</p>
                </sec>
                <sec>
                    <title>Methods</title>
                    <p>In this paper, the performances of the Genetic Algorithm (GA), Grasshopper Optimization Algorithm (GOA), and Covariance Matrix Adaptation Evolution Strategy (CMA-ES) are compared. The study also sought to determine the impact of sample size variations on these optimization algorithms. A dataset on the direct marketing campaigns of a Portuguese banking institution from the UCI Machine Learning Repository with a sample size of 4 521 was used. Synthetic Minority Oversampling Technique (SMOTE) was applied to balance the binary dependent variables for the training data across various sample sizes.</p>
                </sec>
                <sec>
                    <title>Results</title>
                    <p>Based on the classification accuracy, specificity, sensitivity, precision, F-score, and execution time, the MLP based on CMA-ES (CMA-ES-MLP) was identified as the best classifier overall, as it maintained high rates of these classification metrics and was the second fastest to train. CMA-ES-MLP with a training sample of 5 114 was our ideal classifier, and it competes well with the classifiers that have been built by previous studies that used the same dataset.</p>
                </sec>
                <sec>
                    <title>Conclusions</title>
                    <p>The study found no consistent increase or decrease in the classification performance of the algorithms as the sample size increased, and the metrics fluctuated rapidly across sample sizes. It is recommended that future studies be conducted to compare the best-performing classifiers identified in previous studies with the CMA-ES-MLP in this study under the same experimental conditions.</p>
                </sec>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Multilayer Perceptron (MLP); Genetic Algorithm (GA); Grasshopper Optimization Algorithm (GOA); Covariance Matrix Adaptation Evolution Strategy (CMA-ES); Machine Learning; Term Deposit Subscription.</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>The author(s) declared that no grants were involved in supporting this work.</funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec id="sec5" sec-type="intro">
            <title>1.1 Introduction</title>
            <p>The multidisciplinary field of data mining includes Information Technology (IT), Artificial Intelligence (AI), Machine Learning (ML), statistics, pattern recognition, data retrieval, Neural Networks (NN), and information-based systems.
                <sup>
                    <xref ref-type="bibr" rid="ref1">1</xref>
                </sup> This study focused on ML classification algorithms and classifiers. A classifier is an algorithm that links input data to a specific category.
                <sup>
                    <xref ref-type="bibr" rid="ref2">2</xref>
                </sup> More specifically, this study focuses on the Multilayer Perceptron (MLP) classifier because it is one of the most-used algorithms in data science and in recent studies
                <sup>
                    <xref ref-type="bibr" rid="ref3">3</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref7">7</xref>
                </sup> because of its flexibility and ability to differentiate data that can be split linearly
                <sup>
                    <xref ref-type="bibr" rid="ref8">8</xref>
                </sup> defined the MLP as a feedforward artificial neural network (ANN) that comprises the input layer, at least one hidden layer, and the output layer, which are connected by nodes. MLP is also of interest in this study because it is applicable to various fields such as speech recognition, image recognition, text classification, and machine translation software.</p>
            <p>Although it is applicable across various disciplines, a disadvantage of MLP is that the number of total parameters in it can grow uncontrollably, whereby the number of perceptrons in layer one is multiplied by the number of parameters in layer two, which is then multiplied by the number of parameters in layer three and so on. This is inefficient because of redundancy in such high dimensions. In addition, (
                <sup>
                    <xref ref-type="bibr" rid="ref9">9</xref>
                </sup>: 400) stated that when flattened vectors are used as inputs, this uncontrollably growing stack of layers ignores the spatial information. These multiplying parameters can be difficult to control; hence, optimization algorithms were established to determine the optimum number of parameters for the MLP.</p>
            <p>Reference 
                <xref ref-type="bibr" rid="ref10">[10]</xref> defined an algorithm as a process or equation that solves a problem by following a predetermined set of steps. Reference 
                <xref ref-type="bibr" rid="ref11">[11]</xref> described optimization techniques as analytical approaches that use differential calculus to find the best solution. Reference 
                <xref ref-type="bibr" rid="ref12">[12]</xref> further explained that the purpose of optimization techniques is mainly to handle problems that cannot be handled by classifiers. These problems consist of functions with a single variable, functions with multiple variables and no constraints, and functions with multiple variables with both equality and inequality constraints. A variety of optimization algorithms have been developed, and because of their adaptable and flexible searching processes, they have demonstrated a great degree of promise in solving optimization issues. In addition,
                <sup>
                    <xref ref-type="bibr" rid="ref13">13</xref>
                </sup> mentioned their capacity to use specific statistical tools to display satisfactory performance on MLP classification methods, as well as their efficiency in resolving linear and non-linear problems by avoiding local optima and balancing the exploration and exploitation trends.</p>
            <p>According to,
                <sup>
                    <xref ref-type="bibr" rid="ref14">14</xref>
                </sup> there are several optimization algorithms used in optimizing the MLP, including the Bayesian optimization algorithms (BOA), binary particle swarm optimization (BPSO), Covariance Matrix Adaptation Evolution Strategy (CMA-ES), Differential Evolution (DE), FireFly Algorithm (FFA), genetic algorithms (GA), grasshopper optimization algorithm (GOA), and particle swarm optimization (PSO). Other optimization algorithms include the hybrid meta-heuristic approach, which was used in the study by,
                <sup>
                    <xref ref-type="bibr" rid="ref15">15</xref>,
                    <xref ref-type="bibr" rid="ref16">16</xref>
                </sup> and it has been compared to other newly developed optimization algorithms that were used to form hybrid MLP models such as the Gloworm Swarm Optimization-MLP (GSO-MLP), Biogeographical-Based Optimization-MLP (BBO-MLP), and Genetic Algorithm-MLP (GA-MLP).</p>
            <p>The scope of this study is limited to the Genetic Algorithm (GA), grasshopper optimization algorithm (GOA), and Covariance Matrix Adaptation Evolution Strategy (CMA-ES). This is because the literature comparing these novel evolutionary optimization algorithms is scarce. Therefore, although they are known to be better performers than older algorithms, the best optimization algorithm for the MLP between GA, GOA, and CMA-ES remains unknown. It is imperative to determine the most efficient optimization algorithm for an optimal MLP because each optimization technique has various reliability, strength, efficiency, utilization, and limitations. According to,
                <sup>
                    <xref ref-type="bibr" rid="ref17">17</xref>
                </sup> one of the disadvantages of not knowing the most efficient optimization algorithm is that it cannot determine the best level of local optima. It can also waste time for end-users of MLPs (i.e., non-statisticians/non-data scientists) to compare the optimization algorithms before fitting their MLPs, as opposed to having a study such as the current study to refer to, which has already compared these algorithms and has recommended the most efficient one(s).</p>
            <p>This study also intended to explore the effect of changes in sample size on the efficiency of GA, GOA, and CMA-ES. This is because the increase in the sample size is known by some studies to be able to improve the accuracy and robustness of many statistical methods, as detailed by studies such as those conducted by
                <sup>
                    <xref ref-type="bibr" rid="ref18">18</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref21">21</xref>
                </sup> highlighted that when the focus is on individualised outcome risk prediction, it has been shown that extremely large datasets might be needed for ML techniques. The authors explained that for binary outcomes, ML techniques could require more than ten times as many events for each predictor to achieve a small amount of over-fitting compared with classic modelling techniques such as logistic regression and might show instability and high optimism
                <sup>
                    <xref ref-type="bibr" rid="ref21">21</xref>
                </sup> explained that when dealing with optimization algorithms and sample size, it is vital to ensure accurate predictions in key subgroups and to consider the accurate sample size when using an existing dataset to avoid overfitting. On the other hand, although some studies advocate for a large dataset for ML algorithms,
                <sup>
                    <xref ref-type="bibr" rid="ref22">22</xref>
                </sup> explained that a study with a sample size that is too small has a higher risk of missing a meaningful underlying difference, while one with a sample size that is too large may be more expensive than necessary.</p>
            <p>It is evident that sample size affects the efficiency of ML algorithms. However, the efficiency of GA, GOA, and CMA-ES when used in optimizing the MLP relative to the sample size remains unknown, and to the best of our knowledge, this has never been explored before in a single study. In this study, efficiency refers to a measure of the quality of the optimization algorithms depending on the sample size, which is evaluated using measures such as specificity, sensitivity/recall, and execution time. Therefore, this study intended to determine the impact of sample size on the efficiency of GA, GOA, and CMA-ES when used for optimizing the MLP, with a focus on these due to their wide application in various studies
                <sup>
                    <xref ref-type="bibr" rid="ref23">23</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref27">27</xref>
                </sup> and because of their known effectiveness and flexibility.</p>
            <p>A comparison of GA, GOA, and CMA-ES in optimizing MLP and the effect of sample size on the performance of these algorithms is the main objective of this study. However, as an area of application, these methods are applied to predict the likelihood of subscribing to a term deposit following telephone-based direct marketing by a banking institution. This has been the focus of application of ML classifiers in several previous studies, including.
                <sup>
                    <xref ref-type="bibr" rid="ref28">28</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref30">30</xref>
                </sup> Therefore, this study intends to extend the literature in this area, which has caught the attention of many researchers when comparing the performance of ML classifiers. More details on the ML classifiers applied and the conclusions reached from these previous studies are detailed in 
                <xref ref-type="table" rid="T2">
Table 2</xref> in the dataset section of this paper.</p>
            <sec id="sec6">
                <title>1.2 Related works on evaluation of optimisation algorithms for the MLP</title>
                <p>Several previous studies that explored the efficiency of various optimization algorithms for MLP in different areas of application showed that the most efficient optimized MLP varies depending on the area of application, sample sizes, and evaluation metrics implemented in such studies. From the studies reviewed, the most common area of research is information technology
                    <sup>
                        <xref ref-type="bibr" rid="ref31">31</xref>&#x2013;
                        <xref ref-type="bibr" rid="ref34">34</xref>
                    </sup> followed by the medical sector.
                    <sup>
                        <xref ref-type="bibr" rid="ref35">35</xref>&#x2013;
                        <xref ref-type="bibr" rid="ref37">37</xref>
                    </sup> To extend the study by,
                    <sup>
                        <xref ref-type="bibr" rid="ref38">38</xref>
                    </sup> who focused on the financial sector, the current study uses a financial dataset, but it includes the CMA-ES-MLP and GOA-MLP, which are compared to the basic MLP and GA-MLP, which were also included in the study by,
                    <sup>
                        <xref ref-type="bibr" rid="ref38">38</xref>
                    </sup> but across different sample sizes rather than only one. From the studies reviewed, the sample sizes ranged from 400 to 8367, but only one sample was used per study. As such, the current study expands the scope of these studies by comparing the basic MLP and its optimized variates using different samples to determine the effect of sample size on the performance of these ML algorithms.</p>
                <p>The literature shows that in all the studies, the optimized versions of the MLPs were selected as the best performers, and not the basic MLP, which was not optimized. This is evident from the studies conducted by
                    <sup>
                        <xref ref-type="bibr" rid="ref38">38</xref>
                    </sup> in which the diversity-considered GA-MLP ensemble algorithm (DGAMLPE) outperformed the unoptimized basic MLP,
                    <sup>
                        <xref ref-type="bibr" rid="ref35">35</xref>
                    </sup> in which DGAMLPE outperformed the basic MLP, and,
                    <sup>
                        <xref ref-type="bibr" rid="ref31">31</xref>
                    </sup> in which the GOA-MLP outperformed the basic MLP. &#x201c;This implies that indeed the optimised variates of the MLP can improve the basic MLP, and it is also seen that optimisation algorithms give the MLP a competitive advantage over other ML classifiers such as Random Forest (RF), Extreme Gradient Boost (X-GBoost), Weighted Count of Errors and Correct (WCEC), and Deep Belief Network-Support Vector Machine (DBN-SVM), Logistic Regression (LR), K-Nearest Neighbors (K-NN), Decision Tree Classifier (DTC), Support Vector Machine (SVM), Random Forest Classifier (RFC), and Ensemble&#x201d;
                    <sup>
                        <xref ref-type="bibr" rid="ref35">35</xref>
                    </sup>:314). Considering these findings from the literature, the researchers were interested in optimization algorithms for the MLP in the current study. To extend the literature, the researchers included the CMA-ES-MLP in the competing models and explored the effect of sample size on these ML algorithms.</p>
                <p>The most used optimized variates of the MLP from previous studies are GA-MLP,
                    <sup>
                        <xref ref-type="bibr" rid="ref16">16</xref>,
                        <xref ref-type="bibr" rid="ref32">32</xref>,
                        <xref ref-type="bibr" rid="ref34">34</xref>,
                        <xref ref-type="bibr" rid="ref35">35</xref>,
                        <xref ref-type="bibr" rid="ref37">37</xref>
                    </sup> followed by PSO-MLP,
                    <sup>
                        <xref ref-type="bibr" rid="ref16">16</xref>,
                        <xref ref-type="bibr" rid="ref31">31</xref>,
                        <xref ref-type="bibr" rid="ref32">32</xref>,
                        <xref ref-type="bibr" rid="ref34">34</xref>,
                        <xref ref-type="bibr" rid="ref39">39</xref>
                    </sup> and GOA-MLP,
                    <sup>
                        <xref ref-type="bibr" rid="ref31">31</xref>,
                        <xref ref-type="bibr" rid="ref33">33</xref>,
                        <xref ref-type="bibr" rid="ref39">39</xref>
                    </sup> but none of these studies included CMA-ES-MLP, which implies that the performance of CMA-ES-MLP against GA-MLP and GOA-MLP remains an area that requires further research. This study bridges this gap. It also appears that the most frequently used accuracy metric from the reviewed studies is classification accuracy,
                    <sup>
                        <xref ref-type="bibr" rid="ref16">16</xref>,
                        <xref ref-type="bibr" rid="ref31">31</xref>&#x2013;
                        <xref ref-type="bibr" rid="ref39">39</xref>
                    </sup> followed by the F-measure.
                    <sup>
                        <xref ref-type="bibr" rid="ref33">33</xref>,
                        <xref ref-type="bibr" rid="ref35">35</xref>&#x2013;
                        <xref ref-type="bibr" rid="ref38">38</xref>
                    </sup> The negative and positive predictive values appear to be the least used accuracy metrics.
                    <sup>
                        <xref ref-type="bibr" rid="ref31">31</xref>,
                        <xref ref-type="bibr" rid="ref35">35</xref>
                    </sup> Other classification evaluation metrics used in previous studies included sensitivity/recall, specificity, and precision. Similarly, the current study also implemented the classification accuracy, precision, sensitivity/recall, specificity, F-measure, and execution time to compare optimized algorithms based on the popularity of these metrics in previous studies. Including a variety of comparison metrics in a single study assists in minimizing the model selection bias that may be experienced when very few similar metrics are used in the comparison and selection of the most efficient model.</p>
            </sec>
        </sec>
        <sec id="sec7" sec-type="methods">
            <title>1.3 Method</title>
            <sec id="sec8">
                <title>1.3.1 Dataset</title>
                <p>The data used in this study is a secondary dataset on the direct marketing campaigns of a Portuguese banking institution. The dataset was obtained from the UCI Machine Learning Repository of the Center for Machine Learning and Intelligent Systems. The primary contributor to the data is.
                    <sup>
                        <xref ref-type="bibr" rid="ref28">28</xref>
                    </sup> The dataset can be accessed at 
                    <ext-link ext-link-type="uri" xlink:href="https://archive.ics.uci.edu/ml/datasets/Bank+Marketing">https://archive.ics.uci.edu/ml/datasets/Bank+Marketing</ext-link>. The dataset has a total of 4 521 observations, and 11 variables were selected for use as attributes (see 
                    <xref ref-type="table" rid="T1">
Table 1</xref>) in this study to predict whether a client will subscribe to a term deposit following the marketing campaign. That is, the binary variable &#x201c;has the client subscribed to a term deposit&#x201d; from the dataset is used as a dependent variable (binary; 0 is no and 1 is yes).</p>
                <table-wrap id="T1" orientation="portrait" position="float">
                    <label>
Table 1. </label>
                    <caption>
                        <title>Description of features.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">
Name of variables</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
Description of variable</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Variable type/category</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Age</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Client&#x2019;s age</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Numeric</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Type of Job</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">The type of job of client</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Admin, blue collar, entrepreneur, housemaid, management, retired, self-employed, services, student, technician, unemployed, and unknown</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Marital Status</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">What is the marital status of the client?</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Divorced, married, single, unknown</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Educational level</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Highest qualification of client</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Basic 4y, basic 6y, basic 9y, high school, illiterate, professional course, university degree, and unknown</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Default</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Does the client have credit in default?</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">No, yes, and unknown</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Housing</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Does the client have housing loan?</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">No, yes, and unknown</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Loan</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Does the client have a personal loan?</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">No, yes, and unknown</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Contact</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Contact communication type</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Cellular and telephone</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Day</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">The last contact day of the week</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Monday, Tuesday, Wednesday, Thursday, Friday
</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Duration</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Last contact duration, in seconds (numeric)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">e.g., if duration = 0 then y = 'no'</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Outcome of Previous Marketing Campaign</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="left" colspan="1" rowspan="1" valign="top">Failure, non-existent, and success</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>To mimic different sample sizes which are needed to study the impact of sample size on the efficiency of the MLP optimisation algorithms, nine (9) random samples of different sizes (varying by 10%) were drawn with replacement from the 4521. Samples were randomly selected at 10% difference using stratified sampling, in which the dependent variable was used as the stratum to ensure that the samples maintained the distribution of the main dataset in the dependent variable. The following random sample sizes were created: 10% (n = 452), 20% (n = 904), 30% (n = 1356), 40% (n = 1808), 50% (n = 2261), 60% (n = 2713), 70% (n = 3165), 80% (n = 3617), 90% (n = 4069), and the entire dataset, which contained 100% of the observations (n = 4521). The variables described in 
                    <xref ref-type="table" rid="T1">
Table 1</xref> were used as independent variables or features.</p>
                <p>All categorical features with at least three (3) classes from 
                    <xref ref-type="table" rid="T1">
Table 1</xref> were converted to dummy variables using the one-hot encoding technique, which converts classes of the categorical variable to a vector that contains 1 and 0, denoting the presence and absence of the feature, respectively, which led to an increase in the number of features used in the paper to 42. Previous studies that have been conducted that focused on the application and/or comparison of ML classifiers (including MLP and its variates) on the dataset chosen for this study are summarized in 
                    <xref ref-type="table" rid="T2">
Table 2</xref>.</p>
                <table-wrap id="T2" orientation="portrait" position="float">
                    <label>
Table 2. </label>
                    <caption>
                        <title>A summary of studies on the comparison and application of ML classifiers on using the dataset on direct marketing campaigns of a Portuguese banking institution from the UCI repository.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">
Authors</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Classifiers compared or applied</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Best model</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Moro et al. (2014)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">LR, DT, NN, and SVM.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>NN</bold> with an Area under the area of the receiver operating characteristic curve (AUC) of 0.8 and area of the LIFT cumulative curve (ALIFT) of 0.7</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Ghatasheh et al. (2020)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Meta-Cost-MLP, Cost Sensitive Classifier-MLP, MLP (Baseline), DL-MLP, J48, LL, DT, Very Fast Decision Rules (VFDR) and RF.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>Meta-cost MLP</bold> with recall of 0.808, precision of 0.771, Geometric mean of 78.93%, and Classification accuracy of 77.48%.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Moro et al. (2011)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">NB, DT and SVM.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>SVM</bold> with AUC of 0.938 and ALIFT=0.887.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Asare-Frempong and Jayabalan (2017)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">MLP, DT (C4.5), LR and RF</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>RF</bold> with classification accuracy of 86.08% and AUC of 92.7%.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Moro et al. (2015)
                                    <sup>
                                        <xref ref-type="bibr" rid="ref42">42</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Customer lifetime value (LTV) based NN (LTV-NN), baseline NN (with no historical data),
</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>LTV-NN
</bold> increased the AUC of the baseline-NN from 0.8002 to 0.8609, while ALIFT improved from 0.6701 to 0.7044 where AUC was at least 0.84, and ALIFT was at least 0.69.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Elsalamony (2014)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">MLP, NB, LR, and the Ross Quinlan new DT (C5.0).</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Based on the testing dataset, MLP produced the highest classification accuracy of 90.49%, LR the highest sensitivity of 65.53%, and C5.0 yielded specificity of 93.23%.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Zaki et al. (2024)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Stochastic Gradient Descent (SGD) Classifier, k-nearest neighbour Classifier, and Random Forest Classifier.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>DT</bold> with a classification accuracy of 87.5%, a negative predictive value (NPV) of 93%, and a positive predictive value (PPV) of 87.8%.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">&#x0141;ady&#x017c;y&#x0144;ski et al. (2019)
                                    <sup>
                                        <xref ref-type="bibr" rid="ref44">44</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">RF, classification and regression tree (CART) and deep belief learning implemented in H2O framework, and deep belief networks implemented in H2O framework with l1 regularization parameter added.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>CART</bold> with a precision of 9.01% and recall of 67.27%, and the authors commented it is the most efficient in terms of computing power.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Pavlovi&#x0107; et al. (2014)
                                    <sup>
                                        <xref ref-type="bibr" rid="ref45">45</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">DT</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>DT</bold> yielded classification accuracy of 88.51%, sensitivity of 93.6%, specificity of 50.1%, AUC of 70.5%, and Brier of 20.5%.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Karim and Rahman (2013)
                                    <sup>
                                        <xref ref-type="bibr" rid="ref46">46</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">NB and DT (4.5).</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>DT (C4.5)</bold> with classification accuracy of 94%, precision for &#x201c;yes&#x201d; of 79.1%, precision for &#x201c;no&#x201d; of 95.5% and AUC of 93.3, but the DT (C4.5) was 5.78 seconds slower to train than the NB.</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Kim and Street (2004)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Baseline ANN and Genetic Algorithm (GA) based ANN (GA-ANN).</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>GA-ANN
</bold>.</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>
                    <xref ref-type="table" rid="T2">
Table 2</xref> shows that 2004 to date, several ML classifiers have been evaluated to predict the likelihood of a client to subscribe to a term loan following a direct marketing campaign by the bank using data from a Portuguese banking institution. In general, the table shows that the results vary depending on the setting such as the number of attributes, the number of observations in the data, and the number of training times to mention a few. Most of these studies included neural networks
                    <sup>
                        <xref ref-type="bibr" rid="ref29">29</xref>,
                        <xref ref-type="bibr" rid="ref40">40</xref>&#x2013;
                        <xref ref-type="bibr" rid="ref43">43</xref>,
                        <xref ref-type="bibr" rid="ref47">47</xref>
                    </sup> including the basic MLP and its variates such as Meta-Cost-MLP, Cost sensitive classifier-MLP, and the GA based ANN (GA-ANN). Although it appears in most previous studies, the basic neural networks classifier was only found to be the best performer when compared to LR and DT and SVM in the study by.
                    <sup>
                        <xref ref-type="bibr" rid="ref28">28</xref>
                    </sup> However, whenever its modified variates were included in the comparison, these variates were found to be best performers against the basic MLP such as in the study by
                    <sup>
                        <xref ref-type="bibr" rid="ref29">29</xref>
                    </sup> in which the Meta-Cost-MLP outperformed the basic MLP and other classifiers such as (J48, LL, DT, VFDR), and in the study by
                    <sup>
                        <xref ref-type="bibr" rid="ref47">47</xref>
                    </sup> in which the GA-ANN outperformed the baseline ANN. These results show that making improves to the basic MLP can improve its performance, hence this paper extend literature around the enhancement of the neural networks (specifically the MLP) as done by some authors in 
                    <xref ref-type="table" rid="T2">
Table 2</xref>, by comparing GA, GOA and CMA-ES optimisation algorithms for the MLP using the direct marketing data used in studies that are summarise in this table. It is evident from 
                    <xref ref-type="table" rid="T2">
Table 2</xref> that these optimisation algorithms have never been compared in a single study using the dataset that was used by the studies in 
                    <xref ref-type="table" rid="T2">
Table 2</xref>.</p>
            </sec>
            <sec id="sec9">
                <title>1.4 Data analysis methods</title>
                <p>

                    <bold>1.4.1 Data balancing</bold>
                </p>
                <p>The data in this study were split into 80% training data and 20% testing data, which is a commonly used train-to-testing data-splitting ratio. A Synthetic Minority Oversampling Technique (SMOTE) was used to balance the training samples
                    <sup>
                        <xref ref-type="bibr" rid="ref48">48</xref>
                    </sup> defined SMOTE as one of the most used oversampling techniques to solve imbalanced data problems, and it aims to balance class distributions by randomly increasing minority class examples by replicating them
                    <sup>
                        <xref ref-type="bibr" rid="ref48">48</xref>
                    </sup> explained that SMOTE uses linear interpolation to generate the virtual training records. These synthetic data were generated through a random selection of at least one 
                    <italic toggle="yes">k</italic>-nearest neighbor for each observation in the minority class.
                    <sup>
                        <xref ref-type="bibr" rid="ref48">48</xref>
                    </sup> In this study, SMOTE was chosen because of its advantage in reducing the risk of overfitting and its wide application in many previous studies, such as.
                    <sup>
                        <xref ref-type="bibr" rid="ref48">48</xref>&#x2013;
                        <xref ref-type="bibr" rid="ref52">52</xref>
                    </sup>
                </p>
                <p>From 
                    <xref ref-type="fig" rid="f1">
Figure 1</xref>, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>Y</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> is the point under consideration, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>Y</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mn>1</mml:mn>
                        </mml:math>
</inline-formula> to 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>Y</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mn>4</mml:mn>
                        </mml:math>
</inline-formula> are the nearest neighbors, and 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>w</mml:mi>
                                <mml:mn>1</mml:mn>
                            </mml:msub>
                        </mml:math>
</inline-formula> to 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>w</mml:mi>
                                <mml:mn>4</mml:mn>
                            </mml:msub>
                        </mml:math>
</inline-formula> represent the synthetic data generated by the randomized interjection
                    <sup>
                        <xref ref-type="bibr" rid="ref53">53</xref>
                    </sup> explained that synthetic samples are generated by considering the difference between the nearest neighbor and the feature vector
                    <sup>
                        <xref ref-type="bibr" rid="ref53">53</xref>
                    </sup> further explained that the difference is multiplied by a random number between 1 and 0 and then added to the feature vector under consideration. 
                    <xref ref-type="table" rid="T3">
Table 3</xref> presents balanced training data from the original dataset.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>
Figure 1. </label>
                    <caption>
                        <title>Example of how to generate synthetic data using SMOTE (
                            <sup>
                                <xref ref-type="bibr" rid="ref53">53</xref>
                            </sup>:1414).</title>
                        <p>In 
                            <xref ref-type="fig" rid="f1">
Figure 1</xref> explains how SMOTE randomly generates synthetic data (
                            <inline-formula>

                                <mml:math display="inline">
                                    <mml:msub>
                                        <mml:mi>w</mml:mi>
                                        <mml:mn>1</mml:mn>
                                    </mml:msub>
                                </mml:math>
</inline-formula> to 
                            <inline-formula>

                                <mml:math display="inline">
                                    <mml:msub>
                                        <mml:mi>w</mml:mi>
                                        <mml:mn>4</mml:mn>
                                    </mml:msub>
                                </mml:math>
</inline-formula>) to balance the imbalanced dataset by taking the difference between the nearest neighbours (
                            <inline-formula>

                                <mml:math display="inline">
                                    <mml:msub>
                                        <mml:mi>Y</mml:mi>
                                        <mml:mrow>
                                            <mml:mi>i</mml:mi>
                                            <mml:mn>1</mml:mn>
                                        </mml:mrow>
                                    </mml:msub>
                                </mml:math>
</inline-formula> to 
                            <inline-formula>

                                <mml:math display="inline">
                                    <mml:msub>
                                        <mml:mi>Y</mml:mi>
                                        <mml:mrow>
                                            <mml:mi>i</mml:mi>
                                            <mml:mn>4</mml:mn>
                                        </mml:mrow>
                                    </mml:msub>
                                </mml:math>
</inline-formula>) of the data point under consideration (
                            <inline-formula>

                                <mml:math display="inline">
                                    <mml:msub>
                                        <mml:mi>Y</mml:mi>
                                        <mml:mi>i</mml:mi>
                                    </mml:msub>
                                    <mml:mo stretchy="true">)</mml:mo>
                                    <mml:mspace width="0.25em"/>
                                </mml:math>
</inline-formula>and multiplying 
                            <inline-formula>

                                <mml:math display="inline">
                                    <mml:msub>
                                        <mml:mi>Y</mml:mi>
                                        <mml:mi>i</mml:mi>
                                    </mml:msub>
                                </mml:math>
</inline-formula>by a random number between 0 and 1, and then adding it to the feature vector under consideration.
                            <sup>
                                <xref ref-type="bibr" rid="ref53">53</xref>
                            </sup>
                        </p>
                    </caption>
                    <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185252/62644aa1-63cc-499e-8114-200f47dda971_figure1.gif"/>
                </fig>
                <table-wrap id="T3" orientation="portrait" position="float">
                    <label>
Table 3. </label>
                    <caption>
                        <title>Frequencies of the dependent variable in the SMOTE balanced training sets across the sample sizes.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="4" rowspan="1" valign="top">Unbalanced data</th>
                                <th align="left" colspan="4" rowspan="1" valign="top">Balanced data</th>
                            </tr>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">Sample Size</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Client Subscription</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">N</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">%</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Sample Size</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Client Subscription</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">N</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
%</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 362</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">313</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">86</td>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 626</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">313</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">49</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">14</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">313</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 723</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">640</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">89</td>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 1280</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">640</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">83</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">11</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">640</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 1085</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">970</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">89</td>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 1940</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">970</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">115</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">11</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">970</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 1446</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1296</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">90</td>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 2592</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1296</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">150</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">10</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1296</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 1809</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1602</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">89</td>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 3024</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1602</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">207</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">11</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1602</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 2170</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1916</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">88</td>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 3832</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1916</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">254</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">12</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1916</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 2026</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1791</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">88</td>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 3582</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1791</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">235</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">12</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1791</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 2894</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2557</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">88</td>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 5114</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2557</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">337</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">12</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2557</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 3255</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2880</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">88</td>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 5760</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2880</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">375</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">12</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2880</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 3617</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3199</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">88</td>
                                <td align="left" colspan="1" rowspan="2" valign="top">n = 6398</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unsubscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3199</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">418</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">12</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Subscribed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3199</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">50</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>
                    <xref ref-type="table" rid="T3">
Table 3</xref> shows that the class of the dependent variable is balanced after using SMOTE specifically for the training data samples. In all the samples, equal numbers of unsubscribed participants and subscribed participants are observed.</p>
                <p>

                    <bold>1.4.2 Multilayer Perceptron (MLP)</bold>
                </p>
                <p>Explained that MLP was invented in 1958 at the Cornell Aeronautical Laboratory by Frank Rosenblatt, funded by the Office of Naval Research in the United States.
                    <sup>
                        <xref ref-type="bibr" rid="ref54">54</xref>
                    </sup> Further explained that although it was originally designed as a machine rather than a program, the perceptron was first implemented in IBM 704 as software before being implemented in specially designed hardware as the &#x201c;Mark 1 perceptron.&#x201d; In addition,
                    <sup>
                        <xref ref-type="bibr" rid="ref55">55</xref>
                    </sup> explained that the purpose of this machine is image recognition; it has 400 photocells arranged in an array and randomly connected to the &#x201c;neurons.&#x201d; According to the author, electric motors update the weights during learning, and the weights are encoded in the potentiometers. The flexibility of the MLP has enabled its function in various activities.
                    <sup>
                        <xref ref-type="bibr" rid="ref56">56</xref>
                    </sup> It has only been used for image recognition,
                    <sup>
                        <xref ref-type="bibr" rid="ref57">57</xref>,
                        <xref ref-type="bibr" rid="ref58">58</xref>
                    </sup> speech recognition,
                    <sup>
                        <xref ref-type="bibr" rid="ref59">59</xref>
                    </sup> and machine translation software.
                    <sup>
                        <xref ref-type="bibr" rid="ref60">60</xref>
                    </sup> Currently, it can be used for text data,
                    <sup>
                        <xref ref-type="bibr" rid="ref62">61</xref>,
                        <xref ref-type="bibr" rid="ref63">62</xref>
                    </sup> speech recognition,
                    <sup>
                        <xref ref-type="bibr" rid="ref58">58</xref>
                    </sup> and other types of data. MLP can be fitted using various software, such as Waikato Environment for Knowledge Analysis 3.9 (WEKA), Statistical Package for the Social Sciences (SPSS), and Python. With the use of optimization algorithms, such as those being compared in this study, MLPs have become very useful, convenient, and easy to use.</p>
                <p>The MLP consists of an input and an output layer with one or more hidden layers of non-linear activating nodes.
                    <sup>
                        <xref ref-type="bibr" rid="ref64">63</xref>
                    </sup> Each node in one layer connects with a certain weight to every node in the following layer.
                    <sup>
                        <xref ref-type="bibr" rid="ref64">63</xref>
                    </sup> In the input layer, the activations, which were defined by
                    <sup>
                        <xref ref-type="bibr" rid="ref65">64</xref>
                    </sup> as the source of the MLP&#x2019;s power, were determined using the following equation:
                    <disp-formula id="e1">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>b</mml:mi>
                                <mml:mi>j</mml:mi>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:munderover>
                                <mml:mo>&#x2211;</mml:mo>
                                <mml:mrow>
                                    <mml:mi>i</mml:mi>
                                    <mml:mo>=</mml:mo>
                                    <mml:mn>0</mml:mn>
                                </mml:mrow>
                                <mml:mi>D</mml:mi>
                            </mml:munderover>
                            <mml:msubsup>
                                <mml:mi>w</mml:mi>
                                <mml:mi mathvariant="italic">ij</mml:mi>
                                <mml:mrow>
                                    <mml:mo stretchy="true">(</mml:mo>
                                    <mml:mn>1</mml:mn>
                                    <mml:mo stretchy="true">)</mml:mo>
                                </mml:mrow>
                            </mml:msubsup>
                            <mml:msub>
                                <mml:mi>x</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(1)</label>
</disp-formula>
                </p>
                <p>The first layer involves 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>M</mml:mi>
                        </mml:math>
</inline-formula> linear combinations of the d-dimensional input for
                    <disp-formula id="e2">

                        <mml:math display="block">
                            <mml:mi>i</mml:mi>
                            <mml:mspace width="0.25em"/>
                            <mml:mn>1</mml:mn>
                            <mml:mo>,</mml:mo>
                            <mml:mn>2</mml:mn>
                            <mml:mo>,</mml:mo>
                            <mml:mo>&#x2026;</mml:mo>
                            <mml:mo>,</mml:mo>
                            <mml:mi mathvariant="normal">M</mml:mi>
                            <mml:mspace width="0.25em"/>
                            <mml:mtext>and</mml:mtext>
                            <mml:mspace width="0.25em"/>
                            <mml:mi>j</mml:mi>
                            <mml:mo>=</mml:mo>
                            <mml:mn>1</mml:mn>
                            <mml:mo>,</mml:mo>
                            <mml:mn>2</mml:mn>
                            <mml:mo>,</mml:mo>
                            <mml:mo>&#x2026;</mml:mo>
                            <mml:mo>,</mml:mo>
                            <mml:mi mathvariant="normal">d</mml:mi>
                            <mml:mo>,</mml:mo>
                        </mml:math>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msubsup>
                                <mml:mi>w</mml:mi>
                                <mml:mi mathvariant="italic">ij</mml:mi>
                                <mml:mrow>
                                    <mml:mo stretchy="true">(</mml:mo>
                                    <mml:mn>1</mml:mn>
                                    <mml:mo stretchy="true">)</mml:mo>
                                </mml:mrow>
                            </mml:msubsup>
                        </mml:math>
</inline-formula> are the weights for node 
                    <italic toggle="yes">j</italic> in layer 1 for incoming node 
                    <italic toggle="yes">I</italic> and (1) indicates that this is the first layer of the network. Each activation was then transformed by a non-linear activation function 
                    <italic toggle="yes">g.</italic>
                </p>
                <p>In this study, tanh was used as the activation function for the hidden layer
                    <sup>
                        <xref ref-type="bibr" rid="ref66">65</xref>
                    </sup> described the Tanh function as a smoother, zero-center function, with a range between -1 and 1. The Tanh function is defined by the following equation sourced from
                    <sup>
                        <xref ref-type="bibr" rid="ref66">65</xref>
                    </sup>:
                    <disp-formula id="e3">

                        <mml:math display="block">
                            <mml:mi>f</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mi>x</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>=</mml:mo>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mfrac>
                                    <mml:mrow>
                                        <mml:msup>
                                            <mml:mi>e</mml:mi>
                                            <mml:mi>x</mml:mi>
                                        </mml:msup>
                                        <mml:mo>&#x2212;</mml:mo>
                                        <mml:msup>
                                            <mml:mi>e</mml:mi>
                                            <mml:mrow>
                                                <mml:mo>&#x2212;</mml:mo>
                                                <mml:mi>x</mml:mi>
                                            </mml:mrow>
                                        </mml:msup>
                                    </mml:mrow>
                                    <mml:mrow>
                                        <mml:msup>
                                            <mml:mi>e</mml:mi>
                                            <mml:mi>x</mml:mi>
                                        </mml:msup>
                                        <mml:mo>+</mml:mo>
                                        <mml:msup>
                                            <mml:mi>e</mml:mi>
                                            <mml:mrow>
                                                <mml:mo>&#x2212;</mml:mo>
                                                <mml:mi>x</mml:mi>
                                            </mml:mrow>
                                        </mml:msup>
                                    </mml:mrow>
                                </mml:mfrac>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(2)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>x</mml:mi>
                        </mml:math>
</inline-formula> is an input to the neuron and 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>e</mml:mi>
                        </mml:math>
</inline-formula> is Euler&#x2019;s number.</p>
                <p>A sigmoid function was used as the activation function for the output layer.
                    <sup>
                        <xref ref-type="bibr" rid="ref66">65</xref>
                    </sup> defined the sigmoid as a non-linear activation used mostly in feedforward neutral networks. &#x201c;It is a bounded differentiable real function, defined for real input values, with positive derivatives everywhere and some degree of smoothness&#x201d; (
                    <sup>
                        <xref ref-type="bibr" rid="ref66">65</xref>
                    </sup>:5). The sigmoid activation function is given by the following relationship, sourced from
                    <sup>
                        <xref ref-type="bibr" rid="ref66">65</xref>
                    </sup>:
                    <disp-formula id="e4">

                        <mml:math display="block">
                            <mml:mi>f</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mi>x</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>=</mml:mo>
                            <mml:mfrac>
                                <mml:mn>1</mml:mn>
                                <mml:mrow>
                                    <mml:mn>1</mml:mn>
                                    <mml:mo>+</mml:mo>
                                    <mml:mo>exp</mml:mo>
                                    <mml:mrow>
                                        <mml:mo stretchy="true">(</mml:mo>
                                        <mml:mo>&#x2212;</mml:mo>
                                        <mml:msub>
                                            <mml:mi>b</mml:mi>
                                            <mml:mi>j</mml:mi>
                                        </mml:msub>
                                        <mml:mo stretchy="true">)</mml:mo>
                                    </mml:mrow>
                                </mml:mrow>
                            </mml:mfrac>
                        </mml:math>

                        <label>(3)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>f</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mi>x</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                        </mml:math>
</inline-formula> corresponds to the outputs of the basis functions and is interpreted as the output of the hidden units.</p>
                <p>

                    <bold>1.4.3 Covariance Matrix Adaptation Evolution Strategy (CMA-ES)</bold>
                </p>
                <p>The Covariance Matrix Adaptation Evolution Strategy (CMA-ES) was developed by Hansen et al. in 2003.
                    <sup>
                        <xref ref-type="bibr" rid="ref67">66</xref>
                    </sup> According to,
                    <sup>
                        <xref ref-type="bibr" rid="ref68">67</xref>
                    </sup> the algorithm&#x2019;s theoretical underpinnings include variable metrics, and the CMA-ES uses maximum-likelihood updates in conjunction with a stochastic variable-metric approach. In an algorithm that quickly converges to the global optimum across a wide class of functions, the covariance matrix maximizes likelihood while resembling an expectation-maximization algorithm
                    <sup>
                        <xref ref-type="bibr" rid="ref69">68</xref>,
                        <xref ref-type="bibr" rid="ref70">69</xref>
                    </sup> explained that the CMA-ES algorithm has certain drawbacks, such as its performance becoming slow if the number of model parameters that need to be estimated is large. The approximation of gradients without assuming or requiring their existence is another flaw of this algorithm. CMA-ES is a plausible candidate for an effective parameter estimation algorithm,
                    <sup>
                        <xref ref-type="bibr" rid="ref71">70</xref>
                    </sup> but it must be tested against other algorithms to ascertain its efficiency, particularly when the sample size is varied.</p>
                <p>The CMA-ES samples from the multivariate normal distribution search rank the sampled points according to their fitness function values. The multivariate normal distribution can be calculated using the following equation obtained by
                    <sup>
                        <xref ref-type="bibr" rid="ref72">71</xref>
                    </sup>: 
                    <disp-formula id="e5">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>x</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo>&#x223c;</mml:mo>
                            <mml:mi>N</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:msub>
                                    <mml:mi>m</mml:mi>
                                    <mml:mi>k</mml:mi>
                                </mml:msub>
                                <mml:mo>,</mml:mo>
                                <mml:msubsup>
                                    <mml:mi>&#x03c3;</mml:mi>
                                    <mml:mi>k</mml:mi>
                                    <mml:mn>2</mml:mn>
                                </mml:msubsup>
                                <mml:msub>
                                    <mml:mi>C</mml:mi>
                                    <mml:mi>k</mml:mi>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(4)</label>
</disp-formula>

                    <disp-formula id="e6">

                        <mml:math display="block">
                            <mml:mo>&#x223c;</mml:mo>
                            <mml:msub>
                                <mml:mi>m</mml:mi>
                                <mml:mi>k</mml:mi>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mi>&#x03c3;</mml:mi>
                                <mml:mi>k</mml:mi>
                            </mml:msub>
                            <mml:mo>&#x00d7;</mml:mo>
                            <mml:mi>N</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mn>0</mml:mn>
                                <mml:mo>,</mml:mo>
                                <mml:msub>
                                    <mml:mi>C</mml:mi>
                                    <mml:mi>k</mml:mi>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(5)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>m</mml:mi>
                                <mml:mi>k</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> is the distribution average and recent favorite solution to the optimization problem, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>&#x03c3;</mml:mi>
                                <mml:mi>k</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> is the step size, and 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>C</mml:mi>
                                <mml:mi>k</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> is the symmetric and positive definite</p>
                <p>The fitness function for the CMA-ES is defined as:
                    <disp-formula id="e7">

                        <mml:math display="block">
                            <mml:mi>f</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mi>x</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>=</mml:mo>
                            <mml:mi>g</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:msup>
                                    <mml:mi>x</mml:mi>
                                    <mml:mi>T</mml:mi>
                                </mml:msup>
                                <mml:mi mathvariant="italic">Hx</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(6)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi mathvariant="italic">Hx</mml:mi>
                        </mml:math>
</inline-formula> is the Hessian matrix of 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>f</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mi>x</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                        </mml:math>
</inline-formula> and 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msup>
                                <mml:mi>x</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msup>
                        </mml:math>
</inline-formula> is the transpose of 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>x</mml:mi>
                        </mml:math>
</inline-formula>.</p>
                <p>The mean distribution is then updated to a weighted average using the following equation:
                    <disp-formula id="e8">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>m</mml:mi>
                                <mml:mi mathvariant="italic">new</mml:mi>
                            </mml:msub>
                            <mml:mo>&#x2190;</mml:mo>
                            <mml:munderover>
                                <mml:mo>&#x2211;</mml:mo>
                                <mml:mrow>
                                    <mml:mi>i</mml:mi>
                                    <mml:mo>=</mml:mo>
                                    <mml:mn>1</mml:mn>
                                </mml:mrow>
                                <mml:mi mathvariant="normal">&#x03bc;</mml:mi>
                            </mml:munderover>
                            <mml:msub>
                                <mml:mi>w</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:msub>
                                <mml:mi>x</mml:mi>
                                <mml:mrow>
                                    <mml:mi>i</mml:mi>
                                    <mml:mo>:</mml:mo>
                                    <mml:mi>&#x03bb;</mml:mi>
                                </mml:mrow>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:mi>m</mml:mi>
                            <mml:mo>+</mml:mo>
                            <mml:munderover>
                                <mml:mo>&#x2211;</mml:mo>
                                <mml:mrow>
                                    <mml:mspace width="0.25em"/>
                                    <mml:mi>i</mml:mi>
                                    <mml:mo>=</mml:mo>
                                    <mml:mn>1</mml:mn>
                                </mml:mrow>
                                <mml:mi mathvariant="normal">&#x03bc;</mml:mi>
                            </mml:munderover>
                            <mml:mi mathvariant="italic">wi</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:msub>
                                    <mml:mi>x</mml:mi>
                                    <mml:mrow>
                                        <mml:mi>i</mml:mi>
                                        <mml:mo>:</mml:mo>
                                        <mml:mi>&#x03bb;</mml:mi>
                                    </mml:mrow>
                                </mml:msub>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:mi>m</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(7)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>m</mml:mi>
                                <mml:mi mathvariant="italic">new</mml:mi>
                            </mml:msub>
                            <mml:mspace width="0.25em"/>
                        </mml:math>
</inline-formula>is the new distribution mean, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>&#x03bc;</mml:mi>
                        </mml:math>
</inline-formula> is the number of parameters, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>&#x03bb;</mml:mi>
                        </mml:math>
</inline-formula> is the population size, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>m</mml:mi>
                        </mml:math>
</inline-formula> is the mean vector, and 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>w</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> is the recombination weight.</p>
                <p>The isotropic evolution is then updated using the following equation:
                    <disp-formula id="e9">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>p</mml:mi>
                                <mml:mi>&#x03c3;</mml:mi>
                            </mml:msub>
                            <mml:mo>&#x2190;</mml:mo>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mn>1</mml:mn>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:msub>
                                    <mml:mi>C</mml:mi>
                                    <mml:mi>&#x03c3;</mml:mi>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:msub>
                                <mml:mi>p</mml:mi>
                                <mml:mi>&#x03c3;</mml:mi>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msqrt>
                                <mml:mrow>
                                    <mml:mn>1</mml:mn>
                                    <mml:mo>&#x2212;</mml:mo>
                                    <mml:msup>
                                        <mml:mrow>
                                            <mml:mo stretchy="true">(</mml:mo>
                                            <mml:mn>1</mml:mn>
                                            <mml:mo>&#x2212;</mml:mo>
                                            <mml:msub>
                                                <mml:mi>C</mml:mi>
                                                <mml:mi>&#x03c3;</mml:mi>
                                            </mml:msub>
                                            <mml:mo stretchy="true">)</mml:mo>
                                        </mml:mrow>
                                        <mml:mn>2</mml:mn>
                                    </mml:msup>
                                </mml:mrow>
                            </mml:msqrt>
                            <mml:msqrt>
                                <mml:msub>
                                    <mml:mi>&#x03bc;</mml:mi>
                                    <mml:mi>w</mml:mi>
                                </mml:msub>
                            </mml:msqrt>
                            <mml:msubsup>
                                <mml:mi>C</mml:mi>
                                <mml:mi>k</mml:mi>
                                <mml:mfrac>
                                    <mml:mn>1</mml:mn>
                                    <mml:mn>2</mml:mn>
                                </mml:mfrac>
                            </mml:msubsup>
                            <mml:mfrac>
                                <mml:mrow>
                                    <mml:msub>
                                        <mml:mi>m</mml:mi>
                                        <mml:mrow>
                                            <mml:mi>k</mml:mi>
                                            <mml:mo>+</mml:mo>
                                            <mml:mn>1</mml:mn>
                                        </mml:mrow>
                                    </mml:msub>
                                    <mml:mo>&#x2212;</mml:mo>
                                    <mml:msub>
                                        <mml:mi>m</mml:mi>
                                        <mml:mi>k</mml:mi>
                                    </mml:msub>
                                </mml:mrow>
                                <mml:msub>
                                    <mml:mi>&#x03c3;</mml:mi>
                                    <mml:mi>k</mml:mi>
                                </mml:msub>
                            </mml:mfrac>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(8)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>p</mml:mi>
                                <mml:mi>&#x03c3;</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> is the evolution path, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mn>1</mml:mn>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:msub>
                                    <mml:mi>C</mml:mi>
                                    <mml:mi>&#x03c3;</mml:mi>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                        </mml:math>
</inline-formula> is the discount factor, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msqrt>
                                <mml:mrow>
                                    <mml:mn>1</mml:mn>
                                    <mml:mo>&#x2212;</mml:mo>
                                    <mml:msup>
                                        <mml:mrow>
                                            <mml:mo stretchy="true">(</mml:mo>
                                            <mml:mn>1</mml:mn>
                                            <mml:mo>&#x2212;</mml:mo>
                                            <mml:msub>
                                                <mml:mi>C</mml:mi>
                                                <mml:mi>&#x03c3;</mml:mi>
                                            </mml:msub>
                                            <mml:mo stretchy="true">)</mml:mo>
                                        </mml:mrow>
                                        <mml:mn>2</mml:mn>
                                    </mml:msup>
                                </mml:mrow>
                            </mml:msqrt>
                        </mml:math>
</inline-formula> is the complement for the discounted variance, and 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msqrt>
                                <mml:msub>
                                    <mml:mi>&#x03bc;</mml:mi>
                                    <mml:mi>w</mml:mi>
                                </mml:msub>
                            </mml:msqrt>
                            <mml:msubsup>
                                <mml:mi>C</mml:mi>
                                <mml:mi>k</mml:mi>
                                <mml:mfrac>
                                    <mml:mn>1</mml:mn>
                                    <mml:mn>2</mml:mn>
                                </mml:mfrac>
                            </mml:msubsup>
                            <mml:mfrac>
                                <mml:mrow>
                                    <mml:msub>
                                        <mml:mi>m</mml:mi>
                                        <mml:mrow>
                                            <mml:mi>k</mml:mi>
                                            <mml:mo>+</mml:mo>
                                            <mml:mn>1</mml:mn>
                                        </mml:mrow>
                                    </mml:msub>
                                    <mml:mo>&#x2212;</mml:mo>
                                    <mml:msub>
                                        <mml:mi>m</mml:mi>
                                        <mml:mi>k</mml:mi>
                                    </mml:msub>
                                </mml:mrow>
                                <mml:msub>
                                    <mml:mi>&#x03c3;</mml:mi>
                                    <mml:mi>k</mml:mi>
                                </mml:msub>
                            </mml:mfrac>
                        </mml:math>
</inline-formula> are distributed as 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>N</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mn>0</mml:mn>
                                <mml:mo>,</mml:mo>
                                <mml:mi>I</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                        </mml:math>
</inline-formula> under neutral selection.
                    <disp-formula id="e10">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>&#x03c3;</mml:mi>
                                <mml:mrow>
                                    <mml:mi>k</mml:mi>
                                    <mml:mo>+</mml:mo>
                                    <mml:mn>1</mml:mn>
                                </mml:mrow>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:msub>
                                <mml:mi>&#x03c3;</mml:mi>
                                <mml:mi>k</mml:mi>
                            </mml:msub>
                            <mml:mo>&#x00d7;</mml:mo>
                            <mml:mo>exp</mml:mo>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mfrac>
                                    <mml:msub>
                                        <mml:mi>C</mml:mi>
                                        <mml:mi>&#x03c3;</mml:mi>
                                    </mml:msub>
                                    <mml:msub>
                                        <mml:mi>d</mml:mi>
                                        <mml:mi>&#x03c3;</mml:mi>
                                    </mml:msub>
                                </mml:mfrac>
                                <mml:mrow>
                                    <mml:mo stretchy="true">(</mml:mo>
                                    <mml:mfrac>
                                        <mml:mrow>
                                            <mml:mo>|</mml:mo>
                                            <mml:mo>|</mml:mo>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mi>&#x03c3;</mml:mi>
                                            </mml:msub>
                                            <mml:mo>|</mml:mo>
                                            <mml:mo>|</mml:mo>
                                        </mml:mrow>
                                        <mml:mrow>
                                            <mml:mi>E</mml:mi>
                                            <mml:mrow>
                                                <mml:mo>|</mml:mo>
                                                <mml:mo>|</mml:mo>
                                                <mml:mi>N</mml:mi>
                                                <mml:mrow>
                                                    <mml:mo stretchy="true">(</mml:mo>
                                                    <mml:mn>0</mml:mn>
                                                    <mml:mo>,</mml:mo>
                                                    <mml:mi>I</mml:mi>
                                                    <mml:mo stretchy="true">)</mml:mo>
                                                </mml:mrow>
                                                <mml:mo>|</mml:mo>
                                                <mml:mo>|</mml:mo>
                                            </mml:mrow>
                                        </mml:mrow>
                                    </mml:mfrac>
                                    <mml:mo>&#x2212;</mml:mo>
                                    <mml:mn>1</mml:mn>
                                    <mml:mo stretchy="true">)</mml:mo>
                                </mml:mrow>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(9)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mfrac>
                                <mml:msub>
                                    <mml:mi>C</mml:mi>
                                    <mml:mi>&#x03c3;</mml:mi>
                                </mml:msub>
                                <mml:msub>
                                    <mml:mi>d</mml:mi>
                                    <mml:mi>&#x03c3;</mml:mi>
                                </mml:msub>
                            </mml:mfrac>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mfrac>
                                    <mml:mrow>
                                        <mml:mo>|</mml:mo>
                                        <mml:mo>|</mml:mo>
                                        <mml:msub>
                                            <mml:mi>p</mml:mi>
                                            <mml:mi>&#x03c3;</mml:mi>
                                        </mml:msub>
                                        <mml:mo>|</mml:mo>
                                        <mml:mo>|</mml:mo>
                                    </mml:mrow>
                                    <mml:mrow>
                                        <mml:mi>E</mml:mi>
                                        <mml:mrow>
                                            <mml:mo>|</mml:mo>
                                            <mml:mo>|</mml:mo>
                                            <mml:mi>N</mml:mi>
                                            <mml:mrow>
                                                <mml:mo stretchy="true">(</mml:mo>
                                                <mml:mn>0</mml:mn>
                                                <mml:mo>,</mml:mo>
                                                <mml:mi>I</mml:mi>
                                                <mml:mo stretchy="true">)</mml:mo>
                                            </mml:mrow>
                                            <mml:mo>|</mml:mo>
                                            <mml:mo>|</mml:mo>
                                        </mml:mrow>
                                    </mml:mrow>
                                </mml:mfrac>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:mn>1</mml:mn>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                        </mml:math>
</inline-formula> is unbiased about 0 under unbiased selection.
                    <disp-formula id="e11">

                        <mml:math display="block">
                            <mml:mi>E</mml:mi>
                            <mml:mrow>
                                <mml:mo>|</mml:mo>
                                <mml:mo>|</mml:mo>
                                <mml:mi>N</mml:mi>
                                <mml:mrow>
                                    <mml:mo stretchy="true">(</mml:mo>
                                    <mml:mn>0</mml:mn>
                                    <mml:mo>,</mml:mo>
                                    <mml:mi>I</mml:mi>
                                    <mml:mo stretchy="true">)</mml:mo>
                                </mml:mrow>
                                <mml:mo>|</mml:mo>
                                <mml:mo>|</mml:mo>
                            </mml:mrow>
                            <mml:mo>=</mml:mo>
                            <mml:msqrt>
                                <mml:mn>2</mml:mn>
                            </mml:msqrt>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mfrac>
                                    <mml:mrow>
                                        <mml:mi mathvariant="normal">n</mml:mi>
                                        <mml:mo>+</mml:mo>
                                        <mml:mn>1</mml:mn>
                                    </mml:mrow>
                                    <mml:mn>2</mml:mn>
                                </mml:mfrac>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mfrac>
                                    <mml:mi mathvariant="normal">n</mml:mi>
                                    <mml:mn>2</mml:mn>
                                </mml:mfrac>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(10)</label>
</disp-formula>

                    <disp-formula id="e12">

                        <mml:math display="block">
                            <mml:mo>&#x2248;</mml:mo>
                            <mml:msqrt>
                                <mml:mi>n</mml:mi>
                            </mml:msqrt>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mn>1</mml:mn>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:mfrac>
                                    <mml:mn>1</mml:mn>
                                    <mml:mrow>
                                        <mml:mn>4</mml:mn>
                                        <mml:mi>n</mml:mi>
                                    </mml:mrow>
                                </mml:mfrac>
                                <mml:mo>+</mml:mo>
                                <mml:mfrac>
                                    <mml:mn>1</mml:mn>
                                    <mml:mrow>
                                        <mml:mn>21</mml:mn>
                                        <mml:msup>
                                            <mml:mi>n</mml:mi>
                                            <mml:mn>2</mml:mn>
                                        </mml:msup>
                                    </mml:mrow>
                                </mml:mfrac>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(11)</label>
</disp-formula>
                </p>
                <p>Update of the covariance matrix adopted is described as follows:
                    <disp-formula id="e13">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>p</mml:mi>
                                <mml:mi>c</mml:mi>
                            </mml:msub>
                            <mml:mo>&#x2190;</mml:mo>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mn>1</mml:mn>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:msub>
                                    <mml:mi>C</mml:mi>
                                    <mml:mi>c</mml:mi>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:msub>
                                <mml:mi>p</mml:mi>
                                <mml:mi>c</mml:mi>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mn>1</mml:mn>
                                <mml:mrow>
                                    <mml:mo stretchy="true">(</mml:mo>
                                    <mml:mn>0</mml:mn>
                                    <mml:mo>,</mml:mo>
                                    <mml:mi>&#x03b1;</mml:mi>
                                    <mml:msqrt>
                                        <mml:mi>n</mml:mi>
                                    </mml:msqrt>
                                    <mml:mo stretchy="true">)</mml:mo>
                                </mml:mrow>
                            </mml:msub>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mo>|</mml:mo>
                                <mml:mo>|</mml:mo>
                                <mml:msub>
                                    <mml:mi>p</mml:mi>
                                    <mml:mi>&#x03c3;</mml:mi>
                                </mml:msub>
                                <mml:mo>|</mml:mo>
                                <mml:mo>|</mml:mo>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:msqrt>
                                <mml:mrow>
                                    <mml:mn>1</mml:mn>
                                    <mml:mo>&#x2212;</mml:mo>
                                    <mml:msup>
                                        <mml:mrow>
                                            <mml:mo stretchy="true">(</mml:mo>
                                            <mml:mn>1</mml:mn>
                                            <mml:mo>&#x2212;</mml:mo>
                                            <mml:msub>
                                                <mml:mi>c</mml:mi>
                                                <mml:mi>c</mml:mi>
                                            </mml:msub>
                                            <mml:mo stretchy="true">)</mml:mo>
                                        </mml:mrow>
                                        <mml:mn>2</mml:mn>
                                    </mml:msup>
                                </mml:mrow>
                            </mml:msqrt>
                            <mml:msqrt>
                                <mml:msub>
                                    <mml:mi>&#x03bc;</mml:mi>
                                    <mml:mi>w</mml:mi>
                                </mml:msub>
                            </mml:msqrt>
                            <mml:mspace width="0.25em"/>
                            <mml:mfrac>
                                <mml:mrow>
                                    <mml:msub>
                                        <mml:mi>m</mml:mi>
                                        <mml:mrow>
                                            <mml:mi>k</mml:mi>
                                            <mml:mo>+</mml:mo>
                                            <mml:mn>1</mml:mn>
                                        </mml:mrow>
                                    </mml:msub>
                                    <mml:mo>&#x2212;</mml:mo>
                                    <mml:msub>
                                        <mml:mi>m</mml:mi>
                                        <mml:mi>k</mml:mi>
                                    </mml:msub>
                                </mml:mrow>
                                <mml:msub>
                                    <mml:mi>&#x03c3;</mml:mi>
                                    <mml:mi>k</mml:mi>
                                </mml:msub>
                            </mml:mfrac>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(12)</label>
</disp-formula>
                </p>
                <p>The CME-ES is finally updated using:
                    <disp-formula id="e14">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>C</mml:mi>
                                <mml:mrow>
                                    <mml:mi>k</mml:mi>
                                    <mml:mo>+</mml:mo>
                                    <mml:mn>1</mml:mn>
                                </mml:mrow>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mn>1</mml:mn>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:msub>
                                    <mml:mi>c</mml:mi>
                                    <mml:mn>1</mml:mn>
                                </mml:msub>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:msub>
                                    <mml:mi>c</mml:mi>
                                    <mml:mi>&#x03bc;</mml:mi>
                                </mml:msub>
                                <mml:mo>+</mml:mo>
                                <mml:msub>
                                    <mml:mi>c</mml:mi>
                                    <mml:mi>s</mml:mi>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:msub>
                                <mml:mi>C</mml:mi>
                                <mml:mi>k</mml:mi>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mi>c</mml:mi>
                                <mml:mn>1</mml:mn>
                            </mml:msub>
                            <mml:msub>
                                <mml:mi>p</mml:mi>
                                <mml:mi>c</mml:mi>
                            </mml:msub>
                            <mml:msubsup>
                                <mml:mi>p</mml:mi>
                                <mml:mi>c</mml:mi>
                                <mml:mi>T</mml:mi>
                            </mml:msubsup>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mi>c</mml:mi>
                                <mml:mi>&#x03bc;</mml:mi>
                            </mml:msub>
                            <mml:munderover>
                                <mml:mo>&#x2211;</mml:mo>
                                <mml:mrow>
                                    <mml:mi>i</mml:mi>
                                    <mml:mo>=</mml:mo>
                                    <mml:mn>1</mml:mn>
                                </mml:mrow>
                                <mml:mi>&#x03bc;</mml:mi>
                            </mml:munderover>
                            <mml:msub>
                                <mml:mi>w</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mfrac>
                                <mml:mrow>
                                    <mml:msub>
                                        <mml:mi>x</mml:mi>
                                        <mml:mrow>
                                            <mml:mi>i</mml:mi>
                                            <mml:mo>:</mml:mo>
                                            <mml:mi>&#x03bb;</mml:mi>
                                        </mml:mrow>
                                    </mml:msub>
                                    <mml:mo>&#x2212;</mml:mo>
                                    <mml:msub>
                                        <mml:mi>m</mml:mi>
                                        <mml:mi>k</mml:mi>
                                    </mml:msub>
                                </mml:mrow>
                                <mml:msub>
                                    <mml:mi>&#x03c3;</mml:mi>
                                    <mml:mi>k</mml:mi>
                                </mml:msub>
                            </mml:mfrac>
                            <mml:mspace width="0.25em"/>
                            <mml:msup>
                                <mml:mrow>
                                    <mml:mo stretchy="true">(</mml:mo>
                                    <mml:mfrac>
                                        <mml:mrow>
                                            <mml:msub>
                                                <mml:mi>x</mml:mi>
                                                <mml:mrow>
                                                    <mml:mi>i</mml:mi>
                                                    <mml:mo>:</mml:mo>
                                                    <mml:mi>&#x03bb;</mml:mi>
                                                </mml:mrow>
                                            </mml:msub>
                                            <mml:mo>&#x2212;</mml:mo>
                                            <mml:msub>
                                                <mml:mi>m</mml:mi>
                                                <mml:mi>k</mml:mi>
                                            </mml:msub>
                                        </mml:mrow>
                                        <mml:msub>
                                            <mml:mi>&#x03c3;</mml:mi>
                                            <mml:mi>k</mml:mi>
                                        </mml:msub>
                                    </mml:mfrac>
                                    <mml:mo stretchy="true">)</mml:mo>
                                </mml:mrow>
                                <mml:mi>T</mml:mi>
                            </mml:msup>
                        </mml:math>

                        <label>(13)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>c</mml:mi>
                                <mml:mi>s</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> is the small variance loss, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>c</mml:mi>
                                <mml:mn>1</mml:mn>
                            </mml:msub>
                        </mml:math>
</inline-formula>is the learning rate for updating the covariance matrix, and 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>c</mml:mi>
                                <mml:mi>&#x03bc;</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> is the learning rate for rank-
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>&#x03bc;</mml:mi>
                        </mml:math>
</inline-formula> for updating the covariance matrix.</p>
                <p>

                    <bold>1.4.4 Genetic Algorithm (GA)</bold>
                </p>
                <p>Reference 
                    <xref ref-type="bibr" rid="ref74">[73]</xref> proposed a learning machine called the Genetic Algorithm (GA), which paralleled the principles of evolution. Barricelli (1954) pointed out that the first computer simulation of evolution was created in 1954 at the Institute for Advanced Study in Princeton, New Jersey, thanks to the efforts of Barricelli
                    <sup>
                        <xref ref-type="bibr" rid="ref74">73</xref>
                    </sup> found that GA has some limitations, such as repeated evaluation of the fitness function and difficulties in working with dynamic datasets; it tends to converge to a local optimum or even arbitrary points, instead of the global optimum of the problem. &#x201c;A better solution is only in comparison to other solutions, and the stop criterion is not clear in every problem&#x201d; (
                    <sup>
                        <xref ref-type="bibr" rid="ref74">73</xref>
                    </sup>:226). On the other hand, GA has been noticed to be a very efficient and effective technique for both optimisation and ML applications.
                    <sup>
                        <xref ref-type="bibr" rid="ref75">74</xref>
                    </sup> Another advantage of GA is that it requires less information about the problem
                    <sup>
                        <xref ref-type="bibr" rid="ref76">75</xref>,
                        <xref ref-type="bibr" rid="ref77">76</xref>
                    </sup> stated that GA can work very well on mixed (discrete and/or continuous) problems. &#x201c;The GA can be applied in real world situations such as engineering design, to make the design cycle process fast and economical, and in robotics too, to create learning robots which will behave as humans and will do tasks like cooking and laundry&#x201d; (
                    <sup>
                        <xref ref-type="bibr" rid="ref78">77</xref>
                    </sup>: 347).</p>
                <p>The efficiency of GAs depends on mutation and crossover operators and their relationships. &#x201c;To determine the most appropriate operators, different mutation and crossover operators are used and they are compared with each other since GA involves a process of complex interaction between its parameters&#x201d;
                    <sup>
                        <xref ref-type="bibr" rid="ref79">78</xref>
                    </sup> suggested that for the algorithm to perform best, the population size must range between 50 and 100 observations. In this study, we verified this recommendation by studying the effectiveness of GA in different sample sizes
                    <sup>
                        <xref ref-type="bibr" rid="ref80">79</xref>
                    </sup> stated that the algorithm comprises four main steps: selection, reproduction, replacement, and termination. The steps are as follows:</p>
                <p>

                    <italic toggle="yes">1.4.4.1 Selection</italic>
                </p>
                <p>Reference 
                    <xref ref-type="bibr" rid="ref80">[80]</xref> explained that by choosing the reproduction of offspring, the primary goal of this phase is to identify the area with the highest likelihood of producing a solution to the problem that is superior to that of the previous generation. The authors add that the selection of individuals will then be arranged in pairs of two to enhance reproduction
                    <sup>
                        <xref ref-type="bibr" rid="ref80">79</xref>
                    </sup> also explained that individuals will then pass on their genes to the next generation. &#x201c;The GA uses the fitness proportionate selection technique to ensure that useful solutions are used for recombination&#x201d; (
                    <sup>
                        <xref ref-type="bibr" rid="ref80">79</xref>
                    </sup>: 3). Fitness proportion selection is defined by the author as the most popular method of parent selection, where every individual can become a parent with a probability that is proportional to its fitness. &#x201c;Fitter individuals have a higher chance of mating and propagating their features to the next generation. Therefore, such a selection strategy applies a selection pressure to the more fit individuals in the population, evolving better individuals over time&#x201d;(
                    <sup>
                        <xref ref-type="bibr" rid="ref81">80</xref>
                    </sup>: 16). The fitness proportionate selection can be calculated using the following equation adopted from
                    <sup>
                        <xref ref-type="bibr" rid="ref81">80</xref>
                    </sup>:
                    <disp-formula id="e15">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>p</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:mfrac>
                                <mml:msub>
                                    <mml:mi>f</mml:mi>
                                    <mml:mi>i</mml:mi>
                                </mml:msub>
                                <mml:mrow>
                                    <mml:msubsup>
                                        <mml:mo>&#x2211;</mml:mo>
                                        <mml:mrow>
                                            <mml:mspace width="0em"/>
                                            <mml:mi>j</mml:mi>
                                            <mml:mo>=</mml:mo>
                                            <mml:mn>1</mml:mn>
                                        </mml:mrow>
                                        <mml:mi>N</mml:mi>
                                    </mml:msubsup>
                                    <mml:msub>
                                        <mml:mi>f</mml:mi>
                                        <mml:mi>i</mml:mi>
                                    </mml:msub>
                                </mml:mrow>
                            </mml:mfrac>
                        </mml:math>

                        <label>(14)</label>
</disp-formula>where
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mspace width="0.25em"/>
                            <mml:msub>
                                <mml:mi>f</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> denotes the fitness of individual 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>i</mml:mi>
                        </mml:math>
</inline-formula> in the population, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>N</mml:mi>
                        </mml:math>
</inline-formula> denotes the number of individuals in the population, and 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>p</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula>
 denotes the probability.</p>
                <p>

                    <italic toggle="yes">1.4.4.2 Reproduction</italic>
                </p>
                <p>Reference 
                    <xref ref-type="bibr" rid="ref80">[80]</xref> explained that the algorithm applies variation operators to the parent population during the reproduction phase, creating a child population. This phase has four main operators, crossover, mutation, replacement, and termination, which are discussed below.</p>
                <p>

                    <italic toggle="yes">1.4.4.3 Crossover</italic>
                </p>
                <p>According to,
                    <sup>
                        <xref ref-type="bibr" rid="ref82">81</xref>
                    </sup> the crossover operator swaps the genetic information of two parents to produce offspring
                    <sup>
                        <xref ref-type="bibr" rid="ref82">81</xref>
                    </sup> also explained that this is performed on parent pairs that are selected randomly to generate a child population of equal size to the parent population. For this study, a single-point crossover was considered. &#x201c;Single point crossover works in such a way that a parent organism string is selected. All data beyond this point in the organism string were swapped between the two parent organisms. Strings are characterized by positional bias&#x201d; (
                    <sup>
                        <xref ref-type="bibr" rid="ref82">81</xref>
                    </sup>: 13).</p>
                <p>

                    <italic toggle="yes">1.4.4.4 Mutation</italic>
                </p>
                <p>The mutation operator adds genetic information to the new child population. According to,
                    <sup>
                        <xref ref-type="bibr" rid="ref83">82</xref>
                    </sup> the operator achieves this by flipping some bits in the chromosome to solve the problem of local minima and enhance diversification. In the present study, a bit-flip mutation was considered. &#x201c;Bit flip mutation works in such a way that it selects one or more random bits and flip them. This can only be done for binary encoded GA&#x2019;s&#x201d; (
                    <sup>
                        <xref ref-type="bibr" rid="ref83">82</xref>
                    </sup>: 47).</p>
                <p>

                    <italic toggle="yes">1.4.4.5 Replacement</italic>
                </p>
                <p>Reference 
                    <xref ref-type="bibr" rid="ref80">[80]</xref> elucidated that the replacement operator acts as the final generational step to replace the old population with the new child population. In this study, a generational replacement operator is used, where the previous generation is replaced with a newly generated child population.</p>
                <p>

                    <italic toggle="yes">1.4.4.6 Termination</italic>
                </p>
                <p>Reference 
                    <xref ref-type="bibr" rid="ref80">[80]</xref> explains that termination is only possible in specific situations, such as having reached an absolute number of generations but not having improved the population for X iterations or the objective function value reaching a pre-defined threshold
                    <sup>
                        <xref ref-type="bibr" rid="ref80">79</xref>
                    </sup> cited a genetic algorithm example in which a counter was maintained to record generations for which the population did not improve. &#x201c;Initially, we set the counter to zero. Each time we do not generate an offspring, which is better than the individuals in the population, we increase the counter. However, if the fitness of any offspring is better, then we reset the counter to zero&#x201d; (
                    <sup>
                        <xref ref-type="bibr" rid="ref80">79</xref>
                    </sup>: 2). The author also stated that the algorithm terminates when the counter reaches a predetermined value.</p>
                <p>

                    <bold>1.4.5 Grasshopper Optimisation Algorithm (GOA)</bold>
                </p>
                <p>The Grasshopper Optimisation Algorithm (GOA) is a new swarm intelligence algorithm and population-based method developed by Seyedali Mirjalili in 2017.
                    <sup>
                        <xref ref-type="bibr" rid="ref84">83</xref>
                    </sup> According to the authors, the GOA mainly observes the behavior of grasshopper swarms and their social interactions. Every grasshopper in the population represents a solution, and its location within the swarm is determined by three forces: wind advection, the force of gravity applied to it, and social interactions with other grasshoppers.
                    <sup>
                        <xref ref-type="bibr" rid="ref85">84</xref>
                    </sup> The process of optimizing the grasshopper algorithm involves several steps, including initialization, creation, and evaluation of the first population, identification of the best overall solution, updating the decreasing coefficient parameter, mapping the grasshopper&#x2019;s distance, and updating the solution.
                    <sup>
                        <xref ref-type="bibr" rid="ref86">85</xref>
                    </sup>
                </p>
                <p>
Reference 
                    <xref ref-type="bibr" rid="ref87">[87]</xref> explained that the GOA can improve the average fitness of all grasshoppers, which helps the GOA effectively increase the first randomly generated solutions. The algorithm can be computed using software such as Matrix Laboratory (MATLAB) and Python. No information relating to the GOA in comparison with other algorithms has emerged, as this is a newly developed algorithm. Therefore, little is known about the efficiency of this algorithm compared to its predecessors; hence, the proposed study seeks to expand the scope of this algorithm. Grasshopper position (
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>X</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula>) calculations depend on three types of forces: social interactions and other grasshoppers, wind advection, and gravitational force.
                    <sup>
                        <xref ref-type="bibr" rid="ref88">87</xref>
                    </sup> All equations used in the description of the GOA in this study were sourced from
                    <sup>
                        <xref ref-type="bibr" rid="ref88">87</xref>
                    </sup> the grasshopper&#x2019;s position is defined as:
                    <disp-formula id="e16">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>X</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:msub>
                                <mml:mi>S</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mi>G</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mi>A</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(15)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>X</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mspace width="0.25em"/>
                        </mml:math>
</inline-formula>defines the position of the 
                    <italic toggle="yes">i</italic>-th grasshopper, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>S</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> is the social interaction, 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>G</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mspace width="0.25em"/>
                        </mml:math>
</inline-formula>is the gravitational force on the 
                    <italic toggle="yes">i</italic>-th grasshopper, and 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>A</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> is wind advection.</p>
                <p>From 
                    <xref ref-type="disp-formula" rid="e16">
Equation 15</xref>, social interaction is defined as:
                    <disp-formula id="e17">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>S</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:msubsup>
                                <mml:mo>&#x2211;</mml:mo>
                                <mml:mrow>
                                    <mml:mi>J</mml:mi>
                                    <mml:mo>=</mml:mo>
                                    <mml:mn>1</mml:mn>
                                </mml:mrow>
                                <mml:mi>N</mml:mi>
                            </mml:msubsup>
                            <mml:mi>s</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:msub>
                                    <mml:mi>d</mml:mi>
                                    <mml:mi mathvariant="italic">ij</mml:mi>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mover accent="true">
                                <mml:mrow>
                                    <mml:msub>
                                        <mml:mi>d</mml:mi>
                                        <mml:mi mathvariant="italic">ij</mml:mi>
                                    </mml:msub>
                                    <mml:mo>,</mml:mo>
                                </mml:mrow>
                                <mml:mo stretchy="true">&#x0302;</mml:mo>
                            </mml:mover>
                        </mml:math>

                        <label>(16)</label>
</disp-formula>where 
                    <italic toggle="yes">d
                        <sub>ij</sub>
                    </italic> is the distance between grasshopper 
                    <italic toggle="yes">i</italic> and grasshopper 
                    <italic toggle="yes">j</italic> in the 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msup>
                                <mml:mi>d</mml:mi>
                                <mml:mi mathvariant="italic">th</mml:mi>
                            </mml:msup>
                        </mml:math>
</inline-formula>dimension.</p>
                <p>From 
                    <xref ref-type="disp-formula" rid="e16">
Equation 15</xref>, the gravitational force (
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mi>G</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo stretchy="true">)</mml:mo>
                        </mml:math>
</inline-formula> on the grasshopper is computed as follows:
                    <disp-formula id="e18">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>G</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:mo>&#x2212;</mml:mo>
                            <mml:msub>
                                <mml:mi>g</mml:mi>
                                <mml:mover accent="true">
                                    <mml:msub>
                                        <mml:mi>e</mml:mi>
                                        <mml:mi>g</mml:mi>
                                    </mml:msub>
                                    <mml:mo stretchy="true">&#x0302;</mml:mo>
                                </mml:mover>
                            </mml:msub>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(17)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mo>&#x2212;</mml:mo>
                            <mml:mi>g</mml:mi>
                        </mml:math>
</inline-formula> denotes the gravitational constant and 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mover accent="true">
                                    <mml:mi>e</mml:mi>
                                    <mml:mo stretchy="true">&#x0302;</mml:mo>
                                </mml:mover>
                                <mml:mi>g</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> is the unit vector towards the center of the earth.</p>
                <p>From 
                    <xref ref-type="disp-formula" rid="e16">
Equation 15</xref>, the wind advection is computed as follows:
                    <disp-formula id="e19">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>A</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:msub>
                                <mml:mover accent="true">
                                    <mml:mi mathvariant="italic">ue</mml:mi>
                                    <mml:mo stretchy="true">&#x0302;</mml:mo>
                                </mml:mover>
                                <mml:mi>w</mml:mi>
                            </mml:msub>
                            <mml:mo>,</mml:mo>
                        </mml:math>

                        <label>(18)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>u</mml:mi>
                        </mml:math>
</inline-formula> is a constant drift and 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mover accent="true">
                                    <mml:mi>e</mml:mi>
                                    <mml:mo stretchy="true">&#x0302;</mml:mo>
                                </mml:mover>
                                <mml:mi>g</mml:mi>
                            </mml:msub>
                        </mml:math>
</inline-formula> represents a unity vector towards the direction of the wind.</p>
                <p>When substituting 
                    <xref ref-type="disp-formula" rid="e17">
Equations 16</xref>&#x2013;
                    <xref ref-type="disp-formula" rid="e19">18</xref> into 
                    <xref ref-type="disp-formula" rid="e16">
Equation 15</xref>, the position of the current grasshopper becomes.
                    <disp-formula id="e20">

                        <mml:math display="block">
                            <mml:msub>
                                <mml:mi>X</mml:mi>
                                <mml:mi>i</mml:mi>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:munderover>
                                <mml:mo>&#x2211;</mml:mo>
                                <mml:mtable>
                                    <mml:mtr>
                                        <mml:mtd>
                                            <mml:maligngroup/>
                                            <mml:mi>j</mml:mi>
                                            <mml:mo>=</mml:mo>
                                            <mml:mn>1</mml:mn>
                                        </mml:mtd>
                                    </mml:mtr>
                                    <mml:mtr>
                                        <mml:mtd>
                                            <mml:maligngroup/>
                                            <mml:mi>j</mml:mi>
                                            <mml:mo>&#x2260;</mml:mo>
                                            <mml:mn>1</mml:mn>
                                        </mml:mtd>
                                    </mml:mtr>
                                </mml:mtable>
                                <mml:mi>N</mml:mi>
                            </mml:munderover>
                            <mml:mi>s</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mo>|</mml:mo>
                                <mml:msub>
                                    <mml:mi>x</mml:mi>
                                    <mml:mi>j</mml:mi>
                                </mml:msub>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:msub>
                                    <mml:mi>x</mml:mi>
                                    <mml:mi>i</mml:mi>
                                </mml:msub>
                                <mml:mo>|</mml:mo>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mfrac>
                                <mml:mrow>
                                    <mml:msub>
                                        <mml:mi>x</mml:mi>
                                        <mml:mi>j</mml:mi>
                                    </mml:msub>
                                    <mml:mo>&#x2212;</mml:mo>
                                    <mml:msub>
                                        <mml:mi>x</mml:mi>
                                        <mml:mi>i</mml:mi>
                                    </mml:msub>
                                </mml:mrow>
                                <mml:msub>
                                    <mml:mi>d</mml:mi>
                                    <mml:mi mathvariant="italic">ij</mml:mi>
                                </mml:msub>
                            </mml:mfrac>
                            <mml:mo>&#x2212;</mml:mo>
                            <mml:msub>
                                <mml:mover accent="true">
                                    <mml:mi mathvariant="italic">ge</mml:mi>
                                    <mml:mo stretchy="true">&#x0302;</mml:mo>
                                </mml:mover>
                                <mml:mi>g</mml:mi>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mover accent="true">
                                    <mml:mi mathvariant="italic">ue</mml:mi>
                                    <mml:mo stretchy="true">&#x0302;</mml:mo>
                                </mml:mover>
                                <mml:mi>w</mml:mi>
                            </mml:msub>
                        </mml:math>

                        <label>(19)</label>
</disp-formula>where 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>N</mml:mi>
                        </mml:math>
</inline-formula> is the total number of grasshoppers.</p>
                <p>
Reference 
                    <xref ref-type="bibr" rid="ref84">[84]</xref> explained how the pseudocode of the GOA algorithm works. The GOA starts optimization by creating a set of random solutions; the search agents then update their positions, followed by the determination of the position of the best target obtained thus far, and this position is updated in each iteration.
                    <sup>
                        <xref ref-type="bibr" rid="ref84">83</xref>
                    </sup> Additionally, the distances between grasshoppers were normalized in each iteration
                    <sup>
                        <xref ref-type="bibr" rid="ref84">83</xref>
                    </sup> stated that position updating is performed iteratively until the end criterion is satisfied. Finally, the position and fitness of the best target are returned as the best approximation of the global optimum.</p>
                <p>

                    <bold>1.4.6 Model comparison criteria</bold>
                </p>
                <p>Precision, sensitivity/recall, F-score, classification accuracy, sensitivity, specificity, and execution time were used to evaluate and compare the optimization algorithms for the MLP, as described in this section. The classifier with the highest precision, recall, F-score, accuracy rate, sensitivity, specificity, and lowest execution time is preferred.</p>
                <p>Classification accuracy (also referred to as overall accuracy) was described by
                    <sup>
                        <xref ref-type="bibr" rid="ref89">88</xref>
                    </sup> as the number of correct forecasts divided by the total number of forecasts. It is the most straightforward clustering quality measure proposed by
                    <sup>
                        <xref ref-type="bibr" rid="ref90">89</xref>
                    </sup> to assess the clustering results related to the ground truth.
                    <sup>
                        <xref ref-type="bibr" rid="ref89">88</xref>
                    </sup> Classification accuracy was calculated by
                    <sup>
                        <xref ref-type="bibr" rid="ref89">88</xref>
                    </sup> as follows:
                    <disp-formula id="e21">

                        <mml:math display="block">
                            <mml:mtext mathvariant="italic">Accuracy</mml:mtext>
                            <mml:mo>=</mml:mo>
                            <mml:mfrac>
                                <mml:mrow>
                                    <mml:mtext mathvariant="italic">True Positives</mml:mtext>
                                    <mml:mo>+</mml:mo>
                                    <mml:mtext mathvariant="italic">True Negatives</mml:mtext>
                                </mml:mrow>
                                <mml:mrow>
                                    <mml:mo stretchy="true">(</mml:mo>
                                    <mml:mtext mathvariant="italic">Positives</mml:mtext>
                                    <mml:mo>+</mml:mo>
                                    <mml:mtext mathvariant="italic">Negative</mml:mtext>
                                    <mml:mo stretchy="true">)</mml:mo>
                                </mml:mrow>
                            </mml:mfrac>
                        </mml:math>

                        <label>(20)</label>
</disp-formula>
                </p>
                <p>Reference 
                    <xref ref-type="bibr" rid="ref91">[91]</xref> characterized specificity as a proportion of the extent of real negatives that are effectively distinguished, and they described the specificity equation as follows:
                    <disp-formula id="e22">

                        <mml:math display="block">
                            <mml:mtext mathvariant="italic">specificity</mml:mtext>
                            <mml:mo mathvariant="bold">=</mml:mo>
                            <mml:mfrac>
                                <mml:mtext mathvariant="italic">True Negatives</mml:mtext>
                                <mml:mrow>
                                    <mml:mtext mathvariant="italic">True Negatives</mml:mtext>
                                    <mml:mo>+</mml:mo>
                                    <mml:mtext mathvariant="italic">False Positives</mml:mtext>
                                </mml:mrow>
                            </mml:mfrac>
                        </mml:math>

                        <label>(21)</label>
</disp-formula>
                </p>
                <p>Precision was defined by
                    <sup>
                        <xref ref-type="bibr" rid="ref92">91</xref>
                    </sup> as a measure of how close a series of measurements are to one another. The author explained that precise measurements are highly reproducible, even if the measurements are not near the correct value. Precision was calculated as follows
                    <sup>
                        <xref ref-type="bibr" rid="ref92">91</xref>
                    </sup>:
                    <disp-formula id="e23">

                        <mml:math display="block">
                            <mml:mtext mathvariant="italic">Precision</mml:mtext>
                            <mml:mo>=</mml:mo>
                            <mml:mfrac>
                                <mml:mtext mathvariant="italic">True Positives</mml:mtext>
                                <mml:mrow>
                                    <mml:mtext mathvariant="italic">True Positives</mml:mtext>
                                    <mml:mo>+</mml:mo>
                                    <mml:mtext mathvariant="italic">False Positives</mml:mtext>
                                </mml:mrow>
                            </mml:mfrac>
                        </mml:math>

                        <label>(22)</label>
</disp-formula>
                </p>
                <p>Reference 
                    <xref ref-type="bibr" rid="ref91">[91]</xref> characterize the sensitivity/recall rate as a measure of the proportion of real positives that are accurately identified. The following equation for recall/sensitivity was adopted from
                    <sup>
                        <xref ref-type="bibr" rid="ref91">90</xref>
                    </sup>:
                    <disp-formula id="e24">

                        <mml:math display="block">
                            <mml:mtext mathvariant="italic">Sensitivity</mml:mtext>
                            <mml:mo>/</mml:mo>
                            <mml:mtext mathvariant="italic">recall</mml:mtext>
                            <mml:mo mathvariant="bold">=</mml:mo>
                            <mml:mfrac>
                                <mml:mtext mathvariant="italic">True Positives</mml:mtext>
                                <mml:mrow>
                                    <mml:mtext mathvariant="italic">True Positives</mml:mtext>
                                    <mml:mo>+</mml:mo>
                                    <mml:mtext mathvariant="italic">False Negatives</mml:mtext>
                                </mml:mrow>
                            </mml:mfrac>
                        </mml:math>

                        <label>(23)</label>
</disp-formula>
                </p>
                <p>Reference 
                    <xref ref-type="bibr" rid="ref93">[93]</xref> defined the F-measure as a weighted harmonic mean of recall and precision. There are several motivations for this choice
                    <sup>
                        <xref ref-type="bibr" rid="ref93">92</xref>
                    </sup> explains that the harmonic mean is commonly appropriate when averaging rates or frequencies, but there are also a set of theoretical reasons. The author further explains that the mean allows differential weighting of recall and precision, but they are commonly given equal weights. The F-measure was computed as follows:
                    <disp-formula id="e25">

                        <mml:math display="block">
                            <mml:mi>F</mml:mi>
                            <mml:mo>=</mml:mo>
                            <mml:mn>2</mml:mn>
                            <mml:mo>&#x2217;</mml:mo>
                            <mml:mfrac>
                                <mml:mrow>
                                    <mml:mtext mathvariant="italic">Precision</mml:mtext>
                                    <mml:mo>&#x2217;</mml:mo>
                                    <mml:mtext mathvariant="italic">Recall</mml:mtext>
                                </mml:mrow>
                                <mml:mrow>
                                    <mml:mtext mathvariant="italic">Precision</mml:mtext>
                                    <mml:mo>+</mml:mo>
                                    <mml:mtext mathvariant="italic">Recall</mml:mtext>
                                </mml:mrow>
                            </mml:mfrac>
                        </mml:math>

                        <label>(24)</label>
</disp-formula>
                </p>
                <p>Execution time is defined by
                    <sup>
                        <xref ref-type="bibr" rid="ref94">
93</xref>
                    </sup> as the amount of time spent by the system executing a given task, including the amount of time it spends executing runtime or system services.</p>
            </sec>
        </sec>
        <sec id="sec10" sec-type="results">
            <title>1.5 Results</title>
            <p>To ease the presentation and interpretation of the results, the results are presented by plotting each classification metric of all the ML classifiers under comparison across the sample sizes in 
                <xref ref-type="fig" rid="f2">
Figures 2</xref> to 
                <xref ref-type="fig" rid="f8">8</xref>.</p>
            <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                <label>
Figure 2. </label>
                <caption>
                    <title>Classification accuracy for the basic MLP, GA-MLP, GOA-MLP and CMA-ES-MLP by sample size.</title>
                    <p>The line graphs represent the overall classification accuracy of the basic MLP and each optimised MLP to determine the impact of various sample sizes on their classification accuracy. The classification accuracy is rate at which the model correctly classifies all the observations (both non-subscriptions and subscriptions).</p>
                </caption>
                <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185252/62644aa1-63cc-499e-8114-200f47dda971_figure2.gif"/>
            </fig>
            <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                <label>
Figure 3. </label>
                <caption>
                    <title>Precision rate for the basic MLP, GA-MLP, GOA-MLP and CMA-ES-MLP by sample size.</title>
                    <p>The line graphs represent the overall precision rate of the basic MLP and each optimised MLP across the various sample sizes. This was to determine the impact of various sample sizes on the percentage of the term deposit subscriptions that are correctly classified by the models under comparison out of all the cases that were predicted as term deposit subscriptions by these models.</p>
                </caption>
                <graphic id="gr3" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185252/62644aa1-63cc-499e-8114-200f47dda971_figure3.gif"/>
            </fig>
            <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                <label>
Figure 4. </label>
                <caption>
                    <title>Sensitivity/recall for the basic MLP, GA-MLP, GOA-MLP and CMA-ES-MLP by sample size.</title>
                    <p>The line graphs represent the sensitivity rate of the basic MLP and each optimised MLP across the various sample sizes. This was to determine the impact of various sample sizes on the percentage of the term deposit subscriptions that are correctly classified by the models under comparison out of all the term deposit subscriptions from the testing datasets.</p>
                </caption>
                <graphic id="gr4" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185252/62644aa1-63cc-499e-8114-200f47dda971_figure4.gif"/>
            </fig>
            <fig fig-type="figure" id="f5" orientation="portrait" position="float">
                <label>
Figure 5. </label>
                <caption>
                    <title>Specificity rates for the basic MLP, GA-MLP, GOA-MLP and CMA-ES-MLP by sample size.</title>
                    <p>The line graphs represent the specificity rate of the basic MLP and each optimised MLP across the various sample sizes. This was to determine the impact of various sample sizes on the percentage of the term deposit non-subscriptions that are correctly classified by the models under comparison out of all the term deposit non-subscriptions from the testing datasets.</p>
                </caption>
                <graphic id="gr5" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185252/62644aa1-63cc-499e-8114-200f47dda971_figure5.gif"/>
            </fig>
            <fig fig-type="figure" id="f6" orientation="portrait" position="float">
                <label>
Figure 6. </label>
                <caption>
                    <title>F-measure rates for the basic MLP, GA-MLP, GOA-MLP and CMA-ES-MLP by sample size.</title>
                    <p>The line graphs represent the F-measure rate of the basic MLP and each optimised MLP across the various sample sizes. This was to determine the impact of various sample sizes on the harmonic mean of precision and recall. That is, how the sample size impacts the ability of the models under comparison to balance precision and recall.</p>
                </caption>
                <graphic id="gr6" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185252/62644aa1-63cc-499e-8114-200f47dda971_figure6.gif"/>
            </fig>
            <fig fig-type="figure" id="f7" orientation="portrait" position="float">
                <label>
Figure 7. </label>
                <caption>
                    <title>Execution times for the basic MLP, GA-MLP, GOA-MLP and CMA-ES-MLP by sample size.</title>
                    <p>The line graphs represent the execution time of the basic MLP and each optimised MLP across the various sample sizes. This was to determine the impact of various sample sizes on the time it takes to complete the processes of deriving each model.</p>
                </caption>
                <graphic id="gr7" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185252/62644aa1-63cc-499e-8114-200f47dda971_figure7.gif"/>
            </fig>
            <fig fig-type="figure" id="f8" orientation="portrait" position="float">
                <label>
Figure 8. </label>
                <caption>
                    <title>Mean of classification metrics for the basic MLP, GA-MLP, GOA-MLP and CMA-ES-MLP across the samples.</title>
                    <p>The line graphs represent the performance of the basic MLP and each optimised MLP across the various sample sizes on average. This was to determine the impact of various sample sizes on the average performance of the models. The mean classification performance was computed by taking the average of overall classification accuracy, precision, sensitivity and specificity.</p>
                </caption>
                <graphic id="gr8" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/185252/62644aa1-63cc-499e-8114-200f47dda971_figure8.gif"/>
            </fig>
            <p>
                <xref ref-type="fig" rid="f2">
Figure 2</xref> shows that, for the basic MLP classifier, the classification accuracy values fluctuate and do not follow a clear increasing or decreasing trend with different sample sizes. The GA-MLP shows that the classification accuracy values also fluctuate and do not exhibit a consistent pattern with the change in the sample size. For GOA-MLP, the classification accuracy values show fluctuations but are more stable than those of its competitors. For CMA-ES-MLP, the classification accuracy values appear to fluctuate with a significant drop for the 60% sample (n = 3832), but in most of the sample sizes (10% (n = 626), 20% (n = 1280), 40% (n = 2592), 70% (n = 3582), 80% (n = 5114), 90% (n = 5114), and 100% (n = 6398)), this classifier has the highest overall classification accuracy rates; that is, the ability to classify both subscribers and non-subscribers from the datasets. The precision rates for all classifiers across sample sizes are shown in 
                <xref ref-type="fig" rid="f3">
Figure 3</xref>.</p>
            <p>
                <xref ref-type="fig" rid="f3">
Figure 3</xref> shows that the precision values for all the models fluctuated as the sample sizes increased and did not show a consistent upward or downward pattern as the sample size increased. Generally, relatively high rates of precision are shown for the CMA-ES-MLP in most of the sample sizes (10% (n = 626), 20% (n = 1280), 30% (n = 1940), 40% (n = 2592), 50% (n = 3204), 60% (n = 3832), 70% (n = 3582), 80% (n = 5114), and 100% (n = 6398)). This implies that CMA-ES-MLP has the highest ability to correctly classify positive cases (subscribers) out of all predicted positives compared to GA-MLP, GOA-MLP, and the basic MLP in most instances. The sensitivity/recall rates for all classifiers across sample sizes are presented in 
                <xref ref-type="fig" rid="f4">
Figure 4</xref>.</p>
            <p>
                <xref ref-type="fig" rid="f4">
Figure 4</xref> shows a sharp increase from the smallest sample size (10% (n = 626)) to the second-smallest sample size (20% (n = 1280)) in the sensitivity/recall rate for the basic MLP. Thereafter, a steady increase was observed until n = 2592, followed by fluctuating values of sensitivity in the remaining sample sizes. Generally, the basic MLP with no optimization yielded the lowest sensitivity rates across all sample sizes compared to its competitors (except for the full dataset (n = 6398)). The second lowest sensitivity/recall rates were observed for GA-MLP across all samples, except for the 70% sample (n = 3582) and the 80% sample (n = 5114), so generally GA-MLP is the second worst performer among all four models. 
                <xref ref-type="fig" rid="f4">
Figure 4</xref> shows that the sensitivity/recall values for GOA-MLP and CMA-ES-MLP also show fluctuations but are generally relatively higher than those of the basic MLP and GA-MLP for most samples.</p>
            <p>The sensitivity/recall rates for CMA-ES-MLP decreased slowly as the sample size increased (except when n = 5760). In general, the sensitivity rates for GA-MLP, GOA-MLP, and CMA-ES-MLP are more stable across the sample sizes relative to those derived from the basic MLP without optimization because they do not fluctuate rapidly, as in the case of the basic MLP. In most instances, GA-MLP and CMA-ES-MLP correctly classified the negatives (non-subscribers) better than GOA-MLP and basic MLP. The specificity rates for all classifiers across sample sizes are shown in 
                <xref ref-type="fig" rid="f5">
Figure 5</xref>.</p>
            <p>
                <xref ref-type="fig" rid="f5">
Figure 5</xref> shows that for the basic MLP, the specificity values are relatively low and fluctuate with different sample sizes. The specificity for GA-MLP was highest for the smallest sample size (10% (n = 626)), followed by an upward trend between the second-smallest sample (20% (n = 1280)) and the fifth-lowest or sixth highest (50% (n = 3204)). Thereafter, it fluctuates, but for most sample sizes, its values are greater than those of the basic MLP and lower than those of the GOA-MLP. The specificity values for CMA-ES-MLP generally increased from the sixth highest sample (50% (3204)) to the full dataset (n = 6398) as the sample size increased. Generally, the CMA-ES-MLP classifies the positives (subscribers) correctly more accurately than the basic MLP, GA-MLP, and GOA-MLP. The F-measure rates for all classifiers across sample sizes are shown in 
                <xref ref-type="fig" rid="f6">
Figure 6</xref>.</p>
            <p>
                <xref ref-type="fig" rid="f6">
Figure 6</xref> shows that for the basic MLP, the F-measure appears to fluctuate with different sample sizes without forming a clear upward or downward trend, and the basic MLP yielded the lowest F-measure across all sample sizes. For GA-MLP, generally, there seems to be an increase in the F-measure as the sample size increases from 20% (n=1280) to 50% (n=3240) and from 60% (3832) to 90% (5760); however, GA-MLP is the second worst performer in terms of the F-measure. 
                <xref ref-type="fig" rid="f6">
Figure 6</xref> also shows that the F-measure for the GOA-MLP fluctuates, and there is a significant drop in its performance for the whole dataset (n=1940); however, this classifier is generally the second-best performer in terms of the F-measure, after the CMA-ES-MLP. The execution times for all classifiers across sample sizes are shown in 
                <xref ref-type="fig" rid="f7">
Figure 7</xref>.</p>
            <p>
                <xref ref-type="fig" rid="f7">
Figure 7</xref> shows that the basic MLP was the fastest to train, followed by CMA-ES-MLP. For the GA-MLP and GOA-MLP algorithms, there was an increasing trend whereby, as the sample sizes increased, the execution time also increased for these classifiers, but GA-MLP was the most expensive model when the sample size was at least 5114. The means of the classification metrics for all classifiers across the sample sizes are shown in 
                <xref ref-type="fig" rid="f8">
Figure 8</xref>.</p>
            <p>
                <xref ref-type="fig" rid="f8">
Figure 8</xref> shows that the CMA-ES-MLP algorithm consistently achieved the highest mean accuracy across different sample sizes (except for the 50% sample size (n=3204)), indicating that it is the most accurate model overall. The GA-MLP and GOA-MLP algorithms showed varied performance, but for most sample sizes (10% (n=626), 20% (n=1280), 30% (n=1940), 40% (n=2592), 70% (n=3582), 80% (n=5114), and 100% (n=6398)), GA-GOA-MLP provided more accurate classifications than GA-MLP. The basic MLP algorithm consistently achieved the lowest mean classification accuracy, indicating its poor performance compared to its optimized variates. In general, the classifiers can be ranked in descending order of mean classification accuracy: CMA-ES-MLP, GOA-MLP, GA-MLP, and basic MLP.</p>
        </sec>
        <sec id="sec11" sec-type="conclusion">
            <title>1.6 Conclusion</title>
            <p>This study was conducted to determine the impact of sample size on the classification ability and efficiency of GA, GOA, and CMA-CS, which are optimization algorithms for the MLP. The comparison was performed using line graphs of precision, F-measure, accuracy, sensitivity/recall, specificity, and execution time for basic MLP, GA-MLP, GOA-MLP, and CMA-ES-MLP across the ten samples. The line charts did not reveal a defined relationship between the performance of the classifiers across the sample sizes because the plots varied rapidly as the sample size increased. However, the execution time showed a clearer pattern as the sample size increased. The results revealed that GOA-MLP had more stable classification accuracy values than its competitors. Generally, the sensitivity rates for GA-MLP, GOA-MLP, and CMA-ES-MLP were more stable across the sample sizes relative to those derived from the basic MLP without optimization, since they did not fluctuate rapidly like those of the basic MLP.</p>
            <p>The researchers concluded that the CMA-ES-MLP is the best model for this study in general because it maintains high rates of classification accuracy, F-measure, precision, and specificity for most sample sizes, and was the second-best performing classifier execution time. Furthermore, the mean classification metric results revealed that the CMA-ES-MLP algorithm consistently achieved the highest mean accuracy across nine different sample sizes, indicating that it is the most accurate model overall. The CMA-ES-MLP optimizer was identified as the most efficient optimization algorithm for an optimum MLP, as it was generally the most accurate optimizer, and it provided a lower execution time than GA-MLP and GOA-MLP, which did not increase noticeably as the sample size increased, implying that the CMA-ES optimizer is the most efficient optimizer for an optimum MLP compared with GA and GOA across all samples.</p>
            <p>Generally, the sample size affects the performance of the MLP because the values of the classification metrics do not remain constant as the sample size changes. However, the results revealed that the values of the accuracy metrics for all the models fluctuated as the sample size increased, and there was no consistent increase or decrease in the classification performance of the algorithms as the sample size increased. On the other hand, the execution times for the GA and GOA optimizers increased as the sample size increased, but the execution time of the basic MLP remained the lowest and was almost constant as the sample size increased. Although CMA-ES had the lowest execution time compared to GOA and GA, it increased slightly when the sample size was at least 5114.</p>
        </sec>
        <sec id="sec12">
            <title>Contribution</title>
            <p>This study compared the performance of the basic MLP to MLPs optimized using GA, GOA, and CMA-ES, which has not been done in other studies; therefore, this is a contribution to the literature on MLP and optimization algorithms. Through this study, it is now known that the performance of MLP, GA-MLP, CMA-ES-MLP, GO-MLP, and GOA-MLP varies rapidly across the sample sizes, so we cannot generalize that the larger the sample size, the better the model, or vice versa. This novel knowledge extends the literature on ML classifiers, especially MLP. From the execution time results, the change in sample sizes revealed that the basic MLP was the fastest, followed by the CMA-ES-MLP, whereas in the other models, as the sample size increased, the execution time also increased. This implies that the CMA-ES-MLP is not just the most accurate, but also less expensive and has proven to be more stable in terms of training time as the sample size increases. This implies that the training time for the CMA-ES-MLP is least affected by the change in the datasets and using it with large datasets is likely not to affect its training time significantly as opposed to the GA and GOA. These results contribute novel knowledge about the efficiency of CMA-ES in optimizing the MLP.</p>
            <p>The findings of this study also showed that training the MLP and its optimized variates on different samples that are randomly drawn from a larger dataset may aid in identifying the sample that can yield the most accurate classifier, as opposed to training the classifiers using one training dataset. More specifically, the selected model CMA-ES-MLP yielded the highest accuracy (overall classification accuracy, precision, and specificity) when the sample size was 5114, which is less than that of the mother dataset of 6398 observations. The best CMA-ES-MLP identified in this study competes well with classifiers that were the best performers from previous studies using the same dataset. For example, the best CMA-ES-MLP that was identified as the performer in this study has a classification accuracy of 90.18%, which is higher than that of the Meta-cost MLP (77.48%),
                <sup>
                    <xref ref-type="bibr" rid="ref29">29</xref>
                </sup> RF (86.08%),
                <sup>
                    <xref ref-type="bibr" rid="ref41">41</xref>
                </sup> and DT (87.5%).
                <sup>
                    <xref ref-type="bibr" rid="ref30">30</xref>
                </sup> This comparison does not ignore the fact that in some previous studies, the setting was different from that used in our study. It is recommended that a future study using the classifiers that were identified as the best from previous studies in 
                <xref ref-type="table" rid="T2">
Table 2</xref> and the CMA-ES-MLP from this study be conducted to compare these classifiers under the same setting. The recommendations drawn from this study contribute new possible areas of research around ML classifiers, and the implications of the findings from this study contribute to a novel, accurate, and efficient approach to predicting the likelihood of a potential client subscribing to a term deposit using CMA-ES-MLP.</p>
        </sec>
        <sec id="sec13">
            <title>Ethical considerations</title>
            <p>This paper was written using parts of a PhD study whose proposal was presented at the school colloquium, where it received approval. It was subsequently submitted to the School Scientific Committee for approval as well. Then the proposal approved by the North-West University&#x2019;s Faculty of Economic and Management Sciences Research Scientific Committee (FEMS-REC) on 30 June 2023, with the study classified as minimal risk. The ethics approval number is NWU-00684-22-A4.</p>
        </sec>
    </body>
    <back>
        <sec id="sec16" sec-type="data-availability">
            <title>Data availability</title>
            <p>The data used in this study is a secondary dataset on direct marketing campaigns of a Portuguese banking institution named &#x201c;Bank Marketing.&#x201d; The dataset was obtained from the UCI Machine Learning Repository by the Center for Machine Learning and Intelligent Systems. The primary contributor for the data is.
                <sup>
                    <xref ref-type="bibr" rid="ref28">28</xref>
                </sup> The dataset can be accessed through 
                <ext-link ext-link-type="uri" xlink:href="https://archive.ics.uci.edu/ml/datasets/Bank+Marketing">https://archive.ics.uci.edu/ml/datasets/Bank+Marketing</ext-link>. DOI: 10.24432/C5K306. The researchers took some random samples to mimic different sample sizes so that they can successfully achieve the objective of study which is to determine the impact of sample size on the performance of optimisation algorithms for the MLP used in the prediction of client subscription to a term deposit. The dataset is licensed under 
                <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY 4.0 license</ext-link> which allows for its sharing and adaptation for any purpose (which imply that research purposes is included) provided that the appropriate credit is given (which is done in this paper in section 1.3.1).</p>
        </sec>
        <ack>
            <title>Acknowledgements</title>
            <p>The authors of this research acknowledge North-West University (NWU) for availing resources to support this research.</p>
        </ack>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mythili</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Shanavas</surname>
                            <given-names>AM</given-names>
                        </name>
</person-group>:
                    <article-title>An Analysis of students&#x2019; performance using classification algorithms.</article-title>
                    <source>

                        <italic toggle="yes">IOSR Journal of Computer Engineering.</italic>
</source>
                    <year>2014</year>;<volume>16</volume>(<issue>1</issue>):<fpage>63</fpage>&#x2013;<lpage>69</lpage>.
                    <pub-id pub-id-type="doi">10.9790/0661-16136369</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref2">
                <label>2</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tomar</surname>
                            <given-names>GS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chaudhari</surname>
                            <given-names>NS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Barbosa</surname>
                            <given-names>JLV</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <source>

                        <italic toggle="yes">International conference on intelligent computing and smart communication 2019: Proceedings of ICSC 2019.</italic>
</source>
                    <publisher-name>Springer Nature</publisher-name>;<year>2020</year>.</mixed-citation>
            </ref>
            <ref id="ref3">
                <label>3</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Khan</surname>
                            <given-names>MA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Khan</surname>
                            <given-names>MA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jan</surname>
                            <given-names>Z</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <chapter-title>Performance of machine learning techniques in protein fold recognition problem.</chapter-title>
                    <source>

                        <italic toggle="yes">2010 International Conference on Information Science and Applications.</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2010</year>; pp.<fpage>1</fpage>&#x2013;<lpage>6</lpage>.</mixed-citation>
            </ref>
            <ref id="ref4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Stottinger</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hanbury</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sebe</surname>
                            <given-names>N</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Sparse color interest points for image retrieval and object categorization.</article-title>
                    <source>

                        <italic toggle="yes">IEEE Trans. Image Process.</italic>
</source>
                    <year>2012</year>;<volume>21</volume>(<issue>5</issue>):<fpage>2681</fpage>&#x2013;<lpage>2692</lpage>.
                    <pub-id pub-id-type="pmid">22294029</pub-id>
                    <pub-id pub-id-type="doi">10.1109/TIP.2012.2186143</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gulia</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vohra</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rani</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>Liver patient classification using intelligent techniques.</article-title>
                    <source>

                        <italic toggle="yes">International Journal of Computer Science and Information Technologies.</italic>
</source>
                    <year>2014</year>;<volume>5</volume>(<issue>4</issue>):<fpage>5110</fpage>&#x2013;<lpage>5115</lpage>.</mixed-citation>
            </ref>
            <ref id="ref6">
                <label>6</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Di</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Shafiq</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>AlRegib</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>:
                    <chapter-title>Patch-level MLP classification for improved fault detection.</chapter-title>
                    <source>

                        <italic toggle="yes">SEG Technical Program Expanded Abstracts 2018.</italic>
</source>
                    <publisher-name>Society of Exploration Geophysicists</publisher-name>;<year>2018</year>; pp.<fpage>2211</fpage>&#x2013;<lpage>2215</lpage>.</mixed-citation>
            </ref>
            <ref id="ref7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>&#x00c7;&#x0131;&#x011f;&#x015f;ar</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>&#x00dc;nal</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Comparison of data mining classification algorithms determining the default risk.</article-title>
                    <source>

                        <italic toggle="yes">Sci. Program.</italic>
</source>
                    <year>2019</year>;<volume>2019</volume>:<fpage>1</fpage>&#x2013;<lpage>8</lpage>.
                    <pub-id pub-id-type="doi">10.1155/2019/8706505</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref8">
                <label>8</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jamuna</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Karpagavalli</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vijaya</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <chapter-title>Classification of seed cotton yield based on the growth stages of cotton crop using machine learning techniques.</chapter-title>
                    <source>

                        <italic toggle="yes">2010 International Conference on Advances in Computer Engineering.</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2010</year>; pp.<fpage>312</fpage>&#x2013;<lpage>315</lpage>.</mixed-citation>
            </ref>
            <ref id="ref9">
                <label>9</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Harikrishnan</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sethi</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pandey</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <chapter-title>Handwritten digit recognition with feed-forward multi-layer perceptron and convolutional neural network architectures.</chapter-title>
                    <source>

                        <italic toggle="yes">2020 2nd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2020</year>; pp.<fpage>398</fpage>&#x2013;<lpage>402</lpage>.</mixed-citation>
            </ref>
            <ref id="ref10">
                <label>10</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rouse</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <chapter-title>Internet of Things (IOT),[ONLINE]</chapter-title>
                    <source>

                        <italic toggle="yes">Internet-of-Things [Acedido em 23 Junho 2015].</italic>
</source>
                    <year>2014</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://whatis.techtarget.com/definition">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref11">
                <label>11</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hull</surname>
                            <given-names>DG</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Optimal control theory for applications.</italic>
</source>
                    <publisher-name>Springer Science &amp; Business Media</publisher-name>;<year>2013</year>.</mixed-citation>
            </ref>
            <ref id="ref12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fern&#x00e1;ndez</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>L&#x00f3;pez</surname>
                            <given-names>V</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Galar</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Analysing the classification of imbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches.</article-title>
                    <source>

                        <italic toggle="yes">Knowl.-Based Syst.</italic>
</source>
                    <year>2013</year>;<volume>42</volume>:<fpage>97</fpage>&#x2013;<lpage>110</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.knosys.2013.01.018</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref13">
                <label>13</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>HARIT</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Optimizing Weights And Biases in MLP Using Whale Optimization Algorithm.</italic>
</source>
                    <publisher-name>Durham University</publisher-name>;<year>2022</year>.</mixed-citation>
            </ref>
            <ref id="ref14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Abdel-Basset</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>El-Shahat</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>El-Henawy</surname>
                            <given-names>I</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A new fusion of grey wolf optimizer algorithm with a two-phase mutation for feature selection.</article-title>
                    <source>

                        <italic toggle="yes">Expert Syst. Appl.</italic>
</source>
                    <year>2020</year>;<volume>139</volume>:<fpage>112824</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.eswa.2019.112824</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bhesdadiya</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jangir</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jangir</surname>
                            <given-names>N</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Training multi-layer perceptron in neural network using whale optimization algorithm.</article-title>
                    <source>

                        <italic toggle="yes">Indian J. Sci. Technol.</italic>
</source>
                    <year>2016</year>;<volume>9</volume>(<issue>19</issue>):<fpage>28</fpage>&#x2013;<lpage>36</lpage>.</mixed-citation>
            </ref>
            <ref id="ref16">
                <label>16</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Alboaneen</surname>
                            <given-names>DA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tianfield</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>Y</given-names>
                        </name>
</person-group>:
                    <chapter-title>Sentiment analysis via multi-layer perceptron trained by meta-heuristic optimisation.</chapter-title>
                    <source>

                        <italic toggle="yes">2017 IEEE International Conference on Big Data (Big Data).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2017</year>; pp.<fpage>4630</fpage>&#x2013;<lpage>4635</lpage>.</mixed-citation>
            </ref>
            <ref id="ref17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Aljarah</surname>
                            <given-names>I</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Faris</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mirjalili</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Optimizing connection weights in neural networks using the whale optimization algorithm.</article-title>
                    <source>

                        <italic toggle="yes">Soft. Comput.</italic>
</source>
                    <year>2018</year>;<volume>22</volume>:<fpage>1</fpage>&#x2013;<lpage>15</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s00500-016-2442-1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Anderson</surname>
                            <given-names>SF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kelley</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maxwell</surname>
                            <given-names>SE</given-names>
                        </name>
</person-group>:
                    <article-title>Sample-size planning for more accurate statistical power: A method adjusting sample effect sizes for publication bias and uncertainty.</article-title>
                    <source>

                        <italic toggle="yes">Psychol. Sci.</italic>
</source>
                    <year>2017</year>;<volume>28</volume>(<issue>11</issue>):<fpage>1547</fpage>&#x2013;<lpage>1562</lpage>.
                    <pub-id pub-id-type="pmid">28902575</pub-id>
                    <pub-id pub-id-type="doi">10.1177/0956797617723724</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kyriazos</surname>
                            <given-names>TA</given-names>
                        </name>
</person-group>:
                    <article-title>Applied psychometrics: sample size and sample power considerations in factor analysis (EFA, CFA) and SEM in general.</article-title>
                    <source>

                        <italic toggle="yes">Psychology.</italic>
</source>
                    <year>2018</year>;<volume>09</volume>(<issue>08</issue>):<fpage>2207</fpage>&#x2013;<lpage>2230</lpage>.
                    <pub-id pub-id-type="doi">10.4236/psych.2018.98126</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref20">
                <label>20</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Uttley</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Power analysis, sample size, and assessment of statistical assumptions&#x2014;Improving the evidential value of lighting research.</italic>
</source>
                    <publisher-name>Leukos</publisher-name>;<year>2019</year>.</mixed-citation>
            </ref>
            <ref id="ref21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Riley</surname>
                            <given-names>RD</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Calculating the sample size required for developing a clinical prediction model.</article-title>
                    <source>

                        <italic toggle="yes">BMJ.</italic>
</source>
                    <year>2020</year>;<volume>368</volume>.
                    <pub-id pub-id-type="doi">10.1136/bmj.m441</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gibson</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hu</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Huisman</surname>
                            <given-names>HJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Designing image segmentation studies: statistical power, sample size and reference standard quality.</article-title>
                    <source>

                        <italic toggle="yes">Med. Image Anal.</italic>
</source>
                    <year>2017</year>;<volume>42</volume>:<fpage>44</fpage>&#x2013;<lpage>59</lpage>.
                    <pub-id pub-id-type="pmid">28772163</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.media.2017.07.004</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5666910</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Taud</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mas</surname>
                            <given-names>J-F</given-names>
                        </name>
</person-group>:
                    <article-title>Multilayer perceptron (MLP).</article-title>
                    <source>

                        <italic toggle="yes">Geomatic approaches for modeling land change scenarios.</italic>
</source>
                    <year>2018</year>;<fpage>451</fpage>&#x2013;<lpage>455</lpage>.
                    <pub-id pub-id-type="doi">10.1007/978-3-319-60801-3_27</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bisong</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bisong</surname>
                            <given-names>E</given-names>
                        </name>
</person-group>:
                    <article-title>The multilayer perceptron (MLP).</article-title>
                    <source>

                        <italic toggle="yes">Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners.</italic>
</source>
                    <year>2019</year>;<fpage>401</fpage>&#x2013;<lpage>405</lpage>.
                    <pub-id pub-id-type="doi">10.1007/978-1-4842-4470-8_31</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zare</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pourghasemi</surname>
                            <given-names>HR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vafakhah</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Landslide susceptibility mapping at Vaz Watershed (Iran) using an artificial neural network model: a comparison between multilayer perceptron (MLP) and radial basic function (RBF) algorithms.</article-title>
                    <source>

                        <italic toggle="yes">Arab. J. Geosci.</italic>
</source>
                    <year>2013</year>;<volume>6</volume>:<fpage>2873</fpage>&#x2013;<lpage>2888</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s12517-012-0610-x</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fath</surname>
                            <given-names>AH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Madanifar</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Abbasi</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <article-title>Implementation of multilayer perceptron (MLP) and radial basis function (RBF) neural networks to predict solution gas-oil ratio of crude oil systems.</article-title>
                    <source>

                        <italic toggle="yes">Petroleum.</italic>
</source>
                    <year>2020</year>;<volume>6</volume>(<issue>1</issue>):<fpage>80</fpage>&#x2013;<lpage>91</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.petlm.2018.12.002</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref27">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mohammadi</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ataei</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kakaei</surname>
                            <given-names>RK</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Prediction of the production rate of chain saw machine using the multilayer perceptron (MLP) neural network.</article-title>
                    <source>

                        <italic toggle="yes">Civil Engineering Journal.</italic>
</source>
                    <year>2018</year>;<volume>4</volume>(<issue>7</issue>):<fpage>1575</fpage>&#x2013;<lpage>1583</lpage>.
                    <pub-id pub-id-type="doi">10.28991/cej-0309196</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref28">
                <label>28</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Moro</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cortez</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rita</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>A data-driven approach to predict the success of bank telemarketing.</article-title>
                    <source>

                        <italic toggle="yes">Decis. Support. Syst.</italic>
</source>
                    <year>2014</year>;<volume>62</volume>:<fpage>22</fpage>&#x2013;<lpage>31</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.dss.2014.03.001</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref29">
                <label>29</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ghatasheh</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Faris</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>AlTaharwa</surname>
                            <given-names>I</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Business analytics in telemarketing: Cost-sensitive analysis of bank campaigns using artificial neural networks.</article-title>
                    <source>

                        <italic toggle="yes">Appl. Sci.</italic>
</source>
                    <year>2020</year>;<volume>10</volume>(<issue>7</issue>):<fpage>2581</fpage>.
                    <pub-id pub-id-type="doi">10.3390/app10072581</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref30">
                <label>30</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zaki</surname>
                            <given-names>AM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Khodadadi</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lim</surname>
                            <given-names>WH</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Predictive Analytics and Machine Learning in Direct Marketing for Anticipating Bank Term Deposit Subscriptions.</article-title>
                    <source>

                        <italic toggle="yes">American Journal of Business and Operations Research.</italic>
</source>
                    <year>2024</year>;<volume>11</volume>(<issue>1</issue>):<fpage>79</fpage>&#x2013;<lpage>88</lpage>.
                    <pub-id pub-id-type="doi">10.54216/AJBOR.110110</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref31">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ghaleb</surname>
                            <given-names>SA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mohamad</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fadzli</surname>
                            <given-names>SA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>E-mail spam classification using grasshopper optimization algorithm and neural networks.</article-title>
                    <source>

                        <italic toggle="yes">Comput., Mater. Continua.</italic>
</source>
                    <year>2022</year>;<volume>71</volume>(<issue>3</issue>):<fpage>4749</fpage>&#x2013;<lpage>4766</lpage>.
                    <pub-id pub-id-type="doi">10.32604/cmc.2022.020472</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref32">
                <label>32</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Das</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jena</surname>
                            <given-names>AK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nayak</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <chapter-title>A novel PSO based back propagation learning-MLP (PSO-BP-MLP) for classification.</chapter-title>
                    <source>

                        <italic toggle="yes">Computational Intelligence in Data Mining-Volume 2: Proceedings of the International Conference on CIDM, 20-21 December 2014.</italic>
</source>
                    <publisher-name>Springer</publisher-name>;<year>2015</year>; pp.<fpage>461</fpage>&#x2013;<lpage>471</lpage>.</mixed-citation>
            </ref>
            <ref id="ref33">
                <label>33</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Michira</surname>
                            <given-names>MK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rimiru</surname>
                            <given-names>RM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mwangi</surname>
                            <given-names>WR</given-names>
                        </name>
</person-group>:
                    <chapter-title>Improved multilayer perceptron neural networks weights and biases based on the grasshopper optimization algorithm to predict student performance on ambient learning.</chapter-title>
                    <source>

                        <italic toggle="yes">Proceedings of the 2023 7th international conference on machine learning and soft computing.</italic>
</source>
                    <year>2023</year>; pp.<fpage>61</fpage>&#x2013;<lpage>68</lpage>.</mixed-citation>
            </ref>
            <ref id="ref34">
                <label>34</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yuan</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Moayedi</surname>
                            <given-names>H</given-names>
                        </name>
</person-group>:
                    <article-title>The performance of six neural-evolutionary classification techniques combined with multi-layer perception in two-layered cohesive slope stability analysis and failure recognition.</article-title>
                    <source>

                        <italic toggle="yes">Eng. Comput.</italic>
</source>
                    <year>2020</year>;<volume>36</volume>:<fpage>1705</fpage>&#x2013;<lpage>1714</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s00366-019-00791-4</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref35">
                <label>35</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Abdollahi</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Keshandehghan</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gardaneh</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Accurate detection of breast cancer metastasis using a hybrid model of artificial intelligence algorithm.</article-title>
                    <source>

                        <italic toggle="yes">Archives of Breast Cancer.</italic>
</source>
                    <year>2020</year>;<fpage>18</fpage>&#x2013;<lpage>24</lpage>.
                    <pub-id pub-id-type="doi">10.32768/abc.20207118-24</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref36">
                <label>36</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mishra</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tripathy</surname>
                            <given-names>HK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mallick</surname>
                            <given-names>PK</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>EAGA-MLP&#x2014;an enhanced and adaptive hybrid classification model for diabetes diagnosis.</article-title>
                    <source>

                        <italic toggle="yes">Sensors.</italic>
</source>
                    <year>2020</year>;<volume>20</volume>(<issue>14</issue>):<fpage>4036</fpage>.
                    <pub-id pub-id-type="pmid">32698547</pub-id>
                    <pub-id pub-id-type="doi">10.3390/s20144036</pub-id>
                    <pub-id pub-id-type="pmcid">PMC7411768</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref37">
                <label>37</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dweekat</surname>
                            <given-names>OY</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lam</surname>
                            <given-names>SS</given-names>
                        </name>
</person-group>:
                    <chapter-title>Cervical cancer diagnosis using an integrated system of principal component analysis, genetic algorithm, and multilayer perceptron.</chapter-title>
                    <source>

                        <italic toggle="yes">Healthcare.</italic>
</source>
                    <publisher-name>MDPI</publisher-name>;<year>2022</year>; vol.<volume>10</volume>(<issue>10</issue>): p.<fpage>2002</fpage>.</mixed-citation>
            </ref>
            <ref id="ref38">
                <label>38</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Financial distress prediction with a novel diversity-considered GA-MLP ensemble algorithm.</article-title>
                    <source>

                        <italic toggle="yes">Neural. Process. Lett.</italic>
</source>
                    <year>2022</year>;<volume>54</volume>(<issue>2</issue>):<fpage>1175</fpage>&#x2013;<lpage>1194</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s11063-021-10674-9</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref39">
                <label>39</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ghaleb</surname>
                            <given-names>SA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mohamad</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Abdullah</surname>
                            <given-names>EFHS</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <chapter-title>Spam classification based on supervised learning using grasshopper optimization algorithm and artificial neural network.</chapter-title>
                    <source>

                        <italic toggle="yes">Advances in Cyber Security: Second International Conference, ACeS 2020, Penang, Malaysia, December 8-9, 2020, Revised Selected Papers 2.</italic>
</source>
                    <publisher-name>Springer</publisher-name>;<year>2021</year>; pp.<fpage>420</fpage>&#x2013;<lpage>434</lpage>.</mixed-citation>
            </ref>
            <ref id="ref40">
                <label>40</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Moro</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Laureano</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cortez</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>Using data mining for bank direct marketing: An application of the crisp-dm methodology.</article-title>
                    <year>2011</year>.</mixed-citation>
            </ref>
            <ref id="ref41">
                <label>41</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Asare-Frempong</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jayabalan</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <chapter-title>Predicting customer response to bank direct telemarketing campaign.</chapter-title>
                    <source>

                        <italic toggle="yes">2017 International Conference on Engineering Technology and Technopreneurship (ICE2T).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2017</year>; pp.<fpage>1</fpage>&#x2013;<lpage>4</lpage>.</mixed-citation>
            </ref>
            <ref id="ref42">
                <label>42</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Moro</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cortez</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rita</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>Using customer lifetime value and neural networks to improve the prediction of bank deposit subscription in telemarketing campaigns.</article-title>
                    <source>

                        <italic toggle="yes">Neural Comput. &amp; Applic.</italic>
</source>
                    <year>2015</year>;<volume>26</volume>:<fpage>131</fpage>&#x2013;<lpage>139</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s00521-014-1703-0</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref43">
                <label>43</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Elsalamony</surname>
                            <given-names>HA</given-names>
                        </name>
</person-group>:
                    <article-title>Bank direct marketing analysis of data mining techniques.</article-title>
                    <source>

                        <italic toggle="yes">Int. J. Comput. Appl.</italic>
</source>
                    <year>2014</year>;<volume>85</volume>(<issue>7</issue>):<fpage>12</fpage>&#x2013;<lpage>22</lpage>.
                    <pub-id pub-id-type="doi">10.5120/14852-3218</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref44">
                <label>44</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>&#x0141;ady&#x017c;y&#x0144;ski</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>&#x017b;bikowski</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gawrysiak</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>Direct marketing campaigns in retail banking with the use of deep learning and random forests.</article-title>
                    <source>

                        <italic toggle="yes">Expert Syst. Appl.</italic>
</source>
                    <year>2019</year>;<volume>134</volume>:<fpage>28</fpage>&#x2013;<lpage>35</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.eswa.2019.05.020</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref45">
                <label>45</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Pavlovi&#x0107;</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Relji&#x0107;</surname>
                            <given-names>MM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ja&#x0107;imovi&#x0107;</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Application of Data Mining in direct marketing.</article-title>
                    <source>

                        <italic toggle="yes">Industrija.</italic>
</source>
                    <year>2014</year>;<volume>42</volume>(<issue>1</issue>):<fpage>189</fpage>&#x2013;<lpage>201</lpage>.
                    <pub-id pub-id-type="doi">10.5937/industrija42-5087</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref46">
                <label>46</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Karim</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rahman</surname>
                            <given-names>RM</given-names>
                        </name>
</person-group>:
                    <article-title>Decision tree and naive bayes algorithm for classification and generation of actionable knowledge for direct marketing.</article-title>
                    <year>2013</year>.</mixed-citation>
            </ref>
            <ref id="ref47">
                <label>47</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kim</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Street</surname>
                            <given-names>WN</given-names>
                        </name>
</person-group>:
                    <article-title>An intelligent system for customer targeting: a data mining approach.</article-title>
                    <source>

                        <italic toggle="yes">Decis. Support. Syst.</italic>
</source>
                    <year>2004</year>;<volume>37</volume>(<issue>2</issue>):<fpage>215</fpage>&#x2013;<lpage>228</lpage>.
                    <pub-id pub-id-type="doi">10.1016/S0167-9236(03)00008-3</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref48">
                <label>48</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fern&#x00e1;ndez</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Garcia</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Herrera</surname>
                            <given-names>F</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary.</article-title>
                    <source>

                        <italic toggle="yes">J. Artif. Intell. Res.</italic>
</source>
                    <year>2018</year>;<volume>61</volume>:<fpage>863</fpage>&#x2013;<lpage>905</lpage>.
                    <pub-id pub-id-type="doi">10.1613/jair.1.11192</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref49">
                <label>49</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Karabulut</surname>
                            <given-names>EM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ibrikci</surname>
                            <given-names>T</given-names>
                        </name>
</person-group>:
                    <article-title>Effective automated prediction of vertebral column pathologies based on logistic model tree with SMOTE preprocessing.</article-title>
                    <source>

                        <italic toggle="yes">J. Med. Syst.</italic>
</source>
                    <year>2014</year>;<volume>38</volume>:<fpage>1</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s10916-014-0050-0</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref50">
                <label>50</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bahaweres</surname>
                            <given-names>RB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Agustian</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hermadi</surname>
                            <given-names>I</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <chapter-title>Software defect prediction using neural network based smote.</chapter-title>
                    <source>

                        <italic toggle="yes">2020 7th International Conference on Electrical Engineering, Computer Sciences and Informatics (EECSI).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2020</year>; pp.<fpage>71</fpage>&#x2013;<lpage>76</lpage>.</mixed-citation>
            </ref>
            <ref id="ref51">
                <label>51</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>X</given-names>
                        </name>
</person-group>:
                    <chapter-title>Phishing detection method based on borderline-smote deep belief network.</chapter-title>
                    <source>

                        <italic toggle="yes">Security, Privacy, and Anonymity in Computation, Communication, and Storage: SpaCCS 2017 International Workshops, Guangzhou, China, December 12-15, 2017, Proceedings 10.</italic>
</source>
                    <publisher-name>Springer</publisher-name>;<year>2017</year>; pp.<fpage>45</fpage>&#x2013;<lpage>53</lpage>.</mixed-citation>
            </ref>
            <ref id="ref52">
                <label>52</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Su</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Song</surname>
                            <given-names>X</given-names>
                        </name>
</person-group>:
                    <chapter-title>Research on intrusion detection method based on improved smote and XGBoost.</chapter-title>
                    <source>

                        <italic toggle="yes">Proceedings of the 8th International Conference on Communication and Network Security.</italic>
</source>
                    <year>2018</year>; pp.<fpage>37</fpage>&#x2013;<lpage>41</lpage>.</mixed-citation>
            </ref>
            <ref id="ref53">
                <label>53</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hussein</surname>
                            <given-names>AS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yohannese</surname>
                            <given-names>CW</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A-SMOTE: A new preprocessing approach for highly imbalanced datasets by improving SMOTE.</article-title>
                    <source>

                        <italic toggle="yes">Int. J. Comput. Intell. Syst.</italic>
</source>
                    <year>2019</year>;<volume>12</volume>(<issue>2</issue>):<fpage>1412</fpage>&#x2013;<lpage>1422</lpage>.
                    <pub-id pub-id-type="doi">10.2991/ijcis.d.191114.002</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref54">
                <label>54</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Olazaran</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <article-title>A sociological study of the official history of the perceptrons controversy.</article-title>
                    <source>

                        <italic toggle="yes">Soc. Stud. Sci.</italic>
</source>
                    <year>1996</year>;<volume>26</volume>(<issue>3</issue>):<fpage>611</fpage>&#x2013;<lpage>659</lpage>.
                    <pub-id pub-id-type="doi">10.1177/030631296026003005</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref55">
                <label>55</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bishop</surname>
                            <given-names>CM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nasrabadi</surname>
                            <given-names>NM</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Pattern recognition and machine learning.</italic>
</source>
                    <publisher-name>Springer</publisher-name>;<year>2006</year>; (no.<issue>4</issue>).</mixed-citation>
            </ref>
            <ref id="ref56">
                <label>56</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gaikwad</surname>
                            <given-names>NB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tiwari</surname>
                            <given-names>V</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Keskar</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Efficient FPGA implementation of multilayer perceptron for real-time human activity classification.</article-title>
                    <source>

                        <italic toggle="yes">IEEE Access.</italic>
</source>
                    <year>2019</year>;<volume>7</volume>:<fpage>26696</fpage>&#x2013;<lpage>26706</lpage>.
                    <pub-id pub-id-type="doi">10.1109/ACCESS.2019.2900084</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref57">
                <label>57</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wu</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yan</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Shan</surname>
                            <given-names>Y</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Deep image: Scaling up image recognition.</article-title>
                    <source>

                        <italic toggle="yes">arXiv preprint arXiv:1501.02876.</italic>
</source>
                    <year>2015</year>; vol.<volume>7</volume>(<issue>8</issue>): p.<fpage>4</fpage>.</mixed-citation>
            </ref>
            <ref id="ref58">
                <label>58</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kanan</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cottrell</surname>
                            <given-names>GW</given-names>
                        </name>
</person-group>:
                    <article-title>Color-to-grayscale: does the method matter in image recognition?.</article-title>
                    <source>

                        <italic toggle="yes">PloS one.</italic>
</source>
                    <year>2012</year>;<volume>7</volume>(<issue>1</issue>):<fpage>e29740</fpage>.
                    <pub-id pub-id-type="pmid">22253768</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0029740</pub-id>
                    <pub-id pub-id-type="pmcid">PMC3254613</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref59">
                <label>59</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yu</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Deng</surname>
                            <given-names>L</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Automatic speech recognition.</italic>
</source>
                    <publisher-name>Springer</publisher-name>;<year>2016</year>.</mixed-citation>
            </ref>
            <ref id="ref60">
                <label>60</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Parloff</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <chapter-title>Why deep learning is suddenly changing your life.</chapter-title>
                    <source>

                        <italic toggle="yes">Fortune.</italic>
</source>
                    <publisher-name>New York: Time Inc</publisher-name>;<year>2016</year>.</mixed-citation>
            </ref>
            <ref id="ref62">
                <label>61</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Aggarwal</surname>
                            <given-names>CC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhai</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <chapter-title>A survey of text classification algorithms.</chapter-title>
                    <source>

                        <italic toggle="yes">Mining text data.</italic>
</source>
                    <publisher-name>Springer</publisher-name>;<year>2012</year>; pp.<fpage>163</fpage>&#x2013;<lpage>222</lpage>.</mixed-citation>
            </ref>
            <ref id="ref63">
                <label>62</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Miner</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Practical text mining and statistical analysis for non-structured text data applications.</italic>
</source>
                    <publisher-name>Academic Press</publisher-name>;<year>2012</year>.</mixed-citation>
            </ref>
            <ref id="ref64">
                <label>63</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ahishakiye</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Taremwa</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Omulo</surname>
                            <given-names>EO</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Crime prediction using decision tree (J48) classification algorithm.</article-title>
                    <source>

                        <italic toggle="yes">International Journal of Computer and Information Technology.</italic>
</source>
                    <year>2017</year>;<volume>6</volume>(<issue>3</issue>):<fpage>188</fpage>&#x2013;<lpage>195</lpage>.</mixed-citation>
            </ref>
            <ref id="ref65">
                <label>64</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Colak</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yesilbudak</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bayindir</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <article-title>Daily photovoltaic power prediction enhanced by hybrid GWO-MLP, ALO-MLP and WOA-MLP models using meteorological information.</article-title>
                    <source>

                        <italic toggle="yes">Energies.</italic>
</source>
                    <year>2020</year>;<volume>13</volume>(<issue>4</issue>):<fpage>901</fpage>.
                    <pub-id pub-id-type="doi">10.3390/en13040901</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref66">
                <label>65</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nwankpa</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ijomah</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gachagan</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Activation functions: Comparison of trends in practice and research for deep learning.</article-title>
                    <source>

                        <italic toggle="yes">arXiv preprint arXiv:1811.03378.</italic>
</source>
                    <year>2018</year>.</mixed-citation>
            </ref>
            <ref id="ref67">
                <label>66</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hansen</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>M&#x00fc;ller</surname>
                            <given-names>SD</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Koumoutsakos</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES).</article-title>
                    <source>

                        <italic toggle="yes">Evol. Comput.</italic>
</source>
                    <year>2003</year>;<volume>11</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>18</lpage>.
                    <pub-id pub-id-type="pmid">12804094</pub-id>
                    <pub-id pub-id-type="doi">10.1162/106365603321828970</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref68">
                <label>67</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hansen</surname>
                            <given-names>N</given-names>
                        </name>
</person-group>:
                    <article-title>The CMA evolution strategy: A tutorial.</article-title>
                    <source>

                        <italic toggle="yes">arXiv preprint arXiv:1604.00772.</italic>
</source>
                    <year>2016</year>.</mixed-citation>
            </ref>
            <ref id="ref69">
                <label>68</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Balakrishnan</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wainwright</surname>
                            <given-names>MJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yu</surname>
                            <given-names>B</given-names>
                        </name>
</person-group>:
                    <article-title>Statistical guarantees for the EM algorithm: From population to sample-based analysis.</article-title>
                    <year>2017</year>.</mixed-citation>
            </ref>
            <ref id="ref70">
                <label>69</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Arsenault</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Poulin</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>C&#x00f4;t&#x00e9;</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Comparison of stochastic optimization algorithms in hydrological model calibration.</article-title>
                    <source>

                        <italic toggle="yes">J. Hydrol. Eng.</italic>
</source>
                    <year>2014</year>;<volume>19</volume>(<issue>7</issue>):<fpage>1374</fpage>&#x2013;<lpage>1384</lpage>.
                    <pub-id pub-id-type="doi">10.1061/(ASCE)HE.1943-5584.0000938</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref71">
                <label>70</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Suominen</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Brink</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Salmi</surname>
                            <given-names>T</given-names>
                        </name>
</person-group>:
                    <article-title>Parameter estimation of complex chemical kinetics with covariance matrix adaptation evolution strategy.</article-title>
                    <source>

                        <italic toggle="yes">Match-Communications in Mathematical and Computer Chemistry.</italic>
</source>
                    <year>2012</year>;<volume>68</volume>(<issue>2</issue>):<fpage>469</fpage>.</mixed-citation>
            </ref>
            <ref id="ref72">
                <label>71</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lin</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nielsen</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Emtiyaz</surname>
                            <given-names>KM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <chapter-title>Tractable structured natural-gradient descent using local parameterizations.</chapter-title>
                    <source>

                        <italic toggle="yes">International Conference on Machine Learning.</italic>
</source>
                    <publisher-name>PMLR</publisher-name>;<year>2021</year>; pp.<fpage>6680</fpage>&#x2013;<lpage>6691</lpage>.</mixed-citation>
            </ref>
            <ref id="ref73">
                <label>72</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Burgin</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Eberbach</surname>
                            <given-names>E</given-names>
                        </name>
</person-group>:
                    <article-title>Evolutionary Turing in the Context of Evolutionary Machines.</article-title>
                    <source>

                        <italic toggle="yes">arXiv preprint arXiv:1304.3762.</italic>
</source>
                    <year>2013</year>.</mixed-citation>
            </ref>
            <ref id="ref74">
                <label>73</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chernukhin</surname>
                            <given-names>O</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zingg</surname>
                            <given-names>DW</given-names>
                        </name>
</person-group>:
                    <article-title>Multimodality and global optimization in aerodynamic design.</article-title>
                    <source>

                        <italic toggle="yes">AIAA J.</italic>
</source>
                    <year>2013</year>;<volume>51</volume>(<issue>6</issue>):<fpage>1342</fpage>&#x2013;<lpage>1354</lpage>.
                    <pub-id pub-id-type="doi">10.2514/1.J051835</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref75">
                <label>74</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bottou</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Curtis</surname>
                            <given-names>FE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nocedal</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>Optimization methods for large-scale machine learning.</article-title>
                    <source>

                        <italic toggle="yes">SIAM Rev.</italic>
</source>
                    <year>2018</year>;<volume>60</volume>(<issue>2</issue>):<fpage>223</fpage>&#x2013;<lpage>311</lpage>.
                    <pub-id pub-id-type="doi">10.1137/16M1080173</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref76">
                <label>75</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>G&#x00e1;lvez</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Iglesias</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>A new iterative mutually coupled hybrid GA&#x2013;PSO approach for curve fitting in manufacturing.</article-title>
                    <source>

                        <italic toggle="yes">Appl. Soft Comput.</italic>
</source>
                    <year>2013</year>;<volume>13</volume>(<issue>3</issue>):<fpage>1491</fpage>&#x2013;<lpage>1504</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.asoc.2012.05.030</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref77">
                <label>76</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Miller</surname>
                            <given-names>TG</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Xu</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>De Lamare</surname>
                            <given-names>RC</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Distributed spectrum estimation based on alternating mixed discrete-continuous adaptation.</article-title>
                    <source>

                        <italic toggle="yes">IEEE Signal Processing Letters.</italic>
</source>
                    <year>2016</year>;<volume>23</volume>(<issue>4</issue>):<fpage>551</fpage>&#x2013;<lpage>555</lpage>.
                    <pub-id pub-id-type="doi">10.1109/LSP.2016.2539328</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref78">
                <label>77</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hook</surname>
                            <given-names>K</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Designing with the body: Somaesthetic interaction design.</italic>
</source>
                    <publisher-name>MIt Press</publisher-name>;<year>2018</year>.</mixed-citation>
            </ref>
            <ref id="ref79">
                <label>78</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hassanat</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Almohammadi</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Alkafaween</surname>
                            <given-names>EA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Choosing mutation and crossover ratios for genetic algorithms&#x2014;a review with a new dynamic approach.</article-title>
                    <source>

                        <italic toggle="yes">Information.</italic>
</source>
                    <year>2019</year>;<volume>10</volume>(<issue>12</issue>):<fpage>390</fpage>.
                    <pub-id pub-id-type="doi">10.3390/info10120390</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref80">
                <label>79</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hermawanto</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Genetic algorithm for solving simple mathematical equality problem.</article-title>
                    <source>

                        <italic toggle="yes">arXiv preprint arXiv:1308.4675.</italic>
</source>
                    <year>2013</year>.</mixed-citation>
            </ref>
            <ref id="ref81">
                <label>80</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Drezner</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Drezner</surname>
                            <given-names>TD</given-names>
                        </name>
</person-group>:
                    <article-title>Biologically inspired parent selection in genetic algorithms.</article-title>
                    <source>

                        <italic toggle="yes">Ann. Oper. Res.</italic>
</source>
                    <year>2020</year>;<volume>287</volume>(<issue>1</issue>):<fpage>161</fpage>&#x2013;<lpage>183</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s10479-019-03343-7</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref82">
                <label>81</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mazidi</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fakhrahmad</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sadreddini</surname>
                            <given-names>MH</given-names>
                        </name>
</person-group>:
                    <article-title>A meta-heuristic approach to CVRP problem: local search optimization based on GA and ant colony.</article-title>
                    <year>2016</year>.</mixed-citation>
            </ref>
            <ref id="ref83">
                <label>82</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mirjalili</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mirjalili</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Genetic algorithm.</article-title>
                    <source>

                        <italic toggle="yes">Evolutionary algorithms and neural networks: Theory and applications.</italic>
</source>
                    <year>2019</year>;<fpage>43</fpage>&#x2013;<lpage>55</lpage>.
                    <pub-id pub-id-type="doi">10.1007/978-3-319-93025-1_4</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref84">
                <label>83</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Saremi</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mirjalili</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lewis</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>Grasshopper optimisation algorithm: theory and application.</article-title>
                    <source>

                        <italic toggle="yes">Adv. Eng. Softw.</italic>
</source>
                    <year>2017</year>;<volume>105</volume>:<fpage>30</fpage>&#x2013;<lpage>47</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.advengsoft.2017.01.004</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref85">
                <label>84</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zakeri</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hokmabadi</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>Efficient feature selection method using real-valued grasshopper optimization algorithm.</article-title>
                    <source>

                        <italic toggle="yes">Expert Syst. Appl.</italic>
</source>
                    <year>2019</year>;<volume>119</volume>:<fpage>61</fpage>&#x2013;<lpage>72</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.eswa.2018.10.021</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref86">
                <label>85</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lv</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Peng</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <article-title>A novel meta-matching approach for ontology alignment using grasshopper optimization.</article-title>
                    <source>

                        <italic toggle="yes">Knowl.-Based Syst.</italic>
</source>
                    <year>2020</year>;<volume>201</volume>:<fpage>106050</fpage>.</mixed-citation>
            </ref>
            <ref id="ref87">
                <label>86</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Heidari</surname>
                            <given-names>AA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Faris</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Aljarah</surname>
                            <given-names>I</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>An efficient hybrid multilayer perceptron neural network with grasshopper optimization.</article-title>
                    <source>

                        <italic toggle="yes">Soft. Comput.</italic>
</source>
                    <year>2019</year>;<volume>23</volume>:<fpage>7941</fpage>&#x2013;<lpage>7958</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s00500-018-3424-2</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref88">
                <label>87</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mirjalili</surname>
                            <given-names>SZ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mirjalili</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Saremi</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Grasshopper optimization algorithm for multi-objective optimization problems.</article-title>
                    <source>

                        <italic toggle="yes">Appl. Intell.</italic>
</source>
                    <year>2018</year>;<volume>48</volume>:<fpage>805</fpage>&#x2013;<lpage>820</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s10489-017-1019-8</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref89">
                <label>88</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yang</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>K</given-names>
                        </name>
</person-group>:
                    <article-title>Temporal data clustering via weighted clustering ensemble with different representations.</article-title>
                    <source>

                        <italic toggle="yes">IEEE Trans. Knowl. Data Eng.</italic>
</source>
                    <year>2010</year>;<volume>23</volume>(<issue>2</issue>):<fpage>307</fpage>&#x2013;<lpage>320</lpage>.
                    <pub-id pub-id-type="doi">10.1109/TKDE.2010.112</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref90">
                <label>89</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gavrilov</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Anguelov</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Indyk</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <chapter-title>Mining the stock market (extended abstract) which measure is best?.</chapter-title>
                    <source>

                        <italic toggle="yes">Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining.</italic>
</source>
                    <year>2000</year>; pp.<fpage>487</fpage>&#x2013;<lpage>496</lpage>.</mixed-citation>
            </ref>
            <ref id="ref91">
                <label>90</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lalkhen</surname>
                            <given-names>AG</given-names>
                        </name>

                        <name name-style="western">
                            <surname>McCluskey</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>Clinical tests: sensitivity and specificity.</article-title>
                    <source>

                        <italic toggle="yes">Contin. Educ. Anaesth. Crit. Care Pain.</italic>
</source>
                    <year>2008</year>;<volume>8</volume>(<issue>6</issue>):<fpage>221</fpage>&#x2013;<lpage>223</lpage>.
                    <pub-id pub-id-type="doi">10.1093/bjaceaccp/mkn041</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref92">
                <label>91</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Shepherd</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wheeler</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Selbie</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Overseer
                        <sup>&#x00ae;</sup>: accuracy, precision, error and uncertainty.</article-title>
                    <source>

                        <italic toggle="yes">Accurate and efficient use of nutrients on farms.</italic>
</source>
                    <year>2013</year>;<fpage>1</fpage>&#x2013;<lpage>8</lpage>.</mixed-citation>
            </ref>
            <ref id="ref93">
                <label>92</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Powers</surname>
                            <given-names>DM</given-names>
                        </name>
</person-group>:
                    <article-title>What the F-measure doesn&#x2019;t measure: Features, Flaws, Fallacies and Fixes.</article-title>
                    <source>

                        <italic toggle="yes">arXiv preprint arXiv:1503.06410.</italic>
</source>
                    <year>2015</year>.</mixed-citation>
            </ref>
            <ref id="ref94">
                <label>
93</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Casas</surname>
                            <given-names>I</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Taheri</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ranjan</surname>
                            <given-names>R</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A balanced scheduler with data reuse and replication for scientific workflows in cloud computing systems.</article-title>
                    <source>

                        <italic toggle="yes">Futur. Gener. Comput. Syst.</italic>
</source>
                    <year>2017</year>;<volume>74</volume>:<fpage>168</fpage>&#x2013;<lpage>178</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.future.2015.12.005</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report446950">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.185252.r446950</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Mohammadagha</surname>
                        <given-names>Mohsen</given-names>
                    </name>
                    <xref ref-type="aff" rid="r446950a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0009-0007-0394-353X</uri>
                </contrib>
                <aff id="r446950a1">
                    <label>1</label>University of Texas at Arlington, Arlington, Texas, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>10</day>
                <month>1</month>
                <year>2026</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2026 Mohammadagha M</copyright-statement>
                <copyright-year>2026</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport446950" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.168092.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This study compares the performance of three optimization algorithms&#x2014;Genetic Algorithm (GA), Grasshopper Optimization Algorithm (GOA), and Covariance Matrix Adaptation Evolution Strategy (CMA-ES)&#x2014;for optimizing Multilayer Perceptron (MLP) neural networks across varying sample sizes. Using a Portuguese banking dataset (4,521 observations) from the UCI repository, the researchers evaluated how sample size affects classification performance when predicting client subscription to term deposits. The study employed SMOTE for data balancing and tested ten different sample sizes (10%-100%). Results identified CMA-ES-MLP as the best overall performer with high accuracy, precision, and specificity, while maintaining competitive execution time.</p>
            <p> </p>
            <p> </p>
            <p> 
                <bold>1. Is the work clearly and accurately presented and does it cite current literature?</bold>
            </p>
            <p> Recommendations:&#x00a0;Update literature review with 2023-2025 papers.</p>
            <p> </p>
            <p> 
                <bold>2. Is the study design appropriate and is the work technically sound? Yes</bold>
            </p>
            <p> </p>
            <p> 
                <bold>3. Are sufficient details of methods and analysis provided to allow replication by others?</bold>
            </p>
            <p> Recommended
                <bold>:&#x00a0;</bold>Code availability&#x200b;,&#x00a0;Random seeds&#x200b;,&#x00a0;Incomplete hyperparameter specifications&#x200b;, Software versions&#x200b;,&#x00a0;MLP architecture.</p>
            <p> </p>
            <p> 
                <bold>4. If applicable, is the statistical analysis and its interpretation appropriate?</bold>
            </p>
            <p> It can be suggested to conduct runs with statistical tests (e.g., Friedman or any other methods) and report confidence intervals.</p>
            <p> </p>
            <p> 
                <bold>5.&#x00a0;Are all the source data underlying the results available to ensure full reproducibility? </bold>Yes</p>
            <p> </p>
            <p> 
                <bold>6.&#x00a0;Are the conclusions drawn adequately supported by the results?</bold>
            </p>
            <p> It is recommended to add confidence intervals, statistical tests (e.g., McNemar's test, Friedman test with post-hoc analysis)</p>
            <p> </p>
            <p> 
                <bold>Citation Format Issues:</bold>&#x00a0;The manuscript contains improper in-text citation formatting that creates grammatically incomplete sentences. Specifically: 
                <list list-type="bullet">
                    <list-item>
                        <p>"According to,14 there are several..." is missing the author name(s) before the superscript citation</p>
                    </list-item>
                    <list-item>
                        <p>"in the study by,15,16 and it has been..." similarly omits the required author name(s)</p>
                    </list-item>
                </list> 
                <bold>Corrections needed entire manuscript:&#x00a0;</bold>
            </p>
            <p> Use narrative citations, for example: &#x201c;According to Abdel-Basset et al. [14], there are several optimization algorithms.</p>
            <p> </p>
            <p> I provide a reference demonstrating an example for statistical evaluation in machine learning research.</p>
            <p> (Reference 1)</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>Civil Engineering and Computer Science</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-446950-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Evaluating machine learning performance using python for neural network models in urban transportation in New York city case study</article-title>.
                        <source>
                            <italic>Journal of Economy and Technology</italic>
                        </source>.<year>2026</year>;<volume>4</volume>:
                        <elocation-id>10.1016/j.ject.2025.11.001</elocation-id>
                        <fpage>266</fpage>-<lpage>283</lpage>
                        <pub-id pub-id-type="doi">10.1016/j.ject.2025.11.001</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment15331-446950">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Montshiwa</surname>
                            <given-names>Tlhalitshi</given-names>
                        </name>
                        <aff>Business Statistics, North West University Faculty of Economic and Management Sciences, Mahikeng, North West, South Africa</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>None</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>23</day>
                    <month>1</month>
                    <year>2026</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <bold>Comment 1:</bold> Update literature review with 2023-2025 papers.</p>
                <p> 
                    <bold>Response 1:</bold> Thank you for the comment. Kindly note that this article is derived from a PhD study conducted in 2020, which informed the initial scope and selection of the literature. Removal of references below 2023 may lead to misalignment with the theoretical framework and objectives of the study.</p>
                <p> </p>
                <p> 
                    <bold>Comment 2:</bold> Code availability&#x200b;, Random seeds&#x200b;, Incomplete hyperparameter specifications&#x200b;, Software versions&#x200b;, MLP architecture.</p>
                <p> 
                    <bold>Response 2:</bold> Thank you for this comment. The codes utilised for the analysis are available upon request. These codes include detailed information on the random seed settings, hyperparameter specifications, software versions, and the MLP architecture used in the study. This statement is included in the manuscript in data availability section.</p>
                <p> </p>
                <p> 
                    <bold>Comment 3:</bold> It can be suggested to conduct runs with statistical tests (e.g., Friedman or any other methods) and report confidence intervals.</p>
                <p> 
                    <bold>Response 3:</bold> This is a valuable suggestion, and the inclusion of confidence intervals as well as statistical comparison tests such as the Friedman and McNemar tests would enhance the robustness of the analysis. Due to the limited scope and objectives of the current study, these analyses were not implemented at this stage. However, they will be incorporated in future work to provide a more comprehensive statistical comparison of the models.</p>
                <p> </p>
                <p> 
                    <bold>Comment 4:</bold> It is recommended to add confidence intervals, statistical tests (e.g., McNemar's test, Friedman test with post-hoc analysis)</p>
                <p> 
                    <bold>Response 4:</bold> This is a valuable suggestion, and the inclusion of confidence intervals and formal statistical comparison tests such as McNemar&#x2019;s and Friedman tests would indeed strengthen the robustness of the analysis. However, given the limited scope and objectives of the current study, the focus was placed on comparative predictive performance using standard evaluation metrics. Incorporating additional inferential statistical tests is therefore left as a potential extension for future work.</p>
            </body>
        </sub-article>
    </sub-article>
</article>
