<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.73314.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Sickle cell segmentation and classification for thalassemia aid diagnosis</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 1 approved, 1 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Leow</surname>
                        <given-names>Yen-Siang</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Ng</surname>
                        <given-names>Kok-Why</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-4516-4634</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Yoong</surname>
                        <given-names>Yih-Jian</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Ng</surname>
                        <given-names>Seng-Beng</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Faculty of Computing and Informatics, Multimedia University, Cyberjaya, Selangor, 63100, Malaysia</aff>
                <aff id="a2">
                    <label>2</label>Faculty of Computer Science and Information Technology, Universiti Putra Malaysia (UPM), UPM Serdang, Selangor, 43400, Malaysia</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:kwng@mmu.edu.my">kwng@mmu.edu.my</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>23</day>
                <month>11</month>
                <year>2021</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2021</year>
            </pub-date>
            <volume>10</volume>
            <elocation-id>1185</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>15</day>
                    <month>11</month>
                    <year>2021</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Leow YS et al.</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/10-1185/pdf"/>
            <abstract>
                <p>
                    <bold>Background</bold>: Thalassemia is a hereditary blood disease in which abnormal red blood cells (RBCs) carry insufficient oxygen throughout the body. Conventional methods of thalassemia detection through a complete blood count (CBC) test and peripheral blood smear image still possess a lot of weaknesses.</p>
                <p>
                    <bold>Methods</bold>: This paper proposes a hybrid segmentation method to segment the RBCs. It incorporates adaptive thresholding and canny edge method to segment the RBCs. Morphological operations are performed to clean the leftovers. Shape and texture features are extracted using the segmented masks and the gray level co-occurrence matrix. Data imbalance treatment is used for solving the imbalance cell type class in distribution. In the data resampling layer, the synthetic minority oversampling technique (SMOTE), adaptive synthetic sampling (ADASYN), and random over sampling (ROS) are performed and evaluated using the decision tree and logistic regression. In the classification layer, the decision tree, random forest classifier and support vector machine (SVM) are assessed and compared for the best performance in classification.</p>
                <p>
                    <bold>Results</bold>:The proposed method outperforms the other methods in the image segmentation layer with the structural similarity index measure (SSIM) of 89.88%. In the data resampling layer, ADASYN is employed as it is more accurate than the SMOTE and ROS. The random forest classifier is chosen at the classification layer as it is more accurate than the decision tree and support vector machine (SVM).</p>
                <p>
                    <bold>Conclusions</bold>:The proposed method is tested on the latest dataset of erythrocyteIDB3 and it solves the issues of imbalanced data due to the insufficient cell classes.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Red Blood Cell</kwd>
                <kwd>Thalassemia</kwd>
                <kwd>Image processing</kwd>
                <kwd>Image Segmentation</kwd>
                <kwd>Morphological Operations</kwd>
                <kwd>Feature Extraction</kwd>
                <kwd>Data Resampling</kwd>
                <kwd>Machine Learning</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1">
                    <funding-source>MMU IR Fund</funding-source>
                    <award-id>MMU/RMC/GRPROP/IRFUND/2020/33411</award-id>
                </award-group>
                <funding-statement>This work is supported by the funding of Multimedia University IR Fund (Ref: MMUI/210005).</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec id="sec1" sec-type="intro">
            <title>Introduction</title>
            <p>Sickle cell disease is a hereditary disease that disrupts the efficiency of RBCs to carry sufficient oxygen to the body. Traditional methods to detect this disease relied on hematologists manually counting RBCs and classifying them based on their topologies. This has motivated us to explore and develop an automated process to detect the presence of the disease. However, this field poses a lot of challenges such as insufficient medical data, poor extraction methods, and the tedious pipeline-related issues.</p>
            <p>This paper aims to solve some of the aforementioned problems. Firstly, a hybrid segmentation method is presented to accurately segment the RBCs. Next, the data imbalance treatment is used to solve the imbalance cell type class in a distribution manner. The pipeline of the paper includes the comparison between the conventional methods in the image segmentation layer, data resampling layer and the classification layer. The proposed method involves the image acquisition, image pre-processing, image segmentation, morphological operations, image cropping and manual classification, feature extraction, data resampling, and classification. The aim is to create a workflow that is able to detect the presence of thalassemia by recognizing the types of RBCs.</p>
        </sec>
        <sec id="sec2">
            <title>Related work</title>
            <p>In image pre-processing, Rashid 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref1">1</xref>
                </sup> and Das 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref2">2</xref>
                </sup> employed a mixture of global contrast techniques, median blurring and color space conversion to increase the contrast. Color channel conversion is used to filter the unwanted cells in the image.</p>
            <p>In image segmentation layer, an inter-combination of the thresholding, edge-based, region-based, clustering-based and artificial neural network (ANN) is used to perform the segmentation. Sharma 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref3">3</xref>
                </sup> compared different techniques of thresholding; their findings showed that Otsu&#x2019;s thresholding performed the best among other thresholding methods. This concurred with the observation done by Das 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref2">2</xref>
                </sup> Marked water controlled segmentation was used to solve the segmentation of cells in.
                <sup>
                    <xref ref-type="bibr" rid="ref2">2</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>
                </sup> Tyagi 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref5">5</xref>
                </sup> indicated that the Canny Edge technique was better at filtering noise presented in the image. Achargee 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref6">6</xref>
                </sup> used Hough transformation (HT) to detect cells with varying diameter, while Seth and Palodhi
                <sup>
                    <xref ref-type="bibr" rid="ref7">7</xref>
                </sup> proposed a modified version of the HT to detect white blood cells (WBC).</p>
            <p>Feature extraction in cell morphology are classified into boundary and region descriptors. Some of the features may overlap, but generally, boundary descriptors deal with the morphology of the RBCs and region descriptors extract the geometric and texture properties. Lotfi 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>
                </sup> used Fourier descriptors to represent cell morphology. Tyagi 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref5">5</xref>
                </sup> used shape features such as area, perimeter solidity, eccentricity to describe the cells. Other features involved extracting the moment invariant such as invariant to scaling, translation and rotation were proposed by Lofti 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>
                </sup> Since the introduction of deep learning, many skipped this layer due to the models can automatically extract the features. Fadhel 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref8">8</xref>
                </sup> proposed extracting features from the last fully connected layers and subsequently fed the features into the support vector machine (SVM) classifier. Purwar 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref9">9</xref>
                </sup> proposed a hybrid method of features extracted from the convolutional neural network (CNN) models
                <sup>
                    <xref ref-type="bibr" rid="ref15">15</xref>
                </sup>
                <sup>-</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref17">17</xref>
                </sup> and hand clinical features extracted from a CBC test.</p>
            <p>Hortinela 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref10">10</xref>
                </sup> and Safca 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref11">11</xref>
                </sup> used SVM to classify cells based on features extracted from different layers. In multiclass classification, Lotfi 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>
                </sup> compared the K-nearest neighbors and SVM method using the one 
                <italic toggle="yes">versus</italic> all approach and found that SVM performed better. In a seperate study, Dalvi 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref12">12</xref>
                </sup> showed that an ANN with ten hidden nodes and five output nodes performed better than decision tree. Tyas 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref13">13</xref>
                </sup> used ANN with momentum backpropagation and produced the best results among the classifiers.</p>
        </sec>
        <sec id="sec3" sec-type="methods">
            <title>Methods</title>
            <p>Ethical approval number: EA1542021</p>
            <p>Ethical approval body: Research Ethic Committee 2021, Multimedia University</p>
            <sec id="sec4">
                <title>Image acquisition</title>
                <p>The dataset used in this project was obtained from the erythrocytesIDB by Pedro 
                    <italic toggle="yes">et al</italic>.
                    <sup>
                        <xref ref-type="bibr" rid="ref14">14</xref>
                    </sup> It is selected because of the image clarity and optimal cell size. Each subdirectory contains the raw image, ground truth labels, and binary segmented masks. 
                    <xref ref-type="fig" rid="f1">Figure 1</xref> shows the raw blood smear images and its ground truth label for two images.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>Figure 1. </label>
                    <caption>
                        <title>Raw image and ground truth label.</title>
                    </caption>
                    <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure1.gif"/>
                </fig>
            </sec>
            <sec id="sec5">
                <title>Image pre-processing</title>
                <p>As many of the images are extracted using different extraction techniques, to increase the quality of the images while standardizing all of the images, firstly, the images are contrastly increased by using the contrast limited adaptive histogram equalization (CLAHE) method. The image is primarily converted to L.A.B. color space. The L, which refers to the luminance channel is operated by CLAHE and is then merged back with the other two channels.</p>
                <p>Next, median filtering is applied to remove noise in the image. An unsharped mask operation is applied to sharpen the edges of the cells which are previously lost during the median filtering process. Image subtraction is then constructed from the original image to obtain the details and the edges. 
                    <xref ref-type="fig" rid="f2">Figure 2</xref> depicts the result after the pre-processed layer of the two images.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <title>Image before and after pre-processing.</title>
                    </caption>
                    <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure2.gif"/>
                </fig>
            </sec>
            <sec id="sec6">
                <title>Image segmentation</title>
                <p>A hybrid technique is proposed by merging the adaptive thresholding and the canny edge detection method. Gaussian adaptive thresholding is employed to segment the color images with varying intensities. However, this method unable to capture the edges of the cells. An additional of the canny edge detection method with the parameter of 
                    <italic toggle="yes">&#x03bc;</italic> to be set to a lower value is applied. This allows the cell edges to be captured precisely and subsequently to be added to the threshold binary image later.</p>
                <p>Lastly, the leftover edges that are protruding out are removed to show a clean finish. 
                    <xref ref-type="fig" rid="f3">Figure 3</xref> shows the result of the segmentation layer compared to the ground truth given from the dataset. The evaluation of this layer is performed by calculating the SSIM index of the binary segmented image with the ground truth binary given in the dataset.</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>Figure 3. </label>
                    <caption>
                        <title>Segmented image to be compared with the ground truth mask.</title>
                    </caption>
                    <graphic id="gr3" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure3.gif"/>
                </fig>
            </sec>
            <sec id="sec7">
                <title>Morphological operations</title>
                <p>Morphological operations are performed on binary images to clean up the leftover noises from the previous layers. A hole-filling operation is performed to close up the holes that are created using the canny edge detection method. Next, binary opening is used to remove the small particles that are presented in the image. Finally, cells that are located at the border are removed because they do not accurately provide the full feature description of the cell. 
                    <xref ref-type="fig" rid="f4">Figure 4</xref> depicts the result of the morphological operations.</p>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>Figure 4. </label>
                    <caption>
                        <title>Image after morphological operations&#x00a0;to be compared with the ground truth&#x00a0;mask.</title>
                    </caption>
                    <graphic id="gr4" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure4.gif"/>
                </fig>
            </sec>
            <sec id="sec8">
                <title>Image cropping and classification</title>
                <p>Each of the cells in the microscopic blood sample images are cropped using the masks that are segmented from the previous layers. 
                    <xref ref-type="fig" rid="f5">Figure 5</xref> shows the individual cells that are being cropped, while 
                    <xref ref-type="fig" rid="f6">Figure 6</xref> depicts the sub-category of one particular cell. Each of the cropped cells has their own directory which is named based on their origin and cell number as depicted in 
                    <xref ref-type="fig" rid="f7">Figure 7</xref>. The cells are manually classified into their own class which is based on the classification given by Pedro 
                    <italic toggle="yes">et al.</italic>
                    <sup>
                        <xref ref-type="bibr" rid="ref14">14</xref>
                    </sup>
                </p>
                <fig fig-type="figure" id="f5" orientation="portrait" position="float">
                    <label>Figure 5. </label>
                    <caption>
                        <title>Cells that are being cropped.</title>
                    </caption>
                    <graphic id="gr5" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure5.gif"/>
                </fig>
                <fig fig-type="figure" id="f6" orientation="portrait" position="float">
                    <label>Figure 6. </label>
                    <caption>
                        <title>Sub-directory of one cell.</title>
                    </caption>
                    <graphic id="gr6" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure6.gif"/>
                </fig>
                <fig fig-type="figure" id="f7" orientation="portrait" position="float">
                    <label>Figure 7. </label>
                    <caption>
                        <title>Directory of the circular cells.</title>
                    </caption>
                    <graphic id="gr7" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure7.gif"/>
                </fig>
            </sec>
            <sec id="sec9">
                <title>Feature extraction</title>
                <p>Two types of features are extracted: shape features and texture features.</p>
                <p>The shape features are extracted from the individually cropped masks and then to be combined into a data frame. The features extracted include the area, perimeter, major axis length, minor axis length, eccentricity, and the seven moment invariants immune to the scaling, translation, and rotation. Additional shape features which are circulatory factor and elliptical factor are calculated.</p>
                <p>Next, texture features are extracted using the gray level co-occurrence matrix (GLCM). The features extracted from the textures are the contrast, dissimilarity, homogeneity, ASM, energy and correlation.</p>
                <p>Finally, the two features are merged into a data frame with its correct label of either one of the three classes: circular, elongated, or other cells.</p>
            </sec>
            <sec id="sec10">
                <title>Data resampling</title>
                <p>The number of circulars, elongated and other types of cells total up to 316, 53 and 59 respectively. This dataset is imbalanced and may result the classifier to be overfit or bias to certain category of data.</p>
                <p>The synthetic minority oversampling technique (SMOTE), adaptive synthetic sampling (ADASYN) and random over sampling (ROS) are evaluated by using two classifiers, which are logistic regression and decision tree. 
                    <xref ref-type="fig" rid="f8">Figure 8</xref> shows the number of classes prior and after the resampling process. The bottom graph of 
                    <xref ref-type="fig" rid="f8">Figure 8</xref> demonstrations the best result for solving the imbalanced problem using ADASYN resampling technique.</p>
                <fig fig-type="figure" id="f8" orientation="portrait" position="float">
                    <label>Figure 8. </label>
                    <caption>
                        <title>ADASYN data resampling.</title>
                    </caption>
                    <graphic id="gr8" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure8.gif"/>
                </fig>
            </sec>
            <sec id="sec11">
                <title>Classification</title>
                <p>In classification, a total of three classifiers are trained and compared. The chosen classifiers are the decision tree, random forest and the support vector machine.</p>
                <p>The models are evaluated using the classification report which contains the information of accuracy, precision, recall f1-score and support.</p>
                <p>For each classifier, the receiver operating characteristic (ROC) curve is plotted to observe the results. The images that are excluded before the sampling process are tested individually for their accuracy and reliability of the model chosen.</p>
            </sec>
        </sec>
        <sec id="sec12" sec-type="Results|discussion">
            <title>Results and discussion</title>
            <sec id="sec13">
                <title>Image segmentation layer</title>
                <p>The ground truth masks are provided in the dataset. The evaluation are calculated based on the structural similarity index (SSIM) between the ground truth masks and the segmentation of the proposed method.</p>
                <p>The average of the computed SSIM index from each image is 89.88%. However, it is worth noting that the value only estimates how well the segmentation is. In 
                    <xref ref-type="fig" rid="f9">Figure 9</xref>, observing the ground truth mask, some cells are not presented in the ground truth mask (middle image), while it is presented in the proposed segmented image (right most image).</p>
                <fig fig-type="figure" id="f9" orientation="portrait" position="float">
                    <label>Figure 9. </label>
                    <caption>
                        <title>Cells missing in ground truth mask.</title>
                    </caption>
                    <graphic id="gr9" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure9.gif"/>
                </fig>
            </sec>
            <sec id="sec14">
                <title>Data resampling layer</title>
                <p>
                    <xref ref-type="fig" rid="f10">Figure 10</xref> shows the comparison of the resampling techniques between the SMOTE, ADASYN and ROS. Based on the result, logistic regression handles the unbalanced data-sets well as compared with the decision tree. ADASYN outperforms the rest at 84.49% of its accuracy through the logistic regression classifier. Also worth noting is that, even with the unbalanced dataset, its accuracy sits highly at 82.17%, which indicates that the features extracted optimally best describe the types of cells (
                    <xref ref-type="fig" rid="f11">Figure 11</xref>).</p>
                <fig fig-type="figure" id="f10" orientation="portrait" position="float">
                    <label>Figure 10. </label>
                    <caption>
                        <title>Comparison of different sampling techniques.</title>
                    </caption>
                    <graphic id="gr10" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure10.gif"/>
                </fig>
                <fig fig-type="figure" id="f11" orientation="portrait" position="float">
                    <label>Figure 11. </label>
                    <caption>
                        <title>Resampling techniques evaluation bar plot.</title>
                    </caption>
                    <graphic id="gr11" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure11.gif"/>
                </fig>
            </sec>
            <sec id="sec15">
                <title>Classification</title>
                <p>The classification layer is evaluated based on the accuracy of the classification report and the ROC curve of each classifier. 
                    <xref ref-type="fig" rid="f12">Figures 12</xref> to 
                    <xref ref-type="fig" rid="f14">14</xref> depict the model evaluation summary for each of the classifiers.</p>
                <fig fig-type="figure" id="f12" orientation="portrait" position="float">
                    <label>Figure 12. </label>
                    <caption>
                        <title>Decision tree model evaluation.</title>
                    </caption>
                    <graphic id="gr12" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure12.gif"/>
                </fig>
                <fig fig-type="figure" id="f13" orientation="portrait" position="float">
                    <label>Figure 13. </label>
                    <caption>
                        <title>Random forest model evaluation.</title>
                    </caption>
                    <graphic id="gr13" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure13.gif"/>
                </fig>
                <fig fig-type="figure" id="f14" orientation="portrait" position="float">
                    <label>Figure 14. </label>
                    <caption>
                        <title>SVM model evaluation.</title>
                    </caption>
                    <graphic id="gr14" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/76957/cea941d8-9a9b-4770-9f4e-b0d50ee02c53_figure14.gif"/>
                </fig>
                <p>From the results, the random forest classifier performs the best with the accuracy of 90% while the others are below 90%.</p>
                <p>The ROC curve for the SVM dips below the threshold where the number of incorrectly classified classes are more than the correctly classified classes, which indicates a bad classifier.</p>
                <p>Comparing the area under curve (AUC) of the ROC curve, the random forest classifier covers the most area for all the three cell classes of the circular, elongated and other types of cell, which demonstrates a superior classifier.</p>
            </sec>
        </sec>
        <sec id="sec16" sec-type="conclusion">
            <title>Conclusion</title>
            <p>We performed several comparisons and the implementation between different techniques in the segmentation layer, data resampling layer and the classification layer. These provide a good working pipeline for a complete aiding diagnosis for the sickle cell disease detection. The whole pipeline provides a precise classification for each of the cells in microscopic blood smeared images with a 90&#x2013;95% of accuracy on average.</p>
            <p>In addition, we present a strong baseline in the segmentation layer, which is able to detect some of the missing cells that were not presented in the previous work. At the same time, the data resampling layers are added to observe the effect of an unbalanced medical dataset. The combination of different models exhibits the performance of the best classification model.</p>
        </sec>
        <sec id="sec17">
            <title>Author contributions</title>
            <p>Leow YS, Ng KW, Yoong YJ and Ng SB conceived the presented idea. Leow YS carried out the experiment and wrote the manuscript. Ng KW, Yoong YJ and Ng SB supervised the project and provided feedback.</p>
        </sec>
        <sec id="sec18">
            <title>Data availability</title>
            <sec id="sec19">
                <title>Underlying data</title>
                <p>Data were obtained by permission from 
                    <ext-link ext-link-type="uri" xlink:href="http://erythrocytesidb.uib.es/">http://erythrocytesidb.uib.es/</ext-link>.</p>
                <p>Data samples were obtained from volunteer patients. This dataset was not generated nor is it owned by the authors of this article; the listed owners are Universidad de Oriente, Cuba, and Universitat de les Illes Balears, Spain. Therefore, neither the authors nor F1000Research are responsible for the content of this dataset and cannot provide information about data collection.</p>
            </sec>
        </sec>
    </body>
    <back>
        <ack>
            <title>Acknowledgements</title>
            <p>We thank the anonymous reviewers for their careful reading of our manuscript and their insightful comments and suggestions.</p>
        </ack>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <label>1</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rashid</surname>
                            <given-names>NZN</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mashor</surname>
                            <given-names>MY</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hassan</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <article-title>Unsupervised color image segmentation of red blood cell for thalassemia disease.</article-title>
                    <source>

                        <italic toggle="yes">2015 2nd International Conference on Biomedical Engineering (ICoBE).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2015, March</year>; (pp.<fpage>1</fpage>&#x2013;<lpage>6</lpage>).</mixed-citation>
            </ref>
            <ref id="ref2">
                <label>2</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Das</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ghosh</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chakraborty</surname>
                            <given-names>C</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Invariant moment based feature analysis for abnormal erythrocyte recognition.</article-title>
                    <source>

                        <italic toggle="yes">2010 International Conference on Systems in Medicine and Biology.</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2010, December</year>; (pp.<fpage>242</fpage>&#x2013;<lpage>247</lpage>).</mixed-citation>
            </ref>
            <ref id="ref3">
                <label>3</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sharma</surname>
                            <given-names>V</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rathore</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vyas</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>:
                    <article-title>Detection of sickle cell anaemia and thalassaemia causing abnormalities in thin smear of human blood sample using image processing.</article-title>
                    <source>

                        <italic toggle="yes">2016 International Conference on Inventive Computation Technologies (ICICT).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2016, August</year>; (Vol.<volume>3</volume>: pp.<fpage>1</fpage>&#x2013;<lpage>5</lpage>).</mixed-citation>
            </ref>
            <ref id="ref4">
                <label>4</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lotfi</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nazari</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sadri</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The detection of dacrocyte, schistocyte and elliptocyte cells in iron deficiency anemia.</article-title>
                    <source>

                        <italic toggle="yes">2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2015, March</year>; (pp.<fpage>1</fpage>&#x2013;<lpage>5</lpage>).</mixed-citation>
            </ref>
            <ref id="ref5">
                <label>5</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tyagi</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Saini</surname>
                            <given-names>LM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dahyia</surname>
                            <given-names>N</given-names>
                        </name>
</person-group>:
                    <article-title>Detection of poikilocyte cells in iron deficiency anaemia using artificial neural network.</article-title>
                    <source>

                        <italic toggle="yes">2016 International Conference on Computation of Power, Energy Information and Commuincation (ICCPEIC).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2016, April</year>; (pp.<fpage>108</fpage>&#x2013;<lpage>112</lpage>).</mixed-citation>
            </ref>
            <ref id="ref6">
                <label>6</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Acharjee</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chakrabartty</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Alam</surname>
                            <given-names>MI</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A semiautomated approach using GUI for the detection of red blood cells.</article-title>
                    <source>

                        <italic toggle="yes">2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2016, March</year>; (pp.<fpage>525</fpage>&#x2013;<lpage>529</lpage>).</mixed-citation>
            </ref>
            <ref id="ref7">
                <label>7</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Seth</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Palodhi</surname>
                            <given-names>K</given-names>
                        </name>
</person-group>:
                    <article-title>An efficient algorithm for segregation of white and red blood cells based on modified hough transform.</article-title>
                    <source>

                        <italic toggle="yes">2017 IEEE Calcutta Conference (CALCON).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2017, December</year>; (pp.<fpage>465</fpage>&#x2013;<lpage>468</lpage>).</mixed-citation>
            </ref>
            <ref id="ref8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Alzubaidi</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fadhel</surname>
                            <given-names>MA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Al-Shamma</surname>
                            <given-names>O</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Deep learning models for classification of red blood cells in microscopy images to aid in sickle cell anemia diagnosis.</article-title>
                    <source>

                        <italic toggle="yes">Electronics</italic>
</source>.<year>2020</year>;<volume>9</volume>(<issue>3</issue>):<fpage>427</fpage>.
                    <pub-id pub-id-type="doi">10.3390/electronics9030427</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Purwar</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tripathi</surname>
                            <given-names>RK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ranjan</surname>
                            <given-names>R</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Detection of microcytic hypochromia using cbc and blood film features extracted from convolution neural network by different classifiers.</article-title>
                    <source>

                        <italic toggle="yes">Multimed. Tools Appl.</italic>
</source>
                    <year>2020</year>;<volume>79</volume>(<issue>7</issue>):<fpage>4573</fpage>&#x2013;<lpage>4595</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s11042-019-07927-0</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref10">
                <label>10</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hortinela</surname>
                            <given-names>CC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Balbin</surname>
                            <given-names>JR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fausto</surname>
                            <given-names>JC</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Identification of Abnormal Red Blood Cells and Diagnosing Specific Types of Anemia Using Image Processing and Support Vector Machine.</article-title>
                    <source>

                        <italic toggle="yes">2019 IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;
(pp.<fpage>1</fpage>&#x2013;<lpage>6</lpage>).</mixed-citation>
            </ref>
            <ref id="ref11">
                <label>11</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Safca</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Popescu</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ichim</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Image processing techniques to identify red blood cells.</article-title>
                    <source>

                        <italic toggle="yes">2018 22nd International Conference on System Theory, Control and Computing (ICSTCC).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2018, October</year>; (pp.<fpage>93</fpage>&#x2013;<lpage>98</lpage>).</mixed-citation>
            </ref>
            <ref id="ref12">
                <label>12</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dalvi</surname>
                            <given-names>PT</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vernekar</surname>
                            <given-names>N</given-names>
                        </name>
</person-group>:
                    <article-title>Computer aided detection of abnormal red blood cells.</article-title>
                    <source>

                        <italic toggle="yes">2016 IEEE International Conference on Recent Trends in Electronics, Information &amp; Communication Technology (RTEICT).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2016, May</year>; (pp.<fpage>1741</fpage>&#x2013;<lpage>1746</lpage>).</mixed-citation>
            </ref>
            <ref id="ref13">
                <label>13</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tyas</surname>
                            <given-names>DA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ratnaningsih</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Harjoko</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The classification of abnormal red blood cell on the minor thalassemia case using artificial neural network and convolutional neural network.</article-title>
                    <source>

                        <italic toggle="yes">Proceedings of the International Conference on Video and Image Processing</italic>
</source>
                    <year>2017, December</year>; (pp.<fpage>228</fpage>&#x2013;<lpage>233</lpage>).</mixed-citation>
            </ref>
            <ref id="ref14">
                <label>14</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Pedro</surname>
                            <given-names>MF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Grethel</surname>
                            <given-names>CS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wilkie</surname>
                            <given-names>EDF</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>erythrocytesIDB (Version 2, October 2017).</article-title>
                    <ext-link ext-link-type="uri" xlink:href="http://erythrocytesidb.uib.es/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref15">
                <label>15</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Acharya</surname>
                            <given-names>V</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kumar</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>Identification and red blood cell classification using computer aided system to diagnose blood disorders.</article-title>
                    <source>

                        <italic toggle="yes">2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2017, September</year>; (pp.<fpage>2098</fpage>&#x2013;<lpage>2104</lpage>).</mixed-citation>
            </ref>
            <ref id="ref16">
                <label>16</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ong</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Haw</surname>
                            <given-names>SC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ng</surname>
                            <given-names>KW</given-names>
                        </name>
</person-group>:
                    <article-title>Deep Learning Based-Recommendation System: An Overview on Models, Datasets, Evaluation Metrics, and Future Trends.</article-title>
                    <source>

                        <italic toggle="yes">Proceedings of the 2019 2nd International Conference on Computational Intelligence and Intelligent Systems</italic>
</source>
                    <year>2019, November</year>; (pp.<fpage>6</fpage>&#x2013;<lpage>11</lpage>).</mixed-citation>
            </ref>
            <ref id="ref17">
                <label>17</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ang</surname>
                            <given-names>JS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ng</surname>
                            <given-names>KW</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chua</surname>
                            <given-names>FF</given-names>
                        </name>
</person-group>:
                    <article-title>Modeling Time Series Data with Deep Learning: A Review, Analysis, Evaluation and Future Trend.</article-title>
                    <source>

                        <italic toggle="yes">2020 8th International Conference on Information Technology and Multimedia (ICIMU).</italic>
</source>
                    <publisher-name>IEEE</publisher-name>;<year>2020, August</year>; (pp.<fpage>32</fpage>&#x2013;<lpage>37</lpage>).</mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report362042">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.76957.r362042</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Christensen</surname>
                        <given-names>Frederik</given-names>
                    </name>
                    <xref ref-type="aff" rid="r362042a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0009-0002-2410-8852</uri>
                </contrib>
                <aff id="r362042a1">
                    <label>1</label>Aalborg University, Aalborg, Denmark</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>17</day>
                <month>2</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Christensen F</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport362042" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.73314.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors clearly states the objectives of the study and introduces a comparative analysis of different data preprocessing techniques, resampling methods and classification methods for segmenting Red Blood Cells. The final hybrid model shows great and promising results.&#x00a0;</p>
            <p> </p>
            <p> Q1: Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p> A2: Partly. 
                <list list-type="order">
                    <list-item>
                        <p>The authors&#x00a0;literature review cites relevant and current literature, however, should results from the literature cited is missing and should be included in the discussion to further enhance the conclusion.</p>
                    </list-item>
                    <list-item>
                        <p>The authors should include literature about the medical/clinical methods available for detecting sickle cell disorder (such as HPLC, DNA testing and Sequencing methods).&#x00a0;</p>
                    </list-item>
                </list> </p>
            <p> Q2: Is the study design appropriate and is the work technically sound?</p>
            <p> A2: Partly. 
                <list list-type="order">
                    <list-item>
                        <p>The study design would benefit from a flowchart of the entire process (from preprocessing to results).</p>
                    </list-item>
                    <list-item>
                        <p>It is unclear when the data resampling process takes place in the pipeline. The resampling process should be after splitting the dataset into training set and test set as to avoid data leakage between training and testing.&#x00a0;</p>
                    </list-item>
                    <list-item>
                        <p>The size of the dataset might be an issue for classification methods and should be noted as a limitation of the study.&#x00a0;</p>
                    </list-item>
                </list> </p>
            <p> Q3: Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p> A3: Partly. 
                <list list-type="order">
                    <list-item>
                        <p>Please see above. - The size of the dataset might be an issue for classification methods and should be noted as a limitation of the study.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>1. Classification of Hemoglobinopathies including sickle cell disorder and thalassemias.&#x00a0; 2. Data analysis and modelling of machine learning classification methods for clinical decision support tools</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report100976">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.76957.r100976</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Muppalaneni</surname>
                        <given-names>Naresh Babu</given-names>
                    </name>
                    <xref ref-type="aff" rid="r100976a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-8499-1918</uri>
                </contrib>
                <aff id="r100976a1">
                    <label>1</label>National Institute of Technology Silchar, Silchar, Assam, India</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>9</day>
                <month>2</month>
                <year>2022</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2022 Muppalaneni NB</copyright-statement>
                <copyright-year>2022</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport100976" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.73314.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors have done extensive and novel work to bring this scientific work.&#x00a0;</p>
            <p> </p>
            <p> For Sickle cell, the circulatory feature is important, which is considered by authors in the feature set. The authors have extracted features based on shape and texture.&#x00a0;</p>
            <p> The authors have addressed the imbalance problem also.</p>
            <p> </p>
            <p> Please include the following in the next version of the paper: 
                <list list-type="order">
                    <list-item>
                        <p>Comparative studies of authors proposed methods and with literature</p>
                    </list-item>
                    <list-item>
                        <p>Missing dataset details like the size of the dataset, training set size, test dataset size, etc.,</p>
                    </list-item>
                    <list-item>
                        <p>Reasons for the vast difference&#x00a0;in accuracy for SVM and Decision Tree/Random Forest.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Machine Learning, Deep Learning</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
</article>
